Science.gov

Sample records for gene cluster haplotypes

  1. Fine mapping of disease genes via haplotype clustering.

    PubMed

    Waldron, E R B; Whittaker, J C; Balding, D J

    2006-02-01

    We propose an algorithm for analysing SNP-based population association studies, which is a development of that introduced by Molitor et al. [2003: Am J Hum Genet 73:1368-1384]. It uses clustering of haplotypes to overcome the major limitations of many current haplotype-based approaches. We define a between-haplotype score that is simple, yet appears to capture much of the information about evolutionary relatedness of the haplotypes in the vicinity of a (unobserved) putative causal locus. Haplotype clusters can then be defined via a putative ancestral haplotype and a cut-off distance. The number of an individual's two haplotypes that lie within the cluster predicts the individual's genotype at the causal locus. This predicted genotype can then be investigated for association with the phenotype of interest. We implement our approach within a Markov-chain Monte Carlo algorithm that, in effect, searches over locations and ancestral haplotypes to identify large, case-rich clusters. The algorithm successfully fine-maps a causal mutation in a test analysis using real data, and achieves almost 98% accuracy in predicting the genotype at the causal locus. A simulation study indicates that the new algorithm is substantially superior to alternative approaches, and it also allows us to identify situations in which multi-point approaches can substantially improve over single-SNP analyses. Our algorithm runs quickly and there is scope for extension to a wide range of disease models and genomic scales.

  2. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    PubMed Central

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  3. beta(S)-Globin gene cluster haplotypes in the West Bank of Palestine.

    PubMed

    Samarah, Fekri; Ayesh, Suhail; Athanasiou, Miranda; Christakis, John; Vavatsi, Norma

    2009-01-01

    Sickle cell disease is an inherited autosomal recessive disorder of the beta-globin chain. In Palestine it is accompanied by a low level of Hb F (mean 5.14%) and a severe clinical presentation. In this study, 59 Palestinian patients, homozygotes for Hb S were studied for their haplotype background. Eight polymorphic sites in the beta-globin gene cluster were examined. The Benin haplotype was predominant with a frequency of 88.1%, followed by a frequency of 5.1% for the Bantu haplotype. One chromosome was found to carry the Cameroon haplotype (0.85%). Three atypical haplotypes were also found (5.95%). Heterogeneity was observed in Hb F production, ranging between 1.5 and 17.0%, whereas the (G)gamma ratio was homogeneous among all haplotypes with a normal amount of about 41%. Our results are in agreement with previous reports of the Benin haplotype origin in the Mediterranean.

  4. Beta-globin gene cluster haplotypes in sickle cell patients from southwest Iran.

    PubMed

    Rahimi, Z; Karimi, M; Haghshenass, M; Merat, A

    2003-11-01

    Sickle cell anemia in Iran is accompanied by a high level of HbF and mild clinical presentation. Here we report haplotypes of the beta gene cluster found in 81 randomly selected sickle cell patients, including 47 sickle cell anemia (SS), 17 sickle cell trait (AS), and 17 sickle/thalassemia (S/thal) from southwest Iran. We found all five common typical haplotypes as well as five atypical haplotypes in our patients. Except for four patients with homozygous Benin haplotype, none of the other African typical haplotypes were found in a homozygous state. Arab-Indian was found to be the most prevalent haplotype in the study population. This haplotype accounted for 51.1% as the homozygous form in SS patients, where 69.1% of chromosomes in these patients had the Arab-Indian haplotype. Bantu A2 was the second most prevalent haplotype among all patients. The mean %HbF in SS patients was 27.83 and in the homozygous Arab-Indian haplotype it was still higher (30.40%), while in AS patients the %HbF was only 1.20. The high %Ggamma chain (71.81) in the Arab-Indian homozygous haplotype was concomitant with the presence of an Xmn I site in both chromosomes. The presence of the Arab-Indian haplotype as the predominant haplotype might be suggestive of a gene flow to/from Saudi Arabia or India. More haplotype investigations of a normal population can clarify the high incidence of Bantu A2 haplotype in our population.

  5. Nonblack patients with sickle cell disease have African. beta. sup s gene cluster haplotypes

    SciTech Connect

    Rogers, Z.R.; Powars, D.R.; Williams, W.D. ); Kinney, T.R. ); Schroeder, W.A. )

    1989-05-26

    Of 18 nonblack patients with sickle cell disease, 14 had sickle cell anemia, 2 had hemoglobin SC disease, and 2 had hemoglobin S-{beta}{sup o}-thalassemia. The {beta}{sup s} gene cluster haplotypes that were determined in 7 patients were of African origin and were identified as Central African Republic, Central African Republic minor II, Benin, and Senegal. The haplotype Central African Republic minor II was present on the {beta}{sup o}-thalassemia chromosome in 2 patients. None of 10 patients whose {alpha}-gene status was determined had {alpha}-thalassemia-2. These data strongly support the concept that the {beta}{sup s} gene on chromosome 11 of these individuals is of African origin and that the {alpha}-gene locus on chromosome 16 is of white or native American origin. The clinical severity of the disease in these nonblack patients is appropriate to their haplotype without {alpha}-thalassemia-2 and is comparable with that of black patients. All persons with congenital hemolytic anemia should be examined for the presence of sickle cell disease regardless of physical appearance or ethnic background.

  6. Beta-globin gene cluster haplotypes in Afro-Uruguayans from two geographical regions (South and North).

    PubMed

    Da Luz, Julio; Kimura, Elza Miyuki; Costa, Fernando Ferreira; Sonati, Maria de Fatima; Sans, Mónica

    2010-01-01

    The beta-globin gene cluster haplotypes were identified in 52 and 40 chromosomes from two Afro-Uruguayan populations located in the South and North of the country, respectively. In both regions, the 5' haplotype 2 (+ - - - -), characteristic of non-African populations, was the most frequent, reflecting a strong process of admixture in Afro-Uruguayans (0.355 and 0.262, respectively). The haplotypes 3 (- - - - +) and 4 (- + - - +), characteristics of African sub-Saharan populations, present inverse frequencies in North and South: whereas in the South haplotype 3 is the second most frequent (0.232), and haplotype 4 presents a low frequency (0.019), in the North haplotype 4 is the third most frequent (0.140), and haplotype 3 only reaches an intermediate frequency (0.088). The pairwise F(ST) and the exact test of differentiation show genetic heterogeneity between both regions. Nei's genetic distance show that South and North present affinities with Bantu groups, although the North present the smallest genetic distance with the Mandenka, a Senegalese population. With respect to 3' haplotypes, haplotype I was the most frequent in both populations, followed by haplotype II, characteristic of sub-Saharan Africans. The high frequencies of haplotype III-Asian could indicate admixture with Native American populations. The differences observed between both Uruguayan regions could be explained by microevolutionary events as genetic drift, founder effects, differential admixture, and/or distinct origin of the African slaves introduced in those regions.

  7. Haplotyping Problem, A Clustering Approach

    SciTech Connect

    Eslahchi, Changiz; Sadeghi, Mehdi; Pezeshk, Hamid; Kargar, Mehdi; Poormohammadi, Hadi

    2007-09-06

    Construction of two haplotypes from a set of Single Nucleotide Polymorphism (SNP) fragments is called haplotype reconstruction problem. One of the most popular computational model for this problem is Minimum Error Correction (MEC). Since MEC is an NP-hard problem, here we propose a novel heuristic algorithm based on clustering analysis in data mining for haplotype reconstruction problem. Based on hamming distance and similarity between two fragments, our iterative algorithm produces two clusters of fragments; then, in each iteration, the algorithm assigns a fragment to one of the clusters. Our results suggest that the algorithm has less reconstruction error rate in comparison with other algorithms.

  8. Haplotyping Problem, A Clustering Approach

    NASA Astrophysics Data System (ADS)

    Eslahchi, Changiz; Sadeghi, Mehdi; Pezeshk, Hamid; Kargar, Mehdi; Poormohammadi, Hadi

    2007-09-01

    Construction of two haplotypes from a set of Single Nucleotide Polymorphism (SNP) fragments is called haplotype reconstruction problem. One of the most popular computational model for this problem is Minimum Error Correction (MEC). Since MEC is an NP-hard problem, here we propose a novel heuristic algorithm based on clustering analysis in data mining for haplotype reconstruction problem. Based on hamming distance and similarity between two fragments, our iterative algorithm produces two clusters of fragments; then, in each iteration, the algorithm assigns a fragment to one of the clusters. Our results suggest that the algorithm has less reconstruction error rate in comparison with other algorithms.

  9. Effect of beta-globin gene cluster haplotype on the hematological and clinical features of sickle cell anemia.

    PubMed

    Rieder, R F; Safaya, S; Gillette, P; Fryd, S; Hsu, H; Adams, J G; Steinberg, M H

    1991-03-01

    In 113 black American adults with sickle cell anemia (HbSS), we examined nine polymorphic restriction sites, including the Xmnl site 5' to the G gamma gene, to see whether haplotype is related to the level of HbF and the proportion of G gamma chains or if it influences the hematological and clinical features of the disease. Seventy-five percent of the patients were homozygous or heterozygous for the Benin (no. 19) or Central African Republic (Bantu, no. 20) haplotypes; 13.3% were homozygous or heterozygous for the Senegal (no. 3) haplotype, while 11.5% had other genotypes. Of the subjects, 14.2% were either homozygous or heterozygous for the Xmnl restriction site 5' to the G gamma gene. We found no effect of haplotype on HbF levels. The level of G gamma chains was 60.5% +/- 17.0% in individuals heterozygous or homozygous for haplotype no. 3 and was 46.9% +/- 11.6% in individuals with other haplotypes. Subjects with the Xmnl site 5' to the G gamma gene had G gamma globin levels of 59.5% +/- 16.7% while those lacking that site had an average of 47.2% +/- 12.1%. There were no significant differences among these groups in hemoglobin concentration, packed cell volume, mean cell volume, or clinical indicators of vaso-occlusive severity, including crises, hospitalizations per year, aseptic bone necrosis, acute chest syndrome, or leg ulcers. While the presence of haplotype 3 and the 5' G gamma Xmnl site were associated with increased G gamma chains, there was no effect on HbF level or other hematological and clinical features that might reflect disease severity. It is likely that determinants unrelated to haplotype, linked or unlinked to the beta-globin gene cluster, are the major effectors of differences in the levels of HbF in American patients with sickle cell anemia.

  10. Linkage disequilibria and haplotype structure of four SNPs of the interleukin 1 gene cluster in seven Asian Indian populations.

    PubMed

    Raj, Srilakshmi M; Chakraborty, Ranajit; Wang, Ning; Govindaraju, Diddahally R

    2006-02-01

    Variation at four single nucleotide polymorphism (SNP) sites of the interleukin 1 (IL1) gene cluster was investigated among 280 unrelated individuals, representing 7 caste groups from the state of Karnataka, India, and one European American community of Boston, Massachusetts. Allele and haplotype frequencies, strength of linkage disequilibrium, and signatures of recombination varied considerably among populations. Variable community sizes and traditions of consanguinity may account for the observed variation.

  11. Haplotypes in the APOA1-C3-A4-A5 gene cluster affect plasma lipids in both humans and baboons

    SciTech Connect

    Wang, Qian-fei; Liu, Xin; O'Connell, Jeff; Peng, Ze; Krauss, Ronald M.; Rainwater, David L.; VandeBerg, John L.; Rubin, Edward M.; Cheng, Jan-Fang; Pennacchio, Len A.

    2003-09-15

    Genetic studies in non-human primates serve as a potential strategy for identifying genomic intervals where polymorphisms impact upon human disease-related phenotypes. It remains unclear, however, whether independently arising polymorphisms in orthologous regions of non-human primates leads to similar variation in a quantitative trait found in both species. To explore this paradigm, we studied a baboon apolipoprotein gene cluster (APOA1/C3/A4/A5) for which the human gene orthologs have well established roles in influencing plasma HDL-cholesterol and triglyceride concentrations. Our extensive polymorphism analysis of this 68 kb gene cluster in 96 pedigreed baboons identified several haplotype blocks each with limited diversity, consistent with haplotype findings in humans. To determine whether baboons, like humans, also have particular haplotypes associated with lipid phenotypes, we genotyped 634 well characterized baboons using 16 haplotype tagging SNPs. Genetic analysis of single SNPs, as well as haplotypes, revealed an association of APOA5 and APOC3 variants with HDL cholesterol and triglyceride concentrations, respectively. Thus, independent variation in orthologous genomic intervals does associate with similar quantitative lipid traits in both species, supporting the possibility of uncovering human QTL genes in a highly controlled non-human primate model.

  12. Population structure with localized haplotype clusters.

    PubMed

    Browning, Sharon R; Weir, Bruce S

    2010-08-01

    We propose a multilocus version of F(ST) and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific F(ST) estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based F(ST) than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of F(ST) and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data.

  13. Haplotype diversity of VvTFL1A gene and association with cluster traits in grapevine (V. vinifera)

    PubMed Central

    2014-01-01

    Background Interaction between TERMINAL FLOWER 1 (TFL1) and LEAFY (LFY) seem to determine the inflorescence architecture in Arabidopsis. In a parallel way, overexpression of VvTFL1A, a grapevine TFL1 homolog, causes delayed flowering and production of a ramose cluster in the reiterated reproductive meristem (RRM) somatic variant of cultivar Carignan. To analyze the possible contribution of this gene to cluster phenotypic variation in a diversity panel of cultivated grapevine (Vitis vinifera L. subsp. vinifera) its nucleotide diversity was characterized and association analyses among detected sequence polymorphisms and phenology and cluster traits was carried out. Results A total of 3.6 kb of the VvTFL1A gene, including its promoter, was sequenced in a core collection of 140 individuals designed to maximize phenotypic variation at agronomical relevant traits. Nucleotide variation for VvTFL1A within this collection was higher in the promoter and intron sequences than in the exon regions; where few polymorphisms were located in agreement with a high conservation of coding sequence. Characterization of the VvTFL1A haplotype network identified three major haplogroups, consistent with the geographic origins and the use of the cultivars that could correspond to three major ancestral alleles or evolutionary branches, based on the existence of mutations in linkage disequilibrium. Genetic association studies with cluster traits revealed the presence of major INDEL polymorphisms, explaining 16%, 13% and 25% of flowering time, cluster width and berry weight, respectively, and also structuring the three haplogroups. Conclusions At least three major VvTFL1A haplogroups are present in cultivated grapevines, which are defined by the presence of three main polymorphism LD blocks and associated to characteristic phenotypic values for flowering time, cluster width and berry size. Phenotypic differences between haplogroups are consistent with differences observed between Eastern and

  14. Beta-globin gene cluster haplotypes and alpha-thalassemia in sickle cell disease patients from Trinidad.

    PubMed

    Jones-Lecointe, Altheia; Smith, Erskine; Romana, Marc; Gilbert, Marie-Georges; Charles, Waveney P; Saint-Martin, Christian; Kéclard, Lisiane

    2008-01-01

    In this study, we have determined the frequency of beta(S) haplotypes in 163 sickle cell disease patients from Trinidad. The alpha(3.7) globin gene deletion status was also studied with an observed gene frequency of 0.17. Among the 283 beta(S) chromosomes analyzed, the Benin haplotype was the most prevalent (61.8%) followed by Bantu (17.3%), Senegal (8.5%), Cameroon (3.5%), and Arab-Indian (3.2%), while 5.7% of them were atypical. This beta(S) haplotypes distribution differed from those previously described in other Caribbean islands (Jamaica, Guadeloupe, and Cuba), in agreement with the known involvement of the major colonial powers (Spain, France, and Great Britain) in the slave trade in Trinidad and documented an Indian origin of the beta(S) gene.

  15. Physical and genetic mapping of the serpin gene cluster at 14q32.1: allelic association and a unique haplotype associated with alpha 1-antitrypsin deficiency.

    PubMed Central

    Byth, B. C.; Billingsley, G. D.; Cox, D. W.

    1994-01-01

    The alpha 1-antitrypsin (PI) gene is part of a cluster of structurally related serine protease inhibitor genes localized at chromosome 14q32.1, a cluster that includes the alpha 1-antichymotrypsin (AACT), protein C inhibitor (PCI), and corticosteroid-binding globulin (CBG) genes and the alpha 1-antitrypsin-like pseudogene (PIL). The order of the genes is refined here by genetic mapping using simple tandem repeat polymorphisms (STRPs) and by physical mapping in YACs. The order of the genes is (centromere)-CBG-PIL-PI-PCI-AACT-(telomere). Analysis of DNA haplotypes comprising STRP and RFLP markers in the serpin genes reveals considerable allelic association throughout the cluster. Furthermore, the common alpha 1-antitrypsin deficiency allele, PI*Z, has a unique DNA haplotype at the CBG, PIL, and PI loci, which extends over 60 kb in 97% of cases and in 44% of cases includes the PCI and AACT loci. This unique haplotype will be of use in examining a number of other diseases, particularly those with an inflammatory component, thought to be associated with alpha 1-antitrypsin deficiency or partial deficiency. Images Figure 1 Figure 3 PMID:7912884

  16. The beta-globin gene cluster haplotypes in sickle cell anemia patients from Northeast Brazil: a clinical and molecular view.

    PubMed

    Adorno, Elisângela Vitória; Zanette, Angela; Lyra, Isa; Souza, Cyntia Cajado; Santos, Leandro Ferraz; Menezes, Joelma Figueiredo; Dupuit, Marie France; Almeida, Mari Ney Tavares; Reis, Mitermayer Galvão; Gonçalves, Marilda Souza

    2004-08-01

    The beta(S)-globin haplotypes were studied in 78 sickle cell Brazilian patients from Bahia, Northeast Brazil, that has a large population of African origin. Hemoglobin (Hb) profiles were developed by high-performance liquid chromatography (HPLC), and beta(S)-globin gene haplotypes were determined by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) techniques. We identified 44 (55.0%) patients with the CAR/Ben (Central African Republic/Benin) genotype, 16 (20.0%) Ben/Ben, 13 (16.2%) CAR/CAR and seven (8.8%) with other genotypes. Analyses of the phenotypes showed clinical differences related only to Hb F levels and blood transfusion therapy; the presence of -alpha(-3.7)-thalassemia (thal) demonstrated statistical significance when associated with hematocrit (p=0.044), MCV (p=0.0007), MCH (p=0.012) and spleen sequestration events. The haplotype diversity found in the present study can be justified by information about the origin of the slave traffic period in Bahia during the 19th century. The specific characteristics described among the Bahian sickle cell patients could be confirmed by increasing the number of patients with specific genotypes and further studies of genetic markers.

  17. Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia

    PubMed Central

    Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

    2013-01-01

    Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events. PMID:24385850

  18. Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia.

    PubMed

    Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

    2013-12-01

    Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events.

  19. Beta-globin gene cluster haplotypes and HbF levels are not the only modulators of sickle cell disease in Lebanon.

    PubMed

    Inati, A; Taher, A; Bou Alawi, W; Koussa, S; Kaspar, H; Shbaklo, H; Zalloua, P A

    2003-02-01

    Sickle cell disease (SCD) is an inherited autosomal recessive disorder of the beta-globin chain. Despite the fact that all subjects with SCD have the same single base pair mutation, the severity of the clinical and hematological manifestations is extremely variable. This study examined for the first time in Lebanon the correlation between the clinical manifestation of SCD and the beta-globin gene haplotypes. The haplotypes of 50 patients diagnosed with SCD were determined using polymerase chain reaction amplification of fragments containing nine polymorphic restriction sites around and within the epsilon-Ggamma-Agamma-psibeta-delta-beta-globin gene complex. Most reported haplotypes were found in our population with the Benin haplotype as the most prevalent one. When the patients were divided according to their HbF levels into three groups (Group A: HbF < 5%, Group B: HbF between 5 and 15%, and Group C: HbF > 15%), surprisingly, the highest levels of HbF were associated with the most severe clinical cases. Our findings suggest that fetal hemoglobin levels are important but not the only parameters that affect the severity of the disease. In addition, the high levels of HbF in patients with CAR haplotypes did not seem to ameliorate the severity of symptoms, suggesting that genetic factors other than haplotypes are the major determinants of increased HbF levels in Lebanon.

  20. HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.

    PubMed

    Lin, Yen-Jen; Chen, Yu-Tin; Hsu, Shu-Ni; Peng, Chien-Hua; Tang, Chuan-Yi; Yen, Tzu-Chen; Hsieh, Wen-Ping

    2014-01-01

    Copy number variation (CNV) has been reported to be associated with disease and various cancers. Hence, identifying the accurate position and the type of CNV is currently a critical issue. There are many tools targeting on detecting CNV regions, constructing haplotype phases on CNV regions, or estimating the numerical copy numbers. However, none of them can do all of the three tasks at the same time. This paper presents a method based on Hidden Markov Model to detect parent specific copy number change on both chromosomes with signals from SNP arrays. A haplotype tree is constructed with dynamic branch merging to model the transition of the copy number status of the two alleles assessed at each SNP locus. The emission models are constructed for the genotypes formed with the two haplotypes. The proposed method can provide the segmentation points of the CNV regions as well as the haplotype phasing for the allelic status on each chromosome. The estimated copy numbers are provided as fractional numbers, which can accommodate the somatic mutation in cancer specimens that usually consist of heterogeneous cell populations. The algorithm is evaluated on simulated data and the previously published regions of CNV of the 270 HapMap individuals. The results were compared with five popular methods: PennCNV, genoCN, COKGEN, QuantiSNP and cnvHap. The application on oral cancer samples demonstrates how the proposed method can facilitate clinical association studies. The proposed algorithm exhibits comparable sensitivity of the CNV regions to the best algorithm in our genome-wide study and demonstrates the highest detection rate in SNP dense regions. In addition, we provide better haplotype phasing accuracy than similar approaches. The clinical association carried out with our fractional estimate of copy numbers in the cancer samples provides better detection power than that with integer copy number states.

  1. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  2. Genetic origin of Behçet's disease population in Denizli, Turkey; population genetics data analysis; historical demography and geographical perspectives based on β-globin gene cluster haplotype variation.

    PubMed

    Ozturk, O; Arikan, S; Bahadir, A; Atalay, A; Atalay, E O

    2017-01-01

    In our study, we aimed to investigate the possible genetic drift, relationships, expansion and historical origin based on haplotype frequencies of the β-globin gene cluster of normal and Behçet's disease (BD) population in Denizli, Turkey. We examined blood DNA samples obtained from our DNA bank. The association of population genetic parameters such as haplotypes, diversity, differentiation, Hardy-Weinberg equilibrium and demographic analysis for two populations was performed by Arlequin ver. 3.5. Our results show that both populations have high similarity in genetic parameters in terms of development and expansion based on haplotype diversity through the history. We found that historical levels of gene flow were significantly higher between the two populations. According to historical population, growth parameter of τ values for normal and BD populations dated approximately 42 000 to 38 000 ybp, respectively. In conclusion, historically, two populations show similar genetic parameters and unimodal growth distribution. Our results are consistent with the view that the BD may have occurred in area, independent from Silk Road.

  3. Variation analysis and gene annotation of eight MHC haplotypes: the MHC Haplotype Project.

    PubMed

    Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J; Almeida, Jeff; Forbes, Simon; Gilbert, James G R; Halls, Karen; Harrow, Jennifer L; Hart, Elizabeth; Howe, Kevin; Jackson, David K; Palmer, Sophie; Roberts, Anne N; Sims, Sarah; Stewart, C Andrew; Traherne, James A; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J; Elliott, John F; Sawcer, Stephen; Todd, John A; Trowsdale, John; Beck, Stephan

    2008-01-01

    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine.

  4. Functional Haplotypes in Interleukin 4 Gene Associated with Periodontitis

    PubMed Central

    Mayer, Marcia Pinto Alves; Rossa, Carlos

    2017-01-01

    Chronic periodontitis (CP) is an infectious inflammatory disease that affects tooth-supporting structures and in which dental plaque bacteria, immune mechanisms and genetic predisposition play important roles. Interleukin 4 (IL-4) is a key anti-inflammatory cytokine with relevant action in imbalances in inflamed periodontal tissue. Individuals carrying the TCI/CCI genotype (S-haplotype) of the IL-4 gene are 5 times more susceptible to CP, whereas the CTI/TTD genotype (P-haplotype) confers protection against CP. Compared with the S-haplotype, subjects with the P-haplotype produce higher levels of the IL-4 protein after non-surgical periodontal therapy. The present in vitro study aimed to investigate the functionality of IL-4 haplotypes in immune cells to obtain insight into the influence of these genetic variations in regulating immune responses to CP-associated bacteria. Peripheral blood was collected from 6 subjects carrying each haplotype, and their immune cells were challenged with periodontopathogens to compare responses of the different haplotypes with regard to gene expression, protein secretion and the immunophenotype of T helper responses. We found higher IL-4 mRNA and protein levels in the P-haplotype, which also presented higher levels of anti-inflammatory cytokines. In contrast, cells from S-haplotype subjects responded with higher levels of pro-inflammatory cytokines. S-haplotype individuals exhibited significantly greater polarization toward the Th1 phenotype, whereas the P-haplotype was associated with an attenuated response to periodontopathogens, with suggestive skewing toward Th2/M2 phenotypes. In conclusion, IL-4 genetic variations associated with susceptibility to or protection against chronic periodontitis are directly associated with influencing the response of immune cells to periodontopathogens. PMID:28114408

  5. Haplotype-based quantitative trait mapping using a clustering algorithm

    PubMed Central

    Li, Jing; Zhou, Yingyao; Elston, Robert C

    2006-01-01

    Background With the availability of large-scale, high-density single-nucleotide polymorphism (SNP) markers, substantial effort has been made in identifying disease-causing genes using linkage disequilibrium (LD) mapping by haplotype analysis of unrelated individuals. In addition to complex diseases, many continuously distributed quantitative traits are of primary clinical and health significance. However the development of association mapping methods using unrelated individuals for quantitative traits has received relatively less attention. Results We recently developed an association mapping method for complex diseases by mining the sharing of haplotype segments (i.e., phased genotype pairs) in affected individuals that are rarely present in normal individuals. In this paper, we extend our previous work to address the problem of quantitative trait mapping from unrelated individuals. The method is non-parametric in nature, and statistical significance can be obtained by a permutation test. It can also be incorporated into the one-way ANCOVA (analysis of covariance) framework so that other factors and covariates can be easily incorporated. The effectiveness of the approach is demonstrated by extensive experimental studies using both simulated and real data sets. The results show that our haplotype-based approach is more robust than two statistical methods based on single markers: a single SNP association test (SSA) and the Mann-Whitney U-test (MWU). The algorithm has been incorporated into our existing software package called HapMiner, which is available from our website at . Conclusion For QTL (quantitative trait loci) fine mapping, to identify QTNs (quantitative trait nucleotides) with realistic effects (the contribution of each QTN less than 10% of total variance of the trait), large samples sizes (≥ 500) are needed for all the methods. The overall performance of HapMiner is better than that of the other two methods. Its effectiveness further depends on other

  6. iHAP – integrated haplotype analysis pipeline for characterizing the haplotype structure of genes

    PubMed Central

    Song, Chun Meng; Yeo, Boon Huat; Tantoso, Erwin; Yang, Yuchen; Lim, Yun Ping; Li, Kuo-Bin; Rajagopal, Gunaretnam

    2006-01-01

    Background The advent of genotype data from large-scale efforts that catalog the genetic variants of different populations have given rise to new avenues for multifactorial disease association studies. Recent work shows that genotype data from the International HapMap Project have a high degree of transferability to the wider population. This implies that the design of genotyping studies on local populations may be facilitated through inferences drawn from information contained in HapMap populations. Results To facilitate analysis of HapMap data for characterizing the haplotype structure of genes or any chromosomal regions, we have developed an integrated web-based resource, iHAP. In addition to incorporating genotype and haplotype data from the International HapMap Project and gene information from the UCSC Genome Browser Database, iHAP also provides capabilities for inferring haplotype blocks and selecting tag SNPs that are representative of haplotype patterns. These include block partitioning algorithms, block definitions, tag SNP definitions, as well as SNPs to be "force included" as tags. Based on the parameters defined at the input stage, iHAP performs on-the-fly analysis and displays the result graphically as a webpage. To facilitate analysis, intermediate and final result files can be downloaded. Conclusion The iHAP resource, available at , provides a convenient yet flexible approach for the user community to analyze HapMap data and identify candidate targets for genotyping studies. PMID:17137522

  7. Analysis of MHC class I genes across horse MHC haplotypes

    PubMed Central

    Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

    2010-01-01

    The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063

  8. Gene expression of a gene family in maize based on noncollinear haplotypes

    PubMed Central

    Song, Rentao; Messing, Joachim

    2003-01-01

    Genomic regions of nearly every species diverged into different haplotypes, mostly based on point mutations, small deletions, and insertions that do not affect the collinearity of genes within a species. However, the same genomic interval containing the z1C gene cluster of two inbred lines of Zea mays significantly lost their gene collinearity and also differed in the regulation of each remaining gene set. Furthermore, when inbreds were reciprocally crossed, hybrids exhibited an unexpected shift of expression patterns so that “overdominance” instead of “dominance complementation” of allelic and nonallelic gene expression occurred. The same interval also differed in length (360 vs. 263 kb). Segmental rearrangements led to sequence changes, which were further enhanced by the insertion of different transposable elements. Changes in gene order affected not only z1C genes but also three unrelated genes. However, the orthologous interval between two subspecies of rice (not rice cultivars) was conserved in length and gene order, whereas changes between two maize inbreds were as drastic as changes between maize and sorghum. Given that chromosomes could conceivably consist of intervals of haplotypes that are highly diverged, one could envision endless breeding opportunities because of their linear arrangement along a chromosome and their expression potential in hybrid combinations (“binary” systems). The implication of such a hypothesis for heterosis is discussed. PMID:12853580

  9. The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes

    PubMed Central

    Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.

    2016-01-01

    Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371

  10. The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes.

    PubMed

    Lindenau, Juliana D; Wagner, Sandrine C; Castro, Simone M de; Hutz, Mara H

    2016-01-01

    Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management.

  11. Semi-supervised clustering algorithm for haplotype assembly problem based on MEC model.

    PubMed

    Xu, Xin-Shun; Li, Ying-Xin

    2012-01-01

    Haplotype assembly is to infer a pair of haplotypes from localized polymorphism data. In this paper, a semi-supervised clustering algorithm-SSK (semi-supervised K-means) is proposed for it, which, to our knowledge, is the first semi-supervised clustering method for it. In SSK, some positive information is firstly extracted. The information is then used to help k-means to cluster all SNP fragments into two sets from which two haplotypes can be reconstructed. The performance of SSK is tested on both real data and simulated data. The results show that it outperforms several state-of-the-art algorithms on minimum error correction (MEC) model.

  12. Clustering, haplotype diversity and locations of MIC-3: a unique root-specific defense-related gene family in upland cotton (Gossypium hirsutum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MIC-3-related genes of cotton (Gossypium spp.) were identified and shown to have root-specific expression, associated with pathogen defense-related function and specifically increased expression in root-knot nematode (RKN) resistant plants after nematode infection. Here we cloned and sequenced MIC-...

  13. Characterisation of SNP haplotype structure in chemokine and chemokine receptor genes using CEPH pedigrees and statistical estimation.

    PubMed

    Clark, Vanessa J; Dean, Michael

    2004-03-01

    Chemokine signals and their cell-surface receptors are important modulators of HIV-1 disease and cancer. To aid future case/control association studies, aim to further characterise the haplotype structure of variation in chemokine and chemokine receptor genes. To perform haplotype analysis in a population-based association study, haplotypes must be determined by estimation, in the absence of family information or laboratory methods to establish phase. Here, test the accuracy of estimates of haplotype frequency and linkage disequilibrium by comparing estimated haplotypes generated with the expectation maximisation (EM) algorithm to haplotypes determined from Centre d'Etude Polymorphisme Humain (CEPH) pedigree data. To do this, they have characterised haplotypes comprising alleles at 11 biallelic loci in four chemokine receptor genes (CCR3, CCR2, CCR5 and CCRL2), which span 150 kb on chromosome 3p21, and haplotyes of nine biallelic loci in six chemokine genes [MCP-1(CCL2), Eotaxin(CCL11), RANTES(CCL5), MPIF-1(CCL23), PARC(CCL18) and MIP-1alpha(CCL3)] on chromosome 17q11-12. Forty multi-generation CEPH families, totalling 489 individuals, were genotyped by the TaqMan 5'-nuclease assay. Phased haplotypes and haplotypes estimated from unphased genotypes were compared in 103 grandparents who were assumed to have mated at random. For the 3p21 single nucleotide polymorphism (SNP) data, haplotypes determined by pedigree analysis and haplotypes generated by the EM algorithm were nearly identical. Linkage disequilibrium, measured by the D' statistic, was nearly maximal across the 150 kb region, with complete disequilibrium maintained at the extremes between CCR3-Y17Y and CCRL2-I243V. D'-values calculated from estimated haplotypes on 3p21 had high concordance with pairwise comparisons between pedigree-phased chromosomes. Conversely, there was less agreement between analyses of haplotype frequencies and linkage disequilibrium using estimated haplotypes when compared with

  14. Beta-thalassemia genes in French-Canadians: haplotype and mutation analysis of Portneuf chromosomes.

    PubMed Central

    Kaplan, F; Kokotsis, G; DeBraekeleer, M; Morgan, K; Scriver, C R

    1990-01-01

    beta-Thalassemia minor occurs at approximately 1% frequency in French-Canadians--in families residing in Portneuf County (population approximately 40,000) of Quebec province. We found eight different RFLP haplotypes at the beta-globin gene cluster in 37 normal persons and in 12 beta-thalassemia heterozygotes from six families. beta-Thalassemia genes in these families associated with two haplotypes only: Mediterranean I and Mediterranean II. There were two different beta-thalassemia mutations segregating in the Portneuf population: an RNA processing mutation (beta(+)IVS-1,nt110) on haplotype I (five families) and a point mutation leading to chain termination (beta(0) nonsense codon 39) on haplotype II (one family). The distribution of 5' haplotypes on normal beta A Portneuf chromosomes compared with other European populations was most similar to that in British subjects (data for French subjects have not yet been reported). Genealogical reconstructions traced the ancestry of carrier couples to settlers emigrating from several different regions of France to New France in the 17th century. These findings indicate genetic diversity of a greater degree among French-Canadians than recognized heretofore. Images Figure 4 PMID:1967205

  15. Beta-thalassemia genes in French-Canadians: haplotype and mutation analysis of Portneuf chromosomes.

    PubMed

    Kaplan, F; Kokotsis, G; DeBraekeleer, M; Morgan, K; Scriver, C R

    1990-01-01

    beta-Thalassemia minor occurs at approximately 1% frequency in French-Canadians--in families residing in Portneuf County (population approximately 40,000) of Quebec province. We found eight different RFLP haplotypes at the beta-globin gene cluster in 37 normal persons and in 12 beta-thalassemia heterozygotes from six families. beta-Thalassemia genes in these families associated with two haplotypes only: Mediterranean I and Mediterranean II. There were two different beta-thalassemia mutations segregating in the Portneuf population: an RNA processing mutation (beta(+)IVS-1,nt110) on haplotype I (five families) and a point mutation leading to chain termination (beta(0) nonsense codon 39) on haplotype II (one family). The distribution of 5' haplotypes on normal beta A Portneuf chromosomes compared with other European populations was most similar to that in British subjects (data for French subjects have not yet been reported). Genealogical reconstructions traced the ancestry of carrier couples to settlers emigrating from several different regions of France to New France in the 17th century. These findings indicate genetic diversity of a greater degree among French-Canadians than recognized heretofore.

  16. Genomic organization and characterization of a three-gene rat adult beta-globin haplotype.

    PubMed

    Au, D M; Wong, W M; Tam, J W; Cheng, L Y; Lam, V M

    1995-11-20

    The isolation and detailed characterization of a three-beta-globin gene (GloB) haplotype in the Sprague-Dawley (S-D) rat is described. An enriched library, lambda SDHelib, was screened with a human GloB probe, humbg44, and from which a beta minor gene, Rathbbz, was isolated, sequenced and characterized. A S-D rat GloB-specific probe, Ratbgze12, derived from the Rathbbz gene, was then used to screen a S-D rat genomic library, lambda SDglib. The clone T1510 was isolated and identified to include the entire Rathbbz gene and part of another GloB gene, Rathbby, which was 5' upstream from Rathbbz. Chromosomal walking upstream using the riboprobe, rnaT71, led to the isolation of an overlapping clone, Ta49, which was shown to include two full-length GloB genes; the most 5' was Rathbbx followed by Rathbby. Sequence data suggests that Rathbbx is a beta major gene, whereas Rathbby is a hybrid gene of Rathbbx and Rathbbz. Genomic hybridization confirmed this particular three-gene haplotype in the S-D rat. This haplotype, a1, may be the prototype of the GloB cluster in rat.

  17. Haplotype Association Mapping Identifies a Candidate Gene Region in Mice Infected With Staphylococcus aureus.

    PubMed

    Johnson, Nicole V; Ahn, Sun Hee; Deshmukh, Hitesh; Levin, Mikhail K; Nelson, Charlotte L; Scott, William K; Allen, Andrew; Fowler, Vance G; Cowell, Lindsay G

    2012-06-01

    Exposure to Staphylococcus aureus has a variety of outcomes, from asymptomatic colonization to fatal infection. Strong evidence suggests that host genetics play an important role in susceptibility, but the specific host genetic factors involved are not known. The availability of genome-wide single nucleotide polymorphism (SNP) data for inbred Mus musculus strains means that haplotype association mapping can be used to identify candidate susceptibility genes. We applied haplotype association mapping to Perlegen SNP data and kidney bacterial counts from Staphylococcus aureus-infected mice from 13 inbred strains and detected an associated block on chromosome 7. Strong experimental evidence supports the result: a separate study demonstrated the presence of a susceptibility locus on chromosome 7 using consomic mice. The associated block contains no genes, but lies within the gene cluster of the 26-member extended kallikrein gene family, whose members have well-recognized roles in the generation of antimicrobial peptides and the regulation of inflammation. Efficient mixed-model association (EMMA) testing of all SNPs with two alleles and located within the gene cluster boundaries finds two significant associations: one of the three polymorphisms defining the associated block and one in the gene closest to the block, Klk1b11. In addition, we find that 7 of the 26 kallikrein genes are differentially expressed between susceptible and resistant mice, including the Klk1b11 gene. These genes represent a promising set of candidate genes influencing susceptibility to Staphylococcus aureus.

  18. Beta-globin gene haplotypes among cameroonians and review of the global distribution: is there a case for a single sickle mutation origin in Africa?

    PubMed

    Bitoungui, Valentina J Ngo; Pule, Gift D; Hanchard, Neil; Ngogang, Jeanne; Wonkam, Ambroise

    2015-03-01

    Studies of hemoglobin S haplotypes in African subpopulations have potential implications for patient care and our understanding of genetic factors that have shaped the prevalence of sickle cell disease (SCD). We evaluated HBB gene cluster haplotypes in SCD patients from Cameroon, and reviewed the literature for a global distribution. We reviewed medical records to obtain pertinent socio-demographic and clinical features for 610 Cameroonian SCD patients, including hemoglobin electrophoresis and full blood counts. RFLP-PCR was used to determine the HBB gene haplotype on 1082 chromosomes. A systematic review of the current literature was undertaken to catalogue HBB haplotype frequencies in SCD populations around the world. Benin (74%; n = 799) and Cameroon (19%; n = 207) were the most prevalent haplotypes observed among Cameroonian patients. There was no significant association between HBB haplotypes and clinical life events, anthropometric measures, hematological parameters, or fetal hemoglobin (HbF) levels. The literature review of the global haplotype distributions was consistent with known historical migrations of the people of Africa. Previously reported data from Sudan showed a distinctly unusual pattern; all four classical haplotypes were reported, with an exceptionally high proportion of the Senegal, Cameroon, and atypical haplotypes. We did not observe any significant associations between HBB haplotype and SCD disease course in this cohort. Taken together, the data from Cameroon and from the wider literature suggest that a careful reassessment of African HBB haplotypes may shed further light on the evolutionary dynamics of the sickle allele, which could suggest a single origin of the sickle mutation.

  19. Haplotyping algorithms

    SciTech Connect

    Sobel, E.; Lange, K.; O`Connell, J.R.

    1996-12-31

    Haplotyping is the logical process of inferring gene flow in a pedigree based on phenotyping results at a small number of genetic loci. This paper formalizes the haplotyping problem and suggests four algorithms for haplotype reconstruction. These algorithms range from exhaustive enumeration of all haplotype vectors to combinatorial optimization by simulated annealing. Application of the algorithms to published genetic analyses shows that manual haplotyping is often erroneous. Haplotyping is employed in screening pedigrees for phenotyping errors and in positional cloning of disease genes from conserved haplotypes in population isolates. 26 refs., 6 figs., 3 tabs.

  20. Non-random association of the Rsa I polymorphic site 5' to the beta-globin gene with major sickle cell haplotypes.

    PubMed

    Sharon, B; Poncz, M; Surrey, S; Schwartz, E

    1988-01-01

    There are three main African haplotypes associated with the sickle mutation on chromosome 11. We have examined an Rsa I polymorphism 550 bp 5' to the beta-globin gene to study the degree of linkage disequilibrium between this Rsa I site and the three haplotypes. This Rsa I site is contained within the 10.3 kb or less area of randomization separating the 5'- and 3'-haplotype clusters. The beta S-containing chromosomes of the Benin and Senegal haplotypes are not cut, while those of the Central African Republic are cleaved by Rsa I at this site. Possible explanations of these findings are discussed.

  1. Glucocorticoid receptor gene haplotype structure and steroid therapy outcome in IBD patients

    PubMed Central

    Mwinyi, Jessica; Wenger, Christa; Eloranta, Jyrki J; Kullak-Ublick, Gerd A

    2010-01-01

    AIM: To study whether the glucocorticoid receptor (GR/NR3C1) gene haplotypes influence the steroid therapy outcome in inflammatory bowel disease (IBD). METHODS: We sequenced all coding exons and flanking intronic sequences of the NR3C1 gene in 181 IBD patients, determined the single nucleotide polymorphisms, and predicted the NR3C1 haplotypes. Furthermore, we investigated whether certain NR3C1 haplotypes are significantly associated with steroid therapy outcomes. RESULTS: We detected 13 NR3C1 variants, which led to the formation of 17 different haplotypes with a certainty of > 95% in 173 individuals. The three most commonly occurring haplotypes were included in the association analysis of the influence of haplotype on steroid therapy outcome or IBD activity. None of the NR3C1 haplotypes showed statistically significant association with glucocorticoid therapy success. CONCLUSION: NR3C1 haplotypes are not related to steroid therapy outcome. PMID:20712049

  2. Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2014-07-01

    The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement.

  3. Genetic link between Asians and native Americans: evidence from HLA genes and haplotypes.

    PubMed

    Tokunaga, K; Ohashi, J; Bannai, M; Juji, T

    2001-09-01

    We have been studying polymorphisms of HLA class I and II genes in East Asians including Buryat in Siberia, Mongolian, Han Chinese, Man Chinese, Korean Chinese, South Korean, and Taiwan indigenous populations in collaboration with many Asian scientists. Regional populations in Japan, Hondo-Japanese, Ryukyuan, and Ainu, were also studied. HLA-A, -B, and -DRB1 gene frequencies were subjected to the correspondence analysis and calculation of DA distances. The correspondence analysis demonstrated several major clusters of human populations in the world. "Mongoloid" populations were highly diversified, in which several clusters such as Northeast Asians, Southeast Asians, Oceanians, and Native Americans were observed. Interestingly, an indigenous population in North Japan, Ainu, was placed relatively close to Native Americans in the correspondence analysis. Distribution of particular HLA-A, -B, -DRB1 alleles and haplotypes was also analyzed in relation to migration and dispersal routes of ancestral populations. A number of alleles and haplotypes showed characteristic patterns of regional distribution. For example, B39-HR5-DQ7 (B*3901-DRB1*1406-DQB1*0301) was shared by Ainu and Native Americans. A24-Cw8-B48 was commonly observed in Taiwan indigenous populations, Maori in New Zealand, Orochon in Northeast China, Inuit, and Tlingit. These findings further support the genetic link between East Asians and Native Americans. We have proposed that various ancestral populations in East Asia, marked by different HLA haplotypes, had migrated and dispersed through multiple routes. Moreover, relatively small genetic distances and the sharing of several HLA haplotypes between Ainu and Native Americans suggest that these populations are descendants of some Upper Paleolithic populations of East Asia.

  4. Implications of the genetic epidemiology of globin haplotypes linked to the sickle cell gene in southern Iran.

    PubMed

    Rahimi, Zohreh; Merat, Ahmad; Gerard, Nathalie; Krishnamoorthy, Rajagopal; Nagel, Ronald L

    2006-12-01

    To determine the origin of sickle cell mutation in different ethnic groups living in southern Iran, we studied the haplotype background of the betaS and betaA genes in subjects from the provinces of Fars, Khuzestan, Bushehr, Hormozgan, and Kerman and from the islands of Khark and Qeshm. beta-globin gene cluster haplotypes were determined using the PCR-RFLP technique. Detection of -alpha 3.7 deletion and beta-thalassemia mutations were defined by PCR and reverse dot blot techniques, respectively. The framework of the beta-globin gene was determined using denaturing gradient gel electrophoresis. We found that the betaS mutation in southern Iran is associated with multiple mutational events. Most of the patients were from two ethnic groups: Farsi speakers (presumably Persian in origin) from Fars province and patients of Arab origin from Khuzestan province. In both ethnic groups the Arab-Indian haplotype was the most prevalent. The frequencies of the Arab-Indian and African haplotypes in sickle cell anemia patients from the provinces of Fars and Khuzestan were similar. Among betaA chromosomes the Bantu A2 haplotype was the most prevalent. The decrease in alpha-globin production in SS patients and AS individuals appeared to be related to the reduction in mean cell volume and mean cell hemoglobin. The Arab-Indian haplotype gene flow into this region of Iran can be traced to the Sassanian Empire. It is likely that the influx of betaS genes linked to the Benin and Bantu haplotypes, of African origin, must have occurred during the Arab slave trade.

  5. Expected size of shared haplotypes surrounding a common disease gene

    SciTech Connect

    Meerman, G.J. te; Meulen, M.A. van der; Sandkuijl, L.A.

    1994-09-01

    If two persons in a founder population share a rare disease, they may share genes involved in that disease Identical By Descent. We have calculated the probability of the size of the region IBD on either side of a shared common gene. Probabilities are plotted for various values of the meiotic count: the number of independent meioses connecting the persons. Even if this number is quite large, the shared area will, given the present density of markers, contain several markers. To be 95% certain that the area surrounding a gene can be delimited to less than 1 cM, approximately 500 meioses need to be observed. The many generations that are required before a gene is separated from its surrounding polymorphisms indicate that association between disease and marker alleles can be explained as IBD around a common gene. In founder populations apparantly unrelated affected persons will likely share disease genes introduced or mutated between 10 and 40 generations ago. Analyzing the overlap of haplotypes gives excellent opportunities to observe implicitly the many meioses required for genetic fine mapping.

  6. Beta-Globin Gene Haplotypes Among Cameroonians and Review of the Global Distribution: Is There a Case for a Single Sickle Mutation Origin in Africa?

    PubMed Central

    Bitoungui, Valentina J. Ngo; Pule, Gift D.; Hanchard, Neil; Ngogang, Jeanne

    2015-01-01

    Abstract Studies of hemoglobin S haplotypes in African subpopulations have potential implications for patient care and our understanding of genetic factors that have shaped the prevalence of sickle cell disease (SCD). We evaluated HBB gene cluster haplotypes in SCD patients from Cameroon, and reviewed the literature for a global distribution. We reviewed medical records to obtain pertinent socio-demographic and clinical features for 610 Cameroonian SCD patients, including hemoglobin electrophoresis and full blood counts. RFLP-PCR was used to determine the HBB gene haplotype on 1082 chromosomes. A systematic review of the current literature was undertaken to catalogue HBB haplotype frequencies in SCD populations around the world. Benin (74%; n=799) and Cameroon (19%; n=207) were the most prevalent haplotypes observed among Cameroonian patients. There was no significant association between HBB haplotypes and clinical life events, anthropometric measures, hematological parameters, or fetal hemoglobin (HbF) levels. The literature review of the global haplotype distributions was consistent with known historical migrations of the people of Africa. Previously reported data from Sudan showed a distinctly unusual pattern; all four classical haplotypes were reported, with an exceptionally high proportion of the Senegal, Cameroon, and atypical haplotypes. We did not observe any significant associations between HBB haplotype and SCD disease course in this cohort. Taken together, the data from Cameroon and from the wider literature suggest that a careful reassessment of African HBB haplotypes may shed further light on the evolutionary dynamics of the sickle allele, which could suggest a single origin of the sickle mutation. PMID:25748438

  7. Human dopamine transporter gene: differential regulation of 18-kb haplotypes

    PubMed Central

    Zhao, Ying; Xiong, Nian; Liu, Yang; Zhou, Yanhong; Li, Nuomin; Qing, Hong; Lin, Zhicheng

    2013-01-01

    Aim Since previous functional studies of short haplotypes and polymorphic sites of SLC6A3 have shown variant-dependent and drug-sensitive promoter activity, this study aimed to understand whether a large SLC6A3 regulatory region, containing these small haplotypes and polymorphic sites, can display haplotype-dependent promoter activity in a drug-sensitive and pathway-related manner. Materials & methods By creating and using a single copy number luciferase-reporter vector, we examined regulation of two different SLC6A3 haplotypes (A and B) of the 5′ 18-kb promoter and two known downstream regulatory variable number tandem repeats by 17 drugs in four different cellular models. Results The two regulatory haplotypes displayed up to 3.2-fold difference in promoter activity. The regulations were drug selective (37.5% of the drugs showed effects), and both haplotype and cell type dependent. Pathway analysis revealed at least 13 main signaling hubs targeting SLC6A3, including histone deacetylation, AKT, PKC and CK2 α-chains. Conclusion SLC6A3 may be regulated via either its promoter or the variable number tandem repeats independently by specific signaling pathways and in a haplotype-dependent manner. Furthermore, we have developed the first pathway map for SLC6A3 regulation. These findings provide a framework for understanding complex and variant-dependent regulations of SLC6A3. PMID:24024899

  8. Haplotype sharing analysis with SNPs in candidate genes: the Genetic Analysis Workshop 12 example.

    PubMed

    Fischer, Christine; Beckmann, Lars; Majoram, Paul; te Meerman, Gerard; Chang-Claude, Jenny

    2003-01-01

    Haplotype sharing analysis was used to investigate the association of affection status with single nucleotide polymorphism (SNP) haplotypes within candidate gene 1 in one sample each from the isolated and the general population of Genetic Analysis Workshop (GAW) 12 simulated data. Gene 1 has direct influence on affection and harbors more than 70 SNPs. Haplotype sharing analysis depends heavily on previous haplotype estimation. Using GENEHUNTER haplotypes, strong evidence was found for most SNPs in the isolated population sample, thus providing evidence for an involvement of this gene, but the maximum -log(10)(p) values for the haplotype sharing statistics (HSS) test statistic did not correspond to the location of the true variant in either population. In comparison, transmission disequilibrium test (TDT) analysis showed the strongest results at the disease-causing variant in both populations, and these were outstanding in the general population. In this example, TDT analysis appears to perform better than HSS in identifying the disease-causing variant, using SNPs within a candidate gene in an outbred population. Simulations showed that the performance of HSS is hampered by closely spaced SNPs in strong linkage disequilibrium with the functional variant and by ambiguous haplotypes.

  9. Acute chest syndrome is associated with single nucleotide polymorphism-defined beta globin cluster haplotype in children with sickle cell anaemia.

    PubMed

    Bean, Christopher J; Boulet, Sheree L; Yang, Genyan; Payne, Amanda B; Ghaji, Nafisa; Pyle, Meredith E; Hooper, W Craig; Bhatnagar, Pallav; Keefer, Jeffrey; Barron-Casella, Emily A; Casella, James F; Debaun, Michael R

    2013-10-01

    Genetic diversity at the human β-globin locus has been implicated as a modifier of sickle cell anaemia (SCA) severity. However, haplotypes defined by restriction fragment length polymorphism sites across the β-globin locus have not been consistently associated with clinical phenotypes. To define the genetic structure at the β-globin locus more thoroughly, we performed high-density single nucleotide polymorphism (SNP) mapping in 820 children who were homozygous for the sickle cell mutation (HbSS). Genotyping results revealed very high linkage disequilibrium across a large region spanning the locus control region and the HBB (β-globin gene) cluster. We identified three predominant haplotypes accounting for 96% of the β(S) -carrying chromosomes in this population that could be distinguished using a minimal set of common SNPs. Consistent with previous studies, fetal haemoglobin level was significantly associated with β(S) -haplotypes. After controlling for covariates, an association was detected between haplotype and rate of hospitalization for acute chest syndrome (ACS) (incidence rate ratio 0·51, 95% confidence interval 0·29-0·89) but not incidence rate of vaso-occlusive pain or presence of silent cerebral infarct (SCI). Our results suggest that these SNP-defined β(S) -haplotypes may be associated with ACS, but not pain or SCI in a study population of children with SCA.

  10. Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

    PubMed

    Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

    2010-02-01

    Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.

  11. Predicting childhood effortful control from interactions between early parenting quality and children's dopamine transporter gene haplotypes.

    PubMed

    Li, Yi; Sulik, Michael J; Eisenberg, Nancy; Spinrad, Tracy L; Lemery-Chalfant, Kathryn; Stover, Daryn A; Verrelli, Brian C

    2016-02-01

    Children's observed effortful control (EC) at 30, 42, and 54 months (n = 145) was predicted from the interaction between mothers' observed parenting with their 30-month-olds and three variants of the solute carrier family C6, member 3 (SLC6A3) dopamine transporter gene (single nucleotide polymorphisms in intron8 and intron13, and a 40 base pair variable number tandem repeat [VNTR] in the 3'-untranslated region [UTR]), as well as haplotypes of these variants. Significant moderating effects were found. Children without the intron8-A/intron13-G, intron8-A/3'-UTR VNTR-10, or intron13-G/3'-UTR VNTR-10 haplotypes (i.e., haplotypes associated with the reduced SLC6A3 gene expression and thus lower dopamine functioning) appeared to demonstrate altered levels of EC as a function of maternal parenting quality, whereas children with these haplotypes demonstrated a similar EC level regardless of the parenting quality. Children with these haplotypes demonstrated a trade-off, such that they showed higher EC, relative to their counterparts without these haplotypes, when exposed to less supportive maternal parenting. The findings revealed a diathesis-stress pattern and suggested that different SLC6A3 haplotypes, but not single variants, might represent different levels of young children's sensitivity/responsivity to early parenting.

  12. Haplotypes of the porcine peroxisome proliferator-activated receptor delta gene are associated with backfat thickness

    PubMed Central

    2009-01-01

    Background Peroxisome proliferator-activated receptor delta belongs to the nuclear receptor superfamily of ligand-inducible transcription factors. It is a key regulator of lipid metabolism. The peroxisome proliferator-activated receptor delta gene (PPARD) has been assigned to a region on porcine chromosome 7, which harbours a quantitative trait locus for backfat. Thus, PPARD is considered a functional and positional candidate gene for backfat thickness. The purpose of this study was to test this candidate gene hypothesis in a cross of breeds that were highly divergent in lipid deposition characteristics. Results Screening for genetic variation in porcine PPARD revealed only silent mutations. Nevertheless, significant associations between PPARD haplotypes and backfat thickness were observed in the F2 generation of the Mangalitsa × Piétrain cross as well as a commercial German Landrace population. Haplotype 5 is associated with increased backfat in F2 Mangalitsa × Piétrain pigs, whereas haplotype 4 is associated with lower backfat thickness in the German Landrace population. Haplotype 4 and 5 carry the same alleles at all but one SNP. Interestingly, the opposite effects of PPARD haplotypes 4 and 5 on backfat thickness are reflected by opposite effects of these two haplotypes on PPAR-δ mRNA levels. Haplotype 4 significantly increases PPAR-δ mRNA levels, whereas haplotype 5 decreases mRNA levels of PPAR-δ. Conclusion This study provides evidence for an association between PPARD and backfat thickness. The association is substantiated by mRNA quantification. Further studies are required to clarify, whether the observed associations are caused by PPARD or are the result of linkage disequilibrium with a causal variant in a neighbouring gene. PMID:19943979

  13. Rare missense variants within a single gene form yin yang haplotypes.

    PubMed

    Curtis, David

    2016-01-01

    Yin yang haplotype pairs differ at every SNP. They would not be accounted for by population models that incorporate sequential mutation, with or without recombination. Previous reports have claimed that there is a tendency for common SNPs to form yin yang haplotypes more often than would be expected by sequential mutation or by a random sample of all possible haplotypic arrangements of alleles. In the course of analysing next-generation sequencing data, instances of yin yang haplotypes being formed by very rare variants within a single gene were observed. As an example, this report describes a completely yin yang haplotype formed by eight rare missense variants in the ABCA13 gene. Of 1000 genome subjects, 21 have a copy of the alternate allele at all eight of these positions and a single subject is homozygous for all of them. None of the other 1070 subjects possesses any of the altetrnates. Thus, the eight alternate alleles are always found together and never occur separately. The existence of such yin yang haplotypes has important implications for statistical methods for analysing rare variants. Also, they may be of use for gaining a better understanding of the history of human populations.

  14. Association of MAPT haplotypes with Alzheimer’s disease risk and MAPT brain gene expression levels

    PubMed Central

    2014-01-01

    Introduction MAPT encodes for tau, the predominant component of neurofibrillary tangles that are neuropathological hallmarks of Alzheimer’s disease (AD). Genetic association of MAPT variants with late-onset AD (LOAD) risk has been inconsistent, although insufficient power and incomplete assessment of MAPT haplotypes may account for this. Methods We examined the association of MAPT haplotypes with LOAD risk in more than 20,000 subjects (n-cases = 9,814, n-controls = 11,550) from Mayo Clinic (n-cases = 2,052, n-controls = 3,406) and the Alzheimer’s Disease Genetics Consortium (ADGC, n-cases = 7,762, n-controls = 8,144). We also assessed associations with brain MAPT gene expression levels measured in the cerebellum (n = 197) and temporal cortex (n = 202) of LOAD subjects. Six single nucleotide polymorphisms (SNPs) which tag MAPT haplotypes with frequencies greater than 1% were evaluated. Results H2-haplotype tagging rs8070723-G allele associated with reduced risk of LOAD (odds ratio, OR = 0.90, 95% confidence interval, CI = 0.85-0.95, p = 5.2E-05) with consistent results in the Mayo (OR = 0.81, p = 7.0E-04) and ADGC (OR = 0.89, p = 1.26E-04) cohorts. rs3785883-A allele was also nominally significantly associated with LOAD risk (OR = 1.06, 95% CI = 1.01-1.13, p = 0.034). Haplotype analysis revealed significant global association with LOAD risk in the combined cohort (p = 0.033), with significant association of the H2 haplotype with reduced risk of LOAD as expected (p = 1.53E-04) and suggestive association with additional haplotypes. MAPT SNPs and haplotypes also associated with brain MAPT levels in the cerebellum and temporal cortex of AD subjects with the strongest associations observed for the H2 haplotype and reduced brain MAPT levels (β = -0.16 to -0.20, p = 1.0E-03 to 3.0E-03). Conclusions These results confirm the previously reported MAPT H2 associations with LOAD risk in two large series, that this haplotype has the strongest

  15. Atypical haplotypes linked to the beta S gene in Africa are likely to be the product of recombination.

    PubMed

    Srinivas, R; Dunda, O; Krishnamoorthy, R; Fabry, M E; Georges, A; Labie, D; Nagel, R L

    1988-09-01

    We report here the haplotypes of 10 MstII-defined SS patients and a S/beta o thalassemia from the Central African Republic, exhibiting 7 different atypical haplotypes that are different from the typical Bantu haplotype that characterize over 93% of the beta s bearing chromosomes in that region of Africa. Of the seven atypical haplotypes, six can be easily interpreted as the result of recombination around the "hot spot" 5' of the beta gene, between a typical Bantu haplotype and other haplotypes available in the normal population. Except for one case that requires further study, this result demonstrates that the main mutational event leading to sickle hemoglobin in Bantu-speaking Africa was the mutation of the beta gene in a Bantu haplotype background.

  16. Autism Associated Haplotype Affects the Regulation of the Homeobox Gene, ENGRAILED 2

    PubMed Central

    Benayed, Rym; Choi, Jiyeon; Matteson, Paul G; Gharani, Neda; Kamdar, Silky; Brzustowicz, Linda M; Millonig, James H

    2009-01-01

    Background Association analysis identified the homeobox transcription factor, ENGRAILED 2 (EN2), as a possible Autism Spectrum Disorder (ASD) susceptibility gene (ASD [MIM 608636]; EN2 [MIM 131310]). The common alleles (underlined) of two intronic SNPs, rs1861972 (A/G) and rs1861973 (C/T), are over-transmitted to affected individuals both singly and as a haplotype in three separate datasets (518 families total, haplotype P=0.00000035). Methods: Further support that EN2 is a possible ASD susceptibility gene requires the identification of a risk allele, a DNA variant that is consistently associated with ASD but is also functional. To identify possible risk alleles, additional association analysis and LD mapping were performed. Candidate polymorphisms were then tested for functional differences by luciferase (luc) reporter transfections and Electrophoretic Mobility Shift Assays (EMSAs). Results: Association analysis of additional EN2 polymorphisms and LD mapping with Hapmap SNPs identified the rs1861972-rs1861973 haplotype as the most appropriate candidate to test for functional differences. Luc reporters for the two common rs1861972-rs1861973 haplotypes (A-C and G-T) were then transfected into human and rat cell lines as well as primary mouse neuronal cultures. In all cases the A-C haplotype resulted in a significant increase in luc levels (P<.005). EMSAs were then performed and nuclear factors bound specifically to the A and C alleles of both SNPs. Conclusions: These data indicate the AC haplotype is functional and together with the association and LD mapping results support EN2 as a likely ASD susceptibility gene and the A-C haplotype as a possible risk allele. PMID:19615670

  17. Parallel antibody germline gene and haplotype analyses support the validity of immunoglobulin germline gene inference and discovery.

    PubMed

    Kirik, Ufuk; Greiff, Lennart; Levander, Fredrik; Ohlin, Mats

    2017-04-04

    Analysis of antibody repertoire development and specific antibody responses important for e.g. autoimmune conditions, allergy, and protection against disease is supported by high throughput sequencing and associated bioinformatics pipelines that describe the diversity of the encoded antibody variable domains. Proper assignment of sequences to germline genes are important for many such processes, for instance in the analysis of somatic hypermutation. Germline gene inference from antibody-encoding transcriptomes, by using tools such as TIgGER or IgDiscover, has a potential to enhance the quality of such analyses. These tools may also be used to identify germline genes not previously known. In this study, we exploited such software for germline gene inference and define aspects of analysis settings and pre-existing knowledge of germline genes that affect the outcome of gene inference. Furthermore, we demonstrate the capacity of IGHJ and IGHD haplotype inference, whenever subjects are heterozygous with respect to such genes, to lend support to IGHV gene inference in general, and to the identification of novel alleles presently not recognized by germline gene reference directories. We propose that such haplotype analysis shall, whenever possible, be used in future best practice to support the outcome of germline gene inference. IGHJ-directed haplotype inference was also used to identify haplotypes not expressing some IGHV germline genes. In particular, we identified a haplotype that did not express several major germline genes such as IGHV1-8, IGHV3-9, IGHV3-15, IGHV1-18, IGHV3-21, and IGHV3-23. We envisage that haplotype analysis will provide an efficient approach to identify subjects for further studies of the link between the available immunoglobulin repertoire and outcomes of immune responses.

  18. SNP/haplotype associations in cytokine and cytokine receptor genes and immunity to rubella vaccine.

    PubMed

    Dhiman, Neelam; Haralambieva, Iana H; Kennedy, Richard B; Vierkant, Robert A; O'Byrne, Megan M; Ovsyannikova, Inna G; Jacobson, Robert M; Poland, Gregory A

    2010-04-01

    An effective immune response to vaccination is, in part, a complex interaction of alleles of multiple genes regulating cytokine networks. We conducted a genotyping study of Th1/Th2/inflammatory cytokines/cytokine receptors in healthy children (n = 738, 11-19 years) to determine associations between individual single-nucleotide polymorphisms (SNPs)/haplotypes and immune outcomes after two doses of rubella vaccine. SNPs (n = 501) were selected using the ldSelect-approach and genotyped using Illumina GoldenGate and TaqMan assays. Rubella-IgG levels were measured by immunoassay and secreted cytokines by ELISA. Linear regression and post hoc haplotype analyses were used to determine associations between single SNPs/haplotypes and immune outcomes. Increased carriage of minor alleles for the promoter SNPs (rs2844482 and rs2857708) of the TNFA gene were associated with dose-related increases in rubella antibodies. IL-6 secretion was co-directionally associated (p < or = 0.01) with five intronic SNPs in the TNFRSF1B gene in an allele dose-related manner, while five promoter/intronic SNPs in the IL12B gene were associated with variations in IL-6 secretion. TNFA haplotype AAACGGGGC (t-statistic = 3.32) and IL12B promoter haplotype TAG (t-statistic = 2.66) were associated with higher levels of (p < or = 0.01) rubella-IgG and IL-6 secretion, respectively. We identified individual SNPs/haplotypes in TNFA/TNFRSF1B and IL12B genes that appear to modulate immunity to rubella vaccination. Identification of such "genetic fingerprints" may predict the outcome of vaccine response and inform new vaccine strategies.

  19. The role of the JAK2 GGCC haplotype and the TET2 gene in familial myeloproliferative neoplasms

    PubMed Central

    Olcaydu, Damla; Rumi, Elisa; Harutyunyan, Ashot; Passamonti, Francesco; Pietra, Daniela; Pascutto, Cristiana; Berg, Tiina; Jäger, Roland; Hammond, Emma; Cazzola, Mario; Kralovics, Robert

    2011-01-01

    Background Myeloproliferative neoplasms constitute a group of diverse chronic myeloid malignancies that share pathogenic features such as acquired mutations in the JAK2, TET2, CBL and MPL genes. There are recent reports that a JAK2 gene haplotype (GGCC or 46/1) confers susceptibility to JAK2 mutation-positive myeloproliferative neoplasms. The aim of this study was to examine the role of the JAK2 GGCC haplotype and germline mutations of TET2, CBL and MPL in familial myeloproliferative neoplasms. Design and Methods We investigated patients with familial (n=88) or sporadic (n=684) myeloproliferative neoplasms, and a control population (n=203) from the same demographic area in Italy. Association analysis was performed using tagged single nucleotide polymorphisms (rs10974944 and rs12343867) of the JAK2 haplotype. Sequence analysis of TET2, CBL and MPL was conducted in the 88 patients with familial myeloproliferative neoplasms. Results Association analysis revealed no difference in haplotype frequency between familial and sporadic cases of myeloproliferative neoplasms (P=0.6529). No germline mutations in TET2, CBL or MPL that segregate with the disease phenotype were identified. As we observed variability in somatic mutations in the affected members of a pedigree with myeloproliferative neoplasms, we postulated that somatic mutagenesis is increased in familial myeloproliferative neoplasms. Accordingly, we compared the incidence of malignant disorders between sporadic and familial patients. Although the overall incidence of malignant disorders did not differ significantly between cases of familial and sporadic myeloproliferative neoplasms, malignancies were more frequent in patients with familial disease aged between 50 to 70 years (P=0.0198) than in patients in the same age range with sporadic myeloproliferative neoplasms. Conclusions We conclude that the JAK2 GGCC haplotype and germline mutations of TET2, CBL or MPL do not explain familial clustering of

  20. Haplotype analysis of the folate-related genes MTHFR, MTRR, and MTR and migraine with aura

    PubMed Central

    Roecklein, Kathryn A.; Scher, Ann I.; Smith, Albert; Harris, Tamara; Eiriksdottir, Gudny; Garcia, Melissa; Gudnason, Villi; Launer, Lenore J.

    2014-01-01

    Aims The C677T variant in the methylenetetrahydrofolate reductase (MTHFR; EC 1.5.1.20) enzyme, a key player in the folate metabolic pathway, has been associated with increased risk of migraine with aura. Other genes encoding molecular components of this pathway include Methionine synthase (MTR; EC 2.1.1.13), and Methionine synthase reductase (MTRR; EC 2.1.1.135) among others. We performed a haplotype analysis of migraine risk and MTHFR, MTR, and MTRR. Methods Study participants are from a random sub-sample participating in the population-based AGES-Reykjavik Study, including subjects with non-migraine headache (n=367), migraine without aura (n=85), migraine with aura (n=167), and no headache (n=1347). Haplotypes spanning each gene were constructed using Haploview. Association testing was performed on single SNPs and haplotypes using logistic regression, controlling for demographic and cardiovascular risk factors and correcting for multiple testing Results Haplotype analysis suggested an association between MTRR haplotypes and reduced risk of migraine with aura. All other associations were not significant after correcting for multiple testing. Conclusions These results suggest that MTRR variants may protect against migraine with aura in an older population. PMID:23430981

  1. Nitrogen-metabolism related genes in barley - haplotype diversity, linkage mapping and associations with malting and kernel quality parameters

    PubMed Central

    2013-01-01

    Background Several studies report about intra-specific trait variation of nitrogen-metabolism related traits, such as N(itrogen)-use efficiency, protein content, N-storage and remobilization in barley and related grass species. The goal of this study was to assess the intra-specific genetic diversity present in primary N-metabolism genes of barley and to investigate the associations of the detected haplotype diversity with malting and kernel quality related traits. Results Partial sequences of five genes related to N-metabolism in barley (Hordeum vulgare L.) were obtained, i.e. nitrate reductase 1, glutamine synthetase 2, ferredoxin-dependent glutamate synthase, aspartate aminotransferase and asparaginase. Two to five haplotypes in each gene were discovered in a set of 190 various varieties. The development of 33 SNP markers allowed the genotyping of all these barley varieties consisting of spring and winter types. Furthermore, these markers could be mapped in several doubled haploid populations. Cluster analysis based on haplotypes revealed a more uniform pattern of the spring barleys as compared to the winter barleys. Based on linear model approaches associations to several malting and kernel quality traits including soluble N and protein were identified. Conclusions A study was conducted to investigate the presence of sequence variation of several genes related to the primary N-metabolism in barley. The detected diversity could be related to particular phenotypic traits. Specific differences between spring and winter barleys most likely reflect different breeding aims. The developed markers can be used as tool for further genetic studies and marker-assisted selection during breeding of barley. PMID:24007272

  2. Variants and Haplotypes in Angiotensinogen Gene Are Associated With Plasmatic Angiotensinogen Level in Mexican Population

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Alfaro-Ruiz, Luis; Carrillo, Karol; Elizalde, Adela; Gil, Trinidad; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2011-01-01

    Introduction The plasmatic angiotensinogen (AGT) level has been associated with essential hypertension. Linkage analysis has found a relationship between the AGT gene locus and hypertension in the Mexican-American population, but studies have failed to identify genetic variants associated with hypertension or plasma AGT levels. This study analyzes the relationship between polymorphisms in the AGT gene and plasmatic AGT levels in Mexican population. Methods Nine polymorphisms in AGT gene were genotyped, and plasma AGT level was determined by enzyme-linked immunosorbent assay. Results Differences in AGT plasma levels were associated with 2 polymorphisms: T-20G, TT = 25.3 ± 8.3 versus TG + GG = 21.6 ± 8.8 μg/mL; P = 0.008 and C3389T (T174M), CC = 25.8 ± 9.9 versus TC + TT = 20.5 ± 5.4 μg/mL; P = 0.0002. Haplotype 2 was associated with low plasma AGT (−5.1 μg/mL [95% confidence interval: −8.6 to −1.6], P = 0.004) and Haplotype 8 was associated with high plasma AGT (6.5 μg/mL [95% confidence interval: 2.5 to 10.6], P = 0.001). This association remained after adjustment for covariates. A Likelihood Ratio Test for haplotype-phenotype association adjusted for covariates resulted in χ2 = 38.9, P = 0.0005. The total effect of the haplotypes on plasma AGT level variance was 19.5%. No association was identified between haplotypes and quantitative traits of blood pressure. Conclusions Two polymorphisms (T-20G and C3389T) and 2 haplotypes (H2 and H8) showed an association with plasma AGT levels in Mexican population. PMID:21629041

  3. Haplotype and linkage disequilibrium analysis of the CRMP1 and EVC genes.

    PubMed

    Sivakumaran, Theru A; Lesperance, Marci M

    2004-11-01

    In this report, we present the haplotype and linkage disequilibrium (LD) pattern in the Collapsin Response Mediator Protein 1 (CRMP1) and Ellis-van Creveld syndrome (EVC) gene region. We genotyped eight different single nucleotide polymorphisms (SNPs) in the CRMP1 and EVC genes in 90 control individuals of diverse ethnicity. The minor allele frequencies ranged from 3.3-49.4%, with most having a frequency >25%. A total of 37 haplotypes were derived from these eight polymorphisms, with only one haplotype having a frequency >10%. Pairwise LD analysis showed a weak but significant LD between markers located about 243 kb apart in this region. The LD was significant between markers spaced about 208 kb apart in EVC, whereas no LD was found between a pair of markers located about 5 kb apart in CRMP1. However, in general, LD correlated with the distance between loci. The CRMP1 and EVC genes are located near WFS1, the Wolfram syndrome type 1 gene, in which mutations also cause low frequency sensorineural hearing loss (LFSNHL). The haplotypes obtained from these polymorphisms will be useful to track the segregation of phenotypes in families with Ellis-van Creveld syndrome, Weyers acrodental dysostosis, LFSNHL and Wolfram syndrome type 1.

  4. Interactions “Candidatus Liberibacter solanacearum”—Bactericera cockerelli: Haplotype Effect on Vector Fitness and Gene Expression Analyses

    PubMed Central

    Yao, Jianxiu; Saenkham, Panatda; Levy, Julien; Ibanez, Freddy; Noroy, Christophe; Mendoza, Azucena; Huot, Ordom; Meyer, Damien F.; Tamborindeguy, Cecilia

    2016-01-01

    “Candidatus Liberibacter solanacearum” (Lso) has emerged as a serious threat world-wide. Five Lso haplotypes have been identified so far. Haplotypes A and B are present in the Americas and/or New Zealand, where they are vectored to solanaceous plants by the potato psyllid, Bactericera cockerelli (Šulc) (Hemiptera: Triozidae). The fastidious nature of these pathogens has hindered the study of the interactions with their eukaryotic hosts (vector and plant). To understand the strategies used by these pathogens to infect their vector, the effects of each Lso haplotype (A or B) on psyllid fitness was investigated, and genome-wide transcriptomic and RT-qPCR analyses were performed to evaluate Lso gene expression in association with its vector. Results showed that psyllids infected with haplotype B had significantly lower percentage of nymphal survival compared to psyllids infected with haplotype A. Although overall gene expression across Lso genome was similar between the two Lso haplotypes, differences in the expression of key candidate genes were found. Among the 16 putative type IV effector genes tested, four of them were differentially expressed between Lso haplotypes, while no differences in gene expression were measured by qPCR or transcriptomic analysis for the rest of the genes. This study provides new information regarding the pathogenesis of Lso haplotypes in their insect vector. PMID:27376032

  5. Association, haplotype, and gene-gene interactions of the HPA axis genes with suicidal behaviour in affective disorders.

    PubMed

    Leszczyńska-Rodziewicz, Anna; Szczepankiewicz, Aleksandra; Pawlak, Joanna; Dmitrzak-Weglarz, Monika; Hauser, Joanna

    2013-01-01

    Family twin and adoption studies have noted the heritability of specific biological factors that influence suicidal behaviour. Exposure to stress is one of the factors that strongly contribute to suicide attempts. The biological response to stress involves the hypothalamic-pituitary-adrenal axis (HPA). Therefore, we found it interesting to study polymorphisms of genes involved in the HPA axis (CRHR1, NR3C1, and AVPBR1). The study was performed on 597 patients, 225 of whom had a history of suicide attempts. We did not observe any significant differences in the studied polymorphisms between the group of patients with a history of suicide attempts and the control subjects. Our haplotype analysis of the AVPR1b gene revealed an association between the GCA haplotype and suicide attempts; however, this association was not significant after correcting for multiple testing. We did not observe any other association in haplotype and MDR analysis. We report here a comprehensive analysis of the HPA axis genes and a lack of association for genetic variations regarding the risk of suicide attempts in affective disorder patients. Nonetheless, the inconsistencies with the previously published results indicate the importance of the further investigation of these polymorphisms with respect to the risk of suicide attempts.

  6. Maize haplotype with a helitron-amplified cytidine deaminase gene copy

    PubMed Central

    Xu, Jian-Hong; Messing, Joachim

    2006-01-01

    Background Genetic maps are based on recombination of orthologous gene sequences between different strains of the same species. Therefore, it was unexpected to find extensive non-collinearity of genes between different inbred strains of maize. Interestingly, disruption of gene collinearity can be caused among others by a rolling circle-type copy and paste mechanism facilitated by Helitrons. However, understanding the role of this type of gene amplification has been hampered by the lack of finding intact gene sequences within Helitrons. Results By aligning two haplotypes of the z1C1 locus of maize we found a Helitron that contains two genes, one encoding a putative cytidine deaminase and one a hypothetical protein with part of a 40S ribosomal protein. The cytidine deaminase gene, called ZmCDA3, has been copied from the ZmCDA1 gene on maize chromosome 7 about 4.5 million years ago (mya) after maize was formed by whole-genome duplication from two progenitors. Inbred lines contain gene copies of both progenitors, the ZmCDA1 and ZmCDA2 genes. Both genes diverged when the progenitors of maize split and are derived from the same progenitor as the rice OsCDA1 gene. The ZmCDA1 and ZmCDA2 genes are both transcribed in leaf and seed tissue, but transcripts of the paralogous ZmCDA3 gene have not been found yet. Based on their protein structure the maize CDA genes encode a nucleoside deaminase that is found in bacterial systems and is distinct from the mammalian RNA and/or DNA modifying enzymes. Conclusion The conservation of a paralogous gene sequence encoding a cytidine deaminase gene over 4.5 million years suggests that Helitrons could add functional gene sequences to new chromosomal positions and thereby create new haplotypes. However, the function of such paralogous gene copies cannot be essential because they are not present in all maize strains. However, it is interesting to note that maize hybrids can outperform their inbred parents. Therefore, certain haplotypes may

  7. Molecular evolution and functional characterisation of haplotypes of an important rubber biosynthesis gene in Hevea brasiliensis.

    PubMed

    Uthup, T K; Rajamani, A; Ravindran, M; Saha, T

    2016-07-01

    Hydroxy-methylglutaryl coenzyme-A synthase (HMGS) is a rate-limiting enzyme in the cytoplasmic isoprenoid biosynthesis pathway leading to natural rubber production in Hevea brasiliensis (rubber). Analysis of the structural variants of this gene is imperative to understand their functional significance in rubber biosynthesis so that they can be properly utilised for ongoing crop improvement programmes in Hevea. We report here allele richness and diversity of the HMGS gene in selected popular rubber clones. Haplotypes consisting of single nucleotide polymorphisms (SNPs) from the coding and non-coding regions with a high degree of heterozygosity were identified. Segregation and linkage disequilibrium analysis confirmed that recombination is the major contributor to the generation of allelic diversity, rather than point mutations. The evolutionarily conserved nature of some SNPs was identified by comparative DNA sequence analysis of HMGS orthologues from diverse taxa, demonstrating the molecular evolution of rubber biosynthesis genes in general. In silico three-dimensional structural studies highlighting the structural positioning of non-synonymous SNPs from different HMGS haplotypes revealed that the ligand-binding site on the enzyme remains impervious to the reported sequence variations. In contrast, gene expression results indicated the possibility of association between specific haplotypes and HMGS expression in Hevea clones, which may have a downstream impact up to the level of rubber production. Moreover, haplotype diversity of the HMGS gene and its putative association with gene expression can be the basis for further genetic association studies in rubber. Furthermore, the data also show the role of SNPs in the evolution of candidate genes coding for functional traits in plants.

  8. Origins of Wohlfahrtia magnifica in Italy based on the identification of mitochondrial cytochrome b gene haplotypes.

    PubMed

    Marangi, Marianna; Hall, Martin J R; Aitken, Alex; Ready, Paul D; Giangaspero, Annunziata

    2016-02-01

    To identify the geographical origins of larvae of Wohlfahrtia magnifica (Diptera: Sarcophagidae) causing myiasis of sheep in Italy, comparative DNA sequence analysis of the mitochondrial cytochrome b gene was performed, based on gene fragments amplified by PCR from genomic DNA isolated from individual specimens. DNA extractions of 19 larvae from Lazio, Molise, Puglia, and Sicilia generated 17 readable sequences homologous to 2 haplotypes, either CB_magn01 or CB_magn02; DNA extracts from 4 adult flies from Calabria (reared from larvae) produced 4 readable sequences belonging to the haplotype CB_magn01. The two haplotypes found represent both the East and West phylogenetic lineages of W. magnifica, which is consistent with the species' arrival from central/southeast Europe (East lineage) and/or from southwest Europe/northwest Africa (West lineage). This is the first report of the sympatric occurrence of the two lineages, which could have resulted from natural or human-assisted dispersal. Polymorphic nuclear loci will have to be characterized in order to explain the origins and lack of mitochondrial haplotype diversity of this pest in Italy, where it poses increasing veterinary problems.

  9. Haplotype test reveals departure from neutrality in a segment of the white gene of Drosophila melanogaster

    SciTech Connect

    Kirby, D.A.; Stephan, W.

    1995-12-01

    Restriction map studies previously revealed extensive linkage disequilibria in the transcriptional unit of the white locus in natural Drosophila melanogaster populations. To understand the causes of these disequilibria, we sequenced a 4722-bp region of the white gene from 15 lines of D. melanogaster and 1 line of Drosophila simulans. Statistical tests applied to the entire 4722-bp region do not reject neutrality. In contrast, a test for high-frequency haplotypes ({open_quotes}Haplotype test{close_quotes}) revealed an 834-bp segment, encompassing the 3{prime} end of intron 1 to the 3{prime} end of intron 2, in which the structure of variation deviates significantly from the predictions of a neutral equilibrium model. The variants in this 834-bp segment segregate as single haplotype blocks. We propose that these unusually large haplotype blocks are due to positive selection on polymorphisms within the white gene, including a replacement polymorphism, Arg{yields}Leu, within this segment. 45 refs., 4 figs., 1 tab.

  10. Catechol-O-methyltransferase gene haplotypes in Mexican and Spanish patients with fibromyalgia

    PubMed Central

    Vargas-Alarcón, Gilberto; Fragoso, José-Manuel; Cruz-Robles, David; Vargas, Angélica; Vargas, Alfonso; Lao-Villadóniga, José-Ignacio; García-Fructuoso, Ferrán; Ramos-Kuri, Manuel; Hernández, Fernando; Springall, Rashidi; Bojalil, Rafael; Vallejo, Maite; Martínez-Lavín, Manuel

    2007-01-01

    Autonomic dysfunction is frequent in patients with fibromyalgia (FM). Heart rate variability analyses have demonstrated signs of ongoing sympathetic hyperactivity. Catecholamines are sympathetic neurotransmitters. Catechol-O-methyltransferase (COMT), an enzyme, is the major catecholamine-clearing pathway. There are several single-nucleotide polymorphisms (SNPs) in the COMT gene associated with the different catecholamine-clearing abilities of the COMT enzyme. These SNPs are in linkage disequilibrium and segregate as 'haplotypes'. Healthy females with a particular COMT gene haplotype (ACCG) producing a defective enzyme are more sensitive to painful stimuli. The objective of our study was to define whether women with FM, from two different countries (Mexico and Spain), have the COMT gene haplotypes that have been previously associated with greater sensitivity to pain. All the individuals in the study were female. Fifty-seven Mexican patients and 78 Spanish patients were compared with their respective healthy control groups. All participants filled out the Fibromyalgia Impact Questionnaire (FIQ). Six COMT SNPs (rs2097903, rs6269, rs4633, rs4818, rs4680, and rs165599) were genotyped from peripheral blood DNA. In Spanish patients, there was a significant association between three SNPs (rs6269, rs4818, and rs4680) and the presence of FM when compared with healthy controls. Moreover, in Spanish patients with the 'high pain sensitivity' haplotype (ACCG), the disease, as assessed by the FIQ, was more severe. By contrast, Mexican patients displayed only a weak association between rs6269 and rs165599, and some FIQ subscales. In our group of Spanish patients, there was an association between FM and the COMT haplotype previously associated with high pain sensitivity. This association was not observed in Mexican patients. Studies with a larger sample size are needed in order to verify or amend these preliminary results. PMID:17961261

  11. Multiple haplotype-resolved genomes reveal population patterns of gene and protein diplotypes.

    PubMed

    Hoehe, Margret R; Church, George M; Lehrach, Hans; Kroslak, Thomas; Palczewski, Stefanie; Nowick, Katja; Schulz, Sabrina; Suk, Eun-Kyung; Huebsch, Thomas

    2014-11-26

    To fully understand human biology and link genotype to phenotype, the phase of DNA variants must be known. Here we present a comprehensive analysis of haplotype-resolved genomes to assess the nature and variation of haplotypes and their pairs, diplotypes, in European population samples. We use a set of 14 haplotype-resolved genomes generated by fosmid clone-based sequencing, complemented and expanded by up to 372 statistically resolved genomes from the 1000 Genomes Project. We find immense diversity of both haploid and diploid gene forms, up to 4.1 and 3.9 million corresponding to 249 and 235 per gene on average. Less than 15% of autosomal genes have a predominant form. We describe a 'common diplotypic proteome', a set of 4,269 genes encoding two different proteins in over 30% of genomes. We show moreover an abundance of cis configurations of mutations in the 386 genomes with an average cis/trans ratio of 60:40, and distinguishable classes of cis- versus trans-abundant genes. This work identifies key features characterizing the diplotypic nature of human genomes and provides a conceptual and analytical framework, rich resources and novel hypotheses on the functional importance of diploidy.

  12. A candidate transacting modulator of fetal hemoglobin gene expression in the Arab-Indian haplotype of sickle cell anemia.

    PubMed

    Vathipadiekal, Vinod; Farrell, John J; Wang, Shuai; Edward, Heather L; Shappell, Heather; Al-Rubaish, A M; Al-Muhanna, Fahad; Naserullah, Z; Alsuliman, A; Qutub, Hatem Othman; Simkin, Irene; Farrer, Lindsay A; Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Mostoslavsky, Gustavo; Murphy, George J; Patra, Pradeep K; Chui, David H K; Alsultan, Abdulrahman; Al-Ali, Amein K; Sebastiani, Paola; Steinberg, Martin H

    2016-11-01

    Fetal hemoglobin (HbF) levels are higher in the Arab-Indian (AI) β-globin gene haplotype of sickle cell anemia compared with African-origin haplotypes. To study genetic elements that effect HbF expression in the AI haplotype we completed whole genome sequencing in 14 Saudi AI haplotype sickle hemoglobin homozygotes-seven selected for low HbF (8.2% ± 1.3%) and seven selected for high HbF (23.5% ± 2.6%). An intronic single nucleotide polymorphism (SNP) in ANTXR1, an anthrax toxin receptor (chromosome 2p13), was associated with HbF. These results were replicated in two independent Saudi AI haplotype cohorts of 120 and 139 patients, but not in 76 Saudi Benin haplotype, 894 African origin haplotype and 44 AI haplotype patients of Indian origin, suggesting that this association is effective only in the Saudi AI haplotype background. ANTXR1 variants explained 10% of the HbF variability compared with 8% for BCL11A. These two genes had independent, additive effects on HbF and together explained about 15% of HbF variability in Saudi AI sickle cell anemia patients. ANTXR1 was expressed at mRNA and protein levels in erythroid progenitors derived from induced pluripotent stem cells (iPSCs) and CD34(+) cells. As CD34(+) cells matured and their HbF decreased ANTXR1 expression increased; as iPSCs differentiated and their HbF increased, ANTXR1 expression decreased. Along with elements in cis to the HbF genes, ANTXR1 contributes to the variation in HbF in Saudi AI haplotype sickle cell anemia and is the first gene in trans to HBB that is associated with HbF only in carriers of the Saudi AI haplotype. Am. J. Hematol. 91:1118-1122, 2016. © 2016 Wiley Periodicals, Inc.

  13. Investigation of the functional role of human Interleukin-8 gene haplotypes by CRISPR/Cas9 mediated genome editing

    PubMed Central

    Benakanakere, Manjunatha R.; Finoti, Livia S.; Tanaka, Urara; Grant, Gregory R.; Scarel-Caminaga, Raquel M.; Kinane, Denis F.

    2016-01-01

    Interleukin-8 (IL-8) gene polymorphisms have been considered as susceptibility factors in periodontal disease. However, the functional roles of IL-8 gene haplotypes have not been investigated. Here, we demonstrate for the first time the use of the CRISPR/Cas9 system to engineer the IL-8 gene, and tested the functionality of different haplotypes. Two sgRNAs vectors targeting the IL-8 gene and the naked homologous repair DNA carrying different haplotypes were used to successfully generate HEK293T cells carrying the AT genotype at the first SNP - rs4073 (alias -251), TT genotype at the second SNP - rs2227307 (alias +396), TC or CC genotypes at the third SNP - rs2227306 (alias +781) at the IL-8 locus. When stimulated with Poly I:C, ATC/TTC haplotype, cells significantly up-regulated the IL-8 at both transcriptional and translational levels. To test whether ATC/TTC haplotype is functional, we used a trans-well assay to measure the transmigration of primary neutrophils incubated with supernatants from the Poly I:C stimulation experiment. ATC/TTC haplotype cells significantly increased transmigration of neutrophils confirming the functional role for this IL-8 haplotype. Taken together, our data provides evidence that carriage of the ATC/TTC haplotype in itself may increase the influx of neutrophils in inflammatory lesions and influence disease susceptibility. PMID:27499075

  14. IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing.

    PubMed

    Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

    2016-11-28

    Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only.

  15. Parallel evolutionary events in the haptoglobin gene clusters of rhesus monkey and human

    SciTech Connect

    Erickson, L.M.; Maeda, N.

    1994-08-01

    Parallel occurrences of evolutionary events in the haptoglobin gene clusters of rhesus monkeys and humans were studied. We found six different haplotypes among 11 individuals from two rhesus monkey families. The six haplotypes include two types of haptoglobin gene clusters: one type with a single gene and the other with two genes. DNA sequence analysis indicates that the one-gene and the two-gene clusters were both formed by unequal homologous crossovers between two genes of an ancestral three-gene cluster, near exon 5, the longest exon of the gene. This exon is also the location where a separate unequal homologous crossover occured in the human lineage, forming the human two-gene haptoglobin gene cluster from an ancestral three-gene cluster. The occurrence of independent homologous unequal crossovers in rhesus monkey and in human within the same region of DNA suggests that the evolutionary history of the haptoglobin gene cluster in primates is the consequence of frequent homologous pairings facilitated by the longest and most conserved exon of the gene. 27 refs., 7 figs., 1 tab.

  16. Linkage-disequilibrium mapping of disease genes by reconstruction of ancestral haplotypes in founder populations.

    PubMed Central

    Service, S K; Lang, D W; Freimer, N B; Sandkuijl, L A

    1999-01-01

    Linkage disequilibrium (LD) mapping may be a powerful means for genome screening to identify susceptibility loci for common diseases. A new statistical approach for detection of LD around a disease gene is presented here. This method compares the distribution of haplotypes in affected individuals versus that expected for individuals descended from a common ancestor who carried a mutation of the disease gene. Simulations demonstrate that this method, which we term "ancestral haplotype reconstruction" (AHR), should be powerful for genome screening of phenotypes characterized by a high degree of etiologic heterogeneity, even with currently available marker maps. AHR is best suited to application in isolated populations where affected individuals are relatively recently descended (< approximately 25 generations) from a common disease mutation-bearing founder. PMID:10330361

  17. The S haplotype-specific F-box protein gene, SFB, is defective in self-compatible haplotypes of Prunus avium and P. mume.

    PubMed

    Ushijima, Koichiro; Yamane, Hisayo; Watari, Akiko; Kakehi, Eiko; Ikeda, Kazuo; Hauck, Nathanael R; Iezzoni, Amy F; Tao, Ryutaro

    2004-08-01

    Many Prunus species, including sweet cherry and Japanese apricot, of the Rosaceae, display an S-RNase-based gametophytic self-incompatibility (GSI). The specificity of this outcrossing mechanism is determined by a minimum of two genes that are located in a multigene complex, termed the S locus, which controls the pistil and pollen specificities. SFB, a gene located in the S locus region, encodes an F-box protein that has appropriate S haplotype-specific variation to be the pollen determinant in the self-incompatibility reaction. This study characterizes SFBs of two self-compatible (SC) haplotypes, S(4') and S(f), of Prunus. S(4') of sweet cherry is a pollen-part mutant (PPM) that was produced by X-ray irradiation, while S(f) of Japanese apricot is a naturally occurring SC haplotype that is considered to be a PPM. DNA sequence analysis revealed defects in both SFB(4') and SFB(f). A 4 bp deletion upstream from the HVa coding region of SFB(4') causes a frame-shift that produces transcripts of a defective SFB lacking the two hypervariable regions, HVa and HVb. Similarly, the presence of a 6.8 kbp insertion in the middle of the SFB(f) coding region leads to transcripts for a defective SFB lacking the C-terminal half that contains HVa and HVb. As all reported SFBs of functional S haplotypes encode intact SFB, the fact that the partial loss-of-function mutations in SFB are present in SC mutant haplotypes of Prunus provides additional evidence that SFB is the pollen S gene in GSI in Prunus.

  18. Fabry disease: polymorphic haplotypes and a novel missense mutation in the GLA gene.

    PubMed

    Ferri, L; Guido, C; la Marca, G; Malvagia, S; Cavicchi, C; Fiumara, A; Barone, R; Parini, R; Antuzzi, D; Feliciani, C; Zampetti, A; Manna, R; Giglio, S; Della Valle, C M; Wu, X; Valenzano, K J; Benjamin, R; Donati, M A; Guerrini, R; Genuardi, M; Morrone, A

    2012-03-01

    Fabry disease: polymorphic haplotypes and a novel missense mutation in the GLA gene. Fabry disease (FD) is an X-linked lysosomal storage disorder with a heterogeneous spectrum of clinical manifestations that are caused by the deficiency of α-galactosidase A (α-Gal-A) activity. Although useful for diagnosis in males, enzyme activity is not a reliable biochemical marker in heterozygous females due to random X-chromosome inactivation, thus rendering DNA sequencing of the α-Gal-A gene, alpha-galactosidase gene (GLA), the most reliable test for the confirmation of diagnosis in females. The spectrum of GLA mutations is highly heterogeneous. Many polymorphic GLA variants have been described, but it is unclear if haplotypes formed by combinations of such variants correlate with FD, thus complicating molecular diagnosis in females with normal α-Gal-A activity. We tested 67 female probands with clinical manifestations that may be associated with FD and 110 control males with normal α-Gal-A activity. Five different combinations of GLA polymorphic variants were identified in 14 of the 67 females, whereas clearcut pathogenetic alterations, p.Met51Ile and p.Met290Leu, were identified in two cases. The latter has not been reported so far, and both mutant forms were found to be responsive to the pharmacological chaperone deoxygalactonojirimycin (DGJ; migalastat hydrochloride). Analysis of the male control population, as well as male relatives of a suspected FD female proband, permitted the identification of seven different GLA gene haplotypes in strong linkage disequilibrium. The identification of haplotypes in control males provides evidence against their involvement in the development of FD phenotypic manifestations.

  19. Haplotype association analysis of genes within the WNT signalling pathways in diabetic nephropathy

    PubMed Central

    2013-01-01

    Background Renal interstitial fibrosis and glomerular sclerosis are hallmarks of diabetic nephropathy (DN) and several studies have implicated members of the WNT pathways in these pathological processes. This study comprehensively examined common genetic variation within the WNT pathway for association with DN. Methods Genes within the WNT pathways were selected on the basis of nominal significance and consistent direction of effect in the GENIE meta-analysis dataset. Common SNPs and common haplotypes were examined within the selected WNT pathway genes in a white population with type 1 diabetes, discordant for DN (cases: n = 718; controls: n = 749). SNPs were genotyped using Sequenom or Taqman assays. Association analyses were performed using PLINK, to compare allele and haplotype frequencies in cases and controls. Correction for multiple testing was performed by either permutation testing or using false discovery rate. Results A logistic regression model including collection centre, duration of diabetes, and average HbA1c as covariates highlighted three SNPs in GSK3B (rs17810235, rs17471, rs334543), two in DAAM1 (rs1253192, rs1252906) and one in NFAT5 (rs17297207) as being significantly (P < 0.05) associated with DN, however these SNPs did not remain significant after correction for multiple testing. Logistic regression of haplotypes, with ESRD as the outcome, and pairwise interaction analyses did not yield any significant results after correction for multiple testing. Conclusions These results indicate that both common SNPs and common haplotypes of WNT pathway genes are not strongly associated with DN. However, this does not completely exclude these or the WNT pathways from association with DN, as unidentified rare genetic or copy number variants could still contribute towards the genetic architecture of DN. PMID:23777469

  20. ICOS gene haplotypes correlate with IL10 secretion and multiple sclerosis evolution.

    PubMed

    Castelli, Luca; Comi, Cristoforo; Chiocchetti, Annalisa; Nicola, Stefania; Mesturini, Riccardo; Giordano, Mara; D'Alfonso, Sandra; Cerutti, Elisa; Galimberti, Daniela; Fenoglio, Chiara; Tesser, Fabiana; Yagi, Junji; Rojo, José Maria; Perla, Franco; Leone, Maurizio; Scarpini, Elio; Monaco, Francesco; Dianzani, Umberto

    2007-05-01

    Human ICOS is a T cell costimulatory molecule supporting IL10 secretion. A pilot study investigating variations of the ICOS gene 3'UTR detected 8 polymorphisms forming three haplotypes (A, B, C). Haplotype-A and -C displayed the highest difference. Activated T cells from healthy AA homozygotes expressed significantly less ICOS and secreted more IL10 than AC heterozygotes, whereas AB heterozygotes displayed intermediate levels. Analysis of 441 multiple sclerosis patients and 793 controls showed that frequency of AA homozygosity was significantly lower in MS patients with relapsing-remitting onset (N=416) than in controls (OR=0.70). Moreover, AA patients with relapsing-remitting onset had lower relapse rate and multiple sclerosis severity score than non-AA patients.

  1. Molecular characterization and evolution of self-incompatibility genes in Arabidopsis thaliana: the case of the Sc haplotype.

    PubMed

    Dwyer, Kathleen G; Berger, Martin T; Ahmed, Rimsha; Hritzo, Molly K; McCulloch, Amanda A; Price, Michael J; Serniak, Nicholas J; Walsh, Leonard T; Nasrallah, June B; Nasrallah, Mikhail E

    2013-03-01

    The switch from an outcrossing mode of mating enforced by self-incompatibility to self-fertility in the Arabidopsis thaliana lineage was associated with mutations that inactivated one or both of the two genes that comprise the self-incompatibility (SI) specificity-determining S-locus haplotype, the S-locus receptor kinase (SRK) and the S-locus cysteine-rich (SCR) genes, as well as unlinked modifier loci required for SI. All analyzed A. thaliana S-locus haplotypes belong to the SA, SB, or SC haplotypic groups. Of these three, the SC haplotype is the least well characterized. Its SRKC gene can encode a complete open-reading frame, although no functional data are available, while its SCRC sequences have not been isolated. As a result, it is not known what mutations were associated with inactivation of this haplotype. Here, we report on our analysis of the Lz-0 accession and the characterization of its highly rearranged SC haplotype. We describe the isolation of its SCRC gene as well as the subsequent isolation of SCRC sequences from other SC-containing accessions and from the A. lyrata S36 haplotype, which is the functional equivalent of the A. thaliana SC haplotype. By performing transformation experiments using chimeric SRK and SCR genes constructed with SC- and S36-derived sequences, we show that the SRKC and SCRC genes of Lz-0 and at least a few other SC-containing accessions are nonfunctional, despite SCRC encoding a functional full-length protein. We identify the probable mutations that caused the inactivation of these genes and discuss our results in the context of mechanisms of S-locus inactivation in A. thaliana.

  2. Haplotypes in BMP4 and FGF Genes Increase the Risk of Peri-Implantitis.

    PubMed

    Coelho, Renata Barboza; Gonçalves, Roberto; Villas-Boas, Ricardo de Mello; Bonato, Leticia Ladeira; Quinelato, Valquiria; Pinheiro, Aristides da Rosa; Machado, Aldir; Nunes, Carlos Henrique Ramirez; Gonçalves, Rackel; Vieira, Alexandre Resende; Granjeiro, José Mauro; Casado, Priscila Ladeira

    2016-01-01

    Despite the success of osseointegrated implants, failures have increased significantly, associated with development of peri-implantitis. Multiple factors influence the peri-implant bone loss, including environmental and genetic causes. BMPs (Bone morphogenetic proteins) are growth factors that induce bone formation. FGF (fibroblast growth factors) and their receptors (FGFRs) play important roles by controlling the levels of cell proliferation, differentiation and migration. BMP/FGF relationship is responsible for promoting bone regeneration and bone loss. The aim of this study was to analyze the correlation between BMP4, FGF3, FGF10 and FGFR1 genes and peri-implant bone loss. Two hundred and fifteen volunteers, with 754 dental implants, were submitted to oral examination and divided in healthy group (n=129) and peri-implantitis group (n=86). Thirteen polymorphisms in BMP4, FGF3, FGF10 and FGFR1 genes were analyzed individually and in haplotype. The chi-square test correlated genotypes, allelic and haplotype frequencies. Values of p<0.05 were considered significant. Volunteers with peri-implantitis demonstrated high incidence of total edentulism (p<0.0001) and thin peri-implant phenotype (p<0.04). Higher incidence of spontaneous bleeding, plaque and implant mobility was observed in peri-implantitis group (p<0.0001 for all). The TT polymorphic genotype for BMP4 rs2761884 was associated with healthy peri-implant (p=0.01). FGF3 rs4631909 (TT+CT genotype) also showed association with the control group (p=0.04). The frequency of C allele for FGF3 rs4631909 showed a tendency for association with peri-implantitis (p=0.08). FGF10 CCTG (p=0.03), BMP4 GAAA (p=0.05) and GGGA (p=0.02) haplotypes were associated with peri-implantitis (p=0.03). Therefore, it may be concluded that BMP4 and FGF10 haplotypes are associated with peri-implantitis.

  3. Clustering cancer gene expression data by projective clustering ensemble

    PubMed Central

    Yu, Xianxue; Yu, Guoxian

    2017-01-01

    Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920

  4. Eda haplotypes in three-spined stickleback are associated with variation in immune gene expression

    PubMed Central

    Robertson, Shaun; Bradley, Janette E.; MacColl, Andrew D. C.

    2017-01-01

    Haplotypes underlying local adaptation and speciation are predicted to have numerous phenotypic effects, but few genes involved have been identified, with much work to date concentrating on visible, morphological, phenotypes. The link between genes controlling these adaptive morphological phenotypes and the immune system has seldom been investigated, even though changes in the immune system could have profound adaptive consequences. The Eda gene in three-spined stickleback is one of the best studied major adaptation genes; it directly controls bony plate architecture and has been associated with additional aspects of adaptation to freshwater. Here, we exposed F2 hybrids, used to separate Eda genotype from genetic background, to contrasting conditions in semi-natural enclosures. We demonstrate an association between the Eda haplotype block and the expression pattern of key immune system genes. Furthermore, low plated fish grew less and experienced higher burdens of a common ectoparasite with fitness consequences. Little is currently known about the role of the immune system in facilitating adaptation to novel environments, but this study provides an indication of its potential importance. PMID:28195171

  5. No association between polymorphisms/haplotypes of the vascular endothelial growth factor gene and preeclampsia

    PubMed Central

    2011-01-01

    Background Preeclampsia (PE) is the first worldwide cause of death in pregnant women, intra-uterine growth retardation, and fetal prematurity. Some vascular endothelial grown factor gene (VEGF) polymorphisms have been associated to PE and other pregnancy disturbances. We evaluated the associations between VEGF genotypes/haplotypes and PE in Mexican women. Methods 164 pregnant women were enrolled in a case-control study (78 cases and 86 normotensive pregnant controls). The rs699947 (-2578C/A), rs1570360 (-1154G/A), rs2010963 (+405G/C), and rs25648 (-7C/T), VEGF variants were discriminated using Polymerase Chain Reaction - Restriction Fragment Length Polymorphism (PCR-RFLP) methods or Taqman single nucleotide polymorphism (SNP) assays. Results The proportions of the minor allele for rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs were 0.33, 0.2, 0.39, and 0.17 in controls, and 0.39, 0.23, 0.41, and 0.15 in cases, respectively (P values > 0.05). The most frequent haplotypes of rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs, were C-G-C-C and C-G-G-C with frequencies of 0.39, 0.21 in cases and 0.37, 0.25 in controls, respectively (P values > 0.05) Conclusion There was no evidence of an association between VEGF alleles, genotypes, or haplotypes frequencies and PE in our study. PMID:21575227

  6. Haplotype-Based Analysis of Genes Associated With Risk of Adverse Skin Reactions After Radiotherapy in Breast Cancer Patients

    SciTech Connect

    Suga, Tomo; Ishikawa, Atsuko; Kohda, Masakazu; Otsuka, Yoshimi; Yamada, Shigeru; Yamamoto, Naohito; Shibamoto, Yuta; Ogawa, Yoshihiro; Nomura, Kuninori; Sho, Keizen; Omura, Motoko; Sekiguchi, Kenji; Kikuchi, Yuzo; Michikawa, Yuichi; Noda, Shuhei; Sagara, Masashi; Ohashi, Jun; Yoshinaga, Shinji; Mizoe, Junetsu; Tsujii, Hirohiko

    2007-11-01

    Purpose: To identify haplotypes of single nucleotide polymorphism markers associated with the risk of early adverse skin reactions (EASRs) after radiotherapy in breast cancer patients. Methods and Materials: DNA was sampled from 399 Japanese breast cancer patients who qualified for breast-conserving radiotherapy. Using the National Cancer Institute-Common Toxicity Criteria scoring system, version 2, the patients were grouped according to EASRs, defined as those occurring within 3 months of starting radiotherapy (Grade 1 or less, n = 290; Grade 2 or greater, n = 109). A total of 999 single nucleotide polymorphisms from 137 candidate genes for radiation susceptibility were genotyped, and the haplotype associations between groups were assessed. Results: The global haplotype association analysis (p < 0.05 and false discovery rate < 0.05) indicated that estimated haplotypes in six loci were associated with EASR risk. A comparison of the risk haplotype with the most frequent haplotype in each locus showed haplotype GGTT in CD44 (odds ratio [OR] = 2.17; 95% confidence interval [CI], 1.07-4.43) resulted in a significantly greater EASR risk. Five haplotypes, CG in MAD2L2 (OR = 0.55; 95% CI, 0.35-0.87), GTTG in PTTG1 (OR = 0.48; 95% CI, 0.24-0.96), TCC (OR = 0.48; 95% CI, 0.26-0.89) and CCG (OR = 0.50; 95% CI, 0.27-0.92) in RAD9A, and GCT in LIG3 (OR = 0.46; 95% CI, 0.22-0.93) were associated with a reduced EASR risk. No significant risk haplotype was observed in REV3L. Conclusion: Individual radiosensitivity can be partly determined by these haplotypes in multiple loci. Our findings may lead to a better understanding of the mechanisms underlying the genetic variation in radiation sensitivity and resistance among breast cancer patients.

  7. Evidence that the ancestral haplotype in Australian hemochromatosis patients may be associated with a common mutation in the gene

    SciTech Connect

    Crawford, D.H.G.; Powell, L.W.; Leggett, B.A.

    1995-08-01

    Hemochromatosis (HC) is a common inherited disorder of iron metabolism for which neither the gene nor biochemical defect have yet been identified. The aim of this study was to look for clinical evidence that the predominant ancestral haplotype in Australian patients is associated with a common mutation in the gene. We compared indices of iron metabolism and storage in three groups of HC patients categorized according to the presence of the ancestral haplotype (i.e., patients with two copies, one copy, and no copies of the ancestral haplotype). We also examined iron indices in two groups of HC heterozygotes (those with the ancestral haplotype and those without) and in age-matched controls. These analyses indicate that (i) HC patients with two copies of the ancestral haplotype show significantly more severe expression of the disorder than those with one copy or those without, (ii) HC heterozygotes have partial clinical expression, which may be influenced by the presence of the ancestral haplotype in females but not in males, and (iii) the high population frequency of the HC gene may be the result of the selective advantage conferred by protecting heterozygotes against iron deficiency. 18 refs., 3 tabs.

  8. Serum Interleukin-18 and Its Gene Haplotypes Profile as Predictors in Patients with Diabetic Nephropathy

    PubMed Central

    Elneam, Ahmed I. Abd; Mansour, Nahla M.; Zaki, Nayel A.; Taher, Mohamed A.

    2016-01-01

    BACKGROUND: Diabetic nephropathy (DN) is known as an acute microvascular complexity as a subsequence progression in diabetes mellitus type 1 and 2. Many evidence pointed that the proinflammatory cytokine Interleukin (IL)-18 might be involved in the pathogenesis of DN. AIM: The current study aimed to evaluate the association of serum IL-18 and its promoter gene polymorphisms with diabetic nephropathy. METHODS: This study included 62 diabetic nephropathy patients (DN group) compared to 52 diabetes mellitus patients (DM group). The two groups were subjected to anthropometry assessment, molecular studies including SNP genotyping by RFLP and finally statistical analysis. RESULTS: The assessment of the serum IL-18 level and the frequencies of its allele and haplotype: -137G/C, -607C/A and -656G/T among the DN and DM subjects revealed that -137G allele has significant variation between DN and DM subjects (about 80.8%, P = 0.05) but, no significant variation in -607 or -656 alleles IL-18 gene promoter. CONCLUSION: These data confirm the impact of high serum IL-18 and the haplotype of the polymorphism located in the promoter region of the IL-18 gene with the DN. PMID:27703550

  9. Haplotype analysis of the DQA genes in sheep: evidence supporting recombination between the loci.

    PubMed

    Hickford, J G H; Zhou, H; Fang, Q

    2007-03-01

    The ovine class II major histocompatibility complex mediates specific immune responses to exogenous antigens in sheep. A number of ovine class II loci have been identified, and most of them appear to be polymorphic. In this study we investigated the DQA1 locus of 520 sheep and the DQA2 locus of over 40,000 sheep, finding 12 sequences and 22 sequences, respectively, using DQA1- and DQA2-specific PCR primers. Among the DQA2 sequences, 2 groups of sequences can be found: those that share homology with the DQA2 sequences from closely related species and those that cluster with bovine DQA3 and DQA4 sequences and have been called DQA2-like in sheep. The occurrence of these DQA2-like sequences was once again confirmed to correspond with the absence of detectable DQA1 sequences, suggesting that they are found at the same location as DQA1. Within the sheep studied, 37 haplotypes could be detected, 23 being haplotypes of DQA1 and DQA2 sequences and with frequencies ranging from 0.38 to 9.27%, and 14 being haplotypes of DQA2 and DQA2-like sequences and with frequencies ranging from 0.03 to 14.53%. We discovered 12 DQA1-DQA2 combinations that were derived from 5 DQA1 alleles and 4 DQA2 alleles, and 8 DQA2-DQA2-like combinations from 5 DQA2 alleles and 2 DQA2-like sequences. The frequency of occurrence of recombined DQA1-DQA2 sequences and recombined DQA2-DQA2-like sequences is similar, once again suggesting the DQA2-like sequences are found at the DQA1 locus.

  10. Neandertal origin of genetic variation at the cluster of OAS immunity genes.

    PubMed

    Mendez, Fernando L; Watkins, Joseph C; Hammer, Michael F

    2013-04-01

    Analyses of ancient DNA from extinct humans reveal signals of at least two independent hybridization events in the history of non-African populations. To date, there are very few examples of specific genetic variants that have been rigorously identified as introgressive. Here, we survey DNA sequence variation in the OAS gene cluster on chromosome 12 and provide strong evidence that a haplotype extending for ~185 kb introgressed from Neandertals. This haplotype is nearly restricted to Eurasians and is estimated to have diverged from the Neandertal sequence ~125 kya. Despite the potential for novel functional variation, the observed frequency of this haplotype is consistent with neutral introgression. This is the second locus in the human genome, after STAT2, carrying distinct haplotypes that appear to have introgressed separately from both Neandertals and Denisova.

  11. An anther-specific gene encoded by an S locus haplotype of Brassica produces complementary and differentially regulated transcripts.

    PubMed Central

    Boyes, D C; Nasrallah, J B

    1995-01-01

    The self-incompatibility locus of Brassica consists of a coadapted gene complex that contains at least two genes required for the recognition and inhibition of pollen by the stigma when self-pollinated. Here, we report the identification of a third S locus-linked gene from the S2 haplotype of Brassica oleracea. This gene, which we designated SLA (for S Locus Anther), is a novel gene with an unusual structure. SLA is transcribed from two promoters to produce two complementary anther-specific transcripts, one spliced and the other unspliced, that accumulate in an antiparallel manner in developing microspores and anthers. The sequence of the spliced transcript showed the presence of two open reading frames that predict proteins of 10 and 7.5 kD. Neither transcript was produced in a self-compatible B. napus strain carrying an S2-like haplotype, indicating that the SLA gene in this strain is nonfunctional. Interestingly, sequences related to SLA were not detected in DNA or RNA from plants carrying S haplotypes other than S2. The haplotype specificity of SLA, its anther-specific expression, and its physical linkage to the S locus are properties expected for a gene that encodes a determinant of S2 specificity in pollen. PMID:7549484

  12. Beta s gene in Central Iran is in linkage disequilibrium with the Indian-Arab haplotype.

    PubMed

    Rahgozar, S; Poorfathollah, A A; Moafi, A R; Old, J M

    2000-11-01

    Sickle cell anemia is not considered to be a significant disease in Iran, although the sickle cell trait is estimated to have a high incidence in the Southern provinces. Since 1977, when the presence of a mild sickle cell anemia was reported in this country, there have been no further investigations published giving precise data on the incidence and origins of the sickle cell mutation in Iran. We report here the finding of patients with the sickle cell trait, sickle cell anemia, and sickle-beta thalassemia in Central Iran. A survey of 300 individuals from a village in Southeast Esfahan revealed an incidence of the sickle cell trait of 8.33%. "Cascade screening" enabled 96 relatives in four surrounding villages to be tested, and the at-risk couples were offered counseling as a "sickle cell control program." The hematological indices and HbF levels of the affected patients were determined. The HbF levels were unusually high, ranging from 18% to 41.4%, and SS patients with the highest levels were asymptomatic. Linkage analysis revealed the betaS gene haplotype in this population to be the Indian-Arab haplotype.

  13. Haplotype-tagging single nucleotide polymorphisms in the GSTP1 gene promoter and susceptibility to lung cancer☆

    PubMed Central

    Tan, Xiang-Lin; Moslehi, Roxana; Han, WeiGuo; Spivack, Simon D.

    2013-01-01

    Background Glutathione S-transferase (GST) P1 is a major phase II xenobiotic-metabolizing enzyme in the human lung. Our laboratory had previously identified nine single nucleotide polymorphisms (SNPs) in the GSTP1 gene promoter, which were then grouped into three main haplotypes (Hap1, Hap2, and Hap3) based on statistical inference. Hap3 was found to display a high expression phenotype. The main objective of the current study was to test the association between GSTP1 promoter haplotypes with the risk of lung cancer after determining the promoter haplotypes experimentally through cloning and sequencing. Methods We conducted a case–control analysis of 150 subjects with lung cancer and 329 controls with no personal history of the disease. The three statistically inferred GSTP1 promoter haplotypes were confirmed experimentally through cloning and sequencing. Haplotype-tagging SNPs were selected and GSTP1 haplotypes were tested for genetic association to lung cancer using unconditional logistic regression after adjusting for confounders. Statistical interaction between GSTP1 promoter haplotypes with either cigarette smoking or dietary fruit and vegetable intake were tested using the likelihood ratio test. Results We did not find protective effects of Hap3 against lung cancer, despite an adequately powered design for this main effect. Homozygous variants of tagSNPs –1738 T >A and –354 G > T, which tag Hap2, showed an increased (but statistically non-significant) risk of lung cancer among all subjects as well as among individuals with low fruit and vegetable intake, compared to homozygous wildtypes for these SNPs. We did not find significant interactions between Hap2 and dietary intake of fruits and vegetables. Conclusions Our results do not support significant main and modifying effects for GSTP1 promoter haplotypes on susceptibility to lung cancer in this population, but reinforce the protective effects of dietary intake of fruits and vegetables. PMID:19282111

  14. [Haplotype and linkage analysis of HLA-I classical genes in Chinese Han population].

    PubMed

    Yuan, Fang; Sun, Yu-Ying; Luo, Yuan; Liang, Fei; Liu, Nan; Jin, Li; Liu, Jin-Feng; Liu, Shu-Guang; Xi, Yong-Zhi

    2007-10-01

    The aim of this study was to investigate the parameters of gene frequencies, haplotype frequencies and linkage disequilibrium of HLA-A, -B, -Cw in HLA classical I loci for Chinese Han population. HLA-A, HLA-B and HLA-Cw loci were genotyped in 1014 unrelated China people using low resolution PCR-SSP typing method, and their genetic parameters were analyzed by statistic methods. The results indicated that among all the detected HLA-I genes, A*02 (0.33), A*11 (0.24), B*15 (0.14), B*13 (0.13), Cw*03 (0.25) and Cw*07 (0.18) were the popular gene groups distributing in Chinese Han population, and A*02-B*46 (0.071), A*11-B*15 (0.051), A*02-Cw*01 (0.084), A*11-Cw*03 (0.079), B*46-Cw*01 (0.095) and B*13-Cw*03 (0.071) were the predominant haplotypes in Han population. Additionally, A*02-B*46, A*30-B*13, A*30-Cw*06, A*02-Cw*01, B*46-Cw*01 and B*58-Cw*03 were statistically significant with strong linkage disequilibrium. While A*02-B*15, A*02-B*40, A*24-Cw*03, A*02-Cw*03 and A*31-Cw*03 were in low linkage disequilibrium, among them A*24-Cw*03 appeared frequently in HLA recombination events. In addition, A*02-B*46-Cw*01 (0.075), A*30-B*13-Cw*06 (0.046), A*11-B*13-Cw*03 (0.045), A*33-B*58-Cw*03 (0.044), A*11-B*15-Cw*08 (0.027), A*02-B*38-Cw*07 (0.023) and A*11-B*40-Cw*07 (0.022) were the main extended haplotypes in Han population. In conclusions, this study investigated systematically the genetic polymorphism features of Chinese Han population, which may provide useful genetic parameters for researches in colonial evolution, clinical transplantation and disease susceptibility.

  15. Identification of haplotypes in promoter of prolactin gene and their effect on egg production and quality traits in layer chicken.

    PubMed

    Bhattacharya, T K; Chatterjee, R N; Sharma, R P; Niranjan, M; Rajkumar, U; Reddy, B L N

    2011-04-01

    Expression of prolactin hormone is a crucial event in regulating egg production in chickens for which promoter plays the vital role in expressing the prolactin gene. The objective of the present study was to identify haplotypes in the prolactin promoter and their effects on egg production and egg quality traits in White Leghorn chicken. Single stranded conformation polymorphism followed by sequencing was conducted to explore polymorphism at 561 bp promoter of prolactin gene. The effect of haplotype combinations on egg production and quality traits were estimated following general linear model technique. The expression of prolactin by different haplogroups was quantified by qPCR. Total 28 haplotypes were found in White Leghorn chicken of which h1 haplotype possessed the highest frequency of 0.46 and h8, h14, h16, h25, h26, and h28 haplotypes had the lowest frequency (0.1%). The egg production up to 52 and 64 weeks of age were found to be significantly (p < 0.05) associated with haplotype combinations where the highest 52-w (52 weeks) egg production was found in animals with h1/h22 combination and the lowest production was observed in the birds with h1/h2 haplogroup. The haplotype combinations had the significant effect (p < 0.05) on Haugh Unit, yolk index and albumen weight at 40 weeks of age; Haugh Unit and albumen weight at 52 weeks of age and Haugh unit, yolk weight and yolk percentage at 64 weeks of age. The prolactin expression in h1/h22 birds was found to be the lowest and in h1/h5 birds to be the highest. The prolactin expression showed significant effect on 52-w egg production and albumin weight at 52 weeks age. In conclusion, it may be stated that the prolactin promoter was highly polymorphic and had the significant association with egg production and quality traits in White Leghorn chicken.

  16. Haplotype map of sickle cell anemia in Tunisia.

    PubMed

    Moumni, Imen; Ben Mustapha, Maha; Sassi, Sarra; Zorai, Amine; Ben Mansour, Ikbel; Douzi, Kais; Chouachi, Dorra; Mellouli, Fethi; Bejaoui, Mohamed; Abbes, Salem

    2014-01-01

    β-Globin haplotypes are important to establish the ethnic origin and predict the clinical development of sickle cell disease patients (SCD). To determine the chromosomal background of β (S) Tunisian sickle cell patients, in this first study in Tunisia, we have explored four polymorphic regions of β-globin cluster on chromosome 11. It is the 5' region of β-LCR-HS2 site, the intervening sequence II (IVSII) region of two fetal ((G)γ and (A)γ) genes and the 5' region of β-globin gene. The results reveal a high molecular diversity of a microsatellite configuration describing the sequences haplotypes. The linkage disequilibrium analysis showed various haplotype combinations giving 22 "extended haplotypes". These results confirm the utility of the β-globin haplotypes for population studies and contribute to knowledge of the Tunisian gene pool, as well as establishing the role of genetic markers in physiopathology of SCD.

  17. Haplotypes of the D-Amino Acid Oxidase Gene Are Significantly Associated with Schizophrenia and Its Neurocognitive Deficits

    PubMed Central

    Hwu, Hai-Gwo; Fann, Cathy Shen-Jang; Yang, Ueng-Cheng; Yang, Wei-Chih; Hsu, Pei-Chun; Chang, Chien-Ching; Wen, Chun-Chiang; Tsai-Wu, Jyy-Jih; Hwang, Tzung-Jeng; Hsieh, Ming H.; Liu, Chen-Chung; Chien, Yi-Ling; Fang, Chiu-Ping; Faraone, Stephen V.; Tsuang, Ming T.; Chen, Wei J.; Liu, Chih-Min

    2016-01-01

    D-amino acid oxidase (DAO) has been reported to be associated with schizophrenia. This study aimed to search for genetic variants associated with this gene. The genomic regions of all exons, highly conserved regions of introns, and promoters of this gene were sequenced. Potentially meaningful single-nucleotide polymorphisms (SNPs) obtained from direct sequencing were selected for genotyping in 600 controls and 912 patients with schizophrenia and in a replicated sample consisting of 388 patients with schizophrenia. Genetic associations were examined using single-locus and haplotype association analyses. In single-locus analyses, the frequency of the C allele of a novel SNP rs55944529 located at intron 8 was found to be significantly higher in the original large patient sample (p = 0.016). This allele was associated with a higher level of DAO mRNA expression in the Epstein-Barr virus-transformed lymphocytes. The haplotype distribution of a haplotype block composed of rs11114083-rs2070586-rs2070587-rs55944529 across intron 1 and intron 8 was significantly different between the patients and controls and the haplotype frequencies of AAGC were significantly higher in patients, in both the original (corrected p < 0.0001) and replicated samples (corrected p = 0.0003). The CGTC haplotype was specifically associated with the subgroup with deficits in sustained attention and executive function and the AAGC haplotype was associated with the subgroup without such deficits. The DAO gene was a susceptibility gene for schizophrenia and the genomic region between intron 1 and intron 8 may harbor functional genetic variants, which may influence the mRNA expression of DAO and neurocognitive functions in schizophrenia. PMID:26986737

  18. Interactions between Serotonin Transporter Gene Haplotypes and Quality of Mothers' Parenting Predict the Development of Children's Noncompliance

    ERIC Educational Resources Information Center

    Sulik, Michael J.; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L.; Silva, Kassondra M.; Eggum, Natalie D.; Betkowski, Jennifer A.; Kupfer, Anne; Smith, Cynthia L.; Gaertner, Bridget; Stover, Daryn A.; Verrelli, Brian C.

    2012-01-01

    The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children's (N = 138) noncompliance and aggression from age 18-54 months. Quality of mothers' parenting behavior was observed when…

  19. Alternative haplotypes of antigen processing genes in zebrafish diverged early in vertebrate evolution

    PubMed Central

    McConnell, Sean C.; Hernandez, Kyle M.; Wcisel, Dustin J.; Kettleborough, Ross N.; Stemple, Derek L.; Andrade, Jorge; de Jong, Jill L. O.

    2016-01-01

    Antigen processing and presentation genes found within the MHC are among the most highly polymorphic genes of vertebrate genomes, providing populations with diverse immune responses to a wide array of pathogens. Here, we describe transcriptome, exome, and whole-genome sequencing of clonal zebrafish, uncovering the most extensive diversity within the antigen processing and presentation genes of any species yet examined. Our CG2 clonal zebrafish assembly provides genomic context within a remarkably divergent haplotype of the core MHC region on chromosome 19 for six expressed genes not found in the zebrafish reference genome: mhc1uga, proteasome-β 9b (psmb9b), psmb8f, and previously unknown genes psmb13b, tap2d, and tap2e. We identify ancient lineages for Psmb13 within a proteasome branch previously thought to be monomorphic and provide evidence of substantial lineage diversity within each of three major trifurcations of catalytic-type proteasome subunits in vertebrates: Psmb5/Psmb8/Psmb11, Psmb6/Psmb9/Psmb12, and Psmb7/Psmb10/Psmb13. Strikingly, nearby tap2 and MHC class I genes also retain ancient sequence lineages, indicating that alternative lineages may have been preserved throughout the entire MHC pathway since early diversification of the adaptive immune system ∼500 Mya. Furthermore, polymorphisms within the three MHC pathway steps (antigen cleavage, transport, and presentation) are each predicted to alter peptide specificity. Lastly, comparative analysis shows that antigen processing gene diversity is far more extensive than previously realized (with ancient coelacanth psmb8 lineages, shark psmb13, and tap2t and psmb10 outside the teleost MHC), implying distinct immune functions and conserved roles in shaping MHC pathway evolution throughout vertebrates. PMID:27493218

  20. Vitamin D receptor gene polymorphisms and haplotypes (Apa I, Bsm I, Fok I, Taq I) in Turkish psoriasis patients

    PubMed Central

    Acikbas, Ibrahim; Sanlı, Berna; Tepeli, Emre; Ergin, Seniz; Aktan, Sebnem; Bagci, Huseyin

    2012-01-01

    Summary Background Psoriasis is an inflammatory disease characterized by increased squamous cell proliferation and impaired differentiation. Vitamin D, Calcitriol, and its analogues are successfully used for psoriasis therapy. However, it is unknown why some psoriasis patients are resistant to Vitamin D therapy. Vitamin D mediates its activity by a nuclear receptor. It is suggested that polymorphisms and haplotypes in the VDR gene may explain the differences in response to vitamin D therapy. Material/Methods In this study, 102 psoriasis patients and 102 healthy controls were studied for VDR gene polymorphisms. The Fok I, Bsm I, Apa I and Taq I polymorphisms were examined by PCR-RFLP, and 50 subjects received vitamin D therapy to evaluate the association between VDR gene polymorphisms and response to vitamin D therapy. Existence of cutting site is shown by capital letters, and lack was shown by lower case. The haplotypes were analysed by CHAPLIN. Results There was significant difference in allele frequency of T and genotype frequency of Tt between cases and controls (p values 0.038 and 0.04, respectively). The Aa and bb genotypes were significantly higher in early onset than late onset psoriasis (p values 0.008 and 0.04, respectively). The genotypes Ff, ff and TT are significantly different between vitamin D3 therapy responders and non-responders (p values 0.04, 0.0001, 0.009, respectively). To the best of our knowledge, this is the first report showing importance of VDR gene haplotypes in psoriasis, the significance of the Wald and LR (Likelihood Ratio) statistics (p=0,0042) suggest that FfBbAatt is a disease-susceptibility haplotype. Conclusions Haplotype analysis is a recent and commonly used method in genetic association studies. Our results reveal a previously unidentified susceptibility haplotype and indicate that certain haplotypes are important in the resistance to vitamin D3 therapy and the onset of psoriasis. The haplotypes can give valuable data where

  1. Finding approximate gene clusters with Gecko 3

    PubMed Central

    Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

    2016-01-01

    Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. PMID:27679480

  2. Interactions between serotonin transporter gene haplotypes and quality of mothers' parenting predict the development of children's noncompliance.

    PubMed

    Sulik, Michael J; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L; Silva, Kassondra M; Eggum, Natalie D; Betkowski, Jennifer A; Kupfer, Anne; Smith, Cynthia L; Gaertner, Bridget; Stover, Daryn A; Verrelli, Brian C

    2012-05-01

    The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children's (N = 138) noncompliance and aggression from age 18-54 months. Quality of mothers' parenting behavior was observed when children were 18 months old, and nonparental caregivers' reports of noncompliance and aggression were collected annually from 18 to 54 months of age. Quality of early parenting was negatively related to the slope of noncompliance only for children with the LPR-S/STin2-10 haplotype and to 18-month noncompliance only for children with haplotypes that did not include LPR-S. The findings support the notion that SLC6A4 haplotypes index differential susceptibility to variability in parenting quality, with certain haplotypes showing greater reactivity to both supportive and unsupportive environments. These different genetic backgrounds likely reflect an evolutionary response to variation in the parenting environment.

  3. Haplotype combination of the bovine CFL2 gene sequence variants and association with growth traits in Qinchuan cattle.

    PubMed

    Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

    2015-06-01

    The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25Haplotype analysis of three SNPs showed that 8 different haplotypes were identified in all, but only 5 haplotypes were listed except for those with a frequency of <0.03. Hap4 (-GTC-) had the highest haplotype frequencies (34.70%). However in the three SNPs there were no significant associations between the 13 combined genotypes of the CFL2 gene and growth traits. LD analysis showed that the SNP T 1694 A and C 2213 G loci had a strong linkage (r(2)>0.33). Association analysis indicated that SNP G 1500 A, T 1694 A and C 2213 G were significantly associated with growth traits in the QC population. The results of our study suggest that the CFL2 gene may be a strong candidate gene that affects growth traits in the QC cattle breeding program.

  4. Analysis of 5' flanking regions of the gamma globin genes from major African haplotype backgrounds associated with sickle cell disease.

    PubMed

    Month, S R; Wood, R W; Trifillis, P T; Orchowski, P J; Sharon, B; Ballas, S K; Surrey, S; Schwartz, E

    1990-02-01

    There are at least three major African haplotype backgrounds on which the beta s mutation arises. Sequence changes in the immediate 5' flanking area of the gamma-globin genes may account for differences in fetal hemoglobin expression among the three haplotypes. We determined the sequence from -350 to 10 bp 5' of the G gamma and A gamma fetal globin genes from one beta s-containing chromosome on each of the three major haplotype backgrounds. The Senegal chromosome had a T at -158 5' to the G gamma gene; the Benin (BEN) chromosome had an A to G change at -309 5' to the G gamma gene; and the Central African Republic (CAR) chromosome had a C to T change at -271 5' to the A gamma gene. Genomic DNA from patients with sickle cell disease was analyzed using the polymerase chain reaction and radiolabeled allele-specific oligonucleotide probes. The -309 G variant 5' to the G gamma gene is associated with BEN chromosomes, and the -271 T variant 5' to A gamma with CAR. The -309 change was also found on beta A-containing chromosomes, while the -271 change was not. The -309 change may have predated the beta s mutation on the BEN chromosome.

  5. Meiotic recombination generates rich diversity in NK cell receptor genes, alleles, and haplotypes

    PubMed Central

    Norman, Paul J.; Abi-Rached, Laurent; Gendzekhadze, Ketevan; Hammond, John A.; Moesta, Achim K.; Sharma, Deepti; Graef, Thorsten; McQueen, Karina L.; Guethlein, Lisbeth A.; Carrington, Christine V.F.; Chandanayingyong, Dasdayanee; Chang, Yih-Hsin; Crespí, Catalina; Saruhan-Direskeneli, Güher; Hameed, Kamran; Kamkamidze, Giorgi; Koram, Kwadwo A.; Layrisse, Zulay; Matamoros, Nuria; Milà, Joan; Park, Myoung Hee; Pitchappan, Ramasamy M.; Ramdath, D. Dan; Shiau, Ming-Yuh; Stephens, Henry A.F.; Struik, Siske; Tyan, Dolly; Verity, David H.; Vaughan, Robert W.; Davis, Ronald W.; Fraser, Patricia A.; Riley, Eleanor M.; Ronaghi, Mostafa; Parham, Peter

    2009-01-01

    Natural killer (NK) cells contribute to the essential functions of innate immunity and reproduction. Various genes encode NK cell receptors that recognize the major histocompatibility complex (MHC) Class I molecules expressed by other cells. For primate NK cells, the killer-cell immunoglobulin-like receptors (KIR) are a variable and rapidly evolving family of MHC Class I receptors. Studied here is KIR3DL1/S1, which encodes receptors for highly polymorphic human HLA-A and -B and comprises three ancient allelic lineages that have been preserved by balancing selection throughout human evolution. While the 3DS1 lineage of activating receptors has been conserved, the two 3DL1 lineages of inhibitory receptors were diversified through inter-lineage recombination with each other and with 3DS1. Prominent targets for recombination were D0-domain polymorphisms, which modulate enhancer function, and dimorphism at position 283 in the D2 domain, which influences inhibitory function. In African populations, unequal crossing over between the 3DL1 and 3DL2 genes produced a deleted KIR haplotype in which the telomeric “half” was reduced to a single fusion gene with functional properties distinct from its 3DL1 and 3DL2 parents. Conversely, in Eurasian populations, duplication of the KIR3DL1/S1 locus by unequal crossing over has enabled individuals to carry and express alleles of all three KIR3DL1/S1 lineages. These results demonstrate how meiotic recombination combines with an ancient, preserved diversity to create new KIR phenotypes upon which natural selection acts. A consequence of such recombination is to blur the distinction between alleles and loci in the rapidly evolving human KIR gene family. PMID:19411600

  6. Haplotype combination of polymorphisms in the ADIPOQ gene promoter is associated with growth traits in Qinchuan cattle.

    PubMed

    Zhang, Liangzhi; Li, Mijie; Lai, Xinsheng; Yang, Mingjuan; Xu, Yao; Hua, Liushuai; Lan, Xianyong; Zhang, Chunlei; Chen, Hong

    2013-07-01

    Adiponectin modulates lipid and glucose metabolism in adipose tissues and is also related to bone metabolism. Polymorphisms in the ADIPOQ gene likely have an impact on growth traits in cattle. In this study, we examined the relationship between ADIPOQ polymorphisms and body measurement parameters in Chinese beef cattle. First, we sequenced ADIPOQ and 1.2 kb of DNA upstream of its promoter, and we found 14 polymorphisms. With the luciferase reporter assay, we showed that the two polymorphisms SNP PR_-135 A>G and PR_-68 G>C, which are located in the core region of promoter, influence promoter activity of ADIPOQ. Second, we identified three haplotypes involved in these two polymorphic sites: A (A-135/C-68), B (A-135/G-68), and C (G-135/G-68). Haplotypes B and C are major haplotypes in five Chinese populations of cattle (Qinchuan, Nanyang, Jiaxian, Hazakh, and Chinese Holstein). We studied the effects of these three haplotypes on body measurements, gene expression, and promoter activity, and we found that the genotypes are associated with body measurement parameters in Qinchuan cattle. Individuals with genotype BC (AG/GG) had significantly higher body height and heart girth than others, and this result may be interpreted by the following two observations. The promoter activity with haplotype B (A/G) is significantly higher than those with A (A/C) and C (G/G) in driving reporter gene transcription; the ADIPOQ mRNA level in cattle with genotype BC (AG/GG) is relatively lower than that in cattle with genotype BB (AA/GG).

  7. A Killer Immunoglobulin - Like Receptor Gene - Content Haplotype and A Cognate Human Leukocyte Antigen Ligand are Associated with Autism

    PubMed Central

    Torres, Anthony; Westover, Jonna; Benson, Michael; Johnson, Randall; Dykes, Annelise

    2016-01-01

    The killing activity of natural killer cells is largely regulated by the binding of class I human leukocyte antigen cognate ligands to killer cell immunoglobulin - like receptor proteins. The killer cell immunoglobulin - like receptor gene - complex contains genes that activate and others that inhibit the killing state of natural killer cells depending on the binding of specific human leukocyte antigen cognate ligands. It has been suggested in previous publications that activating human leukocyte antigen/killer - cell immunoglobulin - like receptor complexes are increased in people with autism. We present data, which suggests that an activating cB01/tA01 killer cell immunoglobulin - like receptor gene - content haplotype and the cognate ligand human leukocyte antigen - C1k that activates this haplotype is significantly increased in autism. This is an important observation suggesting that the interaction between two proteins encoded on different chromosomes increases natural killer cell killing in autism. PMID:27853655

  8. A Killer Immunoglobulin - Like Receptor Gene - Content Haplotype and A Cognate Human Leukocyte Antigen Ligand are Associated with Autism.

    PubMed

    Torres, Anthony; Westover, Jonna; Benson, Michael; Johnson, Randall; Dykes, Annelise

    2016-04-01

    The killing activity of natural killer cells is largely regulated by the binding of class I human leukocyte antigen cognate ligands to killer cell immunoglobulin - like receptor proteins. The killer cell immunoglobulin - like receptor gene - complex contains genes that activate and others that inhibit the killing state of natural killer cells depending on the binding of specific human leukocyte antigen cognate ligands. It has been suggested in previous publications that activating human leukocyte antigen/killer - cell immunoglobulin - like receptor complexes are increased in people with autism. We present data, which suggests that an activating cB01/tA01 killer cell immunoglobulin - like receptor gene - content haplotype and the cognate ligand human leukocyte antigen - C1k that activates this haplotype is significantly increased in autism. This is an important observation suggesting that the interaction between two proteins encoded on different chromosomes increases natural killer cell killing in autism.

  9. Characterization of the MHC class II region in cattle. The number of DQ genes varies between haplotypes.

    PubMed

    Andersson, L; Rask, L

    1988-01-01

    The organization of the major histocompatibility complex (MHC) class II region in cattle was investigated by Southern blot analysis using human probes corresponding to DO, DP, DQ, and DR genes. Exon-specific probes were also employed to facilitate the assessment of the number of different bovine class II genes. The results indicated the presence of single DO beta and DR alpha genes, at least three DR beta genes, while the number of DQ genes was found to vary between MHC haplotypes. Four DQ haplotypes, DQ alpha 1 beta 1 to DQ alpha 2 beta 4, possessed a single DQ alpha and a single DQ beta gene whereas both these genes were duplicated in eight other haplotypes, DQ alpha 3 beta 5 to DQ alpha 9 beta 12. No firm evidence for the presence of bovine DP genes was obtained. The same human probes were also used to investigate the genetic polymorphism of bovine class II genes. DQ alpha, DQ beta, DR alpha, DR beta, and DO beta restriction fragment length polymorphisms (RFLPs) were resolved and in particular the DQ restriction fragment patterns were highly polymorphic. Comparison of the present result with the current knowledge of the class II region in other mammalian species suggested that the DO, DP, DQ, DR, and DZ subdivision of the class II region was established already in the ancestor of mammals. The DP genes appear to be the least conserved class II genes among mammalian species and may have been lost in cattle. The degree of polymorphism of different class II genes, as revealed by RFLP analyses, shows striking similarities between species.

  10. Single-nucleotide polymorphisms and haplotypes of non-coding area in the CP gene are correlated with Parkinson's disease.

    PubMed

    Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu

    2015-04-01

    Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.

  11. Haplotypes of the IL10 Gene as Potential Protection Factors in Leprosy Patients

    PubMed Central

    Garcia, Patricia; Alencar, Dayse; Pinto, Pablo; Santos, Ney; Salgado, Claudio; Sortica, Vinicius A.; Hutz, Mara H.; Santos, Sidney

    2013-01-01

    Leprosy is an infectious disease caused by Mycobacterium leprae characterized by dermatoneurological signs and symptoms that has a large number of new cases worldwide. Several studies have associated interleukin 10 with susceptibility/resistance to several diseases. We investigated haplotypes formed by three single nucleotide polymorphisms (SNPs) located in the IL10 gene (A-1082G, C-819T, and C-592A) in order to better understand the susceptibility to and severity of leprosy in an admixed northern Brazil population, taking into account estimates of interethnic admixture. We observed the genotypes ACC/ACC (P = 0.021, odds ratio [OR] [95% confidence interval (CI)] = 0.290 [0.085 to 0823]) and ACC/GCC (P = 0.003, OR [95% CI] = 0.220 [0.504 to 0.040]) presenting significant results for protection against leprosy development, framed in the profiles of low and medium interleukin production, respectively. Therefore, we suggest that genotypes A-1082G, C-819T, and C-592A formed by interleukin-10 polymorphisms are closely related to protection of the leprosy development in an admixed northern Brazil population, in particular ACC/ACC and ACC/GCC genotypes. PMID:23966553

  12. Haplotype M2 in the annexin A5 (ANXA5) gene and the occurrence of obstetric complications.

    PubMed

    Tiscia, Giovanni; Colaizzo, Donatella; Chinni, Elena; Pisanelli, Daniela; Sciannamè, Natale; Favuzzi, Giovanni; Margaglione, Maurizio; Grandone, Elvira

    2009-08-01

    Inherited or acquired thrombophilias have been largely explored as a cause of pregnancy complications. However, pathogenesis of obstetric complications, as fetal loss and pregnancy-related hypertensive disorders is still partly unexplained. Recently, a common haplotype (M2) within the annexin A5 (ANXA5) gene has been described as a risk factor in recurrent fetal losses (RFL). It has been demonstrated to reduce the promoter activity of the ANXA5 promoter in luciferase reporter assays. Aim of this study was to investigate the prevalence of M2 haplotype in three different settings of women with previous obstetric complications: RFL, intra-uterine fetal death (IUFD) and pregnancy-related hypertension (gestational hypertension [GH] and pre-eclampsia [PE]). One hundred three patients with previous RFL, 54 with IUFD, 158 with hypertensive disease (67 GH, 91 PE) were investigated. As controls, 195 women from the same ethnic background with uneventful pregnancies were enrolled. Logistic regression, correcting for age, gravidity and parity showed that the ANXA5 haplotype is significantly and independently associated with the occurrence of RFL (3.1; 95%CI: 1.1-9.5; p = 0.047) and pregnancy-related hypertensive disorders (2.1; 95%CI: 1.2-3.5; p = 0.008). The M2 haplotype might be a new and relevant risk factor for obstetric complications.

  13. Incorporating Single-Locus Tests into Haplotype Cladistic Analysis in Case-Control Studies

    PubMed Central

    Liu, Jianfeng; Papasian, Chris; Deng, Hong-Wen

    2007-01-01

    In case-control studies, genetic associations for complex diseases may be probed either with single-locus tests or with haplotype-based tests. Although there are different views on the relative merits and preferences of the two test strategies, haplotype-based analyses are generally believed to be more powerful to detect genes with modest effects. However, a main drawback of haplotype-based association tests is the large number of distinct haplotypes, which increases the degrees of freedom for corresponding test statistics and thus reduces the statistical power. To decrease the degrees of freedom and enhance the efficiency and power of haplotype analysis, we propose an improved haplotype clustering method that is based on the haplotype cladistic analysis developed by Durrant et al. In our method, we attempt to combine the strengths of single-locus analysis and haplotype-based analysis into one single test framework. Novel in our method is that we develop a more informative haplotype similarity measurement by using p-values obtained from single-locus association tests to construct a measure of weight, which to some extent incorporates the information of disease outcomes. The weights are then used in computation of similarity measures to construct distance metrics between haplotype pairs in haplotype cladistic analysis. To assess our proposed new method, we performed simulation analyses to compare the relative performances of (1) conventional haplotype-based analysis using original haplotype, (2) single-locus allele-based analysis, (3) original haplotype cladistic analysis (CLADHC) by Durrant et al., and (4) our weighted haplotype cladistic analysis method, under different scenarios. Our weighted cladistic analysis method shows an increased statistical power and robustness, compared with the methods of haplotype cladistic analysis, single-locus test, and the traditional haplotype-based analyses. The real data analyses also show that our proposed method has practical

  14. Clustering of High Throughput Gene Expression Data

    PubMed Central

    Pirim, Harun; Ekşioğlu, Burak; Perkins, Andy; Yüceer, Çetin

    2012-01-01

    High throughput biological data need to be processed, analyzed, and interpreted to address problems in life sciences. Bioinformatics, computational biology, and systems biology deal with biological problems using computational methods. Clustering is one of the methods used to gain insight into biological processes, particularly at the genomics level. Clearly, clustering can be used in many areas of biological data analysis. However, this paper presents a review of the current clustering algorithms designed especially for analyzing gene expression data. It is also intended to introduce one of the main problems in bioinformatics - clustering gene expression data - to the operations research community. PMID:23144527

  15. Vitamin D Receptor Gene Polymorphisms and Haplotypes in Hungarian Patients with Idiopathic Inflammatory Myopathy

    PubMed Central

    Griger, Zoltán; Dankó, Katalin

    2015-01-01

    Idiopathic inflammatory myopathies are autoimmune diseases characterized by symmetrical proximal muscle weakness. Our aim was to identify a correlation between VDR polymorphisms or haplotypes and myositis. We studied VDR-BsmI, VDR-ApaI, VDR-TaqI, and VDR-FokI polymorphisms and haplotypes in 89 Hungarian poly-/dermatomyositis patients (69 females) and 93 controls (52 females). We did not obtain any significant differences for VDR-FokI, BsmI, ApaI, and TaqI genotypes and allele frequencies between patients with myositis and healthy individuals. There was no association of VDR polymorphisms with clinical manifestations and laboratory profiles in myositis patients. Men with myositis had a significantly different distribution of BB, Bb, and bb genotypes than female patients, control male individuals, and the entire control group. Distribution of TT, Tt, and tt genotypes was significantly different in males than in females in patient group. According to four-marker haplotype prevalence, frequencies of sixteen possible haplotypes showed significant differences between patient and control groups. The three most frequent haplotypes in patients were the fbAt, FBaT, and fbAT. Our findings may reveal that there is a significant association: Bb and Tt genotypes can be associated with myositis in the Hungarian population we studied. We underline the importance of our result in the estimated prevalence of four-marker haplotypes. PMID:25649962

  16. Identification of genetic variation and haplotype structure of the canine ABCA4 gene for retinal disease association studies

    PubMed Central

    Zangerl, B.; Lindauer, S. J.; Acland, G. M.; Aguirre, G. D.

    2010-01-01

    Over 200 mutations in the retina specific member of the ATP-binding cassette transporter super-family (ABCA4) have been associated with a diverse group of human retinal diseases. The disease mechanisms, and genotype–phenotype associations, nonetheless, remain elusive in many cases. As orthologous genes are commonly mutated in canine models of human blinding disorders, canine ABCA4 appears to be an ideal candidate gene to identify and study sequence changes in dogs affected by various forms of inherited retinal degeneration. However, the size of the gene and lack of haplotype assignment significantly limit targeted association and/or linkage approaches. This study assessed the naturally observed sequence diversity of ABCA4 in the dog, identifying 80% of novel variations. While none of the observed polymorphisms have been associated with blinding disorders to date, breed and potentially disease specific haplotypes have been identified. Moreover, a tag SNP map of 17 (15) markers has been established that accurately predicts common ABCA4 haplotypes (frequency > 5%) explaining >85% (>80%) of the observed genetic diversity and will considerably advance future studies. Our sequence analysis of the complete canine ABCA4 coding region will clearly provide a baseline and tools for future association studies and comparative genomics to further delineate the role of ABCA4 in canine blinding disorders. PMID:20661590

  17. Targeted Mutagenesis of a Candidate T Complex Responder Gene in Mouse T Haplotypes Does Not Eliminate Transmission Ratio Distortion

    PubMed Central

    Ewulonu, U. K.; Schimenti, K.; Kuemerle, B.; Magnuson, T.; Schimenti, J.

    1996-01-01

    Transmission ratio distortion (TRD) associated with mouse t haplotypes causes +/t males to transmit the t-bearing chromosome to nearly all their offspring. Of the several genes involved in this phenomenon, the t complex responder (Tcr(t)) locus is absolutely essential for TRD to occur. A candidate Tcr(t) gene called Tcp10b(t) was previously cloned from the genetically defined Tcr(t) region. Its location, restricted expression in testis, and a unique postmeiotic alternative splicing pattern supported the idea that Tcp10b(t) was Tcr(t). To test this hypothesis in a functional assay, ES cells were derived from a viable partial t haplotype, and the Tcp10b(t) gene was mutated by homologous recombination. Mutant mice were mated to appropriate partial t haplotypes to determine whether the targeted chromosome exhibited transmission ratios characteristic of the responder. The results demonstrated that the targeted chromosome retained full responder activity. Hence, Tcp10b(t) does not appear to be Tcr(t). These and other observations necessitate a reevaluation of genetic mapping data and the actual nature of the responder. PMID:8889539

  18. Haplotype of platelet receptor P2RY12 gene is associated with residual clopidogrel on-treatment platelet reactivity*

    PubMed Central

    Nie, Xiao-yan; Li, Jun-lei; Zhang, Yong; Xu, Yang; Yang, Xue-li; Fu, Yu; Liang, Guang-kai; Lu, Yun; Liu, Jian; Shi, Lu-wen

    2017-01-01

    Objective: To investigate a possible association between common variations of the P2RY12 and the residual clopidogrel on-treatment platelet reactivity after adjusting for the influence of CYP2C19 tested by thromboelastography (TEG). Methods: One hundred and eighty patients with acute coronary syndrome (ACS) treated with clopidogrel and aspirin were included and platelet function was assessed by TEG. Five selected P2RY12 single nucleotide polymorphisms (SNPs; rs6798347, rs6787801, rs6801273, rs6785930, and rs2046934), which cover the common variations in the P2RY12 gene and its regulatory regions, and three CYP2C19 SNPs (*2,*3,*17) were genotyped and possible haplotypes were analyzed. Results: The high on-treatment platelet reactivity (HTPR) prevalence defined by a platelet inhibition rate <30% by TEG in adenosine diphosphate (ADP)-channel was 69 (38.33%). Six common haplotypes were inferred from four of the selected P2RY12 SNPs (denoted H0 to H5) according to the linkage disequilibrium R square (except for rs2046934). Haplotype H1 showed a significantly lower incidence of HTPR than the reference haplotype (H0) in the total study population while haplotypes H1 and H2 showed significantly lower incidences of HTPR than H0 in the nonsmoker subgroup after adjusting for CYP2C19 effects and demographic characteristics. rs2046934 (T744C) did not show any significant association with HTPR. Conclusions: The combination of common P2RY12 variations including regulatory regions rather than rs2046934 (T744C) that related to pharmacodynamics of clopidogrel in patients with ACS was independently associated with residual on-clopidogrel platelet reactivity. This is apart from the established association of the CYP2C19. This association seemed more important in the subgroup defined by smoking. PMID:28070995

  19. Genome-wide haplotype association study identify TNFRSF1A, CASP7, LRP1B, CDH1 and TG genes associated with Alzheimer's disease in Caribbean Hispanic individuals.

    PubMed

    Shang, Zhenwei; Lv, Hongchao; Zhang, Mingming; Duan, Lian; Wang, Situo; Li, Jin; Liu, Guiyou; Ruijie, Zhang; Jiang, Yongshuai

    2015-12-15

    Alzheimer's disease (AD) is an acquired disorder of cognitive and behavioral impairment. It is considered to be caused by variety of factors, such as age, environment and genetic factors. In order to identify the genetic affect factors of AD, we carried out a bioinformatic approach which combined genome-wide haplotype-based association study with gene prioritization. The raw SNP genotypes data was downloaded from GEO database (GSE33528). It contains 615 AD patients and 560 controls of Caribbean Hispanic individuals. Firstly, we identified the linkage disequilibrium (LD) haplotype blocks and performed genome-wide haplotype association study to screen significant haplotypes that were associated with AD. Then we mapped these significant haplotypes to genes and obtained candidate genes set for AD. At last, we prioritized AD candidate genes based on their similarity with 36 known AD genes, so as to identify AD related genes. The results showed that 141 haplotypes on 134 LD blocks were significantly associated with AD (P<1E-4), and these significant haplotypes were mapped to 132 AD candidate genes. After prioritizing these candidate genes, we found seven AD related genes: APOE, APOC1, TNFRSF1A, LRP1B, CDH1, TG and CASP7. Among these genes, APOE and APOC1 are known AD risk genes. For the other five genes TNFRSF1A, CDH1, CASP7, LRP1B and TG, this is the first genetic association study which showed the significant association between these five genes and AD susceptibility in Caribbean Hispanic individuals. We believe that our findings can provide a new perspective to understand the genetic affect factors of AD.

  20. Fine localization of the Nijmegen breakage syndrome gene to 8q21: Evidence for a common founder haplotype

    SciTech Connect

    Cerosaletti, K.M.; Lange, E.; Stringham, H.M.

    1998-07-01

    Nijmegen breakage syndrome (NBS) is a rare autosomal recessive disorder characterized by microcephaly, a birdlike face, growth retardation, immunodeficiency, lack of secondary sex characteristics in females, and increased incidence of lymphoid cancers. NBS cells display a phenotype similar to that of cells from ataxia-telangiectasia patients, including chromosomal instability, radiation sensitivity, and aberrant cell-cycle-checkpoint control following exposure to ionizing radiation. A recent study reported genetic linkage of NBs to human chromosome 8q21, with strong linkage disequilibrium detected at marker D8S1811 in eastern European NBS families. The authors collected a geographically diverse group of NBS families and tested them for linkage, using an expanded panel of markers at 8q21. In this article, the authors report linkage of NBS to 8q21 in 6/7 of these families, with a maximum LOD score of 3.58. Significant linkage disequilibrium was detected for 8/13 markers tested in the 8q21 region, including D8S1811. In order to further localize the gene for NBS, the authors generated a radiation-hybrid map of markers at 8q21 and constructed haplotypes based on this map. Examination of disease haplotypes segregating in 11 NBS pedigrees revealed recombination events that place the NBS gene between D8S1757 and D8S270. A common founder haplotype was present on 15/18 disease chromosomes from 9/11 NBS families. Inferred (ancestral) recombination events involving this common haplotype suggest that NBS can be localized further, to an interval flanked by markers D8S273 and D8S88.

  1. Fine localization of the Nijmegen breakage syndrome gene to 8q21: evidence for a common founder haplotype.

    PubMed Central

    Cerosaletti, K M; Lange, E; Stringham, H M; Weemaes, C M; Smeets, D; Sölder, B; Belohradsky, B H; Taylor, A M; Karnes, P; Elliott, A; Komatsu, K; Gatti, R A; Boehnke, M; Concannon, P

    1998-01-01

    Nijmegen breakage syndrome (NBS) is a rare autosomal recessive disorder characterized by microcephaly, a birdlike face, growth retardation, immunodeficiency, lack of secondary sex characteristics in females, and increased incidence of lymphoid cancers. NBS cells display a phenotype similar to that of cells from ataxia-telangiectasia patients, including chromosomal instability, radiation sensitivity, and aberrant cell-cycle-checkpoint control following exposure to ionizing radiation. A recent study reported genetic linkage of NBS to human chromosome 8q21, with strong linkage disequilibrium detected at marker D8S1811 in eastern European NBS families. We collected a geographically diverse group of NBS families and tested them for linkage, using an expanded panel of markers at 8q21. In this article, we report linkage of NBS to 8q21 in 6/7 of these families, with a maximum LOD score of 3.58. Significant linkage disequilibrium was detected for 8/13 markers tested in the 8q21 region, including D8S1811. In order to further localize the gene for NBS, we generated a radiation-hybrid map of markers at 8q21 and constructed haplotypes based on this map. Examination of disease haplotypes segregating in 11 NBS pedigrees revealed recombination events that place the NBS gene between D8S1757 and D8S270. A common founder haplotype was present on 15/18 disease chromosomes from 9/11 NBS families. Inferred (ancestral) recombination events involving this common haplotype suggest that NBS can be localized further, to an interval flanked by markers D8S273 and D8S88. PMID:9634525

  2. Polymorphisms in the interleukin-10 gene cluster are possibly involved in the increased risk for major depressive disorder

    PubMed Central

    Traks, Tanel; Koido, Kati; Eller, Triin; Maron, Eduard; Kingo, Külli; Vasar, Veiko; Vasar, Eero; Kõks, Sulev

    2008-01-01

    Background Innate immune inflammatory response is suggested to have a role in the pathogenesis of major depressive disorder (MDD). Interleukin (IL)-10 family cytokines IL-10, IL-19, IL-20, and IL-24 are all implicated in the inflammatory processes and polymorphisms in respective genes have been associated with various immunopathological conditions. This study was carried out to investigate whether single-nucleotide polymorphisms (SNPs) in these genes are also associated with MDD. Methods Case-control association study was performed with seven SNPs from the IL10 gene cluster. 153 patients with MDD and 277 healthy control individuals were recruited. Results None of the selected SNPs were individually associated with MDD. The linkage disequilibrium (LD) analysis indicated the existence of two recombination sites in the IL10 gene cluster, thus confirming the formerly established LD pattern of this genomic region. This also created two haplotype blocks, both consisting of three SNPs. Additionally, the haplotype analysis detected a significantly higher frequency of block 2 (IL20 and IL24 genes) haplotype TGC in the patients group compared to healthy control individuals (P = 0.0097). Conclusion Our study established increased risk for MDD related to the IL20 and IL24 haplotype and suggests that cytokines may contribute to the pathogenesis of MDD. Since none of the block 2 SNPs were individually associated with MDD, it is possible that other polymorphisms linked to them contribute to the disease susceptibility. Future studies are needed to confirm the results and to find the possible functional explanation. PMID:19087313

  3. The association of very low-density lipoprotein receptor (VLDLR) haplotypes with egg production indicates VLDLR is a candidate gene for modulating egg production

    PubMed Central

    Wang, ZhePeng; Meng, GuoHua; Li, Na; Yu, MingFen; Liang, XiaoWei; Min, YuNa; Liu, FuZhu; Gao, YuPeng

    2016-01-01

    Abstract The very low-density lipoprotein receptor (VLDLR) transports egg yolk precursors into oocytes. However, our knowledge of the distribution patterns of VLDLR variants among breeds and their relationship to egg production is still incomplete. In this study, eight single nucleotide polymorphisms (SNPs) that account for 87% of all VLDLR variants were genotyped in Nick Chick (NC, n=91), Lohmann Brown (LohB, n=50) and Lueyang (LY, n=381) chickens, the latter being an Chinese indigenous breed. Egg production by NC and LY chickens was recorded from 17 to 50 weeks. Only four similar haplotypes were found in NC and LohB, of which two accounted for 100% of all NC haplotypes and 92.5% of LohB haplotypes. In contrast, there was considerable haplotypic diversity in LY. Comparison of egg production in LY showed that hens with NC-like haplotypes had a significantly higher production (p < 0.05) than those without the haplotypes. However, VLDLR expression was not significantly different between the haplotypes. These findings indicate a divergence in the distribution of VLDLR haplotypes between selected and non-selected breeds and suggest that the near fixation of VLDLR variants in NC and LohB is compatible with signature of selection. These data also support VLDLR as a candidate gene for modulating egg production. PMID:27560838

  4. Chicken rRNA Gene Cluster Structure

    PubMed Central

    Dyomin, Alexander G.; Koshel, Elena I.; Kiselev, Artem M.; Saifitdinova, Alsu F.; Galkina, Svetlana A.; Fukagawa, Tatsuo; Kostareva, Anna A.

    2016-01-01

    Ribosomal RNA (rRNA) genes, whose activity results in nucleolus formation, constitute an extremely important part of genome. Despite the extensive exploration into avian genomes, no complete description of avian rRNA gene primary structure has been offered so far. We publish a complete chicken rRNA gene cluster sequence here, including 5’ETS (1836 bp), 18S rRNA gene (1823 bp), ITS1 (2530 bp), 5.8S rRNA gene (157 bp), ITS2 (733 bp), 28S rRNA gene (4441 bp) and 3’ETS (343 bp). The rRNA gene cluster sequence of 11863 bp was assembled from raw reads and deposited to GenBank under KT445934 accession number. The assembly was validated through in situ fluorescent hybridization analysis on chicken metaphase chromosomes using computed and synthesized specific probes, as well as through the reference assembly against de novo assembled rRNA gene cluster sequence using sequenced fragments of BAC-clone containing chicken NOR (nucleolus organizer region). The results have confirmed the chicken rRNA gene cluster validity. PMID:27299357

  5. A novel approach to rapid determination of betaS-globin haplotypes: sequencing of the Agamma-IVS-II region.

    PubMed

    Vinson, Amy E; Walker, Aisha; Elam, Dedrey; Glendenning, Michele; Kutlar, Ferdane; Clair, Betsy; Harbin, Jeanette; Kutlar, Abdullah

    2004-01-01

    beta-Globin gene cluster haplotypes were originally determined by restriction endonuclease mapping with Southern blots of polymorphic sites around the gene cluster. Over the years, haplotyping has been found to be useful, not only in population genetics but also in predicting the severity of hemoglobinopathies such as sickle cell disease. The sickle mutation occurs on five distinct haplotypes. The hitherto used methods are cumbersome and time-consuming, making haplotype determination a tedious procedure. We report our experience with a novel, rapid approach to haplotyping based on sequence polymorphisms in the Agamma-IVS-II region. We provide an algorithm that allows rapid assignment of the four African haplotypes carrying the sickle mutation.

  6. Morbidity, beta S haplotype and alpha-globin gene patterns among sickle cell anemia patients in Kuwait.

    PubMed

    Adekile, A D; Haider, M Z

    1996-01-01

    Admission records of children with sickle cell anemia (SS), in the two main teaching hospitals in Kuwait, were reviewed for a 1-year period. The haplotypes of 92 beta s chromosomes (from 39 SS, 11 AS, 2 S beta-thalassemia [S beta-thal] and 1 SD individuals) were determined using an allele-specific oligonucleotide (ASO) hybridization technique, while the alpha-globin gene status of 27 SS and 33 AS individuals, i.e. 120 chromosomes, was determined with a combination of polymerase chain reaction and AS techniques. A vasooclusive crisis was the most common (60.0%) cause of hospitalization, followed by infections (20%). Hospital admissions were most common during the hottest month of the year (July). Few complications of the disease were seen among patients on follow-up; however, splenomegaly was present in 24.0%, hepatomegaly in 15.2%, gallstones in 15.2% and aseptic necrosis of the femoral head in 6.1%. Haplotype 31 (Saudi Arabia/India) is the most frequent in this community, being present in 80.4% of the chromosomes tested; Benin haplotype 19 was found in 12.0% and Bantu haplotype 20 in 6.5%. Hb F in the haplotype 31 homozygotes and heterozygotes ranged from 11.4 to 35.1% (mean 22.5 +/- 5.2%). The frequency of alpha-thal determinants in the study was 40.0%, the commonest being the -alpha 3.7-kb deletion (27.5%), the alpha 2 polyadenylation signal (AATAAA-> AATAAG) mutation (10.2%) and the IVS-I 5' end GAGGT-GAGG->GAGG pentanucleotide (5 nt) deletion (3.3%). SS patients with coexistent alpha-thal trait did not have severe recurrent infections and none had gallstones. The high frequencies of the Saudi Arabia/India beta s haplotype and alpha-thalassemia trait contribute to the mild nature of SS disease among Kuwaiti Arabs comparable to that in eastern Saudi Arabia.

  7. Clustering gene expression data using graph separators.

    PubMed

    Kaba, Bangaly; Pinet, Nicolas; Lelandais, Gaëlle; Sigayret, Alain; Berry, Anne

    2007-01-01

    Recent work has used graphs to modelize expression data from microarray experiments, in view of partitioning the genes into clusters. In this paper, we introduce the use of a decomposition by clique separators. Our aim is to improve the classical clustering methods in two ways: first we want to allow an overlap between clusters, as this seems biologically sound, and second we want to be guided by the structure of the graph to define the number of clusters. We test this approach with a well-known yeast database (Saccharomyces cerevisiae). Our results are good, as the expression profiles of the clusters we find are very coherent. Moreover, we are able to organize into another graph the clusters we find, and order them in a fashion which turns out to respect the chronological order defined by the the sporulation process.

  8. Population-Specific Haplotype Association of the Postsynaptic Density Gene DLG4 with Schizophrenia, in Family-Based Association Studies

    PubMed Central

    Balan, Shabeesh; Yamada, Kazuo; Hattori, Eiji; Iwayama, Yoshimi; Toyota, Tomoko; Ohnishi, Tetsuo; Maekawa, Motoko; Toyoshima, Manabu; Iwata, Yasuhide; Suzuki, Katsuaki; Kikuchi, Mitsuru; Yoshikawa, Takeo

    2013-01-01

    The post-synaptic density (PSD) of glutamatergic synapses harbors a multitude of proteins critical for maintaining synaptic dynamics. Alteration of protein expression levels in this matrix is a marked phenomenon of neuropsychiatric disorders including schizophrenia, where cognitive functions are impaired. To investigate the genetic relationship of genes expressed in the PSD with schizophrenia, a family-based association analysis of genetic variants in PSD genes such as DLG4, DLG1, PICK1 and MDM2, was performed, using Japanese samples (124 pedigrees, n = 376 subjects). Results showed a significant association of the rs17203281 variant from the DLG4 gene, with preferential transmission of the C allele (p = 0.02), although significance disappeared after correction for multiple testing. Replication analysis of this variant, found no association in a Chinese schizophrenia cohort (293 pedigrees, n = 1163 subjects) or in a Japanese case-control sample (n = 4182 subjects). The DLG4 expression levels between postmortem brain samples from schizophrenia patients showed no significant changes from controls. Interestingly, a five marker haplotype in DLG4, involving rs2242449, rs17203281, rs390200, rs222853 and rs222837, was enriched in a population specific manner, where the sequences A-C-C-C-A and G-C-C-C-A accumulated in Japanese (p = 0.0009) and Chinese (p = 0.0007) schizophrenia pedigree samples, respectively. However, this could not be replicated in case-control samples. None of the variants in other examined candidate genes showed any significant association in these samples. The current study highlights a putative role for DLG4 in schizophrenia pathogenesis, evidenced by haplotype association, and warrants further dense screening for variants within these haplotypes. PMID:23936182

  9. Localization of the familial Mediterranean fever gene (FMF) to a 250-kb interval in non-Ashkenazi Jewish founder haplotypes

    SciTech Connect

    1996-09-01

    Chromosome 16p13.3 harbors a gene (MEF) associated with familial Mediterranean fever (FMF), a recessive disease very common in populations of Mediterranean ancestry. In the course of positional cloning of MEF, we genotyped 26 non-Ashkenazi Jewish FMF pedigrees (310 meioses) with 15 microsatellite markers, most of which were recently developed by Genethon. Identification of recombination events in the haplotypes allowed narrowing of the MEF interval to a region between D16S3124 (telomeric) and D16S475 (centromeric). Two markers, D16S3070 and D16S3275, a microsatellite marker isolated from a YAC that also contains D16S3070, showed no recombination with the disease. Linkage disequilibrium and haplotype analysis high-lighted the existence of a founder haplotype in our population. The core ancestral alleles were present in 71% of MEF-bearing chromosomes at loci D16S3070 and D16S3275. Furthermore, identification of historical crossing-over events in these pedigrees indicated that MEF is located between these two loci, which are both contained in a 250-kb genomic fragment. 24 refs., 4 figs., 3 tabs.

  10. HLA gene and haplotype frequencies in Russians, Bashkirs and Tatars, living in the Chelyabinsk Region (Russian South Urals).

    PubMed

    Suslova, T A; Burmistrova, A L; Chernova, M S; Khromova, E B; Lupar, E I; Timofeeva, S V; Devald, I V; Vavilov, M N; Darke, C

    2012-10-01

    We have characterized the HLA-A, -B, -DRB1, -DQA1 and -DQB1 profiles of three major ethnic groups living in Chelyabinsk Region of Russian South Urals, viz., Russians (n = 207), Bashkirs (n = 146) and Tatars (n = 135). First field level typing was performed by PCR using sequence-specific primers. Estimates included carriage and gene frequencies, linkage disequilibrium and its significance and related values. Population comparisons were made between the allele family frequencies of the three populations and between these populations and 20 others using a dendrogram. Chelyabinsk Region Russians demonstrate all the features typical of a Caucasoid population, but also have some peculiarities. Together with Tatars, Russians have high frequencies of allele families and haplotypes characteristic of Finno-Ugric populations. This presupposes a Finno-Ugric impact on Russian and Tatar ethnogenesis. However, this was not apparent in Bashkirs, the first of the three populations to live in this territory, and implies admixture with populations of a Finno-Ugric origin with precursors of Russians and Tatars before they came to the South Urals. The Bashkirs appear close to Mongoloids in allele and haplotype distribution. However, Bashkirs cannot be labelled either as typical Mongoloids or as Caucasoids. Thus, Bashkirs possess some alleles and haplotypes frequent in Mongoloids, which supports the Turkic impact on Bashkir ethnogenesis, but also possess the AH 8.1 haplotype, which could evidence an ancient Caucasoid population that took part in their ethnic formation or of recent admixture with adjacent populations (Russians and Tatars). Bashkirs showed no features of populations with a substantial Finno-Ugric component, for example Chuvashes or Russian Saami. This disputes the commonly held belief of a Finno-Ugric origin for Bashkirs. Tatars appeared close to many European populations. However, they possessed some characteristics of Asiatic populations possibly reflecting a Mongoloid

  11. Nonrandom association of polymorphic restriction sites in the beta-globin gene cluster.

    PubMed

    Antonarakis, S E; Boehm, C D; Giardina, P J; Kazazian, H H

    1982-01-01

    By using probes for epsilon-, Psibeta(1)-, and beta-globin genes, we found four additional polymorphic restriction sites that have frequencies >0.1 in persons of Mediterranean area origin, Asian Indians, and American Blacks. Three of these (HincII sites) and the two previously described polymorphic HindIII sites [one in intervening sequence (IVS) II of each gamma-globin gene] are distributed over 32 kilobases (kb) of DNA located 5' to the delta-globin gene. This region of DNA comprises two-thirds of the beta-globin gene cluster. Since each of these five polymorphic sites can be present (+) or absent (-), in theory there exist 32 possible combinations of sites (haplotypes). However, in Italians, Greeks, Indians, and Turks, 3 of the 32 haplotypes, (+----), (-+-++), and (-++-+), account for 92% of 89 beta(A) chromosomes examined. The observed frequencies for these haplotypes are 0.64, 0.15, and 0.13 in the populations studied, in contrast to expected frequencies (based on the observed gene frequencies at each of the five sites) of 0.20, 0.006, and 0.005, respectively. In American Blacks, a fourth haplotype, (----+), which is rare in non-Black populations, has a frequency of 0.37 in contrast to its expected frequency of 0.05. These results suggest a nonrandom association of DNA sequences over 32 kb 5' to the delta-globin gene in all populations studied. Two other polymorphic sites 3' to the delta gene (the newly discovered Ava II site in IVS II of the beta-globin gene and the BamHI site 3' to it) are nonrandomly associated with each other but randomly distributed with respect to the above haplotypes. This suggests that randomization of sequences has occurred within 12 kb of DNA between these two nonrandomly associated sequence clusters. Nonrandom association of polymorphic restriction sites has practical consequences in that it limits the usefulness of these additional HincII sites for prenatal diagnosis of hemoglobinopathies by linkage analysis. These sites provide

  12. Phylogenomic analyses of KCNA gene clusters in vertebrates: why do gene clusters stay intact?

    PubMed Central

    Hoegg, Simone; Meyer, Axel

    2007-01-01

    Background Gene clusters are of interest for the understanding of genome evolution since they provide insight in large-scale duplications events as well as patterns of individual gene losses. Vertebrates tend to have multiple copies of gene clusters that typically are only single clusters or are not present at all in genomes of invertebrates. We investigated the genomic architecture and conserved non-coding sequences of vertebrate KCNA gene clusters. KCNA genes encode shaker-related voltage-gated potassium channels and are arranged in two three-gene clusters in tetrapods. Teleost fish are found to possess four clusters. The two tetrapod KNCA clusters are of approximately the same age as the Hox gene clusters that arose through duplications early in vertebrate evolution. For some genes, their conserved retention and arrangement in clusters are thought to be related to regulatory elements in the intergenic regions, which might prevent rearrangements and gene loss. Interestingly, this hypothesis does not appear to apply to the KCNA clusters, as too few conserved putative regulatory elements are retained. Results We obtained KCNA coding sequences from basal ray-finned fishes (sturgeon, gar, bowfin) and confirmed that the duplication of these genes is specific to teleosts and therefore consistent with the fish-specific genome duplication (FSGD). Phylogenetic analyses of the genes suggest a basal position of the only intron containing KCNA gene in vertebrates (KCNA7). Sistergroup relationships of KCNA1/2 and KCNA3/6 support that a large-scale duplication gave rise to the two clusters found in the genome of tetrapods. We analyzed the intergenic regions of KCNA clusters in vertebrates and found that there are only a few conserved sequences shared between tetrapods and teleosts or between paralogous clusters. The orthologous teleost clusters, however, show sequence conservation in these regions. Conclusion The lack of overall conserved sequences in intergenic regions

  13. Tyrosinase-positive oculocutaneous albinism in Southern African blacks: P gene-associated haplotypes suggest a major mutation in the 5{prime} region of the gene

    SciTech Connect

    Ramsay, M.; Stevens, G.; Beukering, J. van

    1994-09-01

    Tyrosinase-positive oculocutaneous albinism (ty-pos OCA) occurs with a prevalence of 1 in 3900 among Southern African (SA) blacks. The major contributors to morbidity and mortality are skin cancer and decreased visual acuity. Two distinct phenotypes occur, namely individuals with ephelides (darkly pigmented patches) and those without. There is complete concordance with regard to ephelus status among siblings. The disorder is linked to markers on chromosome 15q11.2-q12, and no obligatory cross-overs were observed with polymophic markers at the human homolog, P, of the mouse pink eyed dilute gene, p. Contrary to what has been shown for Caucasoid ty-pos OCA, this condition shows locus homogeneity among SA blacks. The P gene is an excellent candidate for ty-pos OCA and mutations in this gene will confirm its role in causing the common form of albinism in SA. Numerous P gene mutations have been described in other populations. In an attempt to detect mutations, the P gene cDNA was used to search for structural rearrangements or polymorphisms. Six polymorphisms (plR10/Scal, 912/Xbal, 912/HincII, 912/TaqI, 1412/TaqI [two systems] and 1412/HindIII) were detected with subclones of the P cDNA and haplotypes were determined in each family. None were clearly associated with an albinism-related rearrangement. However, strong linkage disequilibrium was observed with alleles at loci toward the 5{prime} region of the gene ({triangle}=0.65, 0.57 and 0.80 for the three polymorphisms detected with the 912 subclone), suggesting a major ty-pos OCA mutation in this region. Haplotype analysis provides evidence for a major mutation associated with the same haplotype in individuals with ephelides (8/12 OCA chromosomes) and those without ephelides (24:30). The presence of other ty-pos OCA associated haplotypes indicates several other less common mutations.

  14. Comparison of the power of haplotype-based versus single- and multilocus association methods for gene × environment (gene × sex) interactions and application to gene × smoking and gene × sex interactions in rheumatoid arthritis

    PubMed Central

    Dempfle, Astrid; Hein, Rebecca; Beckmann, Lars; Scherag, André; Nguyen, Thuy Trang; Schäfer, Helmut; Chang-Claude, Jenny

    2007-01-01

    Accounting for interactions with environmental factors in association studies may improve the power to detect genetic effects and may help identifying important environmental effect modifiers. The power of unphased genotype-versus haplotype-based methods in regions with high linkage disequilibrium (LD), as measured by D', for analyzing gene × environment (gene × sex) interactions was compared using the Genetic Analysis Workshop 15 (GAW15) simulated data on rheumatoid arthritis with prior knowledge of the answers. Stepwise and regular conditional logistic regression (CLR) was performed using a matched case-control sample for a HLA region interacting with sex. Haplotype-based analyses were performed using a haplotype-sharing-based Mantel statistic and a test for haplotype-trait association in a general linear model framework. A step-down minP algorithm was applied to derive adjusted p-values and to allow for power comparisons. These methods were also applied to the GAW15 real data set for PTPN22. For markers in strong LD, stepwise CLR performed poorly because of the correlation/collinearity between the predictors in the model. The power was high for detecting genetic main effects using simple CLR models and haplotype-based methods and for detecting joint effects using CLR and Mantel statistics. Only the haplotype-trait association test had high power to detect the gene × sex interaction. In the PTPN22 region with markers characterized by strong LD, all methods indicated a significant genotype × sex interaction in a sample of about 1000 subjects. The previously reported R620W single-nucleotide polymorphism was identified using logistic regression, but the haplotype-based methods did not provide any precise location information. PMID:18466575

  15. A common haplotype associated with the Basque 2362AG --> TCATCT mutation in the muscular calpain-3 gene.

    PubMed

    Cobo, Ana María; Sáenz, Ametz; Poza, Juan José; Urtasun, Miguel; Indakoetxea, Begoña; Urtizberea, Jon Andoni; López de Munain, Adolfo; Calafell, Francesc

    2004-10-01

    Limb-girdle muscular dystrophy type 2A (LGMD2A) is caused by any of over 150 mutations in the calpain-3 (CAPN3) gene. Of those, 2362AG --> TCATCT is particularly prevalent in Basque patients, and this mutation was hypothesized to have arisen in the Basque Country. To explore the natural history of this mutation, we genotyped 65 Basque and non-Basque patients with LGMD2A who carry the 2362AG --> TCATCT mutation for four microsatellites within or flanking the gene. A particular haplotype was found in three-fourths of the patients and was assumed to be ancestral. From the average number of recombinations and mutations accumulated from this ancestral haplotype, the age of the 2362AG ----> TCATCT mutation was estimated to be 50 generations (i.e., 1,250 years), which is more recent than the Paleolithic Basque heritage. The subsequent spread of the 2362AG --> TCATCT mutation can be related to gene flow out of the Basque Country, even across a cultural border.

  16. Genotype variability and haplotype profile of ABCB1 (MDR1) gene polymorphisms in Macedonian population.

    PubMed

    Naumovska, Zorica; Nestorovska, Aleksandra K; Sterjev, Zoran; Filipce, Ana; Dimovski, Aleksandar; Suturkova, Ljubica

    2014-01-01

    The aim of this study was to evaluate the most common ABCB1 (MDR1, P-glycoprotein) polymorphisms in the population of R. Macedonia and compare the allele and haplotype frequencies with the global geographic data reported from different ethnic populations. The total of 107 healthy Macedonian individuals from the general population was included. Genotypes for the ABCB1 for three polymorphisms C1236T [rs1128503], G2677A/T [rs2032582] and C3435T [rs1045642] were analyzed by Real-Time PCR. Obtained allele frequencies for these three SNPs were similar to those observed in other European Caucasians. The detected genotype frequencies were 33.6% for 1236CC, 44.9% for 1236CT and 21.5% for 1236TT in exon 12; 32.7%, 44.9% and 22.4% for 2677GG, 2677GT and 2677GT consecutively in exon 21; and 25.2% for 3435CC, 52.3% for 3435CT and 22.5% for 3435TT in exon 26.Strong LD was observed in our study among all three SNPs with the highest association confirmed for C1236T and G2677T ((D'=0.859, r2=0.711). Eight different haplotypes were identified and the most prominent was the CGC haplotype (45.3%). Our study was the first to have documented the distribution of ABCB1 alleles, genotypes and haplotypes in the population of R. Macedonia. The obtained results can help in the prediction of different response to the drugs that are P-glycoprotein substrates. Additionally, in the era of individualized medicine the determination of the P-glycoprotein genotype might be a good predictive marker for determination of the subpopulations with higher risk to certain diseases.

  17. Polymorphism in the upstream regulatory region of DQA1 genes and DRB1, QAP, DQA1, and DQB1 haplotypes in the German population.

    PubMed

    Haas, J P; Kimura, A; Andreas, A; Hochberger, M; Keller, E; Brünnler, G; Bettinotti, M P; Nevinny-Stickel, C; Hildebrandt, B; Sierp, G

    1994-01-01

    Polymorphism in the URR of the MHC class II DQA1 gene defines ten different alleles named QAP. Oligotyping for the alleles of DRB1, QAP, DQA1, and DQB1 have been performed in 210 unrelated healthy controls from Germany. Moreover, 83 HTCs from the Tenth IHWS have been tested. Four point loci haplotypes (DRB1, QAP, DQA1, and DQB1) have been analyzed in the unrelated healthy population sample. Computer analysis of the linkage disequilibria leads to the conclusion that QAP alleles are in strong linkage disequilibrium with alleles either the DQA1 or the DRB1 locus. One typical ("common") haplotype was found to be associated with each DRB1 allele in the majority (86%) of the tested persons. Apart from that, 25 other less frequent ("unusual") haplotypes, with an overall frequency of 14% have been defined. Some of these "unusual" MHC class II haplotypes were found to differ only in the regulatory alleles of DQA1 (QAP alleles) while they are identical for the alleles coding for structural elements (DRB1, DQA1, and DQB1). Most of the "unusual" haplotypes were found to carry HLA-DQ6. Assuming that "unusual" (= rare) haplotypes have arisen from "common" (= frequent) haplotypes by point mutation and recombination, we propose the existence of three recombination sites in the MHC DR-DQ region: one between DRB1 and QAP, the second between QAP and DQA1, and the third between DQA1 and DQB1.

  18. A Putatively Functional Haplotype in the Gene Encoding Transforming Growth Factor Beta-1 as a Potential Biomarker for Radiosensitivity

    SciTech Connect

    Schirmer, Markus A.; Brockmoeller, Juergen; Rave-Fraenk, Margret; Virsik, Patricia; Wilken, Barbara; Kuehnle, Elna; Campean, Radu; Hoffmann, Arne O.; Mueller, Katarina; Goetze, Robert G.; Neumann, Michael; Janke, Joerg H.; Nasser, Fatima; Wolff, Hendrik A.; Ghadimi, B. Michael; Schmidberger, Heinz; Hess, Clemens F.; Christiansen, Hans; Hille, Andrea

    2011-03-01

    Purpose: To determine whether genetic variability in TGFB1 is related to circulating transforming growth factor-{beta}1 (TGF-{beta}1) plasma concentrations after radiotherapy and to radiosensitivity of lymphoid cells. Patients and Methods: Transforming growth factor-{beta}1 plasma concentrations (n = 79) were measured in patients 1 year after radiotherapy and chromosomal aberrations (n = 71) ex vivo before therapy start. Furthermore, TGF-{beta}1 secretion and apoptosis were measured in isolated peripheral blood mononuclear cells of 55 healthy volunteers. These phenotypes were analyzed in relation to five germline polymorphisms in the 5' region of the TGFB1 gene. Because of high linkage disequilibrium, these five polymorphisms reflect frequent genetic variation in this region. A presumed impact of TGF-{beta}1 on DNA damage or repair was measured as micronucleus formation in 30 lymphoblastoid cell lines. Results: We identified a hypofunctional genetic haplotype termed H3 tagging the 5' region of the TGFB1 gene encoding TGF-{beta}1. H3 was associated with lower TGF-{beta}1 plasma concentrations in patients (p = 0.01) and reduced TGF-{beta}1 secretion in irradiated peripheral blood mononuclear cells (p = 0.003). Furthermore, cells with H3 were less prone to induction of chromosomal aberrations (p = 0.001) and apoptosis (p = 0.003) by irradiation. The hypothesis that high TGF-{beta}1 could sensitize cells to DNA damage was further supported by increased micronuclei formation in 30 lymphoblastoid cell lines when preincubated with TGF-{beta}1 before irradiation (p = 0.04). Conclusions: On the basis of TGF-{beta}1 plasma levels and radiation sensitivity of lymphoid cells, this study revealed a putatively hypofunctional TGFB1 haplotype. The significance of this haplotype and the suggested link between TGF-{beta}1 function and DNA integrity should be further explored in other cell types, as well as other experimental and clinical conditions.

  19. Asian population frequencies and haplotype distribution of killer cell immunoglobulin-like receptor (KIR) genes among Chinese, Malay, and Indian in Singapore.

    PubMed

    Lee, Yi Chuan; Chan, Soh Ha; Ren, Ee Chee

    2008-11-01

    Killer cell immunoglobulin-like receptors (KIR) gene frequencies have been shown to be distinctly different between populations and contribute to functional variation in the immune response. We have investigated KIR gene frequencies in 370 individuals representing three Asian populations in Singapore and report here the distribution of 14 KIR genes (2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DL1, 3DL2, 3DL3, 3DS1) with two pseudogenes (2DP1, 3DP1) among Singapore Chinese (n = 210); Singapore Malay (n = 80), and Singapore Indian (n = 80). Four framework genes (KIR3DL3, 3DP1, 2DL4, 3DL2) and a nonframework pseudogene 2DP1 were detected in all samples while KIR2DS2, 2DL2, 2DL5, and 2DS5 had the greatest significant variation across the three populations. Fifteen significant linkage patterns, consistent with associations between genes of A and B haplotypes, were observed. Eighty-four distinct KIR profiles were determined in our populations, 38 of which had not been described in other populations. KIR haplotype studies were performed using nine Singapore Chinese families comprising 34 individuals. All genotypes could be resolved into corresponding pairs of existing haplotypes with eight distinct KIR genotypes and eight different haplotypes. The haplotype A2 with frequency of 63.9% was dominant in Singapore Chinese, comparable to that reported in Korean and Chinese Han. The A haplotypes predominate in Singapore Chinese, with ratio of A to B haplotypes of approximately 3:1. Comparison with KIR frequencies in other populations showed that Singapore Chinese shared similar distributions with Chinese Han, Japanese, and Korean; Singapore Indian was found to be comparable with North Indian Hindus while Singapore Malay resembled the Thai.

  20. A Gene-Oriented Haplotype Comparison Reveals Recently Selected Genomic Regions in Temperate and Tropical Maize Germplasm

    PubMed Central

    Zhang, Jie; Li, Yongxiang; Zheng, Jun; Zhang, Hongwei; Yang, Xiaohong; Wang, Jianhua; Wang, Guoying

    2017-01-01

    The extensive genetic variation present in maize (Zea mays) germplasm makes it possible to detect signatures of positive artificial selection that occurred during temperate and tropical maize improvement. Here we report an analysis of 532,815 polymorphisms from a maize association panel consisting of 368 diverse temperate and tropical inbred lines. We developed a gene-oriented approach adapting exonic polymorphisms to identify recently selected alleles by comparing haplotypes across the maize genome. This analysis revealed evidence of selection for more than 1100 genomic regions during recent improvement, and included regulatory genes and key genes with visible mutant phenotypes. We find that selected candidate target genes in temperate maize are enriched in biosynthetic processes, and further examination of these candidates highlights two cases, sucrose flux and oil storage, in which multiple genes in a common pathway can be cooperatively selected. Finally, based on available parallel gene expression data, we hypothesize that some genes were selected for regulatory variations, resulting in altered gene expression. PMID:28099470

  1. Clustering Genes of Common Evolutionary History.

    PubMed

    Gori, Kevin; Suchan, Tomasz; Alvarez, Nadir; Goldman, Nick; Dessimoz, Christophe

    2016-06-01

    Phylogenetic inference can potentially result in a more accurate tree using data from multiple loci. However, if the loci are incongruent-due to events such as incomplete lineage sorting or horizontal gene transfer-it can be misleading to infer a single tree. To address this, many previous contributions have taken a mechanistic approach, by modeling specific processes. Alternatively, one can cluster loci without assuming how these incongruencies might arise. Such "process-agnostic" approaches typically infer a tree for each locus and cluster these. There are, however, many possible combinations of tree distance and clustering methods; their comparative performance in the context of tree incongruence is largely unknown. Furthermore, because standard model selection criteria such as AIC cannot be applied to problems with a variable number of topologies, the issue of inferring the optimal number of clusters is poorly understood. Here, we perform a large-scale simulation study of phylogenetic distances and clustering methods to infer loci of common evolutionary history. We observe that the best-performing combinations are distances accounting for branch lengths followed by spectral clustering or Ward's method. We also introduce two statistical tests to infer the optimal number of clusters and show that they strongly outperform the silhouette criterion, a general-purpose heuristic. We illustrate the usefulness of the approach by 1) identifying errors in a previous phylogenetic analysis of yeast species and 2) identifying topological incongruence among newly sequenced loci of the globeflower fly genus Chiastocheta We release treeCl, a new program to cluster genes of common evolutionary history (http://git.io/treeCl).

  2. Discovery, evaluation and distribution of haplotypes and new alleles of the Photoperiod-A1 gene in wheat.

    PubMed

    Muterko, Alexandr; Kalendar, Ruslan; Cockram, James; Balashova, Irina

    2015-05-01

    Photoperiod response in wheat is determined to a large extent by the homoeologous series of Photoperiod 1 (Ppd1) genes. In this study, Ppd-A1 genomic sequences from the 5' UTR and promoter region were analysed in 104 accessions of six tetraploid wheat species (Triticum dicoccoides, T. dicoccum, T. turgidum, T. polonicum, T. carthlicum, T. durum) and 102 accessions of six hexaploid wheat species (T. aestivum, T. compactum, T. sphaerococcum, T. spelta, T. macha, T. vavilovii). This data was supplemented with in silico analysis of publicly available sequences from 46 to 193 accessions of diploid and tetraploid wheat, respectively. Analysis of a region of the Ppd-A1 promoter identified thirteen haplotypes, which were divided in two haplogroups. Distribution of the Ppd-A1 haplogroups and haplotypes in wheat species, and their geographical distributions were analysed. Polymerase chain reaction combined with a heteroduplex mobility assay was subsequently used to efficiently discriminate between Ppd-A1 alleles, allowing identification of the Ppd-A1b haplotypes and haplogroups. The causes of anomalous migration of Ppd-A1 heteroduplexes in gels were found to be the localization of mismatches relative to the center of fragment, the cumulative effect of neighbouring polymorphic sites, and the location of mismatches within A/T-tracts. Analysis of the Ppd-A1 5' UTR in hexaploid wheat revealed a novel mutation within the "photoperiod critical" region in a subset of T. compactum accessions. This putative photoperiod insensitive allele (designated Ppd-A1a.4) includes a 684 bp deletion which spans region in common with deletions previously identified in other photoperiod insensitive Ppd1 alleles.

  3. An analysis of linkage disequilibrium in the interleukin-1 gene cluster, using a novel grouping method for multiallelic markers.

    PubMed Central

    Cox, A; Camp, N J; Nicklin, M J; di Giovine, F S; Duff, G W

    1998-01-01

    In population- and family-based association studies, it is useful to have some knowledge of the patterns of linkage disequilibrium that exist between markers in candidate regions. When such studies are carried out with multiallelic markers, it is often convenient to group the alleles into a biallelic system, for analysis. In this study, we specifically examined the interleukin-1 (IL-1) gene cluster on chromosome 2, a region containing candidates for many inflammatory and autoimmune disorders. Data were collected on eight markers, four of which were multiallelic. Using these data, we investigated the effect of three allele-grouping strategies, including a novel method, on the detection of linkage disequilibrium. The novel approach, termed the "delta method," measures the deviation from the expected haplotype frequencies under linkage equilibrium, for each allelic combination. This information is then used to group the alleles, in an attempt to avoid the grouping together of alleles at one locus that are in opposite disequilibrium with the same allele at the second locus. The estimate haplotype frequencies (EH) program was used to estimate haplotype frequencies and the disequilibrium measure. In our data it was found that the delta method compared well with the other two strategies. Using this method, we found that there was a reasonable correlation between disequilibrium and physical distance in the region (r=-.540, P=.001, one-tailed). We also identified a common, eight-locus haplotype of the IL-1 gene cluster. PMID:9545388

  4. A population association study of vitamin D receptor gene polymorphisms and haplotypes with the risk of systemic lupus erythematosus in a Chinese population.

    PubMed

    Chen, Xu-E; Chen, Pu; Chen, Shan-Shan; Lu, Jin; Ma, Ting; Shi, Guang; Zhou, Ya; Li, Ji; Sheng, Liang

    2017-03-11

    The aim of the study is to investigate the association of vitamin D receptor (VDR) gene polymorphism, additional gene-gene interaction, and haplotype combination with systemic lupus erythematosus (SLE) risk. Pairwise linkage disequilibrium (LD) analysis was conducted using SNPstats. The association between four SNPs within VDR gene and SLE risk was investigated by logistic regression. Generalized multifactor dimensionality reduction (GMDR) was used to analyze the interaction among four SNPs. Four SNPs within VDR gene were selected for genotyping in this study, including rs2228570, rs1544410, rs7975232, and rs731236. The T allele of rs2228570 and the G allele of the rs1544410 were associated with increased MM risk, adjusted ORs (95%CI) were 1.61(1.25-2.11) and 1.78 (1.34-2.23), respectively. GMDR analysis suggested a significant two-locus model (P = 0.0010) involving rs1544410 and rs2228570, and in this model, the cross-validation consistency was 10/10, and the testing accuracy was 62.70%. The haplotype analysis indicated that the most common haplotype was rs1544410-A and rs7975232-G haplotype, the frequencies of which were 0.4701 and 0.5467 in case and control group. Haplotype containing the rs1544410-G and rs7975232-T alleles were associated with increased SLE risk, OR (95%CI) = 2.08 (1.47-2.72), P < 0.001. We found that rs2228570 and rs1544410 within VDR gene, their interaction and haplotype containing the rs1544410-G and rs7975232-T alleles were all associated with increased SLE risk.

  5. Analysis of the S-locus structure in Prunus armeniaca L. Identification of S-haplotype specific S-RNase and F-box genes.

    PubMed

    Romero, C; Vilanova, S; Burgos, L; Martínez-Calvo, J; Vicente, M; Llácer, G; Badenes, M L

    2004-09-01

    The gametophytic self-incompatibility (GSI) system in Rosaceae has been proposed to be controlled by two genes located in the S -locusan S-RNase and a recently described pollen expressed S -haplotype specific F-box gene (SFB). However, in apricot (Prunus armeniaca L.) these genes had not been identified yet. We have sequenced 21 kb in total of the S -locus region in 3 different apricot S -haplotypes. These fragments contain genes homologous to the S-RNase and F-box genes found in other Prunus species, preserving their basic gene structure features and defined amino acid domains. The physical distance between the F-box and the S-RNase genes was determined exactly in the S2-haplotype (2.9 kb) and inferred approximately in the S 1-haplotype (< 49 kb) confirming that these genes are linked. Sequence analysis of the 5' flanking regions indicates the presence of a conserved region upstream of the putative TATA box in the S-RNase gene. The three identified S-RNase alleles (S1, S2 and S4) had a high allelic sequence diversity (75.3 amino acid identity), and the apricot F-box allelic variants (SFB1, SFB2 and SFB4) were also highly haplotype-specific (79.4 amino acid identity). Organ specific-expression was also studied, revealing that S1- and S2-RNases are expressed in style tissues, but not in pollen or leaves. In contrast, SFB1 and SFB2 are only expressed in pollen, but not in styles or leaves. Taken together, these results support these genes as candidates for the pistil and pollen S-determinants of GSI in apricot.

  6. Polymorphism at expressed DQ and DR loci in five common equine MHC haplotypes.

    PubMed

    Miller, Donald; Tallmadge, Rebecca L; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A; Antczak, Douglas F

    2017-03-01

    The polymorphism of major histocompatibility complex (MHC) class II DQ and DR genes in five common equine leukocyte antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine bacterial artificial chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next generation sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse.

  7. Decreased calcium pump expression in human erythrocytes is connected to a minor haplotype in the ATP2B4 gene.

    PubMed

    Zámbó, Boglárka; Várady, György; Padányi, Rita; Szabó, Edit; Németh, Adrienn; Langó, Tamás; Enyedi, Ágnes; Sarkadi, Balázs

    2017-02-03

    Plasma membrane Ca(2+)-ATPases are key calcium exporter proteins in most tissues, and PMCA4b is the main calcium transporter in the human red blood cells (RBCs). In order to assess the expression level of PMCA4b, we have developed a flow cytometry and specific antibody binding method to quantitatively detect this protein in the erythrocyte membrane. Interestingly, we found several healthy volunteers showing significantly reduced expression of RBC-PMCA4b. Western blot analysis of isolated RBC membranes confirmed this observation, and indicated that there are no compensatory alterations in other PMCA isoforms. In addition, reduced PMCA4b levels correlated with a lower calcium extrusion capacity in these erythrocytes. When exploring the potential genetic background of the reduced PMCA4b levels, we found no missense mutations in the ATP2B4 coding regions, while a formerly unrecognized minor haplotype in the predicted second promoter region closely correlated with lower erythrocyte PMCA4b protein levels. In recent GWA studies, SNPs in this ATP2B4 haplotype have been linked to reduced mean corpuscular hemoglobin concentrations (MCHC), and to protection against malaria infection. Our data suggest that an altered regulation of gene expression is responsible for the reduced RBC-PMCA4b levels that is probably linked to the development of human disease-related phenotypes.

  8. SNP analyses of growth factor genes EGF, TGF{beta}-1, and HGF reveal haplotypic association of EGF with autism

    SciTech Connect

    Toyoda, Takao; Thanseem, Ismail; Kawai, Masayoshi; Sekine, Yoshimoto; Nakamura, Kazuhiko; Anitha, Ayyappan; Suda, Shiro . E-mail: nakamura@hama-med.ac.jp; Yamada, Kazuo; Tsujii, Masatsugu |; Iwayama, Yoshimi; Hattori, Eiji; Toyota, Tomoko; Yoshikawa, Takeo; Miyachi, Taishi; Tsuchiya, Kenji; Sugihara, Gen-ichi; Matsuzaki, Hideo; Iwata, Yasuhide; Suzuki, Katsuaki; Mori, Norio |; Ouchi, Yasuomi |; Sugiyama, Toshiro; Takei, Nori

    2007-09-07

    Autism is a pervasive neurodevelopmental disorder diagnosed in early childhood. Growth factors have been found to play a key role in the cellular differentiation and proliferation of the central and peripheral nervous systems. Epidermal growth factor (EGF) is detected in several regions of the developing and adult brain, where, it enhances the differentiation, maturation, and survival of a variety of neurons. Transforming growth factor-{beta} (TGF{beta}) isoforms play an important role in neuronal survival, and the hepatocyte growth factor (HGF) has been shown to exhibit neurotrophic activity. We examined the association of EGF, TGF{beta}1, and HGF genes with autism, in a trio association study, using DNA samples from families recruited to the Autism Genetic Resource Exchange; 252 trios with a male offspring scored for autism were selected for the study. Transmission disequilibrium test revealed significant haplotypic association of EGF with autism. No significant SNP or haplotypic associations were observed for TGF{beta}1 or HGF. Given the role of EGF in brain and neuronal development, we suggest a possible role of EGF in the pathogenesis of autism.

  9. Single nucleotide polymorphisms, haplotype association and tumour expression of the vascular endothelial growth factor (VEGF) gene with lung carcinoma.

    PubMed

    Naykoo, Niyaz A; Dil-Afroze; Rasool, Roohi; Shah, Sonaullah; Ahangar, A G; Bhat, Imtiyaz A; Qasim, Iqbal; Siddiqi, Mushtaq A; Shah, Zafar A

    2017-04-15

    VEGF contains several polymorphic sites known to influence its expression. We examined the possible association between+405(-634)C>G,+936C>T,-2578C>A and lung cancer in 199 Kashmiri patients and 401 healthy controls. VEGF+405CG,+936CT+TT and-2578CA genotypes were significantly associated with lung cancer risk compared to VEGF+405CC,+936CC and-2578AA+CC genotypes [OR=0.07 (0.04-0.13), P<0.0001, OR=0.36 (0.25-0.52), P<0.0001 and 0.08 (0.05-0.13), P<0.0001]. Haplotype analysis revealed that CGA and TGA haplotypes of VEGF gene conveys the risk for lung cancer [OR=0.18 (0.10-0.33), P<0.0001 and 0.07 (0.03-0.13), P<0.0001]. VEGF expression revealed non-significant association with the genotypes of the three SNPs. In conclusion, the SNPs examined appear to influence lung cancer susceptibility while as genotypes of the SNPs don't appear to have significant association with VEGF mRNA expression in lung tumours.

  10. Interleukin-6 (IL-6) and receptor (IL6-R) gene haplotypes associate with amniotic fluid protein concentrations in preterm birth.

    PubMed

    Velez, Digna R; Fortunato, Stephen J; Williams, Scott M; Menon, Ramkumar

    2008-06-01

    Spontaneous preterm birth (PTB-gestational age <37 weeks) occurs in approximately 450 000 births annually in the United States and is one of the leading causes of neonatal morbidity and mortality. Risk of PTB is affected by complex gene-environment interactions that are not well understood. We examined the PTB candidate gene, Interleukin 6 (IL-6) and its receptor (IL6-R) in both Caucasian (145 PTB and 194 term maternal; 140 PTB and 179 term fetal) and African-American (76 PTB and 191 term maternal; 66 PTB and 183 term fetal) DNA. Eight single nucleotide polymorphisms (SNPs) in IL-6 and 22 SNPs in IL6R were examined for association with IL-6 amniotic fluid (AF) concentrations, as concentration of IL-6 is a hypothesized risk factor. In addition, IL-6 and IL6-R SNPs were analyzed for associations with PTB. Haplotype associations were tested by sliding windows. No strong single marker effects were observed in Caucasians; however, in African-American maternal IL-6R marker rs4553185 associated with PTB (allele P = 4.49 x 10(-3) and genotype P = 0.01). The strongest haplotype associations were observed in IL-6R with IL-6 cytokine concentration as outcome: Caucasian fetal (rs4601580-rs4845618) P = 1.6 x 10(-3) and African-American maternal (rs4601580-rs4845618-rs6687726-rs7549338) P = 2.30 x 10(-3). Significant results converged on three regions in the two genes: in IL-6 markers rs1800797, rs1800796 and rs1800795; in IL-6R markers rs4075015, rs4601580, rs4645618, rs6687726 and rs7549338 and markers rs4845623, rs4537545 and rs4845625. In conclusion, our results suggest that IL-6 AF concentration, in situations of PTB, result from variation in IL-6 and more importantly IL-6R.

  11. Combined promoter haplotypes of the IL10R genes are associated with protection against severe malaria in Gabonese children.

    PubMed

    Velavan, T P; Büyükyazici, Birgül; Kremsner, Peter G; Kun, Jürgen F J

    2012-02-01

    The critical barrier in control of infections remains the failure of the immune system to clear parasites despite antigen recognition. We examined and validated possible association of regulatory immune gene polymorphisms in a cohort of children with mild and severe malaria. We focussed on two precursors of the Interleukin 10 Receptor (IL10R) gene namely the IL10R alpha and IL10R beta that play a fundamental role in initiation of signal transduction. Initial screening across 40 Gabonese adult individuals revealed two promoter variants for the IL10R alpha and three for the IL10R beta precursor, respectively. Validation of these variants for their allelic gene expression by transient transfection assays exhibited an altered expression in rs56356146 and rs7925112 of the IL10R alpha (P < 0.5); rs8178435 and rs999788 in the IL10R beta constructs (P < 0.0001), respectively. We further investigated the functional role of those SNP variants exhibiting altered expression in a cohort of children with mild and severe malaria. We genotyped 145 children with mild and 185 children with severe malaria for IL10R alpha; for IL10R beta, 102 children with mild and 101 children with severe malaria. We found that none of the SNP variants had any significant association neither in children with mild or severe malaria. The haplotype -185/-116 of IL10R alpha (TT) in combination with the haplotype -754/-750 of IL10R beta (AC) contributed towards mild malaria in comparison to severe malaria [TT + AC odds ratio of 0.73 (95% CI 0.56-0.94) P = 0.01]. This study may provide a better understanding on the role of IL10R promoter allelic variants contribution to a protective effect on the development of severe malaria.

  12. Comprehensive linkage and association analyses identify haplotype, near to the TNFSF15 gene, significantly associated with spondyloarthritis.

    PubMed

    Zinovieva, Elena; Bourgain, Catherine; Kadi, Amir; Letourneur, Franck; Izac, Brigitte; Said-Nahal, Roula; Lebrun, Nicolas; Cagnard, Nicolas; Vigier, Agathe; Jacques, Sébastien; Miceli-Richard, Corinne; Garchon, Henri-Jean; Heath, Simon; Charon, Céline; Bacq, Delphine; Boland, Anne; Zelenika, Diana; Chiocchia, Gilles; Breban, Maxime

    2009-06-01

    Spondyloarthritis (SpA) is a chronic inflammatory disorder with a strong genetic predisposition dominated by the role of HLA-B27. However, the contribution of other genes to the disease susceptibility has been clearly demonstrated. We previously reported significant evidence of linkage of SpA to chromosome 9q31-34. The current study aimed to characterize this locus, named SPA2. First, we performed a fine linkage mapping of SPA2 (24 cM) with 28 microsatellite markers in 149 multiplex families, which allowed us to reduce the area of investigation to an 18 cM (13 Mb) locus delimited by the markers D9S279 and D9S112. Second, we constructed a linkage disequilibrium (LD) map of this region with 1,536 tag single-nucleotide polymorphisms (SNPs) in 136 families (263 patients). The association was assessed using a transmission disequilibrium test. One tag SNP, rs4979459, yielded a significant P-value (4.9 x 10(-5)). Third, we performed an extension association study with rs4979459 and 30 surrounding SNPs in LD with it, in 287 families (668 patients), and in a sample of 139 cases and 163 controls. Strong association was observed in both familial and case/control datasets for several SNPs. In the replication study, carried with 8 SNPs in an independent sample of 232 cases and 149 controls, one SNP, rs6478105, yielded a nominal P-value<3 x 10(-2). Pooled case/control study (371 cases and 312 controls) as well as combined analysis of extension and replication data showed very significant association (P<5 x 10(-4)) for 6 of the 8 latter markers (rs7849556, rs10817669, rs10759734, rs6478105, rs10982396, and rs10733612). Finally, haplotype association investigations identified a strongly associated haplotype (P<8.8 x 10(-5)) consisting of these 6 SNPs and located in the direct vicinity of the TNFSF15 gene. In conclusion, we have identified within the SPA2 locus a haplotype strongly associated with predisposition to SpA which is located near to TNFSF15, one of the major candidate

  13. Comprehensive Linkage and Association Analyses Identify Haplotype, Near to the TNFSF15 Gene, Significantly Associated with Spondyloarthritis

    PubMed Central

    Zinovieva, Elena; Bourgain, Catherine; Kadi, Amir; Letourneur, Franck; Izac, Brigitte; Said-Nahal, Roula; Lebrun, Nicolas; Cagnard, Nicolas; Vigier, Agathe; Jacques, Sébastien; Miceli-Richard, Corinne; Garchon, Henri-Jean; Heath, Simon; Charon, Céline; Bacq, Delphine; Boland, Anne; Zelenika, Diana

    2009-01-01

    Spondyloarthritis (SpA) is a chronic inflammatory disorder with a strong genetic predisposition dominated by the role of HLA-B27. However, the contribution of other genes to the disease susceptibility has been clearly demonstrated. We previously reported significant evidence of linkage of SpA to chromosome 9q31–34. The current study aimed to characterize this locus, named SPA2. First, we performed a fine linkage mapping of SPA2 (24 cM) with 28 microsatellite markers in 149 multiplex families, which allowed us to reduce the area of investigation to an 18 cM (13 Mb) locus delimited by the markers D9S279 and D9S112. Second, we constructed a linkage disequilibrium (LD) map of this region with 1,536 tag single-nucleotide polymorphisms (SNPs) in 136 families (263 patients). The association was assessed using a transmission disequilibrium test. One tag SNP, rs4979459, yielded a significant P-value (4.9×10−5). Third, we performed an extension association study with rs4979459 and 30 surrounding SNPs in LD with it, in 287 families (668 patients), and in a sample of 139 cases and 163 controls. Strong association was observed in both familial and case/control datasets for several SNPs. In the replication study, carried with 8 SNPs in an independent sample of 232 cases and 149 controls, one SNP, rs6478105, yielded a nominal P-value<3×10−2. Pooled case/control study (371 cases and 312 controls) as well as combined analysis of extension and replication data showed very significant association (P<5×10−4) for 6 of the 8 latter markers (rs7849556, rs10817669, rs10759734, rs6478105, rs10982396, and rs10733612). Finally, haplotype association investigations identified a strongly associated haplotype (P<8.8×10−5) consisting of these 6 SNPs and located in the direct vicinity of the TNFSF15 gene. In conclusion, we have identified within the SPA2 locus a haplotype strongly associated with predisposition to SpA which is located near to TNFSF15, one of the major candidate

  14. Combined clustering models for the analysis of gene expression

    SciTech Connect

    Angelova, M. Ellman, J.

    2010-02-15

    Clustering has become one of the fundamental tools for analyzing gene expression and producing gene classifications. Clustering models enable finding patterns of similarity in order to understand gene function, gene regulation, cellular processes and sub-types of cells. The clustering results however have to be combined with sequence data or knowledge about gene functionality in order to make biologically meaningful conclusions. In this work, we explore a new model that integrates gene expression with sequence or text information.

  15. Haplotyping the human T-cell receptor. beta. -chain gene complex by use of restriction fragment length polymorphisms

    SciTech Connect

    Charmley, P.; Chao, A.; Gatti, R.A. ); Concannon, P. ); Hood, L. )

    1990-06-01

    The authors have studied the genetic segregation of human T-cell receptor {beta}-chain (TCR{beta}) genes on chromosome 7q in 40 CEPH (Centre d'Etude du Polymorphisme Humain) families by using restriction fragment length polymorphisms (RFLPs). They constructed haplotypes from eight RFLPs by using variable- and constant-region cDNA probes, which detect polymorphisms that span more than 600 kilobases of the TCR{beta} gene complex. Analysis of allele distributions between TCR{beta} genes revealed significant linkage disequilibrium between only 6 of the 28 different pairs of RFLPs. This linkage disequilibrium strongly influences the most efficient order to proceed for typing of these RFLPs in order to achieve maximum genetic informativeness, which in this study revealed a 97.3% level of heterozygosity within the TCR{beta} gene complex. The results should provide new insight into recent reports of disease associations with the TCR{beta} gene complex and should assist in designing future experiments to detect or confirm the existence of disease-susceptibility loci in this region of the human genome.

  16. Polymorphisms and haplotypes of the CYP2B6 detoxification gene in the predisposition of Acute Myeloid Leukemia (AML) and induction of its cytogenetic abnormalities.

    PubMed

    Daraki, Aggeliki; Kakosaiou, Katerina; Zachaki, Sophia; Sambani, Constantina; Aleporou-Marinou, Vassiliki; Kollia, Panagoula; Manola, Kalliopi N

    2016-11-01

    CYP2B6 is a polymorphic detoxification gene which plays a vital role in the degradation of genotoxic compounds. In this study we hypothesized that inadequate detoxification due to CYP2B6 polymorphisms may contribute to AML. To evaluate the potential impact of CYP2B6 polymorphisms on AML development and induction of its specific chromosomal abnormalities we studied C(777)A and A(785)G polymorphisms for the first time in AML. Furthermore, we investigated the co-existence of the above polymorphisms with G(516)T polymorphism to determine the CYP2B6 high-risk haplotypes in AML susceptibility. Our study included 619 AML patients and 430 healthy donors. Concerning C(777)A CYP2B6 polymorphism, no significant difference was found between patients and controls. However, A(785)G CYP2B6 polymorphism showed a statistically higher frequency of the variant genotypes in patients (48.2%), mainly in secondary AML patients (49.1%) than in controls (26.1%). Moreover, an increased frequency of the variant genotypes was found in those with abnormal karyotypes, especially with -7/del(7q), -5/del(5q), +8, inv(16) and t(8;21). The combination of the three CYP2B6 polymorphisms (G(516)T, C(777)A & A(785)G) revealed seven haplotypes. Four out of six haplotypes with at least one mutant allele were significantly associated with an increased risk for AML. Interestingly, T516A777G785 haplotype, where the three mutant alleles co-existed, had ~3-fold increased risk to be found in patients than controls. The association between haplotypes and cytogenetic aberrations revealed a positive correlation between specific CYP2B6 haplotypes and AML cytogenetic abnormalities. Our data suggest that A(785)G CYP2B6 gene polymorphism and specific CYP2B6 haplotypes may contribute to AML and its specific chromosomal aberrations.

  17. Integrative mutation, haplotype and G × G interaction evidence connects ABGL4, LRP8 and PCSK9 genes to cardiometabolic risk

    PubMed Central

    Guo, Tao; Yin, Rui-Xing; Yao, Li-Mei; Huang, Feng; Pan, Ling; Lin, Wei-Xiong; Yang, De-Zhai; Pan, Shang-Ling

    2016-01-01

    This study is expected to investigate the association of ATP/GTP binding protein-like 4 (AGBL4), LDL receptor related protein 8 (LRP8) and proprotein convertase subtilisin/kexin type 9 (PCSK9) gene single nucleotide variants (SNVs) with lipid metabolism in 2,552 individuals (Jing, 1,272 and Han, 1,280). We identified 12 mutations in this motif. The genotype and allele frequencies of these variants were different between the two populations. Multiple-locus linkage disequilibrium (LD) elucidated the detected sites are not statistically independent. Possible integrative haplotypes and gene-by-gene (G × G) interactions, comprising mutations of the AGBL4, LRP8 and PCSK9 associated with total cholesterol (TC, AGBL4 G-G-A, PCSK9 C-G-A-A and G-G-A-A-C-A-T-T-T-G-G-A), triglyceride (TG, AGBL4 G-G-A, LRP8 G-A-G-C-C, PCSK9 C-A-A-G, A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A), HDL cholesterol (HDL-C, AGBL4 A-A-G and A-A-G-A-A-G-T-C-C-A-A-G) and the apolipoprotein(Apo)A1/ApoB ratio (A1/B, PCSK9 C-A-A-G) in Jing minority. However, in the Hans, with TG (AGBL4 G-G-A, LRP8 G-A-G-C-C, PCSK9 C-A-A-G, A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A), HDL-C (LRP8 A-A-G-T-C), LDL-C (LRP8 A-A-G-T-C and A-A-G-A-A-G-T-C-C-A-A-G) and A1/B (LRP8 A-C-A-T-T and PCSK9 C-A-A-G). Association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests especially for TG. PMID:27853278

  18. Integrative mutation, haplotype and G × G interaction evidence connects ABGL4, LRP8 and PCSK9 genes to cardiometabolic risk.

    PubMed

    Guo, Tao; Yin, Rui-Xing; Yao, Li-Mei; Huang, Feng; Pan, Ling; Lin, Wei-Xiong; Yang, De-Zhai; Pan, Shang-Ling

    2016-11-17

    This study is expected to investigate the association of ATP/GTP binding protein-like 4 (AGBL4), LDL receptor related protein 8 (LRP8) and proprotein convertase subtilisin/kexin type 9 (PCSK9) gene single nucleotide variants (SNVs) with lipid metabolism in 2,552 individuals (Jing, 1,272 and Han, 1,280). We identified 12 mutations in this motif. The genotype and allele frequencies of these variants were different between the two populations. Multiple-locus linkage disequilibrium (LD) elucidated the detected sites are not statistically independent. Possible integrative haplotypes and gene-by-gene (G × G) interactions, comprising mutations of the AGBL4, LRP8 and PCSK9 associated with total cholesterol (TC, AGBL4 G-G-A, PCSK9 C-G-A-A and G-G-A-A-C-A-T-T-T-G-G-A), triglyceride (TG, AGBL4 G-G-A, LRP8 G-A-G-C-C, PCSK9 C-A-A-G, A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A), HDL cholesterol (HDL-C, AGBL4 A-A-G and A-A-G-A-A-G-T-C-C-A-A-G) and the apolipoprotein(Apo)A1/ApoB ratio (A1/B, PCSK9 C-A-A-G) in Jing minority. However, in the Hans, with TG (AGBL4 G-G-A, LRP8 G-A-G-C-C, PCSK9 C-A-A-G, A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A), HDL-C (LRP8 A-A-G-T-C), LDL-C (LRP8 A-A-G-T-C and A-A-G-A-A-G-T-C-C-A-A-G) and A1/B (LRP8 A-C-A-T-T and PCSK9 C-A-A-G). Association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests especially for TG.

  19. Haplotype mapping and sequence analysis of the mouse Nramp gene predict susceptibility to infection with intracellular parasites

    SciTech Connect

    Malo, D.; Hu, Jinxin; Schurr, E.

    1994-09-01

    The mouse chromosome 1 locus Bcg (Ity, Lsh) controls the capacity of the tissue macrophage to restrict the replication of antigenically unrelated intracellular parasites and therefore determines the natural resistance (BCG-R, dominant) or susceptibility (BCG-S, recessive) of inbred mouse strains to infection with diverse pathogens. We have used a positional cloning strategy based on genetic and physical mapping, YAC cloning, and exon trapping to isolate a candidate gene for Beg (Nramp) that encodes a predicted macrophage-specific transport protein. We have analyzed a total of 27 inbred mouse strains of BCG-R and BCG-S phenotypes for the presence of nucleotide sequence variations within the coding portion of Nramp and have carried out haplotype typing of the corresponding chromosome 1 region in these mice, using 11 additional polymorphic markers mapping in the immediate vicinity of Nramp. cDNA cloning and nucleotide sequencing identified 5 nucleotide sequence variations within Nramp in the inbred strains.

  20. Complex haplotype structure of the human GNAS gene identifies a recombination hotspot centred on a single nucleotide polymorphism widely used in association studies.

    PubMed

    Yang, Wanling; White, Brook; Spicer, Eleanor K; Weinstein, Benjamin L; Hildebrandt, John D

    2004-11-01

    The alpha subunit of the heterotrimeric G protein Gs (Gsalpha) is involved in numerous physiological processes and is a primary determinant of cellular responses to extracellular signals. Genetic variations in the Gsalpha gene may play an important role in complex diseases and drug responses. To characterize the genetic diversity in this locus, we resequenced exons and flanking introns of the gene in 44 genomic samples and analysed the haplotype structure of the gene in an additional 50 African-Americans and 50 Caucasians. Significant differences in allele frequency for nearly all the genotyped single nucleotide polymorphism (SNPs) were detected between the two ethnic groups. Linkage disequilibrium (LD) analysis of this locus revealed two haplotype blocks characterized by strong LD and reduced haplotype diversity, especially in Caucasians. Between the two blocks is a narrow (approximately 3 kb) recombination hotspot centred on exons 4 and 5, and a widely used genetic marker in association studies in this region (rs7121) was in linkage equilibrium with the rest of the gene. The haplotype structure of the GNAS locus warrants reevaluation of previous association studies that used marker rs7121 and affects choice of SNP markers to be used in future studies of this locus.

  1. Association of polymorphisms and haplotypes in the cytochrome P450 1B1 gene with uterine leiomyoma: A case control study

    PubMed Central

    SALIMI, SAEEDEH; KHODAMIAN, MARYAM; NAROOIE-NEJAD, MEHRNAZ; HAJIZADEH, AZAM; FAZELI, KIMIA; NAMAZI, LIDA; YAGHMAEI, MINOO

    2015-01-01

    Uterine leiomyoma (UL) is an estrogen-dependent neoplasm of the uterus and estrogen metabolizing enzymes affect its promotion and progression. The aim of the present study was to evaluate the association between four single-nucleotide polymorphisms (SNPs) of the cytochrome P450 1B1 (CYP1B1) gene and UL risk. Four SNPs of the CYP1B1 gene in 105 UL patients and 112 unrelated healthy controls were genotyped using a direct sequencing method. Haplotype analyses were performed with UNPHASED software and linkage disequilibrium (LD) was assessed by Haploview software. There were no associations between Leu432Val (rs1056836), Asp449Asp (rs1056837) and Asn453Ser (rs1800440) polymorphisms of the CYP1B1 gene and UL. Although the genotypic frequencies of the Arg368His (rs79204362) polymorphism did not differ between the two groups, the frequency of A (His) allele was significantly higher in UL females (P=0.02). In addition, the frequency of GTAA haplotype was significantly higher in the controls and played a protective role in UL susceptibility. A strong LD between the three common SNPs (rs1056836, rs1056837 and rs1800440) in the CYP1B1 gene was observed in the population. In conclusion, a higher frequency of the CYP1B1 368His (A) allele was observed in UL females. The frequency of the GTAA haplotype was significantly higher in healthy females and this haplotype played a protective role in UL susceptibility. PMID:26075073

  2. Association of polymorphisms and haplotypes in the cytochrome P450 1B1 gene with uterine leiomyoma: A case control study.

    PubMed

    Salimi, Saeedeh; Khodamian, Maryam; Narooie-Nejad, Mehrnaz; Hajizadeh, Azam; Fazeli, Kimia; Namazi, Lida; Yaghmaei, Minoo

    2015-03-01

    Uterine leiomyoma (UL) is an estrogen-dependent neoplasm of the uterus and estrogen metabolizing enzymes affect its promotion and progression. The aim of the present study was to evaluate the association between four single-nucleotide polymorphisms (SNPs) of the cytochrome P450 1B1 (CYP1B1) gene and UL risk. Four SNPs of the CYP1B1 gene in 105 UL patients and 112 unrelated healthy controls were genotyped using a direct sequencing method. Haplotype analyses were performed with UNPHASED software and linkage disequilibrium (LD) was assessed by Haploview software. There were no associations between Leu432Val (rs1056836), Asp449Asp (rs1056837) and Asn453Ser (rs1800440) polymorphisms of the CYP1B1 gene and UL. Although the genotypic frequencies of the Arg368His (rs79204362) polymorphism did not differ between the two groups, the frequency of A (His) allele was significantly higher in UL females (P=0.02). In addition, the frequency of GTAA haplotype was significantly higher in the controls and played a protective role in UL susceptibility. A strong LD between the three common SNPs (rs1056836, rs1056837 and rs1800440) in the CYP1B1 gene was observed in the population. In conclusion, a higher frequency of the CYP1B1 368His (A) allele was observed in UL females. The frequency of the GTAA haplotype was significantly higher in healthy females and this haplotype played a protective role in UL susceptibility.

  3. Anonymous marker loci within 400 kb of HLA-A generate haplotypes in linkage disequilibrium with the hemochromatosis gene (HFE)

    SciTech Connect

    Yaouanq, J.; Perichon, M.; Treut, A.L.; Kahloun, A.E.; Mauvieux, V.; Blayau, M.; Jouanolle, A.M.; Chauvel, B.; Le Gall, J.Y.; David, V. )

    1994-02-01

    The hemochromatosis gene (HFE) maps to 6p21.3 and is less than 1 cM from the HLA class I gene; however, the precise physical location of the gene has remained elusive and controversial. The unambiguous identification of a crossover event within hemochromatosis families is very difficult; it is particularly hampered by the variability of the phenotypic expression as well as by the sex- and age-related penetrance of the disease. For these considerations, traditional linkage analysis could prove of limited value in further refining the extrapolated physical position of HFE. The authors therefore embarked upon a linkage-disequilibrium analysis of HFE and normal chromosomes for the Brittany population. In this report, 66 hemochromatosis families yielding 151 hemochromatosis chromosomes and 182 normal chromosomes were RFLP-typed with a battery of probes, including two newly derived polymorphic markers from the 6.7 and HLA-F loci located 150 and 250 kb telomeric to HLA-A, respectively. The results suggest a strong peak of existing linkage disequilibrium focused within the i82-to-6.7 interval (approximately 250 kb). The zone of linkage disequilibrium is flanked by the i97 locus, positioned 30 kb proximal to i82, and the HLA-F gene, found 250 kb distal to HLA-A, markers of which display no significant association with HFE. These data support the possibility that HFE resides within the 400-kb expanse of DNA between i97 and HLA-F. Alternatively, the very tight association of HLA-A3 and allele 1 of the 6.7 locus, both of which are comprised by the major ancestral or founder HFE haplotype in Brittany, supports the possibility that the disease gene may reside immediately telomeric to the 6.7 locus within the linkage-disequilibrium zone. Additionally, hemochromatosis haplotypes possessing HLA-A11 and the low-frequency HLA-F polymorphism (allele 2) are supportive of a separate founder chromosome containing a second, independently arising mutant allele. 69 refs., 1 fig., 5 tabs.

  4. Association between ovocalyxin-32 gene haplotypes and eggshell quality traits in an F2 intercross between two chicken lines divergently selected for eggshell strength.

    PubMed

    Takahashi, H; Sasaki, O; Nirasawa, K; Furukawa, T

    2010-10-01

    Broken and cracked eggshells contribute significantly to economic losses in the egg production industry. We previously identified ovocalyxin-32 as a potential gene influencing eggshell traits, by analysing an intercross between two parent lines developed from the same founder population by a two-way selection for eggshell strength with non-destructive deformation (DEF) conducted over 14 generations. We determined the nucleotide sequences of six ovocalyxin-32 exons in the parent individuals and analysed the association between ovocalyxin-32 and eggshell traits in the F2 individuals. We identified three haplotypes (W, M and S) of ovocalyxin-32 in the parent individuals. A mismatch amplification mutation assay was performed to distinguish six diplotype individuals (WW, MM, SS, WM, MS and WS) inthe F2 population. The egg weight (EW) of SS-diplotype individuals was significantly higher than that of WW-, WM- and WS-diplotypes. Short length of the egg (SLE) of SS-diplotype individuals was significantly higher than that of WW-, WM- and MS-diplotypes. Long length of the egg (LLE) of SS-diplotype individuals was significantly higher than that of WM and WS-diplotypes. DEF of WW-diplotype individuals was significantly higher than that ofSS-, WM, MS and WM-diplotypes. Haplotypic effect analyses showed significant differences between the W-haplotype and the S-haplotypes in the EW, SLE, LLE and DEF. The DEF of M-haplotype was significantly lower than that of W- and S-haplotypes. These results suggest that S- and M-haplotypes are critical for high quality of eggshells in the F2 population. In conclusion, ovocalyxin-32 is a useful marker of eggshell traits and can be used to develop strategies for improving eggshell traits in commercial layer houses.

  5. Efficient clustering of identity-by-descent between multiple individuals

    PubMed Central

    Qian, Yu; Browning, Brian L.; Browning, Sharon R.

    2014-01-01

    Motivation: Most existing identity-by-descent (IBD) detection methods only consider haplotype pairs; less attention has been paid to considering multiple haplotypes simultaneously, even though IBD is an equivalence relation on haplotypes that partitions a set of haplotypes into IBD clusters. Multiple-haplotype IBD clusters may have advantages over pairwise IBD in some applications, such as IBD mapping. Existing methods for detecting multiple-haplotype IBD clusters are often computationally expensive and unable to handle large samples with thousands of haplotypes. Results: We present a clustering method, efficient multiple-IBD, which uses pairwise IBD segments to infer multiple-haplotype IBD clusters. It expands clusters from seed haplotypes by adding qualified neighbors and extends clusters across sliding windows in the genome. Our method is an order of magnitude faster than existing methods and has comparable performance with respect to the quality of clusters it uncovers. We further investigate the potential application of multiple-haplotype IBD clusters in association studies by testing for association between multiple-haplotype IBD clusters and low-density lipoprotein cholesterol in the Northern Finland Birth Cohort. Using our multiple-haplotype IBD cluster approach, we found an association with a genomic interval covering the PCSK9 gene in these data that is missed by standard single-marker association tests. Previously published studies confirm association of PCSK9 with low-density lipoprotein. Availability and implementation: Source code is available under the GNU Public License http://cs.au.dk/~qianyuxx/EMI/. Contact: qianyuxx@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24363374

  6. Haplotype variation at Badh2, the gene determining fragrance in rice.

    PubMed

    Shao, Gaoneng; Tang, Shaoqing; Chen, Mingliang; Wei, Xiangjin; He, Jiwai; Luo, Ju; Jiao, Guiai; Hu, Yichao; Xie, Lihong; Hu, Peisong

    2013-02-01

    Fragrance is an important component of end-use quality in rice. A set of 516 fragrant rice accessions were genotyped and over 80% of them carried the badh2.7 allele. A subset of 144 mostly fragrant accessions, including nine of Oryza rufipogon, was then subjected to a detailed diversity and haplotype analysis. The level of linkage disequilibrium in the Badh2 region was higher among the fragrant accessions. Re-sequencing in the Badh2 region showed that badh2.7, badh2.2 and badh2.4-5 all arose in the japonica genepool, and spread later into the indica genepool as a result of deliberate crossing. However, loss-of-function alleles of Badh2 are also found in the indica genepools, and then transferred into japonica. Evidence for three new possible FNPs was obtained from the Badh2 sequence of 62 fragrant accessions. Based on these data, we have elaborated a model for the evolution of Badh2 and its participation in the rice domestication process.

  7. Gene polymorphisms of novel immunotolerant molecule BTLA: distribution of alleles, genotypes and haplotypes in Polish Caucasian population.

    PubMed

    Partyka, Anna; Woszczyk, Dariusz; Strzała, Tomasz; Szczepańska, Anna; Tomkiewicz, Anna; Frydecka, Irena; Karabon, Lidia

    2015-02-01

    B and T lymphocyte attenuator (BTLA) is one of the members of immunoglobulin superfamily which, like CTLA-4 and PD-1, is involved in down regulation of immune response. Despite the important role of BTLA in maintaining immune homeostasis, relatively little studies were devoted to the relationship of polymorphisms in the gene encoding BTLA with susceptibility to autoimmune disease and cancer. Moreover, all published works were done in Asian populations. BTLA gene is located on chromosome 3 in q13.2 and consists of five exons. The aim of this study was to investigate the alleles, genotypes and haplotypes frequency of selected BTLA gene polymorphisms in Caucasian population originating from Poland. For this study, the single-nucleotide polymorphisms (SNPs) were chosen on the basis of literature data. Additionally, the tag dSNP under linkage equilibrium r (2) > 0.8 and available at the National Center for Biotechnology Information (NCBI) for Caucasian population of rare alleles at a frequency greater than 5 % have been chosen using the NCBI database. The ten BTLA SNPs investigated were: rs1844089, rs2705535, rs9288952, rs9288953, rs1982809, rs2633580, rs2705511, rs2705565, rs76844316, rs16859633. For all SNPs selected on the basis of literature data the significantly different distributions of genotypes between Asian and Caucasian populations were observed.

  8. A Case-Control Study of the Relationship Between SLC22A3-LPAL2-LPA Gene Cluster Polymorphism and Coronary Artery Disease in the Han Chinese Population

    PubMed Central

    Song, Zi-Kai; Cao, Hong-Yan; Wu, Hai-Di; Zhou, Li-Ting; Qin, Ling

    2016-01-01

    Background Mutations in the solute carrier family 22 member 3 (SLC22A3), lipoprotein (a)-like 2 (LPAL2), and the lipoprotein (a) (LPA) gene cluster, which encodes apolipoprotein (a) [apo (a)] of the lipoprotein (a) [Lp (a)] lipoprotein particle, have been suggested to contribute to the risk of coronary artery disease (CAD), but the precise variants of this gene cluster have not yet been identified in Chinese populations. Objectives We sought to investigate the association between SLC22A3-LPAL2-LPA gene cluster polymorphisms and the risk of CAD in the Han Chinese population. Patients and Methods We recruited 551 CAD patients and 544 healthy controls for this case-control study. Four SNPs (rs9346816, rs2221750, rs3127596, and rs9364559) were genotyped in real time using the MassARRAY system (Sequenom; USA) in the SLC22A3-LPAL2-LPA gene cluster. All subjects were Chinese and of Han descent, and were recruited from the First Hospital of Jilin University based on convenience sampling from June 2009 to September 2012. Results The frequency of the minor allele G (34.8%) in rs9364559 was significantly higher in the CAD patients than in the healthy controls (29.4%) (P = 0.006). There was genotypic association between rs9364559 and CAD (P = 0.022), and these results still remained significant after adjustment for the conventional CAD risk factors through forward logistic regression analysis (P = 0.020, P = 0.019). Haplotype analyses from different blocks indicated that 11 haplotypes were associated with the risk of CAD. Seven haplotypes were associated with a reduced risk of CAD, whereas four haplotypes were associated with an increased risk of CAD. Conclusions Rs9364559 in the LPA gene may contribute to the risk of CAD in the Han Chinese population; haplotypes which contain rs9346816-G were all associated with an increased risk of CAD in this study. PMID:27621937

  9. Short communication: casein haplotype variability in sicilian dairy goat breeds.

    PubMed

    Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

    2008-09-01

    In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.

  10. Rainfall-driven sex-ratio genes in African buffalo suggested by correlations between Y-chromosomal haplotype frequencies and foetal sex ratio

    PubMed Central

    2010-01-01

    Background The Y-chromosomal diversity in the African buffalo (Syncerus caffer) population of Kruger National Park (KNP) is characterized by rainfall-driven haplotype frequency shifts between year cohorts. Stable Y-chromosomal polymorphism is difficult to reconcile with haplotype frequency variations without assuming frequency-dependent selection or specific interactions in the population dynamics of X- and Y-chromosomal genes, since otherwise the fittest haplotype would inevitably sweep to fixation. Stable Y-chromosomal polymorphism due one of these factors only seems possible when there are Y-chromosomal distorters of an equal sex ratio, which act by negatively affecting X-gametes, or Y-chromosomal suppressors of a female-biased sex ratio. These sex-ratio (SR) genes modify (suppress) gamete transmission in their own favour at a fitness cost, allowing for stable polymorphism. Results Here we show temporal correlations between Y-chromosomal haplotype frequencies and foetal sex ratios in the KNP buffalo population, suggesting SR genes. Frequencies varied by a factor of five; too high to be alternatively explained by Y-chromosomal effects on pregnancy loss. Sex ratios were male-biased during wet and female-biased during dry periods (male proportion: 0.47-0.53), seasonally and annually. Both wet and dry periods were associated with a specific haplotype indicating a SR distorter and SR suppressor, respectively. Conclusions The distinctive properties suggested for explaining Y-chromosomal polymorphism in African buffalo may not be restricted to this species alone. SR genes may play a broader and largely overlooked role in mammalian sex-ratio variation. PMID:20416038

  11. Chronic inflammatory state in sickle cell anemia patients is associated with HBB(*)S haplotype.

    PubMed

    Bandeira, Izabel C J; Rocha, Lillianne B S; Barbosa, Maritza C; Elias, Darcielle B D; Querioz, José A N; Freitas, Max Vitor Carioca; Gonçalves, Romélia P

    2014-02-01

    The chronic inflammatory state in sickle cell anemia (SCA) is associated with several factors such as the following: endothelial damage; increased production of reactive oxygen species; hemolysis; increased expression of adhesion molecules by leukocytes, erythrocytes, and platelets; and increased production of proinflammatory cytokines. Genetic characteristics affecting the clinical severity of SCA include variations in the hemoglobin F (HbF) level, coexistence of alpha-thalassemia, and the haplotype associated with the HbS gene. The different haplotypes of SCA are Bantu, Benin, Senegal, Cameroon, and Arab-Indian. These haplotypes are associated with ethnic groups and also based on the geographical origin. Studies have shown that the Bantu haplotype is associated with higher incidence of clinical complications than the other haplotypes and is therefore considered to have the worst prognosis. This study aimed to evaluate the profile of the proinflammatory cytokines interleukin-6, tumor necrosis factor-α, and interleukin-17 in patients with SCA and also to assess the haplotypes associated with beta globin cluster S (HBB(*)S). We analyzed a total of 62 patients who had SCA and had been treated with hydroxyurea; they had received a dose ranging between 15 and 25 (20.0±0.6)mg/kg/day for 6-60 (18±3.4)months; their data were compared with those for 30 normal individuals. The presence of HbS was detected and the haplotypes of the beta S gene cluster were analyzed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). Our study demonstrated that SCA patients have increased inflammatory profile when compared to the healthy individuals. Further, analysis of the association between the haplotypes and inflammatory profile showed that the levels of IL-6 and TNF-α were greater in subjects with the Bantu/Bantu haplotype than in subjects with the Benin/Benin haplotype. The Bantu/Benin haplotype individuals had lower levels of cytokines than those with

  12. Haplotypes of beta S chromosomes among patients with sickle cell anemia from Georgia.

    PubMed

    Hattori, Y; Kutlar, F; Kutlar, A; McKie, V C; Huisman, T H

    1986-01-01

    Fetal hemoglobin and G gamma levels have been correlated with the presence or absence of eight restriction sites within the beta globin gene cluster (haplotypes) for numerous sickle cell anemia patients from Georgia. The most common haplotypes were #19 (Benin) and #20 (CAR); all patients with haplotype combinations 19/19, 20/20, and 19/20 were severely affected with low Hb F and low G gamma levels. A modified #19 beta S chromosome with a -G gamma-G gamma- globin gene arrangement, instead of -G gamma-A gamma-, was present in SS and SC newborn babies with G gamma values above 80%. Haplotype #3 (Senegal) was present among 15% of the beta S chromosomes; the two adult patients with the 3/3 combination were mildly affected with high Hb F and G gamma values. The haplotype AT with the variant A gamma T chain was a rarity. A new haplotype was found in one 17-year-old SS patient and five of his Hb S heterozygous relatives. This haplotype is associated with an increased production of Hb F in heterozygous and homozygous Hb S individuals; this Hb F contained primarily A gamma chains. A comparison was made between the different haplotypes among SS patients and normal Black individuals, and a remarkable similarity was noted in the fetal hemoglobin data for subjects with these different chromosomes.

  13. KIR haplotypes defined by segregation analysis in 59 Centre d'Etude Polymorphisme Humain (CEPH) families.

    PubMed

    Martin, M P; Single, R M; Wilson, M J; Trowsdale, J; Carrington, M

    2008-12-01

    The killer cell immunoglobulin-like receptor (KIR) gene cluster exhibits extensive allelic and haplotypic diversity. Variation at the locus is associated with an increasing number of human diseases, reminiscent of the HLA loci. Characterization of diversity at the KIR locus has progressed over the past several years, particularly since the sequence of entire KIR haplotypes have become available. To determine the extent of KIR haplotypic variability among individuals of northern European descent, we genotyped 59 CEPH families for presence/absence of all KIR genes and performed limited allelic subtyping at several KIR loci. A total of 20 unique haplotypes differing in gene content were identified, the most common of which was the previously defined A haplotype (f = 0.52). Several unusual haplotypes that probably arose as a consequence of unequal crossing over events were also identified. Linkage disequilibrium (LD) analysis indicated strong negative and positive LD between several pairs of genes, values that may be useful in determining haplotypic structure when family data are not available. These data provide a resource to aid in the interpretation of disease association data involving individuals of European descent.

  14. Evolution of Hox gene clusters in deuterostomes

    PubMed Central

    2013-01-01

    Hox genes, with their similar roles in animals as evolutionarily distant as humans and flies, have fascinated biologists since their discovery nearly 30 years ago. During the last two decades, reports on Hox genes from a still growing number of eumetazoan species have increased our knowledge on the Hox gene contents of a wide range of animal groups. In this review, we summarize the current Hox inventory among deuterostomes, not only in the well-known teleosts and tetrapods, but also in the earlier vertebrate and invertebrate groups. We draw an updated picture of the ancestral repertoires of the different lineages, a sort of “genome Hox bar-code” for most clades. This scenario allows us to infer differential gene or cluster losses and gains that occurred during deuterostome evolution, which might be causally linked to the morphological changes that led to these widely diverse animal taxa. Finally, we focus on the challenging family of posterior Hox genes, which probably originated through independent tandem duplication events at the origin of each of the ambulacrarian, cephalochordate and vertebrate/urochordate lineages. PMID:23819519

  15. Penicillium roqueforti PR toxin gene cluster characterization.

    PubMed

    Hidalgo, Pedro I; Poirier, Elisabeth; Ullán, Ricardo V; Piqueras, Justine; Meslet-Cladière, Laurence; Coton, Emmanuel; Coton, Monika

    2017-03-01

    PR toxin is a well-known isoprenoid mycotoxin almost solely produced by Penicillium roqueforti after growth on food or animal feed. This mycotoxin has been described as the most toxic produced by this species. In this study, an in silico analysis allowed identifying for the first time a 22.4-kb biosynthetic gene cluster involved in PR toxin biosynthesis in P. roqueforti. The pathway contains 11 open reading frames encoding for ten putative proteins including the major fungal terpene cyclase, aristolochene synthase, involved in the first farnesyl-diphosphate cyclization step as well as an oxidoreductase, an oxidase, two P450 monooxygenases, a transferase, and two dehydrogenase enzymes. Gene silencing was used to study three genes (ORF5, ORF6, and ORF8 encoding for an acetyltransferase and two P450 monooxygenases, respectively) and resulted in 20 to 40% PR toxin production reductions in all transformants proving the involvement of these genes and the corresponding enzyme activities in PR toxin biosynthesis. According to the considered silenced gene target, eremofortin A and B productions were also affected suggesting their involvement as biosynthetic intermediates in this pathway. A PR toxin biosynthesis pathway is proposed based on the most recent and available data.

  16. Genetic variation in the toll-like receptor gene cluster (TLR10-TLR1-TLR6) and prostate cancer risk.

    PubMed

    Stevens, Victoria L; Hsing, Ann W; Talbot, Jeffrey T; Zheng, Siqun Lilly; Sun, Jielin; Chen, Jinbo; Thun, Michael J; Xu, Jianfeng; Calle, Eugenia E; Rodriguez, Carmen

    2008-12-01

    Toll-like receptors (TLRs) are key players in the innate immune system and initiate the inflammatory response to foreign pathogens such as bacteria, fungi and viruses. The proposed role of chronic inflammation in prostate carcinogenesis has prompted investigation into the association of common genetic variation in TLRs with the risk of this cancer. We investigated the role of common SNPs in a gene cluster encoding the TLR10, TLR6 and TLR1 proteins in prostate cancer etiology among 1,414 cancer cases and 1,414 matched controls from the Cancer Prevention Study II Nutrition Cohort. Twenty-eight SNPs, which included the majority of the common nonsynonymous SNPs in the 54-kb gene region and haplotype-tagging SNPs that defined 5 specific haplotype blocks, were genotyped and their association with prostate cancer risk determined. Two SNPs in TLR10 [I369L (rs11096955) and N241H (rs11096957)] and 4 SNPs in TLR1 [N248S (rs4833095), S26L (rs5743596), rs5743595 and rs5743551] were associated with a statistically significant reduced risk of prostate cancer of 29-38% (for the homozygous variant genotype). The association of these SNPs was similar when the analysis was limited to cases with advanced prostate cancer. Haplotype analysis and linkage disequilibrium findings revealed that the 6 associated SNPs were not independent and represent a single association with reduced prostate cancer risk (OR = 0.55, 95% CI: 0.33, 0.90). Our study suggest that a common haplotype in the TLR10-TLR1-TLR6 gene cluster influences prostate cancer risk and clearly supports the need for further investigation of TLR genes in other populations.

  17. Presence of a TA haplotype in the APC gene containing the common 1822 polymorphism and colorectal adenoma.

    PubMed

    Egan, Jan B; Jacobs, Elizabeth T; Martínez, María Elena; Gerner, Eugene W; Jurutka, Peter W; Thompson, Patricia A

    2008-07-15

    Acquired or inherited mutations in the adenomatous polyposis coli (APC) tumor suppressor gene are causally linked to colorectal cancer. Given the significance of APC in colorectal cancer, we investigated the association between common single-nucleotide polymorphisms (SNP) in the APC gene and the odds of developing metachronous colorectal adenomas as a surrogate measure of colorectal cancer risk. Coding SNPs at codons 486, 1678, 1822, 1960, and 2502 were analyzed in a total of 1,399 subjects who participated in two randomized clinical trials for the prevention of colorectal adenomas. No association was found for any single SNP and the odds of metachronous adenoma. In contrast, a TA haplotype (codons 486 and 1822) was associated with a statistically significant 27% and 26% reduction in the odds of any and nonadvanced metachronous adenoma after adjustment for baseline adenoma characteristics [odds ratio (OR), 0.73; 95% confidence interval (95% CI), 0.59-0.91 and OR, 0.74; 95% CI, 0.57-0.94], respectively. No significant reduction in odds was observed for advanced metachronous lesions. Diplotype analysis revealed a strong gene dose effect with carriers of two alleles containing TT-AA (codons 486 and 1822, respectively) having an 89% lower odds for advanced metachronous adenomas (OR, 0.11; 95% CI, 0.01-0.80) when compared with the common CC-AA diplotype (codons 486 and 1822, respectively). Our findings support an important role for germ-line allele sequence in the APC gene and individual risk of metachronous adenomatous polyps.

  18. [Spatial Distribution of Intron 2 of nad1 Gene Haplotypes in Populations of Norway and Siberian Spruce (Picea abies-P. obovata) Species Complex].

    PubMed

    Mudrik, E A; Polyakova, T A; Shatokhina, A V; Bondarenko, G N; Politov, D V

    2015-10-01

    The length and sequence variations among intron 2 haplotypes of the mitochondrial DNA nad1 gene have been studied in the Norway and Siberian spruce (Picea abies (L.) H. Karst.-P. obovata Ledeb.) species complex. Twenty-two native populations and 15 provenances were analyzed. The distribution of the northern European haplogroup (haplotypes 721, 755, 789, 823, 857, 891, and 925) is delimited in the west by the Ural region inclusively. Haplotype 712 is widespread in populations of Siberia, in the Far East and in northeastern Russia. A novel variant of the Siberian haplogroup (780) containing three copies of the first minisatellite motif (34 bp) was found for the first time. The absence of an admixture of the northern European and Siberian haplotypes in the zone of spruce species introgression previously marked by morphological traits and nuclear allozyme loci was demonstrated. This may be evidence of the existence of a sharper geographic boundary between the two haplogroups, as compared to a boundary based on phenotypic and allozyme data. A high proportion of the interpopulation component of variation (65%) estimated by AMOVA indicates a substantial genetic subdivision of European and Siberian populations of the Palearctic spruce complex by mtDNA, which can be putatively explained by natural barriers to gene flow with seeds related, for instance, to the woodless regions of the western Siberian Plain in the Pleistocene and the probable floodplains of large rivers.

  19. Toward localization of the Werner syndrome gene by linkage disequilibrium and ancestral haplotyping: Lessons learned from analysis of 35 chromosome 8p11.1-21.1 markers

    SciTech Connect

    Goddard, K.A.B.; Wijsman, E.M.; Martin, G.M.

    1996-06-01

    Werner syndrome (WS) is an autosomal recessive disorder characterized by premature onset of a number of age-related diseases. The gene for WS, WRN, has been mapped to the 8p11.1-21.1 region with further localization through linkage disequilibrium mapping. Here we present the results of linkage disequilibrium and ancestral haplotype analyses of 35 markers to further refine the location of WRN. We identified an interval in this region in which 14 of 18 markers tested show significant evidence of linkage disequilibrium in at least one of the two populations tested. Analysis of extended and partial haplotypes covering 21 of the markers studied supports the existence of both obligate and probable ancestral recombinant events which localize WRN almost certainly to the interval between DSS2196 and D8S2186, and most likely to the narrower interval between D8S2168 and D8S2186. These haplotype analyses also suggest that there are multiple WRN mutations in each of the two populations under study. We also present a comparison of approaches to performing disequilibrium tests with multiallelic markers, and show that some commonly used approximations for such tests perform poorly in comparison to exact probability tests. Finally, we discuss some of the difficulties introduced by the high mutation rate at microsatellite markers which influence our ability to use ancestral haplotype analysis to localize disease genes. 51 refs., 6 figs., 7 tabs.

  20. Linkage disequilibrium analysis reveals an albuminuria risk haplotype containing three missense mutations in the cubilin gene with striking differences among European and African ancestry populations

    PubMed Central

    2012-01-01

    Background A recent meta-analysis described a variant (p.Ile2984Val) in the cubilin gene (CUBN) that is associated with levels of albuminuria in the general population and in diabetics. Methods We implemented a Linkage Disequilibrium (LD) search with data from the 1000 Genomes Project, on African and European population genomic sequences. Results We found that the p.Ile2984Val variation is part of a larger haplotype in European populations and it is almost absent in west Africans. This haplotype contains 19 single nucleotide polymorphisms (SNPs) in very high LD, three of which are missense mutations (p.Leu2153Phe, p.Ile2984Val, p.Glu3002Gly), and two have not been previously reported. Notably, this European haplotype is absent in west African populations, and the frequency of each individual polymorphism differs significantly in Africans. Conclusions Genotyping of these variants in existing African origin sample sets coupled to measurements of urine albumin excretion levels should reveal which is the most likely functional candidate for albuminuria risk. The unique haplotypic structure of CUBN in different populations may leverage the effort to identify the functional variant and to shed light on evolution of the CUBN gene locus. PMID:23114252

  1. A Haplotype of Two Novel Polymorphisms in δ-Sarcoglycan Gene Increases Risk of Dilated Cardiomyopathy in Mongoloid Population

    PubMed Central

    Wang, Hong; Wei, Sisi; Chen, Dan; Ying, Li; Zhou, Qing; Li, Gang; Li, Joyce; Gao, Jimin; Kato, Naoya; Hu, Wei; Li, Yigang; Wang, Yuepeng

    2015-01-01

    The role of genetic abnormality of δ-sarcoglycan (δ-SG) gene in dilated (DCM) and hypertrophied (HCM) cardiomyopathy patients is still unfolding. In this study we first defined the promoter region and then searched for polymorphisms/mutations among the promoter, 5'-untranslated region, and the encoding exons in δ-SG gene in 104 Chinese patients with DCM, 145 with HCM, and 790 normal controls. Two novel polymorphisms were found, an 11 base-pair (bp) deletion (c.-100~-110; -) in the promoter region and a missense polymorphism of A848G resulting in p.Q283R in the highly conserved C-terminus. The prevalence of homozygous genotype -/- of c.-100~-110 was slightly higher in DCM (14.42%) and HCM patients (14.48%), as compared with normal controls (11.01%). The prevalence of genotype of 848A/G was significantly higher in DCM (6.73%; OR = 9.43; p = 0.0002), but not in HCM patients (1.38%; OR = 1.37; p = 0.62), as compared with controls (0.76%). Haplotype -_G consisting c.-100~-110 and A848G was associated with increased risk of DCM (OR = 17.27; 95%CI = 3.19–93.56; p = 0.001) but not associated with HCM (OR = 1.90; 95%CI = 0.38–9.55; p = 0.44). Co-occurrence of the genotypes -/- of c.-100~-110 and 848A/G was found in 5 patients with DCM (4.81%; OR = 39.85; p = 0.0001), none of HCM patients, and only 1 of the controls (0.13%). Both polymorphisms were also found in the Japanese population, but not in the Africans and Caucasians. C.-100~-110 resulted in a decrease of δ-SG promoter activity to 64±3% of the control level (p<0.01). Both co-immunoprecipitation and in vitro protein pull-down assays demonstrated that δ-SG-283R interacts normally to β- and γ-SG, but significantly decreased localization of β/δ/γ-SG on the plasma membrane. In conclusion, haplotype -_G composed of c.-100~-110 and A848G confers higher susceptibility to DCM in the Mongoloid population. PMID:26720722

  2. CYP1A1 genetic polymorphism and polycyclic aromatic hydrocarbons on pulmonary function in the elderly: haplotype-based approach for gene-environment interaction.

    PubMed

    Choi, Yoon-Hyeong; Kim, Jin Hee; Hong, Yun-Chul

    2013-08-29

    Lung function may be impaired by environmental pollutants not only acting alone, but working with genetic factors as well. Few epidemiologic studies have been conducted to explore the interplay of polycyclic aromatic hydrocarbons (PAHs) exposure and genetic polymorphism on lung function in the elderly. For genetic polymorphism, haplotype is considered a more informative unit than single nucleotide polymorphism markers. Therefore, we examined the role of haplotype based-CYP1A1 polymorphism in the effect of PAHs exposure on lung function in 422 participants from a community-based panel of elderly adults in Seoul, Korea. Linear mixed effect models were fit to evaluate the association of PAH exposure markers (urinary 1-hydroxypyrene and 2-naphthol) with FVC, FEV₁, FEV₁/FVC, and FEF₂₅₋₇₅, and then the interaction with CYP1A1 haplotype constructed from three single nucleotide polymorphisms of the gene (rs4646421/rs4646422/rs1048943). Urinary 1-hydroxypyrene levels were inversely associated with FEV₁/FVC (p<0.05), whereas urinary 2-naphthol levels failed to show associations with lung function. Urinary 1-hydroxypyrene was significantly associated with decrease in FEV₁/FVC among participants with rs4646421 variants (CT+TT), rs4646422 wild-type (GG), and rs1048943 wild-type (AA). At least one TGA haplotype predicted a 0.88% (95% confidence interval, 0.31-1.45%) reduction in FEV₁/FVC with an interquartile range increase in 1-hydroxypyrene, whereas no relationship was observed in participants without TGA haplotype (p for interaction=0.045). Similar patterns were also observed in FEF₂₅₋₇₅. We did not find any main effects of CYP1A1 genetic polymorphisms on lung functions. Our findings suggest that PAH exposure producing 1-hydroxypyrene as a metabolite compromises lung function in the elderly, and that haplotype-based CYP1A1 polymorphism modifies the risk.

  3. Haplotypes of the MTHFR gene are associated with an increased risk of breast cancer in a Han Chinese population in Gansu province.

    PubMed

    Song, Ailing; Zhao, Lei; Li, Yumin; Wu, Li; Li, Yu; Liu, Xiaokang; Lan, Shen

    2016-07-01

    Elevated homocysteine levels are a risk factor for breast cancer, although the mechanism underlying this effect is unknown. Genome-wide association studies were used to systematically identify genetic variants which were significantly associated with the circulating homocysteine concentration. To examine the role of homocysteine-related variants in the occurrence of breast cancer, we investigated the association between these variants and breast cancer in a Han Chinese population. Five variants of genome-wide significant homocysteine-related genes were selected for the analysis in a case-control study, with a total of 487 patients with breast cancer and 605 controls. We found that none of the studied polymorphisms were related to the altered breast cancer risk. In the haplotypic analysis, the 5,10-methylenetetrahydrofolate reductase (MTHFR) haplotypes rs12085006A/rs1999594G/rs1801133C (OR = 3.44, 95% CI = 1.58-7.50, P = 0.0019) and rs12085006A/rs1999594G/rs1801133T (OR = 16.21, 95% CI = 2.19- 120.32, P = 0.0065) were significantly associated with an increased breast cancer risk when compared with the wild-type haplotype. Both of the risky MTHFR haplotypes were correlated with decreased MTHFR gene expression and elevated homocysteine concentrations, indicating a genetic component for hyperhomocysteinemia. The MTHFR haplotypes reconstructed with homocysteine-related variants were associated with the occurrence of breast cancer. This finding further emphasizes the importance of homocysteine metabolism genes in breast carcinogenesis and highlights the interplay of diet, genetics, and human cancers. © 2016 IUBMB Life, 68(7):526-534, 2016.

  4. Polymorphisms in the LPL and CETP Genes and Haplotype in the ESR1 Gene Are Associated with Metabolic Syndrome in Women from Southwestern Mexico

    PubMed Central

    Cahua-Pablo, José Ángel; Cruz, Miguel; Méndez-Palacios, Abigail; Antúnez-Ortiz, Diana Lizzete; Vences-Velázquez, Amalia; del Carmen Alarcón-Romero, Luz; Parra, Esteban Juan; Tello-Flores, Vianet Argelia; Leyva-Vázquez, Marco Antonio; Valladares-Salgado, Adán; Pérez-Macedonio, Claudia Paola; Flores-Alfaro, Eugenia

    2015-01-01

    Metabolic syndrome (MetS) is a combination of metabolic disorders associated with an increased risk for cardiovascular disease (CVD). Studies in women reported associations between polymorphisms in ESR1, LPL and CETP genes and MetS. Our aim was to evaluate the association between variants in ESR1, LPL and CETP genes with MetS and its components. Four hundred and eighty women were analyzed, anthropometric features and biochemical profiles were evaluated, and genotyping was performed by real-time PCR. We found an association with elevated glucose levels (odds ratio (OR) = 2.9; p = 0.013) in carrying the AA genotype of rs1884051 in the ESR1 gene compared with the GG genotype, and the CC genotype of rs328 in the LPL gene was associated with MetS compared to the CG or GG genotype (OR = 2.8; p = 0.04). Moreover, the GA genotype of rs708272 in the CETP gene is associated with MetS compared to the GG or AA genotype (OR = 1.8; p = 0.006). In addition the ACTCCG haplotype in the ESR1 gene is associated with a decrease in the risk of MetS (OR = 0.02; p < 0.001). In conclusion, our results show the involvement of the variants of ESR1, LPL and CETP genes in metabolic events related to MetS or some of its features. PMID:26370976

  5. An association analysis of Alzheimer disease candidate genes detects an ancestral risk haplotype clade in ACE and putative multilocus association between ACE, A2M, and LRRTM3

    PubMed Central

    Edwards, Todd L.; Pericak-Vance, Margaret; Gilbert, Johnny; Haines, Jonathan L.; Martin, Eden; Ritchie, Marylyn D.

    2009-01-01

    Alzheimer’s disease (AD) is the most common form of progressive dementia in the elderly. It is a neurodegenerative disorder characterized by the neuropathologic findings of intracellular neurofibrillary tangles and extracellular amyloid plaques that accumulate in vulnerable brain regions. AD etiology has been studied by many groups, but since the discovery of the APOE ε4 allele, no further genes have been mapped conclusively to the late-onset form of the disease. In this study, we examined genetic association with late-onset Alzheimer’s susceptibility in 738 Caucasian families with 4704 individuals and an independent case-control dataset with 296 unrelated cases and 566 unrelated controls exploring 11 candidate genes with 47 SNPs common to both samples. In addition to tests for main effects and haplotype analyses, the Multifactor Dimensionality Reduction Pedigree Disequilibrium Test (MDR-PDT) was used to search for single-locus effects as well as 2-locus and 3-locus gene-gene interactions associated with AD in the family data. We observed significant haplotype effects in ACE in both family and case-control samples using standard and cladistic haplotype models. ACE was also part of significant 2-locus and 3-locus MDR-PDT joint effects models with Alpha-2-Macroglobulin (A2M), which mediates the clearance of Aβ, and Leucine-Rich Repeat Transmembrane 3 (LRRTM3), a nested gene in Alpha-3 Catenin (CTNNA3) which binds Presenilin 1. This result did not replicate in the case-control sample, and may not be a true positive. These genes are related to amyloid beta clearance; thus this constellation of effects might constitute an axis of susceptibility for late-onset AD. The consistent ACE haplotype result between independent data sets of families and unrelated cases and controls is strong evidence in favor of ACE as a susceptibility locus for AD, and replicates results from several other studies in a very large sample. PMID:19105203

  6. A New SNP Haplotype associated with blue disease resistance gene in cotton (Gossypium hirsutum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Resistance to cotton blue disease (CBD) was evaluated in 364 F2.3 families of 3 populations derived from resistant variety ‘Delta Opal’. The CBD resistance in ‘Delta Opal’ was controlled by one single dominant gene designated Cbd. Two simple sequence repeat (SSR) markers were identified as linked t...

  7. Haplotypes in the promoter region of the CIDEC gene associated with growth traits in Nanyang cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cell death-inducing DFFA-like effector c (CIDEC, also known as Fsp27) has emerged as an important regulator of metabolism associated with lipodystrophy, diabetes, and hepatic steatosis. It is required for unilocular lipid droplet formation and optimal energy storage. The mechanism between this gene ...

  8. Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.

    PubMed

    Morris, Andrew P

    2005-09-01

    We describe a novel method for assessing the strength of disease association with single nucleotide polymorphisms (SNPs) in a candidate gene or small candidate region, and for estimating the corresponding haplotype relative risks of disease, using unphased genotype data directly. We begin by estimating the relative frequencies of haplotypes consistent with observed SNP genotypes. Under the Bayesian partition model, we specify cluster centres from this set of consistent SNP haplotypes. The remaining haplotypes are then assigned to the cluster with the "nearest" centre, where distance is defined in terms of SNP allele matches. Within a logistic regression modelling framework, each haplotype within a cluster is assigned the same disease risk, reducing the number of parameters required. Uncertainty in phase assignment is addressed by considering all possible haplotype configurations consistent with each unphased genotype, weighted in the logistic regression likelihood by their probabilities, calculated according to the estimated relative haplotype frequencies. We develop a Markov chain Monte Carlo algorithm to sample over the space of haplotype clusters and corresponding disease risks, allowing for covariates that might include environmental risk factors or polygenic effects. Application of the algorithm to SNP genotype data in an 890-kb region flanking the CYP2D6 gene illustrates that we can identify clusters of haplotypes with similar risk of poor drug metaboliser (PDM) phenotype, and can distinguish PDM cases carrying different high-risk variants. Further, the results of a detailed simulation study suggest that we can identify positive evidence of association for moderate relative disease risks with a sample of 1,000 cases and 1,000 controls.

  9. Statistical performance of cladistic strategies for haplotype grouping in pharmacogenetics.

    PubMed

    Lunceford, Jared K; Liu, Nancy

    2008-12-10

    Haplotypes comprising multiple single nucleotide polymorphisms (SNPs) are popular covariates for capturing the key genetic variation present over a region of interest in the DNA sequence. Although haplotypes can provide a clearer assessment of genetic variation in a region than their component SNPs considered individually, the multi-allelic nature of haplotypes increases the complexity of the statistical models intended to discover association with outcomes of interest. Cladistic methods cluster haplotypes according to the estimates of their genealogical closeness and have been proposed recently as strategies for reducing model complexity and increasing power. Two examples are methods based on a haplotype nesting algorithm described by Templeton et al. (Genetics 1987; 117:343-351) and hierarchical clustering of haplotypes as described by Durrant et al. (Am. J. Hum. Genet. 2004; 75:35-43). In the context of assessing the pharmacogenetic effects of candidate genes, for which high-density SNP data have been gathered, we have conducted a simulation-based case study of the testing and estimation properties of two strategies based on Templeton's algorithm (TA), one being that described by Seltman et al. (Am. J. Hum. Genet. 2001; 68:1250-1263; Genet. Epidemiol. 2003; 25:48-58), as well as the method of Durrant et al. using data from a diabetes clinical trial. Even after adjusting for multiplicity, improvements in power can be realized using cladistic approaches with treatment group sizes in the range expected for standard trials, although these gains may be sensitive to the cladistic structure used. Differences in the relative performance of the cladistic approaches examined were observed with the clustering approach of Durrant et al. showing statistical properties superior to the methods based on TA.

  10. Haplotype structure enables prioritization of common markers and candidate genes in autism spectrum disorder

    PubMed Central

    Vardarajan, B N; Eran, A; Jung, J-Y; Kunkel, L M; Wall, D P

    2013-01-01

    Autism spectrum disorder (ASD) is a neurodevelopmental condition that results in behavioral, social and communication impairments. ASD has a substantial genetic component, with 88–95% trait concordance among monozygotic twins. Efforts to elucidate the causes of ASD have uncovered hundreds of susceptibility loci and candidate genes. However, owing to its polygenic nature and clinical heterogeneity, only a few of these markers represent clear targets for further analyses. In the present study, we used the linkage structure associated with published genetic markers of ASD to simultaneously improve candidate gene detection while providing a means of prioritizing markers of common genetic variation in ASD. We first mined the literature for linkage and association studies of single-nucleotide polymorphisms, copy-number variations and multi-allelic markers in Autism Genetic Resource Exchange (AGRE) families. From markers that reached genome-wide significance, we calculated male-specific genetic distances, in light of the observed strong male bias in ASD. Four of 67 autism-implicated regions, 3p26.1, 3p26.3, 3q25-27 and 5p15, were enriched with differentially expressed genes in blood and brain from individuals with ASD. Of 30 genes differentially expressed across multiple expression data sets, 21 were within 10 cM of an autism-implicated locus. Among them, CNTN4, CADPS2, SUMF1, SLC9A9, NTRK3 have been previously implicated in autism, whereas others have been implicated in neurological disorders comorbid with ASD. This work leverages the rich multimodal genomic information collected on AGRE families to present an efficient integrative strategy for prioritizing autism candidates and improving our understanding of the relationships among the vast collection of past genetic studies. PMID:23715297

  11. Inferring the Recent Duplication History of a Gene Cluster

    NASA Astrophysics Data System (ADS)

    Song, Giltae; Zhang, Louxin; Vinař, Tomáš; Miller, Webb

    Much important evolutionary activity occurs in gene clusters, where a copy of a gene may be free to evolve new functions. Computational methods to extract evolutionary information from sequence data for such clusters are currently imperfect, in part because accurate sequence data are often lacking in these genomic regions, making the existing methods difficult to apply. We describe a new method for reconstructing the recent evolutionary history of gene clusters. The method’s performance is evaluated on simulated data and on actual human gene clusters.

  12. Haplotype Structure of FSHB, the Beta-Subunit Gene for Fertility-Associated Follicle-Stimulating Hormone: Possible Influence of Balancing Selection

    PubMed Central

    Grigorova, M; Rull, K; Laan, M

    2007-01-01

    Follicle-stimulating hormone (FSH) is essential for human reproduction. The unique functions of this hormone are provided by the FSH receptor-binding beta-subunit encoded by the FSHB gene. Resequencing and genotyping of FSHB in three European, two Asian and one African population, as well as in the great apes (chimpanzee, gorilla, orangutan), revealed low diversity and significant excess of polymorphisms with intermediate frequency alleles. Statistical tests for FSHB showed deviations from neutrality in all populations suggesting a possible effect of balancing selection. Two core haplotypes were identified (carried by 76-96.6% of each population's sample), the sequences of which are clearly separated from each other. As fertility most directly affects an organism's fitness, the carriers of these haplotypes have apparently had more success in human history to contribute to the next generation. There is a preliminary observation suggesting that the second most frequent FSHB haplotype may be associated with rapid conception success in females. Interestingly, the same haplotype is related to an ancestral FSHB variant shared with the ancestor of the great apes. The determination of the functional consequence of the two core FSHB variants may have implications for understanding and regulating human fertility, as well as in assisting infertility treatments. PMID:17227474

  13. Integrative variants, haplotypes and diplotypes of the CAPN3 and FRMD5 genes and several environmental exposures associate with serum lipid variables

    PubMed Central

    Guo, Tao; Yin, Rui-Xing; Pan, Ling; Yang, Shuo; Miao, Liu; Huang, Feng

    2017-01-01

    To determine whether the integrative variants, haplotypes and diplotypes of the calpain 3 (CAPN3) and the FERM domain containing 5 genes (FRMD5) and several environmental exposures are associated with an implication in lipid homeostasis, which are associated with cardiovascular risk. Genotyping of the CAPN3 rs4344713 and FRMD5 rs524908 was performed by Sanger sequencing in 1,640 subjects (Jing, 819 and Han, 821). Multivariate analyses of covariance models that adjusted by age, gender, body mass index (BMI), blood pressure and lifestyle (smoking and drinking), were constructed using variants, haplotypes and diplotypes of the CAPN3 rs4344713 and FRMD5 rs524908 as predictors and changes in lipid variables. Significant associations with low-density lipoprotein cholesterol and apolipoprotein (Apo) B were found. Linkage disequilibrium with each other showed the haplotype-phenotype associations with triglyceride and ApoA1. This study also suggested pleiotropic associations of the CAPN3-FRMD5 diplotypes with lipid variables. As potential confounders, diastolic blood pressure (DBP) and BMI were significantly associated with lipid variables. We conclude that integrative variants, haplotypes and diplotypes of the CAPN3 rs4344713 and FRMD5 rs524908, as well as DBP and BMI are associated with serum lipid variables in the Jing and Han populations. PMID:28332615

  14. Haplotypes in the CRP Gene Associated with Increased BMI and Levels of CRP in Subjects with Type 2 Diabetes or Obesity from Southwestern Mexico

    PubMed Central

    Martínez-Calleja, América; Quiróz-Vargas, Irma; Parra-Rojas, Isela; Muñoz-Valle, José Francisco; Leyva-Vázquez, Marco A.; Fernández-Tilapa, Gloria; Vences-Velázquez, Amalia; Cruz, Miguel; Salazar-Martínez, Eduardo; Flores-Alfaro, Eugenia

    2012-01-01

    Objective. We evaluated the association between four polymorphisms in the CRP gene with circulating levels of C-reactive protein (CRP), type 2 diabetes (T2D), obesity, and risk score of coronary heart disease. Methods. We studied 402 individuals and classified them into four groups: healthy, obese, T2D obese, and T2D without obesity, from Guerrero, Southwestern Mexico. Blood levels of CRP, glucose, cholesterol, triglycerides, and leukocytes were measured. Genotyping was performed by PCR/RFLP, and the risk score for coronary heart disease was determined by the Framingham's methodology. Results. The TT genotype of SNP rs1130864 was associated with increased body mass index and T2D patients with obesity. We found that the haplotype 2 (TGAG) was associated with increased levels of CRP (β = 0.3; 95%CI: 0.1, 0.5; P = 0.005) and haplotype 7 (TGGG) with higher body mass index (BMI) (β = 0.2; 95%CI: 0.1, 0.3; P < 0.001). The risk score for coronary heart disease was associated with increased levels of CRP, but not with any polymorphism or haplotype. Conclusions. The association between the TT genotype of SNP rs1130864 with obesity and the haplotype 7 with BMI may explain how obesity and genetic predisposition increase the risk of diseases such as T2D in the population of Southwestern Mexico. PMID:23049543

  15. Haplotype mapping of the bronchiolitis susceptibility locus near IL8.

    PubMed

    Hull, Jeremy; Rowlands, Kate; Lockhart, Elizabeth; Sharland, Mike; Moore, Catrin; Hanchard, Neil; Kwiatkowski, Dominic P

    2004-02-01

    Susceptibility to viral bronchiolitis, the commonest cause of infant admissions to hospital in the industrialised world, is associated with polymorphism at the IL8 locus. Here we map the genomic boundaries of the disease association by case-control analysis and TDT in 580 affected UK infants. Markers for association mapping were chosen after determining patterns of linkage disequilibrium across the surrounding region of chromosome 4q, a 550-kb segment containing nine genes, extending from AFP to PPBP. The region has three major clusters of high linkage disequilibrium and is notable for its low haplotypic diversity. We exclude adjacent chemokine genes as the cause of the association, and identify a disease-associated haplotype that spans a 250-kb region from AFM to IL8. In between these two genes there is only one structural feature of interest, a novel gene RASSF6, which is predicted to encode a Ras effector protein.

  16. Computing gene expression data with a knowledge-based gene clustering approach.

    PubMed

    Rosa, Bruce A; Oh, Sookyung; Montgomery, Beronda L; Chen, Jin; Qin, Wensheng

    2010-01-01

    Computational analysis methods for gene expression data gathered in microarray experiments can be used to identify the functions of previously unstudied genes. While obtaining the expression data is not a difficult task, interpreting and extracting the information from the datasets is challenging. In this study, a knowledge-based approach which identifies and saves important functional genes before filtering based on variability and fold change differences was utilized to study light regulation. Two clustering methods were used to cluster the filtered datasets, and clusters containing a key light regulatory gene were located. The common genes to both of these clusters were identified, and the genes in the common cluster were ranked based on their coexpression to the key gene. This process was repeated for 11 key genes in 3 treatment combinations. The initial filtering method reduced the dataset size from 22,814 probes to an average of 1134 genes, and the resulting common cluster lists contained an average of only 14 genes. These common cluster lists scored higher gene enrichment scores than two individual clustering methods. In addition, the filtering method increased the proportion of light responsive genes in the dataset from 1.8% to 15.2%, and the cluster lists increased this proportion to 18.4%. The relatively short length of these common cluster lists compared to gene groups generated through typical clustering methods or coexpression networks narrows the search for novel functional genes while increasing the likelihood that they are biologically relevant.

  17. Sequence Variants and Haplotype Analysis of Cat ERBB2 Gene: A Survey on Spontaneous Cat Mammary Neoplastic and Non-Neoplastic Lesions

    PubMed Central

    Santos, Sara; Bastos, Estela; Baptista, Cláudia S.; Sá, Daniela; Caloustian, Christophe; Guedes-Pinto, Henrique; Gärtner, Fátima; Gut, Ivo G.; Chaves, Raquel

    2012-01-01

    The human ERBB2 proto-oncogene is widely considered a key gene involved in human breast cancer onset and progression. Among spontaneous tumors, mammary tumors are the most frequent cause of cancer death in cats and second most frequent in humans. In fact, naturally occurring tumors in domestic animals, more particularly cat mammary tumors, have been proposed as a good model for human breast cancer, but critical genetic and molecular information is still scarce. The aims of this study include the analysis of the cat ERBB2 gene partial sequences (between exon 17 and 20) in order to characterize a normal and a mammary lesion heterogeneous populations. Cat genomic DNA was extracted from normal frozen samples (n = 16) and from frozen and formalin-fixed paraffin-embedded mammary lesion samples (n = 41). We amplified and sequenced two cat ERBB2 DNA fragments comprising exons 17 to 20. It was possible to identify five sequence variants and six haplotypes in the total population. Two sequence variants and two haplotypes show to be specific for cat mammary tumor samples. Bioinformatics analysis predicts that four of the sequence variants can produce alternative transcripts or activate cryptic splicing sites. Also, a possible association was identified between clinicopathological traits and the variant haplotypes. As far as we know, this is the first attempt to examine ERBB2 genetic variations in cat mammary genome and its possible association with the onset and progression of cat mammary tumors. The demonstration of a possible association between primary tumor size (one of the two most important prognostic factors) and the number of masses with the cat ERBB2 variant haplotypes reveal the importance of the analysis of this gene in veterinary medicine. PMID:22489125

  18. Sequence variants and haplotype analysis of cat ERBB2 gene: a survey on spontaneous cat mammary neoplastic and non-neoplastic lesions.

    PubMed

    Santos, Sara; Bastos, Estela; Baptista, Cláudia S; Sá, Daniela; Caloustian, Christophe; Guedes-Pinto, Henrique; Gärtner, Fátima; Gut, Ivo G; Chaves, Raquel

    2012-01-01

    The human ERBB2 proto-oncogene is widely considered a key gene involved in human breast cancer onset and progression. Among spontaneous tumors, mammary tumors are the most frequent cause of cancer death in cats and second most frequent in humans. In fact, naturally occurring tumors in domestic animals, more particularly cat mammary tumors, have been proposed as a good model for human breast cancer, but critical genetic and molecular information is still scarce. The aims of this study include the analysis of the cat ERBB2 gene partial sequences (between exon 17 and 20) in order to characterize a normal and a mammary lesion heterogeneous populations. Cat genomic DNA was extracted from normal frozen samples (n = 16) and from frozen and formalin-fixed paraffin-embedded mammary lesion samples (n = 41). We amplified and sequenced two cat ERBB2 DNA fragments comprising exons 17 to 20. It was possible to identify five sequence variants and six haplotypes in the total population. Two sequence variants and two haplotypes show to be specific for cat mammary tumor samples. Bioinformatics analysis predicts that four of the sequence variants can produce alternative transcripts or activate cryptic splicing sites. Also, a possible association was identified between clinicopathological traits and the variant haplotypes. As far as we know, this is the first attempt to examine ERBB2 genetic variations in cat mammary genome and its possible association with the onset and progression of cat mammary tumors. The demonstration of a possible association between primary tumor size (one of the two most important prognostic factors) and the number of masses with the cat ERBB2 variant haplotypes reveal the importance of the analysis of this gene in veterinary medicine.

  19. HLA class I (A, B) and II (DR, DQ) gene and haplotype frequencies in blood donors from Wales.

    PubMed

    Darke, C; Guttridge, M G; Thompson, J; McNamara, S; Street, J; Thomas, M

    1998-01-01

    Accurate estimates of HLA-A, B, DR and DQ phenotype, gene and haplotype frequencies (HF) in the normal population are of importance in, for example, disease susceptibility studies, platelet transfusion support and transplantation. HLA population genetics studies have been performed on numerous groups, however, no major studies have been carried out on the population of Wales. As part of the validation process for our routine HLA-A and B typing by PCR using sequence-specific primers (PCR-SSP) we examined 1,798 normal, unrelated Caucasoid blood donors living in Wales and recruited onto the Welsh Bone Marrow Donor Registry (WBMDR). Typing was performed by serology (HLA-A, B) and PCR-SSP at low resolution (HLA-A, B, DR, DQ) resulting in a particularly rigorous level of HLA specificity assignment. Four discrepancies were found between the HLA-A and B serological and PCR-SSP specificity assignments: (1) two instances of HLA-A2 by serology were undetected by PCR-SSP and were a new HLA-A2 allele - A*0224; (2) one example of HLA-B*15 by PCR-SSP failed to react by serology, and remained undetectable by serology in subsequent samples, and (3) one example of HLA-B45 by serology was identified as HLA-B*5002 by PCR-SSP. Hardy-Weinberg and homozygosity analysis showed that the goodness-of-fit was excellent (p > 0.05), for both phenotype distribution and the number of homozygotes identified, for all four loci. The phenotype and gene frequencies for the 18 HLA-A, 34 -B, 15 -DR and 8 -DQ specificities identified and two- and three-locus HF, linkage disequilibrium and related values for HLA-A/B, B/DR, DR/DQ and HLA-A/B/DR and B/DR/DQ were essentially typical of a northern European population. HLA-A2, B44, DR4 and DQ2 were the highest frequency phenotypes and HLA-A2403, A34, A74, B42, B75, B2708, B48, B67 and B703 occurred once only. There were no examples of: A36, A43, A69, A80, B46, B54, B59, B73, B76, B77, B7801, B8101 or DR18 specificities. DR17, DQ2 and A1, B8, DR17 were the

  20. A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The B4 resistance (R)-gene cluster, located in subtelomeric region of chromosome 4, is one of the largest clusters known in common bean (Phaseolus vulgaris, Pv). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-coil-Nucleotide-Binding-Site-Leucine-Rich...

  1. Detecting local haplotype sharing and haplotype association

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...

  2. Sequence variations in the 5' flanking and IVS-II regions of the G gamma- and A gamma-globin genes of beta S chromosomes with five different haplotypes.

    PubMed

    Lanclos, K D; Oner, C; Dimovski, A J; Gu, Y C; Huisman, T H

    1991-06-01

    We have amplified and sequenced the 5' flanking and the second intervening sequence (IVS-II) regions of both the G gamma- and A gamma-globin genes of the beta S chromosomes from sickle cell anemia (SS) patients with homozygosities for five different haplotypes. The sequencing data, compared with previously published sequences for the normal chromosomes A and B, show many similarities to chromosome B for haplotypes 19, 20, and 17, while haplotypes 3 and 31 are remarkably similar to chromosome A and also similar to each other. Several unique mutations were found in the 5' flanking regions (G gamma and A gamma) of haplotypes 19 and 20 and in the IVS-II segments of the same genes of haplotypes 19, 20, and 17; the IVS-II of haplotypes 3 and 31 were identical to those of chromosome A. Dot-blot analyses of amplified DNA from additional SS patients with specific probes have confirmed that these mutations are unique for each haplotype. The two general patterns that have been observed among the five haplotypes have most probably arisen by gene conversion events between the A and B type chromosomes in the African population. These patterns correlate with high and low fetal hemoglobin expression, and it is speculated that these and other yet unknown gene conversions may contribute to the variations in hemoglobin F and G gamma levels observed among SS patients. In vitro expression experiments involving the approximately 1.3-kb 5' flanking regions of the G gamma- and A gamma-globin genes of the beta S chromosomes with the five different haplotypes failed to detect differences between the levels of expression, suggesting that the sequence variations observed between these segments of DNA are not the primary cause of the differences in hemoglobin F levels among the SS patients.

  3. Efficient Computation of Approximate Gene Clusters Based on Reference Occurrences

    NASA Astrophysics Data System (ADS)

    Jahn, Katharina

    Whole genome comparison based on the analysis of gene cluster conservation has become a popular approach in comparative genomics. While gene order and gene content as a whole randomize over time, it is observed that certain groups of genes which are often functionally related remain co-located across species. However, the conservation is usually not perfect which turns the identification of these structures, often referred to as approximate gene clusters, into a challenging task. In this paper, we present a polynomial time algorithm that computes approximate gene clusters based on reference occurrences. We show that our approach yields highly comparable results to a more general approach and allows for approximate gene cluster detection in parameter ranges currently not feasible for non-reference based approaches.

  4. Association of XRCC1 and XRCC3 gene haplotypes with the development of radiation-induced fibrosis in patients with nasopharyngeal carcinoma.

    PubMed

    Cheuk, Isabella Wai Yin; Yip, Shea Ping; Kwong, Dora Lai Wan; Wu, Vincent Wing Cheung

    2014-07-01

    Radiation-induced fibrosis is one of the late complications of radiotherapy (RT) for nasopharyngeal carcinoma (NPC). The aim of this study was to investigate the association between X-ray repair cross-complementing protein 1 and 3 (XRCC1 and XRCC3, respectively) gene haplotypes and radiation-induced fibrosis in NPC patients. Genomic DNA was extracted from blood samples of 120 NPC patients previously treated with RT. In total, 12 tag single-nucleotide polymorphisms (SNPs) were selected from the XRCC1 and XRCC3 genes and were genotyped using restriction fragment length polymorphism analysis or unlabeled probe melting analysis. Single-marker and haplotype analyses were performed using multivariate logistic regression analysis. The functional variant rs861539 of XRCC3 may be associated with radiation-induced fibrosis [asymptotic P-value (Pasym)<0.05]. No significant association was observed between radiation-induced fibrosis and any of the tag SNPs of XRCC1 and XRCC3 in either single-marker or haplotype analysis after 10,000 permutations [empirical P-value (Pemp)>0.05]. Our preliminary results indicated that the rs861539 variant of XRCC3 may be associated with an increased risk of radiation-induced fibrosis; however, a large-scale study is required to confirm this result.

  5. Atypical beta(s) haplotypes are generated by diverse genetic mechanisms.

    PubMed

    Zago, M A; Silva, W A; Dalle, B; Gualandro, S; Hutz, M H; Lapoumeroulie, C; Tavella, M H; Araujo, A G; Krieger, J E; Elion, J; Krishnamoorthy, R

    2000-02-01

    The majority of the chromosomes with the beta(S) gene have one of the five common haplotypes, designated as Benin, Bantu, Senegal, Cameroon, and Arab-Indian haplotypes. However, in every large series of sickle cell patients, 5-10% of the chromosomes have less common haplotypes, usually referred to as "atypical" haplotypes. In order to explore the genetic mechanisms that could generate these atypical haplotypes, we extended our analysis to other rarely studied polymorphic markers of the beta(S)-gene cluster, in a total of 40 chromosomes with uncommon haplotypes from Brazil and Cameroon. The following polymorphisms were examined: seven restriction site polymorphisms of the epsilongammadeltabeta-cluster, the pre-(G)gamma framework sequence including the 6-bp deletion/insertion pattern, HS-2 LCR (AT)xR(AT)y and pre-beta (AT)xTy repeat motifs, the GC/TT polymorphism at -1105-1106 of (G)gamma-globin gene, the C/T polymorphism at -551 of the beta-globin gene, and the intragenic beta-globin gene framework. Among the Brazilian subjects, the most common atypical structure (7/16) was a Bantu 3'-subhaplotype associated with different 5'-sequences, while in two chromosomes a Benin 3'-subhaplotype was associated with two different 5'-subhaplotypes. A hybrid Benin/Bantu configuration was also observed. In three chromosomes, the atypical haplotype differed from the typical one by the change of a single restriction site. In 2/134 chromosomes identified as having a typical Bantu RFLP-haplotype, a discrepant LCR repeat sequence was observed, probably owing to a crossover 5' to the epsilon-gene. Among 80 beta(S) chromosomes from Cameroon, 22 were associated with an atypical haplotype. The most common structure was represented by a Benin haplotype (from the LCR to the beta-gene) with a non-Benin segment 3' to the beta-globin gene. In two cases a Bantu LCR was associated with a Benin haplotype and a non-Benin segment 3' to the beta-globin gene. In three other cases, a more complex

  6. Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology

    PubMed Central

    Fischbach, Michael; Voigt, Christopher A.

    2014-01-01

    Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668

  7. Frequency distribution of HLA-antigens, genes and haplotypes in the migrant population of Magadan as a function of length of residence in the region.

    PubMed

    Pereverzeva, V V; Solovenchuk, L L

    1996-09-01

    Peculiarities of the frequency distribution of antigens, genes, and haplotypes at subloci A, B, and Cw of the HLA system in 1429 Slavic inhabitants of Magadan are presented in dependence on length of residence in extreme conditions. No significant differences were revealed with respect to frequency of genes and antigens in inhabitants with different lengths of residence in northeastern Russian conditions. Analysis of gamete associations shows that the revealed positive or negative associativity in some cases is characteristic for Caucasoids on the whole, but an associativity specific to inhabitants of Magadan was also established. Its character depends on the length of residence in extreme conditions.

  8. A Novel Haplotype with the R345W Mutation in the EFEMP1 Gene Associated with Autosomal Dominant Drusen in a Japanese Family

    PubMed Central

    Takeuchi, Tomokazu; Bedell, Matthew; Zhang, Kang; Yamada, Hisashi; Tsuneoka, Hiroshi

    2010-01-01

    Purpose. To describe ophthalmic and molecular genetic findings in a family of Japanese patients with Malattia leventinese (ML)/Doyne honeycomb retinal dystrophy (DHRD), also known as autosomal dominant drusen. Methods. Four patients with ML/DHRD, including a 42-year-old female proband, were ascertained. The proband underwent complete ophthalmic examinations, including fundus and electrodiagnostic investigations, and Humphrey visual field (VF) perimetry. Mutation screening of the EFEMP1 gene and haplotype analysis were performed in the family, an Indian ML/DHRD family, and a branch of 1 of 39 ML/DHRD families in the United States, in which all affected patients shared a common haplotype. Results. A heterozygous missense mutation (p.R345W) was identified in all four Japanese patients and in affected patients of the other two families. This mutation was the only mutation that has been exclusively found in the gene. The disease haplotype in the Japanese family was different from those of the other two families. Clinically, central retinas were prominently affected in the proband and her mother, and subsequently the proband developed subfoveal choroidal neovascularization in the left eye, whereas her younger sister with the mutation, who was asymptomatic, exhibited only fine macular drusen. Long-term follow-up of Humphrey VF and multifocal-electroretinography (mfERG) in the proband also revealed progressive attenuation of macular function in the right eye. Conclusions. This is the first report to describe a Japanese family with variable expressivity of ML/DHRD, in which a novel disease haplotype was identified. Humphrey VF and mfERG testing may be helpful in determining the long-term outcome of macular function. PMID:19850834

  9. Aspergillus nidulans mutants defective in stc gene cluster regulation.

    PubMed Central

    Butchko, R A; Adams, T H; Keller, N P

    1999-01-01

    The genes involved in the biosynthesis of sterigmatocystin (ST), a toxic secondary metabolite produced by Aspergillus nidulans and an aflatoxin (AF) precursor in other Aspergillus spp., are clustered on chromosome IV of A. nidulans. The sterigmatocystin gene cluster (stc gene cluster) is regulated by the pathway-specific transcription factor aflR. The function of aflR appears to be conserved between ST- and AF-producing aspergilli, as are most of the other genes in the cluster. We describe a novel screen for detecting mutants defective in stc gene cluster activity by use of a genetic block early in the ST biosynthetic pathway that results in the accumulation of the first stable intermediate, norsolorinic acid (NOR), an orange-colored compound visible with the unaided eye. We have mutagenized this NOR-accumulating strain and have isolated 176 Nor(-) mutants, 83 of which appear to be wild type in growth and development. Sixty of these 83 mutations are linked to the stc gene cluster and are likely defects in aflR or known stc biosynthetic genes. Of the 23 mutations not linked to the stc gene cluster, 3 prevent accumulation of NOR due to the loss of aflR expression. PMID:10511551

  10. Mining Bacterial Genomes for Secondary Metabolite Gene Clusters.

    PubMed

    Adamek, Martina; Spohn, Marius; Stegmann, Evi; Ziemert, Nadine

    2017-01-01

    With the emergence of bacterial resistance against frequently used antibiotics, novel antibacterial compounds are urgently needed. Traditional bioactivity-guided drug discovery strategies involve laborious screening efforts and display high rediscovery rates. With the progress in next generation sequencing methods and the knowledge that the majority of antibiotics in clinical use are produced as secondary metabolites by bacteria, mining bacterial genomes for secondary metabolites with antimicrobial activity is a promising approach, which can guide a more time and cost-effective identification of novel compounds. However, what sounds easy to accomplish, comes with several challenges. To date, several tools for the prediction of secondary metabolite gene clusters are available, some of which are based on the detection of signature genes, while others are searching for specific patterns in gene content or regulation.Apart from the mere identification of gene clusters, several other factors such as determining cluster boundaries and assessing the novelty of the detected cluster are important. For this purpose, comparison of the predicted secondary metabolite genes with different cluster and compound databases is necessary. Furthermore, it is advisable to classify detected clusters into gene cluster families. So far, there is no standardized procedure for genome mining; however, different approaches to overcome all of these challenges exist and are addressed in this chapter. We give practical guidance on the workflow for secondary metabolite gene cluster identification, which includes the determination of gene cluster boundaries, addresses problems occurring with the use of draft genomes, and gives an outlook on the different methods for gene cluster classification. Based on comprehensible examples a protocol is set, which should enable the readers to mine their own genome data for interesting secondary metabolites.

  11. Bioinformatics Prediction of Polyketide Synthase Gene Clusters from Mycosphaerella fijiensis

    PubMed Central

    Noar, Roslyn D.; Daub, Margaret E.

    2016-01-01

    Mycosphaerella fijiensis, causal agent of black Sigatoka disease of banana, is a Dothideomycete fungus closely related to fungi that produce polyketides important for plant pathogenicity. We utilized the M. fijiensis genome sequence to predict PKS genes and their gene clusters and make bioinformatics predictions about the types of compounds produced by these clusters. Eight PKS gene clusters were identified in the M. fijiensis genome, placing M. fijiensis into the 23rd percentile for the number of PKS genes compared to other Dothideomycetes. Analysis of the PKS domains identified three of the PKS enzymes as non-reducing and two as highly reducing. Gene clusters contained types of genes frequently found in PKS clusters including genes encoding transporters, oxidoreductases, methyltransferases, and non-ribosomal peptide synthases. Phylogenetic analysis identified a putative PKS cluster encoding melanin biosynthesis. None of the other clusters were closely aligned with genes encoding known polyketides, however three of the PKS genes fell into clades with clusters encoding alternapyrone, fumonisin, and solanapyrone produced by Alternaria and Fusarium species. A search for homologs among available genomic sequences from 103 Dothideomycetes identified close homologs (>80% similarity) for six of the PKS sequences. One of the PKS sequences was not similar (< 60% similarity) to sequences in any of the 103 genomes, suggesting that it encodes a unique compound. Comparison of the M. fijiensis PKS sequences with those of two other banana pathogens, M. musicola and M. eumusae, showed that these two species have close homologs to five of the M. fijiensis PKS sequences, but three others were not found in either species. RT-PCR and RNA-Seq analysis showed that the melanin PKS cluster was down-regulated in infected banana as compared to growth in culture. Three other clusters, however were strongly upregulated during disease development in banana, suggesting that they may encode

  12. Evolution of a Bitter Taste Receptor Gene Cluster in a New World Sparrow

    PubMed Central

    Davis, Jamie K.; Lowman, Josh J.; Thomas, Pamela J.; ten Hallers, Boudewijn F. H.; Koriabine, Maxim; Huynh, Lynn Y.; Maney, Donna L.; de Jong, Pieter J.; Martin, Christa L.; Thomas, James W.

    2010-01-01

    Bitter taste perception likely evolved as a protective mechanism against the ingestion of harmful compounds in food. The evolution of the taste receptor type 2 (TAS2R) gene family, which encodes the chemoreceptors that are directly responsible for the detection of bitter compounds, has therefore been of considerable interest. Though TAS2R repertoires have been characterized for a number of species, to date the complement of TAS2Rs from just one bird, the chicken, which had a notably small number of TAS2Rs, has been established. Here, we used targeted mapping and genomic sequencing in the white-throated sparrow (Zonotrichia albicollis) and sample sequencing in other closely related birds to reconstruct the history of a TAS2R gene cluster physically linked to the break points of an evolutionary chromosomal rearrangement. In the white-throated sparrow, this TAS2R cluster encodes up to 18 functional bitter taste receptors and likely underwent a large expansion that predates and/or coincides with the radiation of the Emberizinae subfamily into the New World. In addition to signatures of gene birth-and-death evolution within this cluster, estimates of Ka/Ks for the songbird TAS2Rs were similar to those previously observed in mammals, including humans. Finally, comparison of the complete genomic sequence of the cluster from two common haplotypes in the white-throated sparrow revealed a number of nonsynonymous variants and differences in functional gene content within this species. These results suggest that interspecies and intraspecies genetic variability does exist in avian TAS2Rs and that these differences could contribute to variation in bitter taste perception in birds. PMID:20624740

  13. Oculocutaneous albinism type 2 (OCA2) with homozygous 2.7-kb deletion of the P gene and sickle cell disease in a Cameroonian family. Identification of a common TAG haplotype in the mutated P gene.

    PubMed

    Aquaron, Robert; Soufir, Nadem; Bergé-Lefranc, Jean-Louis; Badens, Catherine; Austerlitz, Frederic; Grandchamp, Bernard

    2007-01-01

    In this study, we report on a Cameroonian family from the Ewondo ethnic group, presenting with three oculocutaneous albinism type 2 (OCA2) patients homozygous for the 2.7-kb deletion of the P gene. In one of these patients OCA2 was associated with sickle cell anaemia and in two with the sickle cell trait. We took this opportunity to determine single nucleotide polymorphism (SNP) haplotypes within the P gene in this family in comparison with a group of 53 OCA2 patients homozygous for the same mutation and with a matched unrelated full-coloured control group of 49 subjects, originating from seven different ethnic groups of Southern Cameroon including Ewondo. A combination of five exonic and intronic SNPs in the OCA2 gene was genotyped by sequencing PCR products. We found 3 different haplotypes (TAGCT, TAGTT and TAGCC with frequencies of 0.66, 0.28 and 0.06, respectively) associated with the mutation in the 53 OCA2 patients, while 11 different haplotypes were observed in the control group. These observations suggest that the mutation appeared on the relatively frequent haplotype TAGCT, and that the two other haplotypes are derived from two independent recombination events. These haplotypic data, associated with a value of 1/15,000 for the prevalence of the 2.7-kb mutation, a present effective population size of 10,000,000 for Cameroon and a recombination rate of 0.0031, allowed us to estimate that this mutation originated 4,100-5,645 years ago.

  14. Genetic attributes of the YHRD minimal haplotype in 10 provinces of Argentina.

    PubMed

    Marino, Miguel; Sala, Andrea; Corach, Daniel

    2007-06-01

    We investigated nine Y-STRs on 1136 unrelated males from 10 provinces and three aboriginal tribes of Argentina. The urban populations depicted 654 different haplotypes (66%) and a haplotype diversity (HD) value of 0.9967. Meanwhile, the Amerindian groups ranged from 20 to 26 different haplotypes (62.5-38.2%) and the HD values from 0.8635 to 0.9586. By AMOVA it was determined a remarkable homogeneous haplotype distribution, nevertheless a certain degree of genetic substructure was detected in the North region, in particular in Salta population. Genetic distance allowed to identify three clusters one of them included Salta population and the Amerindian tribes. The results presented herein showed the impact of the European male genetic contribution on the aboriginal gene pool that can be, at present, assessed by analyzing the nowadays extant population.

  15. Familial hypertrophic cardiomyopathy. Microsatellite haplotyping and identification of a hot spot for mutations in the beta-myosin heavy chain gene.

    PubMed

    Dausse, E; Komajda, M; Fetler, L; Dubourg, O; Dufour, C; Carrier, L; Wisnewsky, C; Bercovici, J; Hengstenberg, C; al-Mahdawi, S

    1993-12-01

    Familial hypertrophic cardiomyopathy (FHC) is a clinically and genetically heterogeneous disease. The first identified disease gene, located on chromosome 14q11-q12, encodes the beta-myosin heavy chain. We have performed linkage analysis of two French FHC pedigrees, 720 and 730, with two microsatellite markers located in the beta-myosin heavy chain gene (MYO I and MYO II) and with four highly informative markers, recently mapped to chromosome 14q11-q12. Significant linkage was found with MYO I and MYO II in pedigree 720, but results were not conclusive for pedigree 730. Haplotype analysis of the six markers allowed identification of affected individuals and of some unaffected subjects carrying the disease gene. Two novel missense mutations were identified in exon 13 by direct sequencing, 403Arg-->Leu and 403Arg-->Trp in families 720 and 730, respectively. The 403Arg-->Leu mutation was associated with incomplete penetrance, a high incidence of sudden deaths and severe cardiac events, whereas the consequences of the 403Arg-->Trp mutation appeared less severe. Haplotyping of polymorphic markers in close linkage to the beta-myosin heavy chain gene can, thus, provide rapid analysis of non informative pedigrees and rapid detection of carrier status. Our results also indicate that codon 403 of the beta-myosin heavy chain gene is a hot spot for mutations causing FHC.

  16. Nonlinear model-based method for clustering periodically expressed genes.

    PubMed

    Tian, Li-Ping; Liu, Li-Zhi; Zhang, Qian-Wei; Wu, Fang-Xiang

    2011-01-01

    Clustering periodically expressed genes from their time-course expression data could help understand the molecular mechanism of those biological processes. In this paper, we propose a nonlinear model-based clustering method for periodically expressed gene profiles. As periodically expressed genes are associated with periodic biological processes, the proposed method naturally assumes that a periodically expressed gene dataset is generated by a number of periodical processes. Each periodical process is modelled by a linear combination of trigonometric sine and cosine functions in time plus a Gaussian noise term. A two stage method is proposed to estimate the model parameter, and a relocation-iteration algorithm is employed to assign each gene to an appropriate cluster. A bootstrapping method and an average adjusted Rand index (AARI) are employed to measure the quality of clustering. One synthetic dataset and two biological datasets were employed to evaluate the performance of the proposed method. The results show that our method allows the better quality clustering than other clustering methods (e.g., k-means) for periodically expressed gene data, and thus it is an effective cluster analysis method for periodically expressed gene data.

  17. Haplotype C of growth hormone (GH) gene in Japanese Black cattle: structure of GH protein and a novel method for detection of the gene.

    PubMed

    Bahrami, Abolfazl; Ardiyanti, Astrid; Tonosaki, Kaoru; Suzuki, Keiichi; Tanida, Nanako; Hirayama, Takuji; Roh, Sang-Gun; Nishio, Takeshi; Katoh, Kazuo

    2013-12-01

    From a series of studies on single nucleotide polymorphisms (SNPs) in the bovine growth hormone (GH) gene of Japanese Black cattle, the type-C (127(Val) and 172(Met) ) that is specific for this breed has been intensively focused upon because of the economic importance for carcass traits, such as intramuscular oleic acid contents. In the present study, we intended to analyze the 3-D structure of GH of haplotype C, and developed a novel method to detect the type C gene. Three-D analysis of the type C protein showed that the amino acid residues (127(Val) and 172(Met) ), which are present in the third and fourth helixes, respectively, and are important for binding with GH receptors, are shifted to deeper positions in the molecule compared with that for type A (127(Leu) and 172(Thr) ), implying the alteration of binding interaction with receptors. A novel, efficient and cost-effective method (Dot-blot-SNP technique) for type C genotyping was successfully established, of which the basal method was a reported genotyping of SNPs for a large number of plants, reducing the cost to 10% or less of direct sequencing.

  18. Efficient arsenic metabolism--the AS3MT haplotype is associated with DNA methylation and expression of multiple genes around AS3MT.

    PubMed

    Engström, Karin S; Hossain, Mohammad Bakhtiar; Lauss, Martin; Ahmed, Sultan; Raqib, Rubhana; Vahter, Marie; Broberg, Karin

    2013-01-01

    Arsenic is a very potent toxicant. One major susceptibility factor for arsenic-related toxicity is the efficiency of arsenic metabolism. The efficiency, in turn, is associated with non-coding single nucleotide polymorphisms (SNPs) in the arsenic methyltransferase AS3MT on chromosome 10q24. However, the mechanism of action for these SNPs is not yet clarified. Here, we assessed the influence of genetic variation in AS3MT on DNA methylation and gene expression within 10q24, in people exposed to arsenic in drinking water. DNA was extracted from peripheral blood from women in the Argentinean Andes (N = 103) and from cord blood from new-borns in Bangladesh (N = 127). AS3MT SNPs were analyzed with Sequenom or Taqman assays. Whole genome epigenetic analysis with Infinium HumanMethylation450 BeadChip was performed on bisulphite-treated DNA. Whole genome gene expression analysis was performed with Illumina DirectHyb HumanHT-12 v4.0 on RNA from peripheral blood. Arsenic exposure was assessed by HPLC-ICPMS. In the Argentinean women, the major AS3MT haplotype, associated with more efficient arsenic metabolism, showed increased methylation of AS3MT (p = 10(-6)) and also differential methylation of several other genes within about 800 kilobasepairs: CNNM2 (p<10(-16)), NT5C2 (p<10(-16)), C10orf26 (p = 10(-8)), USMG5 (p = 10(-5)), TRIM8 (p = 10(-4)), and CALHM2 (p = 0.038) (adjusted for multiple comparisons). Similar, but weaker, associations between AS3MT haplotype and DNA methylation in 10q24 were observed in cord blood (Bangladesh). The haplotype-associated altered CpG methylation was correlated with reduced expression of AS3MT and CNNM2 (r(s) = -0.22 to -0.54), and with increased expression of NT5C2 and USMG5 (r(s) = 0.25 to 0.58). Taking other possibly influential variables into account in multivariable linear models did only to a minor extent alter the strength of the associations. In conclusion, the AS3MT haplotype status strongly

  19. Identification of β-globin haplotypes linked to sickle hemoglobin (Hb S) alleles in Mazandaran province, Iran.

    PubMed

    Aghajani, Faeghe; Mahdavi, Mohammad Reza; Kosaryan, Mehrnoush; Mahdavi, Mehrad; Hamidi, Mohaddese; Jalali, Hossein

    2016-12-21

    Carrier frequency of the β(S) allele has been reported to be 0.19% in Mazandaran province, northern Iran. Haplotype analysis of the β(S) allele helps trace the origin of its encoded hemoglobin (Hb) variant, Hb S, in a region. The aim of this study was to investigate the haplotypes associated with β(S) alleles in Mazandaran province. Capillary electrophoresis was carried out to detect individuals suspected to have a βS allele(s). DNA analysis (PCR-RFLP) was used for final confirmation. To identify 5\\' to 3\\' β-globin gene cluster haplotypes associated with β(S) alleles, family linkage analysis was applied. Six polymorphic sites (HincII 5' to ε, XmnI 5' to (G)γ, HindIII in (G)γ, HindIII in (A)γ, HincII 3' to ψβ and AvaII in β) were investigated using the PCR-RFLP method. Five different haplotypes were linked to β(S) alleles, while β(A) alleles were associated with nine haplotypes. Among the β(S) alleles, 53.9% were associated with the Benin (----++) haplotype, and the Arab-Indian (+++-++) haplotype had the second-highest frequency (23%). Unlike southern provinces, where the Arab-Indian haplotype is prominent, the Benin haplotype is the most frequent haplotype in northern Iran, and this may represent a founder effect. Since the Benin haplotype does not carry the XmnI polymorphism 5' to the (G)γ gene, which is responsible for high expression of Hb F, a severe form of sickle cell disease can be anticipated in patients that are homozygous for the β(S) allele in the northern region.

  20. Sesterterpene ophiobolin biosynthesis involving multiple gene clusters in Aspergillus ustus

    PubMed Central

    Chai, Hangzhen; Yin, Ru; Liu, Yongfeng; Meng, Huiying; Zhou, Xianqiang; Zhou, Guolin; Bi, Xupeng; Yang, Xue; Zhu, Tonghan; Zhu, Weiming; Deng, Zixin; Hong, Kui

    2016-01-01

    Terpenoids are the most diverse and abundant natural products among which sesterterpenes account for less than 2%, with very few reports on their biosynthesis. Ophiobolins are tricyclic 5–8–5 ring sesterterpenes with potential pharmaceutical application. Aspergillus ustus 094102 from mangrove rizhosphere produces ophiobolin and other terpenes. We obtained five gene cluster knockout mutants, with altered ophiobolin yield using genome sequencing and in silico analysis, combined with in vivo genetic manipulation. Involvement of the five gene clusters in ophiobolin synthesis was confirmed by investigation of the five key terpene synthesis relevant enzymes in each gene cluster, either by gene deletion and complementation or in vitro verification of protein function. The results demonstrate that ophiobolin skeleton biosynthesis involves five gene clusters, which are responsible for C15, C20, C25, and C30 terpenoid biosynthesis. PMID:27273151

  1. Common SNP-Based Haplotype Analysis of the 4p16.3 Huntington Disease Gene Region

    PubMed Central

    Lee, Jong-Min; Gillis, Tammy; Mysore, Jayalakshmi Srinidhi; Ramos, Eliana Marisa; Myers, Richard H.; Hayden, Michael R.; Morrison, Patrick J.; Nance, Martha; Ross, Christopher A.; Margolis, Russell L.; Squitieri, Ferdinando; Griguoli, Annamaria; Di Donato, Stefano; Gomez-Tortosa, Estrella; Ayuso, Carmen; Suchowersky, Oksana; Trent, Ronald J.; McCusker, Elizabeth; Novelletto, Andrea; Frontali, Marina; Jones, Randi; Ashizawa, Tetsuo; Frank, Samuel; Saint-Hilaire, Marie-Helene; Hersch, Steven M.; Rosas, Herminia D.; Lucente, Diane; Harrison, Madaline B.; Zanko, Andrea; Abramson, Ruth K.; Marder, Karen; Sequeiros, Jorge; MacDonald, Marcy E.; Gusella, James F.

    2012-01-01

    Age at the onset of motor symptoms in Huntington disease (HD) is determined largely by the length of a CAG repeat expansion in HTT but is also influenced by other genetic factors. We tested whether common genetic variation near the mutation site is associated with differences in the distribution of expanded CAG alleles or age at the onset of motor symptoms. To define disease-associated single-nucleotide polymorphisms (SNPs), we compared 4p16.3 SNPs in HD subjects with population controls in a case:control strategy, which revealed that the strongest signals occurred at a great distance from the HD mutation as a result of “synthetic association” with SNP alleles that are of low frequency in population controls. Detailed analysis delineated a prominent ancestral haplotype that accounted for ∼50% of HD chromosomes and extended to at least 938 kb on about half of these. Together, the seven most abundant haplotypes accounted for ∼83% of HD chromosomes. Neither the extended shared haplotype nor the individual local HTT haplotypes were associated with altered CAG-repeat length distribution or residual age at the onset of motor symptoms, arguing against modification of these disease features by common cis-regulatory elements. Similarly, the 11 most frequent control haplotypes showed no trans-modifier effect on age at the onset of motor symptoms. Our results argue against common local regulatory variation as a factor influencing HD pathogenesis, suggesting that genetic modifiers be sought elsewhere in the genome. They also indicate that genome-wide association analysis with a small number of cases can be effective for regional localization of genetic defects, even when a founder effect accounts for only a fraction of the disorder. PMID:22387017

  2. Fetal hemoglobin in sickle cell anemia: genetic studies of the Arab-Indian haplotype.

    PubMed

    Ngo, Duyen; Bae, Harold; Steinberg, Martin H; Sebastiani, Paola; Solovieff, Nadia; Baldwin, Clinton T; Melista, Efthymia; Safaya, Surinder; Farrer, Lindsay A; Al-Suliman, Ahmed M; Albuali, Waleed H; Al Bagshi, Muneer H; Naserullah, Zaki; Akinsheye, Idowu; Gallagher, Patrick; Luo, Hong-yuan; Chui, David H K; Farrell, John J; Al-Ali, Amein K; Alsultan, Abdulrahman

    2013-06-01

    Sickle cell anemia is common in the Middle East and India where the HbS gene is sometimes associated with the Arab-Indian (AI) β-globin gene (HBB) cluster haplotype. In this haplotype of sickle cell anemia, fetal hemoglobin (HbF) levels are 3-4 fold higher than those found in patients with HbS haplotypes of African origin. Little is known about the genetic elements that modulate HbF in AI haplotype patients. We therefore studied Saudi HbS homozygotes with the AI haplotype (mean HbF 19.2±7.0%, range 3.6 to 39.6%) and employed targeted genotyping of polymorphic sites to explore cis- and trans- acting elements associated with high HbF expression. We also described sequences which appear to be unique to the AI haplotype for which future functional studies are needed to further define their role in HbF modulation. All cases, regardless of HbF concentration, were homozygous for AI haplotype-specific elements cis to HBB. SNPs in BCL11A and HBS1L-MYB that were associated with HbF in other populations explained only 8.8% of the variation in HbF. KLF1 polymorphisms associated previously with high HbF were not present in the 44 patients tested. More than 90% of the HbF variance in sickle cell patients with the AI haplotype remains unexplained by the genetic loci that we studied. The dispersion of HbF levels among AI haplotype patients suggests that other genetic elements modulate the effects of the known cis- and trans-acting regulators. These regulatory elements, which remain to be discovered, might be specific in the Saudi and some other populations where HbF levels are especially high.

  3. Left-Right Axis Differentiation and Functional Lateralization: a Haplotype in the Methyltransferase Encoding Gene SETDB2 Might Mediate Handedness in Healthy Adults.

    PubMed

    Ocklenburg, Sebastian; Arning, Larissa; Gerding, Wanda M; Hengstler, Jan G; Epplen, Jörg T; Güntürkün, Onur; Beste, Christian; Akkad, Denis A

    2016-11-01

    Handedness is a multifactorial trait, and genes contributing to the differentiation of the left-right axis during embryogenesis have been identified as a major gene group associated with this trait. The methyltransferase SETDB2 (SET domain, bifurcated 2) has been shown to regulate structural left-right asymmetry in the vertebrate central nervous system by suppressing fgf8 expression. Here, we investigated the relation of genetic variation in SETDB2-and its paralogue SETDB1-with different handedness phenotypes in 950 healthy adult participants. We identified a haplotype on SETDB2 for which homozygous individuals showed a significantly lower lateralization quotient for handedness than the rest of the cohort after correction for multiple comparisons. Moreover, direction of handedness was significantly associated with genetic variation in this haplotype. This effect was mainly, but not exclusively, driven by the sequence variation rs4942830, as individuals homozygous for the A allele of this single nucleotide polymorphism had a significantly lower lateralization quotient than individuals with at least one T allele. These findings further confirm a role of genetic pathways relevant for structural left-right axis differentiation for functional lateralization. Moreover, as the protein encoded by SETDB2 regulates gene expression epigenetically by histone H3 methylation, our findings highlight the importance of investigating the role of epigenetic modulations of gene expression in relation to handedness.

  4. Genomic analyses of bacterial porin-cytochrome gene clusters

    DOE PAGES

    Shi, Liang; Fredrickson, James K.; Zachara, John M.

    2014-11-26

    In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteriamore » from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III) and Mn(IV) oxides.« less

  5. Frequencies of single-nucleotide polymorphisms and haplotypes of the SLCO1B1 gene in selected populations of the western balkans

    PubMed Central

    Grapci, A Daka; Dimovski, AJ; Kapedanovska, A; Vavlukis, M; Eftimov, A; Geshkovska, N Matevska; Labachevski, N; Jakjovski, K; Gorani, D; Kedev, S; Mladenovska, K

    2015-01-01

    As a membrane influx transporter, organic anion-transporting polypeptide 1B1 (OATP1B1) regulates the cellular uptake of a number of endogenous compounds and drugs. The aim of this study was to characterize the diversity of the solute carrier organic anion transporter family member 1B1 (SLCO1B1) gene encoding this transporter in two ethnic groups populating the Western Balkans. The distribution of SCLO1B1 alleles was determined at seven variant sites (c.388A>G, c.521T>C, c.571T>C, c.597C>T, c.1086C>T, c.1463G>C and c.*439T>G) in 266 Macedonians and 94 Albanians using the TaqMan allelic discrimination assay. No significant difference in the frequencies of the single nucleotide polymorphisms (SNPs) was observed between these populations. The frequency of the c.521T>C SNP was the lowest (<13.7 and 12.2%, respectively), while the frequencies of all other SNP alleles were above 40.0%. Variant alleles of c.1463G>C and c.1086 C>T SNPs were not identified in either ethnic group. The haplotype analysis revealed 20 and 21 different haplotypes in the Macedonian and Albanian population, respectively. The most common haplotype in both ethnic groups, *1J/*1K/*1L, had a frequency of 39.0% and 26.6%, respectively. In both populations, the variant alleles of the functionally significant c.521T>C and c.388A>G SNPs existed in one major haplotype (*15/*16/*17), with a frequency of 8.6 and 2.4% in the Macedonian and Albanian subjects, respectively. In conclusion, sequence variations of the SLCO1B1 gene in the studied populations occur at high frequencies, which are similar to that of the Caucasian population. Further studies are needed to evaluate the clinical significance of these SNPs and/ or the major SLCO1B1 haplotypes they form for a large number of substrates and for susceptibility to certain diseases. PMID:26929901

  6. Frequencies of single-nucleotide polymorphisms and haplotypes of the SLCO1B1 gene in selected populations of the western balkans.

    PubMed

    Grapci, A Daka; Dimovski, A J; Kapedanovska, A; Vavlukis, M; Eftimov, A; Geshkovska, N Matevska; Labachevski, N; Jakjovski, K; Gorani, D; Kedev, S; Mladenovska, K

    2015-06-01

    As a membrane influx transporter, organic anion-transporting polypeptide 1B1 (OATP1B1) regulates the cellular uptake of a number of endogenous compounds and drugs. The aim of this study was to characterize the diversity of the solute carrier organic anion transporter family member 1B1 (SLCO1B1) gene encoding this transporter in two ethnic groups populating the Western Balkans. The distribution of SCLO1B1 alleles was determined at seven variant sites (c.388A>G, c.521T>C, c.571T>C, c.597C>T, c.1086C>T, c.1463G>C and c.*439T>G) in 266 Macedonians and 94 Albanians using the TaqMan allelic discrimination assay. No significant difference in the frequencies of the single nucleotide polymorphisms (SNPs) was observed between these populations. The frequency of the c.521T>C SNP was the lowest (<13.7 and 12.2%, respectively), while the frequencies of all other SNP alleles were above 40.0%. Variant alleles of c.1463G>C and c.1086 C>T SNPs were not identified in either ethnic group. The haplotype analysis revealed 20 and 21 different haplotypes in the Macedonian and Albanian population, respectively. The most common haplotype in both ethnic groups, *1J/*1K/*1L, had a frequency of 39.0% and 26.6%, respectively. In both populations, the variant alleles of the functionally significant c.521T>C and c.388A>G SNPs existed in one major haplotype (*15/*16/*17), with a frequency of 8.6 and 2.4% in the Macedonian and Albanian subjects, respectively. In conclusion, sequence variations of the SLCO1B1 gene in the studied populations occur at high frequencies, which are similar to that of the Caucasian population. Further studies are needed to evaluate the clinical significance of these SNPs and/ or the major SLCO1B1 haplotypes they form for a large number of substrates and for susceptibility to certain diseases.

  7. Understanding Y haplotype matching probability.

    PubMed

    Brenner, Charles H

    2014-01-01

    the simple exchangability model on which the kappa method rests. However, to make progress toward forensic calculation of Y haplotype mixture evidence a different tack is needed. The "Laplace distribution" model of Andersen et al. [3] which estimates haplotype frequencies by identifying haplotype clusters in population data looks useful.

  8. Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca.

    PubMed

    Temme, Karsten; Zhao, Dehua; Voigt, Christopher A

    2012-05-01

    Bacterial genes associated with a single trait are often grouped in a contiguous unit of the genome known as a gene cluster. It is difficult to genetically manipulate many gene clusters because of complex, redundant, and integrated host regulation. We have developed a systematic approach to completely specify the genetics of a gene cluster by rebuilding it from the bottom up using only synthetic, well-characterized parts. This process removes all native regulation, including that which is undiscovered. First, all noncoding DNA, regulatory proteins, and nonessential genes are removed. The codons of essential genes are changed to create a DNA sequence as divergent as possible from the wild-type (WT) gene. Recoded genes are computationally scanned to eliminate internal regulation. They are organized into operons and placed under the control of synthetic parts (promoters, ribosome binding sites, and terminators) that are functionally separated by spacer parts. Finally, a controller consisting of genetic sensors and circuits regulates the conditions and dynamics of gene expression. We applied this approach to an agriculturally relevant gene cluster from Klebsiella oxytoca encoding the nitrogen fixation pathway for converting atmospheric N(2) to ammonia. The native gene cluster consists of 20 genes in seven operons and is encoded in 23.5 kb of DNA. We constructed a "refactored" gene cluster that shares little DNA sequence identity with WT and for which the function of every genetic part is defined. This work demonstrates the potential for synthetic biology tools to rewrite the genetics encoding complex biological functions to facilitate access, engineering, and transferability.

  9. Clustering Algorithms: Their Application to Gene Expression Data

    PubMed Central

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867

  10. The Senegal DNA haplotype is associated with the amelioration of anemia in African-American sickle cell anemia patients.

    PubMed

    Nagel, R L; Erlingsson, S; Fabry, M E; Croizat, H; Susuka, S M; Lachman, H; Sutton, M; Driscoll, C; Bouhassira, E; Billett, H H

    1991-03-15

    We have previously determined that in African sickle cell anemia (SS) patients three different beta-like globin gene cluster haplotypes are associated with different percent G gamma (one of the two types of non-alpha chains comprising hemoglobin F [HbF]), mean percent HbF, and percent dense cells. We report now that in adult New York SS patients, the presence of at least one chromosome with the Senegal haplotype is associated with higher Hb levels (1.2 g/dL higher) than is found for any other non-Senegal haplotype (P less than .004). The percent reticulocytes and the serum bilirubin levels were lower in these patients. When the effect of alpha-gene number was analyzed by examining a sample of SS patients with concomitant alpha-thalassemia, the same results were obtained. Because the HbF level is significantly higher among the Senegal haplotype carriers in this sample, the inhibitory effect on sickling of this Hb variant may be one of the reasons for the haplotype effect. We conclude that the Senegal beta-like globin gene cluster haplotype is associated with an amelioration of the hemolytic anemia that characterizes sickle cell disease.

  11. Entropy-based cluster validation and estimation of the number of clusters in gene expression data.

    PubMed

    Novoselova, Natalia; Tom, Igor

    2012-10-01

    Many external and internal validity measures have been proposed in order to estimate the number of clusters in gene expression data but as a rule they do not consider the analysis of the stability of the groupings produced by a clustering algorithm. Based on the approach assessing the predictive power or stability of a partitioning, we propose the new measure of cluster validation and the selection procedure to determine the suitable number of clusters. The validity measure is based on the estimation of the "clearness" of the consensus matrix, which is the result of a resampling clustering scheme or consensus clustering. According to the proposed selection procedure the stable clustering result is determined with the reference to the validity measure for the null hypothesis encoding for the absence of clusters. The final number of clusters is selected by analyzing the distance between the validity plots for initial and permutated data sets. We applied the selection procedure to estimate the clustering results on several datasets. As a result the proposed procedure produced an accurate and robust estimate of the number of clusters, which are in agreement with the biological knowledge and gold standards of cluster quality.

  12. The human RHOX gene cluster: target genes and functional analysis of gene variants in infertile men.

    PubMed

    Borgmann, Jennifer; Tüttelmann, Frank; Dworniczak, Bernd; Röpke, Albrecht; Song, Hye-Won; Kliesch, Sabine; Wilkinson, Miles F; Laurentino, Sandra; Gromoll, Jörg

    2016-09-15

    The X-linked reproductive homeobox (RHOX) gene cluster encodes transcription factors preferentially expressed in reproductive tissues. This gene cluster has important roles in male fertility based on phenotypic defects of Rhox-mutant mice and the finding that aberrant RHOX promoter methylation is strongly associated with abnormal human sperm parameters. However, little is known about the molecular mechanism of RHOX function in humans. Using gene expression profiling, we identified genes regulated by members of the human RHOX gene cluster. Some genes were uniquely regulated by RHOXF1 or RHOXF2/2B, while others were regulated by both of these transcription factors. Several of these regulated genes encode proteins involved in processes relevant to spermatogenesis; e.g. stress protection and cell survival. One of the target genes of RHOXF2/2B is RHOXF1, suggesting cross-regulation to enhance transcriptional responses. The potential role of RHOX in human infertility was addressed by sequencing all RHOX exons in a group of 250 patients with severe oligozoospermia. This revealed two mutations in RHOXF1 (c.515G > A and c.522C > T) and four in RHOXF2/2B (-73C > G, c.202G > A, c.411C > T and c.679G > A), of which only one (c.202G > A) was found in a control group of men with normal sperm concentration. Functional analysis demonstrated that c.202G > A and c.679G > A significantly impaired the ability of RHOXF2/2B to regulate downstream genes. Molecular modelling suggested that these mutations alter RHOXF2/F2B protein conformation. By combining clinical data with in vitro functional analysis, we demonstrate how the X-linked RHOX gene cluster may function in normal human spermatogenesis and we provide evidence that it is impaired in human male fertility.

  13. Clustering of genes necessary for hydrogen oxidation in Rhodobacter capsulatus.

    PubMed Central

    Xu, H W; Wall, J D

    1991-01-01

    Three cosmids previously shown to contain information necessary for the expression of uptake of hydrogenase in Rhodobacter capsulatus were found to be present in a cluster on the chromosome. Earlier genetic experiments suggested the presence of at least six genes essential for hydrogenase activity that are now shown to be in a region of approximately 18 kb that includes the structural genes for the enzyme. A potential response regulator gene was sequenced as a part of the hup gene region. PMID:2007559

  14. Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis.

    PubMed

    Koh, Esther G L; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V; Brenner, Sydney; Venkatesh, Byrappa

    2003-02-04

    The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes.

  15. Haplotype determination of the upstream regulatory region and the second exon of the BoLA-DRB3 gene in Holstein cattle.

    PubMed

    Goszczynski, D E; Ripoli, M V; Takeshima, S-N; Baltian, L; Aida, Y; Giovambattista, G

    2014-03-01

    Polymorphisms of the BoLA-DRB3 gene are located primarily in the second exon [antigen binding site (ABS)] and, to a lesser extent, in the upstream regulatory region (URR). It can be hypothesised that exon 2 and the URR are under different types of natural selection. The aim of this work was to determine the URR-exon 2 haplotypes; 34 Holstein samples were genotyped by direct sequencing. A total of 7 URR alleles and 23 exon 2 alleles were detected, and 3 of the URR alleles were novel. Our results may suggest that no relationship exists between the URR and exon 2 of the BoLA-DRB3 gene (linkage disequilibrium P value > 0.05), most likely due to recombination over time. Our results also suggest that both regions of class II genes may be included in the development of new genotyping methods based on next-generation DNA sequencing technologies.

  16. 3D visualization of gene clusters and networks

    NASA Astrophysics Data System (ADS)

    Zhang, Leishi; Sheng, Weiguo; Liu, Xiaohui

    2005-03-01

    In this paper, we try to provide a global view of DNA microarray gene expression data analysis and modeling process by combining novel and effective visualization techniques with data mining algorithms. An integrated framework has been proposed to model and visualize short, high-dimensional gene expression data. The framework reduces the dimensionality of variables before applying appropriate temporal modeling method. Prototype has been built using Java3D to visualize the framework. The prototype takes gene expression data as input, clusters the genes, displays the clustering results using a novel graph layout algorithm, models individual gene clusters using Dynamic Bayesian Network and then visualizes the modeling results using simple but effective visualization techniques.

  17. SMART: Unique Splitting-While-Merging Framework for Gene Clustering

    PubMed Central

    Fa, Rui; Roberts, David J.; Nandi, Asoke K.

    2014-01-01

    Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms. PMID:24714159

  18. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  19. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

    PubMed Central

    2010-01-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405

  20. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

    PubMed

    Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

    2010-07-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.

  1. Gene Cluster Encoding Cholate Catabolism in Rhodococcus spp.

    PubMed Central

    Wilbrink, Maarten H.; Casabon, Israël; Stewart, Gordon R.; Liu, Jie; van der Geize, Robert; Eltis, Lindsay D.

    2012-01-01

    Bile acids are highly abundant steroids with important functions in vertebrate digestion. Their catabolism by bacteria is an important component of the carbon cycle, contributes to gut ecology, and has potential commercial applications. We found that Rhodococcus jostii RHA1 grows well on cholate, as well as on its conjugates, taurocholate and glycocholate. The transcriptome of RHA1 growing on cholate revealed 39 genes upregulated on cholate, occurring in a single gene cluster. Reverse transcriptase quantitative PCR confirmed that selected genes in the cluster were upregulated 10-fold on cholate versus on cholesterol. One of these genes, kshA3, encoding a putative 3-ketosteroid-9α-hydroxylase, was deleted and found essential for growth on cholate. Two coenzyme A (CoA) synthetases encoded in the cluster, CasG and CasI, were heterologously expressed. CasG was shown to transform cholate to cholyl-CoA, thus initiating side chain degradation. CasI was shown to form CoA derivatives of steroids with isopropanoyl side chains, likely occurring as degradation intermediates. Orthologous gene clusters were identified in all available Rhodococcus genomes, as well as that of Thermomonospora curvata. Moreover, Rhodococcus equi 103S, Rhodococcus ruber Chol-4 and Rhodococcus erythropolis SQ1 each grew on cholate. In contrast, several mycolic acid bacteria lacking the gene cluster were unable to grow on cholate. Our results demonstrate that the above-mentioned gene cluster encodes cholate catabolism and is distinct from a more widely occurring gene cluster encoding cholesterol catabolism. PMID:23024343

  2. Minimum spanning trees for gene expression data clustering.

    PubMed

    Xu, Y; Olman, V; Xu, D

    2001-01-01

    This paper describes a new framework for microarray gene-expression data clustering. The foundation of this framework is a minimum spanning tree (MST) representation of a set of multi-dimensional gene expression data. A key property of this representation is that each cluster of the expression data corresponds to one subtree of the MST, which rigorously converts a multi-dimensional clustering problem to a tree partitioning problem. We have demonstrated that though the inter-data relationship is greatly simplified in the MST representation, no essential information is lost for the purpose of clustering. Two key advantages in representing a set of multi-dimensional data as an MST are: (1) the simple structure of a tree facilitates efficient implementations of rigorous clustering algorithms, which otherwise are highly computationally challenging; and (2) as an MST-based clustering does not depend on detailed geometric shape of a cluster, it can overcome many of the problems faced by classical clustering algorithms. Based on the MST representation, we have developed a number of rigorous and efficient clustering algorithms, including two with guaranteed global optimality. We have implemented these algorithms as a computer software EXCAVATOR. To demonstrate its effectiveness, we have tested it on two data sets, i.e., expression data from yeast Saccharomyces cerevisiae, and Arabidopsis expression data in response to chitin elicitation.

  3. Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation.

    PubMed

    Osbourn, Anne

    2010-10-01

    Microbes and plants produce a huge array of secondary metabolites that have important ecological functions. These molecules have long been exploited in medicine as antibiotics, anticancer and anti-infective agents and for a wide range of other applications. Gene clusters for secondary metabolic pathways are common in bacteria and filamentous fungi, and examples have now been discovered in plants. Here, current knowledge of gene clusters across the kingdoms is evaluated with the aim of trying to understand the rules behind cluster existence and evolution. Such knowledge will be crucial in learning how to activate the enormous number of 'silent' gene clusters being revealed by whole-genome sequencing and hence in making available a wealth of novel compounds for evaluation as drug leads and other bioactives. It could also facilitate the development of crop plants with enhanced pest or disease resistance, improved nutritional qualities and/or elevated levels of high-value products.

  4. Identification of a new haplotype within the promoter region of the MSTN gene in horses from five of the most common breeds in Poland.

    PubMed

    Stefaniuk, Monika; Kaczor, Urszula; Augustyn, Romana; Gurgul, Artur; Kulisa, Maria; Podstawski, Zenon

    2014-01-01

    Myostatin (GDF-8) encoded by the MSTN gene is a negative regulator of muscle growth and development and belongs to the TGF-β superfamily of secreted growth and differentiation factors. In Thoroughbred horses, an MSTN sequence polymorphism (g.66493737C>T) is associated with optimum race distance. In the present study, a genetic polymorphism of a predicted promoter of the MSTN gene was investigated in 451 horses belonging to five different breeds: Arabian, Thoroughbred, Polish Konik, Hucul and Polish Heavy Draft. Two SNPs located at g.66495826T>C and g.66495696T>C (chr;18 EquCab 2.0) showed three haplotypes previously described: [g.66495826:T, g.66495696:T], [g.66495826:T, g.66495696:C], [g.66495826:C, g.66495696:T] with frequencies 0.877; 0.101; 0.005; respectively. Analysis performed on Polish Heavy Draft indicated the occurrence of a new haplotype [g.6649582626:C, g.66495696:C] with frequency 0.016.

  5. Clustering gene expression data using a diffraction‐inspired framework

    PubMed Central

    2012-01-01

    Background The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised clustering approach is often used, resulting in the development of a wide variety of algorithms. Typical clustering algorithms require selecting certain parameters to operate, for instance the number of expected clusters, as well as defining a similarity measure to quantify the distance between data points. The diffraction‐based clustering algorithm however is designed to overcome this necessity for user‐defined parameters, as it is able to automatically search the data for any underlying structure. Methods The diffraction‐based clustering algorithm presented in this paper is tested using five well‐known expression datasets pertaining to cancerous tissue samples. The clustering results are then compared to those results obtained from conventional algorithms such as the k‐means, fuzzy c‐means, self‐organising map, hierarchical clustering algorithm, Gaussian mixture model and density‐based spatial clustering of applications with noise (DBSCAN). The performance of each algorithm is measured using an average external criterion and an average validity index. Results The diffraction‐based clustering algorithm is shown to be independent of the number of clusters as the algorithm searches the feature space and requires no form of parameter selection. The results show that the diffraction‐based clustering algorithm performs significantly better on the real biological datasets compared to the other existing algorithms. Conclusion The results of the diffraction‐based clustering algorithm presented in this paper suggest that the method can provide researchers with a new tool for successfully analysing microarray data. PMID:23164195

  6. Heterologous Expression of Novobiocin and Clorobiocin Biosynthetic Gene Clusters

    PubMed Central

    Eustáquio, Alessandra S.; Gust, Bertolt; Galm, Ute; Li, Shu-Ming; Chater, Keith F.; Heide, Lutz

    2005-01-01

    A method was developed for the heterologous expression of biosynthetic gene clusters in different Streptomyces strains and for the modification of these clusters by single or multiple gene replacements or gene deletions with unprecedented speed and versatility. λ-Red-mediated homologous recombination was used for genetic modification of the gene clusters, and the attachment site and integrase of phage φC31 were employed for the integration of these clusters into the heterologous hosts. This method was used to express the gene clusters of the aminocoumarin antibiotics novobiocin and clorobiocin in the well-studied strains Streptomyces coelicolor and Streptomyces lividans, which, in contrast to the natural producers, can be easily genetically manipulated. S. coelicolor M512 derivatives produced the respective antibiotic in yields comparable to those of natural producer strains, whereas S. lividans TK24 derivatives were at least five times less productive. This method could also be used to carry out functional investigations. Shortening of the cosmids' inserts showed which genes are essential for antibiotic production. PMID:15870333

  7. Characterization of the Largest Effector Gene Cluster of Ustilago maydis

    PubMed Central

    Vincon, Volker; Kahmann, Regine

    2014-01-01

    In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function. PMID:24992561

  8. Endothelial nitric oxide synthase gene variants and haplotypes associated with an increased risk of idiopathic recurrent miscarriage.

    PubMed

    Almawi, W Y; Guarino, B D; Al-Sulaiti, M A; Al-Busaidi, A S; Racoubian, E; Finan, R R

    2013-09-01

    We investigated the association of endothelial nitric oxide synthase (NOS3) polymorphisms rs2070744 (-786T> C), 27-bp repeat 4b/4a, rs1799983 (Glu298Asp), rs3918188 (-734C> A), and rs743507 (113G> A) with idiopathic recurrent miscarriage (IRM). This was a case-control study involving women with confirmed IRM (n = 296), and 305 age- and ethnically matched control women. NOS3 rs2070744, rs1799983, rs3918188, and rs743507 genotyping was done by TaqMan assays; NOS3 4b/4a genotyping was done by PCR-ASA. A higher frequency of -786C and 298Asp alleles was seen in IRM cases, which remained associated independently with IRM on multivariate analysis. Allele and genotype distribution of 4b/4a, rs3918188 (-734C> A) and rs743507 (113A> G) were comparable between IRM cases and control women. Taking homozygous wild-type genotype as a reference, regression analysis confirmed the association of Glu298Asp and -786T/C, and rs743507 homozygous carriers with IRM risk. Marked linkage disequilibrium was seen between tested NOS3 variants, thus allowing the construction of 5-locus [-786T> C/4b4a/Glu298Asp/-734C> A/113G> A] haplotypes. Taking the common T4bGCA haplotype as a reference, multivariate analysis confirmed the positive association of C4bTCG haplotype with IRM, after controlling for traditional covariates. Genetic variation at the NOS3 locus represents a genetic risk factor for increased susceptibility to IRM.

  9. Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions.

    PubMed

    Petersen, Jessica L; Valberg, Stephanie J; Mickelson, James R; McCue, Molly E

    2014-12-01

    Two variants in the equine myostatin gene (MSTN), including a T/C SNP in the first intron and a 227-bp SINE insertion in the promoter, are associated with muscle fiber type proportions in the Quarter Horse (QH) and with the prediction of race distance propensity in the Thoroughbred (TB). Genotypes from these loci, along with 18 additional variants surrounding MSTN, were examined in 301 horses of 14 breeds to evaluate haplotype relationships and diversity. The C allele of intron 1 was found in 12 of 14 breeds at a frequency of 0.27; the SINE was observed in five breeds, but common in only the TB and QH (0.73 and 0.48 respectively). Haplotype data suggest the SINE insertion is contemporary to and arose upon a haplotype containing the intron 1 C allele. Gluteal muscle biopsies of TBs showed a significant association of the intron 1 C allele and SINE with a higher proportion of Type 2B and lower proportion of Type 1 fibers. However, in the Belgian horse, in which the SINE is not present, the intron 1 SNP was not associated with fiber type proportions, and evaluation of fiber type proportions across the Belgian, TB and QH breeds shows the significant effect of breed on fiber type proportions is negated when evaluating horses without the SINE variant. These data suggest the SINE, rather than the intron 1 SNP, is driving the observed muscle fiber type characteristics and is the variant targeted by selection for short-distance racing.

  10. Cluster J Mycobacteriophages: Intron Splicing in Capsid and Tail Genes

    PubMed Central

    Pope, Welkin H.; Jacobs-Sera, Deborah; Best, Aaron A.; Broussard, Gregory W.; Connerly, Pamela L.; Dedrick, Rebekah M.; Kremer, Timothy A.; Offner, Susan; Ogiefo, Amenawon H.; Pizzorno, Marie C.; Rockenbach, Kate; Russell, Daniel A.; Stowe, Emily L.; Stukey, Joseph; Thibault, Sarah A.; Conway, James F.; Hendrix, Roger W.; Hatfull, Graham F.

    2013-01-01

    Bacteriophages isolated on Mycobacterium smegmatis mc2155 represent many distinct genomes sharing little or no DNA sequence similarity. The genomes are architecturally mosaic and are replete with genes of unknown function. A new group of genomes sharing substantial nucleotide sequences constitute Cluster J. The six mycobacteriophages forming Cluster J are morphologically members of the Siphoviridae, but have unusually long genomes ranging from 106.3 to 117 kbp. Reconstruction of the capsid by cryo-electron microscopy of mycobacteriophage BAKA reveals an icosahedral structure with a triangulation number of 13. All six phages are temperate and homoimmune, and prophage establishment involves integration into a tRNA-Leu gene not previously identified as a mycobacterial attB site for phage integration. The Cluster J genomes provide two examples of intron splicing within the virion structural genes, one in a major capsid subunit gene, and one in a tail gene. These genomes also contain numerous free-standing HNH homing endonuclease, and comparative analysis reveals how these could contribute to genome mosaicism. The unusual Cluster J genomes provide new insights into phage genome architecture, gene function, capsid structure, gene mobility, intron splicing, and evolution. PMID:23874930

  11. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    PubMed

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  12. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    PubMed Central

    Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417

  13. Androgen responsiveness of the murine beta-glucuronidase gene is associated with nuclease hypersensitivity, protein binding, and haplotype-specific sequence diversity within intron 9.

    PubMed Central

    Lund, S D; Gallagher, P M; Wang, B; Porter, S C; Ganschow, R E

    1991-01-01

    The tissue specificity and genetic variability of the murine beta-glucuronidase (GUS) response to androgen provide useful markers for identifying elements which underlie this responsiveness. While GUS is expressed constitutively in all examined cell types, kidney epithelial cells uniquely exhibit a manyfold yet slow rise in GUS mRNA and enzyme levels when stimulated by androgens. Three major phenotypes of this androgen response have been described among inbred strains of mice: (i) a strong response in strains of the Gusa haplotype, (ii) a reduced response in strains of the Gusb and Gush haplotypes, and (iii) no response, as observed in Gusor mice. These response variants define a cis-active element(s) which is tightly linked to the GUS structural gene. Nuclease hypersensitivity scans of kidney chromatin within and surrounding the structural gene revealed an androgen-inducible hypersensitive site in intron 9 of the gene in Gusa but not in Gusor mice. When a radiolabeled fragment of Gusa DNA containing this hypersensitive site was incubated with kidney nuclear extracts and then subjected to gel electrophoresis, two shifted bands were observed whose levels were dramatically higher in extracts of androgen-treated than in those of untreated Gusa mice. The shifted bands reflect binding of a kidney-specific factor(s) to a 57-bp region of complex dyad symmetry in Gusa and Gusor mice which is partially deleted in Gusb and Gush mice. This binding site is located approximately 130 bp downstream of a glucocorticoid response element sequence motif which is totally deleted in [Gus]or mice. Taken together, our results suggest that the androgen responsiveness of GUS in murine kidney epithelial cells is controlled by elements within the proximal end of intron 9 of the GUS structural gene. Images PMID:1922055

  14. Genetic variability of the gene cluster CALHM1–3 in sporadic Creutzfeldt-Jakob disease

    PubMed Central

    Calero, Olga; Bullido, María J.; Clarimón, Jordi; Hortigüela, Rafael; Frank-García, Ana; Martínez-Martín, Pablo; Lleó, Alberto; Rey, María Jesús; Sastre, Isabel; Rábano, Alberto; de Pedro-Cuesta, Jesús; Ferrer, Isidro; Calero, Miguel

    2012-01-01

    Perturbations of calcium homeostasis have been associated with several neurodegenerative disorders. A common polymorphism (rs2986017) in the CALHM1 gene, coding for a regulator of calcium homeostasis, is a genetic risk factor for the development of Alzheimer disease (AD). Although some authors failed to confirm these results, a meta-analysis has shown that this polymorphism modulates the age at disease onset. Furthermore, a recent association study has explored the genetic variability of CALHM1 gene and two adjacent paralog genes (CALHM3 and CALHM2) in an Asian population. Since several lines of evidence suggest that AD and prion diseases share pathophysiologic mechanisms, we investigated for the first time the genetic variability of the gene cluster formed by CALHM1 and its paralogs in a series of 235 sporadic Creutzfeldt-Jakob disease (sCJD) patients, and compared the genotypic and allelic frequencies with those presented in 329 controls from the same ancestry. As such, this work also represents the first association analysis of CALHM genes in sCJD. Sequencing analysis of the complete coding regions of the genes demonstrated the presence of 10 single nucleotide polymorphisms (SNP) within the CALHM genes. We observed that rs4918016-rs2986017-rs2986018 and rs41287502-rs41287500 polymorphic sites at CALHM1 were in linkage disequilibrium. We found marginal associations for sCJD risk at CALHM1 polymorphic sites rs41287502 and rs41287500 [coding for two linked missense mutations (p.(Met323Ile); (Gly282Cys)], and rs2986017 [p.(Leu86Pro)]. Interestingly, a TGG haplotype defined by the rs4918016-rs2986017-rs2986018 block was associated with sCJD. These findings underscore the need of future multinational collaborative initiatives in order to corroborate these seminal data. PMID:22874670

  15. Clustered Genes Involved in Cyclopiazonic Acid Production are Next to the Aflatoxin Biosynthesis Gene Cluster in Aspergillus flavus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cyclopiazonic acid (CPA), an indole-tetramic acid toxin, is produced by many species of Aspergillus and Penicillium. In addition to CPA Aspergillus flavus produces polyketide-derived carcinogenic aflatoxins (AFs). AF biosynthesis genes form a gene cluster in a subtelomeric region. Isolates of A. fla...

  16. Genomic analyses of bacterial porin-cytochrome gene clusters

    SciTech Connect

    Shi, Liang; Fredrickson, James K.; Zachara, John M.

    2014-11-26

    In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteria from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular

  17. Transcriptional regulation of the novobiocin biosynthetic gene cluster.

    PubMed

    Dangel, Volker; Härle, Johannes; Goerke, Christiane; Wolz, Christiane; Gust, Bertolt; Pernodet, Jean-Luc; Heide, Lutz

    2009-12-01

    The aminocoumarin antibiotic novobiocin is a gyrase inhibitor formed by a Streptomyces strain. The biosynthetic gene cluster of novobiocin spans 23.4 kb and contains 20 coding sequences, among them the two regulatory genes novE and novG. We investigated the location of transcriptional promoters within this cluster by insertion of transcriptional terminator cassettes and RT-PCR analysis of the resulting mutants. The cluster was found to contain eight DNA regions with promoter activity. The regulatory protein NovG binds to a previously identified binding site within the promoter region located upstream of novH, but apparently not to any of the other seven promoters. Quantitative real-time PCR was used to compare the number of transcripts in a strain carrying an intact novobiocin cluster with strains carrying mutated clusters. Both in-frame deletion of the regulatory gene novG and insertion of a terminator cassette into the biosynthetic gene novH led to a strong reduction of the number of transcripts of the genes located between novH and novW. This suggested that these 16 biosynthetic genes form a single operon. Three internal promoters are located within this operon but appear to be of minor importance, if any, under our experimental conditions. Transcription of novG was found to depend on the presence of NovE, suggesting that the two regulatory genes, novE and novG, act in a cascade-like mechanism. The resistance gene gyrB(R), encoding an aminocoumarin-resistant gyrase B subunit, may initially be co-transcribed with the genes from novH to novW. However, when the gyrase inhibitor novobiocin accumulates in the cultures, gyrB(R) is transcribed from its own promoter. Previous work has suggested that this promoter is controlled by the superhelical density of chromosomal DNA.

  18. An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray.

    PubMed

    Ramos, Juan; Castellanos-Garzón, José A; González-Briones, Alfonso; de Paz, Juan F; Corchado, Juan M

    2017-03-09

    Gene selection is a major research area in microarray analysis, which seeks to discover differentially expressed genes for a particular target annotation. Such genes also often called informative genes are able to differentiate tissue samples belonging to different classes of the studied disease. Despite the fact that there is a wide number of proposals, the complexity imposed by this problem remains a challenge today. This research proposes a gene selection approach by means of a clustering-based multi-agent system. This proposal manages different filter methods and gene clustering through coordinated agents to discover informative gene subsets. To assess the reliability of our approach, we have used four important and public gene expression datasets, two Lung cancer datasets, Colon and Leukemia cancer dataset. The achieved results have been validated through cluster validity measures, visual analytics, a classifier and compared with other gene selection methods, proving the reliability of our proposal.

  19. Homozygosity mapping on homozygosity haplotype analysis to detect recessive disease-causing genes from a small number of unrelated, outbred patients.

    PubMed

    Hagiwara, Koichi; Morino, Hiroyuki; Shiihara, Jun; Tanaka, Tomoaki; Miyazawa, Hitoshi; Suzuki, Tomoko; Kohda, Masakazu; Okazaki, Yasushi; Seyama, Kuniaki; Kawakami, Hideshi

    2011-01-01

    Genes involved in disease that are not common are often difficult to identify; a method that pinpoints them from a small number of unrelated patients will be of great help. In order to establish such a method that detects recessive genes identical-by-descent, we modified homozygosity mapping (HM) so that it is constructed on the basis of homozygosity haplotype (HM on HH) analysis. An analysis using 6 unrelated patients with Siiyama-type α1-antitrypsin deficiency, a disease caused by a founder gene, the correct gene locus was pinpointed from data of any 2 patients (length: 1.2-21.8 centimorgans, median: 1.6 centimorgans). For a test population in which these 6 patients and 54 healthy subjects were scrambled, the approach accurately identified these 6 patients and pinpointed the locus to a 1.4-centimorgan fragment. Analyses using synthetic data revealed that the analysis works well for IBD fragment derived from a most recent common ancestor (MRCA) who existed less than 60 generations ago. The analysis is unsuitable for the genes with a frequency in general population more than 0.1. Thus, HM on HH analysis is a powerful technique, applicable to a small number of patients not known to be related, and will accelerate the identification of disease-causing genes for recessive conditions.

  20. Association of rs7719175, located in the IL13 gene promoter, with Schistosoma haematobium infection levels and identification of a susceptibility haplotype.

    PubMed

    Isnard, A; Kouriba, B; Doumbo, O; Chevillard, C

    2011-01-01

    Urinary schistosomiasis is a parasitic disease caused by Schistosoma haematobium helminths. S. haematobium eggs may remain trapped within the bladder or the ureter walls, causing major pathological disorders in the urogenital system. The polymorphism rs1800925(C/T) of the IL13 gene promoter, which is functional, has previously been associated with susceptibility to S. haematobium infection. The aim of this study was to further our understanding and to determine whether, in the 5q31-q33 region, rs1800925 affects infection levels alone or in synergy with other polymorphisms. After sequencing the IL13 promoter and increasing the single-nucleotide polymorphism density, we performed a linkage disequilibrium analysis between rs1800925 and the other markers in a Malian population. Multivariate linear regression analysis and electrophoretic mobility shift assay (EMSA) were performed to characterized markers in linkage disequilibrium with rs1800925. An additional polymorphism, rs7719175, in the IL13 promoter was associated with controlling infection levels in multivariate analysis. The haplotype rs7719175T-rs1800925C was associated with high infection levels. EMSA indicated that rs7719175 affects the binding of transcriptional factors to the promoter region. Polymorphisms rs7719175 and rs1800925 have a synergistic role in the control of infection levels caused by S. haematobium and using them as a haplotype allows a better discrimination between infected subjects.

  1. Evolutionary conservation of regulatory elements in vertebrate Hox gene clusters.

    PubMed

    Santini, Simona; Boore, Jeffrey L; Meyer, Axel

    2003-06-01

    Comparisons of DNA sequences among evolutionarily distantly related genomes permit identification of conserved functional regions in noncoding DNA. Hox genes are highly conserved in vertebrates, occur in clusters, and are uninterrupted by other genes. We aligned (PipMaker) the nucleotide sequences of the HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human, and mouse, which are separated by approximately 500 million years of evolution. In support of our approach, several identified putative regulatory elements known to regulate the expression of Hox genes were recovered. The majority of the newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac database). The regulatory intergenic regions located between the genes that are expressed most anteriorly in the embryo are longer and apparently more evolutionarily conserved than those at the other end of Hox clusters. Different presumed regulatory sequences are retained in either the Aalpha or Abeta duplicated Hox clusters in the fish lineages. This suggests that the conserved elements are involved in different gene regulatory networks and supports the duplication-deletion-complementation model of functional divergence of duplicated genes.

  2. The hematologic characteristics of sickle cell anemia bearing the Bantu haplotype: the relationship between G gamma and HbF level.

    PubMed

    Nagel, R L; Rao, S K; Dunda-Belkhodja, O; Connolly, M M; Fabry, M E; Georges, A; Krishnamoorthy, R; Labie, D

    1987-04-01

    Previous work has demonstrated that the HbS gene has appeared and expanded three times in Africa in three separate geographic locations and that these three distinct mutational events can be identified by linked DNA polymorphic sites (haplotypes) surrounding the abnormal gene. We have reported that the Senegalese and Beninian haplotypes differ in G gamma expression, mean percentage of HbF, and percentage of dense cells. We now report on the third haplotype, the Bantu, and find that it has intermediate features, namely, the high mean percentage of HbF and low percentage of dense cells associated with the Senegalese haplotype, but with a low percentage of G gamma expression similar to the Beninian haplotype. The distribution of percent HbF is quite different from Senegal haplotype-bearing sickle cell anemia patients since it covers a much wider range. The low G gamma expression is also different from the Beninians since it contains a significant and unique cluster of individuals with lower than 38% G gamma. Interestingly, among the Bantu there is a strong correlation between HbF levels and G gamma expression, which is not seen with the other haplotypes. These findings open the possibility that among the Bantu haplotype-bearing individuals two chromosomal types exist that define different levels of G gamma and HbF expression. Further structural exploration of these two potential subhaplotypes is needed.

  3. Loss of pollen-S function in two self-compatible selections of Prunus avium is associated with deletion/mutation of an S haplotype-specific F-box gene.

    PubMed

    Sonneveld, Tineke; Tobutt, Kenneth R; Vaughan, Simon P; Robbins, Timothy P

    2005-01-01

    Recently, an S haplotype-specific F-box (SFB) gene has been proposed as a candidate for the pollen-S specificity gene of RNase-mediated gametophytic self-incompatibility in Prunus (Rosaceae). We have examined two pollen-part mutant haplotypes of sweet cherry (Prunus avium). Both were found to retain the S-RNase, which determines stylar specificity, but one (S3' in JI 2434) has a deletion including the haplotype-specific SFB gene, and the other (S4' in JI 2420) has a frame-shift mutation of the haplotype-specific SFB gene, causing amino acid substitutions and premature termination of the protein. The loss or significant alteration of this highly polymorphic gene and the concomitant loss of pollen self-incompatibility function provides compelling evidence that the SFB gene encodes the pollen specificity component of self-incompatibility in Prunus. These loss-of-function mutations are inconsistent with SFB being the inactivator of non-self S-RNases and indicate the presence of a general inactivation mechanism, with SFB conferring specificity by protecting self S-RNases from inactivation.

  4. DNA methylation profiling identifies CG methylation clusters in Arabidopsis genes.

    PubMed

    Tran, Robert K; Henikoff, Jorja G; Zilberman, Daniel; Ditt, Renata F; Jacobsen, Steven E; Henikoff, Steven

    2005-01-26

    Cytosine DNA methylation in vertebrates is widespread, but methylation in plants is found almost exclusively at transposable elements and repetitive DNA. Within regions of methylation, methylcytosines are typically found in CG, CNG, and asymmetric contexts. CG sites are maintained by a plant homolog of mammalian Dnmt1 acting on hemi-methylated DNA after replication. Methylation of CNG and asymmetric sites appears to be maintained at each cell cycle by other mechanisms. We report a new type of DNA methylation in Arabidopsis, dense CG methylation clusters found at scattered sites throughout the genome. These clusters lack non-CG methylation and are preferentially found in genes, although they are relatively deficient toward the 5' end. CG methylation clusters are present in lines derived from different accessions and in mutants that eliminate de novo methylation, indicating that CG methylation clusters are stably maintained at specific sites. Because 5-methylcytosine is mutagenic, the appearance of CG methylation clusters over evolutionary time predicts a genome-wide deficiency of CG dinucleotides and an excess of C(A/T)G trinucleotides within transcribed regions. This is exactly what we find, implying that CG methylation clusters have contributed profoundly to plant gene evolution. We suggest that CG methylation clusters silence cryptic promoters that arise sporadically within transcription units.

  5. Phage cluster relationships identified through single gene analysis

    PubMed Central

    2013-01-01

    Background Phylogenetic comparison of bacteriophages requires whole genome approaches such as dotplot analysis, genome pairwise maps, and gene content analysis. Currently mycobacteriophages, a highly studied phage group, are categorized into related clusters based on the comparative analysis of whole genome sequences. With the recent explosion of phage isolation, a simple method for phage cluster prediction would facilitate analysis of crude or complex samples without whole genome isolation and sequencing. The hypothesis of this study was that mycobacteriophage-cluster prediction is possible using comparison of a single, ubiquitous, semi-conserved gene. Tape Measure Protein (TMP) was selected to test the hypothesis because it is typically the longest gene in mycobacteriophage genomes and because regions within the TMP gene are conserved. Results A single gene, TMP, identified the known Mycobacteriophage clusters and subclusters using a Gepard dotplot comparison or a phylogenetic tree constructed from global alignment and maximum likelihood comparisons. Gepard analysis of 247 mycobacteriophage TMP sequences appropriately recovered 98.8% of the subcluster assignments that were made by whole-genome comparison. Subcluster-specific primers within TMP allow for PCR determination of the mycobacteriophage subcluster from DNA samples. Using the single-gene comparison approach for siphovirus coliphages, phage groupings by TMP comparison reflected relationships observed in a whole genome dotplot comparison and confirm the potential utility of this approach to another widely studied group of phages. Conclusions TMP sequence comparison and PCR results support the hypothesis that a single gene can be used for distinguishing phage cluster and subcluster assignments. TMP single-gene analysis can quickly and accurately aid in mycobacteriophage classification. PMID:23777341

  6. Functional evaluation of novel single nucleotide polymorphisms and haplotypes in the promoter regions of CYP1B1 and CYP1A1 genes.

    PubMed

    Han, Weiguo; Pentecost, Brian T; Spivack, Simon D

    2003-07-01

    Interindividual variation in the expression of the carcinogen- and estrogen-metabolizing enzymes cytochrome P4501B1 and 1A1 (CYP1B1 and CYP1A1) has been detected in human lung. To search for polymorphisms with functional consequences for CYP1B1 and CYP1A1 gene expression, we examined 1.5 kb of the promoter region of each gene. Genomic DNA from 21 Caucasian individuals was amplified by polymerase chain reaction (PCR) for direct cycle sequencing. Eight single nucleotide polymorphisms (SNPs) for CYP1B1 and 13 SNPs for CYP1A1 were found. The majority of polymorphisms occurred as multiSNP combinations for individual subjects. The wild-type sequences were cloned into a luciferase reporter construct. The most frequent polymorphisms were then recreated by iterative site-directed mutagenesis, replicating single polymorphisms and multiSNP combinations. These wild-type and variant constructs were functionally evaluated in transient transfection experiments employing exposures to either the index polycyclic aromatic hydrocarbon (PAH) inducer benzo[a]pyrene (B[a]P), a composite mixture of cigarette smoke extract (CSE), or the repressor chemopreventive agent trans-3,4,5-trihydroxystilbene (reseveratrol). Results indicated that all wild-type and variant constructs responded in qualitatively concordant fashion to the inducers and to the repressor. The CYP1B1 haplotypes and the majority of CYP1A1 haplotypes were shown to have no functional consequence, as compared to those of the wild-type promoter sequences. Two constructs of composite polymorphisms of CYP1A1 appeared to result in a statistically significant increase in basal promoter activity (1.38- and 1.50-fold, respectively), but the degree of functional impact was judged unlikely to be biologically important in vivo. We conclude that the observed promoter region polymorphisms in these genes are common, but are of unclear functional consequence.

  7. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering

    PubMed Central

    Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample. PMID:27764138

  8. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    PubMed

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  9. Evolutionary ecology of beta-lactam gene clusters in animals.

    PubMed

    Suring, Wouter; Meusemann, Karen; Blanke, Alexander; Mariën, Janine; Schol, Tim; Agamennone, Valeria; Faddeeva-Vakhrusheva, Anna; Berg, Matty P; Brouwer, Bram; van Straalen, Nico M; Roelofs, Dick

    2017-03-18

    Beta-lactam biosynthesis was thought to occur only in fungi and bacteria, but we recently reported the presence of isopenicillin N synthase in a soil-dwelling animal, Folsomia candida. However, it has remained unclear whether this gene is part of a larger beta-lactam biosynthesis pathway and how widespread the occurrence of penicillin biosynthesis is among animals. Here, we analyzed the distribution of beta-lactam biosynthesis genes throughout the animal kingdom and identified a beta-lactam gene cluster in the genome of F. candida (Collembola), consisting of isopenicillin N synthase (IPNS), δ-(L-α-aminoadipoyl)-L-cysteinyl-D-valine synthetase (ACVS), and two cephamycin C genes (cmcI and cmcJ) on a genomic scaffold of 0.76 Mb. All genes are transcriptionally active and are inducible by stress (heat shock). A beta-lactam compound was detected in vivo using an ELISA beta-lactam assay. The gene cluster also contains an ABC transporter which is co-regulated with IPNS and ACVS after heat shock. Furthermore, we show that different combinations of beta-lactam biosynthesis genes are present in over 60% of springtail families but they are absent from genome- and transcript libraries of other animals including close relatives of springtails (Protura, Diplura, and insects). The presence of beta-lactam genes is strongly correlated with an eudaphic (soil-living) lifestyle. Beta-lactam genes IPNS and ACVS each form a phylogenetic clade in between bacteria and fungi, while cmcI and cmcJ genes cluster within bacteria. This suggests a single horizontal gene transfer event most probably from a bacterial host, followed by differential loss in more recently evolving species. This article is protected by copyright. All rights reserved.

  10. Evolution and dispersal of emmer wheat (Triticum sp.) from novel haplotypes of Ppd-1 (photoperiod response) genes and their surrounding DNA sequences.

    PubMed

    Takenaka, Shotaro; Kawahara, Taihachi

    2012-09-01

    The sequence data from 5' UTR, intronic, coding and 3' UTR regions of Ppd-A1 and Ppd-B1 were investigated for a total of 158 accessions of emmer wheat landraces comprising 19 of wild emmer wheat (Triticum dicoccoides), 45 of hulled emmer wheat (T. dicoccum) and 94 of free-threshing (FT) emmer wheat (T. durum etc.). We detected some novel types of deletions in the coding regions from 22 hulled emmer accessions and 20 FT emmer accessions. Emmer wheat accessions with these deletions could produce predicted proteins likely to lack function. We also observed some novel mutations in Ppd-B1. Sixty-seven and forty-one haplotypes were found in Ppd-A1 and Ppd-B1, respectively. Some mutations found in this study have not been known, so they have potential for useful genetic resources for wheat breeding. On the basis of sequence data from the 5' UTR region, both Ppd-A1 and Ppd-B1 haplotypes were divided into two groups (Type AI/AII and Type BI/BII). Types AI and AII of Ppd-A1 suggested gene flow between wild and hulled emmer. On the other hand, Types BI and BII of Ppd-B1 suggested gene flow between wild and FT emmer. More than half of hulled emmer accessions were Type AII/BI but few FT emmer accessions were of this type. Therefore, over half of the hulled emmer did not contribute to evolution of FT emmer.

  11. Interleukin-17A Gene Haplotypes Are Associated with Risk of Premature Coronary Artery Disease in Mexican Patients from the Genetics of Atherosclerotic Disease (GEA) Study

    PubMed Central

    Vargas-Alarcón, Gilberto; Angeles-Martínez, Javier; Villarreal-Molina, Teresa; Alvarez-León, Edith; Posadas-Sánchez, Rosalinda; Cardoso-Saldaña, Guillermo; Ramírez-Bello, Julian; Pérez-Hernández, Nonanzit; Juárez-Rojas, Juan Gabriel; Rodríguez-Pérez, José Manuel; Fragoso, José Manuel; Posadas-Romero, Carlos

    2015-01-01

    Aim The role of interleukin 17A (IL-17A) in the inflammatory process has caused interest in the potential significance of IL-17A as a biomarker for coronary artery disease (CAD). The aim of the present study was to evaluate the role of IL-17A gene polymorphisms as susceptibility markers for CAD in the Mexican population. Methods Four IL-17A gene polymorphisms (rs8193036, rs3819024, rs2275913 and rs8193037) were genotyped by 5’ exonuclease TaqMan assays in a group of 900 patients with premature CAD and 667 healthy controls (with negative calcium score by computed tomography), seeking associations with CAD and other metabolic and cardiovascular risk factors using logistic regression analyses. Results No single IL-17A polymorphism was associated with premature CAD, however two haplotypes (CAGG and TAGA) were significantly associated with increased risk of premature CAD (OR = 1.35, 95% CI: 1.00–1.84, P = 0.018 and OR = 2.09, 95% CI: 1.16–3.76, P = 0.003, respectively). Moreover, rs3819024 was associated with increased levels of visceral abdominal fat (P = 0.002) and rs8193036 was significantly associated with risk of central obesity (P = 0.020), hypertriglyceridemia (P = 0.027), and metabolic syndrome (P = 0.027) in the premature CAD group, under dominant models adjusted by age, gender, BMI, smoking history, alcohol consumption, and treatment. Conclusion The results suggest that IL-17A haplotypes are involved in the risk of developing premature CAD and some IL-17A polymorphisms are associated with cardiovascular risk factors in Mexican individuals with premature CAD. PMID:25615631

  12. Cloning and Heterologous Expression of the Grecocycline Biosynthetic Gene Cluster

    PubMed Central

    Bilyk, Oksana; Sekurova, Olga N.; Zotchev, Sergey B.; Luzhetskyy, Andriy

    2016-01-01

    Transformation-associated recombination (TAR) in yeast is a rapid and inexpensive method for cloning and assembly of large DNA fragments, which relies on natural homologous recombination. Two vectors, based on p15a and F-factor replicons that can be maintained in yeast, E. coli and streptomycetes have been constructed. These vectors have been successfully employed for assembly of the grecocycline biosynthetic gene cluster from Streptomyces sp. Acta 1362. Fragments of the cluster were obtained by PCR and transformed together with the “capture” vector into the yeast cells, yielding a construct carrying the entire gene cluster. The obtained construct was heterologously expressed in S. albus J1074, yielding several grecocycline congeners. Grecocyclines have unique structural moieties such as a dissacharide side chain, an additional amino sugar at the C-5 position and a thiol group. Enzymes from this pathway may be used for the derivatization of known active angucyclines in order to improve their desired biological properties. PMID:27410036

  13. Generalized gene adjacencies, graph bandwidth, and clusters in yeast evolution.

    PubMed

    Zhu, Qian; Adam, Zaky; Choi, Vicky; Sankoff, David

    2009-01-01

    We present a parameterized definition of gene clusters that allows us to control the emphasis placed on conserved order within a cluster. Though motivated by biological rather than mathematical considerations, this parameter turns out to be closely related to the bandwidth parameter of a graph. Our focus will be on how this parameter affects the characteristics of clusters: how numerous they are, how large they are, how rearranged they are, and to what extent they are preserved from ancestor to descendant in a phylogenetic tree. We infer the latter property by dynamic programming optimization of the presence of individual edges at the ancestral nodes of the phylogeny. We apply our analysis to a set of genomes drawn from the Yeast Gene Order Browser.

  14. Increased risks between Interleukin-10 gene polymorphisms and haplotype and head and neck cancer: a meta-analysis.

    PubMed

    Niu, Yu-Ming; Du, Xin-Ya; Cai, Heng-Xing; Zhang, Chao; Yuan, Rui-Xia; Zeng, Xian-Tao; Luo, Jie

    2015-11-27

    Molecular epidemiological research suggests that interleukin-10 (IL-10) polymorphisms may be associated with an increased risk of head and neck cancer (HNC), but results remain controversial. To derive a more precise evaluation, we performed a meta-analysis focused on genetic polymorphisms of IL-10. PubMed, Embase, CNKI and Wanfang databases were searched for studies that examined the relationship between IL-10 polymorphisms or haplotypes and HNC risk. The odds ratio (OR) and 95% confidence interval (CI) were applied to assess the relationship strength. Publication bias, sensitivity and cumulative analyses were conducted to measure the robustness of our findings. Overall, nine related studies involving 2,258 patients and 2,887 control samples were analyzed. Significant associations between the IL-10-1082A > G polymorphism and HNC risk were observed (G vs. A: OR = 1.56, 95% CI = 1.27-1.92, P < 0.01, I(2) = 69.4%; AG vs. AA: OR = 1.64, 95% CI = 1.32-2.05, P < 0.01, I(2) = 55.6%; GG vs. AA: OR = 2.24, 95% CI = 1.69-2.97, P < 0.01, I(2) = 38.5%; AG + GG vs. AA: OR = 1.70, 95% CI = 1.36-2.14, P = 0.02, I(2) = 61.8%; GG vs. AA + AG: OR = 1.89, 95% CI = 1.23-2.90, P = 0.01, I(2) = 46.3%) in the total population, as well as in subgroup analysis. Moreover, increased HNC risks were also associated with the IL-10 -819T > C polymorphism and the GCC haplotype. In conclusion, our meta-analyses suggest that IL-10 polymorphisms, specifically the -1082A > G polymorphism, may be associated with increased risk of HNC development.

  15. PEACE: Parallel Environment for Assembly and Clustering of Gene Expression.

    PubMed

    Rao, D M; Moler, J C; Ozden, M; Zhang, Y; Liang, C; Karro, J E

    2010-07-01

    We present PEACE, a stand-alone tool for high-throughput ab initio clustering of transcript fragment sequences produced by Next Generation or Sanger Sequencing technologies. It is freely available from www.peace-tools.org. Installed and managed through a downloadable user-friendly graphical user interface (GUI), PEACE can process large data sets of transcript fragments of length 50 bases or greater, grouping the fragments by gene associations with a sensitivity comparable to leading clustering tools. Once clustered, the user can employ the GUI's analysis functions, facilitating the easy collection of statistics and allowing them to single out specific clusters for more comprehensive study or assembly. Using a novel minimum spanning tree-based clustering method, PEACE is the equal of leading tools in the literature, with an interface making it accessible to any user. It produces results of quality virtually identical to those of the WCD tool when applied to Sanger sequences, significantly improved results over WCD and TGICL when applied to the products of Next Generation Sequencing Technology and significantly improved results over Cap3 in both cases. In short, PEACE provides an intuitive GUI and a feature-rich, parallel clustering engine that proves to be a valuable addition to the leading cDNA clustering tools.

  16. Haplotype-based gene-gene interaction of bone morphogenetic protein 4 and interferon regulatory factor 6 in the etiology of non-syndromic cleft lip with or without cleft palate in a Chilean population.

    PubMed

    Blanco, Rafael; Colombo, Alicia; Pardo, Rosa; Suazo, José

    2017-04-01

    Non-syndromic cleft lip with or without cleft palate (NSCL/P) is the most common craniofacial birth defect in humans, the etiology of which can be dependent on the interactions of multiple genes. We previously reported haplotype associations for polymorphic variants of interferon regulatory factor 6 (IRF6), msh homeobox 1 (MSX1), bone morphogenetic protein 4 (BMP4), and transforming growth factor beta 3 (TGFB3) in Chile. Here, we analyzed the haplotype-based gene-gene interaction for markers of these genes and NSCL/P risk in the Chilean population. We genotyped 15 single nucleoptide polymorphisms (SNPs) in 152 Chilean patients and 164 controls. Linkage disequilibrium (LD) blocks were determined using the Haploview software, and phase reconstruction was performed by the Phase program. Haplotype-based interactions were evaluated using the multifactor dimensionality reduction (MDR) method. We detected two LD blocks composed of two SNPs from BMP4 (Block 1) and three SNPs from IRF6 (Block 2). Although MDR showed no statistical significance for the global interaction model involving these blocks, we found four combinations conferring a statistically significantly increased NSCL/P risk (Block 1-Block 2): T-T/T-G C-G-T/G-A-T; T-T/T-G C-G-C/C-G-C; T-T/T-G G-A-T/G-A-T; and T-T/C-G G-A-T/G-A-T. These findings may reflect the presence of a genomic region containing potential causal variants interacting in the etiology of NSCL/P and may contribute to disentangling the complex etiology of this birth defect.

  17. An alanine tRNA gene cluster from Nephila clavipes.

    PubMed

    Luciano, E; Candelas, G C

    1996-06-01

    We report the sequence of a 2.3-kb genomic DNA fragment from the orb-web spider, Nephila clavipes (Nc). The fragment contains four regions of high homology to tRNA(Ala). The members of this irregularly spaced cluster of genes are oriented in the same direction and have the same anticodon (GCA), but their sequence differs at several positions. Initiation and termination signals, as well as consensus intragenic promoter sequences characteristic of tRNA genes, have been identified in all genes. tRNA(Ala) are involved in the regulation of the fibroin synthesis in the large ampullate Nc glands.

  18. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    SciTech Connect

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  19. Detection of a novel mutation in the ryanodine receptor gene in an Irish malignant hyperthermia pedigree: correlation of the IVCT response with the affected and unaffected haplotypes.

    PubMed Central

    Keating, K E; Giblin, L; Lynch, P J; Quane, K A; Lehane, M; Heffron, J J; McCarthy, T V

    1997-01-01

    Defects in the ryanodine receptor (RYR1) gene are associated with malignant hyperthermia (MH), an autosomal dominant disorder of skeletal muscle and one of the main causes of death resulting from anaesthesia. Susceptibility to MH (MHS) is determined by the level of tension generated in an in vitro muscle contracture test (IVCT) in response to caffeine and halothane. To date, mutation screening of the RYR1 gene in MH families has led to the identification of eight mutations. We describe here the identification of a novel mutation, Arg552Trp, in the RYR1 gene, which is clearly linked to the MHS phenotype in a large, well characterised Irish pedigree. Considering that the RYR1 protein functions as a tetramer, correlation of the IVCT with the affected and unaffected haplotypes was performed on the pedigree to investigate if the normal RYR1 allele in affected subjects contributes to the variation in the IVCT. The results show that the normal RYR1 allele is unlikely to play a role in IVCT variation. Images PMID:9138151

  20. Inheritance of the Bantu/Benin haplotype causes less severe hemolytic and oxidative stress in sickle cell anemia patients treated with hydroxycarbamide.

    PubMed

    Okumura, Jéssika V; Silva, Danilo G H; Torres, Lidiane S; Belini-Junior, Edis; Barberino, Willian M; Oliveira, Renan G; Carrocini, Gisele C S; Gelaleti, Gabriela B; Lobo, Clarisse L C; Bonini-Domingos, Claudia R

    2016-07-01

    Beta S-globin gene cluster haplotypes (β(S)-haplotypes) can modulate the response to hydroxycarbamide (HC) treatment in sickle cell anemia (SCA) patients. In Brazil, the most common haplotypes are Bantu and Benin, and both confer a poor prognosis for patients when untreated with HC. We evaluated oxidative and hemolytic biomarkers in 48 SCA patients undergoing HC treatment separated in three subgroups: Bantu/Bantu, Bantu/Benin and Benin/Benin haplotype. On the basis of reduced haptoglobin (HP) levels, patients with Bantu/Bantu haplotypes had 3.0% higher hemolysis degree when compared with those with Bantu/Benin haplotypes (P=0.01). The Benin/Benin patients had 53.6% greater lipid peroxidation index than the Bantu/Bantu patients (P=0.01) because of evaluated thiobarbituric acid reactive species levels. The Bantu/Benin subgroup had intermediate levels of hemolytic and oxidative stress markers compared with the homozygous subgroups. Through strict inclusion criteria adopted, as well as consolidated and well-described hemolytic and the oxidative parameters evaluated, we suggest a haplotype-interaction response to HC treatment mediated by a 'balance' between the genetic factors of each haplotype studied.

  1. Association between higher expression of interleukin-8 (IL-8) and haplotype −353A/−251A/+678T of IL-8 gene with preeclampsia

    PubMed Central

    Sun, Lei; Mao, Dongwei; Cai, Yan; Tan, Wenhua; Hao, Yanlan; Li, Lin; Liu, Wei

    2016-01-01

    Abstract Preeclampsia (PE) is a common pregnancy-specific disorder associated with significant maternal and fetal morbidity and mortality worldwide. The present study was performed to investigate the role of a CXC chemokine interleukin-8 (IL-8), in the pathogenesis of PE. IL-8 expression levels were assessed in placental and serum samples from 160 pregnant women with PE (N = 68 severe, 92 mild) and 140 healthy donors. Results from enzyme-linked immunosorbent assay showed that the concentration of serum IL-8 in PE patients (180.27 ± 5.81 ng/L) was significantly higher than that in healthy controls (41.57 ± 5.67 ng/L). Patients with severe PE had even higher serum IL-8 levels. Similar messenger RNA and protein expression patterns of IL-8 in placental tissues were confirmed by quantitative real-time polymerase chain reaction and immunohistochemical assay (N = 30 each in the mild PE, severe PE, and control groups). In addition, single nucleotide polymorphisms of IL-8 gene were detected with polymerase chain reaction-restricted fragment length polymorphism/SSP. The frequency of IL-8-251A allele was significantly higher than that in controls (58.4% vs 48.9%, P < 0.05). The occurrence frequency of haplotype −353A/−251A/+678T (AAT) in PE subjects was 27.2% as compared to 21.9% in the control participants (P < 0.05). Our study reveals that IL-8 expression is positively associated with the severity of PE. Presence of haplotype AAT in pregnant women appears to be a risk factor for PE. PMID:28033248

  2. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

    PubMed

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

    2017-02-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.

  3. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

    PubMed Central

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya

    2017-01-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390

  4. Expression profile based gene clusters for ischemic stroke detection Whole blood gene clusters for ischemic stroke detection

    PubMed Central

    Adamski, Mateusz G; Li, Yan; Wagner, Erin; Yu, Hua; Seales-Bailey, Chloe; Soper, Steven A; Murphy, Michael; Baird, Alison E

    2014-01-01

    In microarray studies alterations in gene expression in circulating leukocytes have shown utility for ischemic stroke diagnosis. We studied forty candidate markers identified in three gene expression profiles to (1) quantitate individual transcript expression, (2) identify transcript clusters and (3) assess the clinical diagnostic utility of the clusters identified for ischemic stroke detection. Using high throughput next generation qPCR 16 of the 40 transcripts were significantly up-regulated in stroke patients relative to control subjects (p<0.05). Six clusters of between 5 and 7 transcripts discriminated between stroke and control (p values between 1.01e-9 and 0.03). A 7 transcript cluster containing PLBD1, PYGL, BST1, DUSP1, FOS, VCAN and FCGR1A showed high accuracy for stroke classification (AUC=0.854). These results validate and improve upon the diagnostic value of transcripts identified in microarray studies for ischemic stroke. The clusters identified show promise for acute ischemic stroke detection. PMID:25135788

  5. Transcription mediated insulation and interference direct gene cluster expression switches

    PubMed Central

    Nguyen, Tania; Brown, David; Murray, Struan C; Haenni, Simon; Halstead, James M; O'Connor, Leigh; Shipkovenska, Gergana; Steinmetz, Lars M; Mellor, Jane

    2014-01-01

    In yeast, many tandemly arranged genes show peak expression in different phases of the metabolic cycle (YMC) or in different carbon sources, indicative of regulation by a bi-modal switch, but it is not clear how these switches are controlled. Using native elongating transcript analysis (NET-seq), we show that transcription itself is a component of bi-modal switches, facilitating reciprocal expression in gene clusters. HMS2, encoding a growth-regulated transcription factor, switches between sense- or antisense-dominant states that also coordinate up- and down-regulation of transcription at neighbouring genes. Engineering HMS2 reveals alternative mono-, di- or tri-cistronic and antisense transcription units (TUs), using different promoter and terminator combinations, that underlie state-switching. Promoters or terminators are excluded from functional TUs by read-through transcriptional interference, while antisense TUs insulate downstream genes from interference. We propose that the balance of transcriptional insulation and interference at gene clusters facilitates gene expression switches during intracellular and extracellular environmental change. DOI: http://dx.doi.org/10.7554/eLife.03635.001 PMID:25407679

  6. Identification of genes and gene clusters involved in mycotoxin synthesis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Research methods to identify and characterize genes involved in mycotoxin biosynthetic pathways have evolved considerably over the years. Before whole genome sequences were available (e.g. pre-genomics), work focused primarily on chemistry, biosynthetic mutant strains and molecular analysis of sing...

  7. Transcriptional analysis of exopolysaccharides biosynthesis gene clusters in Lactobacillus plantarum.

    PubMed

    Vastano, Valeria; Perrone, Filomena; Marasco, Rosangela; Sacco, Margherita; Muscariello, Lidia

    2016-04-01

    Exopolysaccharides (EPS) from lactic acid bacteria contribute to specific rheology and texture of fermented milk products and find applications also in non-dairy foods and in therapeutics. Recently, four clusters of genes (cps) associated with surface polysaccharide production have been identified in Lactobacillus plantarum WCFS1, a probiotic and food-associated lactobacillus. These clusters are involved in cell surface architecture and probably in release and/or exposure of immunomodulating bacterial molecules. Here we show a transcriptional analysis of these clusters. Indeed, RT-PCR experiments revealed that the cps loci are organized in five operons. Moreover, by reverse transcription-qPCR analysis performed on L. plantarum WCFS1 (wild type) and WCFS1-2 (ΔccpA), we demonstrated that expression of three cps clusters is under the control of the global regulator CcpA. These results, together with the identification of putative CcpA target sequences (catabolite responsive element CRE) in the regulatory region of four out of five transcriptional units, strongly suggest for the first time a role of the master regulator CcpA in EPS gene transcription among lactobacilli.

  8. Reconstructing Histories of Complex Gene Clusters on a Phylogeny

    NASA Astrophysics Data System (ADS)

    Vinař, Tomáš; Brejová, Broňa; Song, Giltae; Siepel, Adam

    Clusters of genes that have evolved by repeated segmental duplication present difficult challenges throughout genomic analysis, from sequence assembly to functional analysis. These clusters are one of the major sources of evolutionary innovation, and they are linked to multiple diseases, including HIV and a variety of cancers. Understanding their evolutionary histories is a key to the application of comparative genomics methods in these regions of the genome. We propose a probabilistic model of gene cluster evolution on a phylogeny, and an MCMC algorithm for reconstruction of duplication histories from genomic sequences in multiple species. Several projects are underway to obtain high quality BAC-based assemblies of duplicated clusters in multiple species, and we anticipate use of our methods in their analysis. Supplementary materials are located at http://compbio.fmph.uniba.sk/suppl/09recombcg/

  9. Conserved 33-kb haplotype in the MHC class III region regulates chronic arthritis

    PubMed Central

    Yau, Anthony C. Y.; Tuncel, Jonatan; Norin, Ulrika; Houtman, Miranda; Padyukov, Leonid; Holmdahl, Rikard

    2016-01-01

    Genome-wide association studies have revealed many genetic loci associated with complex autoimmune diseases. In rheumatoid arthritis (RA), the MHC gene HLA-DRB1 is the strongest candidate predicting disease development. It has been suggested that other immune-regulating genes in the MHC contribute to the disease risk, but this contribution has been difficult to show because of the strong linkage disequilibrium within the MHC. We isolated genomic regions in the form of congenic fragments in rats to test whether there are additional susceptibility loci in the MHC. By both congenic mapping in inbred strains and SNP typing in wild rats, we identified a conserved, 33-kb large haplotype Ltab-Ncr3 in the MHC-III region, which regulates the onset, severity, and chronicity of arthritis. The Ltab-Ncr3 haplotype consists of five polymorphic immunoregulatory genes: Lta (lymphotoxin-α), Tnf, Ltb (lymphotoxin-β), Lst1 (leukocyte-specific transcript 1), and Ncr3 (natural cytotoxicity-triggering receptor 3). Significant correlation in the expression of the Ltab-Ncr3 genes suggests that interaction of these genes may be important in keeping these genes clustered together as a conserved haplotype. We studied the arthritis association and the spliceo-transcriptome of four different Ltab-Ncr3 haplotypes and showed that higher Ltb and Ncr3 expression, lower Lst1 expression, and the expression of a shorter splice variant of Lst1 correlate with reduced arthritis severity in rats. Interestingly, patients with mild RA also showed higher NCR3 expression and lower LST1 expression than patients with severe RA. These data demonstrate the importance of a conserved haplotype in the regulation of complex diseases such as arthritis. PMID:27303036

  10. An analysis of fetal hemoglobin variation in sickle cell disease: the relative contributions of the X-linked factor, beta-globin haplotypes, alpha-globin gene number, gender, and age.

    PubMed

    Chang, Y C; Smith, K D; Moore, R D; Serjeant, G R; Dover, G J

    1995-02-15

    Five factors have been shown to influence the 20-fold variation of fetal hemoglobin (Hb F) levels in sickle cell anemia (SS): age, sex, the alpha-globin gene number, beta-globin haplotypes, and an X-linked locus that regulates the production of Hb F-containing erythrocytes (F cells), ie, the F-cell production (FCP) locus. To determine the relative importance of these factors, we studied 257 Jamaican SS subjects from a Cohort group identified by newborn screening and from a Sib Pair study. Linear regression analyses showed that each variable, when analyzed alone, had a significant association with Hb F levels (P < .05). Multiple regression analysis, including all variables, showed that the FCP locus is the strongest predictor, accounting for 40% of Hb F variation. beta-Globin haplotypes, alpha-globin genes, and age accounted for less than 10% of the variation. The association between the beta-globin haplotypes and Hb F levels becomes apparent if the influence of the FCP locus is removed by analyzing only individuals with the same FCP phenotype. Thus, the FCP locus is the most important factor identified to date in determining Hb F levels. The variation within each FCP phenotype is modulated by factors associated with the three common beta-globin haplotypes and other as yet unidentified factor(s).

  11. Cyclopiazonic acid biosynthesis gene cluster gene cpaM is required for speradine A biosynthesis.

    PubMed

    Tokuoka, Masafumi; Kikuchi, Tomoki; Shinohara, Yasutomo; Koyama, Akifumi; Iio, Shin-Ichiro; Kubota, Takaaki; Kobayashi, Jun'ichi; Koyama, Yasuji; Totsuka, Akira; Shindo, Hitoshi; Sato, Kazuo

    2015-01-01

    Speradine A is a derivative of cyclopiazonic acid (CPA) found in culture of an Aspergillus tamarii isolate. Heterologous expression of a predicted methyltransferase gene, cpaM, in the cpa biosynthesis gene cluster of A. tamarii resulted in the speradine A production in a 2-oxoCPA producing A. oryzae strain, indicating cpaM is involved in the speradine A biosynthesis.

  12. MHC-Linked Olfactory Receptor Loci Exhibit Polymorphism and Contribute to Extended HLA/OR-Haplotypes

    PubMed Central

    Ehlers, Anke; Beck, Stephan; Forbes, Simon A.; Trowsdale, John; Volz, Armin; Younger, Ruth; Ziegler, Andreas

    2000-01-01

    Clusters of olfactory receptor (OR) genes are found on most human chromosomes. They are one of the largest mammalian multigene families. Here, we report a systematic study of polymorphism of OR genes belonging to the largest fully sequenced OR cluster. The cluster contains 36 OR genes, of which two belong to the vomeronasal 1 (V1-OR) family. The cluster is divided into a major and a minor region at the telomeric end of the HLA complex on chromosome 6. These OR genes could be involved in MHC-related mate preferences. The polymorphism screen was carried out with 13 genes from the HLA-linked OR cluster and three genes from chromosomes 7, 17, and 19 as controls. Ten human cell lines, representing 18 different chromosome 6s, were analyzed. They were from various ethnic origins and exhibited different HLA haplotypes. All OR genes tested, including those not linked to the HLA complex, were polymorphic. These polymorphisms were dispersed along the coding region and resulted in up to seven alleles for a given OR gene. Three polymorphisms resulted either in stop codons (genes hs6M1-4P, hs6M1-17) or in a 16–bp deletion (gene hs6M1-19P), possibly leading to lack of ligand recognition by the respective receptors in the cell line donors. In total, 13 HLA-linked OR haplotypes could be defined. Therefore, allelic variation appears to be a general feature of human OR genes. [The sequence data reported in this paper have been submitted to EMBL under accession nos. AC006137, AC004178, AJ132194, AL022727, AL031983, AL035402, AL035542, Z98744, CAB55431, AL050339, AL035402, AL096770, AL133267, AL121944, Z98745, AL021808, and AL021807.] PMID:11116091

  13. Analysis of lamprey clustered Fox genes: insight into Fox gene evolution and expression in vertebrates.

    PubMed

    Wotton, Karl R; Shimeld, Sebastian M

    2011-12-01

    In the human genome, members of the FoxC, FoxF, FoxL1, and FoxQ1 gene families are found in two paralagous clusters. One cluster contains the genes FOXQ1, FOXF2, FOXC1 and the second consists of FOXF1, FOXC2, and FOXL1. In jawed vertebrates these genes are known to be expressed in different pharyngeal tissues and all, except FoxQ1, are involved in patterning the early embryonic mesoderm. We have previously traced the evolution of this cluster in the bony vertebrates, and the gene content is identical in the dogfish, a member of the most basally branching lineage of the jawed vertebrates. Here we extend these analyses to jawless vertebrates. Using genomic searches and molecular approaches we have identified homologues of these genes from lampreys. We identify two FoxC genes, two FoxF genes, two FoxQ1 genes and single FoxL1 gene. We examine the embryonic expression of one predominantly mesodermally expressed gene family, FoxC, and the endodermally expressed member of the cluster, FoxQ1. We identified FoxQ1 transcripts in the pharyngeal endoderm, while the two FoxC genes are differentially expressed in the pharyngeal mesenchyme and ectoderm. Furthermore we identify conserved expression of lamprey FoxC genes in the paraxial and intermediate mesoderms. We interpret our results through a chordate-wide comparison of expression patterns and discuss gene content in the context of theories on the evolution of the vertebrate genome.

  14. Comparative genomic analysis of sixty mycobacteriophage genomes: Genome clustering, gene acquisition and gene size

    PubMed Central

    Hatfull, Graham F.; Jacobs-Sera, Deborah; Lawrence, Jeffrey G.; Pope, Welkin H.; Russell, Daniel A.; Ko, Ching-Chung; Weber, Rebecca J.; Patel, Manisha C.; Germane, Katherine L.; Edgar, Robert H.; Hoyte, Natasha N.; Bowman, Charles A.; Tantoco, Anthony T.; Paladin, Elizabeth C.; Myers, Marlana S.; Smith, Alexis L.; Grace, Molly S.; Pham, Thuy T.; O'Brien, Matthew B.; Vogelsberger, Amy M.; Hryckowian, Andrew J.; Wynalek, Jessica L.; Donis-Keller, Helen; Bogel, Matt W.; Peebles, Craig L.; Cresawn, Steve G.; Hendrix, Roger W.

    2010-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. PMID:20064525

  15. Molecular analysis of the beta-globin gene cluster in the Niokholo Mandenka population reveals a recent origin of the beta(S) Senegal mutation.

    PubMed

    Currat, Mathias; Trabuchet, Guy; Rees, David; Perrin, Pascale; Harding, Rosalind M; Clegg, John B; Langaney, André; Excoffier, Laurent

    2002-01-01

    A large and ethnically well-defined Mandenka sample from eastern Senegal was analyzed for the polymorphism of the beta-globin gene cluster on chromosome 11. Five RFLP sites of the 5' region were investigated in 193 individuals revealing the presence of 10 different haplotypes. The frequency of the sickle-cell anemia causing mutation (beta(S)) in the Mandenka estimated from this sample is 11.7%. This mutation was found strictly associated with the single Senegal haplotype. Approximately 600 bp of the upstream region of the beta-globin gene were sequenced for a subset of 94 chromosomes, showing the presence of four transversions, five transitions, and a composite microsatellite polymorphism. The sequence of 22 beta(S) chromosomes was also identical to the previously defined Senegal haplotype, suggesting that this mutation is very recent. Monte Carlo simulations (allowing for a specific balancing selection model, a logistic growth of the population, and variable initial frequencies of the Senegal haplotype) were used to estimate the age of the beta(S) mutation. Resulting maximum-likelihood estimates are 45-70 generations (1,350-2,100 years) for very different demographic scenarios. Smallest confidence intervals (25-690 generations) are obtained under the hypothesis that the Mandenka population is large (N(e) >5,000) and stationary or that it has undergone a rapid demographic expansion to a current size of >5,000 reproducing individuals, which is quite likely in view of the great diversity found on beta(A) chromosomes.

  16. CFTR gene mutations--including three novel nucleotide substitutions--and haplotype background in patients with asthma, disseminated bronchiectasis and chronic obstructive pulmonary disease.

    PubMed

    Tzetis, M; Efthymiadou, A; Strofalis, S; Psychou, P; Dimakou, A; Pouliou, E; Doudounakis, S; Kanavakis, E

    2001-03-01

    In order to investigate the incidence of cystic fibrosis transmembrane conductance regulator (CFTR) gene mutations and unclassified variants in chronic pulmonary disease in children and adults, we studied 20 patients with asthma, 19 with disseminated bronchiectasis (DB) of unknown aetiology, and 12 patients with chronic obstructive pulmonary disease (COPD), and compared the results to 52 subjects from the general Greek population. Analysis of the whole coding region of the CFTR gene and its flanking intronic regions revealed that the proportion of CFTR mutations was 45% in asthma (P<0.05), 26.3% in DB (P>0.05), 16.7% in COPD (P>0.05), compared to 15.4% in the general population. Seventeen different molecular defects involved in disease predisposition were identified in 16 patients. Three potentially disease-causing mutations, T388 M, M1R and V11I, are novel, found so far only in three asthma patients. The hyperactive M470 allele was found more frequently in COPD patients (frequency 70.8%, P<0.01) than in the controls. The study of the TGmTnM470 V polyvariant CFTR allele revealed the presence of CFTR function-modulating haplotypes TG13/T5/M470, TG11/T5/M470, TG12/T5/V470 and TG12/T7, combined with M470 or V470, in six asthma patients, four DB patients (P<0.01), and two COPD patients (P<0.05). These results confirm the involvement of the CFTR gene in asthma, DB and possibly in COPD.

  17. Haplotype Detection from Next-Generation Sequencing in High-Ploidy-Level Species: 45S rDNA Gene Copies in the Hexaploid Spartina maritima

    PubMed Central

    Boutte, Julien; Aliaga, Benoît; Lima, Oscar; Ferreira de Carvalho, Julie; Ainouche, Abdelkader; Macas, Jiri; Rousseau-Gueutin, Mathieu; Coriton, Olivier; Ainouche, Malika; Salmon, Armel

    2015-01-01

    Gene and whole-genome duplications are widespread in plant nuclear genomes, resulting in sequence heterogeneity. Identification of duplicated genes may be particularly challenging in highly redundant genomes, especially when there are no diploid parents as a reference. Here, we developed a pipeline to detect the different copies in the ribosomal RNA gene family in the hexaploid grass Spartina maritima from next-generation sequencing (Roche-454) reads. The heterogeneity of the different domains of the highly repeated 45S unit was explored by identifying single nucleotide polymorphisms (SNPs) and assembling reads based on shared polymorphisms. SNPs were validated using comparisons with Illumina sequence data sets and by cloning and Sanger (re)sequencing. Using this approach, 29 validated polymorphisms and 11 validated haplotypes were reported (out of 34 and 20, respectively, that were initially predicted by our program). The rDNA domains of S. maritima have similar lengths as those found in other Poaceae, apart from the 5′-ETS, which is approximately two-times longer in S. maritima. Sequence homogeneity was encountered in coding regions and both internal transcribed spacers (ITS), whereas high intragenomic variability was detected in the intergenic spacer (IGS) and the external transcribed spacer (ETS). Molecular cytogenetic analysis by fluorescent in situ hybridization (FISH) revealed the presence of one pair of 45S rDNA signals on the chromosomes of S. maritima instead of three expected pairs for a hexaploid genome, indicating loss of duplicated homeologous loci through the diploidization process. The procedure developed here may be used at any ploidy level and using different sequencing technologies. PMID:26530424

  18. Horizontal transfer of a large and highly toxic secondary metabolic gene cluster between fungi.

    PubMed

    Slot, Jason C; Rokas, Antonis

    2011-01-25

    Genes involved in intermediary and secondary metabolism in fungi are frequently physically linked or clustered. For example, in Aspergillus nidulans the entire pathway for the production of sterigmatocystin (ST), a highly toxic secondary metabolite and a precursor to the aflatoxins (AF), is located in a ∼54 kb, 23 gene cluster. We discovered that a complete ST gene cluster in Podospora anserina was horizontally transferred from Aspergillus. Phylogenetic analysis shows that most Podospora cluster genes are adjacent to or nested within Aspergillus cluster genes, although the two genera belong to different taxonomic classes. Furthermore, the Podospora cluster is highly conserved in content, sequence, and microsynteny with the Aspergillus ST/AF clusters and its intergenic regions contain 14 putative binding sites for AflR, the transcription factor required for activation of the ST/AF biosynthetic genes. Examination of ∼52,000 Podospora expressed sequence tags identified transcripts for 14 genes in the cluster, with several expressed at multiple life cycle stages. The presence of putative AflR-binding sites and the expression evidence for several cluster genes, coupled with the recent independent discovery of ST production in Podospora [1], suggest that this HGT event probably resulted in a functional cluster. Given the abundance of metabolic gene clusters in fungi, our finding that one of the largest known metabolic gene clusters moved intact between species suggests that such transfers might have significantly contributed to fungal metabolic diversity. PAPERFLICK:

  19. Transcriptional Analysis of Essential Genes of the Escherichia coli Fatty Acid Biosynthesis Gene Cluster by Functional Replacement with the Analogous Salmonella typhimurium Gene Cluster

    PubMed Central

    Zhang, Yan; Cronan, John E.

    1998-01-01

    The genes encoding several key fatty acid biosynthetic enzymes (called the fab cluster) are clustered in the order plsX-fabH-fabD-fabG-acpP-fabF at min 24 of the Escherichia coli chromosome. A difficulty in analysis of the fab cluster by the polar allele duplication approach (Y. Zhang and J. E. Cronan, Jr., J. Bacteriol. 178:3614–3620, 1996) is that several of these genes are essential for the growth of E. coli. We overcame this complication by use of the fab gene cluster of Salmonella typhimurium, a close relative of E. coli, to provide functions necessary for growth. The S. typhimurium fab cluster was isolated by complementation of an E. coli fabD mutant and was found to encode proteins with >94% homology to those of E. coli. However, the S. typhimurium sequences cannot recombine with the E. coli sequences required to direct polar allele duplication via homologous recombination. Using this approach, we found that although approximately 60% of the plsX transcripts initiate at promoters located far upstream and include the upstream rpmF ribosomal protein gene, a promoter located upstream of the plsX coding sequence (probably within the upstream gene, rpmF) is sufficient for normal growth. We have also found that the fabG gene is obligatorily cotranscribed with upstream genes. Insertion of a transcription terminator cassette (Ω-Cm cassette) between the fabD and fabG genes of the E. coli chromosome abolished fabG transcription and blocked cell growth, thus providing the first indication that fabG is an essential gene. Insertion of the Ω-Cm cassette between fabH and fabD caused greatly decreased transcription of the fabD and fabG genes and slower cellular growth, indicating that fabD has only a weak promoter(s). PMID:9642179

  20. Evolution of killer cell Ig-like receptor (KIR) genes: definition of an orangutan KIR haplotype reveals expansion of lineage III KIR associated with the emergence of MHC-C.

    PubMed

    Guethlein, Lisbeth A; Older Aguilar, Anastazia M; Abi-Rached, Laurent; Parham, Peter

    2007-07-01

    Orangutan (Pongo pygmaeus) MHC-C appears less evolved than human HLA-C: Popy-C is not fixed and its alleles encode only one (C1) of the two motifs for killer cell Ig-like receptor (KIR) ligands. To assess the structure and complexity of the orangutan KIR locus, the complete nucleotide sequence of an orangutan KIR haplotype was determined. The PopyKIR locus is flanked by LILR and FCAR and consists of seven genes and pseudogenes, two novel and five corresponding to known cDNA. Distinguishing all KIRs in this rapidly evolving KIR locus from the KIR3DX1 gene is an LTR33A/MLT1D element in intron 3. These two forms of KIR represent lineages that originated by duplication of a common ancestor. The conserved, framework regions of primate KIR loci comprise the 5' part of a lineage V KIR, the 3' part of a pseudogene, the complete 2DL4 gene, and the 3' part of a lineage II KIR. Although previously defined PopyKIR2DL4 alleles contain premature termination codons, the sequenced haplotype's PopyKIR2DL4 allele encodes a full-length protein. A model for KIR evolution is proposed. Distinguishing the orangutan KIR haplotype from the proposed common ancestor of primate KIR haplotypes is an increased number to give three lineage III KIR genes in the centromeric part of the locus, the site for most human lineage III genes encoding HLA-C specific KIR. Thus, expansion of lineage III KIR is associated with emergence of MHC-C.

  1. Haplotype distribution in the class I sirtuin genes and their associations with ultrasound carcass traits in Qinchuan cattle (Bos taurus).

    PubMed

    Gui, Linsheng; Hao, Ruijie; Zhang, Yaran; Zhao, Xianlin; Zan, Linsen

    2015-06-01

    Class I sirtuin genes including SIRT1, SIRT2 and SIRT3, are members of the nicotinamide adenine dinucleotide (NAD)-dependent family of histone deacetylases, and play essential roles in senescence, metabolism, and apoptosis. This study was conducted to detect potential polymorphisms of the bovine class I sirtuin genes and explore their relationships with ultrasound carcass traits in Qinchuan cattle. Four non-coding mutations in the 3'UTR (SIRT1: g.25751A > C, SIRT1: g.25846A > G, SIRT2: g.19676G > A and SIRT3: g. 25702C > T) and three mutations in exons (SIRT2: g.4062C > T; SIRT2: g.4406C > T and SIRT3: g.25557A > G) were identified in 468 individuals of Qinchuan cattle. Chi-square tests showed that g.25751A > C, g.19676G > A, and g.25702C > T were in Hardy-Weinberg disequilibrium (χ(2) < χ0.05(2)). The statistical analyses indicated that six SNPs were significantly associated with the ultrasound carcass traits (P < 0.05) except g.4062C > T (SIRT2) (P > 0.05). These results indicate that the variations in the class I sirtuin genes and their corresponding genotypes may be considered as molecular markers for economic traits in cattle breeding.

  2. Cytochrome P450 2E1 Gene Polymorphisms/Haplotypes and Anti-Tuberculosis Drug-Induced Hepatitis in a Chinese Cohort

    PubMed Central

    Tang, Shaowen; Lv, Xiaozhen; Zhang, Yuan; Wu, Shanshan; Yang, Zhirong; Xia, Yinyin; Tu, Dehua; Deng, Peiyuan; Ma, Yu; Chen, Dafang; Zhan, Siyan

    2013-01-01

    Objective The pathogenic mechanism of anti-tuberculosis (anti-TB) drug-induced hepatitis is associated with drug metabolizing enzymes. No tagging single-nucleotide polymorphisms (tSNPs) of cytochrome P450 2E1(CYP2E1) in the risk of anti-TB drug-induced hepatitis have been reported. The present study was aimed at exploring the role of tSNPs in CYP2E1 gene in a population-based anti-TB treatment cohort. Methods and Design A nested case-control study was designed. Each hepatitis case was 14 matched with controls by age, gender, treatment history, disease severity and drug dosage. The tSNPs were selected by using Haploview 4.2 based on the HapMap database of Han Chinese in Beijing, and detected by using TaqMan allelic discrimination technology. Results Eighty-nine anti-TB drug-induced hepatitis cases and 356 controls were included in this study. 6 tSNPs (rs2031920, rs2070672, rs915908, rs8192775, rs2515641, rs2515644) were genotyped and minor allele frequencies of these tSNPs were 21.9%, 23.0%, 19.1%, 23.6%, 20.8% and 44.4% in the cases and 20.9%, 22.7%, 18.9%, 23.2%, 18.2% and 43.2% in the controls, respectively. No significant difference was observed in genotypes or allele frequencies of the 6 tSNPs between case group and control group, and neither of haplotypes in block 1 nor in block 2 was significantly associated with the development of hepatitis. Conclusion Based on the Chinese anti-TB treatment cohort, we did not find a statistically significant association between genetic polymorphisms of CYP2E1 and the risk of anti-TB drug-induced hepatitis. None of the haplotypes showed a significant association with the development of hepatitis in Chinese TB population. PMID:23460870

  3. Locus for a human hereditary cataract is closely linked to the. gamma. -crystallin gene family

    SciTech Connect

    Lubsen, N.H.; Renwick, J.H.; Tsui, L.C.; Breitman, M.L.; Schoenmakers, J.G.G.

    1987-01-01

    Within the human ..gamma..-crystallin gene cluster polymorphic Taq I sites are present. These give rise to three sets of allelic fragments from the ..gamma..-crystallin genes. Together these restriction fragment length polymorphisms define eight possible haplotypes, three of which (Q, R, and S) were found in the Dutch and English population. A fourth haplotype (P) was detected within a family in which a hereditary Coppock-like cataract of the embryonic lens nucleus occurs in heterozygotes. Haplotype P was found only in family members who suffered from cataract, and all family members who suffered from cataract had haplotype P. The absolute correlation between the presence of haplotype P and cataract within this family shows that the ..gamma..-crystallin gene cluster and the locus for the Coppock-like cataract are closely linked. This linkage provides genetic evidence that the primary cause of a cataract in humans could possibly be a lesion in a crystallin gene.

  4. HLA Type Inference via Haplotypes Identical by Descent

    NASA Astrophysics Data System (ADS)

    Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik

    The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.

  5. Gene clusters reflecting macrodomain structure respond to nucleoid perturbations.

    PubMed

    Scolari, Vittore F; Bassetti, Bruno; Sclavi, Bianca; Lagomarsino, Marco Cosentino

    2011-03-01

    Focusing on the DNA-bridging nucleoid proteins Fis and H-NS, and integrating several independent experimental and bioinformatic data sources, we investigate the links between chromosomal spatial organization and global transcriptional regulation. By means of a novel multi-scale spatial aggregation analysis, we uncover the existence of contiguous clusters of nucleoid-perturbation sensitive genes along the genome, whose expression is affected by a combination of topological DNA state and nucleoid-shaping protein occupancy. The clusters correlate well with the macrodomain structure of the genome. The most significant of them lay symmetrically at the edges of the Ter macrodomain and involve all of the flagellar and chemotaxis machinery, in addition to key regulators of biofilm formation, suggesting that the regulation of the physical state of the chromosome by the nucleoid proteins plays an important role in coordinating the transcriptional response leading to the switch between a motile and a biofilm lifestyle.

  6. Discovery of a widely distributed toxin biosynthetic gene cluster

    PubMed Central

    Lee, Shaun W.; Mitchell, Douglas A.; Markley, Andrew L.; Hensler, Mary E.; Gonzalez, David; Wohlrab, Aaron; Dorrestein, Pieter C.; Nizet, Victor; Dixon, Jack E.

    2008-01-01

    Bacteriocins represent a large family of ribosomally produced peptide antibiotics. Here we describe the discovery of a widely conserved biosynthetic gene cluster for the synthesis of thiazole and oxazole heterocycles on ribosomally produced peptides. These clusters encode a toxin precursor and all necessary proteins for toxin maturation and export. Using the toxin precursor peptide and heterocycle-forming synthetase proteins from the human pathogen Streptococcus pyogenes, we demonstrate the in vitro reconstitution of streptolysin S activity. We provide evidence that the synthetase enzymes, as predicted from our bioinformatics analysis, introduce heterocycles onto precursor peptides, thereby providing molecular insight into the chemical structure of streptolysin S. Furthermore, our studies reveal that the synthetase exhibits relaxed substrate specificity and modifies toxin precursors from both related and distant species. Given our findings, it is likely that the discovery of similar peptidic toxins will rapidly expand to existing and emerging genomes. PMID:18375757

  7. Analysis of HLA genes and haplotypes in Ainu (from Hokkaido, northern Japan) supports the premise that they descent from Upper Paleolithic populations of East Asia.

    PubMed

    Bannai, M; Ohashi, J; Harihara, S; Takahashi, Y; Juji, T; Omoto, K; Tokunaga, K

    2000-02-01

    The Ainu people are assumed to be the descendants of pre-agricultural native populations of northern Japan, while the majority of population of present-day Japan (Hondo-Japanese) is considered to have descended mainly from post-neolithic migrants. Sequence-level polymorphisms of the HLA-class I (HLA-A and HLA-B) genes were investigated in DNA samples of 50 Ainu living in Hidaka district, Hokkaido. HLA-A*2402, A*0201, A*0206, A*2601, A*3101, B*1501, B*5101, B*3901, and B*3501 were observed at frequencies of more than 10% and most of these have previously been found in populations of not only Asians but also North and South American Indians. A*68012, which has not so far been detected in Hondo-Japanese, was found in the Ainu (3%). On the other hand, several alleles common in Hondo-Japanese, including HLA-A*3303, A*1101, B*4403, B*5201, B*5401, B*4601, and B*0702 were infrequent in Ainu (0-1%). Correspondence and neighbor-joining analyses of various populations based on HLA-A, -B and -DRB1 gene frequencies enabled distinction between Asian, Native South American, European, and African populations. The Ainu, as well as Tlingit (Na-Dene), were placed midway between other East Asians, including Hondo Japanese, and Native South Americans (Amerindians) in the correspondence analysis. Furthermore, several HLA-A-B and HLA-B-DR-DQ haplotypes common in the Ainu, are shared with some Native American populations. These observations strongly suggest a unique place for the Ainu as descendants of some Upper Paleolithic populations of East Asia, from whom some Native Americans may have descended.

  8. EasyCluster: a fast and efficient gene-oriented clustering tool for large-scale transcriptome data

    PubMed Central

    Picardi, Ernesto; Mignone, Flavio; Pesole, Graziano

    2009-01-01

    Background ESTs and full-length cDNAs represent an invaluable source of evidence for inferring reliable gene structures and discovering potential alternative splicing events. In newly sequenced genomes, these tasks may not be practicable owing to the lack of appropriate training sets. However, when expression data are available, they can be used to build EST clusters related to specific genomic transcribed loci. Common strategies recently employed to this end are based on sequence similarity between transcripts and can lead, in specific conditions, to inconsistent and erroneous clustering. In order to improve the cluster building and facilitate all downstream annotation analyses, we developed a simple genome-based methodology to generate gene-oriented clusters of ESTs when a genomic sequence and a pool of related expressed sequences are provided. Our procedure has been implemented in the software EasyCluster and takes into account the spliced nature of ESTs after an ad hoc genomic mapping. Methods EasyCluster uses the well-known GMAP program in order to perform a very quick EST-to-genome mapping in addition to the detection of reliable splice sites. Given a genomic sequence and a pool of ESTs/FL-cDNAs, EasyCluster starts building genomic and EST local databases and runs GMAP. Subsequently, it parses results creating an initial collection of pseudo-clusters by grouping ESTs according to the overlap of their genomic coordinates on the same strand. In the final step, EasyCluster refines the clustering by again running GMAP on each pseudo-cluster and groups together ESTs sharing at least one splice site. Results The higher accuracy of EasyCluster with respect to other clustering tools has been verified by means of a manually cured benchmark of human EST clusters. Additional datasets including the Unigene cluster Hs.122986 and ESTs related to the human HOXA gene family have also been used to demonstrate the better clustering capability of EasyCluster over current genome

  9. Diploid/polyploid syntenic shuttle mapping and haplotype-specific chromosome walking toward a rust resistance gene (Bru1) in highly polyploid sugarcane (2n approximately 12x approximately 115).

    PubMed

    Le Cunff, Loïc; Garsmeur, Olivier; Raboin, Louis Marie; Pauquet, Jérome; Telismart, Hugues; Selvi, Athiappan; Grivet, Laurent; Philippe, Romain; Begum, Dilara; Deu, Monique; Costet, Laurent; Wing, Rod; Glaszmann, Jean Christophe; D'Hont, Angélique

    2008-09-01

    The genome of modern sugarcane cultivars is highly polyploid (approximately 12x), aneuploid, of interspecific origin, and contains 10 Gb of DNA. Its size and complexity represent a major challenge for the isolation of agronomically important genes. Here we report on the first attempt to isolate a gene from sugarcane by map-based cloning, targeting a durable major rust resistance gene (Bru1). We describe the genomic strategies that we have developed to overcome constraints associated with high polyploidy in the successive steps of map-based cloning approaches, including diploid/polyploid syntenic shuttle mapping with two model diploid species (sorghum and rice) and haplotype-specific chromosome walking. Their applications allowed us (i) to develop a high-resolution map including markers at 0.28 and 0.14 cM on both sides and 13 markers cosegregating with Bru1 and (ii) to develop a physical map of the target haplotype that still includes two gaps at this stage due to the discovery of an insertion specific to this haplotype. These approaches will pave the way for the development of future map-based cloning approaches for sugarcane and other complex polyploid species.

  10. From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants.

    PubMed

    Chu, Hoi Yee; Wegel, Eva; Osbourn, Anne

    2011-04-01

    Gene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds. An obvious assumption might be that these eukaryotic gene clusters have arisen by horizontal gene transfer from microbes, but there is compelling evidence to indicate that this is not the case. This raises intriguing questions about how widespread such clusters are, what the significance of clustering is, why genes for some metabolic pathways are clustered and those for others are not, and how these clusters form. In answering these questions we may hope to learn more about mechanisms of genome plasticity and adaptive evolution in plants. It is noteworthy that for the five plant secondary metabolic gene clusters reported so far, the enzymes for the first committed steps all appear to have been recruited directly or indirectly from primary metabolic pathways involved in hormone synthesis. This may or may not turn out to be a common feature of plant secondary metabolic gene clusters as new clusters emerge.

  11. Toward Awakening Cryptic Secondary Metabolite Gene Clusters in Filamentous Fungi

    PubMed Central

    Lim, Fang Yun; Sanchez, James F.; Wang, Clay C.C.; Keller, Nancy P.

    2013-01-01

    Mining for novel natural compounds is of eminent importance owing to the continuous need for new pharmaceuticals. Filamentous fungi are historically known to harbor the genetic capacity for an arsenal of natural compounds, both beneficial and detrimental to humans. The majority of these metabolites are still cryptic or silent under standard laboratory culture conditions. Mining for these cryptic natural products can be an excellent source for identifying new compound classes. Capitalizing on the current knowledge on how secondary metabolite gene clusters are regulated has allowed the research community to unlock many hidden fungal treasures, as described in this chapter. PMID:23084945

  12. Vitamin D receptor gene polymorphisms/haplotypes and serum 25(OH)D3 levels in Hashimoto's thyroiditis.

    PubMed

    Giovinazzo, Salvatore; Vicchio, Teresa M; Certo, Rosaria; Alibrandi, Angela; Palmieri, Orazio; Campennì, Alfredo; Cannavò, Salvatore; Trimarchi, Francesco; Ruggeri, Rosaria Maddalena

    2017-02-01

    Vitamin D deficiency and/or reduced function, as per certain polymorphisms of the vitamin D receptor (VDR) gene, have been related to several autoimmune disorders. The present study was aimed to investigate the association of Hashimoto's thyroiditis with vitamin D status and functional polymorphisms (SNPs) of the VDR gene. In this case-control study, 200 euthyroid subjects were enrolled: 100 newly diagnosed HT patients (87 F, 13 M; mean age ± SD 42 ± 15 year) and 100 healthy individuals, matched for age, sex, BMI, and month of blood sampling. Serum 25(OH)D3 was measured by HPLC. The VDR SNPs BsmI, ApaI, and TaqI, in strong linkage disequilibrium with each other, were detected by restriction fragment length polymorphism-PCR. The prevalence of vitamin D deficiency in HT patients was significantly higher than that in the control group (70 vs 18.2 %; p < 0.0001), and median serum 25(OH)D3 level was significantly lower in HT patients than controls (median value: 16.2 vs 37.4 ng/ml; p = 0.026). Moreover, there was a significant inverse correlation between serum 25(OH)D3 and TPOAb concentration (r = -0.669; p = 0.034). Contrarily, the genotype distribution of the studied SNPs was not different in the two groups (BsmI p = 0.783; ApaI p = 0.512; TaqI p = 0.471), as was the allelic frequency [f(B) p = 0.776, f(b) p = 0.887; f(A) p = 0.999, f(a) p = 0.999; f(T) p = 0.617; f(t) p = 0.617]. The present study first investigates newly diagnosed untreated HT and suggests that vitamin D deficiency may contribute to HT development and/or progression, acting as an environmental trigger, while the VDR locus does not appear to be involved in conditioning the genetic susceptibility to the disease, at least in Caucasians.

  13. Founder mitochondrial haplotypes in Amerindian populations.

    PubMed Central

    Bailliet, G.; Rothhammer, F.; Carnese, F. R.; Bravi, C. M.; Bianchi, N. O.

    1994-01-01

    It had been proposed that the colonization of the New World took place by three successive migrations from northeastern Asia. The first one gave rise to Amerindians (Paleo-Indians), the second and third ones to Nadene and Aleut-Eskimo, respectively. Variation in mtDNA has been used to infer the demographic structure of the Amerindian ancestors. The study of RFLP all along the mtDNA and the analysis of nucleotide substitutions in the D-loop region of the mitochondrial genome apparently indicate that most or all full-blooded Amerindians cluster in one of four different mitochondrial haplotypes that are considered to represent the founder maternal lineages of Paleo-Indians. We have studied the mtDNA diversity in 109 Amerindians belonging to 3 different tribes, and we have reanalyzed the published data on 482 individuals from 18 other tribes. Our study confirms the existence of four major Amerindian haplotypes. However, we also found evidence supporting the existence of several other potential founder haplotypes or haplotype subsets in addition to the four ancestral lineages reported. Confirmation of a relatively high number of founder haplotypes would indicate that early migration into America was not accompanied by a severe genetic bottleneck. PMID:7517626

  14. Founder mitochondrial haplotypes in Amerindian populations.

    PubMed

    Bailliet, G; Rothhammer, F; Carnese, F R; Bravi, C M; Bianchi, N O

    1994-07-01

    It had been proposed that the colonization of the New World took place by three successive migrations from northeastern Asia. The first one gave rise to Amerindians (Paleo-Indians), the second and third ones to Nadene and Aleut-Eskimo, respectively. Variation in mtDNA has been used to infer the demographic structure of the Amerindian ancestors. The study of RFLP all along the mtDNA and the analysis of nucleotide substitutions in the D-loop region of the mitochondrial genome apparently indicate that most or all full-blooded Amerindians cluster in one of four different mitochondrial haplotypes that are considered to represent the founder maternal lineages of Paleo-Indians. We have studied the mtDNA diversity in 109 Amerindians belonging to 3 different tribes, and we have reanalyzed the published data on 482 individuals from 18 other tribes. Our study confirms the existence of four major Amerindian haplotypes. However, we also found evidence supporting the existence of several other potential founder haplotypes or haplotype subsets in addition to the four ancestral lineages reported. Confirmation of a relatively high number of founder haplotypes would indicate that early migration into America was not accompanied by a severe genetic bottleneck.

  15. Molecular Characterization of Neurally Expressing Genes in the Para Sodium Channel Gene Cluster of Drosophila

    PubMed Central

    Hong, C. S.; Ganetzky, B.

    1996-01-01

    To elucidate the mechanisms regulating expression of para, which encodes the major class of sodium channels in the Drosophila nervous system, we have tried to locate upstream cis-acting regulatory elements by mapping the transcriptional start site and analyzing the region immediately upstream of para in region 14D of the polytene chromosomes. From these studies, we have discovered that the region contains a cluster of neurally expressing genes. Here we report the molecular characterization of the genomic organization of the 14D region and the genes within this region, which are: calnexin (Cnx), actin related protein 14D (Arp14D), calcineurin A 14D (CnnA14D), and chromosome associated protein (Cap). The tight clustering of these genes, their neuronal expression patterns, and their potential functions related to expression, modulation, or regulation of sodium channels raise the possibility that these genes represent a functionally related group sharing some coordinate regulatory mechanism. PMID:8849894

  16. Evolutionary formation of gene clusters by reorganization: the meleagrin/roquefortine paradigm in different fungi.

    PubMed

    Martín, Juan F; Liras, Paloma

    2016-02-01

    The biosynthesis of secondary metabolites in fungi is catalyzed by enzymes encoded by genes linked in clusters that are frequently co-regulated at the transcriptional level. Formation of gene clusters may take place by de novo assembly of genes recruited from other cellular functions, but also novel gene clusters are formed by reorganization of progenitor clusters and are distributed by horizontal gene transfer. This article reviews (i) the published information on the roquefortine/meleagrin/neoxaline gene clusters of Penicillium chrysogenum (Penicillium rubens) and the short roquefortine cluster of Penicillium roqueforti, and (ii) the correlation of the genes present in those clusters with the enzymes and metabolites derived from these pathways. The P. chrysogenum roq/mel cluster consists of seven genes and includes a gene (roqT) encoding a 12-TMS transporter protein of the MFS family. Interestingly, the orthologous P. roquefortine gene cluster has only four genes and the roqT gene is present as a residual pseudogene that encodes only small peptides. Two of the genes present in the central region of the P. chrysogenum roq/mel cluster have been lost during the evolutionary formation of the short cluster and the order of the structural genes in the cluster has been rearranged. The two lost genes encode a N1 atom hydroxylase (nox) and a roquefortine scaffold-reorganizing oxygenase (sro). As a consequence P. roqueforti has lost the ability to convert the roquefortine-type carbon skeleton to the glandicoline/meleagrin-type scaffold and is unable to produce glandicoline B, meleagrin and neoxaline. The loss of this genetic information is not recent and occurred probably millions of years ago when a progenitor Penicillium strain got adapted to life in a few rich habitats such as cheese, fermented cereal grains or silage. P. roqueforti may be considered as a "domesticated" variant of a progenitor common to contemporary P. chrysogenum and related Penicillia.

  17. Distribution and Genetic Diversity of Bacteriocin Gene Clusters in Rumen Microbial Genomes

    PubMed Central

    Azevedo, Analice C.; Bento, Cláudia B. P.; Ruiz, Jeronimo C.; Queiroz, Marisa V.

    2015-01-01

    Some species of ruminal bacteria are known to produce antimicrobial peptides, but the screening procedures have mostly been based on in vitro assays using standardized methods. Recent sequencing efforts have made available the genome sequences of hundreds of ruminal microorganisms. In this work, we performed genome mining of the complete and partial genome sequences of 224 ruminal bacteria and 5 ruminal archaea to determine the distribution and diversity of bacteriocin gene clusters. A total of 46 bacteriocin gene clusters were identified in 33 strains of ruminal bacteria. Twenty gene clusters were related to lanthipeptide biosynthesis, while 11 gene clusters were associated with sactipeptide production, 7 gene clusters were associated with class II bacteriocin production, and 8 gene clusters were associated with class III bacteriocin production. The frequency of strains whose genomes encode putative antimicrobial peptide precursors was 14.4%. Clusters related to the production of sactipeptides were identified for the first time among ruminal bacteria. BLAST analysis indicated that the majority of the gene clusters (88%) encoding putative lanthipeptides contained all the essential genes required for lanthipeptide biosynthesis. Most strains of Streptococcus (66.6%) harbored complete lanthipeptide gene clusters, in addition to an open reading frame encoding a putative class II bacteriocin. Albusin B-like proteins were found in 100% of the Ruminococcus albus strains screened in this study. The in silico analysis provided evidence of novel biosynthetic gene clusters in bacterial species not previously related to bacteriocin production, suggesting that the rumen microbiota represents an underexplored source of antimicrobial peptides. PMID:26253660

  18. Gene prioritization and clustering by multi-view text mining

    PubMed Central

    2010-01-01

    Background Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. Results We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. Conclusions In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification. PMID:20074336

  19. DNA damage in leukocytes of sickle cell anemia patients is associated with hydroxyurea therapy and with HBB*S haplotype.

    PubMed

    da Silva Rocha, Lilianne Brito; Dias Elias, Darcielle Bruna; Barbosa, Maritza Cavalcante; Bandeira, Izabel Cristina Justino; Gonçalves, Romélia Pinheiro

    2012-12-12

    Hydroxyurea (HU) is the primary pharmacologic agent for preventing the complications and improving the quality of life of sickle cell anemia (SCA) patients. Although HU has been associated with an increased risk of leukemia in some patients with myeloproliferative disorders, the mutagenic and carcinogenic potential of HU has not been established. This study used the alkaline comet assay to investigate DNA damage in peripheral blood leukocytes from 41 individuals with SCA treated with HU (SCAHU) and from 26 normal individuals. The presence of HbS and the analysis of the haplotypes of the beta S gene cluster were done by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). The damage index (DI) in the SCAHU group was significantly higher than in controls (p<0.001). Neither gender nor age was associated with DNA damage in controls or SCAHU individuals. Among the SCAHU individuals, DI was significantly influenced by length of HU treatment (p=0.0039) and BMI (p=0.001). Individuals with length of HU treatment≥20 months and BMI≤20kg/m(2) had a significantly greater DI than those with length of HU treatment<20 months and BMI>20kg/m(2). No significant influence of mean HU dose was observed on DI (p=0.950). However, individuals who received a mean HU dose≥20mg/kg showed a higher DI than those who received less. Furthermore, an association was observed between DI damage and HBB*S gene haplotypes. DI values for the Bantu/Bantu haplotype was greater when compared to the Benin/Benin haplotype; and the Bantu/Benin haplotype had a DI lower than the Bantu/Bantu haplotype and greater than the Benin/Benin haplotype. Our results show that DNA damage in sickle cell anemia is associated not only with treatment with HU but also with genotype.

  20. Gravitation field algorithm and its application in gene cluster

    PubMed Central

    2010-01-01

    Background Searching optima is one of the most challenging tasks in clustering genes from available experimental data or given functions. SA, GA, PSO and other similar efficient global optimization methods are used by biotechnologists. All these algorithms are based on the imitation of natural phenomena. Results This paper proposes a novel searching optimization algorithm called Gravitation Field Algorithm (GFA) which is derived from the famous astronomy theory Solar Nebular Disk Model (SNDM) of planetary formation. GFA simulates the Gravitation field and outperforms GA and SA in some multimodal functions optimization problem. And GFA also can be used in the forms of unimodal functions. GFA clusters the dataset well from the Gene Expression Omnibus. Conclusions The mathematical proof demonstrates that GFA could be convergent in the global optimum by probability 1 in three conditions for one independent variable mass functions. In addition to these results, the fundamental optimization concept in this paper is used to analyze how SA and GA affect the global search and the inherent defects in SA and GA. Some results and source code (in Matlab) are publicly available at http://ccst.jlu.edu.cn/CSBG/GFA. PMID:20854683

  1. Genomic sequence of 'Candidatus Liberibacter solanacearum' haplotype C and its comparison with haplotype A and B genomes

    PubMed Central

    Haapalainen, Minna; Schott, Thomas; Thompson, Sarah M.; Smith, Grant R.; Nissinen, Anne I.; Pirhonen, Minna

    2017-01-01

    Haplotypes A and B of ‘Candidatus Liberibacter solanacearum’ (CLso) are associated with diseases of solanaceous plants, especially Zebra chip disease of potato, and haplotypes C, D and E are associated with symptoms on apiaceous plants. To date, one complete genome of haplotype B and two high quality draft genomes of haplotype A have been obtained for these unculturable bacteria using metagenomics from the psyllid vector Bactericera cockerelli. Here, we present the first genomic sequences obtained for the carrot-associated CLso. These two genomic sequences of haplotype C, FIN114 (1.24 Mbp) and FIN111 (1.20 Mbp), were obtained from carrot psyllids (Trioza apicalis) harboring CLso. Genomic comparisons between the haplotypes A, B and C revealed that the genome organization differs between these haplotypes, due to large inversions and other recombinations. Comparison of protein-coding genes indicated that the core genome of CLso consists of 885 ortholog groups, with the pan-genome consisting of 1327 ortholog groups. Twenty-seven ortholog groups are unique to CLso haplotype C, whilst 11 ortholog groups shared by the haplotypes A and B, are not found in the haplotype C. Some of these ortholog groups that are not part of the core genome may encode functions related to interactions with the different host plant and psyllid species. PMID:28158295

  2. Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

    SciTech Connect

    Dover, Nir; Barash, Jason R.; Burke, Julianne N.; Hill, Karen K.; Detter, John C.; Arnon, Stephen S.

    2014-05-22

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bont gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.

  3. Arrangement of the Clostridium baratii F7 toxin gene cluster with identification of a σ factor that recognizes the botulinum toxin gene cluster promoters.

    PubMed

    Dover, Nir; Barash, Jason R; Burke, Julianne N; Hill, Karen K; Detter, John C; Arnon, Stephen S

    2014-01-01

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bont gene that is part of a toxin gene cluster that includes several accessory genes. We sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. This TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.

  4. An algorithm for haplotype analysis

    SciTech Connect

    Lin, Shili; Speed, T.P.

    1997-12-01

    This paper proposes an algorithm for haplotype analysis based on a Monte Carlo method. Haplotype configurations are generated according to the distribution of joint haplotypes of individuals in a pedigree given their phenotype data, via a Markov chain Monte Carlo algorithm. The haplotype configuration which maximizes this conditional probability distribution can thus be estimated. In addition, the set of haplotype configurations with relatively high probabilities can also be estimated as possible alternatives to the most probable one. This flexibility enables geneticists to choose the haplotype configurations which are most reasonable to them, allowing them to include their knowledge of the data under analysis. 18 refs., 2 figs., 1 tab.

  5. Serum Total Bilirubin, not Cholelithiasis, is Influenced by UGT1A1 Polymorphism, Alpha Thalassemia and βs Haplotype: First Report on Comparison between Arab-Indian and African βs Genes

    PubMed Central

    Alkindi, Said Y.; Pathare, Anil; Al Zadjali, Shoaib; Panjwani, Vinodhkumar; Wasim, Fauzia; Khan, Hammad; Chopra, Pradeep; Krishnamoorthy, Rajagopal; Alkindi, Salam

    2015-01-01

    Background We explored the potential relationship between steady state serum bilirubin levels and the incidence of cholelithiasis in the context of UGT1A1 gene A(TA)nTAA promoter polymorphism in Omani sickle cell anemia (SCA) patients, homozygotes for African (Benin and Bantu) and Arab-Indian βS haplotypes, but sharing the same microgeographical environment and comparable life style factors. Methods 136 SCA patients were retrospectively studied in whom imaging data including abdominal CT scan, MRI or Ultrasonography were routinely available. Available data on the mean steady state hematological/biochemical parameters (n=136), βs haplotypes(n=136), α globin gene status (n=105) and UGT1A1 genotypes (n=133) were reviewed from the respective medical records. Results The mean serum total bilirubin level was significantly higher in the homozygous UGT1A1(AT)7 group as compared to UGT1A1(AT)6 group. Thus, not cholelithiasis but total serum bilirubin was influenced by UGT1A1 polymorphism in this SCA cohort. Conclusion As observed in other population groups, the UGT1A1 (AT)7 homozygosity was significantly associated with raised serum total bilirubin level, but the prevalence of gallstones in the Omani SCA patients was not associated with α thalassaemia, UGT1A1 polymorphism, or βs haplotypes. PMID:26543529

  6. Identification of the cluster control region for the protocadherin-beta genes located beyond the protocadherin-gamma cluster.

    PubMed

    Yokota, Shinnichi; Hirayama, Teruyoshi; Hirano, Keizo; Kaneko, Ryosuke; Toyoda, Shunsuke; Kawamura, Yoshimi; Hirabayashi, Masumi; Hirabayashi, Takahiro; Yagi, Takeshi

    2011-09-09

    The clustered protocadherins (Pcdhs), Pcdh-α, -β, and -γ, are transmembrane proteins constituting a subgroup of the cadherin superfamily. Each Pcdh cluster is arranged in tandem on the same chromosome. Each of the three Pcdh clusters shows stochastic and combinatorial expression in individual neurons, thus generating a hugely diverse set of possible cell surface molecules. Therefore, the clustered Pcdhs are candidates for determining neuronal molecular diversity. Here, we showed that the targeted deletion of DNase I hypersensitive (HS) site HS5-1, previously identified as a Pcdh-α regulatory element in vitro, affects especially the expression of specific Pcdh-α isoforms in vivo. We also identified a Pcdh-β cluster control region (CCR) containing six HS sites (HS16, 17, 17', 18, 19, and 20) downstream of the Pcdh-γ cluster. This CCR comprehensively activates the expression of the Pcdh-β gene cluster in cis, and its deletion dramatically decreases their expression levels. Deleting the CCR nonuniformly down-regulates some Pcdh-γ isoforms and does not affect Pcdh-α expression. Thus, the CCR effect extends beyond the 320-kb region containing the Pcdh-γ cluster to activate the upstream Pcdh-β genes. Thus, we concluded that the CCR is a highly specific regulatory unit for Pcdh-β expression on the clustered Pcdh genomic locus. These findings suggest that each Pcdh cluster is controlled by distinct regulatory elements that activate their expression and that the stochastic gene regulation of the clustered Pcdhs is controlled by the complex chromatin architecture of the clustered Pcdh locus.

  7. Nucleotide polymorphism in colicin E2 gene clusters: evidence for nonneutral evolution.

    PubMed

    Tan, Y; Riley, M A

    1997-06-01

    To explore the molecular mechanisms behind the diversification of colicin gene clusters, we examined DNA sequence polymorphism for the colicin gene clusters of 14 colicin E2 (ColE2) plasmids obtained from natural isolates of Escherichia coli. Two types of ColE2 plasmids are revealed, with type II gene clusters generated by recombination between type I ColE2 and ColE7 gene clusters. The levels and patterns of DNA polymorphism are different between the two types. Type I polymorphism is distributed evenly along the gene cluster, while type II accumulates polymorphism at an elevated rate in the 5' end of the colicin gene. These differences may be explained by recombinational origins of type II gene clusters. The pattern of divergence between the ColE2 gene cluster and its close relative ColE9 is not correlated with the pattern of polymorphism within ColE2, suggesting that this gene cluster is not evolving in a neutral fashion. A statistical test confirms significant departures from the predictions of neutrality. These data lend further support to the hypothesis that colicin gene clusters may evolve under the influence of nonneutral forces.

  8. High diversity of {alpha}-globin haplotypes in a senegalese population, including many previously unreported variants

    SciTech Connect

    Martinson, J.J.; Swinburn, C.; Clegg, J.B.

    1995-11-01

    RFLP haplotypes at the {alpha}-globin gene complex have been examined in 190 individuals from the Niokolo Mandenka population of Senegal: haplotypes were assigned unambiguously for 210 chromosomes. The Mandenka share with other African populations a sample size-independent haplotype diversity that is much greater than that in any non-African population: the number of haplotypes observed in the Mandenka is typically twice that seen in the non-African populations sampled to date. Of these haplotypes, 17.3% had not been observed in any previous surveys, and a further 19.1% have previously been reported only in African populations. The haplotype distribution shows clear differences between African and non-African peoples, but this is on the basis of population-specific haplotypes combined with haplotypes common to all. The relationship of the newly reported haplotypes to those previously recorded suggests that several mutation processes, particularly recombination as homologous exchange or gene conversion, have been involved in their production. A computer program based on the expectation-maximization (EM) algorithm was used to obtain maximum-likelihood estimates of haplotype frequencies for the entire data set: good concordance between the unambiguous and EM-derived sets was seen for the overall haplotype frequencies. Some of the low-frequency haplotypes reported by the estimation algorithm differ greatly, in structure, from those haplotypes known to be present in human populations, and they may not represent haplotypes actually present in the sample. 43 refs., 4 figs., 4 tabs.

  9. High diversity of alpha-globin haplotypes in a Senegalese population, including many previously unreported variants.

    PubMed Central

    Martinson, J J; Excoffier, L; Swinburn, C; Boyce, A J; Harding, R M; Langaney, A; Clegg, J B

    1995-01-01

    RFLP haplotypes at the alpha-globin gene complex have been examined in 190 individuals from the Niokolo Mandenka population of Senegal: haplotypes were assigned unambiguously for 210 chromosomes. The Mandenka share with other African populations a sample size-independent haplotype diversity that is much greater than that in any non-African population: the number of haplotypes observed in the Mandenka is typically twice that seen in the non-African populations sampled to date. Of these haplotypes, 17.3% had not been observed in any previous surveys, and a further 19.1% have previously been reported only in African populations. The haplotype distribution shows clear differences between African and non-African peoples, but this is on the basis of population-specific haplotypes combined with haplotypes common to all. The relationship of the newly reported haplotypes to those previously recorded suggests that several mutation processes, particularly recombination as homologous exchange or gene conversion, have been involved in their production. A computer program based on the expectation-maximization (EM) algorithm was used to obtain maximum-likelihood estimates of haplotype frequencies for the entire data set: good concordance between the unambiguous and EM-derived sets was seen for the overall haplotype frequencies. Some of the low-frequency haplotypes reported by the estimation algorithm differ greatly, in structure, from those haplotypes known to be present in human populations, and they may not represent haplotypes actually present in the sample. PMID:7485171

  10. Polyketide and nonribosomal peptide retro-biosynthesis and global gene cluster matching.

    PubMed

    Dejong, Chris A; Chen, Gregory M; Li, Haoxin; Johnston, Chad W; Edwards, Mclean R; Rees, Philip N; Skinnider, Michael A; Webster, Andrew L H; Magarvey, Nathan A

    2016-12-01

    Polyketides (PKs) and nonribosomal peptides (NRPs) are profoundly important natural products, forming the foundations of many therapeutic regimes. Decades of research have revealed over 11,000 PK and NRP structures, and genome sequencing is uncovering new PK and NRP gene clusters at an unprecedented rate. However, only ∼10% of PK and NRPs are currently associated with gene clusters, and it is unclear how many of these orphan gene clusters encode previously isolated molecules. Therefore, to efficiently guide the discovery of new molecules, we must first systematically de-orphan emergent gene clusters from genomes. Here we provide to our knowledge the first comprehensive retro-biosynthetic program, generalized retro-biosynthetic assembly prediction engine (GRAPE), for PK and NRP families and introduce a computational pipeline, global alignment for natural products cheminformatics (GARLIC), to uncover how observed biosynthetic gene clusters relate to known molecules, leading to the identification of gene clusters that encode new molecules.

  11. Haplotyping using a combination of polymerase chain reaction-single-strand conformational polymorphism analysis and haplotype-specific PCR amplification.

    PubMed

    Zhou, Huitong; Li, Shaobin; Liu, Xiu; Wang, Jiqing; Luo, Yuzhu; Hickford, Jon G H

    2014-12-01

    A single nucleotide polymorphism (SNP) may have an impact on phenotype, but it may also be influenced by multiple SNPs within a gene; hence, the haplotype or phase of multiple SNPs needs to be known. Various methods for haplotyping SNPs have been proposed, but a simple and cost-effective method is currently unavailable. Here we describe a haplotyping approach using two simple techniques: polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and haplotype-specific PCR. In this approach, individual regions of a gene are analyzed by PCR-SSCP to identify variation that defines sub-haplotypes, and then extended haplotypes are assembled from the sub-haplotypes either directly or with the additional use of haplotype-specific PCR amplification. We demonstrate the utility of this approach by haplotyping ovine FABP4 across two variable regions that contain seven SNPs and one indel. The simplicity of this approach makes it suitable for large-scale studies and/or diagnostic screening.

  12. Rapid Selection of Plasmodium falciparum Chloroquine Resistance Transporter Gene and Multidrug Resistance Gene-1 Haplotypes Associated with Past Chloroquine and Present Artemether-Lumefantrine Use in Inhambane District, Southern Mozambique

    PubMed Central

    Thomsen, Thomas T.; Madsen, Laura B.; Hansson, Helle H.; Tomás, Elsa V. E.; Charlwood, Derek; Bygbjerg, Ib C.; Alifrangis, Michael

    2013-01-01

    Chloroquine (CQ) use in Mozambique was stopped in 2002 and artemether-lumefantrine (AL) was implemented in 2008. In light of no use of CQ and extensive use of AL, we determined the frequency of molecular markers of Plasmodium falciparum drug resistance/tolerance to CQ and AL in persons living in Linga-Linga, an isolated peninsula and in Furvela village, which is located 8 km inland. The P. falciparum chloroquine resistance transporter gene CVMNK wild type increased in frequency from 43.9% in 2009 to 66.4% in 2010 (P ≤ 0.001), and combined P. falciparum multidrug resistance gene 1 N86-184F-D1246 haplotype increased significantly between years (P = 0.039). The combination of P. falciparum chloroquine resistance transporter gene CVMNK and P. falciparum multidrug resistance gene NFD increased from 24.3% (2009) to 45.3% in (2010, P = 0.017). The rapid changes observed may largely be caused by decreased use of CQ and large-scale use of AL. In the absence of a clear AL-resistance marker and the (almost) continent-wide use of AL in sub-Saharan Africa, and when considering CQ reintroduction, continued monitoring of these markers is needed. PMID:23382159

  13. Ancient haplotypes of the HLA Class II region.

    PubMed

    Raymond, Christopher K; Kas, Arnold; Paddock, Marcia; Qiu, Ruolan; Zhou, Yang; Subramanian, Sandhya; Chang, Jean; Palmieri, Anthony; Haugen, Eric; Kaul, Rajinder; Olson, Maynard V

    2005-09-01

    Allelic variation in codons that specify amino acids that line the peptide-binding pockets of HLA's Class II antigen-presenting proteins is superimposed on strikingly few deeply diverged haplotypes. These haplotypes appear to have been evolving almost independently for tens of millions of years. By complete resequencing of 20 haplotypes across the approximately 100-kbp region that spans the HLA-DQA1, -DQB1, and -DRB1 genes, we provide a detailed view of the way in which the genome structure at this locus has been shaped by the interplay of selection, gene-gene interaction, and recombination.

  14. Lampreys have a single gene cluster for the fast skeletal myosin heavy chain gene family.

    PubMed

    Ikeda, Daisuke; Ono, Yosuke; Hirano, Shigeki; Kan-no, Nobuhiro; Watabe, Shugo

    2013-01-01

    Muscle tissues contain the most classic sarcomeric myosin, called myosin II, which consists of 2 heavy chains (MYHs) and 4 light chains. In the case of humans (tetrapod), a total of 6 fast skeletal-type MYH genes (MYHs) are clustered on a single chromosome. In contrast, torafugu (teleost) contains at least 13 fast skeletal MYHs, which are distributed in 5 genomic regions; the MYHs are clustered in 3 of these regions. In the present study, the evolutionary relationship among fast skeletal MYHs is elucidated by comparing the MYHs of teleosts and tetrapods with those of cyclostome lampreys, one of two groups of extant jawless vertebrates (agnathans). We found that lampreys contain at least 3 fast skeletal MYHs, which are clustered in a head-to-tail manner in a single genomic region. Although there was apparent synteny in the corresponding MYH cluster regions between lampreys and tetrapods, phylogenetic analysis indicated that lamprey and tetrapod MYHs have independently duplicated and diversified. Subsequent transgenic approaches showed that the 5'-flanking sequences of Japanese lamprey fast skeletal MYHs function as a regulatory sequence to drive specific reporter gene expression in the fast skeletal muscle of zebrafish embryos. Although zebrafish MYH promoters showed apparent activity to direct reporter gene expression in myogenic cells derived from mice, promoters from Japanese lamprey MYHs had no activity. These results suggest that the muscle-specific regulatory mechanisms are partially conserved between teleosts and tetrapods but not between cyclostomes and tetrapods, despite the conserved synteny.

  15. Identification of lethal cluster of genes in the yeast transcription network

    NASA Astrophysics Data System (ADS)

    Rho, K.; Jeong, H.; Kahng, B.

    2006-05-01

    Identification of essential or lethal genes would be one of the ultimate goals in drug designs. Here we introduce an in silico method to select the cluster with a high population of lethal genes, called lethal cluster, through microarray assay. We construct a gene transcription network based on the microarray expression level. Links are added one by one in the descending order of the Pearson correlation coefficients between two genes. As the link density p increases, two meaningful link densities pm and ps are observed. At pm, which is smaller than the percolation threshold, the number of disconnected clusters is maximum, and the lethal genes are highly concentrated in a certain cluster that needs to be identified. Thus the deletion of all genes in that cluster could efficiently lead to a lethal inviable mutant. This lethal cluster can be identified by an in silico method. As p increases further beyond the percolation threshold, the power law behavior in the degree distribution of a giant cluster appears at ps. We measure the degree of each gene at ps. With the information pertaining to the degrees of each gene at ps, we return to the point pm and calculate the mean degree of genes of each cluster. We find that the lethal cluster has the largest mean degree.

  16. Nucleotide sequence and transcriptional analysis of the type A2 neurotoxin gene cluster in Clostridium botulinum.

    PubMed

    Dineen, Sean S; Bradshaw, Marite; Karasek, Charles E; Johnson, Eric A

    2004-06-01

    The nucleotide sequences of the upstream regions of the botulinum neurotoxin type A1 (BoNT/A1) cluster of Clostridium botulinum strain NCTC 2916 and the BoNT/A2 cluster of strain Kyoto-F were determined. A novel gene, designated orfx3, was identified following the orfx2 gene in both clusters. ORF-X2 and ORF-X3 exhibit similarity to the BoNT cluster associated P-47 protein. The BoNT/A1 and BoNT/A2 clusters share a similar gene arrangement, but exhibit differences in the spacing between certain genes. Sequences with similarity to transposases were identified in these intergenic regions, suggesting that these differences arose from an ancestral insertion event. Transcriptional analysis of the BoNT/A2 cluster revealed that the genes of the cluster are primarily synthesized as three polycistronic transcripts. Two divergent polycistronic transcripts, one encoding the orfx1, orfx2, and orfx3 genes, the second encoding the p47, ntnh, and bont/a2 genes, are transcribed from conserved BoNT cluster promoters. The third polycistronic transcript, expressed at low levels, encodes the positive regulatory botR gene and the orfx genes. This is the first complete analysis of a botulinum toxin A2 cluster.

  17. Interactions of Environmental Factors and APOA1-APOC3-APOA4-APOA5 Gene Cluster Gene Polymorphisms with Metabolic Syndrome

    PubMed Central

    Wu, Yanhua; Yu, Yaqin; Zhao, Tiancheng; Wang, Shibin; Fu, Yingli; Qi, Yue; Yang, Guang; Yao, Wenwang; Su, Yingying; Ma, Yue; Shi, Jieping; Jiang, Jing; Kou, Changgui

    2016-01-01

    than 2 servings a week were positively associated with MetS (P<0.05). A family history of diabetes (OR = 1.630, 95%CI = 1.484–1.791), cardiovascular disease or cerebral diseases (OR = 1.297, 95%CI = 1.211–1.389) was associated with MetS. APOA1 rs670, APOA5 rs662799 and rs651821 revealed significant differences in genotype distributions between the MetS patients and control subjects. The minor alleles of APOA1 rs670, APOA5 rs662799 and rs651821, and APOA5 rs2075291 were associated with MetS (P<0.0016). APOA1 rs5072 and APOC3 rs5128, APOA5 rs651821 and rs662799 were in strong linkage disequilibrium to each other with r2 greater than 0.8. Five haplotypes were associated with an increased risk of MetS (OR = 1.23, 1.58, 1.80, 1.90, and 1.98). When we investigated the interactions of environmental factors and APOA1/C3/A4/A5 gene cluster gene polymorphisms, we found that APOA5 rs662799 had interactions with tobacco use and alcohol consumption (PGE<0.05). Conclusions There was a high prevalence of MetS in the northeast of China. Male gender, increasing age, mental labor, family history of diabetes, cardiovascular disease or cerebral diseases, current smoking, excess salt intake, fruit and dairy intake less than 2 servings a week, and drinking were associated with MetS. The APOA1/C3/A4/A5 gene cluster was associated with MetS in the Han Chinese. APOA5 rs662799 had interactions with the environmental factors associated with MetS. PMID:26824674

  18. Mutational and Phylogenetic Analyses of the Mycobacterial mbt Gene Cluster ▿§

    PubMed Central

    Chavadi, Sivagami Sundaram; Stirrett, Karen L.; Edupuganti, Uthamaphani R.; Vergnolle, Olivia; Sadhanandan, Gigani; Marchiano, Emily; Martin, Che; Qiu, Wei-Gang; Soll, Clifford E.; Quadri, Luis E. N.

    2011-01-01

    The mycobactin siderophore system is present in many Mycobacterium species, including M. tuberculosis and other clinically relevant mycobacteria. This siderophore system is believed to be utilized by both pathogenic and nonpathogenic mycobacteria for iron acquisition in both in vivo and ex vivo iron-limiting environments, respectively. Several M. tuberculosis genes located in a so-called mbt gene cluster have been predicted to be required for the biosynthesis of the core scaffold of mycobactin based on sequence analysis. A systematic and controlled mutational analysis probing the hypothesized essential nature of each of these genes for mycobactin production has been lacking. The degree of conservation of mbt gene cluster orthologs remains to be investigated as well. In this study, we sought to conclusively establish whether each of nine mbt genes was required for mycobactin production and to examine the conservation of gene clusters orthologous to the M. tuberculosis mbt gene cluster in other bacteria. We report a systematic mutational analysis of the mbt gene cluster ortholog found in Mycobacterium smegmatis. This mutational analysis demonstrates that eight of the nine mbt genes investigated are essential for mycobactin production. Our genome mining and phylogenetic analyses reveal the presence of orthologous mbt gene clusters in several bacterial species. These gene clusters display significant organizational differences originating from an intricate evolutionary path that might have included horizontal gene transfers. Altogether, the findings reported herein advance our understanding of the genetic requirements for the biosynthesis of an important mycobacterial secondary metabolite with relevance to virulence. PMID:21873494

  19. A Special Local Clustering Algorithm for Identifying the Genes Associated With Alzheimer’s Disease

    PubMed Central

    Pang, Chao-Yang; Hu, Wei; Hu, Ben-Qiong; Shi, Ying; Vanderburg, Charles R.; Rogers, Jack T.

    2010-01-01

    Clustering is the grouping of similar objects into a class. Local clustering feature refers to the phenomenon whereby one group of data is separated from another, and the data from these different groups are clustered locally. A compact class is defined as one cluster in which all similar elements cluster tightly within the cluster. Herein, the essence of the local clustering feature, revealed by mathematical manipulation, results in a novel clustering algorithm termed as the special local clustering (SLC) algorithm that was used to process gene microarray data related to Alzheimer’s disease (AD). SLC algorithm was able to group together genes with similar expression patterns and identify significantly varied gene expression values as isolated points. If a gene belongs to a compact class in control data and appears as an isolated point in incipient, moderate and/or severe AD gene microarray data, this gene is possibly associated with AD. Application of a clustering algorithm in disease-associated gene identification such as in AD is rarely reported. PMID:20089478

  20. Translating biosynthetic gene clusters into fungal armor and weaponry

    PubMed Central

    Keller, Nancy P

    2015-01-01

    Filamentous fungi are renowned for the production of a diverse array of secondary metabolites (SMs) where the genetic material required for synthesis of a SM is typically arrayed in a biosynthetic gene cluster (BGC). These natural products are valued for their bioactive properties stemming from their functions in fungal biology, key among those protection from abiotic and biotic stress and establishment of a secure niche. The producing fungus must not only avoid self-harm from endogenous SMs but also deliver specific SMs at the right time to the right tissue requiring biochemical aid. This review highlights functions of BGCs beyond the enzymatic assembly of SMs, considering the timing and location of SM production and other proteins in the clusters that control SM activity. Specifically, self-protection is provided by both BGC-encoded mechanisms and non-BGC subcellular containment of toxic SM precursors; delivery and timing is orchestrated through cellular trafficking patterns and stress- and developmental-responsive transcriptional programs. PMID:26284674

  1. Enzymology of aminoglycoside biosynthesis-deduction from gene clusters.

    PubMed

    Wehmeier, Udo F; Piepersberg, Wolfgang

    2009-01-01

    The classical aminoglycosides are, with very few exceptions, typically actinobacterial secondary metabolites with antimicrobial activities all mediated by inhibiting translation on the 30S subunit of the bacterial ribosome. Some chemically related natural products inhibit glucosidases by mimicking oligo-alpha-1,4-glucosides. The biochemistry of the aminoglycoside biosynthetic pathways is still a developing field since none of the pathways has been analyzed to completeness as yet. In this chapter we treat the enzymology of aminoglycoside biosyntheses as far as it becomes apparent from recent investigations based on the availability of DNA sequence data of biosynthetic gene clusters for all major structural classes of these bacterial metabolites. We give a more general overview of the field, including descriptions of some key enzymes in various aminoglycoside pathways, whereas in Chapter 20 provides a detailed account of the better-studied enzymology thus far known for the neomycin and butirosin pathways.

  2. Prevalence of β(S)-globin gene haplotypes, α-thalassemia (3.7 kb deletion) and redox status in patients with sickle cell anemia in the state of Paraná, Brazil.

    PubMed

    Shimauti, Eliana LitsukoTomimatsu; Silva, Danilo Grunig Humberto; de Souza, Eniuce Menezes; de Almeida, Eduardo Alves; Leal, Francismar Prestes; Bonini-Domingos, Claudia Regina

    2015-01-01

    The aim of this study was to determine the frequency of beta S-globin gene (β(S) globin) haplotypes and alpha thalassemia with 3.7 kb deletion (-α(3.7kb) thalassemia) in the northwest region of Paraná state, and to investigate the oxidative and clinical-hematological profile of β(S) globin carriers in this population. Of the 77 samples analyzed, 17 were Hb SS, 30 were Hb AS and 30 were Hb AA. The β(S)globin haplotypes and -α(3.7kb) thalassemia were identified using polymerase chain reaction.Trolox equivalent antioxidant capacity (TEAC) and lipid peroxidation (LPO) were assessed spectophotometrically. Serum melatonin levels were determined using high-performance liquid chromatography coupled to coulometric electrochemical detection. The haplotype frequencies in the SS individuals were as follows: Bantu- 21 (62%), Benin - 11 (32%) and Atypical- 2 (6%). Bantu/Benin was the most frequent genotype. Of the 47 SS and AS individuals assessed, 17% (n = 8) had the -α(3.7kb) mutation. Clinical manifestations, as well as serum melatonin, TEAC and LPO levels did not differ between Bantu/Bantu and Bantu/Benin individuals (p > 0.05). Both genotypes were associated with high LPO and TEAC levels and decreased melatonin concentration. These data suggest that the level of oxidative stress in patients with Bantu/Bantu and Bantu/Benin genotypes may overload the antioxidant capacity.

  3. Prevalence of βS-globin gene haplotypes, α-thalassemia (3.7 kb deletion) and redox status in patients with sickle cell anemia in the state of Paraná, Brazil

    PubMed Central

    Shimauti, Eliana LitsukoTomimatsu; Silva, Danilo Grunig Humberto; de Souza, Eniuce Menezes; de Almeida, Eduardo Alves; Leal, Francismar Prestes; Bonini-Domingos, Claudia Regina

    2015-01-01

    The aim of this study was to determine the frequency of beta S-globin gene (βS globin) haplotypes and alpha thalassemia with 3.7 kb deletion (−α3.7kb thalassemia) in the northwest region of Paraná state, and to investigate the oxidative and clinical-hematological profile of βS globin carriers in this population. Of the 77 samples analyzed, 17 were Hb SS, 30 were Hb AS and 30 were Hb AA. The βSglobin haplotypes and −α3.7kb thalassemia were identified using polymerase chain reaction.Trolox equivalent antioxidant capacity (TEAC) and lipid peroxidation (LPO) were assessed spectophotometrically. Serum melatonin levels were determined using high-performance liquid chromatography coupled to coulometric electrochemical detection. The haplotype frequencies in the SS individuals were as follows: Bantu- 21 (62%), Benin - 11 (32%) and Atypical- 2 (6%). Bantu/Benin was the most frequent genotype. Of the 47 SS and AS individuals assessed, 17% (n = 8) had the −α3.7kb mutation. Clinical manifestations, as well as serum melatonin, TEAC and LPO levels did not differ between Bantu/Bantu and Bantu/Benin individuals (p > 0.05). Both genotypes were associated with high LPO and TEAC levels and decreased melatonin concentration. These data suggest that the level of oxidative stress in patients with Bantu/Bantu and Bantu/Benin genotypes may overload the antioxidant capacity. PMID:26500435

  4. Association of the AChRalpha-subunit gene (CHRNA), DQA1*0101, and the DR3 haplotype in myasthenia gravis. Evidence for a three-gene disease model in a subgroup of patients.

    PubMed

    Djabiri, F; Caillat-Zucman, S; Gajdos, P; Jaïs, J P; Gomez, L; Khalil, I; Charron, D; Bach, J F; Garchon, H J

    1997-08-01

    Myasthenia gravis (MG) is an autoimmune disease of the neuromuscular junction having multigene control. HLA-linked loci and the HB*14 micro-satellite marker located within the CHRNA gene which encodes the muscular acetylcholine receptor (AChR) alpha-subunit, the target self-antigen, were previously associated with MG. Combined analysis of these loci revealed a significant increase of DQA1*0101 alleles in HB*14+ vs. HB*14- patients and of DQA1*0501 alleles in HB*14/DQA1*0101 patients. Importantly, the effect of DQA1*0101 was independent of allelically associated DQB1 and DRB1 genes. In contrast, the effect of DQA1*0501 could not be dissociated from that of DRB1*03 and DQB1*0201 on the extended DR3 haplotype. These results indicate that a combination of three genes, of which two are linked to HLA, contributes to disease susceptibility in a subgroup of MG patients.

  5. Clustered Genes Encoding the Methyltransferases of Methanogenesis from Monomethylamine

    PubMed Central

    Burke, Stephen A.; Lo, Sam L.; Krzycki, Joseph A.

    1998-01-01

    Coenzyme M (CoM) is methylated during methanogenesis from monomethyamine in a reaction catalyzed by three proteins. Using monomethylamine, a 52-kDa polypeptide termed monomethylamine methyltransferase (MMAMT) methylates the corrinoid cofactor bound to a second polypeptide, monomethylamine corrinoid protein (MMCP). Methylated MMCP then serves as a substrate for MT2-A, which methylates CoM. The genes for these proteins are clustered on 6.8 kb of DNA in Methanosarcina barkeri MS. The gene encoding MMCP (mtmC) is located directly upstream of the gene encoding MMAMT (mtmB). The gene encoding MT2-A (mtbA) was found 1.1 kb upstream of mtmC, but no obvious open reading frame was found in the intergenic region between mtbA and mtmC. A single monocistronic transcript was found for mtbA that initiated 76 bp from the translational start. Separate transcripts of 2.4 and 4.7 kb were detected, both of which carried mtmCB. The larger transcript also encoded mtmP, which is homologous to the APC family of cationic amine permeases and may therefore encode a methylamine permease. A single transcriptional start site was found 447 bp upstream of the translational start of mtmC. MtmC possesses the corrinoid binding motif found in corrinoid proteins involved in dimethylsulfide- and methanol-dependent methanogenesis, as well as in methionine synthase. The open reading frame of mtmB was interrupted by a single in-frame, midframe, UAG codon which was also found in mtmB from M. barkeri NIH. A mechanism that circumvents UAG-directed termination of translation must operate during expression of mtmB in this methanogen. PMID:9642198

  6. Clustered genes encoding the methyltransferases of methanogenesis from monomethylamine.

    PubMed

    Burke, S A; Lo, S L; Krzycki, J A

    1998-07-01

    Coenzyme M (CoM) is methylated during methanogenesis from monomethyamine in a reaction catalyzed by three proteins. Using monomethylamine, a 52-kDa polypeptide termed monomethylamine methyltransferase (MMAMT) methylates the corrinoid cofactor bound to a second polypeptide, monomethylamine corrinoid protein (MMCP). Methylated MMCP then serves as a substrate for MT2-A, which methylates CoM. The genes for these proteins are clustered on 6.8 kb of DNA in Methanosarcina barkeri MS. The gene encoding MMCP (mtmC) is located directly upstream of the gene encoding MMAMT (mtmB). The gene encoding MT2-A (mtbA) was found 1.1 kb upstream of mtmC, but no obvious open reading frame was found in the intergenic region between mtbA and mtmC. A single monocistronic transcript was found for mtbA that initiated 76 bp from the translational start. Separate transcripts of 2.4 and 4.7 kb were detected, both of which carried mtmCB. The larger transcript also encoded mtmP, which is homologous to the APC family of cationic amine permeases and may therefore encode a methylamine permease. A single transcriptional start site was found 447 bp upstream of the translational start of mtmC. MtmC possesses the corrinoid binding motif found in corrinoid proteins involved in dimethylsulfide- and methanol-dependent methanogenesis, as well as in methionine synthase. The open reading frame of mtmB was interrupted by a single in-frame, midframe, UAG codon which was also found in mtmB from M. barkeri NIH. A mechanism that circumvents UAG-directed termination of translation must operate during expression of mtmB in this methanogen.

  7. Polymorphic haplotypes and recombination rates at the LDL receptor gene locus in subjects with and without familial hypercholesterolemia who are from different populations.

    PubMed

    Miserez, A R; Schuster, H; Chiodetti, N; Keller, U

    1993-04-01

    RFLPs at the low-density lipoprotein (LDL) receptor locus for TaqI, StuI, HincII, AvaII, ApaLI (5' and 3'), PvuII, and NcoI were studied in Swiss and German families with familial hypercholesterolemia (FH). A total of 1,104 LDL receptor alleles were analyzed using Southern blotting and new PCR-based techniques for detection of the TaqI, StuI, HincII, AvaII, NcoI RFLPs. Two hundred fifty-six independent haplotypes from 368 individuals of 61 unrelated Swiss families, as well as 114 independent haplotypes from 184 subjects of 25 unrelated German families, were constructed. In 76 families, clinical diagnosis of FH was confirmed by cosegregation analysis. Of the 43 unique haplotypes consisting of seven RFLPs detected in the Swiss and Germans, only 9 were common in both population samples. Analysis of linkage disequilibrium revealed nonrandom associations between several of the investigated RFLPs. ApaLI (5'), NcoI, PvuII, TaqI, and AvaII or HincII were particularly informative (cumulative informativeness .85). Relative frequencies, heterozygosity indexes, and PICs of the RFLPs from the Swiss and Germans were compared with values calculated from reported haplotype data for Italians, Icelanders, North American Caucasians, South African Caucasians, and Japanese. Pairwise comparisons of population samples by common RFLPs demonstrated unexpected differences even between geographically adjacent populations (e.g., the Swiss and Germans). Furthermore, genetic distances from the Germans to the other Caucasians were larger than to the Japanese. An unexpected lack of correlation between linkage disequilibria and physical distances was detected for the German and Japanese data, possibly because of nonuniform recombination with excessively high rates between exon 13 and intron 15. Hence, the present study revealed a striking variety of polymorphic haplotypes and heterogeneity of RFLP frequencies and recombination rates among the seven population samples.

  8. Polymorphic haplotypes and recombination rates at the LDL receptor gene locus in subjects with and without familial hypercholesterolemia who are from different populations.

    PubMed Central

    Miserez, A R; Schuster, H; Chiodetti, N; Keller, U

    1993-01-01

    RFLPs at the low-density lipoprotein (LDL) receptor locus for TaqI, StuI, HincII, AvaII, ApaLI (5' and 3'), PvuII, and NcoI were studied in Swiss and German families with familial hypercholesterolemia (FH). A total of 1,104 LDL receptor alleles were analyzed using Southern blotting and new PCR-based techniques for detection of the TaqI, StuI, HincII, AvaII, NcoI RFLPs. Two hundred fifty-six independent haplotypes from 368 individuals of 61 unrelated Swiss families, as well as 114 independent haplotypes from 184 subjects of 25 unrelated German families, were constructed. In 76 families, clinical diagnosis of FH was confirmed by cosegregation analysis. Of the 43 unique haplotypes consisting of seven RFLPs detected in the Swiss and Germans, only 9 were common in both population samples. Analysis of linkage disequilibrium revealed nonrandom associations between several of the investigated RFLPs. ApaLI (5'), NcoI, PvuII, TaqI, and AvaII or HincII were particularly informative (cumulative informativeness .85). Relative frequencies, heterozygosity indexes, and PICs of the RFLPs from the Swiss and Germans were compared with values calculated from reported haplotype data for Italians, Icelanders, North American Caucasians, South African Caucasians, and Japanese. Pairwise comparisons of population samples by common RFLPs demonstrated unexpected differences even between geographically adjacent populations (e.g., the Swiss and Germans). Furthermore, genetic distances from the Germans to the other Caucasians were larger than to the Japanese. An unexpected lack of correlation between linkage disequilibria and physical distances was detected for the German and Japanese data, possibly because of nonuniform recombination with excessively high rates between exon 13 and intron 15. Hence, the present study revealed a striking variety of polymorphic haplotypes and heterogeneity of RFLP frequencies and recombination rates among the seven population samples. PMID:8096361

  9. Comparative analysis of magnetosome gene clusters in magnetotactic bacteria provides further evidence for horizontal gene transfer.

    PubMed

    Jogler, Christian; Kube, Michael; Schübbe, Sabrina; Ullrich, Susanne; Teeling, Hanno; Bazylinski, Dennis A; Reinhardt, Richard; Schüler, Dirk

    2009-05-01

    The organization of magnetosome genes was analysed in all available complete or partial genomic sequences of magnetotactic bacteria (MTB), including the magnetosome island (MAI) of the magnetotactic marine vibrio strain MV-1 determined in this study. The MAI was found to differ in gene content and organization between Magnetospirillum species and strains MV-1 or MC-1. Although a similar organization of magnetosome genes was found in all MTB, distinct variations in gene order and sequence similarity were uncovered that may account for the observed diversity of biomineralization, cell biology and magnetotaxis found in various MTB. While several magnetosome genes were present in all MTB, others were confined to Magnetospirillum species, indicating that the minimal set of genes required for magnetosome biomineralization might be smaller than previously suggested. A number of novel candidate genes were implicated in magnetosome formation by gene cluster comparison. Based on phylogenetic and compositional evidence we present a model for the evolution of magnetotaxis within the Alphaproteobacteria, which suggests the independent horizontal transfer of magnetosome genes from an unknown ancestor of magnetospirilla into strains MC-1 and MV-1.

  10. CPm gene diversity in field isolates of Citrus tristeza virus from Colombia.

    PubMed

    Oliveros-Garay, Oscar Arturo; Martinez-Salazar, Natalhie; Torres-Ruiz, Yanneth; Acosta, Orlando

    2009-01-01

    The nucleotide sequence diversity of the CPm gene from 28 field isolates of Citrus tristeza virus (CTV) was assessed by SSCP and sequence analyses. These isolates showed two major shared haplotypes, which differed in distribution: A1 was the major haplotype in 23 isolates from different geographic regions, whereas R1 was found in isolates from a discrete region. Phylogenetic reconstruction clustered A1 within an independent group, while R1 was grouped with mild isolates T30 from Florida and T385 from Spain. Some isolates contained several minor haplotypes, which were very similar to, and associated with, the major haplotype.

  11. Characterization and expression analysis of the exopolysaccharide gene cluster in Lactobacillus fermentum TDS030603.

    PubMed

    Dan, Tong; Fukuda, Kenji; Sugai-Bannai, Michiko; Takakuwa, Naoya; Motoshima, Hidemasa; Urashima, Tadasu

    2009-12-01

    Part of the exopolysaccharide gene cluster of Lactobacillus fermentum TDS030603 was characterized. It consists of 11,890 base pairs and is located in the chromosomal DNA, 13 open reading frames of which were encoded. Out of the 13 open reading frames, six were found to be involved in exopolysaccharide synthesis; however, five were similar to transposase genes of other lactobacilli, and two were functionally unrelated. Expression analysis revealed that the exopolysaccharide synthesis-related genes were expressed during cultivation. Southern analysis using specific primers for the exopolysaccharide genes indicated that duplication of the gene cluster did not occur. The plasmid-cured strain maintained its capacity for exopolysaccharide production, confirming that the exopolysaccharide gene cluster of this strain is located in the chromosomal DNA, similarly to thermophilic lactic acid bacteria. Our results indicate that this exopolysaccharide gene cluster is likely to be functional, although extensive gene rearrangement occurs.

  12. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

    PubMed

    Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

    2011-02-01

    display a different cellular localization compared to that of the gsdf gene indicating that the later gene is not co-regulated. Interestingly, our study identifies new clustered genes that are specifically expressed in previtellogenic oocytes (nup54, aff1, klhl8, sdad1).

  13. Epigenetic regulation of the RHOX homeobox gene cluster and its association with human male infertility.

    PubMed

    Richardson, Marcy E; Bleiziffer, Andreas; Tüttelmann, Frank; Gromoll, Jörg; Wilkinson, Miles F

    2014-01-01

    The X-linked RHOX cluster encodes a set of homeobox genes that are selectively expressed in the reproductive tract. Members of the RHOX cluster regulate target genes important for spermatogenesis promote male fertility in mice. Studies show that demethylating agents strongly upregulate the expression of mouse Rhox genes, suggesting that they are regulated by DNA methylation. However, whether this extends to human RHOX genes, whether DNA methylation directly regulates RHOX gene transcription and how this relates to human male infertility are unknown. To address these issues, we first defined the promoter regions of human RHOX genes and performed gain- and loss-of-function experiments to determine whether human RHOX gene transcription is regulated by DNA methylation. Our results indicated that DNA methylation is necessary and sufficient to silence human RHOX gene expression. To determine whether RHOX cluster methylation associates with male infertility, we evaluated the methylation status of RHOX genes in sperm from a large cohort of infertility patients. Linear regression analysis revealed a strong association between RHOX gene cluster hypermethylation and three independent types of semen abnormalities. Hypermethylation was restricted specifically to the RHOX cluster; we did not observe it in genes immediately adjacent to it on the X chromosome. Our results strongly suggest that human RHOX homeobox genes are under an epigenetic control mechanism that is aberrantly regulated in infertility patients. We propose that hypermethylation of the RHOX gene cluster serves as a marker for idiopathic infertility and that it is a candidate to exert a causal role in male infertility.

  14. Activation and Characterization of a Cryptic Polycyclic Tetramate Macrolactam Biosynthetic Gene Cluster

    PubMed Central

    Luo, Yunzi; Huang, Hua; Liang, Jing; Wang, Meng; Lu, Lu; Shao, Zengyi; Cobb, Ryan E.; Zhao, Huimin

    2014-01-01

    Polycyclic tetramate macrolactams (PTMs) are a widely distributed class of natural products with important biological activities. However, many of them have not been characterized. Here we apply a plug and play synthetic biology strategy to activate a cryptic PTM biosynthetic gene cluster SGR810-815 from Streptomyces griseus and discover three potential PTMs. This gene cluster is highly conserved in phylogenetically diverse bacterial strains and contains an unusual hybrid polyketide synthase-nonribosomal peptide synthetase (PKS-NRPS) which resembles iterative PKSs known in fungi. To further characterize this gene cluster, we use the same synthetic biology approach to create a series of gene deletion constructs and elucidate the biosynthetic steps for the formation of the polycyclic system. The strategy we employ bypasses the traditional laborious processes to elicit gene cluster expression and should be generally applicable to many other silent or cryptic gene clusters for discovery and characterization of new natural products. PMID:24305602

  15. A Bayesian clustering approach for detecting gene-gene interactions in high-dimensional genotype data.

    PubMed

    Chen, Sui-Pi; Huang, Guan-Hua

    2014-06-01

    This paper uses a Bayesian formulation of a clustering procedure to identify gene-gene interactions under case-control studies, called the Algorithm via Bayesian Clustering to Detect Epistasis (ABCDE). The ABCDE uses Dirichlet process mixtures to model SNP marker partitions, and uses the Gibbs weighted Chinese restaurant sampling to simulate posterior distributions of these partitions. Unlike the representative Bayesian epistasis detection algorithm BEAM, which partitions markers into three groups, the ABCDE can be evaluated at any given partition, regardless of the number of groups. This study also develops permutation tests to validate the disease association for SNP subsets identified by the ABCDE, which can yield results that are more robust to model specification and prior assumptions. This study examines the performance of the ABCDE and compares it with the BEAM using various simulated data and a schizophrenia SNP dataset.

  16. Haplotype assembly from aligned weighted SNP fragments.

    PubMed

    Zhao, Yu-Ying; Wu, Ling-Yun; Zhang, Ji-Hong; Wang, Rui-Sheng; Zhang, Xiang-Sun

    2005-08-01

    Given an assembled genome of a diploid organism the haplotype assembly problem can be formulated as retrieval of a pair of haplotypes from a set of aligned weighted SNP fragments. Known computational formulations (models) of this problem are minimum letter flips (MLF) and the weighted minimum letter flips (WMLF; Greenberg et al. (INFORMS J. Comput. 2004, 14, 211-213)). In this paper we show that the general WMLF model is NP-hard even for the gapless case. However the algorithmic solutions for selected variants of WMFL can exist and we propose a heuristic algorithm based on a dynamic clustering technique. We also introduce a new formulation of the haplotype assembly problem that we call COMPLETE WMLF (CWMLF). This model and algorithms for its implementation take into account a simultaneous presence of multiple kinds of data errors. Extensive computational experiments indicate that the algorithmic implementations of the CWMLF model achieve higher accuracy of haplotype reconstruction than the WMLF-based algorithms, which in turn appear to be more accurate than those based on MLF.

  17. Unusual mutation clusters provide insight into class I gene conversion mechanisms.

    PubMed Central

    Pease, L R; Horton, R M; Pullen, J K; Yun, T J

    1993-01-01

    Genetic diversity among the K and D alleles of the mouse major histocompatibility complex is generated by gene conversion among members of the class I multigene family. The majority of known class I mutants contain clusters of nucleotide changes that can be traced to linked family members. However, the details of the gene conversion mechanism are not known. The bm3 and bm23 mutations represent exceptions to the usual pattern and provide insight into intermediates generated during the gene conversion process. Both of these variants contain clusters of five nucleotide substitutions, but they differ from the classic conversion mutants in the important respect that no donor gene for either mutation could be identified in the parental genome. Nevertheless, both mutation clusters are composed of individual mutations that do exist within the parent. Therefore, they are not random and appear to be templated. Significantly, the bm3 and bm23 mutation clusters are divided into overlapping regions that match class I genes which have functioned as donor genes in other characterized gene conversion events. The unusual structure of the mutation clusters indicates an underlying gene conversion mechanism that can generate mutation clusters as a result of the interaction of three genes in a single genetic event. The unusual mutation clusters are consistent with a hypothetical gene conversion model involving extrachromosomal intermediates. Images PMID:8321237

  18. Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus

    PubMed Central

    Wang, Qinghua; Dooner, Hugo K.

    2006-01-01

    Maize is probably the most diverse of all crop species. Unexpectedly large differences among haplotypes were first revealed in a comparison of the bz genomic regions of two different inbred lines, McC and B73. Retrotransposon clusters, which comprise most of the repetitive DNA in maize, varied markedly in makeup, and location relative to the genes in the region and genic sequences, later shown to be carried by two helitron transposons, also differed between the inbreds. Thus, the allelic bz regions of these Corn Belt inbreds shared only a minority of the total sequence. To investigate further the variation caused by retrotransposons, helitrons, and other insertions, we have analyzed the organization of the bz genomic region in five additional cultivars selected because of their geographic and genetic diversity: the inbreds A188, CML258, and I137TN, and the land races Coroico and NalTel. This vertical comparison has revealed the existence of several new helitrons, new retrotransposons, members of every superfamily of DNA transposons, numerous miniature elements, and novel insertions flanked at either end by TA repeats, which we call TAFTs (TA-flanked transposons). The extent of variation in the region is remarkable. In pairwise comparisons of eight bz haplotypes, the percentage of shared sequences ranges from 25% to 84%. Chimeric haplotypes were identified that combine retrotransposon clusters found in different haplotypes. We propose that recombination in the common gene space greatly amplifies the variability produced by the retrotransposition explosion in the maize ancestry, creating the heterogeneity in genome organization found in modern maize. PMID:17101975

  19. Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus.

    PubMed

    Wang, Qinghua; Dooner, Hugo K

    2006-11-21

    Maize is probably the most diverse of all crop species. Unexpectedly large differences among haplotypes were first revealed in a comparison of the bz genomic regions of two different inbred lines, McC and B73. Retrotransposon clusters, which comprise most of the repetitive DNA in maize, varied markedly in makeup, and location relative to the genes in the region and genic sequences, later shown to be carried by two helitron transposons, also differed between the inbreds. Thus, the allelic bz regions of these Corn Belt inbreds shared only a minority of the total sequence. To investigate further the variation caused by retrotransposons, helitrons, and other insertions, we have analyzed the organization of the bz genomic region in five additional cultivars selected because of their geographic and genetic diversity: the inbreds A188, CML258, and I137TN, and the land races Coroico and NalTel. This vertical comparison has revealed the existence of several new helitrons, new retrotransposons, members of every superfamily of DNA transposons, numerous miniature elements, and novel insertions flanked at either end by TA repeats, which we call TAFTs (TA-flanked transposons). The extent of variation in the region is remarkable. In pairwise comparisons of eight bz haplotypes, the percentage of shared sequences ranges from 25% to 84%. Chimeric haplotypes were identified that combine retrotransposon clusters found in different haplotypes. We propose that recombination in the common gene space greatly amplifies the variability produced by the retrotransposition explosion in the maize ancestry, creating the heterogeneity in genome organization found in modern maize.

  20. Elite Haplotypes of a Protein Kinase Gene TaSnRK2.3 Associated with Important Agronomic Traits in Common Wheat

    PubMed Central

    Miao, Lili; Mao, Xinguo; Wang, Jingyi; Liu, Zicheng; Zhang, Bin; Li, Weiyu; Chang, Xiaoping; Reynolds, Matthew; Wang, Zhenhua; Jing, Ruilian

    2017-01-01

    Plant-specific protein kinase SnRK2s play crucial roles in response to various environmental stimuli. TaSnRK2.3, a SnRK2 member, was involved in the response to multiple abiotic stresses in wheat. To facilitate the use of TaSnRK2.3 in wheat breeding, the three genomic sequences of TaSnRK2.3, originating from the A, B, and D genomes of hexaploid wheat, were obtained. Sequence polymorphism assays showing 4 and 10 variations were detected at TaSnRK2.3-1A and at TaSnRK2.3-1B, respectively, yet no variation was identified at TaSnRK2.3-1D. Three haplotypes for A genome, and two main haplotypes for B genome of TaSnRK2.3 were identified in 32 genotypes. Functional markers (2.3AM1, 2.3AM2, 2.3BM1, 2.3BM2) were successfully developed to distinguish different haplotypes. Association analysis was performed with the general linear model in TASSEL 2.1. The results showed that both TaSnRK2.3-1A and TaSnRK2.3-1B were significantly associated with plant height (PH), length of peduncle and penultimate node, as well as 1,000-grain weight (TGW) under different environments. Additionally, TaSnRK2.3-1B was significantly associated with stem water-soluble carbohydrates at flowering and mid-grain filling stages. Hap-1A-1 had higher TGW and lower PH; Hap-1B-1 had higher TGW and stem water-soluble carbohydrates, as well as lower PH, thus the two haplotypes were considered as elite haplotypes. Geographic distribution and allelic frequencies indicated that the two preferred haplotypes Hap-1A-1 and Hap-1B-1 were positively selected in the process of Chinese wheat breeding. These results could be valuable for genetic improvement and germplasm enhancement using molecular marker assisted selection in wheat breeding.

  1. A hypothesis to explain how laeA specifically regulates certain secondary metabolite biosynthesis gene clusters

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Biosynthesis of mycotoxins involves transcriptional co-regulation of sets of clustered genes. We hypothesize that specific control of transcription of genes in these clusters by LaeA, a global regulator of secondary metabolite production and development in aspergilli and other filamentous fungi, re...

  2. Fragmentation of an aflatoxin-like gene cluster in a forest pathogen

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Secondary metabolic pathway genes are typically clustered in fungi. An exception to this paradigm is seen for genes required for the production of dothistromin, an aflatoxin-like virulence factor produced by the pine needle pathogen Dothistroma septosporum. In contrast to the tight clustering of gen...

  3. Rough-fuzzy clustering for grouping functionally similar genes from microarray data.

    PubMed

    Maji, Pradipta; Paul, Sushmita

    2013-01-01

    Gene expression data clustering is one of the important tasks of functional genomics as it provides a powerful tool for studying functional relationships of genes in a biological process. Identifying coexpressed groups of genes represents the basic challenge in gene clustering problem. In this regard, a gene clustering algorithm, termed as robust rough-fuzzy c-means, is proposed judiciously integrating the merits of rough sets and fuzzy sets. While the concept of lower and upper approximations of rough sets deals with uncertainty, vagueness, and incompleteness in cluster definition, the integration of probabilistic and possibilistic memberships of fuzzy sets enables efficient handling of overlapping partitions in noisy environment. The concept of possibilistic lower bound and probabilistic boundary of a cluster, introduced in robust rough-fuzzy c-means, enables efficient selection of gene clusters. An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed c-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on 14 yeast microarray data sets.

  4. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants1[OPEN

    PubMed Central

    Zhang, Peifen; Kim, Taehyong; Banf, Michael; Chavali, Arvind K.; Nilo-Poyanco, Ricardo; Bernard, Thomas

    2017-01-01

    Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters. PMID:28228535

  5. A phylogenomic gene cluster resource: The phylogeneticallyinferred groups (PhlGs) database

    SciTech Connect

    Dehal, Paramvir S.; Boore, Jeffrey L.

    2005-08-25

    We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community.

  6. Identification and analysis of a highly conserved chemotaxis gene cluster in Shewanella species.

    SciTech Connect

    Li, J.; Romine, Margaret F.; Ward, M.

    2007-08-01

    A conserved cluster of chemotaxis genes was identified from the genome sequences of fifteen Shewanella species. An in-frame deletion of the cheA-3 gene, which is located in this cluster, was created in S. oneidensis MR-1 and the gene shown to be essential for chemotactic responses to anaerobic electron acceptors. The CheA-3 protein showed strong similarity to Vibrio cholerae CheA-2 and P. aeruginosa CheA-1, two proteins that are also essential for chemotaxis. The genes encoding these proteins were shown to be located in chemotaxis gene clusters closely related to the cheA-3-containing cluster in Shewanella species. The results of this study suggest that a combination of gene neighborhood and homology analyses may be used to predict which cheA genes are essential for chemotaxis in groups of closely related microorganisms.

  7. Base J represses genes at the end of polycistronic gene clusters in Leishmania major by promoting RNAP II termination

    PubMed Central

    Reynolds, David L.; Hofmeister, Brigitte T.; Cliffe, Laura; Siegel, T. Nicolai; Anderson, Britta A.; Beverley, Stephen M.; Schmitz, Robert J.; Sabatini, Robert

    2016-01-01

    Summary The genomes of kinetoplastids are organized into polycistronic gene clusters that are flanked by the modified DNA base J. Previous work has established a role of base J in promoting RNA polymerase II termination in Leishmania spp. where the loss of J leads to termination defects and transcription into adjacent gene clusters. It remains unclear whether these termination defects affect gene expression and whether read through transcription is detrimental to cell growth, thus explaining the essential nature of J. We now demonstrate that reduction of base J at specific sites within polycistronic gene clusters in L. major leads to read through transcription and increased expression of downstream genes in the cluster. Interestingly, subsequent transcription into the opposing polycistronic gene cluster does not lead to downregulation of sense mRNAs. These findings indicate a conserved role for J regulating transcription termination and expression of genes within polycistronic gene clusters in trypanosomatids. In contrast to the expectations often attributed to opposing transcription, the essential nature of J in Leishmania spp. is related to its role in gene repression rather than preventing transcriptional interference resulting from read through and dual strand transcription. PMID:27125778

  8. Recent advances in awakening silent biosynthetic gene clusters and linking orphan clusters to natural products in microorganisms.

    PubMed

    Chiang, Yi-Ming; Chang, Shu-Lin; Oakley, Berl R; Wang, Clay C C

    2011-02-01

    Secondary metabolites from microorganisms have a broad spectrum of applications, particularly in therapeutics. The growing number of sequenced microbial genomes has revealed a remarkably large number of natural product biosynthetic clusters for which the products are still unknown. These cryptic clusters are potentially a treasure house of medically useful compounds. The recent development of new methodologies has made it possible to begin unlock this treasure house, to discover new natural products and to determine their biosynthesis pathways. This review will highlight some of the most recent strategies to activate silent biosynthetic gene clusters and to elucidate their corresponding products and pathways.

  9. Horizontal Transfer and Death of a Fungal Secondary Metabolic Gene Cluster

    PubMed Central

    Campbell, Matthew A.; Rokas, Antonis; Slot, Jason C.

    2012-01-01

    A cluster composed of four structural and two regulatory genes found in several species of the fungal genus Fusarium (class Sordariomycetes) is responsible for the production of the red pigment bikaverin. We discovered that the unrelated fungus Botrytis cinerea (class Leotiomycetes) contains a cluster of five genes that is highly similar in sequence and gene order to the Fusarium bikaverin cluster. Synteny conservation, nucleotide composition, and phylogenetic analyses of the cluster genes indicate that the B. cinerea cluster was acquired via horizontal transfer from a Fusarium donor. Upon or subsequent to the transfer, the B. cinerea gene cluster became inactivated; one of the four structural genes is missing, two others are pseudogenes, and the fourth structural gene shows an accelerated rate of nonsynonymous substitutions along the B. cinerea lineage, consistent with relaxation of selective constraints. Interestingly, the bik4 regulatory gene is still intact and presumably functional, whereas bik5, which is a pathway-specific regulator, also shows a mild but significant acceleration of evolutionary rate along the B. cinerea lineage. This selective preservation of the bik4 regulator suggests that its conservation is due to its likely involvement in other non–bikaverin-related biological processes in B. cinerea. Thus, in addition to novel metabolism, horizontal transfer of wholesale metabolic gene clusters might also be contributing novel regulation. PMID:22294497

  10. A modified recombineering protocol for the genetic manipulation of gene clusters in Aspergillus fumigatus.

    PubMed

    Alcazar-Fuoli, Laura; Cairns, Timothy; Lopez, Jordi F; Zonja, Bozo; Pérez, Sandra; Barceló, Damià; Igarashi, Yasuhiro; Bowyer, Paul; Bignell, Elaine

    2014-01-01

    Genomic analyses of fungal genome structure have revealed the presence of physically-linked groups of genes, termed gene clusters, where collective functionality of encoded gene products serves a common biosynthetic purpose. In multiple fungal pathogens of humans and plants gene clusters have been shown to encode pathways for biosynthesis of secondary metabolites including metabolites required for pathogenicity. In the major mould pathogen of humans Aspergillus fumigatus, multiple clusters of co-ordinately upregulated genes were identified as having heightened transcript abundances, relative to laboratory cultured equivalents, during the early stages of murine infection. The aim of this study was to develop and optimise a methodology for manipulation of gene cluster architecture, thereby providing the means to assess their relevance to fungal pathogenicity. To this end we adapted a recombineering methodology which exploits lambda phage-mediated recombination of DNA in bacteria, for the generation of gene cluster deletion cassettes. By exploiting a pre-existing bacterial artificial chromosome (BAC) library of A. fumigatus genomic clones we were able to implement single or multiple intra-cluster gene replacement events at both subtelomeric and telomere distal chromosomal locations, in both wild type and highly recombinogenic A. fumigatus isolates. We then applied the methodology to address the boundaries of a gene cluster producing a nematocidal secondary metabolite, pseurotin A, and to address the role of this secondary metabolite in insect and mammalian responses to A. fumigatus challenge.

  11. A Modified Recombineering Protocol for the Genetic Manipulation of Gene Clusters in Aspergillus fumigatus

    PubMed Central

    Alcazar-Fuoli, Laura; Cairns, Timothy; Lopez, Jordi F.; Zonja, Bozo; Pérez, Sandra; Barceló, Damià; Igarashi, Yasuhiro; Bowyer, Paul; Bignell, Elaine

    2014-01-01

    Genomic analyses of fungal genome structure have revealed the presence of physically-linked groups of genes, termed gene clusters, where collective functionality of encoded gene products serves a common biosynthetic purpose. In multiple fungal pathogens of humans and plants gene clusters have been shown to encode pathways for biosynthesis of secondary metabolites including metabolites required for pathogenicity. In the major mould pathogen of humans Aspergillus fumigatus, multiple clusters of co-ordinately upregulated genes were identified as having heightened transcript abundances, relative to laboratory cultured equivalents, during the early stages of murine infection. The aim of this study was to develop and optimise a methodology for manipulation of gene cluster architecture, thereby providing the means to assess their relevance to fungal pathogenicity. To this end we adapted a recombineering methodology which exploits lambda phage-mediated recombination of DNA in bacteria, for the generation of gene cluster deletion cassettes. By exploiting a pre-existing bacterial artificial chromosome (BAC) library of A. fumigatus genomic clones we were able to implement single or multiple intra-cluster gene replacement events at both subtelomeric and telomere distal chromosomal locations, in both wild type and highly recombinogenic A. fumigatus isolates. We then applied the methodology to address the boundaries of a gene cluster producing a nematocidal secondary metabolite, pseurotin A, and to address the role of this secondary metabolite in insect and mammalian responses to A. fumigatus challenge. PMID:25372385

  12. Associations of serotonin receptor gene HTR3A, HTR3B, and HTR3A haplotypes with bipolar disorder in Chinese patients.

    PubMed

    Jian, J; Li, C; Xu, J; Qiao, D; Mi, G; Chen, X; Tang, M

    2016-09-16

    Single nucleotide polymorphisms (SNPs) in HTR3A and HTR3B have been reported to be associated with bipolar disorder in European and Japanese populations. We explored the roles of 21 tag SNPs in HTR3A and HTR3B in susceptibility to bipolar disorder in a Chinese cohort. Twenty-one Tag SNPs were genotyped in a study consisting of 130 patients with bipolar disorder, who visited Shandong Mental Health Center between June 2013 and May 2014, and 109 healthy individuals as controls. All of the tag SNPs were genotyped using Sequenom MassArray matrix-assisted laser desorption/ionization time of flight spectrometry. Plink 1.07, Haploview 4.2, and SPSS 20.0 were used for the analysis of the genotypes and the associations of the haplotypes with bipolar disorder. Association analyses of tag SNPs detected significant associations with the A allele in HTR3A rs1176719 (P = 0.030) and the C allele in HTR3A rs1176713 (P = 0.048). Haplotype-based association analyses indicated a statistically significant (P = 0.035) five-SNP haplotype (rs1062613:C, rs11604247:C, rs1176722:G, rs2276302:A, rs1176719:G) of linkage disequilibrium in block 3. Analysis of our small Chinese sample revealed a significant association of HTR3A with bipolar disorder, but yielded no evidence of an association between HTR3B and bipolar disorder. Furthermore, evidence for an association was found for a haplotype of HTR3A. Studies with larger Chinese samples are needed to verify our findings.

  13. Nonlinear biosynthetic gene cluster dose effect on penicillin production by Penicillium chrysogenum.

    PubMed

    Nijland, Jeroen G; Ebbendorf, Bjorg; Woszczynska, Marta; Boer, Rémon; Bovenberg, Roel A L; Driessen, Arnold J M

    2010-11-01

    Industrial penicillin production levels by the filamentous fungus Penicillium chrysogenum increased dramatically by classical strain improvement. High-yielding strains contain multiple copies of the penicillin biosynthetic gene cluster that encodes three key enzymes of the β-lactam biosynthetic pathway. We have analyzed the gene cluster dose effect on penicillin production using the high-yielding P. chrysogenum strain DS17690 that was cured from its native clusters. The amount of penicillin V produced increased with the penicillin biosynthetic gene cluster number but was saturated at high copy numbers. Likewise, transcript levels of the biosynthetic genes pcbAB [δ-(l-α-aminoadipyl)-l-cysteinyl-d-valine synthetase], pcbC (isopenicillin N synthase), and penDE (acyltransferase) correlated with the cluster copy number. Remarkably, the protein level of acyltransferase, which localizes to peroxisomes, was saturated already at low cluster copy numbers. At higher copy numbers, intracellular levels of isopenicillin N increased, suggesting that the acyltransferase reaction presents a limiting step at a high gene dose. Since the number and appearance of the peroxisomes did not change significantly with the gene cluster copy number, we conclude that the acyltransferase activity is limiting for penicillin biosynthesis at high biosynthetic gene cluster copy numbers. These results suggest that at a high penicillin production level, productivity is limited by the peroxisomal acyltransferase import activity and/or the availability of coenzyme A (CoA)-activated side chains.

  14. Improving the computational efficiency of recursive cluster elimination for gene selection.

    PubMed

    Luo, Lin-Kai; Huang, Deng-Feng; Ye, Ling-Jun; Zhou, Qi-Feng; Shao, Gui-Fang; Peng, Hong

    2011-01-01

    The gene expression data are usually provided with a large number of genes and a relatively small number of samples, which brings a lot of new challenges. Selecting those informative genes becomes the main issue in microarray data analysis. Recursive cluster elimination based on support vector machine (SVM-RCE) has shown the better classification accuracy on some microarray data sets than recursive feature elimination based on support vector machine (SVM-RFE). However, SVM-RCE is extremely time-consuming. In this paper, we propose an improved method of SVM-RCE called ISVM-RCE. ISVM-RCE first trains a SVM model with all clusters, then applies the infinite norm of weight coefficient vector in each cluster to score the cluster, finally eliminates the gene clusters with the lowest score. In addition, ISVM-RCE eliminates genes within the clusters instead of removing a cluster of genes when the number of clusters is small. We have tested ISVM-RCE on six gene expression data sets and compared their performances with SVM-RCE and linear-discriminant-analysis-based RFE (LDA-RFE). The experiment results on these data sets show that ISVM-RCE greatly reduces the time cost of SVM-RCE, meanwhile obtains comparable classification performance as SVM-RCE, while LDA-RFE is not stable.

  15. Genes for iron-sulphur cluster assembly are targets of abiotic stress in rice, Oryza sativa.

    PubMed

    Liang, Xuejiao; Qin, Lu; Liu, Peiwei; Wang, Meihuan; Ye, Hong

    2014-03-01

    Iron-sulphur (Fe-S) cluster assembly occurs in chloroplasts, mitochondria and cytosol, involving dozens of genes in higher plants. In this study, we have identified 41 putative Fe-S cluster assembly genes in rice (Oryza sativa) genome, and the expression of all genes was verified. To investigate the role of Fe-S cluster assembly as a metabolic pathway, we applied abiotic stresses to rice seedlings and analysed Fe-S cluster assembly gene expression by qRT-PCR. Our data showed that genes for Fe-S cluster assembly in chloroplasts of leaves are particularly sensitive to heavy metal treatments, and that Fe-S cluster assembly genes in roots were up-regulated in response to iron toxicity, oxidative stress and some heavy metal assault. The effect of each stress treatment on the Fe-S cluster assembly machinery demonstrated an unexpected tissue or organelle specificity, suggesting that the physiological relevance of the Fe-S cluster assembly is more complex than thought. Furthermore, our results may reveal potential candidate genes for molecular breeding of rice.

  16. Two four-marker haplotypes on 7q36.1 region indicate that the potassium channel gene HERG1 (KCNH2, Kv11.1) is related to schizophrenia: a case control study

    PubMed Central

    2010-01-01

    Background The pathobiology of schizophrenia is still unclear. Its current treatment mainly depends on antipsychotic drugs. A leading adverse effect of these medications is the acquired long QT syndrome, which results from the blockade of cardiac HERG1 channels (human ether-a-go-go-related gene potassium channels 1) by antipsychotic agents. The HERG1 channel is encoded by HERG1 (KCNH2, Kv11.1) gene and is most highly expressed in heart and brain. Genetic variations in HERG1 predispose to acquired long QT syndrome. We hypothesized that the blockade of HERG1 channels by antipsychotics might also be significant for their therapeutic mode of action, indicating a novel mechanism in the pathogenesis of schizophrenia. Methods We genotyped four single nucleotide polymorphisms (SNPs) in 7q36.1 region (two SNPs, rs1805123 and rs3800779, located on HERG1, and two SNPs, rs885684 and rs956642, at the 3'-downstream intergenic region) and then performed single SNP and haplotype association analyses in 84 patients with schizophrenia and 74 healthy controls after the exclusion of individuals having prolonged or shortened QT interval on electrocardiogram. Results Our analyses revealed that both genotype and allele frequencies of rs3800779 (c.307+585G>T) were significantly different between populations (P = 0.023 and P = 0.018, respectively). We also identified that two previously undescribed four-marker haplotypes which are nearly allelic opposite of each other and located in chr7:150225599-150302147bp position encompassing HERG1 were either overrepresented (A-A-A-T, the at-risk haplotype, P = 0.0007) or underrepresented (C-A-C-G, the protective haplotype, P = 0.005) in patients compared to controls. Conclusions Our results indicate that the potassium channel gene HERG1 is related to schizophrenia. Our findings may also implicate the whole family of HERG channels (HERG1, HERG2 and HERG3) in the pathogenesis of psychosis and its treatment. PMID:20507645

  17. Suggestive evidence for association between L-type voltage-gated calcium channel (CACNA1C) gene haplotypes and bipolar disorder in Latinos: a family-based association study

    PubMed Central

    Gonzalez, Suzanne; Xu, Chun; Ramirez, Mercedes; Zavala, Juan; Armas, Regina; Contreras, Salvador A; Contreras, Javier; Dassori, Albana; Leach, Robin J; Flores, Deborah; Jerez, Alvaro; Raventós, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2013-01-01

    Objectives Through recent genome-wide association studies (GWAS), several groups have reported significant association between variants in the alpha 1C subunit of the L-type voltage-gated calcium channel (CACNA1C) and bipolar disorder (BP) in European and European-American cohorts. We performed a family-based association study to determine whether CACNA1C is associated with BP in the Latino population. Methods This study consisted of 913 individuals from 215 Latino pedigrees recruited from the United States, Mexico, Guatemala, and Costa Rica. The Illumina GoldenGate Genotyping Assay was used to genotype 58 single-nucleotide polymorphisms (SNPs) that spanned a 602.9 kb region encompassing the CACNA1C gene including two SNPs (rs7297582 and rs1006737) previously shown to associate with BP. Individual SNP and haplotype association analyses were performed using Family-Based Association Test (version 2.0.3) and Haploview (version 4.2) software. Results An eight-locus haplotype block that included these two markers showed significant association with BP (global marker permuted p = 0.0018) in the Latino population. For individual SNPs, this sample had insufficient power (10%) to detect associations with SNPs with minor effect (odds ratio = 1.15). Conclusions Although we were not able to replicate findings of association between individual CACNA1C SNPs rs7297582 and rs1006737 and BP, we were able to replicate the GWAS signal reported for CACNA1C through a haplotype analysis that encompassed these previously reported significant SNPs. These results provide additional evidence that CACNA1C is associated with BP and provides the first evidence that variations in this gene might play a role in the pathogenesis of this disorder in the Latino population. PMID:23437964

  18. Distribution of HLA-A, -B and -DRB1 Genes and Haplotypes in the Tujia Population Living in the Wufeng Region of Hubei Province, China

    PubMed Central

    Zhang, Li; Cheng, Dangxiao; Tao, Ning; Zhao, Min; Zhang, Fan; Yuan, Yulin; Qiu, Xiaoping

    2012-01-01

    Background The distribution of HLA alleles and haplotypes varies widely between different ethnic populations and geographic areas. Before any genetic marker can be used in a disease-associated study it is therefore essential to investigate allelic frequencies and establish a genetic database. Methodology/Principal Findings This is the first report of HLA typing in the Tujia group using the Luminex HLA-SSO method HLA–A, –B and -DRB1 allelic distributions were determined in 124 unrelated healthy Tujia individuals, and haplotypic frequencies and linkage disequilibrium parameters were estimated using the maximum-likelihood method. In total 10 alleles were detected at the HLA–A locus, 21 alleles at the HLA–B locus and 14 alleles at the HLA-DRB1 locus. The most frequently observed alleles in the HLA-I group were HLA–A*02 (35.48%), A*11 (28.23%), A*24 (15.73%); HLA–B*40 (25.00%), B*46 (16.13%), and B*15 (15.73%). Among HLA-DRB1 alleles, high frequencies of HLA-DRB1*09 (25.81%) were observed, followed by HLA-DRB1*15 (12.9%), and DRB1*12 (10.89%). The two-locus haplotypes at the highest frequency were A*02–B*46A (8.47%), followed by A*11–B*40 (7.66%), A*02–B*40 (8.87%), A*11–B*15 (6.45%), A*02–B*15 (6.05%), B*40–DRB1*09 (9.27%) and B*46–DRB1*09 (6.45%). The most common three-locus haplotypes found in the Tujia population were A*02–B*46–DRB1*09 (4.84%) and A*02–B*40–DRB1*09 (4.03%). Fourteen two-loci haplotypes had significant linkage disequilibrium. Construction of a neighbor-joining phylogenetic tree and principal component analysis using the allelic frequencies at HLA-A was performed to compare the Tujia group and twelve other previously reported populations. The Tujia population in the Wufeng of Hubei Province had the closest genetic relationship with the central Han population, and then to the Shui, the Miao, the southern Han and the northern Han ethnic groups. Conclusions/Significance These results will become a valuable source of data

  19. Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

    DOE PAGES

    Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...

    2014-05-22

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less

  20. Synergistic effect and VEGF/HSP70-hom haplotype analysis: relationship to prostate cancer risk and clinical outcome.

    PubMed

    Sfar, Sana; Saad, Hamadi; Mosbah, Faouzi; Chouchane, Lotfi

    2010-04-01

    Prostate cancer (PCa) is a complex disorder resulting from the combined effects of multiple environmental and genetic factors. Our previous single-locus analysis showed that VEGF and HSP70-hom polymorphisms were significantly associated with PCa susceptibility and prognosis. Both genes encoding these proteins were located on chromosome 6p21, and combining the neighboring single nucleotide polymorphisms (SNPs) into haplotypes may increase the association with the disease. Three tagging polymorphisms, the HSP70-hom 2437 T/C, the VEGF-1154 G/A, and the VEGF-634 G/C SNPs were genotyped in 101 cases and 80 controls. For the combined analysis of VEGF and HSP70-hom, we found a positive gradient in the odds ratios (ORs) related to the number of high-risk genotypes with a 3.53-fold increase of prostate carcinoma risk (OR = 3.53; p = 0.015). Furthermore, the TAG and CAG haplotypes at positions HSP70-hom, VEGF -1154 and VEGF -634 exhibited a two-fold (OR = 0.46; p = 0.014) and a seven-fold (OR = 0.14; p = 0.00005) reduction in PCa risk, respectively. Regarding PCa prognosis, the TAG haplotype had a negative association with the aggressive phenotype as defined by the histopathological grade (OR = 0.28; p = 0.006). Our findings confirm the role of at-risk haplotype across the HSP70-hom/VEGF gene cluster in determining susceptibility to PCa.

  1. Using Multi-Instance Hierarchical Clustering Learning System to Predict Yeast Gene Function

    PubMed Central

    Liao, Bo; Li, Yun; Jiang, Yan; Cai, Lijun

    2014-01-01

    Time-course gene expression datasets, which record continuous biological processes of genes, have recently been used to predict gene function. However, only few positive genes can be obtained from annotation databases, such as gene ontology (GO). To obtain more useful information and effectively predict gene function, gene annotations are clustered together to form a learnable and effective learning system. In this paper, we propose a novel multi-instance hierarchical clustering (MIHC) method to establish a learning system by clustering GO and compare this method with other learning system establishment methods. Multi-label support vector machine classifier and multi-label K-nearest neighbor classifier are used to verify these methods in four yeast time-course gene expression datasets. The MIHC method shows good performance, which serves as a guide to annotators or refines the annotation in detail. PMID:24621610

  2. The urease gene cluster of Vibrio parahaemolyticus does not influence the expression of the thermostable direct hemolysin (TDH) gene or the TDH-related hemolysin gene.

    PubMed

    Nakaguchi, Yoshitsugu; Okuda, Jun; Iida, Tetsuya; Nishibuchi, Mitsuaki

    2003-01-01

    In order to investigate why the thermostable direct hemolysin (TDH) and the TDH-related hemolysin (TRH) of Vibrio parahaemolyticus are produced at low levels from urease-positive strains, the effect of the functional urease gene cluster of V. parahaemolyticus on the expression of the tdh and trh genes was examined. Transcriptional lacZ fusions with the tdh1, tdh2, trh1 and trh2 genes representing variants of the tdh and trh genes were integrated into the chromosome of an Escherichia coli strain and a urease-negative V. parahaemolyticus strain. The plasmid-borne urease gene cluster introduced and expressed in these constructs did not affect expression of any of the fusion genes. The amount of TDH produced from a Kanagawa phenomenon-positive V. parahaemolyticus did not change by introduction of the urease gene cluster either. It was concluded therefore that the urease gene cluster is not involved in the regulation of tdh and trh expression.

  3. High-throughput platform for the discovery of elicitors of silent bacterial gene clusters

    PubMed Central

    Seyedsayamdost, Mohammad R.

    2014-01-01

    Over the past decade, bacterial genome sequences have revealed an immense reservoir of biosynthetic gene clusters, sets of contiguous genes that have the potential to produce drugs or drug-like molecules. However, the majority of these gene clusters appear to be inactive for unknown reasons prompting terms such as “cryptic” or “silent” to describe them. Because natural products have been a major source of therapeutic molecules, methods that rationally activate these silent clusters would have a profound impact on drug discovery. Herein, a new strategy is outlined for awakening silent gene clusters using small molecule elicitors. In this method, a genetic reporter construct affords a facile read-out for activation of the silent cluster of interest, while high-throughput screening of small molecule libraries provides potential inducers. This approach was applied to two cryptic gene clusters in the pathogenic model Burkholderia thailandensis. The results not only demonstrate a prominent activation of these two clusters, but also reveal that the majority of elicitors are themselves antibiotics, most in common clinical use. Antibiotics, which kill B. thailandensis at high concentrations, act as inducers of secondary metabolism at low concentrations. One of these antibiotics, trimethoprim, served as a global activator of secondary metabolism by inducing at least five biosynthetic pathways. Further application of this strategy promises to uncover the regulatory networks that activate silent gene clusters while at the same time providing access to the vast array of cryptic molecules found in bacteria. PMID:24808135

  4. A recently transferred cluster of bacterial genes in Trichomonas vaginalis - lateral gene transfer and the fate of acquired genes

    PubMed Central

    2014-01-01

    Background Lateral Gene Transfer (LGT) has recently gained recognition as an important contributor to some eukaryote proteomes, but the mechanisms of acquisition and fixation in eukaryotic genomes are still uncertain. A previously defined norm for LGTs in microbial eukaryotes states that the majority are genes involved in metabolism, the LGTs are typically localized one by one, surrounded by vertically inherited genes on the chromosome, and phylogenetics shows that a broad collection of bacterial lineages have contributed to the transferome. Results A unique 34 kbp long fragment with 27 clustered genes (TvLF) of prokaryote origin was identified in the sequenced genome of the protozoan parasite Trichomonas vaginalis. Using a PCR based approach we confirmed the presence of the orthologous fragment in four additional T. vaginalis strains. Detailed sequence analyses unambiguously suggest that TvLF is the result of one single, recent LGT event. The proposed donor is a close relative to the firmicute bacterium Peptoniphilus harei. High nucleotide sequence similarity between T. vaginalis strains, as well as to P. harei, and the absence of homologs in other Trichomonas species, suggests that the transfer event took place after the radiation of the genus Trichomonas. Some genes have undergone pseudogenization and degradation, indicating that they may not be retained in the future. Functional annotations reveal that genes involved in informational processes are particularly prone to degradation. Conclusions We conclude that, although the majority of eukaryote LGTs are single gene occurrences, they may be acquired in clusters of several genes that are subsequently cleansed of evolutionarily less advantageous genes. PMID:24898731

  5. Comparative sequence analysis of the potato cyst nematode resistance locus H1 reveals a major lack of co-linearity between three haplotypes in potato (Solanum tuberosum ssp.).

    PubMed

    Finkers-Tomczak, Anna; Bakker, Erin; de Boer, Jan; van der Vossen, Edwin; Achenbach, Ute; Golas, Tomasz; Suryaningrat, Suwardi; Smant, Geert; Bakker, Jaap; Goverse, Aska

    2011-02-01

    The H1 locus confers resistance to the potato cyst nematode Globodera rostochiensis pathotypes 1 and 4. It is positioned at the distal end of chromosome V of the diploid Solanum tuberosum genotype SH83-92-488 (SH) on an introgression segment derived from S. tuberosum ssp. andigena. Markers from a high-resolution genetic map of the H1 locus (Bakker et al. in Theor Appl Genet 109:146-152, 2004) were used to screen a BAC library to construct a physical map covering a 341-kb region of the resistant haplotype coming from SH. For comparison, physical maps were also generated of the two haplotypes from the diploid susceptible genotype RH89-039-16 (S. tuberosum ssp. tuberosum/S. phureja), spanning syntenic regions of 700 and 319 kb. Gene predictions on the genomic segments resulted in the identification of a large cluster consisting of variable numbers of the CC-NB-LRR type of R genes for each haplotype. Furthermore, the regions were interspersed with numerous transposable elements and genes coding for an extensin-like protein and an amino acid transporter. Comparative analysis revealed a major lack of gene order conservation in the sequences of the three closely related haplotypes. Our data provide insight in the evolutionary mechanisms shaping the H1 locus and will facilitate the map-based cloning of the H1 resistance gene.

  6. Engineered Streptomyces avermitilis host for heterologous expression of biosynthetic gene cluster for secondary metabolites

    PubMed Central

    KOMATSU, MAMORU; KOMATSU, KYOKO; KOIWAI, HANAE; YAMADA, YUUKI; KOZONE, IKUKO; IZUMIKAWA, MIHO; HASHIMOTO, JUNKO; TAKAGI, MOTOKI; OMURA, SATOSHI; SHIN-YA, KAZUO; CANE, DAVID E.; IKEDA, HARUO

    2014-01-01

    An industrial microorganism Streptomyces avermitilis, which is a producer of anthelmintic macrocyclic lactones, avermectins, has been constructed as a versatile model host for heterologous expression of genes encoding secondary metabolite biosynthesis. Twenty of the entire biosynthetic gene clusters for secondary metabolites were successively cloned and introduced into a versatile model host S. avermitilis SUKA17 or 22. Almost all S. avermitilis transformants carrying the entire gene cluster produced metabolites as a result of the expression of biosynthetic gene clusters introduced. A few transformants were unable to produce metabolites but their production was restored by the expression of biosynthetic genes using an alternative promoter or the expression of a regulatory gene in the gene cluster that controls the expression of biosynthetic genes in the cluster using an alternative promoter. Production of metabolites in some transformants of the versatile host was higher than that of the original producers and cryptic biosynthetic gene clusters in the original producer were also expressed in a versatile host. PMID:23654282

  7. Discordance, in a malignant hyperthermia pedigree, between in vitro contracture-test phenotypes and haplotypes for the MHS1 region on chromosome 19q12-13.2, comprising the C1840T transition in the RYR1 gene

    SciTech Connect

    Deufel, T.; Sudbrak, R.; Chesne, I.D.

    1995-06-01

    A point mutation in the gene encoding the skeletal muscle calcium release channel (RYR1) has been proposed as the probable cause of malignant hyperthermia (MH) in swine, where it segregates with the disease in all MH-prone strains investigated. The same C-to-T exchange in nucleotide position 1840 of the human RYR1 cDNA sequence was found in a few human MH pedigrees. We report a German MH pedigree where in vitro contracture test (IVCT) results and haplotypes of markers for the MHS1/RYR1 region including this base transition have yielded several discrepancies. The MH-susceptible phenotype was defined by IVCT performed according to the European standard protocol. Haplotypes were constructed for markers for the MHS1/RYR1 region on chromosome 19 and include the C1840T base exchange. Discussing the probabilities for a number of hypotheses to explain these data, we suggest that our results may challenge the causative role of this mutation - and possibly the role of the RYR1 gene itself - in human MH susceptibility, at least in some cases. 33 refs., 3 figs., 2 tabs.

  8. Improvement of gougerotin and nikkomycin production by engineering their biosynthetic gene clusters.

    PubMed

    Du, Deyao; Zhu, Yu; Wei, Junhong; Tian, Yuqing; Niu, Guoqing; Tan, Huarong

    2013-07-01

    Nikkomycins and gougerotin are peptidyl nucleoside antibiotics with broad biological activities. The nikkomycin biosynthetic gene cluster comprises one pathway-specific regulatory gene (sanG) and 21 structural genes, whereas the gene cluster for gougerotin biosynthesis includes one putative regulatory gene, one major facilitator superfamily transporter gene, and 13 structural genes. In the present study, we introduced sanG driven by six different promoters into Streptomyces ansochromogenes TH322. Nikkomycin production was increased significantly with the highest increase in engineered strain harboring hrdB promoter-driven sanG. In the meantime, we replaced the native promoter of key structural genes in the gougerotin (gou) gene cluster with the hrdB promoters. The heterologous producer Streptomyces coelicolor M1146 harboring the modified gene cluster produced gougerotin up to 10-fold more than strains carrying the unmodified cluster. Therefore, genetic manipulations of genes involved in antibiotics biosynthesis with the constitutive hrdB promoter present a robust, easy-to-use system generally useful for the improvement of antibiotics production in Streptomyces.

  9. Sphingolipids regulate telomere clustering by affecting the transcription of genes involved in telomere homeostasis.

    PubMed

    Ikeda, Atsuko; Muneoka, Tetsuya; Murakami, Suguru; Hirota, Ayaka; Yabuki, Yukari; Karashima, Takefumi; Nakazono, Kota; Tsuruno, Masahiro; Pichler, Harald; Shirahige, Katsuhiko; Kodama, Yukiko; Shimamoto, Toshi; Mizuta, Keiko; Funato, Kouichi

    2015-07-15

    In eukaryotic organisms, including mammals, nematodes and yeasts, the ends of chromosomes, telomeres are clustered at the nuclear periphery. Telomere clustering is assumed to be functionally important because proper organization of chromosomes is necessary for proper genome function and stability. However, the mechanisms and physiological roles of telomere clustering remain poorly understood. In this study, we demonstrate a role for sphingolipids in telomere clustering in the budding yeast Saccharomyces cerevisiae. Because abnormal sphingolipid metabolism causes downregulation of expression levels of genes involved in telomere organization, sphingolipids appear to control telomere clustering at the transcriptional level. In addition, the data presented here provide evidence that telomere clustering is required to protect chromosome ends from DNA-damage checkpoint signaling. As sphingolipids are found in all eukaryotes, we speculate that sphingolipid-based regulation of telomere clustering and the protective role of telomere clusters in maintaining genome stability might be conserved in eukaryotes.

  10. The B-type lamin is required for somatic repression of testis-specific gene clusters

    PubMed Central

    Shevelyov, Y. Y.; Lavrov, S. A.; Mikhaylova, L. M.; Nurminsky, I. D.; Kulathinal, R. J.; Egorova, K. S.; Rozovsky, Y. M.; Nurminsky, D. I.

    2009-01-01

    Large clusters of coexpressed tissue-specific genes are abundant on chromosomes of diverse species. The genes coordinately misexpressed in diverse diseases are also found in similar clusters, suggesting that evolutionarily conserved mechanisms regulate expression of large multigenic regions both in normal development and in its pathological disruptions. Studies on individual loci suggest that silent clusters of coregulated genes are embedded in repressed chromatin domains, often localized to the nuclear periphery. To test this model at the genome-wide scale, we studied transcriptional regulation of large testis-specific gene clusters in somatic tissues of Drosophila. These gene clusters showed a drastic paucity of known expressed transgene insertions, indicating that they indeed are embedded in repressed chromatin. Bioinformatics analysis suggested the major role for the B-type lamin, LamDmo, in repression of large testis-specific gene clusters, showing that in somatic cells as many as three-quarters of these clusters interact with LamDmo. Ablation of LamDmo by using mutants and RNAi led to detachment of testis-specific clusters from nuclear envelope and to their selective transcriptional up-regulation in somatic cells, thus providing the first direct evidence for involvement of the B-type lamin in tissue-specific gene repression. Finally, we found that transcriptional activation of the lamina-bound testis-specific gene cluster in male germ line is coupled with its translocation away from the nuclear envelope. Our studies, which directly link nuclear architecture with coordinated regulation of tissue-specific genes, advance understanding of the mechanisms underlying both normal cell differentiation and developmental disorders caused by lesions in the B-type lamins and interacting proteins. PMID:19218438

  11. Birth of Four Chimeric Plastid Gene Clusters in Japanese Umbrella Pine

    PubMed Central

    Hsu, Chih-Yao; Wu, Chung-Shien; Chaw, Shu-Miaw

    2016-01-01

    Many genes in the plastid genomes (plastomes) of plants are organized as gene clusters, in which genes are co-transcribed, resembling bacterial operons. These plastid operons are highly conserved, even among conifers, whose plastomes are highly rearranged relative to other seed plants. We have determined the complete plastome sequence of Sciadopitys verticillata (Japanese umbrella pine), the sole member of Sciadopityaceae. The Sciadopitys plastome is characterized by extensive inversions, pseudogenization of four tRNA genes after tandem duplications, and a unique pair of 370-bp inverted repeats involved in the formation of isomeric plastomes. We showed that plastomic inversions in Sciadopitys have led to shuffling of the remote conserved operons, resulting in the birth of four chimeric gene clusters. Our data also demonstrated that the relocated genes can be co-transcribed in these chimeric gene clusters. The plastome of Sciadopitys advances our current understanding of how the conifer plastomes have evolved toward increased diversity and complexity. PMID:27269365

  12. Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...

  13. Comparative and Genetic Analyses of the Putative Vibrio cholerae Lipopolysaccharide Core Oligosaccharide Biosynthesis (wav) Gene Cluster

    PubMed Central

    Nesper, Jutta; Kraiß, Anita; Schild, Stefan; Blaβ, Julia; Klose, Karl E.; Bockemühl, Jochen; Reidl, Joachim

    2002-01-01

    We identified five different putative wav gene cluster types, which are responsible for the synthesis of the core oligosaccharide (OS) region of Vibrio cholerae lipopolysaccharide. Preliminary evidence that the genes encoded by this cluster are involved in core OS biosynthesis came from analysis of the recently released O1 El Tor V. cholerae genome sequence and sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis of O1 El Tor mutant strains defective in three genes (waaF, waaL, and wavB). Investigations of 38 different V. cholerae strains by Southern blotting, PCR, and sequencing analyses showed that the O1 El Tor wav gene cluster type is prevalent among clinical isolates of different serogroups associated with cholera and environmental O1 strains. In contrast, we found differences in the wav gene contents of 19 unrelated non-O1, non-O139 environmental and human isolates not associated with cholera. These strains contained four new wav gene cluster types that differ from each other in distinct gene loci, providing evidence for horizontal transfer of wav genes and for limited structural diversity of the core OS among V. cholerae isolates. Our results show genetic diversity in the core OS biosynthesis gene cluster and predominance of the type 1 wav gene locus in strains associated with clinical cholera, suggesting that a specific core OS structure could contribute to V. cholerae virulence. PMID:11953379

  14. Isolation and Characterization of the Gibberellin Biosynthetic Gene Cluster in Sphaceloma manihoticola▿ †

    PubMed Central

    Bömke, Christiane; Rojas, Maria Cecilia; Gong, Fan; Hedden, Peter; Tudzynski, Bettina

    2008-01-01

    Gibberellins (GAs) are tetracyclic diterpenoid phytohormones that were first identified as secondary metabolites of the fungus Fusarium fujikuroi (teleomorph, Gibberella fujikuroi). GAs were also found in the cassava pathogen Sphaceloma manihoticola, but the spectrum of GAs differed from that in F. fujikuroi. In contrast to F. fujikuroi, the GA biosynthetic pathway has not been studied in detail in S. manihoticola, and none of the GA biosynthetic genes have been cloned from the species. Here, we present the identification of the GA biosynthetic gene cluster from S. manihoticola consisting of five genes encoding a bifunctional ent-copalyl/ent-kaurene synthase (CPS/KS), a pathway-specific geranylgeranyl diphosphate synthase (GGS2), and three cytochrome P450 monooxygenases. The functions of all of the genes were analyzed either by a gene replacement approach or by complementing the corresponding F. fujikuroi mutants. The cluster organization and gene functions are similar to those in F. fujikuroi. However, the two border genes in the Fusarium cluster encoding the GA4 desaturase (DES) and the 13-hydroxylase (P450-3) are absent in the S. manihoticola GA gene cluster, consistent with the spectrum of GAs produced by this fungus. The close similarity between the two GA gene clusters, the identical gene functions, and the conserved intron positions suggest a common evolutionary origin despite the distant relatedness of the two fungi. PMID:18567680

  15. A rough set based rational clustering framework for determining correlated genes.

    PubMed

    Jeyaswamidoss, Jeba Emilyn; Thangaraj, Kesavan; Ramar, Kadarkarai; Chitra, Muthusamy

    2016-06-01

    Cluster analysis plays a foremost role in identifying groups of genes that show similar behavior under a set of experimental conditions. Several clustering algorithms have been proposed for identifying gene behaviors and to understand their significance. The principal aim of this work is to develop an intelligent rough clustering technique, which will efficiently remove the irrelevant dimensions in a high-dimensional space and obtain appropriate meaningful clusters. This paper proposes a novel biclustering technique that is based on rough set theory. The proposed algorithm uses correlation coefficient as a similarity measure to simultaneously cluster both the rows and columns of a gene expression data matrix and mean squared residue to generate the initial biclusters. Furthermore, the biclusters are refined to form the lower and upper boundaries by determining the membership of the genes in the clusters using mean squared residue. The algorithm is illustrated with yeast gene expression data and the experiment proves the effectiveness of the method. The main advantage is that it overcomes the problem of selection of initial clusters and also the restriction of one object belonging to only one cluster by allowing overlapping of biclusters.

  16. Modeling and visualizing uncertainty in gene expression clusters using dirichlet process mixtures.

    PubMed

    Rasmussen, Carl Edward; de la Cruz, Bernard J; Ghahramani, Zoubin; Wild, David L

    2009-01-01

    Although the use of clustering methods has rapidly become one of the standard computational approaches in the literature of microarray gene expression data, little attention has been paid to uncertainty in the results obtained. Dirichlet process mixture (DPM) models provide a nonparametric Bayesian alternative to the bootstrap approach to modeling uncertainty in gene expression clustering. Most previously published applications of Bayesian model-based clustering methods have been to short time series data. In this paper, we present a case study of the application of nonparametric Bayesian clustering methods to the clustering of high-dimensional nontime series gene expression data using full Gaussian covariances. We use the probability that two genes belong to the same cluster in a DPM model as a measure of the similarity of these gene expression profiles. Conversely, this probability can be used to define a dissimilarity measure, which, for the purposes of visualization, can be input to one of the standard linkage algorithms used for hierarchical clustering. Biologically plausible results are obtained from the Rosetta compendium of expression profiles which extend previously published cluster analyses of this data.

  17. Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....

  18. Fine mapping analysis of HLA-DP/DQ gene clusters on chromosome 6 reveals multiple susceptibility loci for HBV infection.

    PubMed

    Tao, Jingjing; Su, Kunkai; Yu, Chengbo; Liu, Xiaoli; Wu, Wei; Xu, Wei; Jiang, Bingxun; Luo, Rui; Yao, Jian; Zhou, Jiawei; Zhan, Yan; Ye, Chao; Yuan, Wenji; Jiang, Xianzhong; Cui, Wenyan; Li, Ming D; Li, Lianjuan

    2015-12-01

    Recent genome-wide association studies have revealed the HLA region on chromosome 6p21 as a susceptibility locus for hepatitis B virus (HBV) infection, a finding subsequently replicated in independent samples. However, only limited single nucleotide polymorphisms (SNPs) were analyzed in most of these studies, and it remains to be determined which SNPs contribute to the detected association. After genotyping 140 SNPs within this genomic region in a total of 1657 HBV-positive patients and 1456 HBV-negative controls, we conducted a series of genetic epidemiological and bioinformatics analysis, including individual SNP-based association analysis, haplotype-based association analysis, and conditional analysis. We identified 76 SNPs and 5 LD blocks in HLA-DP/DQ clusters that are significantly associated with HBV infection, with the smallest P value being 3.88 × 10(-18) for rs9277535 in HLA-DPB1. With conditional analysis, we further revealed that the genes contributing to the effects of variants in HLA-DP/DQ on infection are independent of each other, and the LD block 5 in the 3'-UTR region of HLA-DPB1 had a predominant effect in the association of HLA-DP with HBV infection. We also found that the SNPs in the 3'-UTR region of HLA-DPB1 were significant between the subgroups of inactive HBV carrier, chronic hepatitis B, or hepatic cirrhosis from the case group and the spontaneous HBV-clearance subgroup from the control group. Finally, we did further association analysis of SNPs in this region with different subgroups from the case group, which revealed no association of these SNPs with the progression of HBV-related diseases. In sum, we showed, for the first time, that the HLA-DP/DQ clusters contribute independently to HBV infection, and the 3'-UTR region of HLA-DPB1 represents an important functional region involved in HBV infection.

  19. Significant interactions between maternal PAH exposure and haplotypes in candidate genes on B[a]P-DNA adducts in a NYC cohort of non-smoking African-American and Dominican mothers and newborns.

    PubMed

    Iyer, Shoba; Perera, Frederica; Zhang, Bingzhi; Chanock, Stephen; Wang, Shuang; Tang, Deliang

    2014-01-01

    Polycyclic aromatic hydrocarbons (PAH) are a class of chemicals common in the environment. Certain PAH are carcinogenic, although the degree to which genetic variation influences susceptibility to carcinogenic PAH remains unclear. Also unknown is the influence of genetic variation on the procarcinogenic effect of in utero exposures to PAH. Benzo[a]pyrene (B[a]P) is a well-studied PAH that is classified as a probable human carcinogen. Within our New York City-based cohort, we explored interactions between maternal exposure to airborne PAH during pregnancy and maternal and newborn haplotypes (and in one case, a single-nucleotide polymorphism) in key B[a]P metabolism genes on B[a]P-DNA adducts in paired cord blood samples. The study subjects included non-smoking African-American (n = 132) and Dominican (n = 235) women with available data on maternal PAH exposure, paired cord adducts and genetic data who resided in the Washington Heights, Central Harlem and South Bronx neighborhoods of New York City. We selected seven maternal and newborn genes related to B[a]P metabolism, detoxification and repair for our analyses: CYP1A1, CYP1A2, CYP1B1, GSTM3, GSTT2, NQO1 and XRCC1. We found significant interactions between maternal PAH exposure and haplotype on cord B[a]P-DNA adducts in the following genes: maternal CYP1B1, XRCC1 and GSTM3, and newborn CYP1A2 and XRCC1 in African-Americans; and maternal XRCC1 and newborn NQO1 in Dominicans. These novel findings highlight differences in maternal and newborn genetic contributions to B[a]P-DNA adduct formation, as well as ethnic differences in gene-environment interactions, and have the potential to identify at-risk subpopulations who are susceptible to the carcinogenic potential of B[a]P.

  20. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    PubMed

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary

  1. A cross-species bi-clustering approach to identifying conserved co-regulated genes

    PubMed Central

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-01-01

    Motivation: A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. Results: We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on

  2. Picocyanobacteria containing a novel pigment gene cluster dominate the brackish water Baltic Sea.

    PubMed

    Larsson, John; Celepli, Narin; Ininbergs, Karolina; Dupont, Christopher L; Yooseph, Shibu; Bergman, Bigitta; Ekman, Martin

    2014-09-01

    Photoautotrophic picocyanobacteria harvest light via phycobilisomes (PBS) consisting of the pigments phycocyanin (PC) and phycoerythrin (PE), encoded by genes in conserved gene clusters. The presence and arrangement of these gene clusters give picocyanobacteria characteristic light absorption properties and allow the colonization of specific ecological niches. To date, a full understanding of the evolution and distribution of the PBS gene cluster in picocyanobacteria has been hampered by the scarcity of genome sequences from fresh- and brackish water-adapted strains. To remediate this, we analysed genomes assembled from metagenomic samples collected along a natural salinity gradient, and over the course of a growth season, in the Baltic Sea. We found that while PBS gene clusters in picocyanobacteria sampled in marine habitats were highly similar to known references, brackish-adapted genotypes harboured a novel type not seen in previously sequenced genomes. Phylogenetic analyses showed that the novel gene cluster belonged to a clade of uncultivated picocyanobacteria that dominate the brackish Baltic Sea throughout the summer season, but are uncommon in other examined aquatic ecosystems. Further, our data suggest that the PE genes were lost in the ancestor of PC-containing coastal picocyanobacteria and that multiple horizontal gene transfer events have re-introduced PE genes into brackish-adapted strains, including the novel clade discovered here.

  3. Extended major histocompatibility complex haplotypes in type I diabetes mellitus.

    PubMed Central

    Raum, D; Awdeh, Z; Yunis, E J; Alper, C A; Gabbay, K H

    1984-01-01

    We have studied major histocompatibility complex markers in Caucasian patients with type I diabetes mellitus and their families. The frequencies of extended haplotypes that were composed of specific HLA-B, HLA-DR, BF, C2, C4A, and C4B allelic combinations, which occurred more commonly than expected, were compared on random diabetic and normal chromosomes in the study families. We demonstrated that all of the previously recognized increases in HLA-B8, B18, B15, DR3, and perhaps DR4 could be ascribed to the increase among diabetic haplotypes of a few extended haplotypes: [HLA B8, DR3, SC01, GLO2]; [HLA-B18, DR3, F1C30]; [HLA-B15, DR4, SC33]; and [HLA-BW38, DR4, SC21]. In fact, HLA-DR3 on nonextended haplotypes was "protective", with a relative risk considerably less than 1.0. There was a paucity or absence among diabetic patients of several extended haplotypes of normal chromosomes, notably [HLA-B7, DR2, SC31] and [HLA-BW44, DR4, SC30]. The extended haplotype [HLA-BW38, DR4, SC21] is found only in Ashkenazi Jewish patients, which suggests that extended haplotypes mark specific mutations that arise in defined ethnic groups. The data show that no known MHC allele, including HLA-DR3 and possibly HLA-DR4, is per se a marker for or itself a susceptibility gene for type I diabetes. Rather, extended haplotypes, with relatively fixed alleles, are either carriers or noncarriers of susceptibility genes for this disease. Thus, the increased frequency (association) or the decreased frequency (protection) of individual MHC alleles is largely explainable by these extended haplotypes. PMID:6746903

  4. High or low correlation between co-occuring gene clusters and 16S rRNA gene phylogeny.

    PubMed

    Rudi, Knut; Sekelja, Monika

    2013-02-01

    Ribosomal RNA (rRNA) genes are universal for all living organisms. Yet, the correspondence between genome composition and rRNA phylogeny remains poorly known. The aim of this study was to use the information from genome sequence databases to address the correlation between rRNA gene phylogeny and total gene composition in bacteria. This was done by analysing 327 genomes with TIGRFAM functional gene annotations. Our approach consisted of two steps. First, we searched for discriminatory clusters of co-occurring genes. Using a multivariate statistical approach, we identified 11 such clusters which contain genes that were co-occurring only in a subset of genomes and contributed to explain the gene content differences between genome subsets. Second, we mapped the discovered clusters to 16S rRNA-based phylogeny and calculated the correlation between co-occuring genes and phylogeny. Six of the 11 clusters exhibited significant correlation with 16S rRNA gene phylogeny. The most distinct phylogenetic finding was a high correlation between iron-sulfur oxidoreductases in combination with carbon nitrogen ligases and Chlorobium. The other correlations identified covered relatively large phylogroups: Actinobacteria were positively associated with kinases, while Gammaproteobacteria were positively associated with methylases and acyltransferases. The suggested functional differences between higher phylogroups, however, need experimental verification.

  5. Efficient transfer of two large secondary metabolite pathway gene clusters into heterologous hosts by transposition

    PubMed Central

    Fu, Jun; Wenzel, Silke C.; Perlova, Olena; Wang, Junping; Gross, Frank; Tang, Zhiru; Yin, Yulong; Stewart, A. Francis; Zhang, Youming

    2008-01-01

    Horizontal gene transfer by transposition has been widely used for transgenesis in prokaryotes. However, conjugation has been preferred for transfer of large transgenes, despite greater restrictions of host range. We examine the possibility that transposons can be used to deliver large transgenes to heterologous hosts. This possibility is particularly relevant to the expression of large secondary metabolite gene clusters in various heterologous hosts. Recently, we showed that the engineering of large gene clusters like type I polyketide/nonribosomal peptide pathways for heterologous expression is no longer a bottleneck. Here, we apply recombineering to engineer either the epothilone (epo) or myxochromide S (mchS) gene cluster for transpositional delivery and expression in heterologous hosts. The 58-kb epo gene cluster was fully reconstituted from two clones by stitching. Then, the epo promoter was exchanged for a promoter active in the heterologous host, followed by engineering into the MycoMar transposon. A similar process was applied to the mchS gene cluster. The engineered gene clusters were transferred and expressed in the heterologous hosts Myxococcus xanthus and Pseudomonas putida. We achieved the largest transposition yet reported for any system and suggest that delivery by transposon will become the method of choice for delivery of large transgenes, particularly not only for metabolic engineering but also for general transgenesis in prokaryotes and eukaryotes. PMID:18701643

  6. Identification of Nocobactin NA Biosynthetic Gene Clusters in Nocardia farcinica▿ §

    PubMed Central

    Hoshino, Yasutaka; Chiba, Kazuhiro; Ishino, Keiko; Fukai, Toshio; Igarashi, Yasuhiro; Yazawa, Katsukiyo; Mikami, Yuzuru; Ishikawa, Jun

    2011-01-01

    We identified the biosynthetic gene clusters of the siderophore nocobactin NA. The nbt clusters, which were discovered as genes highly homologous to the mycobactin biosynthesis genes by the genomic sequencing of Nocardia farcinica IFM 10152, consist of 10 genes separately located at two genomic regions. The gene organization of the nbt clusters and the predicted functions of the nbt genes, particularly the cyclization and epimerization domains, were in good agreement with the chemical structure of nocobactin NA. Disruptions of the nbtA and nbtE genes, respectively, reduced and abolished the productivity of nocobactin NA. The heterologous expression of the nbtS gene revealed that this gene encoded a salicylate synthase. These results indicate that the nbt clusters are responsible for the biosynthesis of nocobactin NA. We also found putative IdeR-binding sequences upstream of the nbtA, -G, -H, -S, and -T genes, whose expression was more than 10-fold higher in the low-iron condition than in the high-iron condition. These results suggest that nbt genes are regulated coordinately by IdeR protein in an iron-dependent manner. The ΔnbtE mutant was found to be impaired in cytotoxicity against J774A.1 cells, suggesting that nocobactin NA production is required for virulence of N. farcinica. PMID:21097631

  7. Characterization of the fumonisin B2 biosynthetic gene cluster in Aspergillus niger and A. awamori.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Aspergillus niger and A. awamori strains isolated from grapes cultivated in Mediterranean basin were examined for fumonisin B2 (FB2) production and presence/absence of sequences within the fumonisin biosynthetic gene (fum) cluster. Presence of 13 regions in the fum cluster was evaluated by PCR assay...

  8. Comparative genomic analysis of secondary metabolite biosynthetic gene clusters in 207 isolates of Fusarium

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fusarium species are known for their ability to produce secondary metabolites (SMs), including plant hormones, pigments, mycotoxins, and other compounds with potential agricultural, pharmaceutical, and biotechnological impact. Understanding the distribution of SM biosynthetic gene clusters across th...

  9. Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

    DOE PAGES

    Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; ...

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundancemore » of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance

  10. Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

    SciTech Connect

    Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; Cole, James R.; Hashsham, Syed A.; Looft, Torey; Zhu, Yong -Guan; Tiedje, James M.

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of

  11. Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

    SciTech Connect

    Data Analysis and Visualization and the Department of Computer Science, University of California, Davis, One Shields Avenue, Davis CA 95616, USA,; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,'' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA; Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA; Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley CA 94720, USA,; Computer Science Division,University of California, Berkeley, CA, USA,; Computer Science Department, University of California, Irvine, CA, USA,; All authors are with the Berkeley Drosophila Transcription Network Project, Lawrence Berkeley National Laboratory,; Rubel, Oliver; Weber, Gunther H.; Huang, Min-Yu; Bethel, E. Wes; Biggin, Mark D.; Fowlkes, Charless C.; Hendriks, Cris L. Luengo; Keranen, Soile V. E.; Eisen, Michael B.; Knowles, David W.; Malik, Jitendra; Hagen, Hans; Hamann, Bernd

    2008-05-12

    The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii) evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.

  12. Regulation of transcription of cell division genes in the Escherichia coli dcw cluster.

    PubMed

    Vicente, M; Gomez, M J; Ayala, J A

    1998-04-01

    The Escherichia coli dcw cluster contains cell division genes, such as the phylogenetically ubiquitous ftsZ, and genes involved in peptidoglycan synthesis. Transcription in the cluster proceeds in the same direction as the progress of the replication fork along the chromosome. Regulation is exerted at the transcriptional and post-transcriptional levels. The absence of transcriptional termination signals may, in principle, allow extension of the transcripts initiated at the up-stream promoter (mraZ1p) even to the furthest down-stream gene (envA). Complementation tests suggest that they extend into ftsW in the central part of the cluster. In addition, the cluster contains other promoters individually regulated by cis- and trans-acting signals. Dissociation of the expression of the ftsZ gene, located after ftsQ and A near the 3' end of the cluster, from its natural regulatory signals leads to an alteration in the physiology of cell division. The complexities observed in the regulation of gene expression in the cluster may then have an important biological role. Among them, LexA-binding SOS boxes have been found at the 5' end of the cluster, preceding promoters which direct the expression of ftsI (coding for PBP3, the penicillin-binding protein involved in septum formation). A gearbox promoter, ftsQ1p, forms part of the signals regulating the transcription of ftsQ, A and Z. It is an inversely growth-dependent mechanism driven by RNA polymerase containing sigma s, the factor involved in the expression of stationary phase-specific genes. Although the dcw cluster is conserved to a different extent in a variety of bacteria, the regulation of gene expression, the presence or absence of individual genes, and even the essentiality of some of them, show variations in the phylogenetic scale which may reflect adaptation to specific life cycles.

  13. A stationary wavelet entropy-based clustering approach accurately predicts gene expression.

    PubMed

    Nguyen, Nha; Vo, An; Choi, Inchan; Won, Kyoung-Jae

    2015-03-01

    Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation.

  14. Characterization of the ars gene cluster from extremely arsenic-resistant Microbacterium sp. strain A33.

    PubMed

    Achour-Rokbani, Asma; Cordi, Audrey; Poupin, Pascal; Bauda, Pascale; Billard, Patrick

    2010-02-01

    The arsenic resistance gene cluster of Microbacterium sp. A33 contains a novel pair of genes (arsTX) encoding a thioredoxin system that are cotranscribed with an unusual arsRC2 fusion gene, ACR3, and arsC1 in an operon divergent from arsC3. The whole ars gene cluster is required to complement an Escherichia coli ars mutant. ArsRC2 negatively regulates the expression of the pentacistronic operon. ArsC1 and ArsC3 are related to thioredoxin-dependent arsenate reductases; however, ArsC3 lacks the two distal catalytic cysteine residues of this class of enzymes.

  15. Significant interactions between maternal PAH exposure and haplotypes in candidate genes on B[a]P-DNA adducts in a NYC cohort of non-smoking African-American and Dominican mothers and newborns

    PubMed Central

    Tang, Deliang

    2014-01-01

    Polycyclic aromatic hydrocarbons (PAH) are a class of chemicals common in the environment. Certain PAH are carcinogenic, although the degree to which genetic variation influences susceptibility to carcinogenic PAH remains unclear. Also unknown is the influence of genetic variation on the procarcinogenic effect of in utero exposures to PAH. Benzo[a]pyrene (B[a]P) is a well-studied PAH that is classified as a probable human carcinogen. Within our New York City-based cohort, we explored interactions between maternal exposure to airborne PAH during pregnancy and maternal and newborn haplotypes (and in one case, a single-nucleotide polymorphism) in key B[a]P metabolism genes on B[a]P-DNA adducts in paired cord blood samples. The study subjects included non-smoking African-American (n = 132) and Dominican (n = 235) women with available data on maternal PAH exposure, paired cord adducts and genetic data who resided in the Washington Heights, Central Harlem and South Bronx neighborhoods of New York City. We selected seven maternal and newborn genes related to B[a]P metabolism, detoxification and repair for our analyses: CYP1A1, CYP1A2, CYP1B1, GSTM3, GSTT2, NQO1 and XRCC1. We found significant interactions between maternal PAH exposure and haplotype on cord B[a]P-DNA adducts in the following genes: maternal CYP1B1, XRCC1 and GSTM3, and newborn CYP1A2 and XRCC1 in African-Americans; and maternal XRCC1 and newborn NQO1 in Dominicans. These novel findings highlight differences in maternal and newborn genetic contributions to B[a]P-DNA adduct formation, as well as ethnic differences in gene–environment interactions, and have the potential to identify at-risk subpopulations who are susceptible to the carcinogenic potential of B[a]P. PMID:24177223

  16. Yeast homologous recombination-based promoter engineering for the activation of silent natural product biosynthetic gene clusters.

    PubMed

    Montiel, Daniel; Kang, Hahk-Soo; Chang, Fang-Yuan; Charlop-Powers, Zachary; Brady, Sean F

    2015-07-21

    Large-scale sequencing of prokaryotic (meta)genomic DNA suggests that most bacterial natural product gene clusters are not expressed under common laboratory culture conditions. Silent gene clusters represent a promising resource for natural product discovery and the development of a new generation of therapeutics. Unfortunately, the characterization of molecules encoded by these clusters is hampered owing to our inability to express these gene clusters in the laboratory. To address this bottleneck, we have developed a promoter-engineering platform to transcriptionally activate silent gene clusters in a model heterologous host. Our approach uses yeast homologous recombination, an auxotrophy complementation-based yeast selection system and sequence orthogonal promoter cassettes to exchange all native promoters in silent gene clusters with constitutively active promoters. As part of this platform, we constructed and validated a set of bidirectional promoter cassettes consisting of orthogonal promoter sequences, Streptomyces ribosome binding sites, and yeast selectable marker genes. Using these tools we demonstrate the ability to simultaneously insert multiple promoter cassettes into a gene cluster, thereby expediting the reengineering process. We apply this method to model active and silent gene clusters (rebeccamycin and tetarimycin) and to the silent, cryptic pseudogene-containing, environmental DNA-derived Lzr gene cluster. Complete promoter refactoring and targeted gene exchange in this "dead" cluster led to the discovery of potent indolotryptoline antiproliferative agents, lazarimides A and B. This potentia