adh gene cluster: Topics by Science.gov

Sample records for adh gene cluster

Genome-Wide Significant Association between Alcohol Dependence and a Variant in the ADH Gene Cluster

PubMed Central

Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

2011-01-01

Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1,333 male in-patients with severe DSM-IV AD and 2,168 controls. These included 487 patients and 1,358 controls from a previous GWAS study by our group. All individuals were of German descent. Single marker tests and a polygenic score based analysis to assess the combined contribution of multiple markers with small effects were performed. The SNP rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance (p=1.27E–8; OR=1.46). Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (p=1.24E–7, OR=1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score based approach produced a significant result (p=9.66E–9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result suggests that many more AD susceptibility genes still await identification. PMID:22004471
Ascidian and amphioxus Adh genes correlate functional and molecular features of the ADH family expansion during vertebrate evolution.

PubMed

Cañestro, Cristian; Albalat, Ricard; Hjelmqvist, Lars; Godoy, Laura; Jörnvall, Hans; Gonzàlez-Duarte, Roser

2002-01-01

The alcohol dehydrogenase (ADH) family has evolved into at least eight ADH classes during vertebrate evolution. We have characterized three prevertebrate forms of the parent enzyme of this family, including one from an urochordate (Ciona intestinalis) and two from cephalochordates (Branchiostoma floridae and Branchiostoma lanceolatum). An evolutionary analysis of the family was performed gathering data from protein and gene structures, exon-intron distribution, and functional features through chordate lines. Our data strongly support that the ADH family expansion occurred 500 million years ago, after the cephalochordate/vertebrate split, probably in the gnathostome subphylum line of the vertebrates. Evolutionary rates differ between the ancestral, ADH3 (glutathione-dependent formaldehyde dehydrogenase), and the emerging forms, including the classical alcohol dehydrogenase, ADH1, which has an evolutionary rate 3.6-fold that of the ADH3 form. Phylogenetic analysis and chromosomal mapping of the vertebrate Adh gene cluster suggest that family expansion took place by tandem duplications, probably concurrent with the extensive isoform burst observed before the fish/tetrapode split, rather than through the large-scale genome duplications also postulated in early vertebrate evolution. The absence of multifunctionality in lower chordate ADHs and the structures compared argue in favor of the acquisition of new functions in vertebrate ADH classes. Finally, comparison between B. floridae and B. lanceolatum Adhs provides the first estimate for a cephalochordate speciation, 190 million years ago, probably concomitant with the beginning of the drifting of major land masses from the Pangea.
Rare ADH Variant Constellations are Specific for Alcohol Dependence

PubMed Central

Zuo, Lingjun; Zhang, Heping; Malison, Robert T.; Li, Chiang-Shan R.; Zhang, Xiang-Yang; Wang, Fei; Lu, Lingeng; Lu, Lin; Wang, Xiaoping; Krystal, John H.; Zhang, Fengyu; Deng, Hong-Wen; Luo, Xingguang

2013-01-01

Aims: Some of the well-known functional alcohol dehydrogenase (ADH) gene variants (e.g. ADH1B*2, ADH1B*3 and ADH1C*2) that significantly affect the risk of alcohol dependence are rare variants in most populations. In the present study, we comprehensively examined the associations between rare ADH variants [minor allele frequency (MAF) <0.05] and alcohol dependence, with several other neuropsychiatric and neurological disorders as reference. Methods: A total of 49,358 subjects in 22 independent cohorts with 11 different neuropsychiatric and neurological disorders were analyzed, including 3 cohorts with alcohol dependence. The entire ADH gene cluster (ADH7–ADH1C–ADH1B–ADH1A–ADH6–ADH4–ADH5 at Chr4) was imputed in all samples using the same reference panels that included whole-genome sequencing data. We stringently cleaned the phenotype and genotype data to obtain a total of 870 single nucleotide polymorphisms with 0< MAF <0.05 for association analysis. Results: We found that a rare variant constellation across the entire ADH gene cluster was significantly associated with alcohol dependence in European-Americans (Fp1: simulated global P = 0.045), European-Australians (Fp5: global P = 0.027; collapsing: P = 0.038) and African-Americans (Fp5: global P = 0.050; collapsing: P = 0.038), but not with any other neuropsychiatric disease. Association signals in this region came principally from ADH6, ADH7, ADH1B and ADH1C. In particular, a rare ADH6 variant constellation showed a replicable association with alcohol dependence across these three independent cohorts. No individual rare variants were statistically significantly associated with any disease examined after group- and region-wide correction for multiple comparisons. Conclusion: We conclude that rare ADH variants are specific for alcohol dependence. The ADH gene cluster may harbor a causal variant(s) for alcohol dependence. PMID:23019235
Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli.

PubMed

Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael

2008-04-16

The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains.
Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli

PubMed Central

Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael

2008-01-01

Background The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. Results We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Conclusion Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains. PMID:18412983
The bifunctional alcohol and aldehyde dehydrogenase gene, adhE, is necessary for ethanol production in Clostridium thermocellum and Thermoanaerobacterium saccharolyticum.

PubMed

Lo, Jonathan; Zheng, Tianyong; Hon, Shuen; Olson, Daniel G; Lynd, Lee R

2015-04-01

Thermoanaerobacterium saccharolyticum and Clostridium thermocellum are anaerobic thermophilic bacteria being investigated for their ability to produce biofuels from plant biomass. The bifunctional alcohol and aldehyde dehydrogenase gene, adhE, is present in these bacteria and has been known to be important for ethanol formation in other anaerobic alcohol producers. This study explores the inactivation of the adhE gene in C. thermocellum and T. saccharolyticum. Deletion of adhE reduced ethanol production by >95% in both T. saccharolyticum and C. thermocellum, confirming that adhE is necessary for ethanol formation in both organisms. In both adhE deletion strains, fermentation products shifted from ethanol to lactate production and resulted in lower cell density and longer time to reach maximal cell density. In T. saccharolyticum, the adhE deletion strain lost >85% of alcohol dehydrogenase (ADH) activity. Aldehyde dehydrogenase (ALDH) activity did not appear to be affected, although ALDH activity was low in cell extracts. Adding ubiquinone-0 to the ALDH assay increased activity in the T. saccharolyticum parent strain but did not increase activity in the adhE deletion strain, suggesting that ALDH activity was inhibited. In C. thermocellum, the adhE deletion strain lost >90% of ALDH and ADH activity in cell extracts. The C. thermocellum adhE deletion strain contained a point mutation in the lactate dehydrogenase gene, which appears to deregulate its activation by fructose 1,6-bisphosphate, leading to constitutive activation of lactate dehydrogenase. Thermoanaerobacterium saccharolyticum and Clostridium thermocellum are bacteria that have been investigated for their ability to produce biofuels from plant biomass. They have been engineered to produce higher yields of ethanol, yet questions remain about the enzymes responsible for ethanol formation in these bacteria. The genomes of these bacteria encode multiple predicted aldehyde and alcohol dehydrogenases which could be
Molecular phylogeny and evolution of alcohol dehydrogenase (Adh) genes in legumes

PubMed Central

Fukuda, Tatsuya; Yokoyama, Jun; Nakamura, Toru; Song, In-Ja; Ito, Takuro; Ochiai, Toshinori; Kanno, Akira; Kameya, Toshiaki; Maki, Masayuki

2005-01-01

Background Nuclear genes determine the vast range of phenotypes that are responsible for the adaptive abilities of organisms in nature. Nevertheless, the evolutionary processes that generate the structures and functions of nuclear genes are only now be coming understood. The aim of our study is to isolate the alcohol dehydrogenase (Adh) genes in two distantly related legumes, and use these sequences to examine the molecular evolutionary history of this nuclear gene. Results We isolated the expressed Adh genes from two species of legumes, Sophora flavescens Ait. and Wisteria floribunda DC., by a RT-PCR based approach and found a new Adh locus in addition to homologues of the Adh genes found previously in legumes. To examine the evolution of these genes, we compared the species and gene trees and found gene duplication of the Adh loci in the legumes occurred as an ancient event. Conclusion This is the first report revealing that some legume species have at least two Adh gene loci belonging to separate clades. Phylogenetic analyses suggest that these genes resulted from relatively ancient duplication events. PMID:15836788
Screening of Two ADH4 Variations in a Swedish Cluster Headache Case–Control Material

PubMed Central

Fourier, Carmen; Ran, Caroline; Steinberg, Anna; Sjöstrand, Christina; Waldenlind, Elisabet

2016-01-01

Background Cluster headache (CH) is a severe neurovascular disorder and an increasing amount of evidence points to a genetic contribution to this disease. When CH was first described, it was observed that alcohol may precipitate an attack during the active phase of the disease. The alcohol dehydrogenase 4 (ADH4) gene encodes an enzyme which contributes to the metabolization of alcohol and is, therefore, an interesting candidate gene for CH. Two Italian groups have reported association of the single nucleotide polymorphism (SNP) rs1126671 located in the ADH4 gene with an increased risk of CH in Italy. In addition, one of the groups found an association between the ADH4 SNP rs1800759 and CH. Objective To perform a replication study on the ADH4 SNPs rs1126671 and rs1800759 in a large homogeneous Swedish case–control cohort in order to further investigate the possible contribution of ADH4 to CH. Methods A total of 390 unrelated patients diagnosed with CH and 389 controls representing a general Swedish population were recruited to the study. DNA samples from patients and controls were genotyped for the two ADH4 SNPs rs1126671 and rs1800759 using quantitative real‐time polymerase chain reaction. Statistical analyses of genotype, allele and haplotype frequencies for the two SNPs were performed and compared between patients and controls. Results For rs1126671, the minor allele frequency (A allele) was 32.8% (n = 254) in controls compared with 31.9% (n = 249) in CH patients. The minor allele frequency (A allele) of rs1800759 was 42.3% (n = 324) in controls and 41.9% (n = 327) in CH patients. Statistical analysis showed no significant differences in allele as well as in genotype or haplotype frequencies between the patient and control group for either SNP. This was also seen after stratifying the patient group for experiencing alcohol as a trigger factor. Conclusions The data did not support an association of the ADH4 SNPs rs1126671 and rs1800759 with CH
Structure, Expression, Chromosomal Location and Product of the Gene Encoding Adh2 in Petunia

PubMed Central

Gregerson, R. G.; Cameron, L.; McLean, M.; Dennis, P.; Strommer, J.

1993-01-01

In most higher plants the genes encoding alcohol dehydrogenase comprise a small gene family, usually with two members. The Adh1 gene of Petunia has been cloned and analyzed, but a second identifiable gene was not recovered from any of three genomic libraries. We have therefore employed the polymerase chain reaction to obtain the major portion of a second Adh gene. From sequence, mapping and northern data we conclude this gene encodes ADH2, the major anaerobically inducible Adh gene of Petunia. The availability of both Adh1 and Adh2 from Petunia has permitted us to compare their structures and patterns of expression to those of the well-studied Adh genes of maize, of which one is highly expressed developmentally, while both are induced in response to hypoxia. Despite their evolutionary distance, evidenced by deduced amino acid sequence as well as taxonomic classification, the pairs of genes are regulated in strikingly similar ways in maize and Petunia. Our findings suggest a significant biological basis for the regulatory strategy employed by these distant species for differential expression of multiple Adh genes. PMID:8096485
Temperature and water loss affect ADH activity and gene expression in grape berry during postharvest dehydration.

PubMed

Cirilli, Marco; Bellincontro, Andrea; De Santis, Diana; Botondi, Rinaldo; Colao, Maria Chiara; Muleo, Rosario; Mencarelli, Fabio

2012-05-01

Clusters of Aleatico wine grape were picked at 18°Brix and placed at 10, 20, or 30°C, 45% relative humidity (RH) and 1.5m/s of air flow to dehydrate the berries up to 40% of loss of initial fresh weight. Sampling was done at 0%, 10%, 20%, 30%, and 40% weight loss (wl). ADH (alcohol dehydrogenase) gene expression, enzyme activity, and related metabolites were analysed. At 10°C, acetaldehyde increased rapidly and then declined, while ethanol continued to rise. At 20°C, acetaldehyde and ethanol increased significantly with the same pattern and declined at 40%wl. At 30°C, acetaldehyde did not increase but ethanol increased rapidly already at 10%wl. At the latter temperature, a significant increase in acetic acid and ethyl acetate occurred, while at 10°C their values were low. At 30°C, the ADH activity (ethanol to acetaldehyde direction), increased rapidly but acetaldehyde did not rise because of its oxidation to acetic acid, which increased together with ethyl acetate. At 10°C, the ADH activity increased at 20%wl and continued to rise even at 40%wl, meaning that ethanol oxidation was delayed. At 20°C, the behaviour was intermediate to the other temperatures. The relative expression of the VvAdh2 gene was the highest at 10°C already at 10%wl in a synchrony with the ADH activity, indicating a rapid response likely due to low temperature. The expression subsequently declined. At 20 and 30°C, the expression was lower and increased slightly during dehydration in combination with the ADH activity. This imbalance between gene expression and ADH activity at 10°C, as well as the unexpected expression of the carotenoid cleavage dioxygenase 1 (CCD1) gene, opens the discussion on the stress sensitivity and transcription event during postharvest dehydration, and the importance of carefully monitoring temperature during dehydration. Copyright © 2011 Elsevier Ltd. All rights reserved.
Effects of glucose, ethanol and acetic acid on regulation of ADH2 gene from Lachancea fermentati.

PubMed

Yaacob, Norhayati; Mohamad Ali, Mohd Shukuri; Salleh, Abu Bakar; Abdul Rahman, Nor Aini

2016-01-01

Background. Not all yeast alcohol dehydrogenase 2 (ADH2) are repressed by glucose, as reported in Saccharomyces cerevisiae. Pichia stipitis ADH2 is regulated by oxygen instead of glucose, whereas Kluyveromyces marxianus ADH2 is regulated by neither glucose nor ethanol. For this reason, ADH2 regulation of yeasts may be species dependent, leading to a different type of expression and fermentation efficiency. Lachancea fermentati is a highly efficient ethanol producer, fast-growing cells and adapted to fermentation-related stresses such as ethanol and organic acid, but the metabolic information regarding the regulation of glucose and ethanol production is still lacking. Methods. Our investigation started with the stimulation of ADH2 activity from S. cerevisiae and L. fermentati by glucose and ethanol induction in a glucose-repressed medium. The study also embarked on the retrospective analysis of ADH2 genomic and protein level through direct sequencing and sites identification. Based on the sequence generated, we demonstrated ADH2 gene expression highlighting the conserved NAD(P)-binding domain in the context of glucose fermentation and ethanol production. Results. An increase of ADH2 activity was observed in starved L. fermentati (LfeADH2) and S. cerevisiae (SceADH2) in response to 2% (w/v) glucose induction. These suggest that in the presence of glucose, ADH2 activity was activated instead of being repressed. An induction of 0.5% (v/v) ethanol also increased LfeADH2 activity, promoting ethanol resistance, whereas accumulating acetic acid at a later stage of fermentation stimulated ADH2 activity and enhanced glucose consumption rates. The lack in upper stream activating sequence (UAS) and TATA elements hindered the possibility of Adr1 binding to LfeADH2. Transcription factors such as SP1 and RAP1 observed in LfeADH2 sequence have been implicated in the regulation of many genes including ADH2. In glucose fermentation, L. fermentati exhibited a bell-shaped ADH2
Cloning of the Arabidopsis and Rice Formaldehyde Dehydrogenase Genes: Implications for the Origin of Plant Adh Enzymes

PubMed Central

Dolferus, R.; Osterman, J. C.; Peacock, W. J.; Dennis, E. S.

1997-01-01

This article reports the cloning of the genes encoding the Arabidopsis and rice class III ADH enzymes, members of the alcohol dehydrogenase or medium chain reductase/dehydrogenase superfamily of proteins with glutathione-dependent formaldehyde dehydrogenase activity (GSH-FDH). Both genes contain eight introns in exactly the same positions, and these positions are conserved in plant ethanol-active Adh genes (class P). These data provide further evidence that plant class P genes have evolved from class III genes by gene duplication and acquisition of new substrate specificities. The position of introns and similarities in the nucleic acid and amino acid sequences of the different classes of ADH enzymes in plants and humans suggest that plant and animal class III enzymes diverged before they duplicated to give rise to plant and animal ethanol-active ADH enzymes. Plant class P ADH enzymes have gained substrate specificities and evolved promoters with different expression properties, in keeping with their metabolic function as part of the alcohol fermentation pathway. PMID:9215914
Ethnic Related Selection for an ADH Class I Variant within East Asia

PubMed Central

Li, Hui; Gu, Sheng; Cai, Xiaoyun; Speed, William C.; Pakstis, Andrew J.; Golub, Efim I.; Kidd, Judith R.; Kidd, Kenneth K.

2008-01-01

Background The alcohol dehydrogenases (ADH) are widely studied enzymes and the evolution of the mammalian gene cluster encoding these enzymes is also well studied. Previous studies have shown that the ADH1B*47His allele at one of the seven genes in humans is associated with a decrease in the risk of alcoholism and the core molecular region with this allele has been selected for in some East Asian populations. As the frequency of ADH1B*47His is highest in East Asia, and very low in most of the rest of the world, we have undertaken more detailed investigation in this geographic region. Methodology/Principal Findings Here we report new data on 30 SNPs in the ADH7 and Class I ADH region in samples of 24 populations from China and Laos. These populations cover a wide geographic region and diverse ethnicities. Combined with our previously published East Asian data for these SNPs in 8 populations, we have typed populations from all of the 6 major linguistic phyla (Altaic including Korean-Japanese and inland Altaic, Sino-Tibetan, Hmong-Mien, Austro-Asiatic, Daic, and Austronesian). The ADH1B genotyping data are strongly related to ethnicity. Only some eastern ethnic phyla or subphyla (Korean-Japanese, Han Chinese, Hmong-Mien, Daic, and Austronesian) have a high frequency of ADH1B*47His. ADH1B haplotype data clustered the populations into linguistic subphyla, and divided the subphyla into eastern and western parts. In the Hmong-Mien and Altaic populations, the extended haplotype homozygosity (EHH) and relative EHH (REHH) tests for the ADH1B core were consistent with selection for the haplotype with derived SNP alleles. In the other ethnic phyla, the core showed only a weak signal of selection at best. Conclusions/Significance The selection distribution is more significantly correlated with the frequency of the derived ADH1B regulatory region polymorphism than the derived amino-acid altering allele ADH1B*47His. Thus, the real focus of selection may be the regulatory region
Ethnic related selection for an ADH Class I variant within East Asia.

PubMed

Li, Hui; Gu, Sheng; Cai, Xiaoyun; Speed, William C; Pakstis, Andrew J; Golub, Efim I; Kidd, Judith R; Kidd, Kenneth K

2008-04-02

The alcohol dehydrogenases (ADH) are widely studied enzymes and the evolution of the mammalian gene cluster encoding these enzymes is also well studied. Previous studies have shown that the ADH1B*47His allele at one of the seven genes in humans is associated with a decrease in the risk of alcoholism and the core molecular region with this allele has been selected for in some East Asian populations. As the frequency of ADH1B*47His is highest in East Asia, and very low in most of the rest of the world, we have undertaken more detailed investigation in this geographic region. Here we report new data on 30 SNPs in the ADH7 and Class I ADH region in samples of 24 populations from China and Laos. These populations cover a wide geographic region and diverse ethnicities. Combined with our previously published East Asian data for these SNPs in 8 populations, we have typed populations from all of the 6 major linguistic phyla (Altaic including Korean-Japanese and inland Altaic, Sino-Tibetan, Hmong-Mien, Austro-Asiatic, Daic, and Austronesian). The ADH1B genotyping data are strongly related to ethnicity. Only some eastern ethnic phyla or subphyla (Korean-Japanese, Han Chinese, Hmong-Mien, Daic, and Austronesian) have a high frequency of ADH1B*47His. ADH1B haplotype data clustered the populations into linguistic subphyla, and divided the subphyla into eastern and western parts. In the Hmong-Mien and Altaic populations, the extended haplotype homozygosity (EHH) and relative EHH (REHH) tests for the ADH1B core were consistent with selection for the haplotype with derived SNP alleles. In the other ethnic phyla, the core showed only a weak signal of selection at best. The selection distribution is more significantly correlated with the frequency of the derived ADH1B regulatory region polymorphism than the derived amino-acid altering allele ADH1B*47His. Thus, the real focus of selection may be the regulatory region. The obvious ethnicity-related distributions of ADH1B diversities
Isolation and Identification of Genes Activating Uas2-Dependent Adh2 Expression in Saccharomyces Cerevisiae

PubMed Central

Donoviel, M. S.; Young, E. T.

1996-01-01

Two cis-acting elements have been identified that act synergistically to regulate expression of the glucose-repressed alcohol dehydrogenase 2 (ADH2) gene. UAS1 is bound by the trans-activator Adr1p. UAS2 is thought to be the binding site for an unidentified regulatory protein. A genetic selection based on a UAS2-dependent ADH2 reporter was devised to isolate genes capable of activating UAS2-dependent transcription. One set of UAS2-dependent genes contained SPT6/CRE2/SSN20. Multicopy SPT6 caused improper expression of chromosomal ADH2. A second set of UAS2-dependent clones contained a previously uncharacterized open reading frame designated MEU1 (Multicopy Enhancer of UAS2). A frame shift mutation in MEU1 abolished its ability to activate UAS2-dependent gene expression. Multicopy MEU1 expression suppressed the constitutive ADH2 expression caused by cre2-1. Disruption of MEU1 reduced endogenous ADH2 expression about twofold but had no effect on cell viability or growth. No homologues of MEU1 were identified by low-stringency Southern hybridization of yeast genomic DNA, and no significant homologues were found in the sequence data bases. A MEU1/β-gal fusion protein was not localized to a particular region of the cell. MEU1 is linked to PPR1 on chromosome XII. PMID:8807288
Expression of adhA from different organisms in Clostridium thermocellum.

PubMed

Zheng, Tianyong; Cui, Jingxuan; Bae, Hye Ri; Lynd, Lee R; Olson, Daniel G

2017-01-01

Clostridium thermocellum is a cellulolytic anaerobic thermophile that is a promising candidate for consolidated bioprocessing of lignocellulosic biomass into biofuels such as ethanol. It was previously shown that expressing Thermoanaerobacterium saccharolyticum adhA in C. thermocellum increases ethanol yield.In this study, we investigated expression of adhA genes from different organisms in Clostridium thermocellum . Based on sequence identity to T. saccharolyticum adhA , we chose adhA genes from 10 other organisms: Clostridium botulinum , Methanocaldococcus bathoardescens , Thermoanaerobacterium ethanolicus , Thermoanaerobacter mathranii , Thermococcus strain AN1, Thermoanaerobacterium thermosaccharolyticum , Caldicellulosiruptor saccharolyticus , Fervidobacterium nodosum , Marinitoga piezophila , and Thermotoga petrophila . All 11 adhA genes (including T. saccharolyticum adhA ) were expressed in C. thermocellum and fermentation end products were analyzed. All 11 adhA genes increased C. thermocellum ethanol yield compared to the empty-vector control. C. botulinum and T. ethanolicus adhA genes generated significantly higher ethanol yield than T. saccharolyticum adhA . Our results indicated that expressing adhA is an effective method of increasing ethanol yield in wild-type C. thermocellum , and that this appears to be a general property of adhA genes.
A Phylogenetic Analysis of the Genus Fragaria (Strawberry) Using Intron-Containing Sequence from the ADH-1 Gene

PubMed Central

DiMeglio, Laura M.; Yu, Hongrun; Davis, Thomas M.

2014-01-01

The genus Fragaria encompasses species at ploidy levels ranging from diploid to decaploid. The cultivated strawberry, Fragaria×ananassa, and its two immediate progenitors, F. chiloensis and F. virginiana, are octoploids. To elucidate the ancestries of these octoploid species, we performed a phylogenetic analysis using intron-containing sequences of the nuclear ADH-1 gene from 39 germplasm accessions representing nineteen Fragaria species and one outgroup species, Dasiphora fruticosa. All trees from Maximum Parsimony and Maximum Likelihood analyses showed two major clades, Clade A and Clade B. Each of the sampled octoploids contributed alleles to both major clades. All octoploid-derived alleles in Clade A clustered with alleles of diploid F. vesca, with the exception of one octoploid allele that clustered with the alleles of diploid F. mandshurica. All octoploid-derived alleles in clade B clustered with the alleles of only one diploid species, F. iinumae. When gaps encoded as binary characters were included in the Maximum Parsimony analysis, tree resolution was improved with the addition of six nodes, and the bootstrap support was generally higher, rising above the 50% threshold for an additional nine branches. These results, coupled with the congruence of the sequence data and the coded gap data, validate and encourage the employment of sequence sets containing gaps for phylogenetic analysis. Our phylogenetic conclusions, based upon sequence data from the ADH-1 gene located on F. vesca linkage group II, complement and generally agree with those obtained from analyses of protein-encoding genes GBSSI-2 and DHAR located on F. vesca linkage groups V and VII, respectively, but differ from a previous study that utilized rDNA sequences and did not detect the ancestral role of F. iinumae. PMID:25078607
Characterization of polymorphisms of genes ADH2, ADH3, ALDH2 and CYP2E1 and relationship to the alcoholism in a Colombian population.

PubMed

Méndez, Claudia; Rey, Mauricio

2015-12-30

Identify and characterize polymorphisms of genes ADH2, ADH3, ALDH2 and CYP2E1 in a Colombian population residing in the city of Bogotá and determine its possible relationship to the alcoholism. ADH2, ADH3, ALDH2, and CYP2E1 genotypes a population of 148 individuals with non-problematic alcohol and 65 individuals with alcoholism were determined with TaqMan probes and PCR-RFLP. DNA was obtained from peripheral blood white cells. Significant difference was found in family history of alcoholism and use of other psychoactive substances to compare alcoholics with controls. When allelic frequencies for each category (gender) were considered, frequency of A2 allele carriers in ADH2 was found higher in male patients than controls. In women, the relative frequency for c1 allele in CYP2E1 was lower in controls than alcoholics. The ALDH2 locus is monomorphic. No significant differences in allele distributions of the loci examined to compare two populations were observed, however when stratifying the same trend was found that these differences tended to be significant. This study allows us to conclude the positive association between family history of alcoholism and alcoholism suggesting that there is a favourable hereditary predisposition. Since substance dependence requires interaction of multiple genes, the combination of genotypes ADH2 * 2, CYP2E1 * 1 combined with genotype homozygous ALDH2 * 1 found in this study could be leading to the population to a potential risk to alcoholism.
Development of a plasmid-based expression system in Clostridium thermocellum and its use to screen heterologous expression of bifunctional alcohol dehydrogenases (adhEs)

DOE PAGES

Hon, Shuen; Lanahan, Anthony; Tian, Liang; ...

2016-04-22

Clostridium thermocellum is a promising candidate for ethanol production from cellulosic biomass, but requires metabolic engineering to improve ethanol yield. A key gene in the ethanol production pathway is the bifunctional aldehyde and alcohol dehydrogenase, adhE. To explore the effects of overexpressing wild-type, mutant, and exogenous adhEs, we developed a new expression plasmid, pDGO144, that exhibited improved transformation efficiency and better gene expression than its predecessor, pDGO-66. This new expression plasmid will allow for many other metabolic engineering and basic research efforts in C. thermocellum. As proof of concept, we used this plasmid to express 12 different adhE genes (bothmore » wild type and mutant) from several organisms. Ethanol production varied between clones immediately after transformation, but tended to converge to a single value after several rounds of serial transfer. The previously described mutant C. thermocellum D494G adhE gave the best ethanol production, which is consistent with previously published results.« less
Development of a plasmid-based expression system in Clostridium thermocellum and its use to screen heterologous expression of bifunctional alcohol dehydrogenases (adhEs)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hon, Shuen; Lanahan, Anthony; Tian, Liang

Clostridium thermocellum is a promising candidate for ethanol production from cellulosic biomass, but requires metabolic engineering to improve ethanol yield. A key gene in the ethanol production pathway is the bifunctional aldehyde and alcohol dehydrogenase, adhE. To explore the effects of overexpressing wild-type, mutant, and exogenous adhEs, we developed a new expression plasmid, pDGO144, that exhibited improved transformation efficiency and better gene expression than its predecessor, pDGO-66. This new expression plasmid will allow for many other metabolic engineering and basic research efforts in C. thermocellum. As proof of concept, we used this plasmid to express 12 different adhE genes (bothmore » wild type and mutant) from several organisms. Ethanol production varied between clones immediately after transformation, but tended to converge to a single value after several rounds of serial transfer. The previously described mutant C. thermocellum D494G adhE gave the best ethanol production, which is consistent with previously published results.« less

Development of a plasmid-based expression system in Clostridium thermocellum and its use to screen heterologous expression of bifunctional alcohol dehydrogenases (adhEs).

PubMed

Hon, Shuen; Lanahan, Anthony A; Tian, Liang; Giannone, Richard J; Hettich, Robert L; Olson, Daniel G; Lynd, Lee R

2016-12-01

Clostridium thermocellum is a promising candidate for ethanol production from cellulosic biomass, but requires metabolic engineering to improve ethanol yield. A key gene in the ethanol production pathway is the bifunctional aldehyde and alcohol dehydrogenase, adhE . To explore the effects of overexpressing wild-type, mutant, and exogenous adhE s, we developed a new expression plasmid, pDGO144, that exhibited improved transformation efficiency and better gene expression than its predecessor, pDGO-66. This new expression plasmid will allow for many other metabolic engineering and basic research efforts in C. thermocellum . As proof of concept, we used this plasmid to express 12 different adhE genes (both wild type and mutant) from several organisms. Ethanol production varied between clones immediately after transformation, but tended to converge to a single value after several rounds of serial transfer. The previously described mutant C. thermocellum D494G adhE gave the best ethanol production, which is consistent with previously published results.
Determining the roles of the three alcohol dehydrogenases (AdhA, AdhB and AdhE) in Thermoanaerobacter ethanolicus during ethanol formation.

PubMed

Zhou, Jilai; Shao, Xiongjun; Olson, Daniel G; Murphy, Sean Jean-Loup; Tian, Liang; Lynd, Lee R

2017-05-01

Thermoanaerobacter ethanolicus is a promising candidate for biofuel production due to the broad range of substrates it can utilize and its high ethanol yield compared to other thermophilic bacteria, such as Clostridium thermocellum. Three alcohol dehydrogenases, AdhA, AdhB and AdhE, play key roles in ethanol formation. To study their physiological roles during ethanol formation, we deleted them separately and in combination. Previously, it has been thought that both AdhB and AdhE were bifunctional alcohol dehydrogenases. Here we show that AdhE has primarily acetyl-CoA reduction activity (ALDH) and almost no acetaldehyde reduction (ADH) activity, whereas AdhB has no ALDH activity and but high ADH activity. We found that AdhA and AdhB have similar patterns of activity. Interestingly, although deletion of both adhA and adhB reduced ethanol production, a single deletion of either one actually increased ethanol yields by 60-70%.
Genetic polymorphisms of ADH1B, ADH1C and ALDH2 in Turkish alcoholics: lack of association with alcoholism and alcoholic cirrhosis.

PubMed

Vatansever, Sezgin; Tekin, Fatih; Salman, Esin; Altintoprak, Ender; Coskunol, Hakan; Akarca, Ulus Salih

2015-05-17

No data exists regarding the alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH) gene polymorphisms in Turkish alcoholic cirrhotics. We studied the polymorphisms of ADH1B, ADH1C and ALDH2 genes in alcoholic cirrhotics and compared the results with non-cirrhotic alcoholics and healthy volunteers. Overall, 237 subjects were included for the study: 156 alcoholic patients (78 cirrhotics, 78 non-cirrhotic alcoholics) and 81 healthy volunteers. Three different single-nucleotide-polymorphism genotyping methods were used. ADH1C genotyping was performed using a polymerase chain reaction-restriction fragment length polymorphism method. The identified ADH1C genotypes were named according to the presence or absence of the enzyme restriction sites. ADH1B (Arg47Hys) genotyping was performed using the allele specific primer extension method, and ALDH2 (Glu487Lys) genotyping was performed by a multiplex polymerase chain reaction using two allele-specific primer pairs. For ADH1B, the frequency of allele *1 in the cirrhotics, non-cirrhotic alcoholics and healthy volunteers was 97.4%, 94.9% and 99.4%, respectively. For ADH1C, the frequency of allele *1 in the cirrhotics, non-cirrhotic alcoholics and healthy volunteers was 47%, 36.3% and 45%, respectively. There was no statistical difference between the groups for ADH1B and ADH1C (p>0.05). All alcoholic and non-alcoholic subjects (100%) had the allele *1 for ALDH2. The obtained results for ADH1B, ADH1C, and ALDH gene polymorphisms in the present study are similar to the results of Caucasian studies. ADH1B and ADH1C genetic variations are not related to the development of alcoholism or susceptibility to alcoholic cirrhosis. ALDH2 gene has no genetic variation in the Turkish population.
[Polymorphism of alcohol dehydrogenase gene ADH1B in eastern Slavic and Iranian-speaking populations].

PubMed

2005-11-01

Frequencies of alleles and genotypes for alcohol dehydrogenase gene ADH1B (arg47his polymorphism), associated with alcohol tolerance/sensitivity, were determined. It was demonstrated that the frequency of allele ADH1B*47his, corresponding to atypical alcohol dehydrogenase variant in Russians, Ukrainians, Iranians, and mountain-dwellers of the Pamirs constituted 3, 7, 24, and 22%, respectively. The frequencies established were consistent with the allele frequency distribution pattern among the populations of Eurasia. Russians and Ukrainians were indistinguishable from other European populations relative to the frequency of allele ADH1B*47his, and consequently, relative to specific features of ethanol metabolic pathways. The data obtained provide refinement of the geographic pattern of ADH1B*47his frequency distribution in Eurasia.
adhA in Aspergillus parasiticus Is Involved in Conversion of 5′-Hydroxyaverantin to Averufin

PubMed Central

Chang, Perng-Kuang; Yu, Jiujiang; Ehrlich, Kenneth C.; Boue, Stephen M.; Montalbano, Beverly G.; Bhatnagar, Deepak; Cleveland, Thomas E.

2000-01-01

Two routes for the conversion of 5′-hydroxyaverantin (HAVN) to averufin (AVF) in the synthesis of aflatoxin have been proposed. One involves the dehydration of HAVN to the lactone averufanin (AVNN), which is then oxidized to AVF. Another requires dehydrogenation of HAVN to 5′-ketoaverantin, the open-chain form of AVF, which then cyclizes spontaneously to AVF. We isolated a gene, adhA, from the aflatoxin gene cluster of Aspergillus parasiticus SU-1. The deduced ADHA amino acid sequence contained two conserved motifs found in short-chain alcohol dehydrogenases—a glycine-rich loop (GXXXGXG) that is necessary for interaction with NAD+-NADP+, and the motif YXXXK, which is found at the active site. A. parasiticus SU-1, which produces aflatoxins, has two copies of adhA (adhA1), whereas A. parasiticus SRRC 2043, a strain that accumulates O-methylsterigmatocystin (OMST), has only one copy. Disruption of adhA in SRRC 2043 resulted in a strain that accumulates predominantly HAVN. This result suggests that ADHA is involved in the dehydrogenation of HAVN to AVF. Those adhA disruptants that still made small amounts of OMST also accumulated other metabolites, including AVNN, after prolonged culture. PMID:11055914
Trends in gastrectomy and ADH1B and ALDH2 genotypes in Japanese alcoholic men and their gene-gastrectomy, gene-gene and gene-age interactions for risk of alcoholism.

PubMed

Yokoyama, Akira; Yokoyama, Tetsuji; Matsui, Toshifumi; Mizukami, Takeshi; Kimura, Mitsuru; Matsushita, Sachio; Higuchi, Susumu; Maruyama, Katsuya

2013-01-01

The life-time drinking profiles of Japanese alcoholics have shown that gastrectomy increases susceptibility to alcoholism. We investigated the trends in gastrectomy and alcohol dehydrogenase-1B (ADH1B) and aldehyde dehydrogenase-2 (ALDH2) genotypes and their interactions in alcoholics. This survey was conducted on 4879 Japanese alcoholic men 40 years of age or older who underwent routine gastrointestinal endoscopic screening during the period 1996-2010. ADH1B/ALDH2 genotyping was performed in 3702 patients. A history of gastrectomy was found in 508 (10.4%) patients. The reason for the gastrectomy was peptic ulcer in 317 patients and gastric cancer in 187 patients. The frequency of gastrectomy had gradually decreased from 13.3% in 1996-2000 to 10.5% in 2001-2005 and to 7.8% in 2006-2010 (P < 0.0001). ADH1B*1/*1 was less frequent in the gastrectomy group than in the non-gastrectomy group (age-adjusted prevalence: 20.4 vs. 27.6%, P = 0.006). ALDH2 genotype distribution did not differ between the two groups. The frequency of inactive ALDH2*1/*2 heterozygotes increased slightly from 13.0% in 1996-2000 to 14.0% in 2001-2005 and to 15.4% in 2006-2010 (P < 0.08). Two alcoholism-susceptibility genotypes, ADH1B*1/*1 and ALDH2*1/*1, modestly but significantly tended not to occur in the same individual (P = 0.026). The frequency of ADH1B*1/*1 decreased with ascending age groups. The high frequency of history of gastrectomy suggested that gastrectomy is still a risk factor for alcoholism, although the percentage decreased during the period. The alcoholism-susceptibility genotype ADH1B*1/*1 was less frequent in the gastrectomy group, suggesting a competitive gene-gastrectomy interaction for alcoholism. A gene-gene interaction and gene-age interactions regarding the ADH1B genotype were observed.
High diversity and no significant selection signal of human ADH1B gene in Tibet

PubMed Central

2012-01-01

Background ADH1B is one of the most studied human genes with many polymorphic sites. One of the single nucleotide polymorphism (SNP), rs1229984, coding for the Arg48His substitution, have been associated with many serious diseases including alcoholism and cancers of the digestive system. The derived allele, ADH1B*48His, reaches high frequency only in East Asia and Southwest Asia, and is highly associated with agriculture. Micro-evolutionary study has defined seven haplogroups for ADH1B based on seven SNPs encompassing the gene. Three of those haplogroups, H5, H6, and H7, contain the ADH1B*48His allele. H5 occurs in Southwest Asia and the other two are found in East Asia. H7 is derived from H6 by the derived allele of rs3811801. The H7 haplotype has been shown to have undergone significant positive selection in Han Chinese, Hmong, Koreans, Japanese, Khazak, Mongols, and so on. Methods In the present study, we tested whether Tibetans also showed evidence for selection by typing 23 SNPs in the region covering the ADH1B gene in 1,175 individuals from 12 Tibetan populations representing all districts of the Tibet Autonomous Region. Multiple statistics were estimated to examine the gene diversities and positive selection signals among the Tibetans and other populations in East Asia. Results The larger Tibetan populations (Qamdo, Lhasa, Nagqu, Nyingchi, Shannan, and Shigatse) comprised mostly farmers, have around 12% of H7, and 2% of H6. The smaller populations, living on hunting or recently switched to farming, have lower H7 frequencies (Tingri 9%, Gongbo 8%, Monba and Sherpa 6%). Luoba (2%) and Deng (0%) have even lower frequencies. Long-range haplotype analyses revealed very weak signals of positive selection for H7 among Tibetans. Interestingly, the haplotype diversity of H7 is higher in Tibetans than in any other populations studied, indicating a longer diversification history for that haplogroup in Tibetans. Network analysis on the long-range haplotypes revealed
Ethanol-induced alcohol dehydrogenase E (AdhE) potentiates pneumolysin in Streptococcus pneumoniae.

PubMed

Luong, Truc Thanh; Kim, Eun-Hye; Bak, Jong Phil; Nguyen, Cuong Thach; Choi, Sangdun; Briles, David E; Pyo, Suhkneung; Rhee, Dong-Kwon

2015-01-01

Alcohol impairs the host immune system, rendering the host more vulnerable to infection. Therefore, alcoholics are at increased risk of acquiring serious bacterial infections caused by Streptococcus pneumoniae, including pneumonia. Nevertheless, how alcohol affects pneumococcal virulence remains unclear. Here, we showed that the S. pneumoniae type 2 D39 strain is ethanol tolerant and that alcohol upregulates alcohol dehydrogenase E (AdhE) and potentiates pneumolysin (Ply). Hemolytic activity, colonization, and virulence of S. pneumoniae, as well as host cell myeloperoxidase activity, proinflammatory cytokine secretion, and inflammation, were significantly attenuated in adhE mutant bacteria (ΔadhE strain) compared to D39 wild-type bacteria. Therefore, AdhE might act as a pneumococcal virulence factor. Moreover, in the presence of ethanol, S. pneumoniae AdhE produced acetaldehyde and NADH, which subsequently led Rex (redox-sensing transcriptional repressor) to dissociate from the adhE promoter. An increase in AdhE level under the ethanol condition conferred an increase in Ply and H2O2 levels. Consistently, S. pneumoniae D39 caused higher cytotoxicity to RAW 264.7 cells than the ΔadhE strain under the ethanol stress condition, and ethanol-fed mice (alcoholic mice) were more susceptible to infection with the D39 wild-type bacteria than with the ΔadhE strain. Taken together, these data indicate that AdhE increases Ply under the ethanol stress condition, thus potentiating pneumococcal virulence. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
A novel RNase G mutant that is defective in degradation of adhE mRNA but proficient in the processing of 16S rRNA precursor.

PubMed

Wachi, M; Kaga, N; Umitsuki, G; Clark, D P; Nagai, K

2001-12-21

Escherichia coli RNase G, encoded by the rng gene, is involved in both the processing of 16S rRNA precursor and the degradation of adhE mRNA. Consequently, defects in RNase G result in elevation of AdhE levels. Furthermore, the adhR430 mutant strain, DC430, is reported to overproduce the AdhE protein in a manner dependent on the adhC81 mutation. We found that overproduction of AdhE by DC430 was reversed to wild-type levels by introduction of a plasmid carrying the wild-type allele of rng. Mapping by P1-phage-mediated transduction also indicated that a mutation involved in AdhE overproduction was located around the rng region in DC430. DNA sequencing of the rng region revealed that DC430 indeed had a mutation in the rng gene: a G1022 to A transition that caused substitution of Gly341 with Ser and which was named rng430. This lies in the highly conserved region of the RNase E/RNase G family, called high similarity region 2 (HSR2). However, very interestingly, rng430 mutant strains did not accumulate the 16.3S precursor of 16S rRNA unlike rng::cat mutants. We also found that the Rng1 mutant protein, which is truncated in its C-terminal domain encompassing HSR2, exhibited a residual processing activity against the 16S rRNA precursor, when overproduced. These results indicate that the HSR2 of RNase G plays an important role in substrate recognition and/or ribonucleolytic action.
Biosynthetic burden and plasmid burden limit expression of chromosomally integrated heterologous genes (pdc, adhB) in Escherichia coli

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martinez, A.; York, S.W.; Yomano, L.P.

1999-10-01

Previous studies have shown an unexpectedly high nutrient requirement for efficient ethanol production by ethanologenic recombinants of Escherichia coli B such as LY01 which contain chromosomally integrated Zymomonas mobilis genes (pdc, adhB) encoding the ethanol pathway. The basis for this requirement has been identified as a media-dependent effect on the expression of the Z. mobilis genes rather than a nutritional limitation. Ethanol production was substantially increased without additional nutrients simply by increasing the level of pyruvate decarboxylase activity. This was accomplished by adding a multicopy plasmid containing pdc alone (but not adhB alone) to strain LY01, and by adding multicopymore » plasmids which express pdc and adhB from strong promoters. New strong promoters were isolated from random fragments of Z. mobilis DNA and characterized but were not used to construct integrated biocatalysts. These promoters contained regions resembling recognition sites for 3 different E. coli sigma factors: {sigma}{sup 70}, {sigma}{sup 38}, and {sigma}{sup 28}. The most effective plasmid-based promoters for fermentation were recognized by multiple sigma factors, expressed both pdc and adhB at high levels, and produced ethanol efficiently while allowing up to 80% reduction in complex nutrients as compared to LY01. The ability to utilize multiple sigma factors may be advantageous to maintain the high levels of PDC and ADH needed for efficient ethanol production throughout batch fermentation.« less
Molecular analysis of UAS(E), a cis element containing stress response elements responsible for ethanol induction of the KlADH4 gene of Kluyveromyces lactis.

PubMed

Mazzoni, C; Santori, F; Saliola, M; Falcone, C

2000-01-01

KlADH4 is a gene of Kluyveromyces lactis encoding a mitochondrial alcohol dehydrogenase activity, which is specifically induced by ethanol and insensitive to glucose repression. In this work, we report the molecular analysis of UAS(E), an element of the KlADH4 promoter which is essential for the induction of KlADH4 in the presence of ethanol. UAS(E) contains five stress response elements (STREs), which have been found in many genes of Saccharomyces cerevisiae involved in the response of cells to conditions of stress. Whereas KlADH4 is not responsive to stress conditions, the STREs present in UAS(E) seem to play a key role in the induction of the gene by ethanol, a situation that has not been observed in the related yeast S. cerevisiae. Gel retardation experiments showed that STREs in the KlADH4 promoter can bind factor(s) under non-inducing conditions. Moreover, we observed that the RAP1 binding site present in UAS(E) binds KlRap1p.
Overexpression of ADH1 and HXT1 genes in the yeast Saccharomyces cerevisiae improves the fermentative efficiency during tequila elaboration.

PubMed

Gutiérrez-Lomelí, Melesio; Torres-Guzmán, Juan Carlos; González-Hernández, Gloria Angélica; Cira-Chávez, Luis Alberto; Pelayo-Ortiz, Carlos; Ramírez-Córdova, Jose de Jesús

2008-05-01

This work assessed the effect of the overexpression of ADH1 and HXT1 genes in the Saccharomyces cerevisiae AR5 strain during fermentation of Agave tequilana Weber blue variety must. Both genes were cloned individually and simultaneously into a yeast centromere plasmid. Two transformant strains overexpressing ADH1 and HXT1 individually and one strain overexpressing both genes were randomly selected and named A1, A3 and A5 respectively. Overexpression effect on growth and ethanol production of the A1, A3 and A5 strains was evaluated in fermentative conditions in A. tequilana Weber blue variety must and YPD medium. During growth in YPD and Agave media, all the recombinant strains showed lower cell mass formation than the wild type AR5 strain. Adh enzymatic activity in the recombinant strains A1 and A5 cultivated in A. tequilana and YPD medium was higher than in the wild type. The overexpression of both genes individually and simultaneously had no significant effect on ethanol formation; however, the fermentative efficiency of the A5 strain increased from 80.33% to 84.57% and 89.40% to 94.29% in YPD and Agave medium respectively.
First description and evaluation of SNPs in the ADH and ALDH genes in a population of alcoholics in Central-West Brazil.

PubMed

Teixeira, Thallita Monteiro; da Silva, Hugo Delleon; Goveia, Rebeca Mota; Ribolla, Paulo Eduardo Martins; Alonso, Diego Peres; Alves, Alessandro Arruda; Melo E Silva, Daniela; Collevatti, Rosane Garcia; Bicudo, Lucilene Arilho; Bérgamo, Nádia Aparecida; de Paula Silveira-Lacerda, Elisângela

2017-12-01

Worldwide, different studies have reported an association of alcohol-use disorder (AUD) with different types of Single Nucleotide Polymorphisms (SNPs) in the genes for aldehyde dehydrogenase (ALDH) and alcohol dehydrogenase (ADH). In Brazil, there is little information about the occurrence of these SNPs in the AUD population and an absence of studies characterizing the population in the Central-West Region of Brazil. Actually, in Brazil, there are more than 4 million people with AUD. Despite the major health hazards of AUD, information on alcohol consumption and its consequences are not well understood. Therefore, it is extremely important to characterize these SNPs for the better understanding of AUD as a genetic disease in the Brazilian population. The present study, unlike other studies in other countries, is done with a subject population that shows a significant amount of racial homogenization. We evaluated the presence of SNPs in the ADH (ADH1B, ADH1C, and ADH4) and ALDH (ALDH2) genes in alcohol users of Goiânia, State of Goiás - Brazil, and then we established a possible relationship with AUD by allelic and genotypic study. This study was conducted with a population of people with AUD (n = 99) from Goiás Alcohol Dependence Recovery Center (GO CEREA) and Psychosocial Care Center for Alcohol and Drugs (CAPS AD), and with a population of people without AUD as controls (n = 100). DNA was extracted from whole-blood samples and the genotyping was performed using TaqMan ® SNP genotyping assays. For characterization and evaluation of SNPs in the population, genotype frequency, allele frequency, haplotype frequency, Hardy-Weinberg equilibrium, and linkage disequilibrium were analyzed. Statistical analyses were calculated by GENEPOP 4.5 and Haploview software. The allele 1 was considered as "wild" (or *1) and allele 2 as mutant (or *2). Significant differences were found for ADH1B*, ADH4*2, and ALDH2*2 SNPs when the genotype and allele frequencies were
ADH1B promotes mesothelial clearance and ovarian cancer infiltration.

PubMed

Gharpure, Kshipra M; Lara, Olivia D; Wen, Yunfei; Pradeep, Sunila; LaFargue, Chris; Ivan, Cristina; Rupaimoole, Rajesha; Hu, Wei; Mangala, Lingegowda S; Wu, Sherry Y; Nagaraja, Archana S; Baggerly, Keith; Sood, Anil K

2018-05-18

Primary debulking surgery followed by adjuvant chemotherapy is the standard treatment for ovarian cancer. Residual disease after primary surgery is associated with poor patient outcome. Previously, we discovered ADH1B to be a molecular biomarker of residual disease. In the current study, we investigated the functional role of ADH1B in promoting ovarian cancer cell invasiveness and contributing to residual disease. We discovered that ADH1B overexpression leads to a more infiltrative cancer cell phenotype, promotes metastasis, increases the adhesion of cancer cells to mesothelial cells, and increases extracellular matrix degradation. Live cell imaging revealed that ADH1B-overexpressing cancer cells efficiently cleared the mesothelial cell layer compared to control cells. Moreover, gene array analysis revealed that ADH1B affects several pathways related to the migration and invasion of cancer cells. We also discovered that hypoxia increases ADH1B expression in ovarian cancer cells. Collectively, these findings indicate that ADH1B plays an important role in the pathways that promote ovarian cancer cell infiltration and may increase the likelihood of residual disease following surgery.
INACTIVATION OF E. COLI PYRUVATE FORMATE-LYASE: ROLE OF AdhE AND SMALL MOLECULES

PubMed Central

Nnyepi, Mbako R.; Peng, Yi; Broderick, Joan B.

2007-01-01

E. coli AdhE has been reported to harbor three distinct enzymatic activities: alcohol dehydrogenase, acetaldehyde-CoA dehydrogenase, and pyruvate formate-lyase (PFL) deactivase. Herein we report on the cloning, expression, and purification of E. coli AdhE, and the re-investigation of its purported enzymatic activities. While both the alcohol dehydrogenase and acetaldehyde-CoA dehydrogenase activities were readily detectible, we were unable to obtain any evidence for catalytic deactivation of PFL by AdhE, regardless of whether the reported cofactors for deactivation (Fe(II), NAD, and CoA) were present. Our results demonstrate that AdhE is not a PFL deactivating enzyme. We have also examined the potential for deactivation of active PFL by small-molecule thiols. Both β-mercaptoethanol and dithiothreitol deactivate PFL efficiently, with the former providing quite rapid deactivation. PFL deactivated by these thiols can be reactivated, suggesting that this deactivation is non-destructive transfer of an H atom equivalent to quench the glycyl radical. PMID:17280641
The yeast ADH7 promoter enables gene expression under pronounced translation repression caused by the combined stress of vanillin, furfural, and 5-hydroxymethylfurfural.

PubMed

Ishida, Yoko; Nguyen, Trinh Thi My; Izawa, Shingo

2017-06-20

Lignocellulosic biomass conversion inhibitors such as vanillin, furfural, and 5-hydroxymethylfurfural (HMF) inhibit the growth of and fermentation by Saccharomyces cerevisiae. A high concentration of each fermentation inhibitor represses translation and increases non-translated mRNAs. We previously reported that the mRNAs of ADH7 and BDH2, which encode putative NADPH- and NADH-dependent alcohol dehydrogenases, respectively, were efficiently translated even with translation repression in response to severe vanillin stress. However, the combined effects of these fermentation inhibitors on the expression of ADH7 and BDH2 remain unclear. We herein demonstrated that exposure to a combined stress of vanillin, furfural, and HMF repressed translation. The protein synthesis of Adh7, but not Bdh2 was significantly induced under combined stress conditions, even though the mRNA levels of ADH7 and BDH2 were up-regulated. Additionally, adh7Δ cells were more sensitive to the combined stress than wild-type and bdh2Δ cells. These results suggest that induction of the ADH7 expression plays a role in the tolerance to the combined stress of vanillin, furfural, and HMF. Furthermore, we succeeded in improving yeast tolerance to the combined stress by controlling the expression of ALD6 with the ADH7 promoter. Our results demonstrate that the ADH7 promoter can overcome the pronounced translation repression caused by the combined stress of vanillin, furfural, and HMF, and also suggest a new gene engineering strategy to breed robust and optimized yeasts for bioethanol production from a lignocellulosic biomass. Copyright © 2017 Elsevier B.V. All rights reserved.
Molecular Variation of Adh and P6 Genes in an African Population of Drosophila Melanogaster and Its Relation to Chromosomal Inversions

PubMed Central

Benassi, V.; Aulard, S.; Mazeau, S.; Veuille, M.

1993-01-01

Four-cutter molecular polymorphism of Adh and P6, and chromosome inversion polymorphism of chromosome II were investigated in 95 isogenic lines of an Ivory Coast population of Drosophila melanogaster, a species assumed to have recently spread throughout the world from a West African origin. The P6 gene showed little linkage disequilibrium with the In(2L)t inversion, although it is located within this inversion. This suggests that the inversion and the P6 locus have extensively exchanged genetic information through either double crossover or gene conversion. Allozymic variation in ADH was in linkage disequilibrium with In(2L)t and In(2R)NS inversions. Evidence suggests either that inversion linkage with the Fast allele is selectively maintained, or that this allele only recently appeared. Molecular polymorphism at the Adh locus in the Ivory Coast is not higher than in North American populations. New haplotypes specific to the African population were found, some of them connect the ``Wa(s)-like'' haplotypes found at high frequencies in the United States to the other slow haplotypes. Their relation with In(2L)t supports the hypothesis that Wa(s) recently recombined away from an In(2L)t chromosome which may be the cause of its divergence from the other haplotypes. PMID:8349110
Allelic variants of ADH, ALDH and the five factor model of personality in alcohol dependence syndrome

PubMed Central

Salujha, S. K.; Chaudhury, S.; Menon, P. K.; Srivastava, K.; Gupta, A.

2014-01-01

Background: The etiology of alcohol dependence is a complex interplay of biopsychosocial factors. The genes for alcohol-metabolizing enzymes: Alcohol dehydrogenase (ADH2 and ADH3) and aldehyde dehydrogenase (ALDH2) exhibit functional polymorphisms. Vulnerability of alcohol dependence may also be in part due to heritable personality traits. Aim: To determine whether any association exists between polymorphisms of ADH2, ADH3 and ALDH2 and alcohol dependence syndrome in a group of Asian Indians. In addition, the personality of these patients was assessed to identify traits predisposing to alcoholism. Materials and Methods: In this study, 100 consecutive males with alcohol dependence syndrome attending the psychiatric outpatient department of a tertiary care service hospital and an equal number of matched healthy controls were included with their consent. Blood samples of all the study cases and controls were collected and genotyped for the ADH2, ADH3 and ALDH2 loci. Personality was evaluated using the neuroticism, extraversion, openness (NEO) personality inventory and sensation seeking scale. Results: Allele frequencies of ADH2*2 (0.50), ADH3*1 (0.67) and ALSH2*2 (0.09) were significantly low in the alcohol dependent subjects. Personality traits of NEO personality inventory and sensation seeking were significantly higher when compared to controls. Conclusions: The functional polymorphisms of genes coding for alcohol metabolizing enzymes and personality traits of NEO and sensation seeking may affect the propensity to develop dependence. PMID:25535445
The Genetics of a Small Autosomal Region of DROSOPHILA MELANOGASTER Containing the Structural Gene for Alcohol Dehydrogenase. I. Characterization of Deficiencies and Mapping of ADH and Visible Mutations

PubMed Central

Woodruff, R. C.; Ashburner, M.

1979-01-01

The position of the structural gene coding for alcohol dehydrogenase (ADH) in Drosophila melanogaster has been shown to be within polytene chromosome bands 35B1 and 35B3, most probably within 35B2. The genetic and cytological properties of twelve deficiencies in polytene chromosome region 34–35 have been characterized, eleven of which include Adh. Also mapped cytogenetically are seven other recessive visible mutant loci. Flies heterozygous for overlapping deficiencies that include both the Adh locus and that for the outspread mutant (osp: a recessive wing phenotype) are homozygous viable and show a complete ADH negative phenotype and strong osp phenotype. These deficiencies probably include two polytene chromosome bands, 35B2 and 35B3. PMID:115743
CvADH1, a member of short-chain alcohol dehydrogenase family, is inducible by gibberellin and sucrose in developing watermelon seeds.

PubMed

Kim, Joonyul; Kang, Hong-Gyu; Jun, Sung-Hoon; Lee, Jinwon; Yim, Jieun; An, Gynheung

2003-01-01

To understand the molecular mechanisms that control seed formation, we selected a seed-preferential gene (CvADH1) from the ESTs of developing watermelon seeds. RNA blot analysis and in situ localization showed that CvADH1 was preferentially expressed in the nucellar tissue. The CvADH1 protein shared about 50% homology with short-chain alcohol dehydrogenase including ABA2 in Arabidopsis thaliana, stem secoisolariciresinol dehydrogenase in Forsythia intermedia, and 3beta-hydroxysterol dehydrogenase in Digitalis lanata. We investigated gene-expression levels in seeds from both normally pollinated fruits and those made parthenocarpic via N-(2-chloro-4-pyridyl)-N'-phenylurea treatment, the latter of which lack zygotic tissues. Whereas the transcripts of CvADH1 rapidly started to accumulate from about the pre-heart stage in normal seeds, they were not detectable in the parthenocarpic seeds. Treating the parthenogenic fruit with GA(3) strongly induced gene expression, up to the level accumulated in pollinated seeds. These results suggest that the CvADH1 gene is induced in maternal tissues by signals made in the zygotic tissues, and that gibberellin might be one of those signals. We also observed that CvADH1 expression was induced by sucrose in the parthenocarpic seeds. Therefore, we propose that the CvADH1 gene is inducible by gibberellin, and that sucrose plays an important role in the maternal tissues of watermelon during early seed development.

Enhanced robustness in acetone-butanol-ethanol fermentation with engineered Clostridium beijerinckii overexpressing adhE2 and ctfAB.

PubMed

Lu, Congcong; Yu, Le; Varghese, Saju; Yu, Mingrui; Yang, Shang-Tian

2017-11-01

Clostridium beijerinckii CC101 was engineered to overexpress aldehyde/alcohol dehydrogenase (adhE2) and CoA-transferase (ctfAB). Solvent production and acid assimilation were compared between the parental and engineered strains expressing only adhE2 (CC101-SV4) and expressing adhE2, ald and ctfAB (CC101-SV6). CC101-SV4 showed an early butanol production from glucose but stopped pre-maturely at a low butanol concentration of ∼6g/L. Compared to CC101, CC101-SV6 produced more butanol (∼12g/L) from glucose and was able to re-assimilate more acids, which prevented "acid crash" and increased butanol production, under all conditions studied. CC101-SV6 also showed better ability in using glucose and xylose present in sugarcane bagasse hydrolysate, and produced 9.4g/L solvents (acetone, butanol and ethanol) compared to only 2.6g/L by CC101, confirming its robustness and better tolerance to hydrolysate inhibitors. The engineered strain of C. beijerinckii overexpressing adhE2 and ctfAB should have good potential for producing butanol from lignocellulosic biomass hydrolysates. Copyright © 2017 Elsevier Ltd. All rights reserved.
DkPK Genes Promote Natural Deastringency in C-PCNA Persimmon by Up-regulating DkPDC and DkADH Expression

PubMed Central

Guan, Changfei; Du, Xiaoyun; Zhang, Qinglin; Ma, Fengwang; Luo, Zhengrong; Yang, Yong

2017-01-01

The astringency of Chinese pollination-constant non-astringent (C-PCNA) persimmon (Diospyros kaki Thunb.) can be naturally removed on the tree. This process is controlled by a single locus and is dominant against other types of persimmons; therefore, this variant is an important candidate for commercial cultivation and the breeding of PCNA cultivars. In our previous study, six full-length coding sequences (CDS) for pyruvate kinase genes (DkPK1-6) were isolated, and DkPK1 is thought to be involved in the natural deastringency of C-PCNA persimmon fruit. Here, we characterize the eight other DkPK genes (DkPK7-14) from C-PCNA persimmon fruit based on transcriptome data. The transcript changes in DkPK7-14 genes and correlations with the proanthocyanidin (PA) content were investigated during different fruit development stages in C-PCNA, J-PCNA, and non-PCNA persimmon; DkPK7 and DkPK8 exhibited up-regulation patterns during the last developmental stage in C-PCNA persimmon that was negatively correlated with the decrease in soluble PAs. Phylogenetic analysis and subcellular localization analysis revealed that DkPK7 and DkPK8 are cytosolic proteins. Notably, DkPK7 and DkPK8 were ubiquitously expressed in various persimmon organs and abundantly up-regulated in seeds. Furthermore, transient over-expression of DkPK7 and DkPK8 in persimmon leaves led to a significant decrease in the content of soluble PAs but a significant increase in the expression levels of the pyruvate decarboxylase (DkPDC) and alcohol dehydrogenase genes (DkADH), which are closely related to acetaldehyde metabolism. The accumulated acetaldehyde that results from the up-regulation of the DkPDC and DkADH genes can combine with soluble PAs to form insoluble PAs, resulting in the removal of astringency from persimmon fruit. Thus, we suggest that both DkPK7 and DkPK8 are likely to be involved in natural deastringency via the up-regulation of DkPDC and DkADH expression during the last developmental stage in C
Finding genes discriminating smokers from non-smokers by applying a growing self-organizing clustering method to large airway epithelium cell microarray data.

PubMed

Shahdoust, Maryam; Hajizadeh, Ebrahim; Mozdarani, Hossein; Chehrei, Ali

2013-01-01

Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now
Optical mapping and sequencing of the Escherichia coli KO11 genome reveal extensive chromosomal rearrangements, and multiple tandem copies of the Zymomonas mobilis pdc and adhB genes.

PubMed

Turner, Peter C; Yomano, Lorraine P; Jarboe, Laura R; York, Sean W; Baggett, Christy L; Moritz, Brélan E; Zentz, Emily B; Shanmugam, K T; Ingram, Lonnie O

2012-04-01

Escherichia coli KO11 (ATCC 55124) was engineered in 1990 to produce ethanol by chromosomal insertion of the Zymomonas mobilis pdc and adhB genes into E. coli W (ATCC 9637). KO11FL, our current laboratory version of KO11, and its parent E. coli W were sequenced, and contigs assembled into genomic sequences using optical NcoI restriction maps as templates. E. coli W contained plasmids pRK1 (102.5 kb) and pRK2 (5.4 kb), but KO11FL only contained pRK2. KO11FL optical maps made with AflII and with BamHI showed a tandem repeat region, consisting of at least 20 copies of a 10-kb unit. The repeat region was located at the insertion site for the pdc, adhB, and chloramphenicol-resistance genes. Sequence coverage of these genes was about 25-fold higher than average, consistent with amplification of the foreign genes that were inserted as circularized DNA. Selection for higher levels of chloramphenicol resistance originally produced strains with higher pdc and adhB expression, and hence improved fermentation performance, by increasing the gene copy number. Sequence data for an earlier version of KO11, ATCC 55124, indicated that multiple copies of pdc adhB were present. Comparison of the W and KO11FL genomes showed large inversions and deletions in KO11FL, mostly enabled by IS10, which is absent from W but present at 30 sites in KO11FL. The early KO11 strain ATCC 55124 had no rearrangements, contained only one IS10, and lacked most accumulated single nucleotide polymorphisms (SNPs) present in KO11FL. Despite rearrangements and SNPs in KO11FL, fermentation performance was equal to that of ATCC 55124.
Transient Overexpression of adh8a Increases Allyl Alcohol Toxicity in Zebrafish Embryos

PubMed Central

Klüver, Nils; Ortmann, Julia; Paschke, Heidrun; Renner, Patrick; Ritter, Axel P.; Scholz, Stefan

2014-01-01

Fish embryos are widely used as an alternative model to study toxicity in vertebrates. Due to their complexity, embryos are believed to more resemble an adult organism than in vitro cellular models. However, concerns have been raised with respect to the embryo's metabolic capacity. We recently identified allyl alcohol, an industrial chemical, to be several orders of magnitude less toxic to zebrafish embryo than to adult zebrafish (embryo LC50 = 478 mg/L vs. fish LC50 = 0.28 mg/L). Reports on mammals have indicated that allyl alcohol requires activation by alcohol dehydrogenases (Adh) to form the highly reactive and toxic metabolite acrolein, which shows similar toxicity in zebrafish embryos and adults. To identify if a limited metabolic capacity of embryos indeed can explain the low allyl alcohol sensitivity of zebrafish embryos, we compared the mRNA expression levels of Adh isoenzymes (adh5, adh8a, adh8b and adhfe1) during embryo development to that in adult fish. The greatest difference between embryo and adult fish was found for adh8a and adh8b expression. Therefore, we hypothesized that these genes might be required for allyl alcohol activation. Microinjection of adh8a, but not adh8b mRNA led to a significant increase of allyl alcohol toxicity in embryos similar to levels reported for adults (LC50 = 0.42 mg/L in adh8a mRNA-injected embryos). Furthermore, GC/MS analysis of adh8a-injected embryos indicated a significant decline of internal allyl alcohol concentrations from 0.23-58 ng/embryo to levels below the limit of detection (< 4.6 µg/L). Injection of neither adh8b nor gfp mRNA had an impact on internal allyl alcohol levels supporting that the increased allyl alcohol toxicity was mediated by an increase in its metabolization. These results underline the necessity to critically consider metabolic activation in the zebrafish embryo. As demonstrated here, mRNA injection is one useful approach to study the role of candidate enzymes involved in
Ethylene-responsive transcription factors interact with promoters of ADH and PDC involved in persimmon (Diospyros kaki) fruit de-astringency

PubMed Central

Min, Ting; Yin, Xue-ren; Chen, Kun-song

2012-01-01

The persimmon fruit is a particularly good model for studying fruit response to hypoxia, in particular, the hypoxia-response ERF (HRE) genes. An anaerobic environment reduces fruit astringency by converting soluble condensed tannins (SCTs) into an insoluble form. Although the physiology of de-astringency has been widely studied, its molecular control is poorly understood. Both CO2 and ethylene treatments efficiently removed the astringency from ‘Mopan’ persimmon fruit, as indicated by a decrease in SCTs. Acetaldehyde, the putative agent for causing de-astringency, accumulated during these treatments, as did activities of the key enzymes of acetaldehyde synthesis, alcohol dehydrogenase (ADH), and pyruvate decarboxylase (PDC). Eight DkADH and DkPDC genes were isolated, and three candidates for a role in de-astringency, DkADH1, DkPDC1, and DkPDC2, were characterized by transcriptional analysis in different tissues. The significance of these specific isoforms was confirmed by principal component analysis. Transient expression in leaf tissue showed that DkPDC2 decreased SCTs. Interactions of six hypoxia-responsive ERF genes and target promoters were tested in transient assays. The results indicated that two hypoxia-responsive ERF genes, DkERF9 and DkERF10, were involved in separately regulating the DkPDC2 and DkADH1 promoters. It is proposed that a DkERF–DkADH/DkPDC cascade is involved in regulating persimmon de-astringency. PMID:23095993
Efficient transcription of the glycolytic gene ADH1 and three translational component genes requires the GCR1 product, which can act through TUF/GRF/RAP binding sites.

PubMed Central

Santangelo, G M; Tornow, J

1990-01-01

Glycolytic gene expression in Saccharomyces cerevisiae is thought to be activated by the GCR and TUF proteins. We tested the hypothesis that GCR function is mediated by TUF/GRF/RAP binding sites (UASRPG elements). We found that UASRPG-dependent activation of a heterologous gene and transcription of ADH1, TEF1, TEF2, and RP59 were sensitive to GCR1 disruption. GCR is not required for TUF/GRF/RAP expression or in vitro DNA-binding activity. Images PMID:2405258
Expression pattern, ethanol-metabolizing activities, and cellular localization of alcohol and aldehyde dehydrogenases in human large bowel: association of the functional polymorphisms of ADH and ALDH genes with hemorrhoids and colorectal cancer.

PubMed

Chiang, Chien-Ping; Jao, Shu-Wen; Lee, Shiao-Pieng; Chen, Pei-Chi; Chung, Chia-Chi; Lee, Shou-Lun; Nieh, Shin; Yin, Shih-Jiun

2012-02-01

Alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH) are principal enzymes responsible for metabolism of ethanol. Functional polymorphisms of ADH1B, ADH1C, and ALDH2 genes occur among racial populations. The goal of this study was to systematically determine the functional expressions and cellular localization of ADHs and ALDHs in human rectal mucosa, the lesions of adenocarcinoma and hemorrhoid, and the genetic association of allelic variations of ADH and ALDH with large bowel disorders. Twenty-one surgical specimens of rectal adenocarcinoma and the adjacent normal mucosa, including 16 paired tissues of rectal tumor, normal mucosae of rectum and sigmoid colon from the same individuals, and 18 surgical mixed hemorrhoid specimens and leukocyte DNA samples from 103 colorectal cancer patients, 67 hemorrhoid patients, and 545 control subjects recruited in previous study, were investigated. The isozyme/allozyme expression patterns of ADH and ALDH were identified by isoelectric focusing and the activities were assayed spectrophotometrically. The protein contents of ADH/ALDH isozymes were determined by immunoblotting using the corresponding purified class-specific antibodies; the cellular activity and protein localizations were detected by immunohistochemistry and histochemistry, respectively. Genotypes of ADH1B, ADH1C, and ALDH2 were determined by polymerase chain reaction-restriction fragment length polymorphisms. At 33mM ethanol, pH 7.5, the activity of ADH1C*1/1 phenotypes exhibited 87% higher than that of the ADH1C*1/*2 phenotypes in normal rectal mucosa. The activity of ALDH2-active phenotypes of rectal mucosa was 33% greater than ALDH2-inactive phenotypes at 200μM acetaldehyde. The protein contents in normal rectal mucosa were in the following order: ADH1>ALDH2>ADH3≈ALDH1A1, whereas those of ADH2, ADH4, and ALDH3A1 were fairly low. Both activity and content of ADH1 were significantly decreased in rectal tumors, whereas the ALDH activity remained
Effect of the allelic variant of alcohol dehydrogenase ADH1B*2 on ethanol metabolism.

PubMed

Kang, Gaeun; Bae, Kyung-Yeol; Kim, Sung-Wan; Kim, Jin; Shin, Hee-Young; Kim, Jae-Min; Shin, Il-Seon; Yoon, Jin-Sang; Kim, Jong-Keun

2014-06-01

It has been known that ADH1B*2 allele has a protective effect against the development of alcohol dependence. However, the protection mechanism is still unknown. We investigated whether ADH1B gene polymorphism affects ethanol (EtOH) metabolism. In a parent study, we conducted a randomized crossover trials on 24 healthy male subjects who were selected by genotyping: 12 with ALDH2*1/*1 (active form) and 12 with ALDH2*1/*2 (inactive form). In the present study, the 24 subjects were reclassified into 2 groups of 11 with ADH1B*1/*2 and 13 with ADH1B*2/*2 according to the ADH1B genotypes. Each subject was administered 1 of 3 doses of EtOH (0.25, 0.5, 0.75 g/kg) or a placebo in 4 trials. After the administration of alcohol, blood EtOH and acetaldehyde concentrations were measured 9 times over 4 hours. In the case of EtOH, the area under the concentration-time curve from 0 to 4 hours (AUC0-4 ) and the peak blood concentration of EtOH (Cmax ) in subjects with ADH1B*2/*2 were significantly higher than those in subjects with ADH1B*1/*2 at all 3 dosages before stratifying by ALDH2 genotype. However, after stratifying by ALDH2 genotype, a statistically significant difference between ADH1B*2/*2 and ADH1B*1/*2 was found only at the 0.5 g/kg dosage regardless of ALDH2 genotype. In the case of acetaldehyde, the AUC0-4 and Cmax of acetaldehyde of ADH1B*2/*2 after administration of 0.25 g/kg alcohol and the AUC0-4 of acetaldehyde of ADH1B*2/*2 at 0.5 g/kg were significantly higher than corresponding values of ADH1B*1/*2 only in the group of ALDH2*1/*2. Our findings indicate that the blood EtOH concentrations of ADH1B*2/*2 group are higher than those of ADH1B*1/*2 group regardless of ALDH2 genotype, and the blood acetaldehyde concentrations of ADH1B*2/*2 are also higher than those of ADH1B*1/*2 only in the ALDH2*1/*2 group. To our knowledge, this is the first report to demonstrate the association of ADH1B*2 allele with blood EtOH and acetaldehyde levels in humans, and these results
Meta-analysis of ADH1B and ALDH2 polymorphisms and esophageal cancer risk in China.

PubMed

Zhang, Guo-Hong; Mai, Rui-Qin; Huang, Bo

2010-12-21

To evaluate whether alcohol dehydrogenase-1B (ADH1B) His47Arg and aldehyde dehydrogenase-2 (ALDH2) Glu487Lys polymorphism is involved in the esophageal squamous cell carcinoma (ESCC) risk in Chinese Han population. Seven studies of ADH1B and ALDH2 genotypes in Chinese Han population in 1450 cases and 2459 controls were included for meta-analysis. Stratified analyses were carried out to determine the gene-alcohol and gene-gene interaction with ESCC risk. Potential sources of heterogeneity between studies were explored, and publication bias was also evaluated. Individuals with ADH1B arginine (Arg)/Arg genotype showed 3.95-fold increased ESCC risk in the recessive genetic model [Arg/Arg vs Arg/histidine (His) + His/His: odds ratio (OR) = 3.95, 95% confidence interval (CI): 2.76-5.67]. Significant association was found in the dominant model for ALDH2 lysine (Lys) allele [glutamate (Glu)/Lys + Lys/Lys vs Glu/Glu: OR = 2.00, 95% CI: 1.54-2.61]. Compared with the non-alcoholics, Arg/Arg (OR = 25.20, 95% CI: 10.87-53.44) and Glu/Lys + Lys/Lys (OR = 21.47, 95% CI: 6.44-71.59) were found to interact with alcohol drinking to increase the ESCC risk. ADH1B Arg+ and ALDH2 Lys+ had a higher risk for ESCC (OR = 7.09, 95% CI: 2.16-23.33). The genetic variations of ADH1B His47Arg and ALDH2 Glu487Lys are susceptible loci for ESCC in Chinese Han population and interact substantially with alcohol consumption. The individuals carrying both risky genotypes have a higher baseline risk of ESCC.
Clustering cancer gene expression data by projective clustering ensemble

PubMed Central

Yu, Xianxue; Yu, Guoxian

2017-01-01

Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920
Diametrical clustering for identifying anti-correlated gene clusters.

PubMed

Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

2003-09-01

Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.
Calcilytic Ameliorates Abnormalities of Mutant Calcium-Sensing Receptor (CaSR) Knock-In Mice Mimicking Autosomal Dominant Hypocalcemia (ADH).

PubMed

Dong, Bingzi; Endo, Itsuro; Ohnishi, Yukiyo; Kondo, Takeshi; Hasegawa, Tomoka; Amizuka, Norio; Kiyonari, Hiroshi; Shioi, Go; Abe, Masahiro; Fukumoto, Seiji; Matsumoto, Toshio

2015-11-01

Activating mutations of calcium-sensing receptor (CaSR) cause autosomal dominant hypocalcemia (ADH). ADH patients develop hypocalcemia, hyperphosphatemia, and hypercalciuria, similar to the clinical features of hypoparathyroidism. The current treatment of ADH is similar to the other forms of hypoparathyroidism, using active vitamin D3 or parathyroid hormone (PTH). However, these treatments aggravate hypercalciuria and renal calcification. Thus, new therapeutic strategies for ADH are needed. Calcilytics are allosteric antagonists of CaSR, and may be effective for the treatment of ADH caused by activating mutations of CaSR. In order to examine the effect of calcilytic JTT-305/MK-5442 on CaSR harboring activating mutations in the extracellular and transmembrane domains in vitro, we first transfected a mutated CaSR gene into HEK cells. JTT-305/MK-5442 suppressed the hypersensitivity to extracellular Ca(2+) of HEK cells transfected with the CaSR gene with activating mutations in the extracellular and transmembrane domains. We then selected two activating mutations locating in the extracellular (C129S) and transmembrane (A843E) domains, and generated two strains of CaSR knock-in mice to build an ADH mouse model. Both mutant mice mimicked almost all the clinical features of human ADH. JTT-305/MK-5442 treatment in vivo increased urinary cAMP excretion, improved serum and urinary calcium and phosphate levels by stimulating endogenous PTH secretion, and prevented renal calcification. In contrast, PTH(1-34) treatment normalized serum calcium and phosphate but could not reduce hypercalciuria or renal calcification. CaSR knock-in mice exhibited low bone turnover due to the deficiency of PTH, and JTT-305/MK-5442 as well as PTH(1-34) increased bone turnover and bone mineral density (BMD) in these mice. These results demonstrate that calcilytics can reverse almost all the phenotypes of ADH including hypercalciuria and renal calcification, and suggest that calcilytics can become a
Alcohol dehydrogenase ADH2-1 and ADH2-2 allelic isoforms in the Russian population correlate with type of alcoholic disease.

PubMed

Ogurtsov, Pavel P.; Garmash, Irina V.; Miandina, Galina I.; Guschin, Alexander E.; Itkes, Alexander V.; Moiseev, Valentin S.

2001-09-01

The frequency ADH2-2 allele in the Moscow urban population and a correlation between the ADH2-2 allele, alcoholic dependence without cirrhosis, symptomatic alcoholic cirrhosis and status on hepatitis B and C infection have been studied. One hundred and twenty-three inhabitants of Moscow (50 healthy donors, 36 patients with alcoholic cirrhosis (subdivided into infected and uninfected by HBV and/or HCV) and 37 patients with alcoholic dependence) of a similar age/sex and drinking pattern have been analysed. The frequency of 41% for ADH2-2 allele is characteristic for an urban Moscow population. This value is intermediate between that found for Asian peoples and for Central and Western Europe. There is a negative correlation between the ADH2-2 allele and alcohol misuse (both alcoholic dependence and alcoholic cirrhosis). This correlation is expressed more in alcoholic dependence. In spite of the possession of the ADH2-2 allele (or genotype ADH2-1/2), alcohol misuse increases the risk of cirrhosis. At the same time, positive status for active hepatitis B, C or combined infection B + C (replication markers HBV-DNA or HCV-RNA) increases the risk for symptomatic alcoholic cirrhosis in alcohol abusing patients, independently of ADH2 genotype.
[ADH/D and impulsiveness: Prevalence of impulse control disorders and other comorbidities, in 81 adults with attention deficit/hyperactivity disorder (ADH/D)].

PubMed

Porteret, R; Bouchez, J; Baylé, F J; Varescon, I

2016-04-01

Attention deficit hyperactivity disorder (ADH/D) is a neuropsychological developmental disorder characterized by pervasive and impairing symptoms of inattention, hyperactivity, and impulsivity. Whereas it is well known in children, there is still little information about ADH/D in adults, including prevalence. Indeed, there are actually no epidemiological studies in France, despite the considerable impact of this disorder in a patient's professional and affective life. Moreover, ADH/D rarely stays isolated, and many comorbidities often complicate the diagnostic investigation. It is well known that the so-called ADH/D is composed of two main categories of symptoms (Attentional Disorder/Hyperactiviy Disorder), but Impulsiveness also remains a major symptom. The aim of this study was to evaluate not only the prevalence of Impulse Control Disorders (ICD) but also psychological and addictive comorbidities among adult patients with ADH/D. A total of 100 patients from specialized consultations of adult ADH/D were evaluated in this study, but only 81 were included after presenting all the clinical criteria of ADH/D. We used the DSM IV-T-R for ADH/D, the Minnesota Impulsive Disorders Interview a semi-structured clinical interview assessing impulse control disorders (ICD) (compulsive buying, trichotillomania, compulsive sexual behaviour, kleptomania, pyromania and intermittent explosive disorder), and the Mini International Neuropsychiatric Interview in order to evaluate psychiatric and addictive comorbidities. More than 90 % of the patients met the early apparition criteria of ADH/D (before 7years). More than half of the patients presented a mixed type of ADH/D (both inattentive and hyperactive-impulsive forms): 55.6 % vs 44.4 % for the inattentive type. The vast majority of patients showed a complete form (with a total of 6 or more symptoms out of 9, of inattentive and/or impulsive-hyperactivity category): 93.8 % and only 6.2 % presented a sub-syndromic form of ADH/D (with
Molecular evolution of Adh and LEAFY and the phylogenetic utility of their introns in Pyrus (Rosaceae)

PubMed Central

2011-01-01

Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus
Transcriptomic Identification of ADH1B as a Novel Candidate Gene for Obesity and Insulin Resistance in Human Adipose Tissue in Mexican Americans from the Veterans Administration Genetic Epidemiology Study (VAGES)

PubMed Central

Winnier, Deidre A.; Fourcaudot, Marcel; Norton, Luke; Abdul-Ghani, Muhammad A.; Hu, Shirley L.; Farook, Vidya S.; Coletta, Dawn K.; Kumar, Satish; Puppala, Sobha; Chittoor, Geetha; Dyer, Thomas D.; Arya, Rector; Carless, Melanie; Lehman, Donna M.; Curran, Joanne E.; Cromack, Douglas T.; Tripathy, Devjit; Blangero, John; Duggirala, Ravindranath; Göring, Harald H. H.; DeFronzo, Ralph A.; Jenkinson, Christopher P.

2015-01-01

Type 2 diabetes (T2D) is a complex metabolic disease that is more prevalent in ethnic groups such as Mexican Americans, and is strongly associated with the risk factors obesity and insulin resistance. The goal of this study was to perform whole genome gene expression profiling in adipose tissue to detect common patterns of gene regulation associated with obesity and insulin resistance. We used phenotypic and genotypic data from 308 Mexican American participants from the Veterans Administration Genetic Epidemiology Study (VAGES). Basal fasting RNA was extracted from adipose tissue biopsies from a subset of 75 unrelated individuals, and gene expression data generated on the Illumina BeadArray platform. The number of gene probes with significant expression above baseline was approximately 31,000. We performed multiple regression analysis of all probes with 15 metabolic traits. Adipose tissue had 3,012 genes significantly associated with the traits of interest (false discovery rate, FDR ≤ 0.05). The significance of gene expression changes was used to select 52 genes with significant (FDR ≤ 10-4) gene expression changes across multiple traits. Gene sets/Pathways analysis identified one gene, alcohol dehydrogenase 1B (ADH1B) that was significantly enriched (P < 10-60) as a prime candidate for involvement in multiple relevant metabolic pathways. Illumina BeadChip derived ADH1B expression data was consistent with quantitative real time PCR data. We observed significant inverse correlations with waist circumference (2.8 x 10-9), BMI (5.4 x 10-6), and fasting plasma insulin (P < 0.001). These findings are consistent with a central role for ADH1B in obesity and insulin resistance and provide evidence for a novel genetic regulatory mechanism for human metabolic diseases related to these traits. PMID:25830378
The Natural History of Class I Primate Alcohol Dehydrogenases Includes Gene Duplication, Gene Loss, and Gene Conversion

PubMed Central

Carrigan, Matthew A.; Uryasev, Oleg; Davis, Ross P.; Zhai, LanMin; Hurley, Thomas D.; Benner, Steven A.

2012-01-01

Background Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids. Methodology/Principal Findings To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences. Conclusions/Significance We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in
Meta-Analyses of ALDH2 and ADH1B with Alcohol Dependence in Asians

ERIC Educational Resources Information Center

Luczak, Susan E.; Glatt, Stephen J.; Wall, Tamara J.

2006-01-01

Meta-analyses were conducted to determine the magnitude of relationships between polymorphisms in 2 genes, ALDH2 and ADH1B, with alcohol dependence in Asians. For each gene, possession of 1 variant [asterisk]2 allele was protective against alcohol dependence, and possession of a 2nd [asterisk]2 allele did not offer significant additional…
Association between ADH1C and ALDH2 polymorphisms and alcoholism in a Turkish sample.

PubMed

Ayhan, Yavuz; Gürel, Şeref Can; Karaca, Özgür; Zoto, Teuta; Hayran, Mutlu; Babaoğlu, Melih; Yaşar, Ümit; Bozkurt, Atilla; Dilbaz, Nesrin; Uluğ, Berna Diclenur; Demir, Başaran

2015-04-01

Polymorphisms in the genes encoding alcohol metabolizing enzymes are associated with alcohol dependence. To evaluate the association between the alcohol dehydrogenase 1C (ADH1C) Ile350Val and aldehyde dehydrogenase 2 (ALDH2) Glu504Lys polymorphisms and alcohol dependence in a Turkish sample. 235 individuals (115 alcohol-dependent patients and 120 controls) were genotyped for ADH1C and ALDH2 with PCR-RFLP (polymerase chain reaction-restriction fragment length polymorphism). Association between the polymorphisms and family history, daily and maximum amount of alcohol consumed was investigated. The associations between alcohol dependence, severity of consumption and family history and the polymorphisms were analyzed by chi-square or Fisher's exact test where necessary. Relationship between genotypes and dependence related features was evaluated using analysis of variance (ANOVA). The -350Val allele for ADH1C (ADH1C*2) was increased in alcohol-dependent patients (P = 0.05). In individuals with a positive family history, the genotype distribution differed significantly (P = 0.031) and more patients carried the Val allele compared with controls (P = 0.025). Genotyping of 162 participants did not reveal the -504Lys allele in ALDH2. These findings suggest that ADH1C*2 is associated with alcohol dependence in the Turkish population displaying a dominant inheritance model. ADH1C*2 allele may contribute to the variance in heritability of alcohol dependence. The ALDH2 -504Lys/Lys or Glu/Lys genotypes were not present in alcohol-dependent patients, similar to that seen in European populations and in contrast to the findings in the Asian populations.

Functional characterization of SlscADH1, a fruit-ripening-associated short-chain alcohol dehydrogenase of tomato.

PubMed

Moummou, Hanane; Tonfack, Libert Brice; Chervin, Christian; Benichou, Mohamed; Youmbi, Emmanuel; Ginies, Christian; Latché, Alain; Pech, Jean-Claude; van der Rest, Benoît

2012-10-15

A tomato short-chain dehydrogenase-reductase (SlscADH1) is preferentially expressed in fruit with a maximum expression at the breaker stage while expression in roots, stems, leaves and flowers is very weak. It represents a potential candidate for the formation of aroma volatiles by interconverting alcohols and aldehydes. The SlscADH1 recombinant protein produced in Escherichia coli exhibited dehydrogenase-reductase activity towards several volatile compounds present in tomato flavour with a strong preference for the NAD/NADH co-factors. The strongest activity was observed for the reduction of hexanal (K(m)=0.175mM) and phenylacetaldehyde (K(m)=0.375mM) in the presence of NADH. The oxidation process of hexanol and 1-phenylethanol was much less efficient (K(m)s of 2.9 and 23.0mM, respectively), indicating that the enzyme preferentially acts as a reductase. However activity was observed only for hexanal, phenylacetaldehyde, (E)-2-hexenal and acetaldehyde and the corresponding alcohols. No activity could be detected for other aroma volatiles important for tomato flavour, such as methyl-butanol/methyl-butanal, 5-methyl-6-hepten-2-one/5-methyl-6-hepten-2-ol, citronellal/citronellol, neral/nerol, geraniol. In order to assess the function of the SlscADH1 gene, transgenic plants have been generated using the technique of RNA interference (RNAi). Constitutive down-regulation using the 35S promoter resulted in the generation of dwarf plants, indicating that the SlscADH1 gene, although weakly expressed in vegetative tissues, had a function in regulating plant development. Fruit-specific down-regulation using the 2A11 promoter had no morphogenetic effect and did not alter the aldehyde/alcohol balance of the volatiles compounds produced by the fruit. Nevertheless, SlscADH1-inhibited fruit unexpectedly accumulated higher concentrations of C5 and C6 volatile compounds of the lipoxygenase pathway, possibly as an indirect effect of the suppression of SlscADH1 on the catabolism of
The Alcohol Dehydrogenase Gene Family in Melon (Cucumis melo L.): Bioinformatic Analysis and Expression Patterns

PubMed Central

Jin, Yazhong; Zhang, Chong; Liu, Wei; Tang, Yufan; Qi, Hongyan; Chen, Hao; Cao, Songxiao

2016-01-01

Alcohol dehydrogenases (ADH), encoded by multigene family in plants, play a critical role in plant growth, development, adaptation, fruit ripening and aroma production. Thirteen ADH genes were identified in melon genome, including 12 ADHs and one formaldehyde dehydrogenease (FDH), designated CmADH1-12 and CmFDH1, in which CmADH1 and CmADH2 have been isolated in Cantaloupe. ADH genes shared a lower identity with each other at the protein level and had different intron-exon structure at nucleotide level. No typical signal peptides were found in all CmADHs, and CmADH proteins might locate in the cytoplasm. The phylogenetic tree revealed that 13 ADH genes were divided into three groups respectively, namely long-, medium-, and short-chain ADH subfamily, and CmADH1,3-11, which belongs to the medium-chain ADH subfamily, fell into six medium-chain ADH subgroups. CmADH12 may belong to the long-chain ADH subfamily, while CmFDH1 may be a Class III ADH and serve as an ancestral ADH in melon. Expression profiling revealed that CmADH1, CmADH2, CmADH10 and CmFDH1 were moderately or strongly expressed in different vegetative tissues and fruit at medium and late developmental stages, while CmADH8 and CmADH12 were highly expressed in fruit after 20 days. CmADH3 showed preferential expression in young tissues. CmADH4 only had slight expression in root. Promoter analysis revealed several motifs of CmADH genes involved in the gene expression modulated by various hormones, and the response pattern of CmADH genes to ABA, IAA and ethylene were different. These CmADHs were divided into ethylene-sensitive and –insensitive groups, and the functions of CmADHs were discussed. PMID:27242871
Metabolite Profiling of adh1 Mutant Response to Cold Stress in Arabidopsis

PubMed Central

Song, Yuan; Liu, Lijun; Wei, Yunzhu; Li, Gaopeng; Yue, Xiule; An, Lizhe

2017-01-01

As a result of global warming, vegetation suffers from repeated freeze-thaw cycles caused by more frequent short-term low temperatures induced by hail, snow, or night frost. Therefore, short-term freezing stress of plants should be investigated particularly in light of the current climatic conditions. Alcohol dehydrogenase (ADH) plays a central role in the metabolism of alcohols and aldehydes and it is a key enzyme in anaerobic fermentation. ADH1 responds to plant growth and environmental stress; however, the function of ADH1 in the response to short-term freezing stress remains unknown. Using real-time quantitative fluorescence PCR, the expression level of ADH1 was analyzed at low temperature (4°C). The lethal temperature was calculated based on the electrolyte leakage tests for both ADH1 deletion mutants (adh1) and wild type (WT) plants. To further investigate the relationship between ADH1 and cold tolerance in plants, low-Mr polar metabolite analyses of Arabidopsis adh1 and WT were performed at cold temperatures using gas chromatography-mass spectrometry. This investigation focused on freezing treatments (cold acclimation group: −6°C for 2 h with prior 4°C for 7 d, cold shock group: −6°C for 2 h without cold acclimation) and recovery (23°C for 24 h) with respect to seedling growth at optimum temperature. The experimental results revealed a significant increase in ADH1 expression during low temperature treatment (4°C) and at a higher lethal temperature in adh1 compared to that in the WT. Retention time indices and specific mass fragments were used to monitor 263 variables and annotate 78 identified metabolites. From these analyses, differences in the degree of metabolite accumulation between adh1 and WT were detected, including soluble sugars (e.g., sucrose) and amino acids (e.g., asparagine). In addition, the correlation-based network analysis highlighted some metabolites, e.g., melibiose, fumaric acid, succinic acid, glycolic acid, and xylose, which
Three alcohol dehydrogenase genes and one acetyl-CoA synthetase gene are responsible for ethanol utilization in Yarrowia lipolytica.

PubMed

Gatter, Michael; Ottlik, Stephanie; Kövesi, Zsolt; Bauer, Benjamin; Matthäus, Falk; Barth, Gerold

2016-10-01

The non-conventional yeast Yarrowia lipolytica is able to utilize a wide range of different substrates like glucose, glycerol, ethanol, acetate, proteins and various hydrophobic molecules. Although most metabolic pathways for the utilization of these substrates have been clarified by now, it was not clear whether ethanol is oxidized by alcohol dehydrogenases or by an alternative oxidation system inside the cell. In order to detect the genes that are required for ethanol utilization in Y. lipolytica, eight alcohol dehydrogenase (ADH) genes and one alcohol oxidase gene (FAO1) have been identified and respective deletion strains were tested for their ability to metabolize ethanol. As a result of this, we found that the availability of ADH1, ADH2 or ADH3 is required for ethanol utilization in Y. lipolytica. A strain with deletions in all three genes is lacking the ability to utilize ethanol as sole carbon source. Although Adh2p showed by far the highest enzyme activity in an in vitro assay, the availability of any of the three genes was sufficient to enable a decent growth. In addition to ADH1, ADH2 and ADH3, an acetyl-CoA synthetase encoding gene (ACS1) was found to be essential for ethanol utilization. As Y. lipolytica is a non-fermenting yeast, it is neither able to grow under anaerobic conditions nor to produce ethanol. To investigate whether Y. lipolytica may produce ethanol, the key genes of alcoholic fermentation in S. cerevisiae, ScADH1 and ScPDC1, were overexpressed in an ADH and an ACS1 deletion strain. However, instead of producing ethanol, the respective strains regained the ability to use ethanol as single carbon source and were still not able to grow under anaerobic conditions. Copyright © 2016 Elsevier Inc. All rights reserved.
Endogenous Methanol Regulates Mammalian Gene Activity

PubMed Central

Komarova, Tatiana V.; Petrunia, Igor V.; Shindyapina, Anastasia V.; Silachev, Denis N.; Sheshukova, Ekaterina V.; Kiryanov, Gleb I.; Dorokhov, Yuri L.

2014-01-01

We recently showed that methanol emitted by wounded plants might function as a signaling molecule for plant-to-plant and plant-to-animal communications. In mammals, methanol is considered a poison because the enzyme alcohol dehydrogenase (ADH) converts methanol into toxic formaldehyde. However, the detection of methanol in the blood and exhaled air of healthy volunteers suggests that methanol may be a chemical with specific functions rather than a metabolic waste product. Using a genome-wide analysis of the mouse brain, we demonstrated that an increase in blood methanol concentration led to a change in the accumulation of mRNAs from genes primarily involved in detoxification processes and regulation of the alcohol/aldehyde dehydrogenases gene cluster. To test the role of ADH in the maintenance of low methanol concentration in the plasma, we used the specific ADH inhibitor 4-methylpyrazole (4-MP) and showed that intraperitoneal administration of 4-MP resulted in a significant increase in the plasma methanol, ethanol and formaldehyde concentrations. Removal of the intestine significantly decreased the rate of methanol addition to the plasma and suggested that the gut flora may be involved in the endogenous production of methanol. ADH in the liver was identified as the main enzyme for metabolizing methanol because an increase in the methanol and ethanol contents in the liver homogenate was observed after 4-MP administration into the portal vein. Liver mRNA quantification showed changes in the accumulation of mRNAs from genes involved in cell signalling and detoxification processes. We hypothesized that endogenous methanol acts as a regulator of homeostasis by controlling the mRNA synthesis. PMID:24587296
Pichia stipitis Genes for Alcohol Dehydrogenase with Fermentative and Respiratory Functions

PubMed Central

Cho, Jae-yong; Jeffries, Thomas W.

1998-01-01

Two genes coding for isozymes of alcohol dehydrogenase (ADH); designated PsADH1 and PsADH2, have been identified and isolated from Pichia stipitis CBS 6054 genomic DNA by Southern hybridization to Saccharomyces cerevisiae ADH genes, and their physiological roles have been characterized through disruption. The amino acid sequences of the PsADH1 and PsADH2 isozymes are 80.5% identical to one another and are 71.9 and 74.7% identical to the S. cerevisiae ADH1 protein. They also show a high level identity with the group I ADH proteins from Kluyveromyces lactis. The PsADH isozymes are presumably localized in the cytoplasm, as they do not possess the amino-terminal extension of mitochondrion-targeted ADHs. Gene disruption studies suggest that PsADH1 plays a major role in xylose fermentation because PsADH1 disruption results in a lower growth rate and profoundly greater accumulation of xylitol. Disruption of PsADH2 does not significantly affect ethanol production or aerobic growth on ethanol as long as PsADH1 is present. The PsADH1 and PsADH2 isozymes appear to be equivalent in the ability to convert ethanol to acetaldehyde, and either is sufficient to allow cell growth on ethanol. However, disruption of both genes blocks growth on ethanol. P. stipitis strains disrupted in either PsADH1 or PsADH2 still accumulate ethanol, although in different amounts, when grown on xylose under oxygen-limited conditions. The PsADH double disruptant, which is unable to grow on ethanol, still produces ethanol from xylose at about 13% of the rate seen in the parental strain. Thus, deletion of both PsADH1 and PsADH2 blocks ethanol respiration but not production, implying a separate path for fermentation. PMID:9546172
Polymorphisms in the promoter region of the human class II alcohol dehydrogenase (ADH4) gene affect both transcriptional activity and ethanol metabolism in Japanese subjects.

PubMed

Kimura, Yukiko; Nishimura, Fusae T; Abe, Shuntaro; Fukunaga, Tatsushige; Tanii, Hideji; Saijoh, Kiyofumi

2009-02-01

Class II alcohol dehydrogenase (pi-ADH), encoded by alcohol dehydrogenase (ADH4), is considered to contribute to ethanol (EtOH) oxidation in the liver at high concentration. Four single nucleotide polymorphisms (SNPs) were found in the promoter region of this gene. Analysis of genotype distribution in 102 unrelated Japanese subjects revealed that four loci were in strong linkage disequilibrium and could be classified into three haplotypes. The effects of these polymorphisms on transcriptional activity were investigated in HepG2 cells. Transcriptional activity was significantly higher in cells with the -136A allele than in those with the -136C allele. To investigate whether this difference in transcriptional activity caused a difference in EtOH elimination, previous data on blood EtOH changes after 0.4 g/kg body weight alcohol ingestion were analyzed. When analyzed based on aldehyde dehydrogenase-2 gene (ALDH2) (487)Glu/Lys genotype, the significantly lower level of EtOH at peak in subjects with -136C/A and -136A/A genotype compared with subjects with -136C/C genotype indicated that -136 bp was a suggestive locus for differences in EtOH oxidation. This effect was observed only in subjects with ALDH2 (487)Glu/Glu. These results suggested that the SNP at -136bp in the ADH4 promoter had an effect on transcriptional regulation, and that the higher activity of the -136A allele compared with the -136C allele caused a lower level of blood EtOH after alcohol ingestion; that is, individuals with the -136A allele may consume more EtOH and might have a higher risk for development of alcohol dependence than those without the -136A allele.
Deep analysis of N-cadherin/ADH-1 interaction: a computational survey.

PubMed

Eslami, Mahboobeh; Nezafat, Navid; Khajeh, Sahar; Mostafavi-Pour, Zohreh; Bagheri Novir, Samaneh; Negahdaripour, Manica; Ghasemi, Younes; Razban, Vahid

2018-01-19

Due to the considerable role of N-cadherin in cancer metastasis, tumor growth, and progression, inhibition of this protein has been highly regarded in recent years. Although ADH-1 has been known as an appropriate inhibitor of N-cadherin in clinical trials, its chemical nature and binding mode with N-cadherin have not been precisely specified yet. Accordingly, in this study, quantum mechanics calculations were used to investigate the chemical nature of ADH-1. These calculations clarify the molecular properties of ADH-1 and determine its reactive sites. Based on the results, the oxygen atoms are suitable for electrophilic reactivity, while the hydrogen atoms that are connected to nitrogen atoms are the favorite sites for nucleophilic reactivity. The higher electronegativity of the oxygen atoms makes them the most reactive portions in this molecule. Molecular docking and molecular dynamics (MD) simulation have also been applied to specify the binding mode of ADH-1 with N-cadherin and determine the important residues of N-cadherin involving in the interaction with ADH-1. Moreover, the verified model by MD simulation has been studied to extract the free energy value and find driving forces. These calculations and molecular electrostatic potential map of ADH-1 indicated that hydrophobic and electrostatic interactions are almost equally involved in the implantation of ADH-1 in the N-cadherin binding site. The presented results not only enable a closer examination of N-cadherin in complex with ADH-1 molecule, but also are very beneficial in designing new inhibitors for N-cadherin and can help to save time and cost in this field.
Combined effect of ADH1B RS1229984, RS2066702 and ADH1C RS1693482/ RS698 alleles on alcoholism and chronic liver diseases.

PubMed

Tóth, Réka; Fiatal, Szilvia; Petrovski, Beáta; McKee, Martin; Adány, Róza

2011-01-01

The aim of this study was to analyze the combined effect of the most frequent alcohol dehydrogenase polymorphisms (Arg48His and Arg370Cys in ADH1B, Arg272Gln and Ile350Val in ADH1C) on the alcohol use habits, alcohol dependence and chronic liver diseases in Hungary. The study included men, aged 45-64 years. Altogether, 241 cases with chronic liver disease (CLD) and 666 randomly selected controls without CLD were analysed for all four polymorphisms. Associations between the polymorphisms, individually, and in combination, and excessive and problem drinking and CLD, were assessed using logistic regression. In this study we have identified a novel mutation, called ADH1B Arg370His. The ADH1C Arg272Gln and Ile350Val showed almost complete linkage. The 272Gln/35Val allele increased the risk of excessive and problem drinking in homozygous form (OR=1.582, p=0.035, CI=1.034-2.421, OR=1.780, p=0.016, CI=1.113-2.848, respectively). The joint analysis showed that when combined with the wild type ADH1C Arg272/Ile350 allele, the ADH1B 48His is protective against CLD (OR=0.368, p=0.019, CI=0.159-0.851). The results obtained in the study help not only to clarify the effects of different ADH SNPs but to better understand how these polymorphisms modify each other's effects in the development of alcoholism and related diseases.
Multiconstrained gene clustering based on generalized projections

PubMed Central

2010-01-01

Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions. PMID:20356386
Finding approximate gene clusters with Gecko 3.

PubMed

Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

2016-11-16

Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
[Studies on the role of high pressure baroreceptors in vasopressin (ADH) secretion. Effect of occlusion of common carotid and vertebral arteries on blood ADH level (author's transl)].

PubMed

Matsuzaki, M

1977-08-20

The role of baroreceptors in common carotid and vertebral arteries and arteries in the thoracic cavity in vasopressin secretion was investigated in this study. Effects of bilateral occlusion of common carotid and vertebral arteries on blood ADH level as well as mean arterial pressure were studied in common carotid arterial plexus-denervated dogs, cervically vagotomized dogs and intact dogs. Blood ADH titers were determined by bioassay technic before and 5 minutes after the occlusion of the arteries and were compared with the changes of mean arterial pressure (MAP). The following results were obtained. (1) Blood ADH titers and MAP were elevated by the occlusion of the common carotid arteries in both intact and vagotomized dogs, while they were not significantly affected in denervated dogs. Elevation of blood ADH titers was more pronounced in vagotomized dogs than in intact dogs. (2) Blood ADH titers and MAP were elevated by the occlusion of vertebral arteries in all groups of dogs. However, the elevation of blood ADH titers in denervated dogs was more pronounced than in intact dogs, but less than in vagotomized dogs. (3) The effects of the occlusion of common carotid arteries on blood ADH titers and MPA were more pronounced than those of the occlusion of vertebral arteries. These results may suggest that: a. baroreceptors involved in vasopressin secretion are present in vertebral arteries as well, and that b. the intrathoracic baroreceptors are dominant in controlling vasopressin secretion, while those in common carotid arteries are secondly and those in vertebral arteries thirdly dominant.
The multifunctional isopropyl alcohol dehydrogenase of Phytomonas sp. could be the result of a horizontal gene transfer from a bacterium to the trypanosomatid lineage.

PubMed

Molinas, Sara M; Altabe, Silvia G; Opperdoes, Fred R; Rider, Mark H; Michels, Paul A M; Uttaro, Antonio D

2003-09-19

Isopropyl alcohol dehydrogenase (iPDH) is a dimeric mitochondrial alcohol dehydrogenase (ADH), so far detected within the Trypanosomatidae only in the genus Phytomonas. The cloning, sequencing, and heterologous expression of the two gene alleles of the enzyme revealed that it is a zinc-dependent medium-chain ADH. Both polypeptides have 361 amino acids. A mitochondrial targeting sequence was identified. The mature proteins each have 348 amino acids and a calculated molecular mass of 37 kDa. They differ only in one amino acid, which can explain the three isoenzymes and their respective isoelectric points previously found. A phylogenetic analysis locates iPDH within a cluster with fermentative ADHs from bacteria, sharing 74% similarity and 60% identity with Ralstonia eutropha ADH. The characterization of the two bacterially expressed Phytomonas enzymes and the comparison of their kinetic properties with those of the wild-type iPDH and of the R. eutropha ADH strongly support the idea of a horizontal gene transfer event from a bacterium to a trypanosomatid to explain the origin of the iPDH in Phytomonas. Phytomonas iPDH and R. eutropha ADH are able to use a wide range of substrates with similar Km values such as primary and secondary alcohols, diols, and aldehydes, as well as ketones such as acetone, diacetyl, and acetoin. We speculate that, as for R. eutropha ADH, Phytomonas iPDH acts as a safety valve for the release of excess reducing power.
Conversion events in gene clusters

PubMed Central

2011-01-01

Background Gene clusters containing multiple similar genomic regions in close proximity are of great interest for biomedical studies because of their associations with inherited diseases. However, such regions are difficult to analyze due to their structural complexity and their complicated evolutionary histories, reflecting a variety of large-scale mutational events. In particular, conversion events can mislead inferences about the relationships among these regions, as traced by traditional methods such as construction of phylogenetic trees or multi-species alignments. Results To correct the distorted information generated by such methods, we have developed an automated pipeline called CHAP (Cluster History Analysis Package) for detecting conversion events. We used this pipeline to analyze the conversion events that affected two well-studied gene clusters (α-globin and β-globin) and three gene clusters for which comparative sequence data were generated from seven primate species: CCL (chemokine ligand), IFN (interferon), and CYP2abf (part of cytochrome P450 family 2). CHAP is freely available at http://www.bx.psu.edu/miller_lab. Conclusions These studies reveal the value of characterizing conversion events in the context of studying gene clusters in complex genomes. PMID:21798034
Mutational Analysis of the Adaptor Protein 2 Sigma Subunit (AP2S1) Gene: Search for Autosomal Dominant Hypocalcemia Type 3 (ADH3)

PubMed Central

Rogers, Angela; Nesbit, M. Andrew; Hannan, Fadil M.; Howles, Sarah A.; Gorvin, Caroline M.; Cranston, Treena; Allgrove, Jeremy; Bevan, John S.; Bano, Gul; Brain, Caroline; Datta, Vipan; Grossman, Ashley B.; Hodgson, Shirley V.; Izatt, Louise; Millar-Jones, Lynne; Pearce, Simon H.; Robertson, Lisa; Selby, Peter L.; Shine, Brian; Snape, Katie; Warner, Justin

2014-01-01

Context: Autosomal dominant hypocalcemia (ADH) types 1 and 2 are due to calcium-sensing receptor (CASR) and G-protein subunit-α11 (GNA11) gain-of-function mutations, respectively, whereas CASR and GNA11 loss-of-function mutations result in familial hypocalciuric hypercalcemia (FHH) types 1 and 2, respectively. Loss-of-function mutations of adaptor protein-2 sigma subunit (AP2σ 2), encoded by AP2S1, cause FHH3, and we therefore sought for gain-of-function AP2S1 mutations that may cause an additional form of ADH, which we designated ADH3. Objective: The objective of the study was to investigate the hypothesis that gain-of-function AP2S1 mutations may cause ADH3. Design: The sample size required for the detection of at least one mutation with a greater than 95% likelihood was determined by binomial probability analysis. Nineteen patients (including six familial cases) with hypocalcemia in association with low or normal serum PTH concentrations, consistent with ADH, but who did not have CASR or GNA11 mutations, were ascertained. Leukocyte DNA was used for sequence and copy number variation analysis of AP2S1. Results: Binomial probability analysis, using the assumption that AP2S1 mutations would occur in hypocalcemic patients at a prevalence of 20%, which is observed in FHH patients without CASR or GNA11 mutations, indicated that the likelihood of detecting at least one AP2S1 mutation was greater than 95% and greater than 98% in sample sizes of 14 and 19 hypocalcemic patients, respectively. AP2S1 mutations and copy number variations were not detected in the 19 hypocalcemic patients. Conclusion: The absence of AP2S1 abnormalities in hypocalcemic patients, suggests that ADH3 may not occur or otherwise represents a rare hypocalcemic disorder. PMID:24708097
A multiple mediator analysis approach to quantify the effects of the ADH1B and ALDH2 genes on hepatocellular carcinoma risk.

PubMed

Shih, Stephannie; Huang, Yen-Tsung; Yang, Hwai-I

2018-06-01

Previous work suggested a genetic component affecting the risk of hepatocellular carcinoma (HCC) and mediation analyses have elucidated potential indirect pathways of these genetic effects. Specifically, the effects of alcohol dehydrogenase (ADH1B) and aldehyde dehydrogenase (ALDH2) genes on HCC risk vary based on alcohol consumption habits. However, alcohol consumption may not be the only mediator in the identified pathway: factors related to alcohol consumption may contribute to the same indirect pathway. Thus, we developed a multimediator model to quantify the genetic effects on HCC risk through sequential dichotomous mediators under the counterfactual framework. Our method provided a closed form formula for the mediation effects through different indirect paths, which requires no assumption for the rarity of outcome. In simulation studies of a finite sample, we presented the utility of the method with the variance of the effects estimated using the delta method and bootstrapping. We applied our method to data from participants in Taiwan (580 cases and 3,207 controls) and quantified the mediation effects of single nucleotide polymorphisms (SNPs) in the ADH1B and ALDH2 genes on HCC through alcohol consumption (yes/no) and high alanine transaminase (ALT) levels (greater than or equal to 45 U/L or below 45 U/L). Assuming a dominant risk model, we identified that the SNPs' effects through alcohol consumption is more significant than through ALT levels on HCC risk. This new method provides insight to the magnitude of various casual mechanisms as a closed form solution and can be readily applied in other genomic studies. © 2018 WILEY PERIODICALS, INC.
Co-expression of TAL1 and ADH1 in recombinant xylose-fermenting Saccharomyces cerevisiae improves ethanol production from lignocellulosic hydrolysates in the presence of furfural.

PubMed

Hasunuma, Tomohisa; Ismail, Ku Syahidah Ku; Nambu, Yumiko; Kondo, Akihiko

2014-02-01

Lignocellulosic biomass dedicated to bioethanol production usually contains pentoses and inhibitory compounds such as furfural that are not well tolerated by Saccharomyces cerevisiae. Thus, S. cerevisiae strains with the capability of utilizing both glucose and xylose in the presence of inhibitors such as furfural are very important in industrial ethanol production. Under the synergistic conditions of transaldolase (TAL) and alcohol dehydrogenase (ADH) overexpression, S. cerevisiae MT8-1X/TAL-ADH was able to produce 1.3-fold and 2.3-fold more ethanol in the presence of 70 mM furfural than a TAL-expressing strain and a control strain, respectively. We also tested the strains' ability by mimicking industrial ethanol production from hemicellulosic hydrolysate containing fermentation inhibitors, and ethanol production was further improved by 16% when using MT8-1X/TAL-ADH compared to the control strain. Transcript analysis further revealed that besides the pentose phosphate pathway genes TKL1 and TAL1, ADH7 was also upregulated in response to furfural stress, which resulted in higher ethanol production compared to the TAL-expressing strain. The improved capability of our modified strain was based on its capacity to more quickly reduce furfural in situ resulting in higher ethanol production. The co-expression of TAL/ADH genes is one crucial strategy to fully utilize undetoxified lignocellulosic hydrolysate, leading to cost-competitive ethanol production. Copyright © 2013 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Suppression of ADH during water immersion in normal man. [antidiuretic hormone

NASA Technical Reports Server (NTRS)

Epstein, M.; Pins, D. S.; Miller, M.

1975-01-01

A study was undertaken to ascertain whether diuresis induced by immersion is medicated by an inhibition of ADH. Immersion resulted in a progressive decrease in ADH excretion from 80.1 + or - 7 (SEM) to 37.3 + or - 6.3 microU/min (P less than 0.025). Cessation of immersion was associated with a marked increase in ADH from 37.3 + or - 6.3 microU/min to 176.6 + or - 72.6 microU/min during the recovery hour (P less than 0.05). Concomitant with these changes, urine osmolality decreased significantly beginning as early as the initial hour of immersion from 1044 + or - 36 to 542 + or - 66 mosmol/kg H2O during the final hour of immersion (P less than 0.001). These findings are consistent with the earlier suggestion that suppression of ADH release contributes to enhanced free water clearance in hydrated subjects undergoing immersion.
ADH1B Arg47His polymorphism is associated with esophageal cancer risk in high-incidence Asian population: evidence from a meta-analysis.

PubMed

Zhang, Guohong; Mai, Ruiqin; Huang, Bo

2010-10-27

Incidence of Esophageal squamous cell carcinoma (ESCC) is prevalent in Asian populations, especially in the ones from the "Asian esophageal cancer belt" along the Silk Road and the ones from East Asia (including Japan). Silk Road and Eastern Asia population genetics are relevant to the ancient population migration from central China. The Arg47His (rs1229984) polymorphism of ADH1B is the highest in East Asians, and ancient migrations along the Silk Road were thought to be contributive to a frequent ADH1B*47His allele in Central Asians. This polymorphism was identified as responsible for susceptibility in the first large-scale genome-wide association study of ESCC and that's explained by its modulation of alcohol oxidization capability. To investigate the association of ADH1B Arg47His with ESCC in Asian populations under a common ancestry scenario of the susceptibility loci, we combined all available studies into a meta-analysis. A dataset composed of 4,220 cases and 8,946 controls from twelve studies of Asian populations was analyzed for ADH1B Arg47His association with ESCC and its interactions with alcohol drinking and ALDH2 Glu504Lys. Heterogeneity among studies and their publication bias were also tested. The ADH1B*47Arg allele was found to be associated to increased risk of ESCC, with the odds ratios (OR) being 1.62 (95% CI: 1.49-1.76) and 3.86 (2.96-5.03) for the His/Arg and the Arg/Arg genotypes, respectively. When compared with the His/His genotype of non-drinkers, the Arg/Arg genotype can interact with alcohol drinking and greatly increase the risk of ESCC (OR = 20.69, 95%CI: 5.09-84.13). Statistical tests also showed gene-gene interaction of ADH1B Arg+ with ALDH2 Lys+ can bring more risk to ESCC (OR = 13.46, 95% CI: 2.32-78.07). Revealed by this meta-analysis, ADH1B*47Arg as a common ancestral allele can significantly increase the risk of ESCC in Asians, especially when coupled with alcohol drinking or the ALDH2*504Lys allele.
Finding gene clusters for a replicated time course study

PubMed Central

2014-01-01

Background Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. Findings In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. Conclusions The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. PMID:24460656

Constrained clusters of gene expression profiles with pathological features.

PubMed

Sese, Jun; Kurokawa, Yukinori; Monden, Morito; Kato, Kikuya; Morishita, Shinichi

2004-11-22

Gene expression profiles should be useful in distinguishing variations in disease, since they reflect accurately the status of cells. The primary clustering of gene expression reveals the genotypes that are responsible for the proximity of members within each cluster, while further clustering elucidates the pathological features of the individual members of each cluster. However, since the first clustering process and the second classification step, in which the features are associated with clusters, are performed independently, the initial set of clusters may omit genes that are associated with pathologically meaningful features. Therefore, it is important to devise a way of identifying gene expression clusters that are associated with pathological features. We present the novel technique of 'itemset constrained clustering' (IC-Clustering), which computes the optimal cluster that maximizes the interclass variance of gene expression between groups, which are divided according to the restriction that only divisions that can be expressed using common features are allowed. This constraint automatically labels each cluster with a set of pathological features which characterize that cluster. When applied to liver cancer datasets, IC-Clustering revealed informative gene expression clusters, which could be annotated with various pathological features, such as 'tumor' and 'man', or 'except tumor' and 'normal liver function'. In contrast, the k-means method overlooked these clusters.
In vitro expression of Candida albicans alcohol dehydrogenase genes involved in acetaldehyde metabolism.

PubMed

Bakri, M M; Rich, A M; Cannon, R D; Holmes, A R

2015-02-01

Alcohol consumption is a risk factor for oral cancer, possibly via its conversion to acetaldehyde, a known carcinogen. The oral commensal yeast Candida albicans may be one of the agents responsible for this conversion intra-orally. The alcohol dehydrogenase (Adh) family of enzymes are involved in acetaldehyde metabolism in yeast but, for C. albicans it is not known which family member is responsible for the conversion of ethanol to acetaldehyde. In this study we determined the expression of mRNAs from three C. albicans Adh genes (CaADH1, CaADH2 and CaCDH3) for cells grown in different culture media at different growth phases by Northern blot analysis and quantitative reverse transcription polymerase chain reaction. CaADH1 was constitutively expressed under all growth conditions but there was differential expression of CaADH2. CaADH3 expression was not detected. To investigate whether CaAdh1p or CaAdh2p can contribute to alcohol catabolism in C. albicans, each gene from the reference strain C. albicans SC5314 was expressed in Saccharomyces cerevisiae. Cell extracts from an CaAdh1p-expressing S. cerevisiae recombinant, but not an CaAdh2p-expressing recombinant, or an empty vector control strain, possessed ethanol-utilizing Adh activity above endogenous S. cerevisiae activity. Furthermore, expression of C. albicans Adh1p in a recombinant S. cerevisiae strain in which the endogenous ScADH2 gene (known to convert ethanol to acetaldehyde in this yeast) had been deleted, conferred an NAD-dependent ethanol-utilizing, and so acetaldehyde-producing, Adh activity. We conclude that CaAdh1p is the enzyme responsible for ethanol use under in vitro growth conditions, and may contribute to the intra-oral production of acetaldehyde. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Low ergosterol content in yeast adh1 mutant enhances chitin maldistribution and sensitivity to paraquat-induced oxidative stress.

PubMed

Marisco, G; Saito, S T; Ganda, I S; Brendel, M; Pungartnik, C

2011-05-01

Alcohol dehydrogenases catalyse the reversible oxidation of alcohols to aldehydes or ketones, with concomitant reduction of NAD(+) or NADP(+) . Adh1p is responsible for the reduction of acetaldehyde to ethanol, while Adh2p catalyses the reverse reaction, the oxidation of ethanol to acetaldehyde. Lack of Adh1p shifts the cellular redox balance towards excess NADH/NADPH and acetaldehyde, while absence of Adh2p does the opposite. Yeast mutant adh1Δ had a slow growth rate, whereas adh2Δ grew like the isogenic wild-type (WT) during prediauxic shift fermentative metabolism. After 48 h WT and mutants reached the same number of viable cells. When exponentially growing (LOG) cells were exposed to calcofluor white, only mutant adh1Δ displayed an irregular deposition of chitin. Quantitative analyses of both LOG and stationary-phase cells showed that adh1Δ mutant contained significantly less ergosterol than cells of WT and adh2Δ mutant, whereas the erg3Δ mutant contained extremely low ergosterol pools. Both adh1Δ and adh2Δ mutants showed higher-than-WT resistance to heat shock and to H(2) O(2) but had WT resistance when exposed to ultraviolet (UV) light and the DNA cross-linking agent diepoxyoctane, indicating normal DNA repair capacity. Mutant adh1Δ was specifically sensitive to acetaldehyde and to membrane peroxidizing paraquat. Our results link the pleiotropic phenotype of adh1Δ mutants to low pools of ergosterol and to reductive stress, and introduce the two new phenotypes, resistance to heat shock and to H(2) O(2) , for the adh2Δ mutant, most probably related to increased ROS production in mitochondria, which leads to the induction of oxidative stress protection. Copyright © 2011 John Wiley & Sons, Ltd.
Bioethanol production by heterologous expression of Pdc and AdhII in Streptomyces lividans.

PubMed

Lee, Jae Sun; Chi, Won-Jae; Hong, Soon-Kwang; Yang, Ji-Won; Chang, Yong Keun

2013-07-01

Two genes from Zymomonas mobilis that are responsible for ethanol production, pyruvate decarboxylase (pdc) and alcohol dehydrogenase II (adhII), were heterologously expressed in the Gram-positive bacterium Streptomyces lividans TK24. An examination of carbon distribution revealed that a significant portion of carbon metabolism was switched from biomass and organic acid biosynthesis to ethanol production upon the expression of pdc and adhII. The recombinant S. lividans TK24 produced ethanol from glucose with a yield of 23.7% based on the carbohydrate consumed. The recombinant was able to produce ethanol from xylose, L-arabinose, mannose, L-rhamnose, galactose, ribose, and cellobiose with yields of 16.0, 25.6, 21.5, 33.6, 30.6, 14.6, and 33.3%, respectively. Polymeric substances such as starch and xylan were directly converted to ethanol by the recombinant with ethanol yields of 18.9 and 8.8%, respectively. The recombinant S. lividans TK24/Tpet developed in this study is potentially a useful microbial resource for ethanol production from various sources of biomasses, especially microalgae.
Genomic analyses of bacterial porin-cytochrome gene clusters

DOE PAGES

Shi, Liang; Fredrickson, James K.; Zachara, John M.

2014-11-26

In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteriamore » from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III) and Mn(IV) oxides.« less
Alcohol dehydrogenase AdhA plays a role in ethanol tolerance in model cyanobacterium Synechocystis sp. PCC 6803.

PubMed

Vidal, Rebeca

2017-04-01

The protein AdhA from the cyanobacterium Synechocystis sp. PCC 6803 (hereafter Synechocystis) has been previously reported to show alcohol dehydrogenase activity towards ethanol and both NAD and NADP. This protein is currently being used in genetically modified strains of Synechocystis capable of synthesizing ethanol showing the highest ethanol productivities. In the present work, mutant strains of Synechocystis lacking AdhA have been constructed and tested for tolerance to ethanol. The lack of AdhA in the wild-type strain reduces survival to externally added ethanol at lethal concentration of 4% (v/v). On the other hand, the lack of AdhA in an ethanologenic strain diminishes tolerance of cells to internally produced ethanol. It is also shown that light-activated heterotrophic growth (LAHG) of the wild-type strain is impaired in the mutant strain lacking AdhA (∆adhA strain). Photoautotrophic, mixotrophic, and photoheterotrophic growth are not affected in the mutant strain. Based on phenotypic characterization of ∆adhA mutants, the possible physiological function of AdhA in Synechocystis is discussed.
Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology

PubMed Central

Fischbach, Michael; Voigt, Christopher A.

2014-01-01

Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668
The joint effects of ADH1B variants and childhood adversity on alcohol related phenotypes in African-American and European-American women and men.

PubMed

Sartor, Carolyn E; Wang, Zuoheng; Xu, Ke; Kranzler, Henry R; Gelernter, Joel

2014-12-01

The ADH1B gene has consistently been implicated in problem drinking, but rarely incorporated into gene by environment investigations of alcohol phenotypes. This study examined the joint effects of variation in ADH1B and childhood adversity-a well-documented risk factor for alcohol problems and moderator of genetic liability to psychiatric outcomes-on maximum drinks consumed in a 24-hour period (maxdrinks) and alcohol use disorder (AUD) symptoms. Data were drawn from 2,617 African-American (AA) and 1,436 European-American (EA) participants (42% female) in a multisite genetic study of substance dependence. We tested the most significant ADH1B single nucleotide polymorphisms for alcohol dependence from a genomewide association study with this sample, ADH1B-rs1229984 (Arg48His) and ADH1B-rs2066702 (Arg370Cys), in EA and AA subsamples, respectively. Ordinal regression analyses conducted separately by sex and population revealed significant main effects for childhood adversity for both alcohol phenotypes in AA women and men and for maxdrinks in EA women. A significant rs1229984 by childhood adversity interaction was observed for AUD symptoms in EA men. Unexposed His-allele carriers reported a mean of 3.6 AUD criteria, but adversity-exposed His-allele carriers endorsed approximately the same number (6.3) as those without the protective allele (6.3 and 7.0 for adversity-exposed and -unexposed groups, respectively). Results suggest that under conditions of childhood adversity, the His allele does not exert its protective effects in EA men (OR = 0.57, CI: 0.32 to 1.01; p = 0.056). Findings highlight the robust risk effect conferred by childhood adversity and the importance of considering population and sex in genetically informative investigations of its association with alcohol outcomes. Copyright © 2014 by the Research Society on Alcoholism.
Gene Cluster Encoding Cholate Catabolism in Rhodococcus spp.

PubMed Central

Wilbrink, Maarten H.; Casabon, Israël; Stewart, Gordon R.; Liu, Jie; van der Geize, Robert; Eltis, Lindsay D.

2012-01-01

Bile acids are highly abundant steroids with important functions in vertebrate digestion. Their catabolism by bacteria is an important component of the carbon cycle, contributes to gut ecology, and has potential commercial applications. We found that Rhodococcus jostii RHA1 grows well on cholate, as well as on its conjugates, taurocholate and glycocholate. The transcriptome of RHA1 growing on cholate revealed 39 genes upregulated on cholate, occurring in a single gene cluster. Reverse transcriptase quantitative PCR confirmed that selected genes in the cluster were upregulated 10-fold on cholate versus on cholesterol. One of these genes, kshA3, encoding a putative 3-ketosteroid-9α-hydroxylase, was deleted and found essential for growth on cholate. Two coenzyme A (CoA) synthetases encoded in the cluster, CasG and CasI, were heterologously expressed. CasG was shown to transform cholate to cholyl-CoA, thus initiating side chain degradation. CasI was shown to form CoA derivatives of steroids with isopropanoyl side chains, likely occurring as degradation intermediates. Orthologous gene clusters were identified in all available Rhodococcus genomes, as well as that of Thermomonospora curvata. Moreover, Rhodococcus equi 103S, Rhodococcus ruber Chol-4 and Rhodococcus erythropolis SQ1 each grew on cholate. In contrast, several mycolic acid bacteria lacking the gene cluster were unable to grow on cholate. Our results demonstrate that the above-mentioned gene cluster encodes cholate catabolism and is distinct from a more widely occurring gene cluster encoding cholesterol catabolism. PMID:23024343
Evidence for the bacterial origin of genes encoding fermentation enzymes of the amitochondriate protozoan parasite Entamoeba histolytica.

PubMed

Rosenthal, B; Mai, Z; Caplivski, D; Ghosh, S; de la Vega, H; Graf, T; Samuelson, J

1997-06-01

Entamoeba histolytica is an amitochondriate protozoan parasite with numerous bacterium-like fermentation enzymes including the pyruvate:ferredoxin oxidoreductase (POR), ferredoxin (FD), and alcohol dehydrogenase E (ADHE). The goal of this study was to determine whether the genes encoding these cytosolic E. histolytica fermentation enzymes might derive from a bacterium by horizontal transfer, as has previously been suggested for E. histolytica genes encoding heat shock protein 60, nicotinamide nucleotide transhydrogenase, and superoxide dismutase. In this study, the E. histolytica por gene and the adhE gene of a second amitochondriate protozoan parasite, Giardia lamblia, were sequenced, and their phylogenetic positions were estimated in relation to POR, ADHE, and FD cloned from eukaryotic and eubacterial organisms. The E. histolytica por gene encodes a 1,620-amino-acid peptide that contained conserved iron-sulfur- and thiamine pyrophosphate-binding sites. The predicted E. histolytica POR showed fewer positional identities to the POR of G. lamblia (34%) than to the POR of the enterobacterium Klebsiella pneumoniae (49%), the cyanobacterium Anabaena sp. (44%), and the protozoan Trichomonas vaginalis (46%), which targets its POR to anaerobic organelles called hydrogenosomes. Maximum-likelihood, neighbor-joining, and parsimony analyses also suggested as less likely E. histolytica POR sharing more recent common ancestry with G. lamblia POR than with POR of bacteria and the T. vaginalis hydrogenosome. The G. lamblia adhE encodes an 888-amino-acid fusion peptide with an aldehyde dehydrogenase at its amino half and an iron-dependent (class 3) ADH at its carboxy half. The predicted G. lamblia ADHE showed extensive positional identities to ADHE of Escherichia coli (49%), Clostridium acetobutylicum (44%), and E. histolytica (43%) and lesser identities to the class 3 ADH of eubacteria and yeast (19 to 36%). Phylogenetic analyses inferred a closer relationship of the E
Clustering Algorithms: Their Application to Gene Expression Data

PubMed Central

Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

2016-01-01

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867
CORM: An R Package Implementing the Clustering of Regression Models Method for Gene Clustering

PubMed Central

Shi, Jiejun; Qin, Li-Xuan

2014-01-01

We report a new R package implementing the clustering of regression models (CORM) method for clustering genes using gene expression data and provide data examples illustrating each clustering function in the package. The CORM package is freely available at CRAN from http://cran.r-project.org. PMID:25452684
Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters.

PubMed

Lukashin, A V; Fuchs, R

2001-05-01

Cluster analysis of genome-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and samples. In the present paper, we focus on several important issues related to clustering algorithms that have not yet been fully studied. We describe a simple and robust algorithm for the clustering of temporal gene expression profiles that is based on the simulated annealing procedure. In general, this algorithm guarantees to eventually find the globally optimal distribution of genes over clusters. We introduce an iterative scheme that serves to evaluate quantitatively the optimal number of clusters for each specific data set. The scheme is based on standard approaches used in regular statistical tests. The basic idea is to organize the search of the optimal number of clusters simultaneously with the optimization of the distribution of genes over clusters. The efficiency of the proposed algorithm has been evaluated by means of a reverse engineering experiment, that is, a situation in which the correct distribution of genes over clusters is known a priori. The employment of this statistically rigorous test has shown that our algorithm places greater than 90% genes into correct clusters. Finally, the algorithm has been tested on real gene expression data (expression changes during yeast cell cycle) for which the fundamental patterns of gene expression and the assignment of genes to clusters are well understood from numerous previous studies.
Fast gene ontology based clustering for microarray experiments.

PubMed

Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

2008-11-21

Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications

PubMed Central

Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel

2007-01-01

Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor
Fractal Clustering and Knowledge-driven Validation Assessment for Gene Expression Profiling.

PubMed

Wang, Lu-Yong; Balasubramanian, Ammaiappan; Chakraborty, Amit; Comaniciu, Dorin

2005-01-01

DNA microarray experiments generate a substantial amount of information about the global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. A number of clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or some distance metric to cluster the points in multi-dimension linear Euclidean space. Their results shows poor consistence with the functional annotation of genes in previous validation study. From a novel different perspective, we propose fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry. This method clusters points in such a way that points in the same clusters are more self-affine among themselves than to the points in other clusters. We assess this method using annotation-based validation assessment for gene clusters. It shows that this method is superior in identifying functional related gene groups than other traditional methods.
Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis

PubMed Central

Koh, Esther G. L.; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V.; Brenner, Sydney; Venkatesh, Byrappa

2003-01-01

The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes. PMID:12547909
Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis.

PubMed

Koh, Esther G L; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V; Brenner, Sydney; Venkatesh, Byrappa

2003-02-04

The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes.
Conditions for the Evolution of Gene Clusters in Bacterial Genomes

PubMed Central

Ballouz, Sara; Francis, Andrew R.; Lan, Ruiting; Tanaka, Mark M.

2010-01-01

Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters. PMID:20168992
Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

PubMed Central

Poole, William; Leinonen, Kalle; Shmulevich, Ilya

2017-01-01

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390

Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

PubMed

Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

2017-02-01

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
A tripartite clustering analysis on microRNA, gene and disease model.

PubMed

Shen, Chengcheng; Liu, Ying

2012-02-01

Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.
An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region.

PubMed Central

Ashburner, M; Misra, S; Roote, J; Lewis, S E; Blazej, R; Davis, T; Doyle, C; Galle, R; George, R; Harris, N; Hartzell, G; Harvey, D; Hong, L; Houston, K; Hoskins, R; Johnson, G; Martin, C; Moshrefi, A; Palazzolo, M; Reese, M G; Spradling, A; Tsang, G; Wan, K; Whitelaw, K; Celniker, S

1999-01-01

A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanogaster has been sequenced from a series of overlapping P1 and BAC clones. This region covers 69 chromosome polytene bands on chromosome arm 2L, including the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and 17 transposable element sequences. At least 38 of the protein-coding genes are arranged in clusters of from 2 to 6 closely related genes, suggesting extensive tandem duplication. The gene density is one protein-coding gene every 13 kb; the transposable element density is one element every 171 kb. Of 73 genes in this region identified by genetic analysis, 49 have been located on the sequence; P-element insertions have been mapped to 43 genes. Ninety-five (44%) of the known and predicted genes match a Drosophila EST, and 144 (66%) have clear similarities to proteins in other organisms. Genes known to have mutant phenotypes are more likely to be represented in cDNA libraries, and far more likely to have products similar to proteins of other organisms, than are genes with no known mutant phenotype. Over 650 chromosome aberration breakpoints map to this chromosome region, and their nonrandom distribution on the genetic map reflects variation in gene spacing on the DNA. This is the first large-scale analysis of the genome of D. melanogaster at the sequence level. In addition to the direct results obtained, this analysis has allowed us to develop and test methods that will be needed to interpret the complete sequence of the genome of this species.Before beginning a Hunt, it is wise to ask someone what you are looking for before you begin looking for it. Milne 1926 PMID:10471707
Hemodynamic and ADH responses to central blood volume shifts in cardiac-denervated humans

NASA Technical Reports Server (NTRS)

Convertino, V. A.; Thompson, C. A.; Benjamin, B. A.; Keil, L. C.; Savin, W. M.; Gordon, E. P.; Haskell, W. L.; Schroeder, J. S.; Sandler, H.

1990-01-01

Hemodynamic responses and antidiuretic hormone (ADH) were measured during body position changes designed to induce blood volume shifts in ten cardiac transplant recipients to assess the contribution of cardiac and vascular volume receptors in the control of ADH secretion. Each subject underwent 15 min of a control period in the seated posture, then assumed a lying posture for 30 min at 6 deg head down tilt (HDT) followed by 20 min of seated recovery. Venous blood samples and cardiac dimensions (echocardiography) were taken at 0 and 15 min before HDT, 5, 15, and 30 min of HDT, and 5, 15, and 30 min of seated recovery. Blood samples were analyzed for hematocrit, plasma osmolality, plasma renin activity (PRA), and ADH. Resting plasma volume (PV) was measured by Evans blue dye and percent changes in PV during posture changes were calculated from changes in hematocrit. Heart rate (HR) and blood pressure (BP) were recorded every 2 min. Results indicate that cardiac volume receptors are not the only mechanism for the control of ADH release during acute blood volume shifts in man.
Genes encoding cuticular proteins are components of the Nimrod gene cluster in Drosophila.

PubMed

Cinege, Gyöngyi; Zsámboki, János; Vidal-Quadras, Maite; Uv, Anne; Csordás, Gábor; Honti, Viktor; Gábor, Erika; Hegedűs, Zoltán; Varga, Gergely I B; Kovács, Attila L; Juhász, Gábor; Williams, Michael J; Andó, István; Kurucz, Éva

2017-08-01

The Nimrod gene cluster, located on the second chromosome of Drosophila melanogaster, is the largest synthenic unit of the Drosophila genome. Nimrod genes show blood cell specific expression and code for phagocytosis receptors that play a major role in fruit fly innate immune functions. We previously identified three homologous genes (vajk-1, vajk-2 and vajk-3) located within the Nimrod cluster, which are unrelated to the Nimrod genes, but are homologous to a fourth gene (vajk-4) located outside the cluster. Here we show that, unlike the Nimrod candidates, the Vajk proteins are expressed in cuticular structures of the late embryo and the late pupa, indicating that they contribute to cuticular barrier functions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Identification of lethal cluster of genes in the yeast transcription network

NASA Astrophysics Data System (ADS)

Rho, K.; Jeong, H.; Kahng, B.

2006-05-01

Identification of essential or lethal genes would be one of the ultimate goals in drug designs. Here we introduce an in silico method to select the cluster with a high population of lethal genes, called lethal cluster, through microarray assay. We construct a gene transcription network based on the microarray expression level. Links are added one by one in the descending order of the Pearson correlation coefficients between two genes. As the link density p increases, two meaningful link densities pm and ps are observed. At pm, which is smaller than the percolation threshold, the number of disconnected clusters is maximum, and the lethal genes are highly concentrated in a certain cluster that needs to be identified. Thus the deletion of all genes in that cluster could efficiently lead to a lethal inviable mutant. This lethal cluster can be identified by an in silico method. As p increases further beyond the percolation threshold, the power law behavior in the degree distribution of a giant cluster appears at ps. We measure the degree of each gene at ps. With the information pertaining to the degrees of each gene at ps, we return to the point pm and calculate the mean degree of genes of each cluster. We find that the lethal cluster has the largest mean degree.
Association of polymorphisms in nicotinic acetylcholine receptor alpha 4 subunit gene (CHRNA4), mu-opioid receptor gene (OPRM1), and ethanol-metabolizing enzyme genes with alcoholism in Korean patients.

PubMed

Kim, Soon Ae; Kim, Jong-Woo; Song, Ji-Young; Park, Sunny; Lee, Hee Jae; Chung, Joo-Ho

2004-01-01

Findings obtained from several studies indicate that ethanol enhances the activity of alpha4beta2 neuronal nicotinic acetylcholine receptor and support the possibility that a polymorphism of the nicotinic acetylcholine receptor alpha4 subunit gene (CHRNA4) modulates enhancement of nicotinic receptor function by ethanol. To identify the association between the CfoI polymorphism of the CHRNA4 and alcoholism, we examined distribution of genotypes and allele frequencies in Korean patients diagnosed with alcoholism (n = 127) and Korean control subjects without alcoholism (n = 185) with polymerase chain reaction-restriction fragment length polymorphism methods. We were able to detect the association between the CfoI polymorphism of the CHRNA4 and alcoholism in Korean patients (genotype P = .023; allele frequency P = .047). The genotypes and allele frequencies of known polymorphisms in other alcoholism candidate genes, such as alcohol metabolism-related genes [alcohol dehydrogenase 2 (ADH2), aldehyde dehydrogenase 2 (ALDH2), alcohol dehydrogenase 3 (ADH3), and cytochrome P450 2E1 (CYP2E1)] and mu-opioid receptor gene (OPRM1), were studied. The polymorphisms of ADH2, ALDH2, and CYP2E1 were significantly different in Korean patients with alcoholism and Korean control subjects without alcoholism, but ADH3 and OPRM1 did not differ between the two groups.
The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

PubMed

Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

2005-06-01

The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.
Large clusters of co-expressed genes in the Drosophila genome.

PubMed

Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

2002-12-12

Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.
Structure of Escherichia coli AdhP (ethanol-inducible dehydrogenase) with bound NAD.

PubMed

Thomas, Leonard M; Harper, Angelica R; Miner, Whitney A; Ajufo, Helen O; Branscum, Katie M; Kao, Lydia; Sims, Paul A

2013-07-01

The crystal structure of AdhP, a recombinantly expressed alcohol dehydrogenase from Escherichia coli K-12 (substrain MG1655), was determined to 2.01 Å resolution. The structure, which was solved using molecular replacement, also included the structural and catalytic zinc ions and the cofactor nicotinamide adenine dinucleotide (NAD). The crystals belonged to space group P21, with unit-cell parameters a = 68.18, b = 118.92, c = 97.87 Å, β = 106.41°. The final R factor and Rfree were 0.138 and 0.184, respectively. The structure of the active site of AdhP suggested a number of residues that may participate in a proton relay, and the overall structure of AdhP, including the coordination to structural and active-site zinc ions, is similar to those of other tetrameric alcohol dehydrogenase enzymes.
Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

DOE PAGES

Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...

2014-05-22

Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ 70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less
Differential Retention of Gene Functions in a Secondary Metabolite Cluster.

PubMed

Reynolds, Hannah T; Slot, Jason C; Divon, Hege H; Lysøe, Erik; Proctor, Robert H; Brown, Daren W

2017-08-01

In fungi, distribution of secondary metabolite (SM) gene clusters is often associated with host- or environment-specific benefits provided by SMs. In the plant pathogen Alternaria brassicicola (Dothideomycetes), the DEP cluster confers an ability to synthesize the SM depudecin, a histone deacetylase inhibitor that contributes weakly to virulence. The DEP cluster includes genes encoding enzymes, a transporter, and a transcription regulator. We investigated the distribution and evolution of the DEP cluster in 585 fungal genomes and found a wide but sporadic distribution among Dothideomycetes, Sordariomycetes, and Eurotiomycetes. We confirmed DEP gene expression and depudecin production in one fungus, Fusarium langsethiae. Phylogenetic analyses suggested 6-10 horizontal gene transfers (HGTs) of the cluster, including a transfer that led to the presence of closely related cluster homologs in Alternaria and Fusarium. The analyses also indicated that HGTs were frequently followed by loss/pseudogenization of one or more DEP genes. Independent cluster inactivation was inferred in at least four fungal classes. Analyses of transitions among functional, pseudogenized, and absent states of DEP genes among Fusarium species suggest enzyme-encoding genes are lost at higher rates than the transporter (DEP3) and regulatory (DEP6) genes. The phenotype of an experimentally-induced DEP3 mutant of Fusarium did not support the hypothesis that selective retention of DEP3 and DEP6 protects fungi from exogenous depudecin. Together, the results suggest that HGT and gene loss have contributed significantly to DEP cluster distribution, and that some DEP genes provide a greater fitness benefit possibly due to a differential tendency to form network connections. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.
Functional clustering of time series gene expression data by Granger causality

PubMed Central

2012-01-01

Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425
Transcription factor clusters regulate genes in eukaryotic cells

PubMed Central

Hedlund, Erik G; Friemann, Rosmarie; Hohmann, Stefan

2017-01-01

Transcription is regulated through binding factors to gene promoters to activate or repress expression, however, the mechanisms by which factors find targets remain unclear. Using single-molecule fluorescence microscopy, we determined in vivo stoichiometry and spatiotemporal dynamics of a GFP tagged repressor, Mig1, from a paradigm signaling pathway of Saccharomyces cerevisiae. We find the repressor operates in clusters, which upon extracellular signal detection, translocate from the cytoplasm, bind to nuclear targets and turnover. Simulations of Mig1 configuration within a 3D yeast genome model combined with a promoter-specific, fluorescent translation reporter confirmed clusters are the functional unit of gene regulation. In vitro and structural analysis on reconstituted Mig1 suggests that clusters are stabilized by depletion forces between intrinsically disordered sequences. We observed similar clusters of a co-regulatory activator from a different pathway, supporting a generalized cluster model for transcription factors that reduces promoter search times through intersegment transfer while stabilizing gene expression. PMID:28841133
Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

PubMed

Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.
Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

PubMed Central

Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417
Hierarchical Dirichlet process model for gene expression clustering

PubMed Central

2013-01-01

Clustering is an important data processing tool for interpreting microarray data and genomic network inference. In this article, we propose a clustering algorithm based on the hierarchical Dirichlet processes (HDP). The HDP clustering introduces a hierarchical structure in the statistical model which captures the hierarchical features prevalent in biological data such as the gene express data. We develop a Gibbs sampling algorithm based on the Chinese restaurant metaphor for the HDP clustering. We apply the proposed HDP algorithm to both regulatory network segmentation and gene expression clustering. The HDP algorithm is shown to outperform several popular clustering algorithms by revealing the underlying hierarchical structure of the data. For the yeast cell cycle data, we compare the HDP result to the standard result and show that the HDP algorithm provides more information and reduces the unnecessary clustering fragments. PMID:23587447
Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

PubMed

Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

2012-07-15

Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary
Heterologous expression of pikromycin biosynthetic gene cluster using Streptomyces artificial chromosome system.

PubMed

Pyeon, Hye-Rim; Nah, Hee-Ju; Kang, Seung-Hoon; Choi, Si-Sun; Kim, Eung-Soo

2017-05-31

Heterologous expression of biosynthetic gene clusters of natural microbial products has become an essential strategy for titer improvement and pathway engineering of various potentially-valuable natural products. A Streptomyces artificial chromosomal conjugation vector, pSBAC, was previously successfully applied for precise cloning and tandem integration of a large polyketide tautomycetin (TMC) biosynthetic gene cluster (Nah et al. in Microb Cell Fact 14(1):1, 2015), implying that this strategy could be employed to develop a custom overexpression scheme of natural product pathway clusters present in actinomycetes. To validate the pSBAC system as a generally-applicable heterologous overexpression system for a large-sized polyketide biosynthetic gene cluster in Streptomyces, another model polyketide compound, the pikromycin biosynthetic gene cluster, was preciously cloned and heterologously expressed using the pSBAC system. A unique HindIII restriction site was precisely inserted at one of the border regions of the pikromycin biosynthetic gene cluster within the chromosome of Streptomyces venezuelae, followed by site-specific recombination of pSBAC into the flanking region of the pikromycin gene cluster. Unlike the previous cloning process, one HindIII site integration step was skipped through pSBAC modification. pPik001, a pSBAC containing the pikromycin biosynthetic gene cluster, was directly introduced into two heterologous hosts, Streptomyces lividans and Streptomyces coelicolor, resulting in the production of 10-deoxymethynolide, a major pikromycin derivative. When two entire pikromycin biosynthetic gene clusters were tandemly introduced into the S. lividans chromosome, overproduction of 10-deoxymethynolide and the presence of pikromycin, which was previously not detected, were both confirmed. Moreover, comparative qRT-PCR results confirmed that the transcription of pikromycin biosynthetic genes was significantly upregulated in S. lividans containing tandem
Clustering approaches to identifying gene expression patterns from DNA microarray data.

PubMed

Do, Jin Hwan; Choi, Dong-Kug

2008-04-30

The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

A Stationary Wavelet Entropy-Based Clustering Approach Accurately Predicts Gene Expression

PubMed Central

Nguyen, Nha; Vo, An; Choi, Inchan

2015-01-01

Abstract Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation. PMID:25383910
Engineering substrate promiscuity in halophilic alcohol dehydrogenase (HvADH2) by in silico design.

PubMed

Cassidy, Jennifer; Bruen, Larah; Rosini, Elena; Molla, Gianluca; Pollegioni, Loredano; Paradisi, Francesca

2017-01-01

An alcohol dehydrogenase from the halophilic archaeon Haloferax volcanii (HvADH2) has been engineered by rational design to broaden its substrate scope towards the conversion of a range of aromatic substrates, including flurbiprofenol, that is an intermediate of the non-steroidal anti-inflammatory drug, flurbiprofen. Wild-type HvADH2 showed minimal activity with flurbiprofenol (11.1 mU/mg). A homology model of HvADH2 was built and docking experiments with this substrate revealed that the biphenyl rings of flurbiprofenol formed strong interactions with residues F85 and F108, preventing its optimal binding in the active site. Mutations at position 85 however did not increase activity. Site directed mutagenesis at position F108 allowed the identification of three variants showing a significant (up to 2.3-fold) enhancement of activity towards flurbiprofenol, when compared to wild-type HvADH2. Interestingly, F108G variant did not show the classic inhibition in the presence of (R)-enantiomer when tested with rac-1-phenylethanol, underling its potential in racemic resolution of secondary alcohols.
An effective fuzzy kernel clustering analysis approach for gene expression data.

PubMed

Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

2015-01-01

Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.
Clustering Genes of Common Evolutionary History

PubMed Central

Gori, Kevin; Suchan, Tomasz; Alvarez, Nadir; Goldman, Nick; Dessimoz, Christophe

2016-01-01

Phylogenetic inference can potentially result in a more accurate tree using data from multiple loci. However, if the loci are incongruent—due to events such as incomplete lineage sorting or horizontal gene transfer—it can be misleading to infer a single tree. To address this, many previous contributions have taken a mechanistic approach, by modeling specific processes. Alternatively, one can cluster loci without assuming how these incongruencies might arise. Such “process-agnostic” approaches typically infer a tree for each locus and cluster these. There are, however, many possible combinations of tree distance and clustering methods; their comparative performance in the context of tree incongruence is largely unknown. Furthermore, because standard model selection criteria such as AIC cannot be applied to problems with a variable number of topologies, the issue of inferring the optimal number of clusters is poorly understood. Here, we perform a large-scale simulation study of phylogenetic distances and clustering methods to infer loci of common evolutionary history. We observe that the best-performing combinations are distances accounting for branch lengths followed by spectral clustering or Ward’s method. We also introduce two statistical tests to infer the optimal number of clusters and show that they strongly outperform the silhouette criterion, a general-purpose heuristic. We illustrate the usefulness of the approach by 1) identifying errors in a previous phylogenetic analysis of yeast species and 2) identifying topological incongruence among newly sequenced loci of the globeflower fly genus Chiastocheta. We release treeCl, a new program to cluster genes of common evolutionary history (http://git.io/treeCl). PMID:26893301
Distribution and Genetic Diversity of Bacteriocin Gene Clusters in Rumen Microbial Genomes.

PubMed

Azevedo, Analice C; Bento, Cláudia B P; Ruiz, Jeronimo C; Queiroz, Marisa V; Mantovani, Hilário C

2015-10-01

Some species of ruminal bacteria are known to produce antimicrobial peptides, but the screening procedures have mostly been based on in vitro assays using standardized methods. Recent sequencing efforts have made available the genome sequences of hundreds of ruminal microorganisms. In this work, we performed genome mining of the complete and partial genome sequences of 224 ruminal bacteria and 5 ruminal archaea to determine the distribution and diversity of bacteriocin gene clusters. A total of 46 bacteriocin gene clusters were identified in 33 strains of ruminal bacteria. Twenty gene clusters were related to lanthipeptide biosynthesis, while 11 gene clusters were associated with sactipeptide production, 7 gene clusters were associated with class II bacteriocin production, and 8 gene clusters were associated with class III bacteriocin production. The frequency of strains whose genomes encode putative antimicrobial peptide precursors was 14.4%. Clusters related to the production of sactipeptides were identified for the first time among ruminal bacteria. BLAST analysis indicated that the majority of the gene clusters (88%) encoding putative lanthipeptides contained all the essential genes required for lanthipeptide biosynthesis. Most strains of Streptococcus (66.6%) harbored complete lanthipeptide gene clusters, in addition to an open reading frame encoding a putative class II bacteriocin. Albusin B-like proteins were found in 100% of the Ruminococcus albus strains screened in this study. The in silico analysis provided evidence of novel biosynthetic gene clusters in bacterial species not previously related to bacteriocin production, suggesting that the rumen microbiota represents an underexplored source of antimicrobial peptides. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Assembly and features of secondary metabolite biosynthetic gene clusters in Streptomyces ansochromogenes.

PubMed

Zhong, Xingyu; Tian, Yuqing; Niu, Guoqing; Tan, Huarong

2013-07-01

A draft genome sequence of Streptomyces ansochromogenes 7100 was generated using 454 sequencing technology. In combination with local BLAST searches and gap filling techniques, a comprehensive antiSMASH-based method was adopted to assemble the secondary metabolite biosynthetic gene clusters in the draft genome of S. ansochromogenes. A total of at least 35 putative gene clusters were identified and assembled. Transcriptional analysis showed that 20 of the 35 gene clusters were expressed in either or all of the three different media tested, whereas the other 15 gene clusters were silent in all three different media. This study provides a comprehensive method to identify and assemble secondary metabolite biosynthetic gene clusters in draft genomes of Streptomyces, and will significantly promote functional studies of these secondary metabolite biosynthetic gene clusters.
Transcriptome Analysis of Aspergillus flavus Reveals veA-Dependent Regulation of Secondary Metabolite Gene Clusters, Including the Novel Aflavarin Cluster

PubMed Central

Cary, J. W.; Han, Z.; Yin, Y.; Lohmar, J. M.; Shantappa, S.; Harris-Coward, P. Y.; Mack, B.; Ehrlich, K. C.; Wei, Q.; Arroyo-Manzanares, N.; Uka, V.; Vanhaecke, L.; Bhatnagar, D.; Yu, J.; Nierman, W. C.; Johns, M. A.; Sorensen, D.; Shen, H.; De Saeger, S.; Diana Di Mavungu, J.

2015-01-01

The global regulatory veA gene governs development and secondary metabolism in numerous fungal species, including Aspergillus flavus. This is especially relevant since A. flavus infects crops of agricultural importance worldwide, contaminating them with potent mycotoxins. The most well-known are aflatoxins, which are cytotoxic and carcinogenic polyketide compounds. The production of aflatoxins and the expression of genes implicated in the production of these mycotoxins are veA dependent. The genes responsible for the synthesis of aflatoxins are clustered, a signature common for genes involved in fungal secondary metabolism. Studies of the A. flavus genome revealed many gene clusters possibly connected to the synthesis of secondary metabolites. Many of these metabolites are still unknown, or the association between a known metabolite and a particular gene cluster has not yet been established. In the present transcriptome study, we show that veA is necessary for the expression of a large number of genes. Twenty-eight out of the predicted 56 secondary metabolite gene clusters include at least one gene that is differentially expressed depending on presence or absence of veA. One of the clusters under the influence of veA is cluster 39. The absence of veA results in a downregulation of the five genes found within this cluster. Interestingly, our results indicate that the cluster is expressed mainly in sclerotia. Chemical analysis of sclerotial extracts revealed that cluster 39 is responsible for the production of aflavarin. PMID:26209694
Identifying a gene expression signature of cluster headache in blood

PubMed Central

Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

2017-01-01

Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859
The human RHOX gene cluster: target genes and functional analysis of gene variants in infertile men.

PubMed

Borgmann, Jennifer; Tüttelmann, Frank; Dworniczak, Bernd; Röpke, Albrecht; Song, Hye-Won; Kliesch, Sabine; Wilkinson, Miles F; Laurentino, Sandra; Gromoll, Jörg

2016-11-15

The X-linked reproductive homeobox (RHOX) gene cluster encodes transcription factors preferentially expressed in reproductive tissues. This gene cluster has important roles in male fertility based on phenotypic defects of Rhox-mutant mice and the finding that aberrant RHOX promoter methylation is strongly associated with abnormal human sperm parameters. However, little is known about the molecular mechanism of RHOX function in humans. Using gene expression profiling, we identified genes regulated by members of the human RHOX gene cluster. Some genes were uniquely regulated by RHOXF1 or RHOXF2/2B, while others were regulated by both of these transcription factors. Several of these regulated genes encode proteins involved in processes relevant to spermatogenesis; e.g. stress protection and cell survival. One of the target genes of RHOXF2/2B is RHOXF1, suggesting cross-regulation to enhance transcriptional responses. The potential role of RHOX in human infertility was addressed by sequencing all RHOX exons in a group of 250 patients with severe oligozoospermia. This revealed two mutations in RHOXF1 (c.515G > A and c.522C > T) and four in RHOXF2/2B (-73C > G, c.202G > A, c.411C > T and c.679G > A), of which only one (c.202G > A) was found in a control group of men with normal sperm concentration. Functional analysis demonstrated that c.202G > A and c.679G > A significantly impaired the ability of RHOXF2/2B to regulate downstream genes. Molecular modelling suggested that these mutations alter RHOXF2/F2B protein conformation. By combining clinical data with in vitro functional analysis, we demonstrate how the X-linked RHOX gene cluster may function in normal human spermatogenesis and we provide evidence that it is impaired in human male fertility.
Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters

PubMed Central

WANG, JIACHEN; DASGUPTA, INDRANI; FOX, GEORGE E.

2009-01-01

The genomic associations of the archaeal ribosomal proteins, (r-proteins), were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such. PMID:19478915
Conditioning to ethanol in the fruit fly-a study using an inhibitor of ADH.

PubMed

Cadieu, N; Cadieu, J -C.; El Ghadraoui, L; Grimal, A; Lamboeuf, Y

1999-06-01

To identify processes involved in the choice of ethanol by adult Drosophila, flies homozygous Adh(F), reared in the absence of alcohol were placed in contact with: a) an ethanol-free medium, b) a medium containing ethanol, c) a medium supplemented with 4-methylpyrazole (4-MP, an inhibitor of the ADH pathway), d) a medium containing ethanol and 4-MP. The choice of ethanol over a medium without ethanol was evaluated by measuring the duration of extension of the proboscis of the flies in each of the media. A slight preference for the ethanol-supplemented medium was observed in the naive flies, which was enhanced by previous exposure to ethanol. Exposure to ethanol and 4-MP, however, led to an avoidance of ethanol. There was a reduction in ADH activity on treatment of the flies with 4-MP, and signs of malaise (reduced locomotor activity, loss of balance) were observed in the flies who ingested both ethanol and inhibitor. We concluded that the preference for ethanol stems from an associative learning related to ethanol utilization. Inhibition of enzymes of ADH pathway led to a conditioned aversion due to disturbance of ethanol metabolism giving rise to malaise.
Hox gene cluster of the ascidian, Halocynthia roretzi, reveals multiple ancient steps of cluster disintegration during ascidian evolution.

PubMed

Sekigami, Yuka; Kobayashi, Takuya; Omi, Ai; Nishitsuji, Koki; Ikuta, Tetsuro; Fujiyama, Asao; Satoh, Noriyuki; Saiga, Hidetoshi

2017-01-01

Hox gene clusters with at least 13 paralog group (PG) members are common in vertebrate genomes and in that of amphioxus. Ascidians, which belong to the subphylum Tunicata (Urochordata), are phylogenetically positioned between vertebrates and amphioxus, and traditionally divided into two groups: the Pleurogona and the Enterogona. An enterogonan ascidian, Ciona intestinalis ( Ci ), possesses nine Hox genes localized on two chromosomes; thus, the Hox gene cluster is disintegrated. We investigated the Hox gene cluster of a pleurogonan ascidian, Halocynthia roretzi ( Hr ) to investigate whether Hox gene cluster disintegration is common among ascidians, and if so, how such disintegration occurred during ascidian or tunicate evolution. Our phylogenetic analysis reveals that the Hr Hox gene complement comprises nine members, including one with a relatively divergent Hox homeodomain sequence. Eight of nine Hr Hox genes were orthologous to Ci-Hox1 , 2, 3, 4, 5, 10, 12 and 13. Following the phylogenetic classification into 13 PGs, we designated Hr Hox genes as Hox1, 2, 3, 4, 5, 10, 11/12/13.a , 11/12/13.b and HoxX . To address the chromosomal arrangement of the nine Hox genes, we performed two-color chromosomal fluorescent in situ hybridization, which revealed that the nine Hox genes are localized on a single chromosome in Hr , distinct from their arrangement in Ci . We further examined the order of the nine Hox genes on the chromosome by chromosome/scaffold walking. This analysis suggested a gene order of Hox1 , 11/12/13.b, 11/12/13.a, 10, 5, X, followed by either Hox4, 3, 2 or Hox2, 3, 4 on the chromosome. Based on the present results and those previously reported in Ci , we discuss the establishment of the Hox gene complement and disintegration of Hox gene clusters during the course of ascidian or tunicate evolution. The Hox gene cluster and the genome must have experienced extensive reorganization during the course of evolution from the ancestral tunicate to Hr and Ci
The ergot alkaloid gene cluster: functional analyses and evolutionary aspects.

PubMed

Lorenz, Nicole; Haarmann, Thomas; Pazoutová, Sylvie; Jung, Manfred; Tudzynski, Paul

2009-01-01

Ergot alkaloids and their derivatives have been traditionally used as therapeutic agents in migraine, blood pressure regulation and help in childbirth and abortion. Their production in submerse culture is a long established biotechnological process. Ergot alkaloids are produced mainly by members of the genus Claviceps, with Claviceps purpurea as best investigated species concerning the biochemistry of ergot alkaloid synthesis (EAS). Genes encoding enzymes involved in EAS have been shown to be clustered; functional analyses of EAS cluster genes have allowed to assign specific functions to several gene products. Various Claviceps species differ with respect to their host specificity and their alkaloid content; comparison of the ergot alkaloid clusters in these species (and of clavine alkaloid clusters in other genera) yields interesting insights into the evolution of cluster structure. This review focuses on recently published and also yet unpublished data on the structure and evolution of the EAS gene cluster and on the function and regulation of cluster genes. These analyses have also significant biotechnological implications: the characterization of non-ribosomal peptide synthetases (NRPS) involved in the synthesis of the peptide moiety of ergopeptines opened interesting perspectives for the synthesis of ergot alkaloids; on the other hand, defined mutants could be generated producing interesting intermediates or only single peptide alkaloids (instead of the alkaloid mixtures usually produced by industrial strains).
Temporal expression of the human alcohol dehydrogenase gene family during liver development correlates with differential promoter activation by hepatocyte nuclear factor 1, CCAAT/enhancer-binding protein alpha, liver activator protein, and D-element-binding protein.

PubMed Central

van Ooij, C; Snyder, R C; Paeper, B W; Duester, G

1992-01-01

The human class I alcohol dehydrogenase (ADH) gene family consists of ADH1, ADH2, and ADH3, which are sequentially activated in early fetal, late fetal, and postnatal liver, respectively. Analysis of ADH promoters revealed differential activation by several factors previously shown to control liver transcription. In cotransfection assays, the ADH1 promoter, but not the ADH2 or ADH3 promoter, was shown to respond to hepatocyte nuclear factor 1 (HNF-1), which has previously been shown to regulate transcription in early liver development. The ADH2 promoter, but not the ADH1 or ADH3 promoter, was shown to respond to CCAAT/enhancer-binding protein alpha (C/EBP alpha), a transcription factor particularly active during late fetal liver and early postnatal liver development. The ADH1, ADH2, and ADH3 promoters all responded to the liver transcription factors liver activator protein (LAP) and D-element-binding protein (DBP), which are most active in postnatal liver. For all three promoters, the activation by LAP or DBP was higher than that seen by HNF-1 or C/EBP alpha, and a significant synergism between C/EBP alpha and LAP was noticed for the ADH2 and ADH3 promoters when both factors were simultaneously cotransfected. A hierarchy of ADH promoter responsiveness to C/EBP alpha and LAP homo- and heterodimers is suggested. In all three ADH genes, LAP bound to the same four sites previously reported for C/EBP alpha (i.e., -160, -120, -40, and -20 bp), but DBP bound strongly only to the site located at -40 bp relative to the transcriptional start. Mutational analysis of ADH2 indicated that the -40 bp element accounts for most of the promoter regulation by the bZIP factors analyzed. These studies suggest that HNF-1 and C/EBP alpha help establish ADH gene family transcription in fetal liver and that LAP and DBP help maintain high-level ADH gene family transcription in postnatal liver. Images PMID:1620113
OptSSeq: High-throughput sequencing readout of growth enrichment defines optimal gene expression elements for homoethanologenesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghosh, Indro Neil; Landick, Robert

The optimization of synthetic pathways is a central challenge in metabolic engineering. OptSSeq (Optimization by Selection and Sequencing) is one approach to this challenge. OptSSeq couples selection of optimal enzyme expression levels linked to cell growth rate with high-throughput sequencing to track enrichment of gene expression elements (promoters and ribosomebinding sites) from a combinatorial library. OptSSeq yields information on both optimal and suboptimal enzyme levels, and helps identify constraints that limit maximal product formation. Here we report a proof-of-concept implementation of OptSSeq using homoethanologenesis, a two-step pathway consisting of pyruvate decarboxylase (Pdc) and alcohol dehydrogenase (Adh) that converts pyruvate tomore » ethanol and is naturally optimized in the bacterium Zymomonas mobilis. We used OptSSeq to determine optimal gene expression elements and enzyme levels for Z. mobilis Pdc, AdhA, and AdhB expressed in Escherichia coli. By varying both expression signals and gene order, we identified an optimal solution using only Pdc and AdhB. We resolved current uncertainty about the functions of the Fe 2+-dependent AdhB and Zn 2+- dependent AdhA by showing that AdhB is preferred over AdhA for rapid growth in both E. coli and Z. mobilis. Finally, by comparing predictions of growth-linked metabolic flux to enzyme synthesis costs, we established that optimal E. coli homoethanologenesis was achieved by our best pdc-adhB expression cassette and that the remaining constraints lie in the E. coli metabolic network or inefficient Pdc or AdhB function in E. coli. Furthermore, OptSSeq is a general tool for synthetic biology to tune enzyme levels in any pathway whose optimal function can be linked to cell growth or survival.« less
OptSSeq: High-throughput sequencing readout of growth enrichment defines optimal gene expression elements for homoethanologenesis

DOE PAGES

Ghosh, Indro Neil; Landick, Robert

2016-07-16

The optimization of synthetic pathways is a central challenge in metabolic engineering. OptSSeq (Optimization by Selection and Sequencing) is one approach to this challenge. OptSSeq couples selection of optimal enzyme expression levels linked to cell growth rate with high-throughput sequencing to track enrichment of gene expression elements (promoters and ribosomebinding sites) from a combinatorial library. OptSSeq yields information on both optimal and suboptimal enzyme levels, and helps identify constraints that limit maximal product formation. Here we report a proof-of-concept implementation of OptSSeq using homoethanologenesis, a two-step pathway consisting of pyruvate decarboxylase (Pdc) and alcohol dehydrogenase (Adh) that converts pyruvate tomore » ethanol and is naturally optimized in the bacterium Zymomonas mobilis. We used OptSSeq to determine optimal gene expression elements and enzyme levels for Z. mobilis Pdc, AdhA, and AdhB expressed in Escherichia coli. By varying both expression signals and gene order, we identified an optimal solution using only Pdc and AdhB. We resolved current uncertainty about the functions of the Fe 2+-dependent AdhB and Zn 2+- dependent AdhA by showing that AdhB is preferred over AdhA for rapid growth in both E. coli and Z. mobilis. Finally, by comparing predictions of growth-linked metabolic flux to enzyme synthesis costs, we established that optimal E. coli homoethanologenesis was achieved by our best pdc-adhB expression cassette and that the remaining constraints lie in the E. coli metabolic network or inefficient Pdc or AdhB function in E. coli. Furthermore, OptSSeq is a general tool for synthetic biology to tune enzyme levels in any pathway whose optimal function can be linked to cell growth or survival.« less
Gender differences in the effects of ADH1B and ALDH2 polymorphisms on alcoholism.

PubMed

Kimura, Mitsuru; Miyakawa, Tomohiro; Matsushita, Sachio; So, Mirai; Higuchi, Susumu

2011-11-01

Gender differences are known to exist in the prevalence, characteristics, and course of alcohol dependence. Elucidating gender differences in the characteristics of alcohol dependence is important in gender-based medicine and may improve treatment outcomes. Many studies have shown that genetic factors are associated with the risk of alcohol dependence in both genders. Polymorphisms of alcohol dehydrogenase-1B (ADH1B) and aldehyde dehydrogenase-2 (ALDH2) are strong genetic determinants of alcohol dependence. This study aimed to clarify gender differences in the effects of ADH1B and ALDH2 polymorphism on the development of alcohol dependence. Subjects were 200 female alcoholics and 415 male alcoholics hospitalized in Kurihama Alcoholism Center. Clinical information and background data were obtained by chart review. ALDH2 and ADH1B genotyping was performed by the polymerase chain reaction-restriction fragment length polymorphism method. The onset age of female alcoholics with inactive ALDH2 genotype was significantly lower than those with active ALDH2 genotype, but the onset age did not differ between the inactive and active ALDH2 group in male alcoholics. The difference in onset age between the ADH1B genotype groups did not reach significant levels. The prevalence of comorbid psychiatric disorders, including major depression, eating disorder, panic disorder, and borderline personality disorder, was significantly higher in female alcoholics with inactive ALDH2 or superactive ADH1B than in those with active ALDH2 or normal ADH1B. ALDH2 polymorphism appears to have contrasting effects on the development of alcoholism in women and men. One possible reason for this gender difference may be the high prevalence of psychiatric comorbidities in female alcoholics with inactive ALDH2. Copyright © 2011 by the Research Society on Alcoholism.
Strong Magnetic Field Induced Changes of Gene Expression in Arabidopsis

NASA Astrophysics Data System (ADS)

Paul, A.-L.; Ferl, R. J.; Klingenberg, B.; Brooks, J. S.; Morgan, A. N.; Yowtak, J.; Meisel, M. W.

2005-07-01

We review our studies of the biological impact of magnetic field strengths of up to 30 T on transgenic arabidopsis plants engineered with a stress response gene consisting of the alcohol dehydrogenase (Adh) gene promoter driving the β-glucuronidase (GUS) gene reporter. Field strengths in excess of 15 T induce expression of the Adh/GUS transgene in the roots and leaves. Microarray analyses indicate that such field strengths have a far reaching effect on the genome. Wide spread induction of stress-related genes and transcription factors, and a depression of genes associated with cell wall metabolism are prominent examples.
Interactions Between Alcohol Metabolism Genes and Religious Involvement in Association With Maximum Drinks and Alcohol Dependence Symptoms

PubMed Central

Chartier, Karen G.; Dick, Danielle M.; Almasy, Laura; Chan, Grace; Aliev, Fazil; Schuckit, Marc A.; Scott, Denise M.; Kramer, John; Bucholz, Kathleen K.; Bierut, Laura J.; Nurnberger, John; Porjesz, Bernice; Hesselbrock, Victor M.

2016-01-01

Objective: Variations in the genes encoding alcohol dehydrogenase (ADH) enzymes are associated with both alcohol consumption and dependence in multiple populations. Additionally, some environmental factors have been recognized as modifiers of these relationships. This study examined the modifying effect of religious involvement on relationships between ADH gene variants and alcohol consumption–related phenotypes. Method: Subjects were African American, European American, and Hispanic American adults with lifetime exposure to alcohol (N = 7,716; 53% female) from the Collaborative Study on the Genetics of Alcoholism. Genetic markers included ADH1B-rs1229984, ADH1B-rs2066702, ADH1C-rs698, ADH4-rs1042364, and ADH4-rs1800759. Phenotypes were maximum drinks consumed in a 24-hour period and total number of alcohol dependence symptoms according to the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition. Religious involvement was defined by self-reported religious services attendance. Results: Both religious involvement and ADH1B-rs1229984 were negatively associated with the number of maximum drinks consumed and the number of lifetime alcohol dependence symptoms endorsed. The interactions of religious involvement with ADH1B-rs2066702, ADH1C-rs698, and ADH4-rs1042364 were significantly associated with maximum drinks and alcohol dependence symptoms. Risk variants had weaker associations with maximum drinks and alcohol dependence symptoms as a function of increasing religious involvement. Conclusions: This study provided initial evidence of a modifying effect for religious involvement on relationships between ADH variants and maximum drinks and alcohol dependence symptoms. PMID:27172571
Clustered Genes Involved in Cyclopiazonic Acid Production are Next to the Aflatoxin Biosynthesis Gene Cluster in Aspergillus flavus

USDA-ARS?s Scientific Manuscript database

Cyclopiazonic acid (CPA), an indole-tetramic acid toxin, is produced by many species of Aspergillus and Penicillium. In addition to CPA Aspergillus flavus produces polyketide-derived carcinogenic aflatoxins (AFs). AF biosynthesis genes form a gene cluster in a subtelomeric region. Isolates of A. fla...

Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cameron, R A; Rowen, L; Nesbitt, R

2005-10-11

The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3 gene is Hox5. (The gene order is :more » 5-Hox1, 2, 3, 11/13c, 11/13b, 11/13a, 9/10, 8, 7, 6, 5 - 3). The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less
Unusual Gene Order and Organization of the Sea Urchin HoxCluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew

2005-05-10

The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is :more » 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less
clusterProfiler: an R package for comparing biological themes among gene clusters.

PubMed

Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu

2012-05-01

Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.
From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants.

PubMed

Chu, Hoi Yee; Wegel, Eva; Osbourn, Anne

2011-04-01

Gene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds. An obvious assumption might be that these eukaryotic gene clusters have arisen by horizontal gene transfer from microbes, but there is compelling evidence to indicate that this is not the case. This raises intriguing questions about how widespread such clusters are, what the significance of clustering is, why genes for some metabolic pathways are clustered and those for others are not, and how these clusters form. In answering these questions we may hope to learn more about mechanisms of genome plasticity and adaptive evolution in plants. It is noteworthy that for the five plant secondary metabolic gene clusters reported so far, the enzymes for the first committed steps all appear to have been recruited directly or indirectly from primary metabolic pathways involved in hormone synthesis. This may or may not turn out to be a common feature of plant secondary metabolic gene clusters as new clusters emerge. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
Drivers of genetic diversity in secondary metabolic gene clusters within a fungal species

PubMed Central

Lind, Abigail L.; Wisecaver, Jennifer H.; Lameiras, Catarina; Wiemann, Philipp; Palmer, Jonathan M.; Keller, Nancy P.; Rodrigues, Fernando; Goldman, Gustavo H.

2017-01-01

Filamentous fungi produce a diverse array of secondary metabolites (SMs) critical for defense, virulence, and communication. The metabolic pathways that produce SMs are found in contiguous gene clusters in fungal genomes, an atypical arrangement for metabolic pathways in other eukaryotes. Comparative studies of filamentous fungal species have shown that SM gene clusters are often either highly divergent or uniquely present in one or a handful of species, hampering efforts to determine the genetic basis and evolutionary drivers of SM gene cluster divergence. Here, we examined SM variation in 66 cosmopolitan strains of a single species, the opportunistic human pathogen Aspergillus fumigatus. Investigation of genome-wide within-species variation revealed 5 general types of variation in SM gene clusters: nonfunctional gene polymorphisms; gene gain and loss polymorphisms; whole cluster gain and loss polymorphisms; allelic polymorphisms, in which different alleles corresponded to distinct, nonhomologous clusters; and location polymorphisms, in which a cluster was found to differ in its genomic location across strains. These polymorphisms affect the function of representative A. fumigatus SM gene clusters, such as those involved in the production of gliotoxin, fumigaclavine, and helvolic acid as well as the function of clusters with undefined products. In addition to enabling the identification of polymorphisms, the detection of which requires extensive genome-wide synteny conservation (e.g., mobile gene clusters and nonhomologous cluster alleles), our approach also implicated multiple underlying genetic drivers, including point mutations, recombination, and genomic deletion and insertion events as well as horizontal gene transfer from distant fungi. Finally, most of the variants that we uncover within A. fumigatus have been previously hypothesized to contribute to SM gene cluster diversity across entire fungal classes and phyla. We suggest that the drivers of genetic
Remote sensing of gene expression in Planta: transgenic plants as monitors of exogenous stress perception in extraterrestrial environments

NASA Technical Reports Server (NTRS)

Manak, Michael S.; Paul, Anna-Lisa; Sehnke, Paul C.; Ferl, Robert J.

2002-01-01

Transgenic arabidopsis plants containing the alcohol dehydrogenase (Adh) gene promoter fused to the green fluorescent protein (GFP) reporter gene were developed as biological sensors for monitoring physiological responses to unique environments. Plants were monitored in vivo during exposure to hypoxia, high salt, cold, and abcissic acid in experiments designed to characterize the utility and responses of the Adh/GFP biosensors. Plants in the presence of environmental stimuli that induced the Adh promoter responded by expressing GFP, which in turn generated a detectable fluorescent signal. The GFP signal degraded when the inducing stimulus was removed. Digital imaging of the Adh/GFP plants exposed to each of the exogenous stresses demonstrated that the stress-induced gene expression could be followed in real time. The experimental results established the feasibility of using a digital monitoring system for collecting gene expression data in real time from Transgenic Arabidopsis Gene Expression System (TAGES) biosensor plants during space exploration experiments.
Analysis of lamprey clustered Fox genes: insight into Fox gene evolution and expression in vertebrates.

PubMed

Wotton, Karl R; Shimeld, Sebastian M

2011-12-01

In the human genome, members of the FoxC, FoxF, FoxL1, and FoxQ1 gene families are found in two paralagous clusters. One cluster contains the genes FOXQ1, FOXF2, FOXC1 and the second consists of FOXF1, FOXC2, and FOXL1. In jawed vertebrates these genes are known to be expressed in different pharyngeal tissues and all, except FoxQ1, are involved in patterning the early embryonic mesoderm. We have previously traced the evolution of this cluster in the bony vertebrates, and the gene content is identical in the dogfish, a member of the most basally branching lineage of the jawed vertebrates. Here we extend these analyses to jawless vertebrates. Using genomic searches and molecular approaches we have identified homologues of these genes from lampreys. We identify two FoxC genes, two FoxF genes, two FoxQ1 genes and single FoxL1 gene. We examine the embryonic expression of one predominantly mesodermally expressed gene family, FoxC, and the endodermally expressed member of the cluster, FoxQ1. We identified FoxQ1 transcripts in the pharyngeal endoderm, while the two FoxC genes are differentially expressed in the pharyngeal mesenchyme and ectoderm. Furthermore we identify conserved expression of lamprey FoxC genes in the paraxial and intermediate mesoderms. We interpret our results through a chordate-wide comparison of expression patterns and discuss gene content in the context of theories on the evolution of the vertebrate genome. 2011 Elsevier B.V. All rights reserved.
Identification and Functional Analysis of the Nocardithiocin Gene Cluster in Nocardia pseudobrasiliensis

PubMed Central

Sakai, Kanae; Komaki, Hisayuki; Gonoi, Tohru

2015-01-01

Nocardithiocin is a thiopeptide compound isolated from the opportunistic pathogen Nocardia pseudobrasiliensis. It shows a strong activity against acid-fast bacteria and is also active against rifampicin-resistant Mycobacterium tuberculosis. Here, we report the identification of the nocardithiocin gene cluster in N. pseudobrasiliensis IFM 0761 based on conserved thiopeptide biosynthesis gene sequence and the whole genome sequence. The predicted gene cluster was confirmed by gene disruption and complementation. As expected, strains containing the disrupted gene did not produce nocardithiocin while gene complementation restored nocardithiocin production in these strains. The predicted cluster was further analyzed using RNA-seq which showed that the nocardithiocin gene cluster contains 12 genes within a 15.2-kb region. This finding will promote the improvement of nocardithiocin productivity and its derivatives production. PMID:26588225
Lampreys, the jawless vertebrates, contain only two ParaHox gene clusters.

PubMed

Zhang, Huixian; Ravi, Vydianathan; Tay, Boon-Hui; Tohari, Sumanty; Pillai, Nisha E; Prasad, Aravind; Lin, Qiang; Brenner, Sydney; Venkatesh, Byrappa

2017-08-22

ParaHox genes ( Gsx , Pdx , and Cdx ) are an ancient family of developmental genes closely related to the Hox genes. They play critical roles in the patterning of brain and gut. The basal chordate, amphioxus, contains a single ParaHox cluster comprising one member of each family, whereas nonteleost jawed vertebrates contain four ParaHox genomic loci with six or seven ParaHox genes. Teleosts, which have experienced an additional whole-genome duplication, contain six ParaHox genomic loci with six ParaHox genes. Jawless vertebrates, represented by lampreys and hagfish, are the most ancient group of vertebrates and are crucial for understanding the origin and evolution of vertebrate gene families. We have previously shown that lampreys contain six Hox gene loci. Here we report that lampreys contain only two ParaHox gene clusters (designated as α- and β-clusters) bearing five ParaHox genes ( Gsxα , Pdxα , Cdxα , Gsxβ , and Cdxβ ). The order and orientation of the three genes in the α-cluster are identical to that of the single cluster in amphioxus. However, the orientation of Gsxβ in the β-cluster is inverted. Interestingly, Gsxβ is expressed in the eye, unlike its homologs in jawed vertebrates, which are expressed mainly in the brain. The lamprey Pdxα is expressed in the pancreas similar to jawed vertebrate Pdx genes, indicating that the pancreatic expression of Pdx was acquired before the divergence of jawless and jawed vertebrate lineages. It is likely that the lamprey Pdxα plays a crucial role in pancreas specification and insulin production similar to the Pdx of jawed vertebrates.
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants

DOE PAGES

Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan; ...

2017-04-01

Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we will need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can bemore » used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.« less
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan

Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we will need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can bemore » used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.« less
The influence of Adh function on ethanol preference and tolerance in adult Drosophila melanogaster.

PubMed

Ogueta, Maite; Cibik, Osman; Eltrop, Rouven; Schneider, Andrea; Scholz, Henrike

2010-11-01

Preference determines behavioral choices such as choosing among food sources and mates. One preference-affecting chemical is ethanol, which guides insects to fermenting fruits or leaves. Here, we show that adult Drosophila melanogaster prefer food containing up to 5% ethanol over food without ethanol and avoid food with high levels (23%) of ethanol. Although female and male flies behaved differently at ethanol-containing food sources, there was no sexual dimorphism in the preference for food containing modest ethanol levels. We also investigated whether Drosophila preference, sensitivity and tolerance to ethanol was related to the activity of alcohol dehydrogenase (Adh), the primary ethanol-metabolizing enzyme in D. melanogaster. Impaired Adh function reduced ethanol preference in both D. melanogaster and a related species, D. sechellia. Adh-impaired flies also displayed reduced aversion to high ethanol concentrations, increased sensitivity to the effects of ethanol on postural control, and negative tolerance/sensitization (i.e., a reduction of the increased resistance to ethanol's effects that normally occurs upon repeated exposure). These data strongly indicate a linkage between ethanol-induced behavior and ethanol metabolism in adult fruit flies: Adh deficiency resulted in reduced preference to low ethanol concentrations and reduced aversion to high ones, despite recovery from ethanol being strongly impaired.
Horizontal transfer of a large and highly toxic secondary metabolic gene cluster between fungi.

PubMed

Slot, Jason C; Rokas, Antonis

2011-01-25

Genes involved in intermediary and secondary metabolism in fungi are frequently physically linked or clustered. For example, in Aspergillus nidulans the entire pathway for the production of sterigmatocystin (ST), a highly toxic secondary metabolite and a precursor to the aflatoxins (AF), is located in a ∼54 kb, 23 gene cluster. We discovered that a complete ST gene cluster in Podospora anserina was horizontally transferred from Aspergillus. Phylogenetic analysis shows that most Podospora cluster genes are adjacent to or nested within Aspergillus cluster genes, although the two genera belong to different taxonomic classes. Furthermore, the Podospora cluster is highly conserved in content, sequence, and microsynteny with the Aspergillus ST/AF clusters and its intergenic regions contain 14 putative binding sites for AflR, the transcription factor required for activation of the ST/AF biosynthetic genes. Examination of ∼52,000 Podospora expressed sequence tags identified transcripts for 14 genes in the cluster, with several expressed at multiple life cycle stages. The presence of putative AflR-binding sites and the expression evidence for several cluster genes, coupled with the recent independent discovery of ST production in Podospora [1], suggest that this HGT event probably resulted in a functional cluster. Given the abundance of metabolic gene clusters in fungi, our finding that one of the largest known metabolic gene clusters moved intact between species suggests that such transfers might have significantly contributed to fungal metabolic diversity. PAPERFLICK: Copyright Â© 2011 Elsevier Ltd. All rights reserved.
ADH1B*2 allele is protective against alcoholism but not chronic liver disease in the Hungarian population.

PubMed

Toth, Reka; Pocsai, Zsuzsa; Fiatal, Szilvia; Szeles, Gyorgy; Kardos, Laszlo; Petrovski, Beata; McKee, Martin; Adany, Roza

2010-05-01

Standardized death rates from chronic liver diseases (CLDs) in Hungary are much higher than the European Union average. Carrying the alcohol dehydrogenase 1B 48His allele (rs1229984 or ADH1B*2) could decrease the risk of alcoholism, but with persistent drinking may confer a greater risk of CLDs. The aim of this study was to assess the prevalence of this polymorphism in the Hungarian population and its association with alcohol consumption and with CLDs. A total of 278 cases with diagnosed CLDs and 752 controls without any alterations in liver function, all males aged 45-64, were screened for ADH1B Arg48His polymorphism. ADH1B*2 allele frequencies in controls and cases were 8.31% and 4.50%, respectively (chi(2) = 9.2; P = 0.01). Carrying the ADH1B*2 allele was associated with significantly lower odds ratio (OR) for drinking frequency (OR = 0.63; P = 0.003), the number of positive answers on CAGE (Cut-down, Annoyed, Guilt, Eye-opener) assessment (OR = 0.58; P = 0.005) and a positive CAGE status (OR = 0.55; P = 0.007). There was a significant association between ADH1B*2 and CLDs (OR = 0.50; P = 0.003), but it disappeared after adjusting for CAGE status and scores (OR = 0.67 P = 0.134; OR = 0.67 P = 0.148, respectively) and weakened after adjusting for drinking frequency (OR = 0.61; P = 0.045). Among heavy drinkers the presence of ADH1B*2 did not increase the risk of cirrhosis but there was a significant interaction between genotype and CAGE status (P = 0.003, P = 0.042), with ADH1B*2 conferring reduced risk of CLDs in CAGE negatives. In Hungarians, the ADH1B 48His allele reduces the risk of alcoholism, but not the risk of chronic liver disease among heavy drinkers.
The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans.

PubMed

Gardiner, Donald M; Cozijnsen, Anton J; Wilson, Leanne M; Pedras, M Soledade C; Howlett, Barbara J

2004-09-01

Sirodesmin PL is a phytotoxin produced by the fungus Leptosphaeria maculans, which causes blackleg disease of canola (Brassica napus). This phytotoxin belongs to the epipolythiodioxopiperazine (ETP) class of toxins produced by fungi including mammalian and plant pathogens. We report the cloning of a cluster of genes with predicted roles in the biosynthesis of sirodesmin PL and show via gene disruption that one of these genes (encoding a two-module non-ribosomal peptide synthetase) is essential for sirodesmin PL biosynthesis. Of the nine genes in the cluster tested, all are co-regulated with the production of sirodesmin PL in culture. A similar cluster is present in the genome of the opportunistic human pathogen Aspergillus fumigatus and is most likely responsible for the production of gliotoxin, which is also an ETP. Homologues of the genes in the cluster were also identified in expressed sequence tags of the ETP producing fungus Chaetomium globosum. Two other fungi with publicly available genome sequences, Magnaporthe grisea and Fusarium graminearum, had similar gene clusters. A comparative analysis of all four clusters is presented. This is the first report of the genes responsible for the biosynthesis of an ETP. Copyright 2004 Blackwell Publishing Ltd
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants.

PubMed

Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan; Kim, Taehyong; Banf, Michael; Chae, Lee; Dreher, Kate; Chavali, Arvind K; Nilo-Poyanco, Ricardo; Bernard, Thomas; Kahn, Daniel; Rhee, Seung Y

2017-04-01

Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters. © 2017 American Society of Plant Biologists. All Rights Reserved.
Fungal secondary metabolites - strategies to activate silent gene clusters.

PubMed

Brakhage, Axel A; Schroeckh, Volker

2011-01-01

Filamentous fungi produce a multitude of low molecular weight bioactive compounds. The increasing number of fungal genome sequences impressively demonstrated that their biosynthetic potential is far from being exploited. In fungi, the genes required for the biosynthesis of a secondary metabolite are clustered. Many of these bioinformatically newly discovered secondary metabolism gene clusters are silent under standard laboratory conditions. Consequently, no product can be found. This review summarizes the current strategies that have been successfully applied during the last years to activate these silent gene clusters in filamentous fungi, especially in the genus Aspergillus. The techniques take advantage of genome mining, vary from the simple search for compounds with bioinformatically predicted physicochemical properties up to methods that exploit a probable interaction of microorganisms. Until now, the majority of successful approaches have been based on molecular biology like the generation of gene "knock outs", promoter exchange, overexpression of transcription factors or other pleiotropic regulators. Moreover, strategies based on epigenetics opened a new avenue for the elucidation of the regulation of secondary metabolite formation and will certainly continue to play a significant role for the elucidation of cryptic natural products. The conditions under which a given gene cluster is naturally expressed are largely unknown. One technique is to attempt to simulate the natural habitat by co-cultivation of microorganisms from the same ecosystem. This has already led to the activation of silent gene clusters and the identification of novel compounds in Aspergillus nidulans. These simulation strategies will help discover new natural products in the future, and may also provide fundamental new insights into microbial communication. Copyright © 2010 Elsevier Inc. All rights reserved.
The development of bactericidal yeast strains by expressing the Pediococcus acidilactici pediocin gene (pedA) in Saccharomyces cerevisiae.

PubMed

Schoeman, H; Vivier, M A; Du Toit, M; Dicks, L M; Pretorius, I S

1999-06-15

The excessive use of sulphur dioxide and other chemical preservatives in wine, beer and other fermented food and beverage products to prevent the growth of unwanted microbes holds various disadvantages for the quality of the end-products and is confronted by mounting consumer resistance. The objective of this study was to investigate the feasibility of controlling spoilage bacteria during yeast-based fermentations by engineering bactericidal strains of Saccharomyces cerevisiae. To test this novel concept, we have successfully expressed a bacteriocin gene in yeast. The pediocin operon of Pediococcus acidilactici PAC1.0 consists of four clustered genes, namely pedA (encoding a 62 amino acid precursor of the PA-1 pediocin), pedB (encoding an immunity factor), pedC (encoding a PA-1 transport protein) and pedD (encoding a protein involved in the transport and processing of PA-1). The pedA gene was inserted into a yeast expression/secretion cassette and introduced as a multicopy episomal plasmid into a laboratory strain (Y294) of S. cerevisiae. Northern blot analysis confirmed that the pedA structural gene in this construct (ADH1P-MFa1S-pedA-ADH1T, designated PED1), was efficiently expressed under the control of the yeast alcohol dehydrogenase I gene promoter (ADH1P) and terminator (ADH1T). Secretion of the PED1-encoded pediocin PA-1 was directed by the yeast mating pheromone alpha-factor's secretion signal (MFa1S). The presence of biologically active antimicrobial peptides produced by the yeast transformants was indicated by agar diffusion assays against sensitive indicator bacteria (e.g. Listeria monocytogenes B73). Protein analysis indicated the secreted heterologous peptide to be approximately 4.6 kDa, which conforms to the expected size. The heterologous peptide was present at relatively low levels in the yeast supernatant but pediocin activity was readily detected when intact yeast colonies were used in sensitive strain overlays. This study could lead to the
Bacillus cereus-type polyhydroxyalkanoate biosynthetic gene cluster contains R-specific enoyl-CoA hydratase gene.

PubMed

Kihara, Takahiro; Hiroe, Ayaka; Ishii-Hyakutake, Manami; Mizuno, Kouhei; Tsuge, Takeharu

2017-08-01

Bacillus cereus and Bacillus megaterium both accumulate polyhydroxyalkanoate (PHA) but their PHA biosynthetic gene (pha) clusters that code for proteins involved in PHA biosynthesis are different. Namely, a gene encoding MaoC-like protein exists in the B. cereus-type pha cluster but not in the B. megaterium-type pha cluster. MaoC-like protein has an R-specific enoyl-CoA hydratase (R-hydratase) activity and is referred to as PhaJ when involved in PHA metabolism. In this study, the pha cluster of B. cereus YB-4 was characterized in terms of PhaJ's function. In an in vitro assay, PhaJ from B. cereus YB-4 (PhaJ YB4 ) exhibited hydration activity toward crotonyl-CoA. In an in vivo assay using Escherichia coli as a host for PHA accumulation, the recombinant strain expressing PhaJ YB4 and PHA synthase led to increased PHA accumulation, suggesting that PhaJ YB4 functioned as a monomer supplier. The monomer composition of the accumulated PHA reflected the substrate specificity of PhaJ YB4 , which appeared to prefer short chain-length substrates. The pha cluster from B. cereus YB-4 functioned to accumulate PHA in E. coli; however, it did not function when the phaJ YB4 gene was deleted. The B. cereus-type pha cluster represents a new example of a pha cluster that contains the gene encoding PhaJ.
Angiotensin II inhibits ADH-stimulated cAMP: role on O2- and transport-related oxygen consumption in the loop of Henle.

PubMed

Silva, G B; Juncos, L I; Baigorria, S T; Garcia, N H

2013-01-01

Dehydration and acute reductions of blood pressure increases ADH and Ang II levels. These hormones increase transport along the distal nephron. In the thick ascending limb (TAL) ADH increases transport via cAMP, while Ang II acts via superoxide (O2-). However, the mechanism of interaction of these hormones in this segment remains unclear. The aim of this study was to explore ADH/Ang II interactions on TAL transport. For this, we measured the effects of ADH/Ang II, added sequentially to TAL suspensions from Wistar rats, on oxygen consumption (QO2) -as a transport index-, cAMP and O2-. Basal QO2 was 112+-5 nmol O2/min/mg protein. Addition of ADH (1nM) increased QO2 by 227 percent. In the presence of ADH, Ang II (1nM) elicited a QO2 transient response. During an initial 3.1+-0.7 minutes after adding Ang II, QO2 decreased 58 percent (p less than 0.03 initial vs. ADH) and then rose by 188 percent (p less than 0.03 late vs initial Ang II). We found that Losartan blocked the initial effects of Ang II and the latter blocked ADH and forskolin-stimulated cAMP. The NOS inhibitor L-NAME or the AT2 receptor antagonist PD123319 showed no effect on transported related oxygen consumption. Then, we assessed the late period after adding Ang II. The O2- scavenger tempol blocked the late Ang II effects on QO2, while Ang II increased O2- production during this period. We conclude that 1) Ang II has a transient effect on ADH-stimulated transport; 2) this effect is mediated by AT1 receptors; 3) the initial period is mediated by decreased cAMP and 4) the late period is mediated by O2-.

Hox cluster polarity in early transcriptional availability: a high order regulatory level of clustered Hox genes in the mouse.

PubMed

Roelen, Bernard A J; de Graaff, Wim; Forlani, Sylvie; Deschamps, Jacqueline

2002-11-01

The molecular mechanism underlying the 3' to 5' polarity of induction of mouse Hox genes is still elusive. While relief from a cluster-encompassing repression was shown to lead to all Hoxd genes being expressed like the 3'most of them, Hoxd1 (Kondo and Duboule, 1999), the molecular basis of initial activation of this 3'most gene, is not understood yet. We show that, already before primitive streak formation, prior to initial expression of the first Hox gene, a dramatic transcriptional stimulation of the 3'most genes, Hoxb1 and Hoxb2, is observed upon a short pulse of exogenous retinoic acid (RA), whereas it is not in the case for more 5', cluster-internal, RA-responsive Hoxb genes. In contrast, the RA-responding Hoxb1lacZ transgene that faithfully mimics the endogenous gene (Marshall et al., 1994) did not exhibit the sensitivity of Hoxb1 to precocious activation. We conclude that polarity in initial activation of Hoxb genes reflects a greater availability of 3'Hox genes for transcription, suggesting a pre-existing (susceptibility to) opening of the chromatin structure at the 3' extremity of the cluster. We discuss the data in the context of prevailing models involving differential chromatin opening in the directionality of clustered Hox gene transcription, and regarding the importance of the cluster context for correct timing of initial Hox gene expression.Interestingly, Cdx1 manifested the same early transcriptional availability as Hoxb1. Copyright 2002 Elsevier Science Ireland Ltd.
A cross-species bi-clustering approach to identifying conserved co-regulated genes.

PubMed

Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

2016-06-15

A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared
Pichia stipitis genomics, transcriptomics, and gene clusters

Treesearch

Thomas W. Jeffries; Jennifer R. Headman Van Vleet

2009-01-01

Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...
Childhood adversity moderates the effect of ADH1B on risk for alcohol-related phenotypes in Jewish Israeli drinkers.

PubMed

Meyers, Jacquelyn L; Shmulewitz, Dvora; Wall, Melanie M; Keyes, Katherine M; Aharonovich, Efrat; Spivak, Baruch; Weizman, Abraham; Frisch, Amos; Edenberg, Howard J; Gelernter, Joel; Grant, Bridget F; Hasin, Deborah

2015-01-01

Childhood adversity and genetic variant ADH1B-rs1229984 have each been shown to influence heavy alcohol consumption and disorders. However, little is known about how these factors jointly influence these outcomes. We assessed the main and additive interactive effects of childhood adversity (abuse, neglect and parental divorce) and the ADH1B-rs1229984 on the quantitative phenotypes 'maximum drinks in a day' (Maxdrinks) and DSM-Alcohol Use Disorder (AUD) severity, adjusting for demographic variables, in an Israeli sample of adult household residents (n = 1143) evaluated between 2007 and 2009. Childhood adversity and absence of the protective ADH1B-rs1229984 A allele were associated with greater mean Maxdrinks (mean differences: 1.50; 1.13, respectively) and AUD severity (mean ratios: 0.71; 0.27, respectively). In addition, childhood adversity moderated the ADH1B-rs1229984 effect on Maxdrinks (P < 0.01) and AUD severity (P < 0.05), in that there was a stronger effect of ADH1B-rs1229984 genotype on Maxdrinks and AUD severity among those who had experienced childhood adversity compared with those who had not. ADH1B-rs1229984 impacts alcohol metabolism. Therefore, among those at risk for greater consumption, e.g. those who experienced childhood adversity, ADH1B-rs1229984 appears to have a stronger effect on alcohol consumption and consequently on risk for AUD symptom severity. Evidence for the interaction of genetic vulnerability and early life adversity on alcohol-related phenotypes provides further insight into the complex relationships between genetic and environmental risk factors. © 2013 Society for the Study of Addiction.
Clustering change patterns using Fourier transformation with time-course gene expression data.

PubMed

Kim, Jaehee

2011-01-01

To understand the behavior of genes, it is important to explore how the patterns of gene expression change over a period of time because biologically related gene groups can share the same change patterns. In this study, the problem of finding similar change patterns is induced to clustering with the derivative Fourier coefficients. This work is aimed at discovering gene groups with similar change patterns which share similar biological properties. We developed a statistical model using derivative Fourier coefficients to identify similar change patterns of gene expression. We used a model-based method to cluster the Fourier series estimation of derivatives. We applied our model to cluster change patterns of yeast cell cycle microarray expression data with alpha-factor synchronization. It showed that, as the method clusters with the probability-neighboring data, the model-based clustering with our proposed model yielded biologically interpretable results. We expect that our proposed Fourier analysis with suitably chosen smoothing parameters could serve as a useful tool in classifying genes and interpreting possible biological change patterns.
Role of cardiac volume receptors in the control of ADH release during acute simulated weightlessness in man

NASA Technical Reports Server (NTRS)

Convertino, V. A.; Benjamin, B. A.; Keil, L. C.; Sandler, H.

1984-01-01

Hemodynamic responses and antidiuretic hormone (ADH) were measured during body position changes, designed to induce central blood volume shifts in ten cardiac and one heart-lung transplant recipients, to assess the contribution of cardiac volume receptors in the control of ADH release during the initial acute phase of exposure to weightlessness. Each subject underwent 15 min of a sitting-control period (C) followed by 30 min of 6 deg headdown tilt (T) and 30 min of resumed sitting (S). Venous blood samples and cardiac dimensions were taken at 0 and 15 min of C; 5, 15, and 30 min of T; and 5, 15, and 30 min of S. Blood samples were analyzed for hematocrit, plasma osmolality, plasma renin activity (PRA), and ADH. Heart rate and blood pressure were recorded every two min. Plasma osmolality was not altered by posture changes. Mean left ventricular end-diastolic volume increased (P less than 0.05) from 90 ml in C to 106 ml in T and returned to 87 ml in S. Plasma ADH was reduced by 20 percent (P less than 0.05) with T, and returned to control levels with S. These responses were similar in six normal cardiac-innervated control subjects. These data may suggest that cardiac volume receptors are not the primary mechanism for the control of ADH release during acute central volume shifts in man.
An ergot alkaloid biosynthesis gene and clustered hypothetical genes from Aspergillus fumigatus.

PubMed

Coyle, Christine M; Panaccione, Daniel G

2005-06-01

The ergot alkaloids are a family of indole-derived mycotoxins with a variety of significant biological activities. Aspergillus fumigatus, a common airborne fungus and opportunistic human pathogen, and several fungi in the relatively distant taxon Clavicipitaceae (clavicipitaceous fungi) produce different sets of ergot alkaloids. The ergot alkaloids of these divergent fungi share a four-member ergoline ring but differ in the number, type, and position of the side chains. Several genes required for ergot alkaloid production are known in the clavicipitaceous fungi, and these genes are clustered in the genome of the ergot fungus Claviceps purpurea. We investigated whether the ergot alkaloids of A. fumigatus have a common biosynthetic and genetic origin with those of the clavicipitaceous fungi. A homolog of dmaW, the gene controlling the determinant step in the ergot alkaloid pathway of clavicipitaceous fungi, was identified in the A. fumigatus genome. Knockout of dmaW eliminated all known ergot alkaloids from A. fumigatus, and complementation of the mutation restored ergot alkaloid production. Clustered with dmaW in the A. fumigatus genome are sequences corresponding to five genes previously proposed to encode steps in the ergot alkaloid pathway of C. purpurea, as well as additional sequences whose deduced protein products are consistent with their involvement in the ergot alkaloid pathway. The corresponding genes have similarities in their nucleotide sequences, but the orientations and positions within the cluster of several of these genes differ. The data indicate that the ergot alkaloid biosynthetic capabilities in A. fumigatus and the clavicipitaceous fungi had a common origin.
An Ergot Alkaloid Biosynthesis Gene and Clustered Hypothetical Genes from Aspergillus fumigatus†

PubMed Central

Coyle, Christine M.; Panaccione, Daniel G.

2005-01-01

The ergot alkaloids are a family of indole-derived mycotoxins with a variety of significant biological activities. Aspergillus fumigatus, a common airborne fungus and opportunistic human pathogen, and several fungi in the relatively distant taxon Clavicipitaceae (clavicipitaceous fungi) produce different sets of ergot alkaloids. The ergot alkaloids of these divergent fungi share a four-member ergoline ring but differ in the number, type, and position of the side chains. Several genes required for ergot alkaloid production are known in the clavicipitaceous fungi, and these genes are clustered in the genome of the ergot fungus Claviceps purpurea. We investigated whether the ergot alkaloids of A. fumigatus have a common biosynthetic and genetic origin with those of the clavicipitaceous fungi. A homolog of dmaW, the gene controlling the determinant step in the ergot alkaloid pathway of clavicipitaceous fungi, was identified in the A. fumigatus genome. Knockout of dmaW eliminated all known ergot alkaloids from A. fumigatus, and complementation of the mutation restored ergot alkaloid production. Clustered with dmaW in the A. fumigatus genome are sequences corresponding to five genes previously proposed to encode steps in the ergot alkaloid pathway of C. purpurea, as well as additional sequences whose deduced protein products are consistent with their involvement in the ergot alkaloid pathway. The corresponding genes have similarities in their nucleotide sequences, but the orientations and positions within the cluster of several of these genes differ. The data indicate that the ergot alkaloid biosynthetic capabilities in A. fumigatus and the clavicipitaceous fungi had a common origin. PMID:15933009
Comparison of two schemes for automatic keyword extraction from MEDLINE for functional gene clustering.

PubMed

Liu, Ying; Ciliax, Brian J; Borges, Karin; Dasigi, Venu; Ram, Ashwin; Navathe, Shamkant B; Dingledine, Ray

2004-01-01

One of the key challenges of microarray studies is to derive biological insights from the unprecedented quatities of data on gene-expression patterns. Clustering genes by functional keyword association can provide direct information about the nature of the functional links among genes within the derived clusters. However, the quality of the keyword lists extracted from biomedical literature for each gene significantly affects the clustering results. We extracted keywords from MEDLINE that describes the most prominent functions of the genes, and used the resulting weights of the keywords as feature vectors for gene clustering. By analyzing the resulting cluster quality, we compared two keyword weighting schemes: normalized z-score and term frequency-inverse document frequency (TFIDF). The best combination of background comparison set, stop list and stemming algorithm was selected based on precision and recall metrics. In a test set of four known gene groups, a hierarchical algorithm correctly assigned 25 of 26 genes to the appropriate clusters based on keywords extracted by the TDFIDF weighting scheme, but only 23 og 26 with the z-score method. To evaluate the effectiveness of the weighting schemes for keyword extraction for gene clusters from microarray profiles, 44 yeast genes that are differentially expressed during the cell cycle were used as a second test set. Using established measures of cluster quality, the results produced from TFIDF-weighted keywords had higher purity, lower entropy, and higher mutual information than those produced from normalized z-score weighted keywords. The optimized algorithms should be useful for sorting genes from microarray lists into functionally discrete clusters.
GraphTeams: a method for discovering spatial gene clusters in Hi-C sequencing data.

PubMed

Schulz, Tizian; Stoye, Jens; Doerr, Daniel

2018-05-08

Hi-C sequencing offers novel, cost-effective means to study the spatial conformation of chromosomes. We use data obtained from Hi-C experiments to provide new evidence for the existence of spatial gene clusters. These are sets of genes with associated functionality that exhibit close proximity to each other in the spatial conformation of chromosomes across several related species. We present the first gene cluster model capable of handling spatial data. Our model generalizes a popular computational model for gene cluster prediction, called δ-teams, from sequences to graphs. Following previous lines of research, we subsequently extend our model to allow for several vertices being associated with the same label. The model, called δ-teams with families, is particular suitable for our application as it enables handling of gene duplicates. We develop algorithmic solutions for both models. We implemented the algorithm for discovering δ-teams with families and integrated it into a fully automated workflow for discovering gene clusters in Hi-C data, called GraphTeams. We applied it to human and mouse data to find intra- and interchromosomal gene cluster candidates. The results include intrachromosomal clusters that seem to exhibit a closer proximity in space than on their chromosomal DNA sequence. We further discovered interchromosomal gene clusters that contain genes from different chromosomes within the human genome, but are located on a single chromosome in mouse. By identifying δ-teams with families, we provide a flexible model to discover gene cluster candidates in Hi-C data. Our analysis of Hi-C data from human and mouse reveals several known gene clusters (thus validating our approach), but also few sparsely studied or possibly unknown gene cluster candidates that could be the source of further experimental investigations.
Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

DOE PAGES

Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; ...

2016-04-12

Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundancemore » of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance
Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

DOE Office of Scientific and Technical Information (OSTI.GOV)

Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong

Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundancemore » of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance
Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

PubMed

Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

2016-04-12

Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if
Is ADH1C genotype relevant for the cardioprotective effect of alcohol?

PubMed

Høiseth, Gudrun; Magnus, Per; Knudsen, Gun Peggy; Jansen, Mona Dverdal; Næss, Oyvind; Tambs, Kristian; Mørland, Jørg

2013-03-01

The cardioprotective effect of ethanol has been suggested to be linked to one of the ethanol metabolizing enzymes (ADH1C), which constitutes a high V(max) and a low V(max) variant. This has been demonstrated in some studies, while others have not been able to replicate the findings. The aim of the present study was to investigate the relation between the different ADH1C genotypes, death from coronary heart disease (CHD) and alcohol in a material larger than the previously published studies. Eight hundred CHD deaths as well as 1303 controls were genotyped for the high V(max) (γ1) and the low V(max) (γ2) ADH1C variant. Information of alcohol use was available for all subjects. Multiple logistic regression analyses was used to study if the decreased risk of death from CHD in alcohol consuming subjects was more pronounced in subjects homozygous for the γ2 allele (γ2γ2 subjects) compared to γ1γ1 and γ1γ2 subjects. The odds ratio (OR) for death from CHD in alcohol consumers compared to abstainers was similar in the genotype groups, i.e., 0.62 (95% CI: 0.43-0.88) in γ1γ1 subjects and 0.62 (95% CI: 0.42-0.91) in γ2γ2 subjects. Also when stratifying the results by gender and when dividing alcohol consumers into different alcohol consumption groups, there was no difference in the OR between the different genotype groups. This study, which included the largest study group published so far, failed to find any link between the ADH1C genotype and the cardioprotective effects of alcohol. Copyright © 2013 Elsevier Inc. All rights reserved.
Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

PubMed Central

2010-01-01

Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is
Chromatin organization and global regulation of Hox gene clusters

PubMed Central

Montavon, Thomas; Duboule, Denis

2013-01-01

During development, a properly coordinated expression of Hox genes, within their different genomic clusters is critical for patterning the body plans of many animals with a bilateral symmetry. The fascinating correspondence between the topological organization of Hox clusters and their transcriptional activation in space and time has served as a paradigm for understanding the relationships between genome structure and function. Here, we review some recent observations, which revealed highly dynamic changes in the structure of chromatin at Hox clusters, in parallel with their activation during embryonic development. We discuss the relevance of these findings for our understanding of large-scale gene regulation. PMID:23650639
Single gene insertion drives bioalcohol production by a thermophilic archaeon

PubMed Central

Basen, Mirko; Schut, Gerrit J.; Nguyen, Diep M.; Lipscomb, Gina L.; Benn, Robert A.; Prybol, Cameron J.; Vaccaro, Brian J.; Poole, Farris L.; Kelly, Robert M.; Adams, Michael W. W.

2014-01-01

Bioethanol production is achieved by only two metabolic pathways and only at moderate temperatures. Herein a fundamentally different synthetic pathway for bioalcohol production at 70 °C was constructed by insertion of the gene for bacterial alcohol dehydrogenase (AdhA) into the archaeon Pyrococcus furiosus. The engineered strain converted glucose to ethanol via acetate and acetaldehyde, catalyzed by the host-encoded aldehyde ferredoxin oxidoreductase (AOR) and heterologously expressed AdhA, in an energy-conserving, redox-balanced pathway. Furthermore, the AOR/AdhA pathway also converted exogenously added aliphatic and aromatic carboxylic acids to the corresponding alcohol using glucose, pyruvate, and/or hydrogen as the source of reductant. By heterologous coexpression of a membrane-bound carbon monoxide dehydrogenase, CO was used as a reductant for converting carboxylic acids to alcohols. Redirecting the fermentative metabolism of P. furiosus through strategic insertion of foreign genes creates unprecedented opportunities for thermophilic bioalcohol production. Moreover, the AOR/AdhA pathway is a potentially game-changing strategy for syngas fermentation, especially in combination with carbon chain elongation pathways. PMID:25368184
Single gene insertion drives bioalcohol production by a thermophilic archaeon

DOE Office of Scientific and Technical Information (OSTI.GOV)

Basen, M; Schut, GJ; Nguyen, DM

2014-12-09

Bioethanol production is achieved by only two metabolic pathways and only at moderate temperatures. Herein a fundamentally different synthetic pathway for bioalcohol production at 70 degrees C was constructed by insertion of the gene for bacterial alcohol dehydrogenase (AdhA) into the archaeon Pyrococcus furiosus. The engineered strain converted glucose to ethanol via acetate and acetaldehyde, catalyzed by the host-encoded aldehyde ferredoxin oxidoreductase (AOR) and heterologously expressed AdhA, in an energy-conserving, redox-balanced pathway. Furthermore, the AOR/AdhA pathway also converted exogenously added aliphatic and aromatic carboxylic acids to the corresponding alcohol using glucose, pyruvate, and/or hydrogen as the source of reductant. Bymore » heterologous coexpression of a membrane-bound carbon monoxide dehydrogenase, CO was used as a reductant for converting carboxylic acids to alcohols. Redirecting the fermentative metabolism of P. furiosus through strategic insertion of foreign genes creates unprecedented opportunities for thermophilic bioalcohol production. Moreover, the AOR/AdhA pathway is a potentially game-changing strategy for syngas fermentation, especially in combination with carbon chain elongation pathways.« less
Fragmentation of an aflatoxin-like gene cluster in a forest pathogen

USDA-ARS?s Scientific Manuscript database

Secondary metabolic pathway genes are typically clustered in fungi. An exception to this paradigm is seen for genes required for the production of dothistromin, an aflatoxin-like virulence factor produced by the pine needle pathogen Dothistroma septosporum. In contrast to the tight clustering of gen...
Molecular Population Genetics of the Alcohol Dehydrogenase Gene Region of DROSOPHILA MELANOGASTER

PubMed Central

Aquadro, Charles F.; Desse, Susan F.; Bland, Molly M.; Langley, Charles H.; Laurie-Ahlberg, Cathy C.

1986-01-01

Variation in the DNA restriction map of a 13-kb region of chromosome II including the alcohol dehydrogenase structural gene (Adh) was examined in Drosophila melanogaster from natural populations. Detailed analysis of 48 D. melanogaster lines representing four eastern United States populations revealed extensive DNA sequence variation due to base substitutions, insertions and deletions. Cloning of this region from several lines allowed characterization of length variation as due to unique sequence insertions or deletions [nine sizes; 21–200 base pairs (bp)] or transposable element insertions (several sizes, 340 bp to 10.2 kb, representing four different elements). Despite this extensive variation in sequences flanking the Adh gene, only one length polymorphism is clearly associated with altered Adh expression (a copia element approximately 250 bp 5' to the distal transcript start site). Nonetheless, the frequency spectra of transposable elements within and between Drosophila species suggests they are slightly deleterious. Strong nonrandom associations are observed among Adh region sequence variants, ADH allozyme (Fast vs. Slow), ADH enzyme activity and the chromosome inversion ln(2L) t. Phylogenetic analysis of restriction map haplotypes suggest that the major twofold component of ADH activity variation (high vs. low, typical of Fast and Slow allozymes, respectively) is due to sequence variation tightly linked to and possibly distinct from that underlying the allozyme difference. The patterns of nucleotide and haplotype variation for Fast and Slow allozyme lines are consistent with the recent increase in frequency and spread of the Fast haplotype associated with high ADH activity. These data emphasize the important role of evolutionary history and strong nonrandom associations among tightly linked sequence variation as determinants of the patterns of variation observed in natural populations. PMID:3026893

Facteurs prédictifs de l’adhésion médicamenteuse chez les patients en insuffisance cardiaque chronique: expérience marocaine

PubMed Central

Ragbaoui, Yassine; Nouamou, Imad; Hammiri, Ayoub El; Habbal, Rachida

2017-01-01

L’adhésion médicamenteuse chez les patients ayant une insuffisance cardiaque chronique est reconnue comme l’un des problèmes majeurs dans la gestion de cette pathologie. L’état démographique et socioéconomique des pays africains peut avoir un impact sur l’adhésion au traitement de l’insuffisance cardiaque chronique. Nous avons réalisé une étude transversale de Septembre 2014 à Janvier 2015 portant sur les patients en insuffisance cardiaque chronique suivis au centre d’insuffisance cardiaque du département de cardiologie du centre hospitalier universitaire IBN ROCHD à Casablanca au Maroc. La mesure de l’adhésion médicamenteuse était basée sur un questionnaire: questionnaire CARDIA. Les informations relatifs aux facteurs prédictifs d’adhésion médicamenteuse était dérivés du model d’adhésion multidimensionnel. Nous avons inclus dans cette étude 147 patients insuffisants cardiaques chroniques. Le pourcentage de l’adhésion médicamenteuse était de 83.6% selon CARDIA-Questionary. Les facteurs prédictifs qui influencent significativement l’adhésion médicamenteuse était: La dépression (p=0.034), le niveau de support social (p=0.03) et la prise de médicaments par le patient lui-même (p=0.0001). Comme dans plusieurs régions au monde, l’adhésion médicamenteuse chez les patients ayant une insuffisance cardiaque chronique reste un problème de santé au Maroc. Les différentes stratégies qui agissent sur les facteurs prédictifs pourraient améliorer l’adhésion médicamenteuse. PMID:28533838
High-throughput platform for the discovery of elicitors of silent bacterial gene clusters.

PubMed

Seyedsayamdost, Mohammad R

2014-05-20

Over the past decade, bacterial genome sequences have revealed an immense reservoir of biosynthetic gene clusters, sets of contiguous genes that have the potential to produce drugs or drug-like molecules. However, the majority of these gene clusters appear to be inactive for unknown reasons prompting terms such as "cryptic" or "silent" to describe them. Because natural products have been a major source of therapeutic molecules, methods that rationally activate these silent clusters would have a profound impact on drug discovery. Herein, a new strategy is outlined for awakening silent gene clusters using small molecule elicitors. In this method, a genetic reporter construct affords a facile read-out for activation of the silent cluster of interest, while high-throughput screening of small molecule libraries provides potential inducers. This approach was applied to two cryptic gene clusters in the pathogenic model Burkholderia thailandensis. The results not only demonstrate a prominent activation of these two clusters, but also reveal that the majority of elicitors are themselves antibiotics, most in common clinical use. Antibiotics, which kill B. thailandensis at high concentrations, act as inducers of secondary metabolism at low concentrations. One of these antibiotics, trimethoprim, served as a global activator of secondary metabolism by inducing at least five biosynthetic pathways. Further application of this strategy promises to uncover the regulatory networks that activate silent gene clusters while at the same time providing access to the vast array of cryptic molecules found in bacteria.
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants1[OPEN

PubMed Central

Zhang, Peifen; Kim, Taehyong; Banf, Michael; Chavali, Arvind K.; Nilo-Poyanco, Ricardo; Bernard, Thomas

2017-01-01

Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters. PMID:28228535
Conservation of gene linkage in dispersed vertebrate NK homeobox clusters.

PubMed

Wotton, Karl R; Weierud, Frida K; Juárez-Morales, José L; Alvares, Lúcia E; Dietrich, Susanne; Lewis, Katharine E

2009-10-01

Nk homeobox genes are important regulators of many different developmental processes including muscle, heart, central nervous system and sensory organ development. They are thought to have arisen as part of the ANTP megacluster, which also gave rise to Hox and ParaHox genes, and at least some NK genes remain tightly linked in all animals examined so far. The protostome-deuterostome ancestor probably contained a cluster of nine Nk genes: (Msx)-(Nk4/tinman)-(Nk3/bagpipe)-(Lbx/ladybird)-(Tlx/c15)-(Nk7)-(Nk6/hgtx)-(Nk1/slouch)-(Nk5/Hmx). Of these genes, only NKX2.6-NKX3.1, LBX1-TLX1 and LBX2-TLX2 remain tightly linked in humans. However, it is currently unclear whether this is unique to the human genome as we do not know which of these Nk genes are clustered in other vertebrates. This makes it difficult to assess whether the remaining linkages are due to selective pressures or because chance rearrangements have "missed" certain genes. In this paper, we identify all of the paralogs of these ancestrally clustered NK genes in several distinct vertebrates. We demonstrate that tight linkages of Lbx1-Tlx1, Lbx2-Tlx2 and Nkx3.1-Nkx2.6 have been widely maintained in both the ray-finned and lobe-finned fish lineages. Moreover, the recently duplicated Hmx2-Hmx3 genes are also tightly linked. Finally, we show that Lbx1-Tlx1 and Hmx2-Hmx3 are flanked by highly conserved noncoding elements, suggesting that shared regulatory regions may have resulted in evolutionary pressure to maintain these linkages. Consistent with this, these pairs of genes have overlapping expression domains. In contrast, Lbx2-Tlx2 and Nkx3.1-Nkx2.6, which do not seem to be coexpressed, are also not associated with conserved noncoding sequences, suggesting that an alternative mechanism may be responsible for the continued clustering of these genes.
A Cluster of Cuticle Protein Genes of Drosophila Melanogaster at 65a: Sequence, Structure and Evolution

PubMed Central

Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.

1997-01-01

A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064
Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA

PubMed Central

Mugford, Sam T.; Louveau, Thomas; Melton, Rachel; Qi, Xiaoquan; Bakht, Saleha; Hill, Lionel; Tsurushima, Tetsu; Honkanen, Suvi; Rosser, Susan J.; Lomonossoff, George P.; Osbourn, Anne

2013-01-01

Operon-like gene clusters are an emerging phenomenon in the field of plant natural products. The genes encoding some of the best-characterized plant secondary metabolite biosynthetic pathways are scattered across plant genomes. However, an increasing number of gene clusters encoding the synthesis of diverse natural products have recently been reported in plant genomes. These clusters have arisen through the neo-functionalization and relocation of existing genes within the genome, and not by horizontal gene transfer from microbes. The reasons for clustering are not yet clear, although this form of gene organization is likely to facilitate co-inheritance and co-regulation. Oats (Avena spp) synthesize antimicrobial triterpenoids (avenacins) that provide protection against disease. The synthesis of these compounds is encoded by a gene cluster. Here we show that a module of three adjacent genes within the wider biosynthetic gene cluster is required for avenacin acylation. Through the characterization of these genes and their encoded proteins we present a model of the subcellular organization of triterpenoid biosynthesis. PMID:23532069
Accurate prediction of secondary metabolite gene clusters in filamentous fungi.

PubMed

Andersen, Mikael R; Nielsen, Jakob B; Klitgaard, Andreas; Petersen, Lene M; Zachariasen, Mia; Hansen, Tilde J; Blicher, Lene H; Gotfredsen, Charlotte H; Larsen, Thomas O; Nielsen, Kristian F; Mortensen, Uffe H

2013-01-02

Biosynthetic pathways of secondary metabolites from fungi are currently subject to an intense effort to elucidate the genetic basis for these compounds due to their large potential within pharmaceutics and synthetic biochemistry. The preferred method is methodical gene deletions to identify supporting enzymes for key synthases one cluster at a time. In this study, we design and apply a DNA expression array for Aspergillus nidulans in combination with legacy data to form a comprehensive gene expression compendium. We apply a guilt-by-association-based analysis to predict the extent of the biosynthetic clusters for the 58 synthases active in our set of experimental conditions. A comparison with legacy data shows the method to be accurate in 13 of 16 known clusters and nearly accurate for the remaining 3 clusters. Furthermore, we apply a data clustering approach, which identifies cross-chemistry between physically separate gene clusters (superclusters), and validate this both with legacy data and experimentally by prediction and verification of a supercluster consisting of the synthase AN1242 and the prenyltransferase AN11080, as well as identification of the product compound nidulanin A. We have used A. nidulans for our method development and validation due to the wealth of available biochemical data, but the method can be applied to any fungus with a sequenced and assembled genome, thus supporting further secondary metabolite pathway elucidation in the fungal kingdom.
Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Data Analysis and Visualization; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,'' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA

2008-05-12

The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii)more » evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.« less
Variation in the fumonisin biosynthetic gene cluster in fumonisin-producing and nonproducing black aspergilli.

PubMed

Susca, Antonia; Proctor, Robert H; Butchko, Robert A E; Haidukowski, Miriam; Stea, Gaetano; Logrieco, Antonio; Moretti, Antonio

2014-12-01

The ability to produce fumonisin mycotoxins varies among members of the black aspergilli. Previously, analyses of selected genes in the fumonisin biosynthetic gene (fum) cluster in black aspergilli from California grapes indicated that fumonisin-nonproducing isolates of Aspergillus welwitschiae lack six fum genes, but nonproducing isolates of Aspergillus niger do not. In the current study, analyses of black aspergilli from grapes from the Mediterranean Basin indicate that the genomic context of the fum cluster is the same in isolates of A. niger and A. welwitschiae regardless of fumonisin-production ability and that full-length clusters occur in producing isolates of both species and nonproducing isolates of A. niger. In contrast, the cluster has undergone an eight-gene deletion in fumonisin-nonproducing isolates of A. welwitschiae. Phylogenetic analyses suggest each species consists of a mixed population of fumonisin-producing and nonproducing individuals, and that existence of both production phenotypes may provide a selective advantage to these species. Differences in gene content of fum cluster homologues and phylogenetic relationships of fum genes suggest that the mutation(s) responsible for the nonproduction phenotype differs, and therefore arose independently, in the two species. Partial fum cluster homologues were also identified in genome sequences of four other black Aspergillus species. Gene content of these partial clusters and phylogenetic relationships of fum sequences indicate that non-random partial deletion of the cluster has occurred multiple times among the species. This in turn suggests that an intact cluster and fumonisin production were once more widespread among black aspergilli. Copyright © 2014 Elsevier Inc. All rights reserved.
Wide distribution of O157-antigen biosynthesis gene clusters in Escherichia coli.

PubMed

Iguchi, Atsushi; Shirai, Hiroki; Seto, Kazuko; Ooka, Tadasuke; Ogura, Yoshitoshi; Hayashi, Tetsuya; Osawa, Kayo; Osawa, Ro

2011-01-01

Most Escherichia coli O157-serogroup strains are classified as enterohemorrhagic E. coli (EHEC), which is known as an important food-borne pathogen for humans. They usually produce Shiga toxin (Stx) 1 and/or Stx2, and express H7-flagella antigen (or nonmotile). However, O157 strains that do not produce Stxs and express H antigens different from H7 are sometimes isolated from clinical and other sources. Multilocus sequence analysis revealed that these 21 O157:non-H7 strains tested in this study belong to multiple evolutionary lineages different from that of EHEC O157:H7 strains, suggesting a wide distribution of the gene set encoding the O157-antigen biosynthesis in multiple lineages. To gain insight into the gene organization and the sequence similarity of the O157-antigen biosynthesis gene clusters, we conducted genomic comparisons of the chromosomal regions (about 59 kb in each strain) covering the O-antigen gene cluster and its flanking regions between six O157:H7/non-H7 strains. Gene organization of the O157-antigen gene cluster was identical among O157:H7/non-H7 strains, but was divided into two distinct types at the nucleotide sequence level. Interestingly, distribution of the two types did not clearly follow the evolutionary lineages of the strains, suggesting that horizontal gene transfer of both types of O157-antigen gene clusters has occurred independently among E. coli strains. Additionally, detailed sequence comparison revealed that some positions of the repetitive extragenic palindromic (REP) sequences in the regions flanking the O-antigen gene clusters were coincident with possible recombination points. From these results, we conclude that the horizontal transfer of the O157-antigen gene clusters induced the emergence of multiple O157 lineages within E. coli and speculate that REP sequences may involve one of the driving forces for exchange and evolution of O-antigen loci.
Delineation of metabolic gene clusters in plant genomes by chromatin signatures

PubMed Central

Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T.; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J.; Kumar, S. Vinod; Freemont, Paul S.; Osbourn, Anne

2016-01-01

Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. PMID:26895889
A conserved gene cluster as a putative functional unit in insect innate immunity.

PubMed

Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Andó, István

2010-11-05

The Nimrod gene superfamily is an important component of the innate immune response. The majority of its member genes are located in close proximity within the Drosophila melanogaster genome and they lie in a larger conserved cluster ("Nimrod cluster"), made up of non-related groups (families, superfamilies) of genes. This cluster has been a part of the Arthropod genomes for about 300-350 million years. The available data suggest that the Nimrod cluster is a functional module of the insect innate immune response. Copyright © 2010 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture

PubMed Central

Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; Cole, James R.; Hashsham, Syed A.; Looft, Torey; Zhu, Yong-Guan

2016-01-01

ABSTRACT Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. PMID:27073098
Cytoplasmic involvement in ADH-mediated osmosis across toad urinary bladder.

PubMed

DiBona, D R

1983-11-01

Several lines of investigation have suggested that antidiuretic hormone (ADH) may have direct effects on the cytoskeletal organization of granular epithelial cells in the toad urinary bladder. To some extent, these effects are in concert with the well-established action of ADH on the hydraulic permeability of the mucosal plasma membrane, but it appears that other conformational adjustments (largely cytoplasmic) may be of comparable importance. The thrust of this review is that the hormone brings about a general restructuring of the granular cells so that the epithelium as a whole may function efficiently as an osmotic pathway. Details of cytoskeletal changes are far from clear as yet, but interference with or modulation of these particular effects infer that cytoplasmic organization is the seat of feedback control of osmotic flow rate, the basis for viability in the presence of dramatic cytosolic dilution and a major factor in the observed disparity in osmotic and diffusional permeability coefficients. In the interest of stimulating new thoughts and experiments in this area, a number of preliminary findings have been freely cited.
β-globin gene cluster haplotypes in ethnic minority populations of southwest China

PubMed Central

Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

2017-01-01

The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625
Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma.

PubMed

Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

2007-12-30

Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis.
The evolutionary life cycle of the polysaccharide biosynthetic gene cluster based on the Sphingomonadaceae.

PubMed

Wu, Mengmeng; Huang, Haidong; Li, Guoqiang; Ren, Yi; Shi, Zhong; Li, Xiaoyan; Dai, Xiaohui; Gao, Ge; Ren, Mengnan; Ma, Ting

2017-04-21

Although clustering of genes from the same metabolic pathway is a widespread phenomenon, the evolution of the polysaccharide biosynthetic gene cluster remains poorly understood. To determine the evolution of this pathway, we identified a scattered production pathway of the polysaccharide sanxan by Sphingomonas sanxanigenens NX02, and compared the distribution of genes between sphingan-producing and other Sphingomonadaceae strains. This allowed us to determine how the scattered sanxan pathway developed, and how the polysaccharide gene cluster evolved. Our findings suggested that the evolution of microbial polysaccharide biosynthesis gene clusters is a lengthy cyclic process comprising cluster 1 → scatter → cluster 2. The sanxan biosynthetic pathway proved the existence of a dispersive process. We also report the complete genome sequence of NX02, in which we identified many unstable genetic elements and powerful secretion systems. Furthermore, nine enzymes for the formation of activated precursors, four glycosyltransferases, four acyltransferases, and four polymerization and export proteins were identified. These genes were scattered in the NX02 genome, and the positive regulator SpnA of sphingans synthesis could not regulate sanxan production. Finally, we concluded that the evolution of the sanxan pathway was independent. NX02 evolved naturally as a polysaccharide producing strain over a long-time evolution involving gene acquisitions and adaptive mutations.
Co-clustering phenome–genome for phenotype classification and disease gene discovery

PubMed Central

Hwang, TaeHyun; Atluri, Gowtham; Xie, MaoQiang; Dey, Sanjoy; Hong, Changjin; Kumar, Vipin; Kuang, Rui

2012-01-01

Understanding the categorization of human diseases is critical for reliably identifying disease causal genes. Recently, genome-wide studies of abnormal chromosomal locations related to diseases have mapped >2000 phenotype–gene relations, which provide valuable information for classifying diseases and identifying candidate genes as drug targets. In this article, a regularized non-negative matrix tri-factorization (R-NMTF) algorithm is introduced to co-cluster phenotypes and genes, and simultaneously detect associations between the detected phenotype clusters and gene clusters. The R-NMTF algorithm factorizes the phenotype–gene association matrix under the prior knowledge from phenotype similarity network and protein–protein interaction network, supervised by the label information from known disease classes and biological pathways. In the experiments on disease phenotype–gene associations in OMIM and KEGG disease pathways, R-NMTF significantly improved the classification of disease phenotypes and disease pathway genes compared with support vector machines and Label Propagation in cross-validation on the annotated phenotypes and genes. The newly predicted phenotypes in each disease class are highly consistent with human phenotype ontology annotations. The roles of the new member genes in the disease pathways are examined and validated in the protein–protein interaction subnetworks. Extensive literature review also confirmed many new members of the disease classes and pathways as well as the predicted associations between disease phenotype classes and pathways. PMID:22735708
Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

PubMed

Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

2017-06-15

Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.
Clustering gene expression regulators: new approach to disease subtyping.

PubMed

Pyatnitskiy, Mikhail; Mazo, Ilya; Shkrob, Maria; Schwartz, Elena; Kotelnikova, Ekaterina

2014-01-01

One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

Clustering Gene Expression Regulators: New Approach to Disease Subtyping

PubMed Central

Pyatnitskiy, Mikhail; Mazo, Ilya; Shkrob, Maria; Schwartz, Elena; Kotelnikova, Ekaterina

2014-01-01

One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient. PMID:24416320
Delineation of metabolic gene clusters in plant genomes by chromatin signatures.

PubMed

Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J; Kumar, S Vinod; Freemont, Paul S; Osbourn, Anne

2016-03-18

Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma

PubMed Central

Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

2007-01-01

Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis. PMID:18305825
Replication of Genome Wide Association Studies of Alcohol Dependence: Support for Association with Variation in ADH1C

PubMed Central

Biernacka, Joanna M.; Geske, Jennifer R.; Schneekloth, Terry D.; Frye, Mark A.; Cunningham, Julie M.; Choi, Doo-Sup; Tapp, Courtney L.; Lewis, Bradley R.; Drews, Maureen S.; L.Pietrzak, Tracy; Colby, Colin L.; Hall-Flavin, Daniel K.; Loukianova, Larissa L.; Heit, John A.; Mrazek, David A.; Karpyak, Victor M.

2013-01-01

Genome-wide association studies (GWAS) have revealed many single nucleotide polymorphisms (SNPs) associated with complex traits. Although these studies frequently fail to identify statistically significant associations, the top association signals from GWAS may be enriched for true associations. We therefore investigated the association of alcohol dependence with 43 SNPs selected from association signals in the first two published GWAS of alcoholism. Our analysis of 808 alcohol-dependent cases and 1,248 controls provided evidence of association of alcohol dependence with SNP rs1614972 in the ADH1C gene (unadjusted p = 0.0017). Because the GWAS study that originally reported association of alcohol dependence with this SNP [1] included only men, we also performed analyses in sex-specific strata. The results suggest that this SNP has a similar effect in both sexes (men: OR (95%CI) = 0.80 (0.66, 0.95); women: OR (95%CI) = 0.83 (0.66, 1.03)). We also observed marginal evidence of association of the rs1614972 minor allele with lower alcohol consumption in the non-alcoholic controls (p = 0.081), and independently in the alcohol-dependent cases (p = 0.046). Despite a number of potential differences between the samples investigated by the prior GWAS and the current study, data presented here provide additional support for the association of SNP rs1614972 in ADH1C with alcohol dependence and extend this finding by demonstrating association with consumption levels in both non-alcoholic and alcohol-dependent populations. Further studies should investigate the association of other polymorphisms in this gene with alcohol dependence and related alcohol-use phenotypes. PMID:23516558
Patterning C. elegans: homeotic cluster genes, cell fates and cell migrations.

PubMed

Salser, S J; Kenyon, C

1994-05-01

Despite its simple body form, the nematode C. elegans expresses homeotic cluster genes similar to those of insects and vertebrates in the patterning of many cell types and tissues along the anteroposterior axis. In the ventral nerve cord, these genes program spatial patterns of cell death, fusion, division and neurotransmitter production; in migrating cells they regulate the direction and extent of movement. Nematode development permits an analysis at the cellular level of how homeotic cluster genes interact to specify cell fates, and how cell behavior can be regulated to assemble an organism.
A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.

PubMed

Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong

2015-01-01

Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.
A Genomics Based Discovery of Secondary Metabolite Biosynthetic Gene Clusters in Aspergillus ustus

PubMed Central

Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong

2015-01-01

Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic. PMID:25706180
Opioid-induced hyponatremia in a patient with central diabetes insipidus: independence from ADH.

PubMed

Bhat, Nandini; Balliu, Erjola; Osipoff, Jennifer; Lane, Andrew; Wilson, Thomas

2017-05-24

Hyponatremia can be a complication of opioid therapy, which has been postulated to occur secondary to inappropriate antidiuretic hormone secretion (syndrome of inappropriate antidiuretic hormone secretion [SIADH]). We report severe hyponatremia following wisdom teeth extraction with opioid analgesia in a 19-year-old female with diabetes insipidus (DI) and acquired panhypopituitarism that challenges this theory. As this patient has DI, we believe opioid treatment caused severe hyponatremia by the following mechanisms: (1) Opioids have a direct antidiuretic effect independent of changes in ADH, as demonstrated in Brattleboro rats with central DI. (2) Hydrocodone may have stimulated this patient's thirst center contributing to hyponatremia, as demonstrated in animal studies. Opioid use can cause hyponatremia in patients independent of ADH. It is important for clinicians to be aware of this so that patients can be appropriately counseled.
[Vasopressin (ADH)].

PubMed

Hirai, A; Uchida, D; Yoshida, S

1992-12-01

Vasopressin is thought to play an important role, not only in the metabolism of water and electrolytes, but also in the regulation of renal hemodynamics. This year, great progress has been achieved in molecular biology of vasopressin receptors. First, the cloning of a complementary DNA, encoding the rat liver V1a arginine vasopressin receptor, was reported. The liver cDNA encodes a protein with seven putative transmembrane domains, which binds arginine vasopressin and related compounds with affinities similar to the native rat V1a receptor. The messenger RNA, corresponding to the cDNA, is distributed in rat tissues, known to contain V1a receptors. Second, the cloning of a complementary DNA encoding the rat kidney V2 arginine vasopressin receptor was also successful. The kidney cDNA encodes a protein with a transmembrane topography characteristic of G protein-coupled receptors. The receptor messenger RNA is detected only in the kidney. Last year, an orally active and specific vasopressin V1 receptor antagonist, OPC-21268 was first reported. The i.v. or p.o. administration of OPC-21268 dose-dependently inhibited vasopressin-induced vasoconstriction, while that induced by angiotensin II was not affected. OPC-21268 may have clinical potentials in certain hypertensive cardiovascular disorders. In addition, an orally active and specific vasopressin V2 receptor antagonist, OPC-31260 was also reported. Oral administration of OPC-31260 inhibited antidiuretic action of arginine vasopressin. OPC-31260 is thought to be useful in the treatment of certain disorders, such as the syndrome of inappropriate secretion of ADH (SIADH).
Origins and Domestication of Cultivated Banana Inferred from Chloroplast and Nuclear Genes

PubMed Central

Zhang, Cui; Wang, Xin-Feng; Shi, Feng-Xue; Chen, Wen-Na; Ge, Xue-Jun

2013-01-01

Background Cultivated bananas are large, vegetatively-propagated members of the genus Musa. More than 1,000 cultivars are grown worldwide and they are major economic and food resources in numerous developing countries. It has been suggested that cultivated bananas originated from the islands of Southeast Asia (ISEA) and have been developed through complex geodomestication pathways. However, the maternal and parental donors of most cultivars are unknown, and the pattern of nucleotide diversity in domesticated banana has not been fully resolved. Methodology/Principal Findings We studied the genetics of 16 cultivated and 18 wild Musa accessions using two single-copy nuclear (granule-bound starch synthase I, GBSS I, also known as Waxy, and alcohol dehydrogenase 1, Adh1) and two chloroplast (maturase K, matK, and the trnL-F gene cluster) genes. The results of phylogenetic analyses showed that all A-genome haplotypes of cultivated bananas were grouped together with those of ISEA subspecies of M. acuminata (A-genome). Similarly, the B- and S-genome haplotypes of cultivated bananas clustered with the wild species M. balbisiana (B-genome) and M. schizocarpa (S-genome), respectively. Notably, it has been shown that distinct haplotypes of each cultivar (A-genome group) were nested together to different ISEA subspecies M. acuminata. Analyses of nucleotide polymorphism in the Waxy and Adh1 genes revealed that, in comparison to the wild relatives, cultivated banana exhibited slightly lower nucleotide diversity both across all sites and specifically at silent sites. However, dramatically reduced nucleotide diversity was found at nonsynonymous sites for cultivated bananas. Conclusions/Significance Our study not only confirmed the origin of cultivated banana as arising from multiple intra- and inter-specific hybridization events, but also showed that cultivated banana may have not suffered a severe genetic bottleneck during the domestication process. Importantly, our findings
Genome mining-directed activation of a silent angucycline biosynthetic gene cluster in Streptomyces chattanoogensis.

PubMed

Zhou, Zhenxing; Xu, Qingqing; Bu, Qingting; Guo, Yuanyang; Liu, Shuiping; Liu, Yu; Du, Yiling; Li, Yongquan

2015-02-09

Genomic sequencing of actinomycetes has revealed the presence of numerous gene clusters seemingly capable of natural product biosynthesis, yet most clusters are cryptic under laboratory conditions. Bioinformatics analysis of the completely sequenced genome of Streptomyces chattanoogensis L10 (CGMCC 2644) revealed a silent angucycline biosynthetic gene cluster. The overexpression of a pathway-specific activator gene under the constitutive ermE* promoter successfully triggered the expression of the angucycline biosynthetic genes. Two novel members of the angucycline antibiotic family, chattamycins A and B, were further isolated and elucidated. Biological activity assays demonstrated that chattamycin B possesses good antitumor activities against human cancer cell lines and moderate antibacterial activities. The results presented here provide a feasible method to activate silent angucycline biosynthetic gene clusters to discover potential new drug leads. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Global Identification of Genes Affecting Iron-Sulfur Cluster Biogenesis and Iron Homeostasis

PubMed Central

Hidese, Ryota; Kurihara, Tatsuo; Esaki, Nobuyoshi

2014-01-01

Iron-sulfur (Fe-S) clusters are ubiquitous cofactors that are crucial for many physiological processes in all organisms. In Escherichia coli, assembly of Fe-S clusters depends on the activity of the iron-sulfur cluster (ISC) assembly and sulfur mobilization (SUF) apparatus. However, the underlying molecular mechanisms and the mechanisms that control Fe-S cluster biogenesis and iron homeostasis are still poorly defined. In this study, we performed a global screen to identify the factors affecting Fe-S cluster biogenesis and iron homeostasis using the Keio collection, which is a library of 3,815 single-gene E. coli knockout mutants. The approach was based on radiolabeling of the cells with [2-14C]dihydrouracil, which entirely depends on the activity of an Fe-S enzyme, dihydropyrimidine dehydrogenase. We identified 49 genes affecting Fe-S cluster biogenesis and/or iron homeostasis, including 23 genes important only under microaerobic/anaerobic conditions. This study defines key proteins associated with Fe-S cluster biogenesis and iron homeostasis, which will aid further understanding of the cellular mechanisms that coordinate the processes. In addition, we applied the [2-14C]dihydrouracil-labeling method to analyze the role of amino acid residues of an Fe-S cluster assembly scaffold (IscU) as a model of the Fe-S cluster assembly apparatus. The analysis showed that Cys37, Cys63, His105, and Cys106 are essential for the function of IscU in vivo, demonstrating the potential of the method to investigate in vivo function of proteins involved in Fe-S cluster assembly. PMID:24415728
Genomics-driven discovery of the pneumocandin biosynthetic gene cluster in the fungus Glarea lozoyensis

PubMed Central

2013-01-01

Background The antifungal therapy caspofungin is a semi-synthetic derivative of pneumocandin B0, a lipohexapeptide produced by the fungus Glarea lozoyensis, and was the first member of the echinocandin class approved for human therapy. The nonribosomal peptide synthetase (NRPS)-polyketide synthases (PKS) gene cluster responsible for pneumocandin biosynthesis from G. lozoyensis has not been elucidated to date. In this study, we report the elucidation of the pneumocandin biosynthetic gene cluster by whole genome sequencing of the G. lozoyensis wild-type strain ATCC 20868. Results The pneumocandin biosynthetic gene cluster contains a NRPS (GLNRPS4) and a PKS (GLPKS4) arranged in tandem, two cytochrome P450 monooxygenases, seven other modifying enzymes, and genes for L-homotyrosine biosynthesis, a component of the peptide core. Thus, the pneumocandin biosynthetic gene cluster is significantly more autonomous and organized than that of the recently characterized echinocandin B gene cluster. Disruption mutants of GLNRPS4 and GLPKS4 no longer produced the pneumocandins (A0 and B0), and the Δglnrps4 and Δglpks4 mutants lost antifungal activity against the human pathogenic fungus Candida albicans. In addition to pneumocandins, the G. lozoyensis genome encodes a rich repertoire of natural product-encoding genes including 24 PKSs, six NRPSs, five PKS-NRPS hybrids, two dimethylallyl tryptophan synthases, and 14 terpene synthases. Conclusions Characterization of the gene cluster provides a blueprint for engineering new pneumocandin derivatives with improved pharmacological properties. Whole genome estimation of the secondary metabolite-encoding genes from G. lozoyensis provides yet another example of the huge potential for drug discovery from natural products from the fungal kingdom. PMID:23688303
A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean1[W

PubMed Central

David, Perrine; Chen, Nicolas W.G.; Pedrosa-Harand, Andrea; Thareau, Vincent; Sévignac, Mireille; Cannon, Steven B.; Debouck, Daniel; Langin, Thierry; Geffroy, Valérie

2009-01-01

The B4 resistance (R) gene cluster is one of the largest clusters known in common bean (Phaseolus vulgaris [Pv]). It is located in a peculiar genomic environment in the subtelomeric region of the short arm of chromosome 4, adjacent to two heterochromatic blocks (knobs). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-Coil-Nucleotide-Binding-Site-Leucine-Rich-Repeat (CNL). Conserved microsynteny was observed between the Pv B4 locus and corresponding regions of Medicago truncatula and Lotus japonicus in chromosomes Mt6 and Lj2, respectively. The notable exception was the CNL sequences, which were completely absent in these regions. The origin of the Pv B4-CNL sequences was investigated through phylogenetic analysis, which reveals that, in the Pv genome, paralogous CNL genes are shared among nonhomologous chromosomes (4 and 11). Together, our results suggest that Pv B4-CNL was derived from CNL sequences from another cluster, the Co-2 cluster, through an ectopic recombination event. Integration of the soybean (Glycine max) genome data enables us to date more precisely this event and also to infer that a single CNL moved from the Co-2 to the B4 cluster. Moreover, we identified a new 528-bp satellite repeat, referred to as khipu, specific to the Phaseolus genus, present both between B4-CNL sequences and in the two knobs identified at the B4 R gene cluster. The khipu repeat is present on most chromosomal termini, indicating the existence of frequent ectopic recombination events in Pv subtelomeric regions. Our results highlight the importance of ectopic recombination in R gene evolution. PMID:19776165
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

DOE Office of Scientific and Technical Information (OSTI.GOV)

Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

2003-12-31

Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11

PubMed Central

Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.

2009-01-01

Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120
Evolutionary change in the structure of the regulatory region that drives tissue and temporally regulated expression of alcohol dehydrogenase gene in Drosophila funebris.

PubMed

Amador, A; Papaceit, M; Juan, E

2001-06-01

The Adh locus of Drosophilidae is organized as a single gene transcribed from two spatially and temporally regulated promoters except in species of the repleta group, which have two single promoter genes. Here we show that in Drosophila funebris the Adh gene is transcribed from a single promoter, in both larva and adult, with qualitative and quantitative species specific-differences in tissue distribution. The gene is expressed in larval fat body but in other tissues such as gastric caeca, midgut and Malpighian tubules its expression is reduced compared to most Drosophilidae species, and in adults it is almost limited to the fat body. The comparative analysis of gene expression of two strains, which differ by a duplication, indicates that the cis elements necessary for this pattern of expression in larvae are included in the region of 1.55 kb upstream of the transcription initiation site. This new organization reveals the evolution of a different regulatory strategy to express the Adh gene in the subgenus Drosophila.
Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

USDA-ARS?s Scientific Manuscript database

Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
An ADH1B variant and peer drinking in progression to adolescent drinking milestones: evidence of a gene-by-environment interaction.

PubMed

Olfson, Emily; Edenberg, Howard J; Nurnberger, John; Agrawal, Arpana; Bucholz, Kathleen K; Almasy, Laura A; Chorlian, David; Dick, Danielle M; Hesselbrock, Victor M; Kramer, John R; Kuperman, Samuel; Porjesz, Bernice; Schuckit, Marc A; Tischfield, Jay A; Wang, Jen-Chyong; Wetherill, Leah; Foroud, Tatiana M; Rice, John; Goate, Alison; Bierut, Laura J

2014-10-01

Adolescent drinking is an important public health concern, one that is influenced by both genetic and environmental factors. The functional variant rs1229984 in alcohol dehydrogenase 1B (ADH1B) has been associated at a genome-wide level with alcohol use disorders in diverse adult populations. However, few data are available regarding whether this variant influences early drinking behaviors and whether social context moderates this effect. This study examines the interplay between rs1229984 and peer drinking in the development of adolescent drinking milestones. One thousand five hundred and fifty European and African American individuals who had a full drink of alcohol before age 18 were selected from a longitudinal study of youth as part of the Collaborative Study on the Genetics of Alcoholism (COGA). Cox proportional hazards regression, with G × E product terms in the final models, was used to study 2 primary outcomes during adolescence: age of first intoxication and age of first DSM-5 alcohol use disorder symptom. The minor A allele of rs1229984 was associated with a protective effect for first intoxication (HR = 0.56, 95% CI 0.41 to 0.76) and first DSM-5 symptom (HR = 0.45, 95% CI 0.26 to 0.77) in the final models. Reporting that most or all best friends drink was associated with a hazardous effect for first intoxication (HR = 1.81, 95% CI 1.62 to 2.01) and first DSM-5 symptom (HR = 2.17, 95% 1.88 to 2.50) in the final models. Furthermore, there was a significant G × E interaction for first intoxication (p = 0.002) and first DSM-5 symptom (p = 0.01). Among individuals reporting none or few best friends drinking, the ADH1B variant had a protective effect for adolescent drinking milestones, but for those reporting most or all best friends drinking, this effect was greatly reduced. Our results suggest that the risk factor of best friends drinking attenuates the protective effect of a well-established ADH1B variant for 2 adolescent drinking

A ground truth based comparative study on clustering of gene expression data.

PubMed

Zhu, Yitan; Wang, Zuyi; Miller, David J; Clarke, Robert; Xuan, Jianhua; Hoffman, Eric P; Wang, Yue

2008-05-01

Given the variety of available clustering methods for gene expression data analysis, it is important to develop an appropriate and rigorous validation scheme to assess the performance and limitations of the most widely used clustering algorithms. In this paper, we present a ground truth based comparative study on the functionality, accuracy, and stability of five data clustering methods, namely hierarchical clustering, K-means clustering, self-organizing maps, standard finite normal mixture fitting, and a caBIG toolkit (VIsual Statistical Data Analyzer--VISDA), tested on sample clustering of seven published microarray gene expression datasets and one synthetic dataset. We examined the performance of these algorithms in both data-sufficient and data-insufficient cases using quantitative performance measures, including cluster number detection accuracy and mean and standard deviation of partition accuracy. The experimental results showed that VISDA, an interactive coarse-to-fine maximum likelihood fitting algorithm, is a solid performer on most of the datasets, while K-means clustering and self-organizing maps optimized by the mean squared compactness criterion generally produce more stable solutions than the other methods.
The Chloroplast atpA Gene Cluster in Chlamydomonas reinhardtii1

PubMed Central

Drapier, Dominique; Suzuki, Hideki; Levy, Haim; Rimbault, Blandine; Kindle, Karen L.; Stern, David B.; Wollman, Francis-André

1998-01-01

Most chloroplast genes in vascular plants are organized into polycistronic transcription units, which generate a complex pattern of mono-, di-, and polycistronic transcripts. In contrast, most Chlamydomonas reinhardtii chloroplast transcripts characterized to date have been monocistronic. This paper describes the atpA gene cluster in the C. reinhardtii chloroplast genome, which includes the atpA, psbI, cemA, and atpH genes, encoding the α-subunit of the coupling-factor-1 (CF1) ATP synthase, a small photosystem II polypeptide, a chloroplast envelope membrane protein, and subunit III of the CF0 ATP synthase, respectively. We show that promoters precede the atpA, psbI, and atpH genes, but not the cemA gene, and that cemA mRNA is present only as part of di-, tri-, or tetracistronic transcripts. Deletions introduced into the gene cluster reveal, first, that CF1-α can be translated from di- or polycistronic transcripts, and, second, that substantial reductions in mRNA quantity have minimal effects on protein synthesis rates. We suggest that posttranscriptional mRNA processing is common in C. reinhardtii chloroplasts, permitting the expression of multiple genes from a single promoter. PMID:9625716
Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

PubMed

Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

1998-08-01

The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Two Gene Clusters Coordinate Galactose and Lactose Metabolism in Streptococcus gordonii

PubMed Central

Zeng, Lin; Martino, Nicole C.

2012-01-01

Streptococcus gordonii is an early colonizer of the human oral cavity and an abundant constituent of oral biofilms. Two tandemly arranged gene clusters, designated lac and gal, were identified in the S. gordonii DL1 genome, which encode genes of the tagatose pathway (lacABCD) and sugar phosphotransferase system (PTS) enzyme II permeases. Genes encoding a predicted phospho-β-galactosidase (LacG), a DeoR family transcriptional regulator (LacR), and a transcriptional antiterminator (LacT) were also present in the clusters. Growth and PTS assays supported that the permease designated EIILac transports lactose and galactose, whereas EIIGal transports galactose. The expression of the gene for EIIGal was markedly upregulated in cells growing on galactose. Using promoter-cat fusions, a role for LacR in the regulation of the expressions of both gene clusters was demonstrated, and the gal cluster was also shown to be sensitive to repression by CcpA. The deletion of lacT caused an inability to grow on lactose, apparently because of its role in the regulation of the expression of the genes for EIILac, but had little effect on galactose utilization. S. gordonii maintained a selective advantage over Streptococcus mutans in a mixed-species competition assay, associated with its possession of a high-affinity galactose PTS, although S. mutans could persist better at low pHs. Collectively, these results support the concept that the galactose and lactose systems of S. gordonii are subject to complex regulation and that a high-affinity galactose PTS may be advantageous when S. gordonii is competing against the caries pathogen S. mutans in oral biofilms. PMID:22660715
The Genetic and Molecular Organization of the Dopa Decarboxylase Gene Cluster of Drosophila Melanogaster

PubMed Central

Stathakis, D. G.; Pentz, E. S.; Freeman, M. E.; Kullman, J.; Hankins, G. R.; Pearlson, N. J.; Wright, TRF.

1995-01-01

We report the complete molecular organization of the Dopa decarboxylase gene cluster. Mutagenesis screens recovered 77 new Df(2L)TW130 recessive lethal mutations. These new alleles combined with 263 previously isolated mutations in the cluster to define 18 essential genes. In addition, seven new deficiencies were isolated and characterized. Deficiency mapping, restriction fragment length polymorphism (RFLP) analysis and P-element-mediated germline transformation experiments determined the gene order for all 18 loci. Genomic and cDNA restriction endonuclease mapping, Northern blot analysis and DNA sequencing provided information on exact gene location, mRNA size and transcriptional direction for most of these loci. In addition, this analysis identified two transcription units that had not previously been identified by extensive mutagenesis screening. Most of the loci are contained within two dense subclusters. We discuss the effectiveness of mutagens and strategies used in our screens, the variable mutability of loci within the genome of Drosophila melanogaster, the cytological and molecular organization of the Ddc gene cluster, the validity of the one band-one gene hypothesis and a possible purpose for the clustering of genes in the Ddc region. PMID:8647399
Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters.

PubMed

Schorn, Michelle A; Alanjary, Mohammad M; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R; Ziemert, Nadine; Moore, Bradley S

2016-12-01

Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites.
Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters

PubMed Central

Schorn, Michelle A.; Alanjary, Mohammad M.; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R.; Ziemert, Nadine

2016-01-01

Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites. PMID:27902408
A Putative Gene Cluster from a Lyngbya wollei Bloom that Encodes Paralytic Shellfish Toxin Biosynthesis

PubMed Central

Mihali, Troco K.; Carmichael, Wayne W.; Neilan, Brett A.

2011-01-01

Saxitoxin and its analogs cause the paralytic shellfish-poisoning syndrome, adversely affecting human health and coastal shellfish industries worldwide. Here we report the isolation, sequencing, annotation, and predicted pathway of the saxitoxin biosynthetic gene cluster in the cyanobacterium Lyngbya wollei. The gene cluster spans 36 kb and encodes enzymes for the biosynthesis and export of the toxins. The Lyngbya wollei saxitoxin gene cluster differs from previously identified saxitoxin clusters as it contains genes that are unique to this cluster, whereby the carbamoyltransferase is truncated and replaced by an acyltransferase, explaining the unique toxin profile presented by Lyngbya wollei. These findings will enable the creation of toxin probes, for water monitoring purposes, as well as proof-of-concept for the combinatorial biosynthesis of these natural occurring alkaloids for the production of novel, biologically active compounds. PMID:21347365
Expression of alcoholism-relevant genes in the liver are differently correlated to different parts of the brain.

PubMed

Wang, Lishi; Huang, Yue; Jiao, Yan; Chen, Hong; Cao, Yanhong; Bennett, Beth; Wang, Yongjun; Gu, Weikuan

2013-01-01

The purpose of this study is to investigate whether expression profiles of alcoholism-relevant genes in different parts of the brain are correlated differently with those in the liver. Four experiments were conducted. First, we used gene expression profiles from five parts of the brain (striatum, prefrontal cortex, nucleus accumbens, hippocampus, and cerebellum) and from liver in a population of recombinant inbred mouse strains to examine the expression association of 10 alcoholism-relevant genes. Second, we conducted the same association analysis between brain structures and the lung. Third, using five randomly selected, nonalcoholism-relevant genes, we conducted the association analysis between brain and liver. Finally, we compared the expression of 10 alcoholism-relevant genes in hippocampus and cerebellum between an alcohol preference strain and a wild-type control. We observed a difference in correlation patterns in expression levels of 10 alcoholism-relevant genes between different parts of the brain with those of liver. We then examined the association of gene expression between alcohol dehydrogenases (Adh1, Adh2, Adh5, and Adh7) and different parts of the brain. The results were similar to those of the 10 genes. Then, we found that the association of those genes between brain structures and lung was different from that of liver. Next, we found that the association patterns of five alcoholism-nonrelevant genes were different from those of 10 alcoholism-relevant genes. Finally, we found that the expression level of 10 alcohol-relevant genes is influenced more in hippocampus than in cerebellum in the alcohol preference strain. Our results show that the expression of alcoholism-relevant genes in liver is differently associated with the expression of genes in different parts of the brain. Because different structural changes in different parts of the brain in alcoholism have been reported, it is important to investigate whether those structural differences in
Statistical indicators of collective behavior and functional clusters in gene networks of yeast

NASA Astrophysics Data System (ADS)

Živković, J.; Tadić, B.; Wick, N.; Thurner, S.

2006-03-01

We analyze gene expression time-series data of yeast (S. cerevisiae) measured along two full cell-cycles. We quantify these data by using q-exponentials, gene expression ranking and a temporal mean-variance analysis. We construct gene interaction networks based on correlation coefficients and study the formation of the corresponding giant components and minimum spanning trees. By coloring genes according to their cell function we find functional clusters in the correlation networks and functional branches in the associated trees. Our results suggest that a percolation point of functional clusters can be identified on these gene expression correlation networks.
Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana

PubMed Central

Reimegård, Johan; Kundu, Snehangshu; Pendle, Ali; Irish, Vivian F.; Shaw, Peter

2017-01-01

Abstract Co-expression of physically linked genes occurs surprisingly frequently in eukaryotes. Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied the chromosomal organization of genes involved in male reproductive development in Arabidopsis thaliana. We developed an in-silico tool to identify physical clusters of co-regulated genes from gene expression data. We identified 17 clusters (96 genes) involved in stamen development and acting downstream of the transcriptional activator MS1 (MALE STERILITY 1), which contains a PHD domain associated with chromatin re-organization. The clusters exhibited little gene homology or promoter element similarity, and largely overlapped with reported repressive histone marks. Experiments on a subset of the clusters suggested a link between expression activation and chromatin conformation: qRT-PCR and mRNA in situ hybridization showed that the clustered genes were up-regulated within 48 h after MS1 induction; out of 14 chromatin-remodeling mutants studied, expression of clustered genes was consistently down-regulated only in hta9/hta11, previously associated with metabolic cluster activation; DNA fluorescence in situ hybridization confirmed that transcriptional activation of the clustered genes was correlated with open chromatin conformation. Stamen development thus appears to involve transcriptional activation of physically clustered genes through chromatin de-condensation. PMID:28175342
Identifying conserved gene clusters in the presence of homology families.

PubMed

He, Xin; Goldwasser, Michael H

2005-01-01

The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.
Molecular analysis of SCARECROW genes expressed in white lupin cluster roots

PubMed Central

Sbabou, Laila; Bucciarelli, Bruna; Miller, Susan; Liu, Junqi; Berhada, Fatiha; Filali-Maltouf, Abdelkarim; Allan, Deborah; Vance, Carroll

2010-01-01

The Scarecrow (SCR) transcription factor plays a crucial role in root cell radial patterning and is required for maintenance of the quiescent centre and differentiation of the endodermis. In response to phosphorus (P) deficiency, white lupin (Lupinus albus L.) root surface area increases some 50-fold to 70-fold due to the development of cluster (proteoid) roots. Previously it was reported that SCR-like expressed sequence tags (ESTs) were expressed during early cluster root development. Here the cloning of two white lupin SCR genes, LaSCR1 and LaSCR2, is reported. The predicted amino acid sequences of both LaSCR gene products are highly similar to AtSCR and contain C-terminal conserved GRAS family domains. LaSCR1 and LaSCR2 transcript accumulation localized to the endodermis of both normal and cluster roots as shown by in situ hybridization and gene promoter::reporter staining. Transcript analysis as evaluated by quantitative real-time-PCR (qRT-PCR) and RNA gel hybridization indicated that the two LaSCR genes are expressed predominantly in roots. Expression of LaSCR genes was not directly responsive to the P status of the plant but was a function of cluster root development. Suppression of LaSCR1 in transformed roots of lupin and Medicago via RNAi (RNA interference) delivered through Agrobacterium rhizogenes resulted in decreased root numbers, reflecting the potential role of LaSCR1 in maintaining root growth in these species. The results suggest that the functional orthologues of AtSCR have been characterized. PMID:20167612
A homeotic gene cluster patterns the anteroposterior body axis of C. elegans.

PubMed

Wang, B B; Müller-Immergluck, M M; Austin, J; Robinson, N T; Chisholm, A; Kenyon, C

1993-07-16

In insects and vertebrates, clusters of Antennapedia class homeobox (HOM-C) genes specify anteroposterior body pattern. The nematode C. elegans also contains a small cluster of HOM-C genes, one of which has been shown to specify positional identity. Here we show that two additional C. elegans HOM-C genes also specify positional identity and that together these three HOM-C genes function along the anteroposterior axis in the same order as their homologs in other organisms. Thus, HOM-C-based pattern formation has been conserved in nematodes despite the many differences in morphology and embryology that distinguish them from other phyla. Each C. elegans HOM-C gene is responsible for a distinct body region; however, where their domains overlap, two HOM-C genes can act together to specify the fates of individual cells.
Clustering gene expression data based on predicted differential effects of GV interaction.

PubMed

Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

2005-02-01

Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.
Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury

PubMed Central

2010-01-01

Background Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Results Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. Conclusions This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation
Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury.

PubMed

Ryge, Jesper; Winther, Ole; Wienecke, Jacob; Sandelin, Albin; Westerdahl, Ann-Charlotte; Hultborn, Hans; Kiehn, Ole

2010-06-09

Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation of which potentially could be
Transcriptional analysis of exopolysaccharides biosynthesis gene clusters in Lactobacillus plantarum.

PubMed

Vastano, Valeria; Perrone, Filomena; Marasco, Rosangela; Sacco, Margherita; Muscariello, Lidia

2016-04-01

Exopolysaccharides (EPS) from lactic acid bacteria contribute to specific rheology and texture of fermented milk products and find applications also in non-dairy foods and in therapeutics. Recently, four clusters of genes (cps) associated with surface polysaccharide production have been identified in Lactobacillus plantarum WCFS1, a probiotic and food-associated lactobacillus. These clusters are involved in cell surface architecture and probably in release and/or exposure of immunomodulating bacterial molecules. Here we show a transcriptional analysis of these clusters. Indeed, RT-PCR experiments revealed that the cps loci are organized in five operons. Moreover, by reverse transcription-qPCR analysis performed on L. plantarum WCFS1 (wild type) and WCFS1-2 (ΔccpA), we demonstrated that expression of three cps clusters is under the control of the global regulator CcpA. These results, together with the identification of putative CcpA target sequences (catabolite responsive element CRE) in the regulatory region of four out of five transcriptional units, strongly suggest for the first time a role of the master regulator CcpA in EPS gene transcription among lactobacilli.
RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage

PubMed Central

Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili

2013-01-01

The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778
Identification and characterization of the ergochrome gene cluster in the plant pathogenic fungus Claviceps purpurea.

PubMed

Neubauer, Lisa; Dopstadt, Julian; Humpf, Hans-Ulrich; Tudzynski, Paul

2016-01-01

Claviceps purpurea is a phytopathogenic fungus infecting a broad range of grasses including economically important cereal crop plants. The infection cycle ends with the formation of the typical purple-black pigmented sclerotia containing the toxic ergot alkaloids. Besides these ergot alkaloids little is known about the secondary metabolism of the fungus. Red anthraquinone derivatives and yellow xanthone dimers (ergochromes) have been isolated from sclerotia and described as ergot pigments, but the corresponding gene cluster has remained unknown. Fungal pigments gain increasing interest for example as environmentally friendly alternatives to existing dyes. Furthermore, several pigments show biological activities and may have some pharmaceutical value. This study identified the gene cluster responsible for the synthesis of the ergot pigments. Overexpression of the cluster-specific transcription factor led to activation of the gene cluster and to the production of several known ergot pigments. Knock out of the cluster key enzyme, a nonreducing polyketide synthase, clearly showed that this cluster is responsible for the production of red anthraquinones as well as yellow ergochromes. Furthermore, a tentative biosynthetic pathway for the ergot pigments is proposed. By changing the culture conditions, pigment production was activated in axenic culture so that high concentration of phosphate and low concentration of sucrose induced pigment syntheses. This is the first functional analysis of a secondary metabolite gene cluster in the ergot fungus besides that for the classical ergot alkaloids. We demonstrated that this gene cluster is responsible for the typical purple-black color of the ergot sclerotia and showed that the red and yellow ergot pigments are products of the same biosynthetic pathway. Activation of the gene cluster in axenic culture opened up new possibilities for biotechnological applications like the dye production or the development of new pharmaceuticals.

Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects

PubMed Central

2012-01-01

Background Time-course gene expression data such as yeast cell cycle data may be periodically expressed. To cluster such data, currently used Fourier series approximations of periodic gene expressions have been found not to be sufficiently adequate to model the complexity of the time-course data, partly due to their ignoring the dependence between the expression measurements over time and the correlation among gene expression profiles. We further investigate the advantages and limitations of available models in the literature and propose a new mixture model with autoregressive random effects of the first order for the clustering of time-course gene-expression profiles. Some simulations and real examples are given to demonstrate the usefulness of the proposed models. Results We illustrate the applicability of our new model using synthetic and real time-course datasets. We show that our model outperforms existing models to provide more reliable and robust clustering of time-course data. Our model provides superior results when genetic profiles are correlated. It also gives comparable results when the correlation between the gene profiles is weak. In the applications to real time-course data, relevant clusters of coregulated genes are obtained, which are supported by gene-function annotation databases. Conclusions Our new model under our extension of the EMMIX-WIRE procedure is more reliable and robust for clustering time-course data because it adopts a random effects model that allows for the correlation among observations at different time points. It postulates gene-specific random effects with an autocorrelation variance structure that models coregulation within the clusters. The developed R package is flexible in its specification of the random effects through user-input parameters that enables improved modelling and consequent clustering of time-course data. PMID:23151154
The Genome of Tolypocladium inflatum: Evolution, Organization, and Expression of the Cyclosporin Biosynthetic Gene Cluster

PubMed Central

Bushley, Kathryn E.; Raja, Rajani; Jaiswal, Pankaj; Cumbie, Jason S.; Nonogaki, Mariko; Boyd, Alexander E.; Owensby, C. Alisha; Knaus, Brian J.; Elser, Justin; Miller, Daniel; Di, Yanming; McPhail, Kerry L.; Spatafora, Joseph W.

2013-01-01

The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921), the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS) that encodes for cyclosporin synthetase (simA) and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc.), and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further investigations of the role
Gene cluster conservation provides insight into cercosporin biosynthesis and extends production to the genus Colletotrichum.

PubMed

de Jonge, Ronnie; Ebert, Malaika K; Huitt-Roehl, Callie R; Pal, Paramita; Suttle, Jeffrey C; Spanner, Rebecca E; Neubauer, Jonathan D; Jurick, Wayne M; Stott, Karina A; Secor, Gary A; Thomma, Bart P H J; Van de Peer, Yves; Townsend, Craig A; Bolton, Melvin D

2018-06-12

Species in the genus Cercospora cause economically devastating diseases in sugar beet, maize, rice, soy bean, and other major food crops. Here, we sequenced the genome of the sugar beet pathogen Cercospora beticola and found it encodes 63 putative secondary metabolite gene clusters, including the cercosporin toxin biosynthesis ( CTB ) cluster. We show that the CTB gene cluster has experienced multiple duplications and horizontal transfers across a spectrum of plant pathogenic fungi, including the wide-host range Colletotrichum genus as well as the rice pathogen Magnaporthe oryzae Although cercosporin biosynthesis has been thought to rely on an eight-gene CTB cluster, our phylogenomic analysis revealed gene collinearity adjacent to the established cluster in all CTB cluster-harboring species. We demonstrate that the CTB cluster is larger than previously recognized and includes cercosporin facilitator protein, previously shown to be involved with cercosporin autoresistance, and four additional genes required for cercosporin biosynthesis, including the final pathway enzymes that install the unusual cercosporin methylenedioxy bridge. Lastly, we demonstrate production of cercosporin by Colletotrichum fioriniae , the first known cercosporin producer within this agriculturally important genus. Thus, our results provide insight into the intricate evolution and biology of a toxin critical to agriculture and broaden the production of cercosporin to another fungal genus containing many plant pathogens of important crops worldwide. Copyright © 2018 the Author(s). Published by PNAS.
Combination of ALDH2 and ADH1B polymorphisms is associated with smoking initiation: A large-scale cross-sectional study in a Japanese population.

PubMed

Masaoka, Hiroyuki; Ito, Hidemi; Gallus, Silvano; Watanabe, Miki; Yokomizo, Akira; Eto, Masatoshi; Matsuo, Keitaro

2017-04-01

Aldehyde dehydrogenase 2 (ALDH2; rs671, Glu504Lys) and alcohol dehydrogenase 1B (ADH1B; rs1229984, His47Arg) polymorphisms are known to strongly influence alcohol　drinking behavior. Given evidence of an association between smoking and drinking behaviors, we hypothesized that ALDH2/ADH1B polymorphisms might also be associated with smoking initiation, and conducted a cross-sectional study to examine this hypothesis. Study subjects were first-visit outpatients diagnosed not to have cancer at Aichi Cancer Center Hospital between 2001 and 2005, including 4141 never smokers and 2912 ever smokers. Unconditional logistic regression models were applied to estimate odds ratios (OR) and 95% confidence intervals (CI) for smoking initiation by comparing ever smokers with never smokers. Excessive alcohol drinking was associated with a higher likelihood of ever smoking. After adjustment for drinking behaviors, compared to individuals with ALDH2 Glu/Glu, the ORs of ever smoking were 1.71 (95% CI, 1.49-1.95) and 2.28 (1.81-2.87) among those with ALDH2 Glu/Lys and Lys/Lys, respectively. Combination of ALDH2 Lys/Lys and ADH1B His/His (i.e., the most alcohol-intolerant subpopulation) showed the highest OR [2.44 (1.84-3.23)], whereas combination of ALDH2 Glu/Glu and ADH1B Arg/Arg (i.e., the most alcohol-tolerant subpopulation) showed the lowest OR [0.83 (0.57-1.21)] compared with ALDH2 Glu/Glu and ADH1B His/His. Besides the amount and frequency of alcohol drinking, the combination of ALDH2 and ADH1B polymorphisms predicts smoking initiation. This study suggests that alcohol tolerance regulated by ALDH2 and ADH1B polymorphisms is associated with smoking initiation, and facilitates the development of targeted interventions to reduce smoking prevalence. Copyright © 2017 Elsevier B.V. All rights reserved.
A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data.

PubMed

Nishiyama, Takeshi; Takahashi, Kunihiko; Tango, Toshiro; Pinto, Dalila; Scherer, Stephen W; Takami, Satoshi; Kishino, Hirohisa

2011-05-26

Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.
Identification of the Coumermycin A1 Biosynthetic Gene Cluster of Streptomyces rishiriensis DSM 40489

PubMed Central

Wang, Zhao-Xin; Li, Shu-Ming; Heide, Lutz

2000-01-01

The biosynthetic gene cluster of the aminocoumarin antibiotic coumermycin A1 was cloned by screening of a cosmid library of Streptomyces rishiriensis DSM 40489 with heterologous probes from a dTDP-glucose 4,6-dehydratase gene, involved in deoxysugar biosynthesis, and from the aminocoumarin resistance gyrase gene gyrBr. Sequence analysis of a 30.8-kb region upstream of gyrBr revealed the presence of 28 complete open reading frames (ORFs). Fifteen of the identified ORFs showed, on average, 84% identity to corresponding ORFs in the biosynthetic gene cluster of novobiocin, another aminocoumarin antibiotic. Possible functions of 17 ORFs in the biosynthesis of coumermycin A1 could be assigned by comparison with sequences in GenBank. Experimental proof for the function of the identified gene cluster was provided by an insertional gene inactivation experiment, which resulted in an abolishment of coumermycin A1 production. PMID:11036020
Gene structure and expression characteristic of a novel odorant receptor gene cluster in the parasitoid wasp Microplitis mediator (Hymenoptera: Braconidae).

PubMed

Wang, S-N; Shan, S; Zheng, Y; Peng, Y; Lu, Z-Y; Yang, Y-Q; Li, R-J; Zhang, Y-J; Guo, Y-Y

2017-08-01

Odorant receptors (ORs) expressed in the antennae of parasitoid wasps are responsible for detection of various lipophilic airborne molecules. In the present study, 107 novel OR genes were identified from Microplitis mediator antennal transcriptome data. Phylogenetic analysis of the set of OR genes from M. mediator and Microplitis demolitor revealed that M. mediator OR (MmedOR) genes can be classified into different subfamilies, and the majority of MmedORs in each subfamily shared high sequence identities and clear orthologous relationships to M. demolitor ORs. Within a subfamily, six MmedOR genes, MmedOR98, 124, 125, 126, 131 and 155, shared a similar gene structure and were tightly linked in the genome. To evaluate whether the clustered MmedOR genes share common regulatory features, the transcription profile and expression characteristics of the six closely related OR genes were investigated in M. mediator. Rapid amplification of cDNA ends-PCR experiments revealed that the OR genes within the cluster were transcribed as single mRNAs, and a bicistronic mRNA for two adjacent genes (MmedOR124 and MmedOR98) was also detected in female antennae by reverse transcription PCR. In situ hybridization experiments indicated that each OR gene within the cluster was expressed in a different number of cells. Moreover, there was no co-expression of the two highly related OR genes, MmedOR124 and MmedOR98, which appeared to be individually expressed in a distinct population of neurons. Overall, there were distinct expression profiles of closely related MmedOR genes from the same cluster in M. mediator. These data provide a basic understanding of the olfactory coding in parasitoid wasps. © 2017 The Royal Entomological Society.
Polymorphisms and linkage analysis for ICAM-1 and the selectin gene cluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vora, D.K.; Rosenbloom, C.L.; Cottingham, R.W.

1994-06-01

Genetic polymorphisms in leukocyte and endothelial cell adhesion molecules may be important variables with regard to susceptibility to multifactorial disease processes that include an inflammatory component. For this reason, polymorphisms were sought for intercellular adhesion molecule-1 (ICAM-1; gene symbol ICAM1) and for the three genes in the selectin cluster, P-selectin, L-selectin, and E-selectin (gene symbols SELP, SELL, and SELE, respectively). Two amino acid polymorphisms were identified for ICAM-1; Gly or Arg at codon 241 and Lys or Glu at codon 469. Dinucleotide repeat polymorphisms were identified in the 3{prime}-untranslated region for ICAM-1 and in intron 9 for P-selectin. Restriction fragmentmore » length polymorphisms were found using cDNAs for each of the three selectin genes as probes; E-selectin with BglII, P-selectin with ScaI, and L-selectin with HincII. Linkage analysis was performed for the selectin gene cluster and for ICAM-1 using the CEPH families; ICAM-1 is very tightly linked to the LDL receptor on chromosome 19, and the selectin cluster is linked to markers at chromosome 1q23. 41 refs., 2 tabs.« less
Spatial enhancer clustering and regulation of enhancer-proximal genes by cohesin

PubMed Central

Ing-Simmons, Elizabeth; Seitan, Vlad C.; Faure, Andre J.; Flicek, Paul; Carroll, Thomas; Dekker, Job; Fisher, Amanda G.; Lenhard, Boris

2015-01-01

In addition to mediating sister chromatid cohesion during the cell cycle, the cohesin complex associates with CTCF and with active gene regulatory elements to form long-range interactions between its binding sites. Genome-wide chromosome conformation capture had shown that cohesin's main role in interphase genome organization is in mediating interactions within architectural chromosome compartments, rather than specifying compartments per se. However, it remains unclear how cohesin-mediated interactions contribute to the regulation of gene expression. We have found that the binding of CTCF and cohesin is highly enriched at enhancers and in particular at enhancer arrays or “super-enhancers” in mouse thymocytes. Using local and global chromosome conformation capture, we demonstrate that enhancer elements associate not just in linear sequence, but also in 3D, and that spatial enhancer clustering is facilitated by cohesin. The conditional deletion of cohesin from noncycling thymocytes preserved enhancer position, H3K27ac, H4K4me1, and enhancer transcription, but weakened interactions between enhancers. Interestingly, ∼50% of deregulated genes reside in the vicinity of enhancer elements, suggesting that cohesin regulates gene expression through spatial clustering of enhancer elements. We propose a model for cohesin-dependent gene regulation in which spatial clustering of enhancer elements acts as a unified mechanism for both enhancer-promoter “connections” and “insulation.” PMID:25677180
Two Horizontally Transferred Xenobiotic Resistance Gene Clusters Associated with Detoxification of Benzoxazolinones by Fusarium Species

PubMed Central

Glenn, Anthony E.; Davis, C. Britton; Gao, Minglu; Gold, Scott E.; Mitchell, Trevor R.; Proctor, Robert H.; Stewart, Jane E.; Snook, Maurice E.

2016-01-01

Microbes encounter a broad spectrum of antimicrobial compounds in their environments and often possess metabolic strategies to detoxify such xenobiotics. We have previously shown that Fusarium verticillioides, a fungal pathogen of maize known for its production of fumonisin mycotoxins, possesses two unlinked loci, FDB1 and FDB2, necessary for detoxification of antimicrobial compounds produced by maize, including the γ-lactam 2-benzoxazolinone (BOA). In support of these earlier studies, microarray analysis of F. verticillioides exposed to BOA identified the induction of multiple genes at FDB1 and FDB2, indicating the loci consist of gene clusters. One of the FDB1 cluster genes encoded a protein having domain homology to the metallo-β-lactamase (MBL) superfamily. Deletion of this gene (MBL1) rendered F. verticillioides incapable of metabolizing BOA and thus unable to grow on BOA-amended media. Deletion of other FDB1 cluster genes, in particular AMD1 and DLH1, did not affect BOA degradation. Phylogenetic analyses and topology testing of the FDB1 and FDB2 cluster genes suggested two horizontal transfer events among fungi, one being transfer of FDB1 from Fusarium to Colletotrichum, and the second being transfer of the FDB2 cluster from Fusarium to Aspergillus. Together, the results suggest that plant-derived xenobiotics have exerted evolutionary pressure on these fungi, leading to horizontal transfer of genes that enhance fitness or virulence. PMID:26808652
Genome-wide DNA methylation analysis reveals estrogen-mediated epigenetic repression of metallothionein-1 gene cluster in breast cancer.

PubMed

Jadhav, Rohit R; Ye, Zhenqing; Huang, Rui-Lan; Liu, Joseph; Hsu, Pei-Yin; Huang, Yi-Wen; Rangel, Leticia B; Lai, Hung-Cheng; Roa, Juan Carlos; Kirma, Nameer B; Huang, Tim Hui-Ming; Jin, Victor X

2015-01-01

Recent genome-wide analysis has shown that DNA methylation spans long stretches of chromosome regions consisting of clusters of contiguous CpG islands or gene families. Hypermethylation of various gene clusters has been reported in many types of cancer. In this study, we conducted methyl-binding domain capture (MBDCap) sequencing (MBD-seq) analysis on a breast cancer cohort consisting of 77 patients and 10 normal controls, as well as a panel of 38 breast cancer cell lines. Bioinformatics analysis determined seven gene clusters with a significant difference in overall survival (OS) and further revealed a distinct feature that the conservation of a large gene cluster (approximately 70 kb) metallothionein-1 (MT1) among 45 species is much lower than the average of all RefSeq genes. Furthermore, we found that DNA methylation is an important epigenetic regulator contributing to gene repression of MT1 gene cluster in both ERα positive (ERα+) and ERα negative (ERα-) breast tumors. In silico analysis revealed much lower gene expression of this cluster in The Cancer Genome Atlas (TCGA) cohort for ERα + tumors. To further investigate the role of estrogen, we conducted 17β-estradiol (E2) and demethylating agent 5-aza-2'-deoxycytidine (DAC) treatment in various breast cancer cell types. Cell proliferation and invasion assays suggested MT1F and MT1M may play an anti-oncogenic role in breast cancer. Our data suggests that DNA methylation in large contiguous gene clusters can be potential prognostic markers of breast cancer. Further investigation of these clusters revealed that estrogen mediates epigenetic repression of MT1 cluster in ERα + breast cancer cell lines. In all, our studies identify thousands of breast tumor hypermethylated regions for the first time, in particular, discovering seven large contiguous hypermethylated gene clusters.
Regulatory Feedback Loop of Two phz Gene Clusters through 5′-Untranslated Regions in Pseudomonas sp. M18

PubMed Central

Li, Yaqian; Du, Xilin; Lu, Zhi John; Wu, Daqiang; Zhao, Yilei; Ren, Bin; Huang, Jiaofang; Huang, Xianqing; Xu, Yuhong; Xu, Yuquan

2011-01-01

Background Phenazines are important compounds produced by pseudomonads and other bacteria. Two phz gene clusters called phzA1-G1 and phzA2-G2, respectively, were found in the genome of Pseudomonas sp. M18, an effective biocontrol agent, which is highly homologous to the opportunistic human pathogen P. aeruginosa PAO1, however little is known about the correlation between the expressions of two phz gene clusters. Methodology/Principal Findings Two chromosomal insertion inactivated mutants for the two gene clusters were constructed respectively and the correlation between the expressions of two phz gene clusters was investigated in strain M18. Phenazine-1-carboxylic acid (PCA) molecules produced from phzA2-G2 gene cluster are able to auto-regulate expression itself and activate the expression of phzA1-G1 gene cluster in a circulated amplification pattern. However, the post-transcriptional expression of phzA1-G1 transcript was blocked principally through 5′-untranslated region (UTR). In contrast, the phzA2-G2 gene cluster was transcribed to a lesser extent and translated efficiently and was negatively regulated by the GacA signal transduction pathway, mainly at a post-transcriptional level. Conclusions/Significance A single molecule, PCA, produced in different quantities by the two phz gene clusters acted as the functional mediator and the two phz gene clusters developed a specific regulatory mechanism which acts through 5′-UTR to transfer a single, but complex bacterial signaling event in Pseudomonas sp. strain M18. PMID:21559370
Investigation of miR-136-5p key target genes and pathways in lung squamous cell cancer based on TCGA database and bioinformatics analysis.

PubMed

Xie, Zu-Cheng; Li, Tian-Tian; Gan, Bin-Liang; Gao, Xiang; Gao, Li; Chen, Gang; Hu, Xiao-Hua

2018-05-01

Lung squamous cell cancer (LUSC) is a common but challenging malignancy. It is important to illuminate the molecular mechanism of LUSC. Thus, we aim to explore the molecular mechanism of miR-136-5p in relation to LUSC. We used the Cancer Genome Atlas (TCGA) database to investigate the expression of miR-136-5p in relation to LUSC. Then, we identified the possible miR-136-5p target genes through intersection of the predicted miR-136-5p target genes and LUSC upregulated genes from TCGA. Bioinformatics analysis was performed to determine the key miR-136-5p targets and pathways associated with LUSC. Finally, the expression of hub genes, correlation between miR-136-5p and hub genes, and expected significance of hub genes were evaluated via the TCGA and Genotype-Tissue Expression (GTEx) project. MiR-136-5p was significantly downregulated in LUSC patients. Glucuronidation, glucuronosyltransferase, and the retinoic acid metabolic process were the most enriched metabolic interactions in LUSC patients. Ascorbate and aldarate metabolism, pentose and glucuronate interconversions, and retinol metabolism were identified as crucial pathways. Seven hub genes (UGT1A1, UGT1A3, UGT1A6, UGT1A7, UGT1A10, SRD5A1, and ADH7) were found to be upregulated, and UGT1A1, UGT1A3, UGT1A6, UGT1A7, and ADH7 were negatively correlated with miR-136-5p. UGT1A7 and ADH7 were the most significantly involved miR-136-5p target genes, and high expression of these genes was correlated with better overall survival and disease-free survival of LUSC patients. Downregulated miR-136-5p may target UGT1A7 and ADH7 and participate in ascorbate and aldarate metabolism, pentose and glucuronate interconversions, and retinol metabolism. High expression of UGT1A7 and ADH7 may indicate better prognosis of LUSC patients. Copyright © 2018. Published by Elsevier GmbH.
Performance Assessment of Kernel Density Clustering for Gene Expression Profile Data

PubMed Central

Zeng, Beiyan; Chen, Yiping P.; Smith, Oscar H.

2003-01-01

Kernel density smoothing techniques have been used in classification or supervised learning of gene expression profile (GEP) data, but their applications to clustering or unsupervised learning of those data have not been explored and assessed. Here we report a kernel density clustering method for analysing GEP data and compare its performance with the three most widely-used clustering methods: hierarchical clustering, K-means clustering, and multivariate mixture model-based clustering. Using several methods to measure agreement, between-cluster isolation, and withincluster coherence, such as the Adjusted Rand Index, the Pseudo F test, the r2 test, and the profile plot, we have assessed the effectiveness of kernel density clustering for recovering clusters, and its robustness against noise on clustering both simulated and real GEP data. Our results show that the kernel density clustering method has excellent performance in recovering clusters from simulated data and in grouping large real expression profile data sets into compact and well-isolated clusters, and that it is the most robust clustering method for analysing noisy expression profile data compared to the other three methods assessed. PMID:18629292
Arabidopsis gene expression patterns during spaceflight

NASA Astrophysics Data System (ADS)

Paul, A.-L.; Ferl, R. J.

The exposure of Arabidopsis thaliana (Arabidopsis) plants to spaceflight environments resulted in the differential expression of hundreds of genes. A 5 day mission on orbiter Columbia in 1999 (STS-93) carried transgenic Arabidopsis plants engineered with a transgene composed of the alcohol dehydrogenase (Adh) gene promoter linked to the β -Glucuronidase (GUS) reporter gene. The plants were used to evaluate the effects of spaceflight on two fronts. First, expression patterns visualized with the Adh/GUS transgene were used to address specifically the possibility that spaceflight induces a hypoxic stress response, and to assess whether any spaceflight response was similar to control terrestrial hypoxia-induced gene expression patterns. (Paul et al., Plant Physiol. 2001, 126:613). Second, genome-wide patterns of native gene expression were evaluated utilizing the Affymetrix ATH1 GeneChip? array of 8,000 Arabidopsis genes. As a control for the veracity of the array analyses, a selection of genes identified with the arrays was further characterized with quantitative Real-Time RT PCR (ABI - TaqmanTM). Comparison of the patterns of expression for arrays of hybridized with RNA isolated from plants exposed to spaceflight compared to the control arrays revealed hundreds of genes that were differentially expressed in response to spaceflight, yet most genes that are hallmarks of hypoxic stress were unaffected. These results will be discussed in light of current models for plant responses to the spaceflight environment, and with regard to potential future flight opportunities.
Evolution of coding and non-coding genes in HOX clusters of a marsupial.

PubMed

Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

2012-06-18

The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial

PubMed Central

2012-01-01

Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
Identification of the Regulator Gene Responsible for the Acetone-Responsive Expression of the Binuclear Iron Monooxygenase Gene Cluster in Mycobacteria ▿

PubMed Central

Furuya, Toshiki; Hirose, Satomi; Semba, Hisashi; Kino, Kuniki

2011-01-01

The mimABCD gene cluster encodes the binuclear iron monooxygenase that oxidizes propane and phenol in Mycobacterium smegmatis strain MC2 155 and Mycobacterium goodii strain 12523. Interestingly, expression of the mimABCD gene cluster is induced by acetone. In this study, we investigated the regulator gene responsible for this acetone-responsive expression. In the genome sequence of M. smegmatis strain MC2 155, the mimABCD gene cluster is preceded by a gene designated mimR, which is divergently transcribed. Sequence analysis revealed that MimR exhibits amino acid similarity with the NtrC family of transcriptional activators, including AcxR and AcoR, which are involved in acetone and acetoin metabolism, respectively. Unexpectedly, many homologs of the mimR gene were also found in the sequenced genomes of actinomycetes. A plasmid carrying a transcriptional fusion of the intergenic region between the mimR and mimA genes with a promoterless green fluorescent protein (GFP) gene was constructed and introduced into M. smegmatis strain MC2 155. Using a GFP reporter system, we confirmed by deletion and complementation analyses that the mimR gene product is the positive regulator of the mimABCD gene cluster expression that is responsive to acetone. M. goodii strain 12523 also utilized the same regulatory system as M. smegmatis strain MC2 155. Although transcriptional activators of the NtrC family generally control transcription using the σ54 factor, a gene encoding the σ54 factor was absent from the genome sequence of M. smegmatis strain MC2 155. These results suggest the presence of a novel regulatory system in actinomycetes, including mycobacteria. PMID:21856847
The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6

PubMed Central

Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval

1999-01-01

A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143
Distribution of Suicin Gene Clusters in Streptococcus suis Serotype 2 Belonging to Sequence Types 25 and 28.

PubMed

Athey, Taryn B T; Vaillancourt, Katy; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

2016-01-01

Recently, we reported the purification and characterization of three distinct lantibiotics (named suicin 90-1330, suicin 3908, and suicin 65) produced by Streptococcus suis . In this study, we investigated the distribution of the three suicin lantibiotic gene clusters among serotype 2 S. suis strains belonging to sequence type (ST) 25 and ST28, the two dominant STs identified in North America. The genomes of 102 strains were interrogated for the presence of suicin gene clusters encoding suicins 90-1330, 3908, and 65. The gene cluster encoding suicin 65 was the most prevalent and mainly found among ST25 strains. In contrast, none of the genes related to suicin 90-1330 production were identified in 51 ST25 strains nor in 35/51 ST28 strains. However, the complete suicin 90-1330 gene cluster was found in ten ST28 strains, although some genes in the cluster were truncated in three of these isolates. The vast majority (101/102) of S. suis strains did not possess any of the genes encoding suicin 3908. In conclusion, this study indicates heterogeneous distribution of suicin genes in S. suis .

Form gene clustering method about pan-ethnic-group products based on emotional semantic

NASA Astrophysics Data System (ADS)

Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

2016-09-01

The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.
Intact cluster and chordate-like expression of ParaHox genes in a sea star

PubMed Central

2013-01-01

Background The ParaHox genes are thought to be major players in patterning the gut of several bilaterian taxa. Though this is a fundamental role that these transcription factors play, their activities are not limited to the endoderm and extend to both ectodermal and mesodermal tissues. Three genes compose the ParaHox group: Gsx, Xlox and Cdx. In some taxa (mostly chordates but to some degree also in protostomes) the three genes are arranged into a genomic cluster, in a similar fashion to what has been shown for the better-known Hox genes. Sea urchins possess the full complement of ParaHox genes but they are all dispersed throughout the genome, an arrangement that, perhaps, represented the primitive condition for all echinoderms. In order to understand the evolutionary history of this group of genes we cloned and characterized all ParaHox genes, studied their expression patterns and identified their genomic loci in a member of an earlier branching group of echinoderms, the asteroid Patiria miniata. Results We identified the three ParaHox orthologs in the genome of P. miniata. While one of them, PmGsx is provided as maternal message, with no zygotic activation afterwards, the other two, PmLox and PmCdx are expressed during embryogenesis, within restricted domains of both endoderm and ectoderm. Screening of a Patiria bacterial artificial chromosome (BAC) library led to the identification of a clone containing the three genes. The transcriptional directions of PmGsx and PmLox are opposed to that of the PmCdx gene within the cluster. Conclusions The identification of P. miniata ParaHox genes has revealed the fact that these genes are clustered in the genome, in contrast to what has been reported for echinoids. Since the presence of an intact cluster, or at least a partial cluster, has been reported in chordates and polychaetes respectively, it becomes clear that within echinoderms, sea urchins have modified the original bilaterian arrangement. Moreover, the sea star
The p.Leu167del Mutation in APOE Gene Causes Autosomal Dominant Hypercholesterolemia by Down-regulation of LDL Receptor Expression in Hepatocytes.

PubMed

Cenarro, Ana; Etxebarria, Aitor; de Castro-Orós, Isabel; Stef, Marianne; Bea, Ana M; Palacios, Lourdes; Mateo-Gallego, Rocío; Benito-Vicente, Asier; Ostolaza, Helena; Tejedor, Teresa; Martín, César; Civeira, Fernando

2016-05-01

The p.Leu167del mutation in the APOE gene has been associated with hyperlipidemia. Our objective was to determine the frequency of p.Leu167del mutation in APOE gene in subjects with autosomal dominant hypercholesterolemia (ADH) in whom LDLR, APOB, and PCSK9 mutations had been excluded and to identify the mechanisms by which this mutant apo E causes hypercholesterolemia. The APOE gene was analyzed in a case-control study. The study was conducted at a University Hospital Lipid Clinic. Two groups (ADH, 288 patients; control, 220 normolipidemic subjects) were included. We performed sequencing of APOE gene and proteomic and cellular experiments. To determine the frequency of the p.Leu167del mutation and the mechanism by which it causes hypercholesterolemia. In the ADH group, nine subjects (3.1%) were carriers of the APOE c.500_502delTCC, p.Leu167del mutation, cosegregating with hypercholesterolemia in studied families. Proteomic quantification of wild-type and mutant apo E in very low-density lipoprotein (VLDL) from carrier subjects revealed that apo E3 is almost a 5-fold increase compared to mutant apo E. Cultured cell studies revealed that VLDL from mutation carriers had a significantly higher uptake by HepG2 and THP-1 cells compared to VLDL from subjects with E3/E3 or E2/E2 genotypes. Transcriptional down-regulation of LDLR was also confirmed. p.Leu167del mutation in APOE gene is the cause of hypercholesterolemia in the 3.1% of our ADH subjects without LDLR, APOB, and PCSK9 mutations. The mechanism by which this mutation is associated to ADH is that VLDL carrying the mutant apo E produces LDLR down-regulation, thereby raising plasma low-density lipoprotein cholesterol levels.
Comparison of expression of secondary metabolite biosynthesis cluster genes in Aspergillus flavus, A. parasiticus, and A. oryzae.

PubMed

Ehrlich, Kenneth C; Mack, Brian M

2014-06-23

Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity.
Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

PubMed Central

Ehrlich, Kenneth C.; Mack, Brian M.

2014-01-01

Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity. PMID:24960201
A recently transferred cluster of bacterial genes in Trichomonas vaginalis - lateral gene transfer and the fate of acquired genes

PubMed Central

2014-01-01

Background Lateral Gene Transfer (LGT) has recently gained recognition as an important contributor to some eukaryote proteomes, but the mechanisms of acquisition and fixation in eukaryotic genomes are still uncertain. A previously defined norm for LGTs in microbial eukaryotes states that the majority are genes involved in metabolism, the LGTs are typically localized one by one, surrounded by vertically inherited genes on the chromosome, and phylogenetics shows that a broad collection of bacterial lineages have contributed to the transferome. Results A unique 34 kbp long fragment with 27 clustered genes (TvLF) of prokaryote origin was identified in the sequenced genome of the protozoan parasite Trichomonas vaginalis. Using a PCR based approach we confirmed the presence of the orthologous fragment in four additional T. vaginalis strains. Detailed sequence analyses unambiguously suggest that TvLF is the result of one single, recent LGT event. The proposed donor is a close relative to the firmicute bacterium Peptoniphilus harei. High nucleotide sequence similarity between T. vaginalis strains, as well as to P. harei, and the absence of homologs in other Trichomonas species, suggests that the transfer event took place after the radiation of the genus Trichomonas. Some genes have undergone pseudogenization and degradation, indicating that they may not be retained in the future. Functional annotations reveal that genes involved in informational processes are particularly prone to degradation. Conclusions We conclude that, although the majority of eukaryote LGTs are single gene occurrences, they may be acquired in clusters of several genes that are subsequently cleansed of evolutionarily less advantageous genes. PMID:24898731
Clustering by soft-constraint affinity propagation: applications to gene-expression data.

PubMed

Leone, Michele; Sumedha; Weigt, Martin

2007-10-15

Similarity-measure-based clustering is a crucial problem appearing throughout scientific data analysis. Recently, a powerful new algorithm called Affinity Propagation (AP) based on message-passing techniques was proposed by Frey and Dueck (2007a). In AP, each cluster is identified by a common exemplar all other data points of the same cluster refer to, and exemplars have to refer to themselves. Albeit its proved power, AP in its present form suffers from a number of drawbacks. The hard constraint of having exactly one exemplar per cluster restricts AP to classes of regularly shaped clusters, and leads to suboptimal performance, e.g. in analyzing gene expression data. This limitation can be overcome by relaxing the AP hard constraints. A new parameter controls the importance of the constraints compared to the aim of maximizing the overall similarity, and allows to interpolate between the simple case where each data point selects its closest neighbor as an exemplar and the original AP. The resulting soft-constraint affinity propagation (SCAP) becomes more informative, accurate and leads to more stable clustering. Even though a new a priori free parameter is introduced, the overall dependence of the algorithm on external tuning is reduced, as robustness is increased and an optimal strategy for parameter selection emerges more naturally. SCAP is tested on biological benchmark data, including in particular microarray data related to various cancer types. We show that the algorithm efficiently unveils the hierarchical cluster structure present in the data sets. Further on, it allows to extract sparse gene expression signatures for each cluster.
Identification of new genes in a cell envelope-cell division gene cluster of Escherichia coli: cell envelope gene murG.

PubMed Central

Salmond, G P; Lutkenhaus, J F; Donachie, W D

1980-01-01

We report the identification, cloning, and mapping of a new cell envelope gene, murG. This lies in a group of five genes of similar phenotype (in the order murE murF murG murC ddl) all concerned with peptidoglycan biosynthesis. This group is in a larger cluster of at least 10 genes, all of which are involved in some way with cell envelope growth. Images PMID:6998962
antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters.

PubMed

Weber, Tilmann; Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko; Medema, Marnix H

2015-07-01

Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Organization of the Escherichia coli K-12 gene cluster responsible for production of the extracellular polysaccharide colanic acid.

PubMed Central

Stevenson, G; Andrianopoulos, K; Hobbs, M; Reeves, P R

1996-01-01

Colanic acid (CA) is an extracellular polysaccharide produced by most Escherichia coli strains as well as by other species of the family Enterobacteriaceae. We have determined the sequence of a 23-kb segment of the E. coli K-12 chromosome which includes the cluster of genes necessary for production of CA. The CA cluster comprises 19 genes. Two other sequenced genes (orf1.3 and galF), which are situated between the CA cluster and the O-antigen cluster, were shown to be unnecessary for CA production. The CA cluster includes genes for synthesis of GDP-L-fucose, one of the precursors of CA, and the gene for one of the enzymes in this pathway (GDP-D-mannose 4,6-dehydratase) was identified by biochemical assay. Six of the inferred proteins show sequence similarity to glycosyl transferases, and two others have sequence similarity to acetyl transferases. Another gene (wzx) is predicted to encode a protein with multiple transmembrane segments and may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The first three genes of the cluster are predicted to encode an outer membrane lipoprotein, a phosphatase, and an inner membrane protein with an ATP-binding domain. Since homologs of these genes are found in other extracellular polysaccharide gene clusters, they may have a common function, such as export of polysaccharide from the cell. PMID:8759852
Characterization of a Major Cluster of nif, fix, and Associated Genes in a Sugarcane Endophyte, Acetobacter diazotrophicus

PubMed Central

Lee, Sunhee; Reth, Alexander; Meletzus, Dietmar; Sevilla, Myrna; Kennedy, Christina

2000-01-01

A major 30.5-kb cluster of nif and associated genes of Acetobacter diazotrophicus (syn. Gluconacetobacter diazotrophicus), a nitrogen-fixing endophyte of sugarcane, was sequenced and analyzed. This cluster represents the largest assembly of contiguous nif-fix and associated genes so far characterized in any diazotrophic bacterial species. Northern blots and promoter sequence analysis indicated that the genes are organized into eight transcriptional units. The overall arrangement of genes is most like that of the nif-fix cluster in Azospirillum brasilense, while the individual gene products are more similar to those in species of Rhizobiaceae or in Rhodobacter capsulatus. PMID:11092875
Activation and comparative analysis of cryptic xiamycin gene cluster from marine-derived Streptomyces sp. FXJ 7.388.

PubMed

Uhong Lü, Yuhong; Liu, Xiaoli; Wang, Miao; Li, Yuanyuan; Liu, Ning; Bao, Yuxin; Liu, Minghao; Li, Xiaoqian; Wang, Yinyin; Qian, Shenyan; Yue, Changwu; Huang, Ying

2016-09-01

In order to obtain the natural products synthesized by the three putative xiamycin biosynthesis gene clusters which were predicted via antiSMASH during the genome mining of marine Streptomyces sp. FXJ 7.388, Streptomyces sp. FXJ 8.012, and Streptomyces olivaceus FXJ 7.023. Sixteen genes involved in xiamycin assembly, modification, and regulation with higher identity than the newest reported xiamycin biosynthetic gene cluster from marine Streptomyces sp. SCSIO 02999, Streptomyces sp. HKI0576, and Streptomyces sp. FXJ 7.388 were discovered via gene cluster comparative analysis. A ribosome engineering strategy was adopted to activate such cryptic gene clusters with different final concentrations antibiotics that act on the ribosome, and two indolosesquiterpenes were isolated from idlethaldose streptomycin-resistant Streptomyces sp. FXJ 7.388 strains. However, no such product was detected in Streptomyces sp. FXJ 8.012 and Streptomyces olivaceus FXJ 7.023 under the same treatment. This result suggested that these genes might hold the least gene content for xiamycin biosynthesis.
Genomic organization of the rat alpha 2u-globulin gene cluster.

PubMed

McFadyen, D A; Addison, W; Locke, J

1999-05-01

The alpha 2u-globulin are a group of similar proteins, belonging to the lipocalin superfamily of proteins, that are synthesized in a subset of secretory tissues in rats. The many alpha 2u-globulin isoforms are encoded by a multigene family that exhibits extensive homology. Despite a high degree of sequence identity, individual family members show diverse expression patterns involving complex hormonal, tissue-specific, and developmental regulation. Analysis suggests that there are approximately 20 alpha 2u-globulin genes in the rat genome. We have used fluorescence in situ hybridization (FISH) to show that the alpha 2u-globulin genes are clustered at a single site on rat Chromosome (Chr) 5 (5q22-24). Southern blots of rat genomic DNA separated by pulsed field gel electrophoresis indicated that the alpha 2u-globulin genes are contained on two NruI fragments with a total size of 880 kbp. Analysis of three P1 clones containing alpha 2u-globulin genes indicated that the alpha 2u-globulin genes are tandemly arranged in a head-to-tail fashion. The organization of the alpha 2u-globulin genes in the rat as a tandem array of single genes differs from the homologous major urinary protein genes in the mouse, which are organized as tandem arrays of divergently oriented gene pairs. The structure of these gene clusters may have consequences for the proposed function, as a pheromone transporter, for the protein products encoded by these genes.
Inference from clustering with application to gene-expression microarrays.

PubMed

Dougherty, Edward R; Barrera, Junior; Brun, Marcel; Kim, Seungchan; Cesar, Roberto M; Chen, Yidong; Bittner, Michael; Trent, Jeffrey M

2002-01-01

There are many algorithms to cluster sample data points based on nearness or a similarity measure. Often the implication is that points in different clusters come from different underlying classes, whereas those in the same cluster come from the same class. Stochastically, the underlying classes represent different random processes. The inference is that clusters represent a partition of the sample points according to which process they belong. This paper discusses a model-based clustering toolbox that evaluates cluster accuracy. Each random process is modeled as its mean plus independent noise, sample points are generated, the points are clustered, and the clustering error is the number of points clustered incorrectly according to the generating random processes. Various clustering algorithms are evaluated based on process variance and the key issue of the rate at which algorithmic performance improves with increasing numbers of experimental replications. The model means can be selected by hand to test the separability of expected types of biological expression patterns. Alternatively, the model can be seeded by real data to test the expected precision of that output or the extent of improvement in precision that replication could provide. In the latter case, a clustering algorithm is used to form clusters, and the model is seeded with the means and variances of these clusters. Other algorithms are then tested relative to the seeding algorithm. Results are averaged over various seeds. Output includes error tables and graphs, confusion matrices, principal-component plots, and validation measures. Five algorithms are studied in detail: K-means, fuzzy C-means, self-organizing maps, hierarchical Euclidean-distance-based and correlation-based clustering. The toolbox is applied to gene-expression clustering based on cDNA microarrays using real data. Expression profile graphics are generated and error analysis is displayed within the context of these profile graphics. A
Identification of a G‐Protein Subunit‐α11 Gain‐of‐Function Mutation, Val340Met, in a Family With Autosomal Dominant Hypocalcemia Type 2 (ADH2)

PubMed Central

Piret, Sian E; Gorvin, Caroline M; Pagnamenta, Alistair T; Howles, Sarah A; Cranston, Treena; Rust, Nigel; Nesbit, M Andrew; Glaser, Ben; Taylor, Jenny C; Buchs, Andreas E; Hannan, Fadil M

2016-01-01

ABSTRACT Autosomal dominant hypocalcemia (ADH) is characterized by hypocalcemia, inappropriately low serum parathyroid hormone concentrations and hypercalciuria. ADH is genetically heterogeneous with ADH type 1 (ADH1), the predominant form, being caused by germline gain‐of‐function mutations of the G‐protein coupled calcium‐sensing receptor (CaSR), and ADH2 caused by germline gain‐of‐function mutations of G‐protein subunit α‐11 (Gα11). To date Gα11 mutations causing ADH2 have been reported in only five probands. We investigated a multigenerational nonconsanguineous family, from Iran, with ADH and keratoconus which are not known to be associated, for causative mutations by whole‐exome sequencing in two individuals with hypoparathyroidism, of whom one also had keratoconus, followed by cosegregation analysis of variants. This identified a novel heterozygous germline Val340Met Gα11 mutation in both individuals, and this was also present in the other two relatives with hypocalcemia that were tested. Three‐dimensional modeling revealed the Val340Met mutation to likely alter the conformation of the C‐terminal α5 helix, which may affect G‐protein coupled receptor binding and G‐protein activation. In vitro functional expression of wild‐type (Val340) and mutant (Met340) Gα11 proteins in HEK293 cells stably expressing the CaSR, demonstrated that the intracellular calcium responses following stimulation with extracellular calcium, of the mutant Met340 Gα11 led to a leftward shift of the concentration‐response curve with a significantly (p < 0.0001) reduced mean half‐maximal concentration (EC50) value of 2.44 mM (95% CI, 2.31 to 2.77 mM) when compared to the wild‐type EC50 of 3.14 mM (95% CI, 3.03 to 3.26 mM), consistent with a gain‐of‐function mutation. A novel His403Gln variant in transforming growth factor, beta‐induced (TGFBI), that may be causing keratoconus was also identified, indicating likely digenic
Clavine Alkaloids Gene Clusters of Penicillium and Related Fungi: Evolutionary Combination of Prenyltransferases, Monooxygenases and Dioxygenases

PubMed Central

Martín, Juan F.; Liras, Paloma

2017-01-01

The clavine alkaloids produced by the fungi of the Aspergillaceae and Arthrodermatacea families differ from the ergot alkaloids produced by Claviceps and Neotyphodium. The clavine alkaloids lack the extensive peptide chain modifications that occur in lysergic acid derived ergot alkaloids. Both clavine and ergot alkaloids arise from the condensation of tryptophan and dimethylallylpyrophosphate by the action of the dimethylallyltryptophan synthase. The first five steps of the biosynthetic pathway that convert tryptophan and dimethylallyl-pyrophosphate (DMA-PP) in chanoclavine-1-aldehyde are common to both clavine and ergot alkaloids. The biosynthesis of ergot alkaloids has been extensively studied and is not considered in this article. We focus this review on recent advances in the gene clusters for clavine alkaloids in the species of Penicillium, Aspergillus (Neosartorya), Arthroderma and Trychophyton and the enzymes encoded by them. The final products of the clavine alkaloids pathways derive from the tetracyclic ergoline ring, which is modified by late enzymes, including a reverse type prenyltransferase, P450 monooxygenases and acetyltransferases. In Aspergillus japonicus, a α-ketoglutarate and Fe2+-dependent dioxygenase is involved in the cyclization of a festuclavine-like unknown type intermediate into cycloclavine. Related dioxygenases occur in the biosynthetic gene clusters of ergot alkaloids in Claviceps purpurea and also in the clavine clusters in Penicillium species. The final products of the clavine alkaloid pathway in these fungi differ from each other depending on the late biosynthetic enzymes involved. An important difference between clavine and ergot alkaloid pathways is that clavine producers lack the enzyme CloA, a P450 monooxygenase, involved in one of the steps of the conversion of chanoclavine-1-aldehyde into lysergic acid. Bioinformatic analysis of the sequenced genomes of the Aspergillaceae and Arthrodermataceae fungi showed the presence of
Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

PubMed

Lu, Hong; Patil, Prabhu; Van Sluys, Marie-Anne; White, Frank F; Ryan, Robert P; Dow, J Maxwell; Rabinowicz, Pablo; Salzberg, Steven L; Leach, Jan E; Sonti, Ramesh; Brendel, Volker; Bogdanove, Adam J

2008-01-01

Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors) cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small number of genes or
A novel harmony search-K means hybrid algorithm for clustering gene expression data

PubMed Central

Nazeer, KA Abdul; Sebastian, MP; Kumar, SD Madhu

2013-01-01

Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms. PMID:23390351
A novel harmony search-K means hybrid algorithm for clustering gene expression data.

PubMed

Nazeer, Ka Abdul; Sebastian, Mp; Kumar, Sd Madhu

2013-01-01

Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms.
Identification of the Viridicatumtoxin and Griseofulvin Gene Clusters from Penicillium aethiopicum

PubMed Central

Chooi, Yit-Heng; Cacho, Ralph; Tang, Yi

2010-01-01

SUMMARY Penicillium aethiopicum produces two structurally interesting and biologically active polyketides: the tetracycline-like viridicatumtoxin 1 and the classic antifungal agent griseofulvin 2. Here, we report the concurrent discovery of the two corresponding biosynthetic gene clusters (vrt and gsf) by 454 shotgun sequencing. Gene deletions confirmed two nonreducing PKSs (NRPKS), vrtA and gsfA, are required for the biosynthesis of 1 and 2, respectively. Both PKSs share similar domain architectures and lack a C-terminal thioesterase domain. We identified gsfI as the chlorinase involved in the biosynthesis of 2, as deletion of gsfI resulted in the accumulation of decholorogriseofulvin 3. Comparative analysis with the P. chrysogenum genome revealed that both clusters are embedded within conserved syntenic regions of P. aethiopicum chromosomes. Discovery of the vrt and gsf clusters provided the basis for genetic and biochemical studies of the pathways. PMID:20534346

MeSH key terms for validation and annotation of gene expression clusters

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rechtsteiner, A.; Rocha, L. M.

2004-01-01

Integration of different sources of information is a great challenge for the analysis of gene expression data, and for the field of Functional Genomics in general. As the availability of numerical data from high-throughput methods increases, so does the need for technologies that assist in the validation and evaluation of the biological significance of results extracted from these data. In mRNA assaying with microarrays, for example, numerical analysis often attempts to identify clusters of co-expressed genes. The important task to find the biological significance of the results and validate them has so far mostly fallen to the biological expert whomore » had to perform this task manually. One of the most promising avenues to develop automated and integrative technology for such tasks lies in the application of modern Information Retrieval (IR) and Knowledge Management (KM) algorithms to databases with biomedical publications and data. Examples of databases available for the field are bibliographic databases c ntaining scientific publications (e.g. MEDLINE/PUBMED), databases containing sequence data (e.g. GenBank) and databases of semantic annotations (e.g. the Gene Ontology Consortium and Medical Subject Headings (MeSH)). We present here an approach that uses the MeSH terms and their concept hierarchies to validate and obtain functional information for gene expression clusters. The controlled and hierarchical MeSH vocabulary is used by the National Library of Medicine (NLM) to index all the articles cited in MEDLINE. Such indexing with a controlled vocabulary eliminates some of the ambiguity due to polysemy (terms that have multiple meanings) and synonymy (multiple terms have similar meaning) that would be encountered if terms would be extracted directly from the articles due to differing article contexts or author preferences and background. Further, the hierarchical organization of the MeSH terms can illustrate the conceptuallfunctional relationships of genes
antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

PubMed Central

Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko

2015-01-01

Abstract Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. PMID:25948579
A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

PubMed Central

2011-01-01

Background First identified in fruit flies with temperature-sensitive paralysis phenotypes, the Drosophila melanogaster TipE locus encodes four voltage-gated sodium (NaV) channel auxiliary subunits. This cluster of TipE-like genes on chromosome 3L, and a fifth family member on chromosome 3R, are important for the optional expression and functionality of the Para NaV channel but appear quite distinct from auxiliary subunits in vertebrates. Here, we exploited available arthropod genomic resources to trace the origin of TipE-like genes by mapping their evolutionary histories and examining their genomic architectures. Results We identified a remarkably conserved synteny block of TipE-like orthologues with well-maintained local gene arrangements from 21 insect species. Homologues in the water flea, Daphnia pulex, suggest an ancestral pancrustacean repertoire of four TipE-like genes; a subsequent gene duplication may have generated functional redundancy allowing gene losses in the silk moth and mosquitoes. Intronic nesting of the insect TipE gene cluster probably occurred following the divergence from crustaceans, but in the flour beetle and silk moth genomes the clusters apparently escaped from nesting. Across Pancrustacea, TipE gene family members have experienced intronic nesting, escape from nesting, retrotransposition, translocation, and gene loss events while generally maintaining their local gene neighbourhoods. D. melanogaster TipE-like genes exhibit coordinated spatial and temporal regulation of expression distinct from their host gene but well-correlated with their regulatory target, the Para NaV channel, suggesting that functional constraints may preserve the TipE gene cluster. We identified homology between TipE-like NaV channel regulators and vertebrate Slo-beta auxiliary subunits of big-conductance calcium-activated potassium (BKCa) channels, which suggests that ion channel regulatory partners have evolved distinct lineage-specific characteristics
Establishment of the Inducible Tet-On System for the Activation of the Silent Trichosetin Gene Cluster in Fusarium fujikuroi

PubMed Central

Janevska, Slavica; Arndt, Birgit; Baumann, Leonie; Apken, Lisa Helene; Mauriz Marques, Lucas Maciel; Humpf, Hans-Ulrich; Tudzynski, Bettina

2017-01-01

The PKS-NRPS-derived tetramic acid equisetin and its N-desmethyl derivative trichosetin exhibit remarkable biological activities against a variety of organisms, including plants and bacteria, e.g., Staphylococcus aureus. The equisetin biosynthetic gene cluster was first described in Fusarium heterosporum, a species distantly related to the notorious rice pathogen Fusarium fujikuroi. Here we present the activation and characterization of a homologous, but silent, gene cluster in F. fujikuroi. Bioinformatic analysis revealed that this cluster does not contain the equisetin N-methyltransferase gene eqxD and consequently, trichosetin was isolated as final product. The adaption of the inducible, tetracycline-dependent Tet-on promoter system from Aspergillus niger achieved a controlled overproduction of this toxic metabolite and a functional characterization of each cluster gene in F. fujikuroi. Overexpression of one of the two cluster-specific transcription factor (TF) genes, TF22, led to an activation of the three biosynthetic cluster genes, including the PKS-NRPS key gene. In contrast, overexpression of TF23, encoding a second Zn(II)2Cys6 TF, did not activate adjacent cluster genes. Instead, TF23 was induced by the final product trichosetin and was required for expression of the transporter-encoding gene MFS-T. TF23 and MFS-T likely act in consort and contribute to detoxification of trichosetin and therefore, self-protection of the producing fungus. PMID:28379186
Establishment of the Inducible Tet-On System for the Activation of the Silent Trichosetin Gene Cluster in Fusarium fujikuroi.

PubMed

Janevska, Slavica; Arndt, Birgit; Baumann, Leonie; Apken, Lisa Helene; Mauriz Marques, Lucas Maciel; Humpf, Hans-Ulrich; Tudzynski, Bettina

2017-04-05

The PKS-NRPS-derived tetramic acid equisetin and its N -desmethyl derivative trichosetin exhibit remarkable biological activities against a variety of organisms, including plants and bacteria, e.g., Staphylococcus aureus . The equisetin biosynthetic gene cluster was first described in Fusarium heterosporum , a species distantly related to the notorious rice pathogen Fusarium fujikuroi . Here we present the activation and characterization of a homologous, but silent, gene cluster in F. fujikuroi . Bioinformatic analysis revealed that this cluster does not contain the equisetin N -methyltransferase gene eqxD and consequently, trichosetin was isolated as final product. The adaption of the inducible, tetracycline-dependent Tet-on promoter system from Aspergillus niger achieved a controlled overproduction of this toxic metabolite and a functional characterization of each cluster gene in F. fujikuroi . Overexpression of one of the two cluster-specific transcription factor (TF) genes, TF22 , led to an activation of the three biosynthetic cluster genes, including the PKS-NRPS key gene. In contrast, overexpression of TF23 , encoding a second Zn(II)₂Cys₆ TF, did not activate adjacent cluster genes. Instead, TF23 was induced by the final product trichosetin and was required for expression of the transporter-encoding gene MFS-T . TF23 and MFS-T likely act in consort and contribute to detoxification of trichosetin and therefore, self-protection of the producing fungus.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed

Sakoda, H; Imanaka, T

1992-02-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed Central

Sakoda, H; Imanaka, T

1992-01-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH. Images PMID:1735726
The Genetics of a Small Chromosome Region of DROSOPHILA MELANOGASTER Containing the Structural Gene for Alcohol Dehydrogenase. IV: Scutoid, an Antimorphic Mutation

PubMed Central

Ashburner, M.; Tsubota, S.; Woodruff, R. C.

1982-01-01

Exchange mapping locates the dominant mutation Scutoid to the right of Adh on chromosome arm 2L of D. melanogaster. However, deletion mapping indicates that Sco is to the left of Adh. The phenotype of Sco is sensitive to mutation, or deletion, of noc+ and of three genes, el, l(2)br22, and l(2)br29 mapping immediately distal to noc. The four contiguous loci, el, l(2)br22, l(2)br29 and noc, although separable by deletion end points, interact, because certain (or all) alleles of these four loci show partial failure of complementation, or even negative complementation. The simplest hypothesis is that Sco is a small reciprocal transposition, the genes noc, osp, and Adh exchanging places with three genes normally mapping proximal to them: l(2)br34, l(2)br35 and rd. The Sco phenotype is thought to result from a position effect at the newly created noc/l(2)br28 junction. PMID:6816673
Dynamically Coupled Food-web and Hydrodynamic Modeling with ADH-CASM

NASA Astrophysics Data System (ADS)

Piercy, C.; Swannack, T. M.

2012-12-01

Oysters and freshwater mussels are "ecological engineers," modifying the local water quality by filtering zooplankton and other suspended particulate matter from the water column and flow hydraulics by impinging on the near-bed flow environment. The success of sessile, benthic invertebrates such as oysters depends on environmental factors including but not limited to temperature, salinity, and flow regime. Typically food-web and other types of ecological models use flow and water quality data as direct input without regard to the feedback between the ecosystem and the physical environment. The USACE-ERDC has developed a coupled hydrodynamic-ecological modeling approach that dynamically couples a 2-D hydrodynamic and constituent transport model, Adaptive Hydraulics (ADH), with a bioenergetics food-web model, the Comprehensive Aquatics Systems Model (CASM), which captures the dynamic feedback between aquatic ecological systems and the environment. We present modeling results from restored oyster reefs in the Great Wicomico River on the western shore of the Chesapeake Bay, which quantify ecosystem services such as the influence of the benthic ecosystem on water quality. Preliminary results indicate that while the influence of oyster reefs on bulk flow dynamics is limited due to the localized influence of oyster reefs, large reefs and the associated benthic ecosystem can create measurable changes in the concentrations of nitrogen, phosphorus, and carbon in the areas around reefs. We also present a sensitivity analysis to quantify the relative sensitivity of the coupled ADH-CASM model to both hydrodynamic and ecological parameter choice.
Structure-related clustering of gene expression fingerprints of thp-1 cells exposed to smaller polycyclic aromatic hydrocarbons.

PubMed

Wan, B; Yarbrough, J W; Schultz, T W

2008-01-01

This study was undertaken to test the hypothesis that structurally similar PAHs induce similar gene expression profiles. THP-1 cells were exposed to a series of 12 selected PAHs at 50 microM for 24 hours and gene expressions profiles were analyzed using both unsupervised and supervised methods. Clustering analysis of gene expression profiles revealed that the 12 tested chemicals were grouped into five clusters. Within each cluster, the gene expression profiles are more similar to each other than to the ones outside the cluster. One-methylanthracene and 1-methylfluorene were found to have the most similar profiles; dibenzothiophene and dibenzofuran were found to share common profiles with fluorine. As expression pattern comparisons were expanded, similarity in genomic fingerprint dropped off dramatically. Prediction analysis of microarrays (PAM) based on the clustering pattern generated 49 predictor genes that can be used for sample discrimination. Moreover, a significant analysis of Microarrays (SAM) identified 598 genes being modulated by tested chemicals with a variety of biological processes, such as cell cycle, metabolism, and protein binding and KEGG pathways being significantly (p < 0.05) affected. It is feasible to distinguish structurally different PAHs based on their genomic fingerprints, which are mechanism based.
Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages.

PubMed

Elmore, M Holly; McGary, Kriston L; Wisecaver, Jennifer H; Slot, Jason C; Geiser, David M; Sink, Stacy; O'Donnell, Kerry; Rokas, Antonis

2015-02-06

Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trace its evolution across Ascomycetes, and examine the evolutionary dynamics of its spread among lineages of the Fusarium oxysporum species complex (hereafter referred to as the FOSC), a cosmopolitan clade of purportedly clonal vascular wilt plant pathogens. Phylogenetic analysis of fungal cyanase and carbonic anhydrase genes reveals that the CCA gene cluster arose independently at least twice and is now present in three lineages, namely Cochliobolus lunatus, Oidiodendron maius, and the FOSC. Genome-wide surveys within the FOSC indicate that the CCA gene cluster varies in copy number across isolates, is always located on accessory chromosomes, and is absent in FOSC's closest relatives. Phylogenetic reconstruction of the CCA gene cluster in 163 FOSC strains from a wide variety of hosts suggests a recent history of rampant transfers between isolates. We hypothesize that the independent formation of the CCA gene cluster in different fungal lineages and its spread across FOSC strains may be associated with resistance to plant-produced cyanates or to use of cyanate fungicides in agriculture. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Integrating Horizontal Gene Transfer and Common Descent to Depict Evolution and Contrast It with “Common Design”1

PubMed Central

GUILLERMO PAZ-Y-MIÑO-C; ESPINOSA, AVELINA

2016-01-01

Horizontal gene transfer (HGT) and common descent interact in space and time. Because events of HGT co-occur with phylogenetic evolution, it is difficult to depict evolutionary patterns graphically. Tree-like representations of life’s diversification are useful, but they ignore the significance of HGT in evolutionary history, particularly of unicellular organisms, ancestors of multicellular life. Here we integrate the reticulated-tree model, ring of life, symbiogenesis whole-organism model, and eliminative pattern pluralism to represent evolution. Using Entamoeba histolytica alcohol dehydrogenase 2 (EhADH2), a bifunctional enzyme in the glycolytic pathway of amoeba, we illustrate how EhADH2 could be the product of both horizontally acquired features from ancestral prokaryotes (i.e. aldehyde dehydrogenase [ALDH] and alcohol dehydrogenase [ADH]), and subsequent functional integration of these enzymes into EhADH2, which is now inherited by amoeba via common descent. Natural selection has driven the evolution of EhADH2 active sites, which require specific amino acids (cysteine 252 in the ALDH domain; histidine 754 in the ADH domain), iron- and NAD+ as cofactors, and the substrates acetyl-CoA for ALDH and acetaldehyde for ADH. Alternative views invoking “common design” (i.e. the non-naturalistic emergence of major taxa independent from ancestry) to explain the interaction between horizontal and vertical evolution are unfounded. PMID:20021546
Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5.

PubMed

Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

2009-03-30

Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of
Comprehensive annotation of secondary metabolite biosynthetic genes and gene clusters of Aspergillus nidulans, A. fumigatus, A. niger and A. oryzae

PubMed Central

2013-01-01

Background Secondary metabolite production, a hallmark of filamentous fungi, is an expanding area of research for the Aspergilli. These compounds are potent chemicals, ranging from deadly toxins to therapeutic antibiotics to potential anti-cancer drugs. The genome sequences for multiple Aspergilli have been determined, and provide a wealth of predictive information about secondary metabolite production. Sequence analysis and gene overexpression strategies have enabled the discovery of novel secondary metabolites and the genes involved in their biosynthesis. The Aspergillus Genome Database (AspGD) provides a central repository for gene annotation and protein information for Aspergillus species. These annotations include Gene Ontology (GO) terms, phenotype data, gene names and descriptions and they are crucial for interpreting both small- and large-scale data and for aiding in the design of new experiments that further Aspergillus research. Results We have manually curated Biological Process GO annotations for all genes in AspGD with recorded functions in secondary metabolite production, adding new GO terms that specifically describe each secondary metabolite. We then leveraged these new annotations to predict roles in secondary metabolism for genes lacking experimental characterization. As a starting point for manually annotating Aspergillus secondary metabolite gene clusters, we used antiSMASH (antibiotics and Secondary Metabolite Analysis SHell) and SMURF (Secondary Metabolite Unknown Regions Finder) algorithms to identify potential clusters in A. nidulans, A. fumigatus, A. niger and A. oryzae, which we subsequently refined through manual curation. Conclusions This set of 266 manually curated secondary metabolite gene clusters will facilitate the investigation of novel Aspergillus secondary metabolites. PMID:23617571
Impact of missing data imputation methods on gene expression clustering and classification.

PubMed

de Souto, Marcilio C P; Jaskowiak, Pablo A; Costa, Ivan G

2015-02-26

Several missing value imputation methods for gene expression data have been proposed in the literature. In the past few years, researchers have been putting a great deal of effort into presenting systematic evaluations of the different imputation algorithms. Initially, most algorithms were assessed with an emphasis on the accuracy of the imputation, using metrics such as the root mean squared error. However, it has become clear that the success of the estimation of the expression value should be evaluated in more practical terms as well. One can consider, for example, the ability of the method to preserve the significant genes in the dataset, or its discriminative/predictive power for classification/clustering purposes. We performed a broad analysis of the impact of five well-known missing value imputation methods on three clustering and four classification methods, in the context of 12 cancer gene expression datasets. We employed a statistical framework, for the first time in this field, to assess whether different imputation methods improve the performance of the clustering/classification methods. Our results suggest that the imputation methods evaluated have a minor impact on the classification and downstream clustering analyses. Simple methods such as replacing the missing values by mean or the median values performed as well as more complex strategies. The datasets analyzed in this study are available at http://costalab.org/Imputation/ .
Variability among Cucurbitaceae species (melon, cucumber and watermelon) in a genomic region containing a cluster of NBS-LRR genes.

PubMed

Morata, Jordi; Puigdomènech, Pere

2017-02-08

Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in
A novel polyketide biosynthesis gene cluster is involved in fruiting body morphogenesis in the filamentous fungi Sordaria macrospora and Neurospora crassa.

PubMed

Nowrousian, Minou

2009-04-01

During fungal fruiting body development, hyphae aggregate to form multicellular structures that protect and disperse the sexual spores. Analysis of microarray data revealed a gene cluster strongly upregulated during fruiting body development in the ascomycete Sordaria macrospora. Real time PCR analysis showed that the genes from the orthologous cluster in Neurospora crassa are also upregulated during development. The cluster encodes putative polyketide biosynthesis enzymes, including a reducing polyketide synthase. Analysis of knockout strains of a predicted dehydrogenase gene from the cluster showed that mutants in N. crassa and S. macrospora are delayed in fruiting body formation. In addition to the upregulated cluster, the N. crassa genome comprises another cluster containing a polyketide synthase gene, and five additional reducing polyketide synthase (rpks) genes that are not part of clusters. To study the role of these genes in sexual development, expression of the predicted rpks genes in S. macrospora (five genes) and N. crassa (six genes) was analyzed; all but one are upregulated during sexual development. Analysis of knockout strains for the N. crassa rpks genes showed that one of them is essential for fruiting body formation. These data indicate that polyketides produced by RPKSs are involved in sexual development in filamentous ascomycetes.
CRAWview: for viewing splicing variation, gene families, and polymorphism in clusters of ESTs and full-length sequences.

PubMed

Chou, A; Burke, J

1999-05-01

DNA sequence clustering has become a valuable method in support of gene discovery and gene expression analysis. Our interest lies in leveraging the sequence diversity within clusters of expressed sequence tags (ESTs) to model gene structure for the study of gene variants that arise from, among other things, alternative mRNA splicing, polymorphism, and divergence after gene duplication, fusion, and translocation events. In previous work, CRAW was developed to discover gene variants from assembled clusters of ESTs. Most importantly, novel gene features (the differing units between gene variants, for example alternative exons, polymorphisms, transposable elements, etc.) that are specialized to tissue, disease, population, or developmental states can be identified when these tools collate DNA source information with gene variant discrimination. While the goal is complete automation of novel feature and gene variant detection, current methods are far from perfect and hence the development of effective tools for visualization and exploratory data analysis are of paramount importance in the process of sifting through candidate genes and validating targets. We present CRAWview, a Java based visualization extension to CRAW. Features that vary between gene forms are displayed using an automatically generated color coded index. The reporting format of CRAWview gives a brief, high level summary report to display overlap and divergence within clusters of sequences as well as the ability to 'drill down' and see detailed information concerning regions of interest. Additionally, the alignment viewing and editing capabilities of CRAWview make it possible to interactively correct frame-shifts and otherwise edit cluster assemblies. We have implemented CRAWview as a Java application across windows NT/95 and UNIX platforms. A beta version of CRAWview will be freely available to academic users from Pangea Systems (http://www.pangeasystems.com). Contact :
Identification of aflatoxin biosynthesis genes by genetic complementation in an Aspergillus flavus mutant lacking the aflatoxin gene cluster.

PubMed Central

Prieto, R; Yousibova, G L; Woloshuk, C P

1996-01-01

Aspergillus flavus mutant strain 649, which has a genomic DNA deletion of at least 120 kb covering the aflatoxin biosynthesis cluster, was transformed with a series of overlapping cosmids that contained DNA harboring the cluster of genes. The mutant phenotype of strain 649 was rescued by transformation with a combination of cosmid clones 5E6, 8B9, and 13B9, indicating that the cluster of genes involved in aflatoxin biosynthesis resides in the 90 kb of A. flavus genomic DNA carried by these clones. Transformants 5E6 and 20B11 and transformants 5E6 and 8B9 accumulated intermediate metabolites of the aflatoxin pathway, which were identified as averufanin and/or averufin, respectively.These data suggest that avf1, which is involved in the conversion of averufin to versiconal hemiacetal acetate, was present in the cosmid 13B9. Deletion analysis of 13B9 located the gene on a 7-kb DNA fragment of the cosmid. Transformants containing cosmid 8B9 converted exogenously supplied O-methylsterigmatocystin to aflatoxin, indicating that the oxidoreductase gene (ord1), which mediates the conversion of O-methylsterigmatocystin to aflatoxin, is carried by this cosmid. The analysis of transformants containing deletions of 8B9 led to the localization of ord1 on a 3.3-kb A. flavus genomic DNA fragment of the cosmid. PMID:8967772
[Distribution of genotypes of alcohol dehydrogenase 2 and aldehyde dehydrogenase 2 in Japanese twin children].

PubMed

Qu, W; Yamagata, Z; Wu, D; Zhang, B; Zhang, Y

1999-03-01

In order to prevent alcohol related deseases, this study investigated the distribution of the genes controlling alcohol metabolism in Japan's twin. Restriction fragment length polymorphism-polymerase chain reaction (RFLP-PCR) technique was used to measure the control gene of alcohol metabolized enzymes and the genotypes of alcohol dehydrogenase 2 (ADH2) and aldehyde dehydrogenase 2 (ALDH2), which were distributed in Japan's twins. At the same time, according to the difference in genotypes, the sensitive individuals were screened from the study subjects. The distribution of ADH2 and ALDH2 genes were consistent with the Hardy-weinberg equation. The three genotypes of ADH2 gene were ADH2(1)/ADH2(1) (1.1%), ADH2(1)/ADH2(2) (44.6%) and ADH2(2)/ADH2(2) (54.3%). And those of ALDH2 gene were ALDH2(1)/ALDH2(1) (41.3%), ALDH2(1)/ALDH2(2) (39.1%) and ALDH2(2)/ALDH2(2) (19.6%). The frequency of ADH2 and ALDH2 genes was 0.255, 0.745 and 0.609, 0.391 respectively. Not only the distribution of genotypes of ADH2 and ALDH2 is known, but also the sensitive individuals are found, which can help prevent alcohol related disease.

Function Clustering Self-Organization Maps (FCSOMs) for mining differentially expressed genes in Drosophila and its correlation with the growth medium.

PubMed

Liu, L L; Liu, M J; Ma, M

2015-09-28

The central task of this study was to mine the gene-to-medium relationship. Adequate knowledge of this relationship could potentially improve the accuracy of differentially expressed gene mining. One of the approaches to differentially expressed gene mining uses conventional clustering algorithms to identify the gene-to-medium relationship. Compared to conventional clustering algorithms, self-organization maps (SOMs) identify the nonlinear aspects of the gene-to-medium relationships by mapping the input space into another higher dimensional feature space. However, SOMs are not suitable for huge datasets consisting of millions of samples. Therefore, a new computational model, the Function Clustering Self-Organization Maps (FCSOMs), was developed. FCSOMs take advantage of the theory of granular computing as well as advanced statistical learning methodologies, and are built specifically for each information granule (a function cluster of genes), which are intelligently partitioned by the clustering algorithm provided by the DAVID_6.7 software platform. However, only the gene functions, and not their expression values, are considered in the fuzzy clustering algorithm of DAVID. Compared to the clustering algorithm of DAVID, these experimental results show a marked improvement in the accuracy of classification with the application of FCSOMs. FCSOMs can handle huge datasets and their complex classification problems, as each FCSOM (modeled for each function cluster) can be easily parallelized.
Characterisation of the paralytic shellfish toxin biosynthesis gene clusters in Anabaena circinalis AWQC131C and Aphanizomenon sp. NH-5

PubMed Central

Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A

2009-01-01

Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may
Two divergent Symbiodinium genomes reveal conservation of a gene cluster for sunscreen biosynthesis and recently lost genes.

PubMed

Shoguchi, Eiichi; Beedessee, Girish; Tada, Ipputa; Hisata, Kanako; Kawashima, Takeshi; Takeuchi, Takeshi; Arakaki, Nana; Fujie, Manabu; Koyanagi, Ryo; Roy, Michael C; Kawachi, Masanobu; Hidaka, Michio; Satoh, Noriyuki; Shinzato, Chuya

2018-06-14

The marine dinoflagellate, Symbiodinium, is a well-known photosynthetic partner for coral and other diverse, non-photosynthetic hosts in subtropical and tropical shallows, where it comprises an essential component of marine ecosystems. Using molecular phylogenetics, the genus Symbiodinium has been classified into nine major clades, A-I, and one of the reported differences among phenotypes is their capacity to synthesize mycosporine-like amino acids (MAAs), which absorb UV radiation. However, the genetic basis for this difference in synthetic capacity is unknown. To understand genetics underlying Symbiodinium diversity, we report two draft genomes, one from clade A, presumed to have been the earliest branching clade, and the other from clade C, in the terminal branch. The nuclear genome of Symbiodinium clade A (SymA) has more gene families than that of clade C, with larger numbers of organelle-related genes, including mitochondrial transcription terminal factor (mTERF) and Rubisco. While clade C (SymC) has fewer gene families, it displays specific expansions of repeat domain-containing genes, such as leucine-rich repeats (LRRs) and retrovirus-related dUTPases. Interestingly, the SymA genome encodes a gene cluster for MAA biosynthesis, potentially transferred from an endosymbiotic red alga (probably of bacterial origin), while SymC has completely lost these genes. Our analysis demonstrates that SymC appears to have evolved by losing gene families, such as the MAA biosynthesis gene cluster. In contrast to the conservation of genes related to photosynthetic ability, the terminal clade has suffered more gene family losses than other clades, suggesting a possible adaptation to symbiosis. Overall, this study implies that Symbiodinium ecology drives acquisition and loss of gene families.
The PhytoClust tool for metabolic gene clusters discovery in plant genomes

PubMed Central

Fuchs, Lisa-Maria

2017-01-01

Abstract The existence of Metabolic Gene Clusters (MGCs) in plant genomes has recently raised increased interest. Thus far, MGCs were commonly identified for pathways of specialized metabolism, mostly those associated with terpene type products. For efficient identification of novel MGCs, computational approaches are essential. Here, we present PhytoClust; a tool for the detection of candidate MGCs in plant genomes. The algorithm employs a collection of enzyme families related to plant specialized metabolism, translated into hidden Markov models, to mine given genome sequences for physically co-localized metabolic enzymes. Our tool accurately identifies previously characterized plant MGCs. An exhaustive search of 31 plant genomes detected 1232 and 5531 putative gene cluster types and candidates, respectively. Clustering analysis of putative MGCs types by species reflected plant taxonomy. Furthermore, enrichment analysis revealed taxa- and species-specific enrichment of certain enzyme families in MGCs. When operating through our web-interface, PhytoClust users can mine a genome either based on a list of known cluster types or by defining new cluster rules. Moreover, for selected plant species, the output can be complemented by co-expression analysis. Altogether, we envisage PhytoClust to enhance novel MGCs discovery which will in turn impact the exploration of plant metabolism. PMID:28486689
Organization of the qa Gene Cluster in NEUROSPORA CRASSA: Direction of Transcription of the qa-3 Gene

PubMed Central

Strøman, Per; Reinert, William; Case, Mary E.; Giles, Norman H.

1979-01-01

In Neurospora crassa, the enzyme quinate (shikimate) dehydrogenase catalyzes the first reaction in the inducible quinic acid catabolic pathway and is encoded in the qa-3 gene of the qa cluster. In this cluster, the order of genes has been established as qa-1 qa-3 qa-4 qa-2. Amino-terminal sequences have been determined for purified quinate dehydrogenase from wild type and from UV-induced revertants in two different qa-3 mutants. These two mutants (M16 and M45) map at opposite ends of the qa-3 locus. In addition, mapping data (Case et al. 1978) indicate that the end of the qa-3 gene specified by M45 is closer to the adjacent qa-1 gene than is the end specified by the M16 mutant site. In one of the revertants (R45 from qa-3 mutant M45), the aminoterminal sequence for the first ten amino acids is identical to that of wild type. The other revertant (R1 from qa-3 mutant M16) differs from wild type at the amino-terminal end by a single altered residue at position three in the sequence. The observed change involves the substitution of an isoleucine in M16-R1 for a proline in wild type. This substitution requires a two-nucleotide change in the corresponding wild-type codon.——The combined genetic and biochemical data indicate that the qa-3 mutants M16 and M45 carry amino acid substitutions near the amino-terminal and carboxyl-terminal ends of the quinate dehydrogenase enzyme, respectively. On this basis we conclude that transcription of the qa-3 gene proceeds from the end specified by the M16 mutant site in the direction of the qa-1 gene. It appears probable that transcription is initiated from a promoter site within the qa cluster, possibly immediately adjacent to the qa-3 gene. PMID:159203
Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

PubMed Central

2012-01-01

Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster. PMID:22577841
Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster.

PubMed

Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René

2012-05-11

The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.
Purification of acetaldehyde dehydrogenase and alcohol dehydrogenases from Thermoanaerobacter ethanolicus 39E and characterization of the secondary-alcohol dehydrogenase (2 degrees Adh) as a bifunctional alcohol dehydrogenase--acetyl-CoA reductive thioesterase.

PubMed Central

Burdette, D; Zeikus, J G

1994-01-01

The purification and characterization of three enzymes involved in ethanol formation from acetyl-CoA in Thermoanaerobacter ethanolicus 39E (formerly Clostridium thermohydrosulfuricum 39E) is described. The secondary-alcohol dehydrogenase (2 degrees Adh) was determined to be a homotetramer of 40 kDa subunits (SDS/PAGE) with a molecular mass of 160 kDa. The 2 degrees Adh had a lower catalytic efficiency for the oxidation of 1 degree alcohols, including ethanol, than for the oxidation of secondary (2 degrees) alcohols or the reduction of ketones or aldehydes. This enzyme possesses a significant acetyl-CoA reductive thioesterase activity as determined by NADPH oxidation, thiol formation and ethanol production. The primary-alcohol dehydrogenase (1 degree Adh) was determined to be a homotetramer of 41.5 kDa (SDS/PAGE) subunits with a molecular mass of 170 kDa. The 1 degree Adh used both NAD(H) and NADP(H) and displayed higher catalytic efficiencies for NADP(+)-dependent ethanol oxidation and NADH-dependent acetaldehyde (identical to ethanal) reduction than for NADPH-dependent acetaldehyde reduction or NAD(+)-dependent ethanol oxidation. The NAD(H)-linked acetaldehyde dehydrogenase was a homotetramer (360 kDa) of identical subunits (100 kDa) that readily catalysed thioester cleavage and condensation. The 1 degree Adh was expressed at 5-20% of the level of the 2 degrees Adh throughout the growth cycle on glucose. The results suggest that the 2 degrees Adh primarily functions in ethanol production from acetyl-CoA and acetaldehyde, whereas the 1 degree Adh functions in ethanol consumption for nicotinamide-cofactor recycling. Images Figure 1 PMID:8068002
Overproduction of Ristomycin A by Activation of a Silent Gene Cluster in Amycolatopsis japonicum MG417-CF17

PubMed Central

Spohn, Marius; Kirchner, Norbert; Kulik, Andreas; Jochim, Angelika; Wolf, Felix; Muenzer, Patrick; Borst, Oliver; Gross, Harald; Wohlleben, Wolfgang

2014-01-01

The emergence of antibiotic-resistant pathogenic bacteria within the last decades is one reason for the urgent need for new antibacterial agents. A strategy to discover new anti-infective compounds is the evaluation of the genetic capacity of secondary metabolite producers and the activation of cryptic gene clusters (genome mining). One genus known for its potential to synthesize medically important products is Amycolatopsis. However, Amycolatopsis japonicum does not produce an antibiotic under standard laboratory conditions. In contrast to most Amycolatopsis strains, A. japonicum is genetically tractable with different methods. In order to activate a possible silent glycopeptide cluster, we introduced a gene encoding the transcriptional activator of balhimycin biosynthesis, the bbr gene from Amycolatopsis balhimycina (bbrAba), into A. japonicum. This resulted in the production of an antibiotically active compound. Following whole-genome sequencing of A. japonicum, 29 cryptic gene clusters were identified by genome mining. One of these gene clusters is a putative glycopeptide biosynthesis gene cluster. Using bioinformatic tools, ristomycin (syn. ristocetin), a type III glycopeptide, which has antibacterial activity and which is used for the diagnosis of von Willebrand disease and Bernard-Soulier syndrome, was deduced as a possible product of the gene cluster. Chemical analyses by high-performance liquid chromatography and mass spectrometry (HPLC-MS), tandem mass spectrometry (MS/MS), and nuclear magnetic resonance (NMR) spectroscopy confirmed the in silico prediction that the recombinant A. japonicum/pRM4-bbrAba synthesizes ristomycin A. PMID:25114137
Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

USDA-ARS?s Scientific Manuscript database

Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....
Deletion and Gene Expression Analyses Define the Paxilline Biosynthetic Gene Cluster in Penicillium paxilli

PubMed Central

Scott, Barry; Young, Carolyn A.; Saikia, Sanjay; McMillan, Lisa K.; Monahan, Brendon J.; Koulman, Albert; Astin, Jonathan; Eaton, Carla J.; Bryant, Andrea; Wrenn, Ruth E.; Finch, Sarah C.; Tapper, Brian A.; Parker, Emily J.; Jameson, Geoffrey B.

2013-01-01

The indole-diterpene paxilline is an abundant secondary metabolite synthesized by Penicillium paxilli. In total, 21 genes have been identified at the PAX locus of which six have been previously confirmed to have a functional role in paxilline biosynthesis. A combination of bioinformatics, gene expression and targeted gene replacement analyses were used to define the boundaries of the PAX gene cluster. Targeted gene replacement identified seven genes, paxG, paxA, paxM, paxB, paxC, paxP and paxQ that were all required for paxilline production, with one additional gene, paxD, required for regular prenylation of the indole ring post paxilline synthesis. The two putative transcription factors, PP104 and PP105, were not co-regulated with the pax genes and based on targeted gene replacement, including the double knockout, did not have a role in paxilline production. The relationship of indole dimethylallyl transferases involved in prenylation of indole-diterpenes such as paxilline or lolitrem B, can be found as two disparate clades, not supported by prenylation type (e.g., regular or reverse). This paper provides insight into the P. paxilli indole-diterpene locus and reviews the recent advances identified in paxilline biosynthesis. PMID:23949005
A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq.

PubMed

Ye, Meixia; Wang, Zhong; Wang, Yaqun; Wu, Rongling

2015-03-01

Dynamic changes of gene expression reflect an intrinsic mechanism of how an organism responds to developmental and environmental signals. With the increasing availability of expression data across a time-space scale by RNA-seq, the classification of genes as per their biological function using RNA-seq data has become one of the most significant challenges in contemporary biology. Here we develop a clustering mixture model to discover distinct groups of genes expressed during a period of organ development. By integrating the density function of multivariate Poisson distribution, the model accommodates the discrete property of read counts characteristic of RNA-seq data. The temporal dependence of gene expression is modeled by the first-order autoregressive process. The model is implemented with the Expectation-Maximization algorithm and model selection to determine the optimal number of gene clusters and obtain the estimates of Poisson parameters that describe the pattern of time-dependent expression of genes from each cluster. The model has been demonstrated by analyzing a real data from an experiment aimed to link the pattern of gene expression to catkin development in white poplar. The usefulness of the model has been validated through computer simulation. The model provides a valuable tool for clustering RNA-seq data, facilitating our global view of expression dynamics and understanding of gene regulation mechanisms. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Genomic insights into the evolution of hybrid isoprenoid biosynthetic gene clusters in the MAR4 marine streptomycete clade

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gallagher, Kelley A.; Jensen, Paul R.

Background: Considerable advances have been made in our understanding of the molecular genetics of secondary metabolite biosynthesis. Coupled with increased access to genome sequence data, new insight can be gained into the diversity and distributions of secondary metabolite biosynthetic gene clusters and the evolutionary processes that generate them. Here we examine the distribution of gene clusters predicted to encode the biosynthesis of a structurally diverse class of molecules called hybrid isoprenoids (HIs) in the genus Streptomyces. These compounds are derived from a mixed biosynthetic origin that is characterized by the incorporation of a terpene moiety onto a variety of chemicalmore » scaffolds and include many potent antibiotic and cytotoxic agents. Results: One hundred and twenty Streptomyces genomes were searched for HI biosynthetic gene clusters using ABBA prenyltransferases (PTases) as queries. These enzymes are responsible for a key step in HI biosynthesis. The strains included 12 that belong to the ‘MAR4’ clade, a largely marine-derived lineage linked to the production of diverse HI secondary metabolites. We found ABBA PTase homologs in all of the MAR4 genomes, which averaged five copies per strain, compared with 21 % of the non-MAR4 genomes, which averaged one copy per strain. Phylogenetic analyses suggest that MAR4 PTase diversity has arisen by a combination of horizontal gene transfer and gene duplication. Furthermore, there is evidence that HI gene cluster diversity is generated by the horizontal exchange of orthologous PTases among clusters. Many putative HI gene clusters have not been linked to their secondary metabolic products, suggesting that MAR4 strains will yield additional new compounds in this structure class. Finally, we confirm that the mevalonate pathway is not always present in genomes that contain HI gene clusters and thus is not a reliable query for identifying strains with the potential to produce HI secondary metabolites
Genomic insights into the evolution of hybrid isoprenoid biosynthetic gene clusters in the MAR4 marine streptomycete clade

DOE PAGES

Gallagher, Kelley A.; Jensen, Paul R.

2015-11-17

Background: Considerable advances have been made in our understanding of the molecular genetics of secondary metabolite biosynthesis. Coupled with increased access to genome sequence data, new insight can be gained into the diversity and distributions of secondary metabolite biosynthetic gene clusters and the evolutionary processes that generate them. Here we examine the distribution of gene clusters predicted to encode the biosynthesis of a structurally diverse class of molecules called hybrid isoprenoids (HIs) in the genus Streptomyces. These compounds are derived from a mixed biosynthetic origin that is characterized by the incorporation of a terpene moiety onto a variety of chemicalmore » scaffolds and include many potent antibiotic and cytotoxic agents. Results: One hundred and twenty Streptomyces genomes were searched for HI biosynthetic gene clusters using ABBA prenyltransferases (PTases) as queries. These enzymes are responsible for a key step in HI biosynthesis. The strains included 12 that belong to the ‘MAR4’ clade, a largely marine-derived lineage linked to the production of diverse HI secondary metabolites. We found ABBA PTase homologs in all of the MAR4 genomes, which averaged five copies per strain, compared with 21 % of the non-MAR4 genomes, which averaged one copy per strain. Phylogenetic analyses suggest that MAR4 PTase diversity has arisen by a combination of horizontal gene transfer and gene duplication. Furthermore, there is evidence that HI gene cluster diversity is generated by the horizontal exchange of orthologous PTases among clusters. Many putative HI gene clusters have not been linked to their secondary metabolic products, suggesting that MAR4 strains will yield additional new compounds in this structure class. Finally, we confirm that the mevalonate pathway is not always present in genomes that contain HI gene clusters and thus is not a reliable query for identifying strains with the potential to produce HI secondary metabolites
Strategies to regulate transcription factor-mediated gene positioning and interchromosomal clustering at the nuclear periphery.

PubMed

Randise-Hinchliff, Carlo; Coukos, Robert; Sood, Varun; Sumner, Michael Chas; Zdraljevic, Stefan; Meldi Sholl, Lauren; Garvey Brickner, Donna; Ahmed, Sara; Watchmaker, Lauren; Brickner, Jason H

2016-03-14

In budding yeast, targeting of active genes to the nuclear pore complex (NPC) and interchromosomal clustering is mediated by transcription factor (TF) binding sites in the gene promoters. For example, the binding sites for the TFs Put3, Ste12, and Gcn4 are necessary and sufficient to promote positioning at the nuclear periphery and interchromosomal clustering. However, in all three cases, gene positioning and interchromosomal clustering are regulated. Under uninducing conditions, local recruitment of the Rpd3(L) histone deacetylase by transcriptional repressors blocks Put3 DNA binding. This is a general function of yeast repressors: 16 of 21 repressors blocked Put3-mediated subnuclear positioning; 11 of these required Rpd3. In contrast, Ste12-mediated gene positioning is regulated independently of DNA binding by mitogen-activated protein kinase phosphorylation of the Dig2 inhibitor, and Gcn4-dependent targeting is up-regulated by increasing Gcn4 protein levels. These different regulatory strategies provide either qualitative switch-like control or quantitative control of gene positioning over different time scales. © 2016 Randise-Hinchliff et al.
Motif-independent prediction of a secondary metabolism gene cluster using comparative genomics: application to sequenced genomes of Aspergillus and ten other filamentous fungal species.

PubMed

Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki

2014-08-01

Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Spatial expression of Hox cluster genes in the ontogeny of a sea urchin

NASA Technical Reports Server (NTRS)

Arenas-Mena, C.; Cameron, A. R.; Davidson, E. H.

2000-01-01

The Hox cluster of the sea urchin Strongylocentrous purpuratus contains ten genes in a 500 kb span of the genome. Only two of these genes are expressed during embryogenesis, while all of eight genes tested are expressed during development of the adult body plan in the larval stage. We report the spatial expression during larval development of the five 'posterior' genes of the cluster: SpHox7, SpHox8, SpHox9/10, SpHox11/13a and SpHox11/13b. The five genes exhibit a dynamic, largely mesodermal program of expression. Only SpHox7 displays extensive expression within the pentameral rudiment itself. A spatially sequential and colinear arrangement of expression domains is found in the somatocoels, the paired posterior mesodermal structures that will become the adult perivisceral coeloms. No such sequential expression pattern is observed in endodermal, epidermal or neural tissues of either the larva or the presumptive juvenile sea urchin. The spatial expression patterns of the Hox genes illuminate the evolutionary process by which the pentameral echinoderm body plan emerged from a bilateral ancestor.
Effets du Parecoxib dans la Prévention des Adhérences abdominales postopératoires: étude expérimentale randomisée chez les rats

PubMed Central

Arung, Willy; Tshilombo, François; Odimba, Etienne

2015-01-01

Introduction Bien d’études ont été menées sur les adhérences intrapéritonéales, mais aucune unanimité n'est encore acquise sur leur prévention. Le but de notre étude a été d’évaluer le potentiel effet d'un antiinflammatoire, parecoxib dans la prévention des adhérences ainsi que sur la cicatrisation chez des rats. Méthodes Dans un modèle expérimental d'adhérences postopératoires secondaires à des lésions péritonéales par brûlure, 30 rats furent randomisés en trois groupes suivant le mode d'administration de parecoxib (groupe contrôle; intrapéritonéal; intramusculaire. Résultats Le parecoxib a significativement diminué la quantité (p < .05) et la sévérité (p < .01) des adhérences postopératoires dans les deux modèles expérimentaux. Au total, 21 rats ont développé des adhérences, respectivement 9 (100%) dans le groupe A, 5 (50%) dans le groupe B et 7 (70%) dans le groupe C (p = 0.05). Du point de vue de la formation des adhérences au site du traumatisme, dix-neuf rats en ont développé: 9 (100%) dans le groupe A et 5 (50%) pour chacun de deux autres groupes B et C. Une différence significative a été constatée en comparant ces groupes deux à deux: A vs B (p < 0.05); A vs C (p < 0,05). Parecoxib n'a pas compromis la cicatrisation intestinale, ni cutanée. Conclusion Cette étude a montré que le parecoxib pouvait réduire la formation des adhérences postopératoires. La confirmation de la sécurité du parecoxib sur les anastomoses intestinales doit être investiguée au cours d'autres expérimentations. PMID:26966478
Alcohol and aldehyde dehydrogenase gene polymorphisms and oropharyngolaryngeal, esophageal and stomach cancers in Japanese alcoholics.

PubMed

Yokoyama, A; Muramatsu, T; Omori, T; Yokoyama, T; Matsushita, S; Higuchi, S; Maruyama, K; Ishii, H

2001-03-01

Alcohol dehydrogenase-2 (ADH2) and aldehyde dehydrogenase-2 (ALDH2) gene polymorphisms play roles in ethanol metabolism, drinking behavior and esophageal carcinogenesis in Japanese; however, the combined influence of ADH2 and ALDH2 genotypes on other aerodigestive tract cancers have not been investigated. ADH2/ALDH2 genotyping was performed on lymphocyte DNA samples from Japanese alcoholic men (526 cancer-free; 159 with solitary or multiple aerodigestive tract cancers, including 33 oropharyngolaryngeal, 112 esophageal, 38 stomach and 22 multiple primary cancers in two or three organs). After adjustment for age, drinking and smoking habits, and ADH2/ALDH2 genotypes, the presence of either ADH2*1/2*1 or ALDH2*1/2*2 significantly increased the risk for oropharyngolaryngeal cancer [odds ratios (ORs), 6.68 with ADH2*1/2*1 and 18.52 with ALDH2*1/2*2] and esophageal cancer (ORs, 2.64 and 13.50, respectively). For patients with both ADH2*1/2*1 and ALDH2*1/2*2, the risks for oropharyngolaryngeal and esophageal cancers were enhanced in a multiplicative fashion (OR = 121.77 and 40.40, respectively). A positive association with ALDH2*1/2*2 alone was observed for stomach cancer patients who also had oropharyngolaryngeal and/or esophageal cancer (OR = 110.58), but it was not observed for those with stomach cancer alone. Furthermore, in the presence of ALDH2*1/2*2, the risks for multiple intra-esophageal cancers (OR = 3.43) and for esophageal cancer with oropharyngolaryngeal and/or stomach cancer (OR = 3.95) were higher than the risks for solitary intra-esophageal cancer and for esophageal cancer alone, but these tendencies were not observed for ADH2*1/2*1 genotype. Alcoholics' population attributable risks due to ADH2/ALDH2 polymorphisms were estimated to be 82.0% for oropharyngolaryngeal cancer and 63.9% for esophageal cancer.
Genome-Wide Analysis of Secondary Metabolite Gene Clusters in Ophiostoma ulmi and Ophiostoma novo-ulmi Reveals a Fujikurin-Like Gene Cluster with a Putative Role in Infection.

PubMed

Sbaraini, Nicolau; Andreis, Fábio C; Thompson, Claudia E; Guedes, Rafael L M; Junges, Ângela; Campos, Thais; Staats, Charley C; Vainstein, Marilene H; Ribeiro de Vasconcelos, Ana T; Schrank, Augusto

2017-01-01

The emergence of new microbial pathogens can result in destructive outbreaks, since their hosts have limited resistance and pathogens may be excessively aggressive. Described as the major ecological incident of the twentieth century, Dutch elm disease, caused by ascomycete fungi from the Ophiostoma genus, has caused a significant decline in elm tree populations ( Ulmus sp.) in North America and Europe. Genome sequencing of the two main causative agents of Dutch elm disease ( Ophiostoma ulmi and Ophiostoma novo-ulmi ), along with closely related species with different lifestyles, allows for unique comparisons to be made to identify how pathogens and virulence determinants have emerged. Among several established virulence determinants, secondary metabolites (SMs) have been suggested to play significant roles during phytopathogen infection. Interestingly, the secondary metabolism of Dutch elm pathogens remains almost unexplored, and little is known about how SM biosynthetic genes are organized in these species. To better understand the metabolic potential of O. ulmi and O. novo-ulmi , we performed a deep survey and description of SM biosynthetic gene clusters (BGCs) in these species and assessed their conservation among eight species from the Ophiostomataceae family. Among 19 identified BGCs, a fujikurin-like gene cluster (OpPKS8) was unique to Dutch elm pathogens. Phylogenetic analysis revealed that orthologs for this gene cluster are widespread among phytopathogens and plant-associated fungi, suggesting that OpPKS8 may have been horizontally acquired by the Ophiostoma genus. Moreover, the detailed identification of several BGCs paves the way for future in-depth research and supports the potential impact of secondary metabolism on Ophiostoma genus' lifestyle.

A Zn(II)2Cys6 DNA binding protein regulates the sirodesmin PL biosynthetic gene cluster in Leptosphaeria maculans

PubMed Central

Fox, Ellen M.; Gardiner, Donald M.; Keller, Nancy P.; Howlett, Barbara J.

2008-01-01

A gene, sirZ, encoding a Zn(II)2Cys6 DNA binding protein is present in a cluster of genes responsible for the biosynthesis of the epipolythiodioxopiperazine (ETP) toxin, sirodesmin PL in the ascomycete plant pathogen, Leptosphaeria maculans. RNA-mediated silencing of sirZ gives rise to transformants that produce only residual amounts of sirodesmin PL and display a decrease in the transcription of several sirodesmin PL biosynthetic genes. This indicates that SirZ is a major regulator of this gene cluster. Proteins similar to SirZ are encoded in the gliotoxin biosynthetic gene cluster of Aspergillus fumigatus (gliZ) and in an ETP-like cluster in Penicillium lilacinoechinulatum (PlgliZ). Despite its high level of sequence similarity to gliZ, PlgliZ is unable to complement the gliotoxin-deficiency of a mutant of gliZ in A. fumigatus. Putative binding sites for these regulatory proteins in the promoters of genes in these clusters were predicted using bioinformatic analysis. These sites are similar to those commonly bound by other proteins with Zn(II)2Cys6 DNA binding domains. PMID:18023597
Identification of the Monooxygenase Gene Clusters Responsible for the Regioselective Oxidation of Phenol to Hydroquinone in Mycobacteria▿

PubMed Central

Furuya, Toshiki; Hirose, Satomi; Osanai, Hisashi; Semba, Hisashi; Kino, Kuniki

2011-01-01

Mycobacterium goodii strain 12523 is an actinomycete that is able to oxidize phenol regioselectively at the para position to produce hydroquinone. In this study, we investigated the genes responsible for this unique regioselective oxidation. On the basis of the fact that the oxidation activity of M. goodii strain 12523 toward phenol is induced in the presence of acetone, we first identified acetone-induced proteins in this microorganism by two-dimensional electrophoretic analysis. The N-terminal amino acid sequence of one of these acetone-induced proteins shares 100% identity with that of the protein encoded by the open reading frame Msmeg_1971 in Mycobacterium smegmatis strain mc2155, whose genome sequence has been determined. Since Msmeg_1971, Msmeg_1972, Msmeg_1973, and Msmeg_1974 constitute a putative binuclear iron monooxygenase gene cluster, we cloned this gene cluster of M. smegmatis strain mc2155 and its homologous gene cluster found in M. goodii strain 12523. Sequence analysis of these binuclear iron monooxygenase gene clusters revealed the presence of four genes designated mimABCD, which encode an oxygenase large subunit, a reductase, an oxygenase small subunit, and a coupling protein, respectively. When the mimA gene (Msmeg_1971) of M. smegmatis strain mc2155, which was also found to be able to oxidize phenol to hydroquinone, was deleted, this mutant lost the oxidation ability. This ability was restored by introduction of the mimA gene of M. smegmatis strain mc2155 or of M. goodii strain 12523 into this mutant. Interestingly, we found that these gene clusters also play essential roles in propane and acetone metabolism in these mycobacteria. PMID:21183637
Genomic and expression analysis of the vanG-like gene cluster of Clostridium difficile.

PubMed

Peltier, Johann; Courtin, Pascal; El Meouche, Imane; Catel-Ferreira, Manuella; Chapot-Chartier, Marie-Pierre; Lemée, Ludovic; Pons, Jean-Louis

2013-07-01

Primary antibiotic treatment of Clostridium difficile intestinal diseases requires metronidazole or vancomycin therapy. A cluster of genes homologous to enterococcal glycopeptides resistance vanG genes was found in the genome of C. difficile 630, although this strain remains sensitive to vancomycin. This vanG-like gene cluster was found to consist of five ORFs: the regulatory region consisting of vanR and vanS and the effector region consisting of vanG, vanXY and vanT. We found that 57 out of 83 C. difficile strains, representative of the main lineages of the species, harbour this vanG-like cluster. The cluster is expressed as an operon and, when present, is found at the same genomic location in all strains. The vanG, vanXY and vanT homologues in C. difficile 630 are co-transcribed and expressed to a low level throughout the growth phases in the absence of vancomycin. Conversely, the expression of these genes is strongly induced in the presence of subinhibitory concentrations of vancomycin, indicating that the vanG-like operon is functional at the transcriptional level in C. difficile. Hydrophilic interaction liquid chromatography (HILIC-HPLC) and MS analysis of cytoplasmic peptidoglycan precursors of C. difficile 630 grown without vancomycin revealed the exclusive presence of a UDP-MurNAc-pentapeptide with an alanine at the C terminus. UDP-MurNAc-pentapeptide [d-Ala] was also the only peptidoglycan precursor detected in C. difficile grown in the presence of vancomycin, corroborating the lack of vancomycin resistance. Peptidoglycan structures of a vanG-like mutant strain and of a strain lacking the vanG-like cluster did not differ from the C. difficile 630 strain, indicating that the vanG-like cluster also has no impact on cell-wall composition.
Biosynthetic Investigations of Lactonamycin and Lactonamycin Z: Cloning of the Biosynthetic Gene Clusters and Discovery of an Unusual Starter Unit▿ †

PubMed Central

Zhang, Xiujun; Alemany, Lawrence B.; Fiedler, Hans-Peter; Goodfellow, Michael; Parry, Ronald J.

2008-01-01

The antibiotics lactonamycin and lactonamycin Z provide attractive leads for antibacterial drug development. Both antibiotics contain a novel aglycone core called lactonamycinone. To gain insight into lactonamycinone biosynthesis, cloning and precursor incorporation experiments were undertaken. The lactonamycin gene cluster was initially cloned from Streptomyces rishiriensis. Sequencing of ca. 61 kb of S. rishiriensis DNA revealed the presence of 57 open reading frames. These included genes coding for the biosynthesis of l-rhodinose, the sugar found in lactonamycin, and genes similar to those in the tetracenomycin biosynthetic gene cluster. Since lactonamycin production by S. rishiriensis could not be sustained, additional proof for the identity of the S. rishiriensis cluster was obtained by cloning the lactonamycin Z gene cluster from Streptomyces sanglieri. Partial sequencing of the S. sanglieri cluster revealed 15 genes that exhibited a very high degree of similarity to genes within the lactonamycin cluster, as well as an identical organization. Double-crossover disruption of one gene in the S. sanglieri cluster abolished lactonamycin Z production, and production was restored by complementation. These results confirm the identity of the genetic locus cloned from S. sanglieri and indicate that the highly similar locus in S. rishiriensis encodes lactonamycin biosynthetic genes. Precursor incorporation experiments with S. sanglieri revealed that lactonamycinone is biosynthesized in an unusual manner whereby glycine or a glycine derivative serves as a starter unit that is extended by nine acetate units. Analysis of the gene clusters and of the precursor incorporation data suggested a hypothetical scheme for lactonamycinone biosynthesis. PMID:18070976
Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

PubMed

Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

2010-08-01

Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.
VRprofile: gene-cluster-detection-based profiling of virulence and antibiotic resistance traits encoded within genome sequences of pathogenic bacteria.

PubMed

Li, Jun; Tai, Cui; Deng, Zixin; Zhong, Weihong; He, Yongqun; Ou, Hong-Yu

2017-01-10

VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Genome-wide association study identifies the SERPINB gene cluster as a susceptibility locus for food allergy.

PubMed

Marenholz, Ingo; Grosche, Sarah; Kalb, Birgit; Rüschendorf, Franz; Blümchen, Katharina; Schlags, Rupert; Harandi, Neda; Price, Mareike; Hansen, Gesine; Seidenberg, Jürgen; Röblitz, Holger; Yürek, Songül; Tschirner, Sebastian; Hong, Xiumei; Wang, Xiaobin; Homuth, Georg; Schmidt, Carsten O; Nöthen, Markus M; Hübner, Norbert; Niggemann, Bodo; Beyer, Kirsten; Lee, Young-Ae

2017-10-20

Genetic factors and mechanisms underlying food allergy are largely unknown. Due to heterogeneity of symptoms a reliable diagnosis is often difficult to make. Here, we report a genome-wide association study on food allergy diagnosed by oral food challenge in 497 cases and 2387 controls. We identify five loci at genome-wide significance, the clade B serpin (SERPINB) gene cluster at 18q21.3, the cytokine gene cluster at 5q31.1, the filaggrin gene, the C11orf30/LRRC32 locus, and the human leukocyte antigen (HLA) region. Stratifying the results for the causative food demonstrates that association of the HLA locus is peanut allergy-specific whereas the other four loci increase the risk for any food allergy. Variants in the SERPINB gene cluster are associated with SERPINB10 expression in leukocytes. Moreover, SERPINB genes are highly expressed in the esophagus. All identified loci are involved in immunological regulation or epithelial barrier function, emphasizing the role of both mechanisms in food allergy.
The impact of polyploidy on the evolution of a complex NB-LRR resistance gene cluster in soybean

USDA-ARS?s Scientific Manuscript database

A comparative genomics approach was used to investigate the evolution of a complex NB-LRR gene cluster found in soybean (Glycine max), common bean (Phaseolus vulgaris), and other legumes. In soybean, the cluster is associated with several disease resistance (R) genes of known function including Rpg1...
Association between ALDH2 and ADH1B polymorphisms, alcohol drinking and gastric cancer: a replication and mediation analysis.

PubMed

Ishioka, Kuka; Masaoka, Hiroyuki; Ito, Hidemi; Oze, Isao; Ito, Seiji; Tajika, Masahiro; Shimizu, Yasuhiro; Niwa, Yasumasa; Nakamura, Shigeo; Matsuo, Keitaro

2018-04-03

Aldehyde dehydrogenase 2 (ALDH2; rs671, Glu504Lys) and alcohol dehydrogenase 1B (ADH1B; rs1229984, His47Arg) polymorphisms have a strong impact on carcinogenic acetaldehyde accumulation after alcohol drinking. To date, however, evidence for a significant ALDH2-alcohol drinking interaction and a mediation effect of ALDH2/ADH1B through alcohol drinking on gastric cancer have remained unclear. We conducted two case-control studies to validate the interaction and to estimate the mediation effect on gastric cancer. We calculated odds ratios (OR) and 95% confidence intervals (CI) for ALDH2/ADH1B genotypes and alcohol drinking using conditional logistic regression models after adjustment for potential confounding in the HERPACC-2 (697 cases and 1372 controls) and HERPACC-3 studies (678 cases and 678 controls). We also conducted a mediation analysis of the combination of the two studies to assess whether the effects of these polymorphisms operated through alcohol drinking or through other pathways. ALDH2 Lys alleles had a higher risk with increased alcohol consumption compared with ALDH2 Glu/Glu (OR for heavy drinking, 3.57; 95% CI 2.04-6.27; P for trend = 0.007), indicating a significant ALDH2-alcohol drinking interaction (P interaction = 0.024). The mediation analysis indicated a significant positive direct effect (OR 1.67; 95% CI 1.38-2.03) and a protective indirect effect (OR 0.84; 95% CI 0.76-0.92) of the ALDH2 Lys alleles with the ALDH2-alcohol drinking interaction. No significant association of ADH1B with gastric cancer was observed. The observed ALDH2-alcohol drinking interaction and the direct effect of ALDH2 Lys alleles may suggest the involvement of acetaldehyde in the development of gastric cancer.
Organization and differential regulation of a cluster of lignin peroxidase genes of Phanerochaete chrysosporium

Treesearch

Philip Stewart; Daniel Cullen

1999-06-01

The lignin peroxidases of Phanerochaete chrysosporium are encoded by a minimum of 10 closely related genes. Physical and genetic mapping of a cluster of eight lip genes revealed six genes occurring in pairs and transcriptionally convergent, suggesting that portions of the lip family arose by gene duplication events. The completed sequence of 1ipG and lipJ, together...
WordCluster: detecting clusters of DNA words and genomic elements

PubMed Central

2011-01-01

Background Many k-mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (k-mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used WordCluster to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions WordCluster seems to predict biological meaningful clusters of DNA words (k-mers) and genomic entities. The implementation of the method into a web server is available at http://bioinfo2.ugr.es/wordCluster/wordCluster.php including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes. PMID:21261981
Persistence of hAQP1 expression in human salivary gland cells following AdhAQP1 transduction is associated with a lack of methylation of hCMV promoter

PubMed Central

Zheng, C; Baum, BJ; Liu, X; Goldsmith, CM; Perez, P; Jang, S-I; Cotrim, AP; McCullagh, L; Ambudkar, IS; Alevizos, I

2017-01-01

In 2012, we reported that 5 out of 11 subjects in a clinical trial (NCT00372320) administering AdhAQP1 to radiation-damaged parotid glands showed increased saliva flow rates and decreased symptoms over the initial 42 days. AdhAQP1 is a first-generation, E1-deleted, replication-defective, serotype 5 adenoviral vector encoding human aquaporin-1 (hAQP1). This vector uses the human cytomegalovirus enhancer/promoter (hCMVp). As subject peak responses were at times much longer (7–42 days) than expected, we hypothesized that the hCMVp may not be methylated in human salivary gland cells to the extent previously observed in rodent salivary gland cells. This hypothesis was supported in human salivary gland primary cultures and human salivary gland cell lines after transduction with AdhAQP1. Importantly, hAQP1 maintained its function in those cells. Conversely, when we transduced mouse and rat cell lines in vitro and submandibular glands in vivo with AdhAQP1, the hCMVp was gradually methylated over time and associated with decreased hAQP1 expression and function in vitro and decreased hAQP1 expression in vivo. These data suggest that the hCMVp in AdhAQP1was probably not methylated in transduced human salivary gland cells of responding subjects, resulting in an unexpectedly longer functional expression of hAQP1. PMID:26177970
Identification of an Imprinted Gene Cluster in the X-Inactivation Center

PubMed Central

Kobayashi, Shin; Totoki, Yasushi; Soma, Miki; Matsumoto, Kazuya; Fujihara, Yoshitaka; Toyoda, Atsushi; Sakaki, Yoshiyuki; Okabe, Masaru; Ishino, Fumitoshi

2013-01-01

Mammalian development is strongly influenced by the epigenetic phenomenon called genomic imprinting, in which either the paternal or the maternal allele of imprinted genes is expressed. Paternally expressed Xist, an imprinted gene, has been considered as a single cis-acting factor to inactivate the paternally inherited X chromosome (Xp) in preimplantation mouse embryos. This means that X-chromosome inactivation also entails gene imprinting at a very early developmental stage. However, the precise mechanism of imprinted X-chromosome inactivation remains unknown and there is little information about imprinted genes on X chromosomes. In this study, we examined whether there are other imprinted genes than Xist expressed from the inactive paternal X chromosome and expressed in female embryos at the preimplantation stage. We focused on small RNAs and compared their expression patterns between sexes by tagging the female X chromosome with green fluorescent protein. As a result, we identified two micro (mi)RNAs–miR-374-5p and miR-421-3p–mapped adjacent to Xist that were predominantly expressed in female blastocysts. Allelic expression analysis revealed that these miRNAs were indeed imprinted and expressed from the Xp. Further analysis of the imprinting status of adjacent locus led to the discovery of a large cluster of imprinted genes expressed from the Xp: Jpx, Ftx and Zcchc13. To our knowledge, this is the first identified cluster of imprinted genes in the cis-acting regulatory region termed the X-inactivation center. This finding may help in understanding the molecular mechanisms regulating imprinted X-chromosome inactivation during early mammalian development. PMID:23940725
Identification of an imprinted gene cluster in the X-inactivation center.

PubMed

Kobayashi, Shin; Totoki, Yasushi; Soma, Miki; Matsumoto, Kazuya; Fujihara, Yoshitaka; Toyoda, Atsushi; Sakaki, Yoshiyuki; Okabe, Masaru; Ishino, Fumitoshi

2013-01-01

Mammalian development is strongly influenced by the epigenetic phenomenon called genomic imprinting, in which either the paternal or the maternal allele of imprinted genes is expressed. Paternally expressed Xist, an imprinted gene, has been considered as a single cis-acting factor to inactivate the paternally inherited X chromosome (Xp) in preimplantation mouse embryos. This means that X-chromosome inactivation also entails gene imprinting at a very early developmental stage. However, the precise mechanism of imprinted X-chromosome inactivation remains unknown and there is little information about imprinted genes on X chromosomes. In this study, we examined whether there are other imprinted genes than Xist expressed from the inactive paternal X chromosome and expressed in female embryos at the preimplantation stage. We focused on small RNAs and compared their expression patterns between sexes by tagging the female X chromosome with green fluorescent protein. As a result, we identified two micro (mi)RNAs-miR-374-5p and miR-421-3p-mapped adjacent to Xist that were predominantly expressed in female blastocysts. Allelic expression analysis revealed that these miRNAs were indeed imprinted and expressed from the Xp. Further analysis of the imprinting status of adjacent locus led to the discovery of a large cluster of imprinted genes expressed from the Xp: Jpx, Ftx and Zcchc13. To our knowledge, this is the first identified cluster of imprinted genes in the cis-acting regulatory region termed the X-inactivation center. This finding may help in understanding the molecular mechanisms regulating imprinted X-chromosome inactivation during early mammalian development.
The PhytoClust tool for metabolic gene clusters discovery in plant genomes.

PubMed

Töpfer, Nadine; Fuchs, Lisa-Maria; Aharoni, Asaph

2017-07-07

The existence of Metabolic Gene Clusters (MGCs) in plant genomes has recently raised increased interest. Thus far, MGCs were commonly identified for pathways of specialized metabolism, mostly those associated with terpene type products. For efficient identification of novel MGCs, computational approaches are essential. Here, we present PhytoClust; a tool for the detection of candidate MGCs in plant genomes. The algorithm employs a collection of enzyme families related to plant specialized metabolism, translated into hidden Markov models, to mine given genome sequences for physically co-localized metabolic enzymes. Our tool accurately identifies previously characterized plant MGCs. An exhaustive search of 31 plant genomes detected 1232 and 5531 putative gene cluster types and candidates, respectively. Clustering analysis of putative MGCs types by species reflected plant taxonomy. Furthermore, enrichment analysis revealed taxa- and species-specific enrichment of certain enzyme families in MGCs. When operating through our web-interface, PhytoClust users can mine a genome either based on a list of known cluster types or by defining new cluster rules. Moreover, for selected plant species, the output can be complemented by co-expression analysis. Altogether, we envisage PhytoClust to enhance novel MGCs discovery which will in turn impact the exploration of plant metabolism. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages

USDA-ARS?s Scientific Manuscript database

Fungi that have the enzymes cyanase and carbonic anhydrase show a limited capacity to detoxify cyanate, a fungicide employed by both plants and humans. Here, we describe a novel two-gene cluster that comprises duplicated cyanase and carbonic anhydrase copies, which we name the CCA gene cluster, trac...
Amplification of the entire kanamycin biosynthetic gene cluster during empirical strain improvement of Streptomyces kanamyceticus.

PubMed

Yanai, Koji; Murakami, Takeshi; Bibb, Mervyn

2006-06-20

Streptomyces kanamyceticus 12-6 is a derivative of the wild-type strain developed for industrial kanamycin (Km) production. Southern analysis and DNA sequencing revealed amplification of a large genomic segment including the entire Km biosynthetic gene cluster in the chromosome of strain 12-6. At 145 kb, the amplifiable unit of DNA (AUD) is the largest AUD reported in Streptomyces. Striking repetitive DNA sequences belonging to the clustered regularly interspaced short palindromic repeats family were found in the AUD and may play a role in its amplification. Strain 12-6 contains a mixture of different chromosomes with varying numbers of AUDs, sometimes exceeding 36 copies and producing an amplified region >5.7 Mb. The level of Km production depended on the copy number of the Km biosynthetic gene cluster, suggesting that DNA amplification occurred during strain improvement as a consequence of selection for increased Km resistance. Amplification of DNA segments including entire antibiotic biosynthetic gene clusters might be a common mechanism leading to increased antibiotic production in industrial strains.
Bacillus sp. CDB3 isolated from cattle dip-sites possesses two ars gene clusters.

PubMed

Bhat, Somanath; Luo, Xi; Xu, Zhiqiang; Liu, Lixia; Zhang, Ren

2011-01-01

Contamination of soil and water by arsenic is a global problem. In Australia, the dipping of cattle in arsenic-containing solution to control cattle ticks in last centenary has left many sites heavily contaminated with arsenic and other toxicants. We had previously isolated five soil bacterial strains (CDB1-5) highly resistant to arsenic. To understand the resistance mechanism, molecular studies have been carried out. Two chromosome-encoded arsenic resistance (ars) gene clusters have been cloned from CDB3 (Bacillus sp.). They both function in Escherichia coli and cluster 1 exerts a much higher resistance to the toxic metalloid. Cluster 2 is smaller possessing four open reading frames (ORFs) arsRorf2BC, similar to that identified in Bacillus subtilis Skin element. Among the eight ORFs in cluster 1 five are analogs of common ars genes found in other bacteria, however, organized in a unique order arsRBCDA instead of arsRDABC. Three other putative genes are located directly downstream and designated as arsTIP based on the homologies of their theoretical translation sequences respectively to thioredoxin reductases, iron-sulphur cluster proteins and protein phosphatases. The latter two are novel of any known ars operons. The arsD gene from Bacillus species was cloned for the first time and the predict protein differs from the well studied E. coli ArsD by lacking two pairs of C-terminal cysteine residues. Its functional involvement in arsenic resistance has been confirmed by a deletion experiment. There exists also an inverted repeat in the intergenic region between arsC and arsD implying some unknown transcription regulation.
An Integrated workflow for phenazine biosynthetic gene cluster discovery and characterization

USDA-ARS?s Scientific Manuscript database

Increasing availability of new genomes and putative biosynthetic gene clusters (BGCs) has extended the opportunity to access novel chemical diversity for agriculture, medicine, environmental and industrial purposes. However, functional characterization of BGCs through heterologous expression is limi...
An NPC1L1 gene promoter variant is associated with autosomal dominant hypercholesterolemia.

PubMed

Martín, B; Solanas-Barca, M; García-Otín, A-L; Pampín, S; Cofán, M; Ros, E; Rodríguez-Rey, J-C; Pocoví, M; Civeira, F

2010-05-01

A substantial number of subjects with autosomal dominant hypercholesterolemia (ADH) do not have LDL receptor (LDLR) or apolipoprotein B (APOB) mutations. Some ADH subjects appear to hyperabsorb sterols from the intestine, thus we hypothesized that they could have variants of the Niemann-Pick C1-Like 1 gene (NPC1L1). NPC1L1 encodes a crucial protein involved in intestinal sterol absorption. Four NPC1L1 variants (-133A>G, -18C>A, 1679C>G, 28650A>G) were analyzed in 271 (155 women and 116 men) ADH bearers without mutations in LDLR or APOB aged 30-70years and 274 (180 women and 94 men) control subjects aged 25-65years. The AC haplotype determined by the -133A>G and -18C>A variants was underrepresented in ADH subjects compared to controls (p=0.01). In the ADH group, cholesterol absorption/synthesis markers were significantly lower in AC homozygotes that in all others haplotypes. Electrophoretic mobility shift assay (EMSA) results revealed that the -133A-specific oligonucleotide produced a retarded band stronger than the -133G allele. Luciferase activity with NPC1L1 -133G variant was 2.5-fold higher than with the -133A variant. The -133A>G polymorphism exerts a significant effect on NPC1L1 promoter activity. NPC1L1 promoter variants might explain in part the hypercholesterolemic phenotype of some subjects with nonLDLR/nonAPOB ADH. Copyright 2009 Elsevier B.V. All rights reserved.

Characterization of bafilomycin biosynthesis in Kitasatospora setae KM-6054 and comparative analysis of gene clusters in Actinomycetales microorganisms.

PubMed

Nara, Ayako; Hashimoto, Takuya; Komatsu, Mamoru; Nishiyama, Makoto; Kuzuyama, Tomohisa; Ikeda, Haruo

2017-05-01

Bafilomycins A 1 , C 1 and B 1 (setamycin) produced by Kitasatospora setae KM-6054 belong to the plecomacrolide family, which exhibit antibacterial, antifungal, antineoplastic and immunosuppressive activities. An analysis of gene clusters from K. setae KM-6054 governing the biosynthesis of bafilomycins revealed that it contains five large open reading frames (ORFs) encoding the multifunctional polypeptides of bafilomycin polyketide synthases (PKSs). These clustered PKS genes, which are responsible for bafilomycin biosynthesis, together encode 11 homologous sets of enzyme activities, each catalyzing a specific round of polyketide chain elongation. The region contains an additional 13 ORFs spanning a distance of 73 287 bp, some of which encode polypeptides governing other key steps in bafilomycin biosynthesis. Five ORFs, BfmB, BfmC, BfmD, BfmE and BfmF, were involved in the formation of methoxymalonyl-acyl carrier protein (ACP). Two possible regulatory genes, bfmR and bfmH, were found downstream of the above genes. A gene-knockout analysis revealed that BfmR was only a transcriptional regulator for the transcription of bafilomycin biosynthetic genes. Two genes, bfmI and bfmJ, were found downstream of bfmH. An analysis of these gene-disruption mutants in addition to an enzymatic analysis of BfmI and BfmJ revealed that BfmJ activated fumarate and BfmI functioned as a catalyst to form a fumaryl ester at the C21 hydroxyl residue of bafilomycin A 1 . A comparative analysis of bafilomycin gene clusters in K. setae KM-6054, Streptomyces lohii JCM 14114 and Streptomyces griseus DSM 2608 revealed that each ORF of both gene clusters in two Streptomyces strains were quite similar to each other. However, each ORF of gene cluster in K. setae KM-6054 was of lower similarity to that of corresponding ORF in the two Streptomyces species.
pySAPC, a python package for sparse affinity propagation clustering: Application to odontogenesis whole genome time series gene-expression data.

PubMed

Cao, Huojun; Amendt, Brad A

2016-11-01

Developmental dental anomalies are common forms of congenital defects. The molecular mechanisms of dental anomalies are poorly understood. Systematic approaches such as clustering genes based on similar expression patterns could identify novel genes involved in dental anomalies and provide a framework for understanding molecular regulatory mechanisms of these genes during tooth development (odontogenesis). A python package (pySAPC) of sparse affinity propagation clustering algorithm for large datasets was developed. Whole genome pair-wise similarity was calculated based on expression pattern similarity based on 45 microarrays of several stages during odontogenesis. pySAPC identified 743 gene clusters based on expression pattern similarity during mouse tooth development. Three clusters are significantly enriched for genes associated with dental anomalies (with FDR <0.1). The three clusters of genes have distinct expression patterns during odontogenesis. Clustering genes based on similar expression profiles recovered several known regulatory relationships for genes involved in odontogenesis, as well as many novel genes that may be involved with the same genetic pathways as genes that have already been shown to contribute to dental defects. By using sparse similarity matrix, pySAPC use much less memory and CPU time compared with the original affinity propagation program that uses a full similarity matrix. This python package will be useful for many applications where dataset(s) are too large to use full similarity matrix. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016. Published by Elsevier B.V.
Cloning and characterization of a Pseudomonas mendocina KR1 gene cluster encoding toluene-4-monooxygenase.

PubMed Central

Yen, K M; Karl, M R; Blatt, L M; Simon, M J; Winter, R B; Fausset, P R; Lu, H S; Harcourt, A A; Chen, K K

1991-01-01

Pseudomonas mendocina KR1 metabolizes toluene as a carbon source by a previously unknown pathway. The initial step of the pathway is hydroxylation of toluene to form p-cresol by a multicomponent toluene-4-monooxygenase (T4MO) system. The T4MO enzyme system has broad substrate specificity and provides a new opportunity for biodegradation of toxic compounds and bioconversions. Its known activities include conversion of a variety of phenyl compounds into the phenolic derivatives and the complete degradation of trichloroethylene. We have cloned and characterized a gene cluster from KR1 that determines the offO activity. To clone the T4MO genes, KR1 DNA libraries were constructed in Escherichia coli HB101 by using a broad-host-range vector and transferred to a KR1 mutant able to grow on p-cresol but not on toluene. An insert consisting of two SacI fragments of identical size (10.2 kb) was shown to complement the mutant for growth on toluene. One of the SacI fragments, when cloned into the E. coli vector pUC19, was found to direct the synthesis of indigo dye. The indigo-forming property was correlated with the presence of T4MO activity. The T4MO genes were mapped to a 3.6-kb region, and the direction of transcription was determined. DNA sequencing and N-terminal amino acid determination identified a five-gene cluster, tmoABCDE, within this region. Expression of this cluster carrying a single mutation in each gene demonstrated that each of the five genes is essential for T4MO activity. Other evidence presented indicated that none of the tmo genes was involved in the regulation of the tmo gene cluster, in the control of substrate transport for the T4MO system, or in major processing of the products of the tmo genes. It was tentatively concluded that the tmoABCDE genes encode structural polypeptides of the T4MO enzyme system. One of the tmo genes was tentatively identified as a ferredoxin gene. Images PMID:1885512
The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter

PubMed Central

Darbani, Behrooz; Motawia, Mohammed Saddik; Olsen, Carl Erik; Nour-Eldin, Hussam H.; Møller, Birger Lindberg; Rook, Fred

2016-01-01

Genomic gene clusters for the biosynthesis of chemical defence compounds are increasingly identified in plant genomes. We previously reported the independent evolution of biosynthetic gene clusters for cyanogenic glucoside biosynthesis in three plant lineages. Here we report that the gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor additionally contains a gene, SbMATE2, encoding a transporter of the multidrug and toxic compound extrusion (MATE) family, which is co-expressed with the biosynthetic genes. The predicted localisation of SbMATE2 to the vacuolar membrane was demonstrated experimentally by transient expression of a SbMATE2-YFP fusion protein and confocal microscopy. Transport studies in Xenopus laevis oocytes demonstrate that SbMATE2 is able to transport dhurrin. In addition, SbMATE2 was able to transport non-endogenous cyanogenic glucosides, but not the anthocyanin cyanidin 3-O-glucoside or the glucosinolate indol-3-yl-methyl glucosinolate. The genomic co-localisation of a transporter gene with the biosynthetic genes producing the transported compound is discussed in relation to the role self-toxicity of chemical defence compounds may play in the formation of gene clusters. PMID:27841372
A Granular Self-Organizing Map for Clustering and Gene Selection in Microarray Data.

PubMed

Ray, Shubhra Sankar; Ganivada, Avatharam; Pal, Sankar K

2016-09-01

A new granular self-organizing map (GSOM) is developed by integrating the concept of a fuzzy rough set with the SOM. While training the GSOM, the weights of a winning neuron and the neighborhood neurons are updated through a modified learning procedure. The neighborhood is newly defined using the fuzzy rough sets. The clusters (granules) evolved by the GSOM are presented to a decision table as its decision classes. Based on the decision table, a method of gene selection is developed. The effectiveness of the GSOM is shown in both clustering samples and developing an unsupervised fuzzy rough feature selection (UFRFS) method for gene selection in microarray data. While the superior results of the GSOM, as compared with the related clustering methods, are provided in terms of β -index, DB-index, Dunn-index, and fuzzy rough entropy, the genes selected by the UFRFS are not only better in terms of classification accuracy and a feature evaluation index, but also statistically more significant than the related unsupervised methods. The C-codes of the GSOM and UFRFS are available online at http://avatharamg.webs.com/software-code.
Chassis organism from Corynebacterium glutamicum--a top-down approach to identify and delete irrelevant gene clusters.

PubMed

Unthan, Simon; Baumgart, Meike; Radek, Andreas; Herbst, Marius; Siebert, Daniel; Brühl, Natalie; Bartsch, Anna; Bott, Michael; Wiechert, Wolfgang; Marin, Kay; Hans, Stephan; Krämer, Reinhard; Seibold, Gerd; Frunzke, Julia; Kalinowski, Jörn; Rückert, Christian; Wendisch, Volker F; Noack, Stephan

2015-02-01

For synthetic biology applications, a robust structural basis is required, which can be constructed either from scratch or in a top-down approach starting from any existing organism. In this study, we initiated the top-down construction of a chassis organism from Corynebacterium glutamicum ATCC 13032, aiming for the relevant gene set to maintain its fast growth on defined medium. We evaluated each native gene for its essentiality considering expression levels, phylogenetic conservation, and knockout data. Based on this classification, we determined 41 gene clusters ranging from 3.7 to 49.7 kbp as target sites for deletion. 36 deletions were successful and 10 genome-reduced strains showed impaired growth rates, indicating that genes were hit, which are relevant to maintain biological fitness at wild-type level. In contrast, 26 deleted clusters were found to include exclusively irrelevant genes for growth on defined medium. A combinatory deletion of all irrelevant gene clusters would, in a prophage-free strain, decrease the size of the native genome by about 722 kbp (22%) to 2561 kbp. Finally, five combinatory deletions of irrelevant gene clusters were investigated. The study introduces the novel concept of relevant genes and demonstrates general strategies to construct a chassis suitable for biotechnological application. © 2014 The Authors. Biotechnology Journal published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim. This is an open access article under the terms of the Creative Commons Attribution-Non-Commercial-NoDerivs Licence, which permits use and distribution in any medium, provided the original work is properly cited, the use is non- commercial and no modifications or adaptations are made.
Alcohol and aldehyde dehydrogenase gene polymorphisms influence susceptibility to esophageal cancer in Japanese alcoholics.

PubMed

Yokoyama, A; Muramatsu, T; Omori, T; Matsushita, S; Yoshimizu, H; Higuchi, S; Yokoyama, T; Maruyama, K; Ishii, H

1999-11-01

screening procedure for the highest risk gene combination (ADH2*1/2*1 and ALDH2*1/2*2) will require further investigation.
Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

PubMed Central

Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

1993-01-01

Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Biogeochemical sampling in the Mahd Adh Dhahab District, Kingdom of Saudi Arabia

USGS Publications Warehouse

Ebens, Richard J.; Shacklette, Hansford T.; Worl, Ronald G.

1983-01-01

A biogeochemical reconnaissance of the Mahd adh Dhahab district, Kingdom of Saudi Arabia, confirms the ability of deep-rooted Acacia trees to reflect bedrock concentrations of some trace elements. The analytical values for lead, zinc, selenium, and cadmium in ash of tree branches are significantly higher in samples from areas of known mineralization (13 sites) than in samples from areas of no known mineralization (12 sites). Geometric mean concentrations of these elements in the two areas (mineralized; nonmineralized), quoted as parts per million in ash, are lead (122; 28), zinc (713; 443), selenium (1.2; 0.6), and cadmium (1.4; 0.5). The range of molybdenum values in ash from the two areas is similar, but a cluster of four sites in an area classified as nonmineralized corresponds to an area where the U.S. Geological Survey reported anomalous molybdenum values in rock in 1965. Results for other elements were either equivocal (mercury, tellurium, silver) or showed no correspondence to the two areas. Mean values for barium, manganese, potassium, and sodium are significantly higher in areas of no known mineralization, but we conclude that this reflects a difference in country rock major-element chemistry rather than the effect of ore-forming processes. The pattern of trace-metal values in Acacia ash is present whether the sampled tree grows on bedrock, on talus, or on residual or modern alluvium. This fact suggests that the trace-element chemistry of the trees reflects bedrock geochemistry and implies that Acacia biogeochemistry could be applied as a prospecting tool in areas where bedrock is not well exposed.
Identification and manipulation of the pleuromutilin gene cluster from Clitopilus passeckerianus for increased rapid antibiotic production

NASA Astrophysics Data System (ADS)

Bailey, Andy M.; Alberti, Fabrizio; Kilaru, Sreedhar; Collins, Catherine M.; de Mattos-Shipley, Kate; Hartley, Amanda J.; Hayes, Patrick; Griffin, Alison; Lazarus, Colin M.; Cox, Russell J.; Willis, Christine L.; O'Dwyer, Karen; Spence, David W.; Foster, Gary D.

2016-05-01

Semi-synthetic derivatives of the tricyclic diterpene antibiotic pleuromutilin from the basidiomycete Clitopilus passeckerianus are important in combatting bacterial infections in human and veterinary medicine. These compounds belong to the only new class of antibiotics for human applications, with novel mode of action and lack of cross-resistance, representing a class with great potential. Basidiomycete fungi, being dikaryotic, are not generally amenable to strain improvement. We report identification of the seven-gene pleuromutilin gene cluster and verify that using various targeted approaches aimed at increasing antibiotic production in C. passeckerianus, no improvement in yield was achieved. The seven-gene pleuromutilin cluster was reconstructed within Aspergillus oryzae giving production of pleuromutilin in an ascomycete, with a significant increase (2106%) in production. This is the first gene cluster from a basidiomycete to be successfully expressed in an ascomycete, and paves the way for the exploitation of a metabolically rich but traditionally overlooked group of fungi.
Molecular evolution of the nif gene cluster carrying nifI1 and nifI2 genes in the Gram-positive phototrophic bacterium Heliobacterium chlorum.

PubMed

Enkh-Amgalan, Jigjiddorj; Kawasaki, Hiroko; Seki, Tatsuji

2006-01-01

A major nif cluster was detected in the strictly anaerobic, Gram-positive phototrophic bacterium Heliobacterium chlorum. The cluster consisted of 11 genes arranged within a 10 kb region in the order nifI1, nifI2, nifH, nifD, nifK, nifE, nifN, nifX, fdx, nifB and nifV. The phylogenetic position of Hbt. chlorum was the same in the NifH, NifD, NifK, NifE and NifN trees; Hbt. chlorum formed a cluster with Desulfitobacterium hafniense, the closest neighbour of heliobacteria based on the 16S rRNA phylogeny, and two species of the genus Geobacter belonging to the Deltaproteobacteria. Two nifI genes, known to occur in the nif clusters of methanogenic archaea between nifH and nifD, were found upstream of the nifH gene of Hbt. chlorum. The organization of the nif operon and the phylogeny of individual and concatenated gene products showed that the Hbt. chlorum nif operon carrying nifI genes upstream of the nifH gene was an intermediate between the nif operon with nifI downstream of nifH (group II and III of the nitrogenase classification) and the nif operon lacking nifI (group I). Thus, the phylogenetic position of Hbt. chlorum nitrogenase may reflect an evolutionary stage of a divergence of the two nitrogenase groups, with group I consisting of the aerobic diazotrophs and group II consisting of strictly anaerobic prokaryotes.
Silencing of a second dimethylallyltryptophan synthase of Penicillium roqueforti reveals a novel clavine alkaloid gene cluster.

PubMed

Fernández-Bodega, Ángeles; Álvarez-Álvarez, Rubén; Liras, Paloma; Martín, Juan F

2017-08-01

Penicillium roqueforti produces several prenylated indole alkaloids, including roquefortine C and clavine alkaloids. The first step in the biosynthesis of roquefortine C is the prenylation of tryptophan-derived dipeptides by a dimethylallyltryptophan synthase, specific for roquefortine biosynthesis (roquefortine prenyltransferase). A second dimethylallyltryptophan synthase, DmaW2, different from the roquefortine prenyltransferase, has been studied in this article. Silencing the gene encoding this second dimethylallyltryptophan synthase, dmaW2, proved that inactivation of this gene does not prevent the production of roquefortine C, but suppresses the formation of other indole alkaloids. Mass spectrometry studies have identified these compounds as isofumigaclavine A, the pathway final product and prenylated intermediates. The silencing does not affect the production of mycophenolic acid and andrastin A. A bioinformatic study of the genome of P. roqueforti revealed that DmaW2 (renamed IfgA) is a prenyltransferase involved in isofumigaclavine A biosynthesis encoded by a gene located in a six genes cluster (cluster A). A second three genes cluster (cluster B) encodes the so-called yellow enzyme and enzymes for the late steps for the conversion of festuclavine to isofumigaclavine A. The yellow enzyme contains a tyrosine-181 at its active center, as occurs in Neosartorya fumigata, but in contrast to the Clavicipitaceae fungi. A complete isofumigaclavines A and B biosynthetic pathway is proposed based on the finding of these studies on the biosynthesis of clavine alkaloids.
Characterisation of the gene cluster for L-rhamnose catabolism in the yeast Scheffersomyces (Pichia) stipitis

Treesearch

Outi M. Koivistoinen; Mikko Arvas; Jennifer R. Headman; Martina Andberg; Merja Penttilä; Thomas W. Jeffries; Peter Richard

2012-01-01

In Scheffersomyces (Pichia) stipitis and related fungal species the genes for L-rhamnose catabolism RHA1, LRA2, LRA3 and LRA4 but not LADH are clustered. We find that located next to the cluster is a transcription...
Association of paraoxonase gene cluster polymorphisms with ALS in France, Quebec, and Sweden.

PubMed

Valdmanis, P N; Kabashi, E; Dyck, A; Hince, P; Lee, J; Dion, P; D'Amour, M; Souchon, F; Bouchard, J-P; Salachas, F; Meininger, V; Andersen, P M; Camu, W; Dupré, N; Rouleau, G A

2008-08-12

The paraoxonase gene cluster on chromosome 7 comprising the PON1-3 genes is an attractive candidate for association in amyotrophic lateral sclerosis (ALS) given the role of paraoxonase genes during the response to oxidative stress and their contribution to the enzymatic break down of nerve toxins. Oxidative stress is considered one of the mechanisms involved in ALS pathogenesis. Evidence for this includes the fact that mutations of SOD1, which normally reduce the production of toxic superoxide anion, account for 12% to 23% of familial cases in ALS. In addition, PON variants were shown to be associated with susceptibility to ALS in several North American and European populations. We extended this analysis to examine 20 single nucleotide polymorphisms (SNPs) across the PON gene cluster in a set of patients from France (480 cases, 475 controls), Quebec (159 cases, 95 controls), and Sweden (558 cases, 506 controls). Although individual SNPs were not considered associated on their own, a haplotype of SNPs at the C-terminal portion of PON2 that includes the PON2 C311S amino acid change was significant in the French (p value 0.0075) and Quebec (p value 0.026) populations as well as all three populations combined (p value 1.69 x 10(-6)). Stratification of the samples showed that this variation was pertinent to ALS susceptibility as a whole, and not to a particular subset of patients. These findings contribute to the increasing weight of evidence that genetic variants in the paraoxonase gene cluster are associated with amyotrophic lateral sclerosis.
Average correlation clustering algorithm (ACCA) for grouping of co-regulated genes with similar pattern of variation in their expression values.

PubMed

Bhattacharya, Anindya; De, Rajat K

2010-08-01

Distance based clustering algorithms can group genes that show similar expression values under multiple experimental conditions. They are unable to identify a group of genes that have similar pattern of variation in their expression values. Previously we developed an algorithm called divisive correlation clustering algorithm (DCCA) to tackle this situation, which is based on the concept of correlation clustering. But this algorithm may also fail for certain cases. In order to overcome these situations, we propose a new clustering algorithm, called average correlation clustering algorithm (ACCA), which is able to produce better clustering solution than that produced by some others. ACCA is able to find groups of genes having more common transcription factors and similar pattern of variation in their expression values. Moreover, ACCA is more efficient than DCCA with respect to the time of execution. Like DCCA, we use the concept of correlation clustering concept introduced by Bansal et al. ACCA uses the correlation matrix in such a way that all genes in a cluster have the highest average correlation values with the genes in that cluster. We have applied ACCA and some well-known conventional methods including DCCA to two artificial and nine gene expression datasets, and compared the performance of the algorithms. The clustering results of ACCA are found to be more significantly relevant to the biological annotations than those of the other methods. Analysis of the results show the superiority of ACCA over some others in determining a group of genes having more common transcription factors and with similar pattern of variation in their expression profiles. Availability of the software: The software has been developed using C and Visual Basic languages, and can be executed on the Microsoft Windows platforms. The software may be downloaded as a zip file from http://www.isical.ac.in/~rajat. Then it needs to be installed. Two word files (included in the zip file) need to
The Lineage-Specific Evolution of Aquaporin Gene Clusters Facilitated Tetrapod Terrestrial Adaptation

PubMed Central

Finn, Roderick Nigel; Chauvigné, François; Hlidberg, Jón Baldur; Cutler, Christopher P.; Cerdà, Joan

2014-01-01

A major physiological barrier for aquatic organisms adapting to terrestrial life is dessication in the aerial environment. This barrier was nevertheless overcome by the Devonian ancestors of extant Tetrapoda, but the origin of specific molecular mechanisms that solved this water problem remains largely unknown. Here we show that an ancient aquaporin gene cluster evolved specifically in the sarcopterygian lineage, and subsequently diverged into paralogous forms of AQP2, -5, or -6 to mediate water conservation in extant Tetrapoda. To determine the origin of these apomorphic genomic traits, we combined aquaporin sequencing from jawless and jawed vertebrates with broad taxon assembly of >2,000 transcripts amongst 131 deuterostome genomes and developed a model based upon Bayesian inference that traces their convergent roots to stem subfamilies in basal Metazoa and Prokaryota. This approach uncovered an unexpected diversity of aquaporins in every lineage investigated, and revealed that the vertebrate superfamily consists of 17 classes of aquaporins (Aqp0 - Aqp16). The oldest orthologs associated with water conservation in modern Tetrapoda are traced to a cluster of three aqp2-like genes in Actinistia that likely arose >500 Ma through duplication of an aqp0-like gene present in a jawless ancestor. In sea lamprey, we show that aqp0 first arose in a protocluster comprised of a novel aqp14 paralog and a fused aqp01 gene. To corroborate these findings, we conducted phylogenetic analyses of five syntenic nuclear receptor subfamilies, which, together with observations of extensive genome rearrangements, support the coincident loss of ancestral aqp2-like orthologs in Actinopterygii. We thus conclude that the divergence of sarcopterygian-specific aquaporin gene clusters was permissive for the evolution of water conservation mechanisms that facilitated tetrapod terrestrial adaptation. PMID:25426855
Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

PubMed Central

Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

2017-01-01

Sequencing the actinomycin (acm) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S
Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C.

PubMed

Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

2017-01-01

Sequencing the actinomycin ( acm ) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN , encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S
Cloning, expression, and characterization of a novel (S)-specific alcohol dehydrogenase from Lactobacillus kefir.

PubMed

Chen, Qilei; Hu, Youjia; Zhao, Wenjie; Zhu, Chunbao; Zhu, Baoquan

2010-01-01

A gene encoding a novel (S)-specific NADH-dependent alcohol dehydrogenase (LK-ADH) was isolated from the genomic DNA of Lactobacillus kefir DSM 20587 by thermal asymmetric interlaced-polymerase chain reaction. The nucleotide sequence of (S)-LK-ADH gene (adhS) was determined, which consists of an open reading frame of 1,044 bp, coding for 347 amino acids with a molecular mass of 37.065 kDa. After a BLAST similarity search in GenBank database, the amino acid sequence of (S)-LK-ADH showed some homologies to several zinc containing medium-chain alcohol dehydrogenases. This novel gene was deposited into GenBank with the accession number of EU877965. adhS gene was subcloned into plasmid pET-28a(+), and recombinant (S)-LK-ADH was successfully expressed in E. coli BL21(DE3) by isopropyl-beta-D-1-thiogalactopyranoside induction. Purified enzyme showed a high enantioselectivity in the reduction of acetophenone to (S)-phenylethanol with an ee value of 99.4%. The substrate specificity and cofactor preference of recombinant (S)-LK-ADH were also tested.
CYP76M7 Is an ent-Cassadiene C11α-Hydroxylase Defining a Second Multifunctional Diterpenoid Biosynthetic Gene Cluster in Rice[W][OA

PubMed Central

Swaminathan, Sivakumar; Morrone, Dana; Wang, Qiang; Fulton, D. Bruce; Peters, Reuben J.

2009-01-01

Biosynthetic gene clusters are common in microbial organisms, but rare in plants, raising questions regarding the evolutionary forces that drive their assembly in multicellular eukaryotes. Here, we characterize the biochemical function of a rice (Oryza sativa) cytochrome P450 monooxygenase, CYP76M7, which seems to act in the production of antifungal phytocassanes and defines a second diterpenoid biosynthetic gene cluster in rice. This cluster is uniquely multifunctional, containing enzymatic genes involved in the production of two distinct sets of phytoalexins, the antifungal phytocassanes and antibacterial oryzalides/oryzadiones, with the corresponding genes being subject to distinct transcriptional regulation. The lack of uniform coregulation of the genes within this multifunctional cluster suggests that this was not a primary driving force in its assembly. However, the cluster is dedicated to specialized metabolism, as all genes in the cluster are involved in phytoalexin metabolism. We hypothesize that this dedication to specialized metabolism led to the assembly of the corresponding biosynthetic gene cluster. Consistent with this hypothesis, molecular phylogenetic comparison demonstrates that the two rice diterpenoid biosynthetic gene clusters have undergone independent elaboration to their present-day forms, indicating continued evolutionary pressure for coclustering of enzymatic genes encoding components of related biosynthetic pathways. PMID:19825834

Identifying driving gene clusters in complex diseases through critical transition theory

NASA Astrophysics Data System (ADS)

Wolanyk, Nathaniel; Wang, Xujing; Hessner, Martin; Gao, Shouguo; Chen, Ye; Jia, Shuang

A novel approach of looking at the human body using critical transition theory has yielded positive results: clusters of genes that act in tandem to drive complex disease progression. This cluster of genes can be thought of as the first part of a large genetic force that pushes the body from a curable, but sick, point to an incurable diseased point through a catastrophic bifurcation. The data analyzed is time course microarray blood assay data of 7 high risk individuals for Type 1 Diabetes who progressed into a clinical onset, with an additional larger study requested to be presented at the conference. The normalized data is 25,000 genes strong, which were narrowed down based on statistical metrics, and finally a machine learning algorithm using critical transition metrics found the driving network. This approach was created to be repeatable across multiple complex diseases with only progression time course data needed so that it would be applicable to identifying when an individual is at risk of developing a complex disease. Thusly, preventative measures can be enacted, and in the longer term, offers a possible solution to prevent all Type 1 Diabetes.
Comprehensive cluster analysis with Transitivity Clustering.

PubMed

Wittkop, Tobias; Emig, Dorothea; Truss, Anke; Albrecht, Mario; Böcker, Sebastian; Baumbach, Jan

2011-03-01

Transitivity Clustering is a method for the partitioning of biological data into groups of similar objects, such as genes, for instance. It provides integrated access to various functions addressing each step of a typical cluster analysis. To facilitate this, Transitivity Clustering is accessible online and offers three user-friendly interfaces: a powerful stand-alone version, a web interface, and a collection of Cytoscape plug-ins. In this paper, we describe three major workflows: (i) protein (super)family detection with Cytoscape, (ii) protein homology detection with incomplete gold standards and (iii) clustering of gene expression data. This protocol guides the user through the most important features of Transitivity Clustering and takes ∼1 h to complete.
Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci

PubMed Central

Boldogköi, Zsolt

2012-01-01

The regulation of gene expression is essential for normal functioning of biological systems in every form of life. Gene expression is primarily controlled at the level of transcription, especially at the phase of initiation. Non-coding RNAs are one of the major players at every level of genetic regulation, including the control of chromatin organization, transcription, various post-transcriptional processes, and translation. In this study, the Transcriptional Interference Network (TIN) hypothesis was put forward in an attempt to explain the global expression of antisense RNAs and the overall occurrence of tandem gene clusters in the genomes of various biological systems ranging from viruses to mammalian cells. The TIN hypothesis suggests the existence of a novel layer of genetic regulation, based on the interactions between the transcriptional machineries of neighboring genes at their overlapping regions, which are assumed to play a fundamental role in coordinating gene expression within a cluster of functionally linked genes. It is claimed that the transcriptional overlaps between adjacent genes are much more widespread in genomes than is thought today. The Waterfall model of the TIN hypothesis postulates a unidirectional effect of upstream genes on the transcription of downstream genes within a cluster of tandemly arrayed genes, while the Seesaw model proposes a mutual interdependence of gene expression between the oppositely oriented genes. The TIN represents an auto-regulatory system with an exquisitely timed and highly synchronized cascade of gene expression in functionally linked genes located in close physical proximity to each other. In this study, we focused on herpesviruses. The reason for this lies in the compressed nature of viral genes, which allows a tight regulation and an easier investigation of the transcriptional interactions between genes. However, I believe that the same or similar principles can be applied to cellular organisms too. PMID
Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci.

PubMed

Boldogköi, Zsolt

2012-01-01

The regulation of gene expression is essential for normal functioning of biological systems in every form of life. Gene expression is primarily controlled at the level of transcription, especially at the phase of initiation. Non-coding RNAs are one of the major players at every level of genetic regulation, including the control of chromatin organization, transcription, various post-transcriptional processes, and translation. In this study, the Transcriptional Interference Network (TIN) hypothesis was put forward in an attempt to explain the global expression of antisense RNAs and the overall occurrence of tandem gene clusters in the genomes of various biological systems ranging from viruses to mammalian cells. The TIN hypothesis suggests the existence of a novel layer of genetic regulation, based on the interactions between the transcriptional machineries of neighboring genes at their overlapping regions, which are assumed to play a fundamental role in coordinating gene expression within a cluster of functionally linked genes. It is claimed that the transcriptional overlaps between adjacent genes are much more widespread in genomes than is thought today. The Waterfall model of the TIN hypothesis postulates a unidirectional effect of upstream genes on the transcription of downstream genes within a cluster of tandemly arrayed genes, while the Seesaw model proposes a mutual interdependence of gene expression between the oppositely oriented genes. The TIN represents an auto-regulatory system with an exquisitely timed and highly synchronized cascade of gene expression in functionally linked genes located in close physical proximity to each other. In this study, we focused on herpesviruses. The reason for this lies in the compressed nature of viral genes, which allows a tight regulation and an easier investigation of the transcriptional interactions between genes. However, I believe that the same or similar principles can be applied to cellular organisms too.
Cloning and Characterization of the Pyrrolomycin Biosynthetic Gene Clusters from Actinosporangium vitaminophilum ATCC 31673 and Streptomyces sp. Strain UC 11065▿

PubMed Central

Zhang, Xiujun; Parry, Ronald J.

2007-01-01

The pyrrolomycins are a family of polyketide antibiotics, some of which contain a nitro group. To gain insight into the nitration mechanism associated with the formation of these antibiotics, the pyrrolomycin biosynthetic gene cluster from Actinosporangium vitaminophilum was cloned. Sequencing of ca. 56 kb of A. vitaminophilum DNA revealed 35 open reading frames (ORFs). Sequence analysis revealed a clear relationship between some of these ORFs and the biosynthetic gene cluster for pyoluteorin, a structurally related antibiotic. Since a gene transfer system could not be devised for A. vitaminophilum, additional proof for the identity of the cloned gene cluster was sought by cloning the pyrrolomycin gene cluster from Streptomyces sp. strain UC 11065, a transformable pyrrolomycin producer. Sequencing of ca. 26 kb of UC 11065 DNA revealed the presence of 17 ORFs, 15 of which exhibit strong similarity to ORFs in the A. vitaminophilum cluster as well as a nearly identical organization. Single-crossover disruption of two genes in the UC 11065 cluster abolished pyrrolomycin production in both cases. These results confirm that the genetic locus cloned from UC 11065 is essential for pyrrolomycin production, and they also confirm that the highly similar locus in A. vitaminophilum encodes pyrrolomycin biosynthetic genes. Sequence analysis revealed that both clusters contain genes encoding the two components of an assimilatory nitrate reductase. This finding suggests that nitrite is required for the formation of the nitrated pyrrolomycins. However, sequence analysis did not provide additional insights into the nitration process, suggesting the operation of a novel nitration mechanism. PMID:17158935
The role of chalcones: helichrysetin, xanthohumol, and flavokawin-C in promoting neurite outgrowth in PC12 Adh cells.

PubMed

Phan, Chia-Wei; Sabaratnam, Vikineswary; Yong, Wai-Kuan; Abd Malek, Sri Nurestri

2018-05-01

Chalcones are a group of compounds widely distributed in plant kingdom. The aim of this study was to assess the neurite outgrowth stimulatory activity of selected chalcones, namely helichrysetin, xanthohumol and flavokawin-C. Using adherent rat pheochromocytoma (PC12 Adh) cells, the chalcones were subjected to neurite outgrowth assay and the extracellular nerve growth factor (NGF) levels were determined. Xanthohumol (10 μg/mL) displayed the highest (p < 0.05) percentage of neurite-bearing PC12 Adh cells and the highest (p < 0.05) NGF level in the culture medium of xanthohumol-treated cells. While, helichrysetin induced a moderately high numbers of neurite-bearing cells, flavokawin-C did not stimulate neurite outgrowth. This work supports the potential use of xanthohumol as a potential neuroactive compound to stimulate neurite outgrowth.
Resolving misassembled cattle immune gene clusters with hierarchical, long read sequencing

USDA-ARS?s Scientific Manuscript database

Animal health is a critical component of productivity; however, current genomic selection genotyping tools have a paucity of genetic markers within key immune gene clusters (IGC) involved in the cattle innate and adaptive immune systems. With diseases such as Bovine Tuberculosis and Johne’s disease ...
A physical map of the human regulator of complement activation gene cluster linking the complement genes CR1, CR2, DAF, and C4BP

PubMed Central

1988-01-01

We report the organization of the human genes encoding the complement components C4-binding protein (C4BP), C3b/C4b receptor (CR1), decay accelerating factor (DAF), and C3dg receptor (CR2) within the regulator of complement activation (RCA) gene cluster. Using pulsed field gel electrophoresis analysis these genes have been physically linked and aligned as CR1-CR2-DAF-C4BP in an 800-kb DNA segment. The very tight linkage between the CR1 and the C4BP loci, contrasted with the relative long DNA distance between these genes, suggests the existence of mechanisms interfering with recombination within the RCA gene cluster. PMID:2450163
Arabidopsis gene expression patterns are altered during spaceflight

NASA Astrophysics Data System (ADS)

Paul, Anna-Lisa; Popp, Michael P.; Gurley, William B.; Guy, Charles; Norwood, Kelly L.; Ferl, Robert J.

The exposure of Arabidopsis thaliana (Arabidopsis) plants to spaceflight environments results in differential gene expression. A 5-day mission on orbiter Columbia in 1999 (STS-93) carried transgenic Arabidopsis plants engineered with a transgene composed of the alcohol dehydrogenase (Adh) gene promoter linked to the β-Glucuronidase (GUS) reporter gene. The plants were used to evaluate the effects of spaceflight on gene expression patterns initially by using the Adh/GUS transgene to address specifically the possibility that spaceflight induces a hypoxic stress response (Paul, A.L., Daugherty, C.J., Bihn, E.A., Chapman, D.K., Norwood, K.L., Ferl, R.J., 2001. Transgene expression patterns indicate that spaceflight affects stress signal perception and transduction in arabidopsis, Plant Physiol. 126, 613-621). As a follow-on to the reporter gene analysis, we report here the evaluation of genome-wide patterns of native gene expression within Arabidopsis shoots utilizing the Agilent DNA array of 21,000 Arabidopsis genes. As a control for the veracity of the array analyses, a selection of genes was further characterized with quantitative Real-Time RT PCR (ABI - Taqman®). Comparison of the patterns of expression for arrays probed with RNA isolated from plants exposed to spaceflight compared to RNA isolated from ground control plants revealed 182 genes that were differentially expressed in response to the spaceflight mission by more than 4-fold, and of those only 50 genes were expressed at levels chosen to support a conservative change call. None of the genes that are hallmarks of hypoxic stress were induced to this level. However, genes related to heat shock were dramatically induced - but in a pattern and under growth conditions that are not easily explained by elevated temperatures. These gene expression data are discussed in light of current models for plant responses to the spaceflight environment and with regard to potential future spaceflight experiment
DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

PubMed

El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

2007-01-01

We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.
Expression of a DNA Replication Gene Cluster in Bacteriophage T4: Genetic Linkage and the Control of Gene Product Interactions

PubMed Central

Gerald, W. L.; Karam, J. D.

1984-01-01

The results of this study bear on the relationship between genetic linkage and control of interactions between the protein products of different cistrons. In T4 bacteriophage, genes 45 and 44 encode essential components of the phage DNA replication multiprotein complex. T4 gene 45 maps directly upstream of gene 44 relative to the overall direction of reading of this region of the phage chromosome, but it is not known whether these two genes are cotranscribed. It has been shown that a nonsense lesion of T4 gene 45 exerts a cis-dominant inhibitory effect on growth of a missense mutant of gene 44 but not on growth of phage carrying the wild-type gene 44 allele. In previous work, we confirmed these observations on polarity of the gene 45 mutation but detected no polar effects by this lesion on synthesis of either mutant or wild-type gene 44 protein. In the present study, we demonstrate that mRNA for gene 44 protein is separable by gel electrophoresis from gene 45-protein-encoding mRNA. That is, the two proteins are not synthesized from one polycistronic message, and the cis-dominant inhibitory effect of the gene 45 mutation on gene 44 function is probably expressed at a posttranslational stage. We propose that close genetic linkage, whether or not it provides shared transcriptional and translational regulatory signals for certain clusters of functionally related cistrons, may determine the intracellular compartmentalization for synthesis of proteins encoded by these clusters. In prokaryotes, such linkage-dependent compartmentation may minimize the diffusion distances between gene products that are synthesized at low levels and are destined to interact. PMID:6745641
A Genetic System for Clostridium ljungdahlii: a Chassis for Autotrophic Production of Biocommodities and a Model Homoacetogen

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leang, C; Ueki, T; Nevin, KP

Methods for genetic manipulation of Clostridium ljungdahlii are of interest because of the potential for production of fuels and other biocommodities from carbon dioxide via microbial electrosynthesis or more traditional modes of autotrophy with hydrogen or carbon monoxide as the electron donor. Furthermore, acetogenesis plays an important role in the global carbon cycle. Gene deletion strategies required for physiological studies of C. ljungdahlii have not previously been demonstrated. An electroporation procedure for introducing plasmids was optimized, and four different replicative origins for plasmid propagation in C. ljungdahlii were identified. Chromosomal gene deletion via double-crossover homologous recombination with a suicide vectormore » was demonstrated initially with deletion of the gene for FliA, a putative sigma factor involved in flagellar biogenesis and motility in C. ljungdahlii. Deletion of fliA yielded a strain that lacked flagella and was not motile. To evaluate the potential utility of gene deletions for functional genomic studies and to redirect carbon and electron flow, the genes for the putative bifunctional aldehyde/alcohol dehydrogenases, adhE1 and adhE2, were deleted individually or together. Deletion of adhE1, but not adhE2, diminished ethanol production with a corresponding carbon recovery in acetate. The double deletion mutant had a phenotype similar to that of the adhE1-deficient strain. Expression of adhE1 in trans partially restored the capacity for ethanol production. These results demonstrate the feasibility of genetic investigations of acetogen physiology and the potential for genetic manipulation of C. ljungdahlii to optimize autotrophic biocommodity production.« less
Structure and gene cluster of the O-antigen of Escherichia coli O54.

PubMed

Naumenko, Olesya I; Guo, Xi; Senchenkova, Sof'ya N; Geng, Peng; Perepelov, Andrei V; Shashkov, Alexander S; Liu, Bin; Knirel, Yuriy A

2018-06-15

Mild acid hydrolysis of the lipopolysaccharide of Escherichia coli O54 afforded an O-polysaccharide, which was studied by sugar analysis, solvolysis with anhydrous trifluoroacetic acid, and 1 H and 13 C NMR spectroscopy. Solvolysis cleaved predominantly the linkage of β-d-Ribf and, to a lesser extent, that of β-d-GlcpNAc, whereas the other linkages, including the linkage of α-l-Rhap, were stable under selected conditions (40 °C, 5 h). The following structure of the O-polysaccharide was established: →4)-α-d-GalpA-(1 → 2)-α-l-Rhap-(1 → 2)-β-d-Ribf-(1 → 4)-β-d-Galp-(1 → 3)-β-d-GlcpNAc-(1→ The O-antigen gene cluster of E. coli O54 was analyzed and found to be consistent in general with the O-polysaccharide structure established but there were two exceptions: i) in the cluster, there were genes for phosphoserine phosphatase and serine transferase, which have no apparent role in the O-polysaccharide synthesis, and ii) no ribofuranosyltransferase gene was present in the cluster. Both uncommon features are shared by some other enteric bacteria. Copyright © 2018 Elsevier Ltd. All rights reserved.
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

PubMed

Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

2017-04-12

Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.
Alcoholism and alcohol drinking habits predicted from alcohol dehydrogenase genes.

PubMed

Tolstrup, Janne Schurmann; Nordestgaard, Børge Grønne; Rasmussen, Søren; Tybjaerg-Hansen, Anne; Grønbaek, Morten

2008-06-01

Alcohol drinking habits and alcoholism are partly genetically determined. Alcohol is degraded primarily by alcohol dehydrogenase (ADH) wherein genetic variation that affects the rate of alcohol degradation is found in ADH1B and ADH1C. It is biologically plausible that these variations may be associated with alcohol drinking habits and alcoholism. By genotyping 9080 white men and women from the general population, we found that men and women with ADH1B slow vs fast alcohol degradation drank more alcohol and had a higher risk of everyday drinking, heavy drinking, excessive drinking and of alcoholism. For example, the weekly alcohol intake was 9.8 drinks (95% confidence interval (CI): 9.1-11) among men with the ADH1B.1/1 genotype compared to 7.5 drinks (95% CI: 6.4-8.7) among men with the ADH1B.1/2 genotype, and the odds ratio (OR) for heavy drinking was 3.1 (95% CI: 1.7-5.7) among men with the ADH1B.1/1 genotype compared to men with the ADH1B.1/2 genotype. Furthermore, individuals with ADH1C slow vs fast alcohol degradation had a higher risk of heavy and excessive drinking. For example, the OR for heavy drinking was 1.4 (95% CI: 1.1-1.8) among men with the ADH1C.1/2 genotype and 1.4 (95% CI: 1.0-1.9) among men with the ADH1B.2/2 genotype, compared with men with the ADH1C.1/1 genotype. Results for ADH1B and ADH1C genotypes among men and women were similar. Finally, because slow ADH1B alcohol degradation is found in more than 90% of the white population compared to less than 10% of East Asians, the population attributable risk of heavy drinking and alcoholism by ADH1B.1/1 genotype was 67 and 62% among the white population compared with 9 and 24% among the East Asian population.
SMCHD1 regulates a limited set of gene clusters on autosomal chromosomes.

PubMed

Mason, Amanda G; Slieker, Roderick C; Balog, Judit; Lemmers, Richard J L F; Wong, Chao-Jen; Yao, Zizhen; Lim, Jong-Won; Filippova, Galina N; Ne, Enrico; Tawil, Rabi; Heijmans, Bas T; Tapscott, Stephen J; van der Maarel, Silvère M

2017-06-06

Facioscapulohumeral muscular dystrophy (FSHD) is in most cases caused by a contraction of the D4Z4 macrosatellite repeat on chromosome 4 (FSHD1) or by mutations in the SMCHD1 or DNMT3B gene (FSHD2). Both situations result in the incomplete epigenetic repression of the D4Z4-encoded retrogene DUX4 in somatic cells, leading to the aberrant expression of DUX4 in the skeletal muscle. In mice, Smchd1 regulates chromatin repression at different loci, having a role in CpG methylation establishment and/or maintenance. To investigate the global effects of harboring heterozygous SMCHD1 mutations on DNA methylation in humans, we combined 450k methylation analysis on mononuclear monocytes from female heterozygous SMCHD1 mutation carriers and unaffected controls with reduced representation bisulfite sequencing (RRBS) on FSHD2 and control myoblast cell lines. Candidate loci were then evaluated for SMCHD1 binding using ChIP-qPCR and expression was evaluated using RT-qPCR. We identified a limited number of clustered autosomal loci with CpG hypomethylation in SMCHD1 mutation carriers: the protocadherin (PCDH) cluster on chromosome 5, the transfer RNA (tRNA) and 5S rRNA clusters on chromosome 1, the HOXB and HOXD clusters on chromosomes 17 and 2, respectively, and the D4Z4 repeats on chromosomes 4 and 10. Furthermore, minor increases in RNA expression were seen in FSHD2 myoblasts for some of the PCDHβ cluster isoforms, tRNA isoforms, and a HOXB isoform in comparison to controls, in addition to the previously reported effects on DUX4 expression. SMCHD1 was bound at DNAseI hypersensitivity sites known to regulate the PCDHβ cluster and at the chromosome 1 tRNA cluster, with decreased binding in SMCHD1 mutation carriers at the PCDHβ cluster sites. Our study is the first to investigate the global methylation effects in humans resulting from heterozygous mutations in SMCHD1. Our results suggest that SMCHD1 acts as a repressor on a limited set of autosomal gene clusters, as an observed
Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

NASA Technical Reports Server (NTRS)

Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

1997-01-01

Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.
Characterization of the fumonisin B2 biosynthetic gene cluster in Aspergillus niger and A. awamori.

USDA-ARS?s Scientific Manuscript database

Aspergillus niger and A. awamori strains isolated from grapes cultivated in Mediterranean basin were examined for fumonisin B2 (FB2) production and presence/absence of sequences within the fumonisin biosynthetic gene (fum) cluster. Presence of 13 regions in the fum cluster was evaluated by PCR assay...
Degradation of Benzene by Pseudomonas veronii 1YdBTEX2 and 1YB2 Is Catalyzed by Enzymes Encoded in Distinct Catabolism Gene Clusters.

PubMed

de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L; Jáuregui, Ruy; Vilchez-Vargas, Ramiro; Pieper, Dietmar H

2016-01-01

Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Two Different Secondary Metabolism Gene Clusters Occupied the Same Ancestral Locus in Fungal Dermatophytes of the Arthrodermataceae

PubMed Central

Zhang, Han; Rokas, Antonis; Slot, Jason C.

2012-01-01

Background Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. Results The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. Conclusions We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity. PMID

Two different secondary metabolism gene clusters occupied the same ancestral locus in fungal dermatophytes of the arthrodermataceae.

PubMed

Zhang, Han; Rokas, Antonis; Slot, Jason C

2012-01-01

Dermatophyte fungi of the family Arthrodermataceae (Eurotiomycetes) colonize keratinized tissue, such as skin, frequently causing superficial mycoses in humans and other mammals, reptiles, and birds. Competition with native microflora likely underlies the propensity of these dermatophytes to produce a diversity of antibiotics and compounds for scavenging iron, which is extremely scarce, as well as the presence of an unusually large number of putative secondary metabolism gene clusters, most of which contain non-ribosomal peptide synthetases (NRPS), in their genomes. To better understand the historical origins and diversification of NRPS-containing gene clusters we examined the evolution of a variable locus (VL) that exists in one of three alternative conformations among the genomes of seven dermatophyte species. The first conformation of the VL (termed VLA) contains only 539 base pairs of sequence and lacks protein-coding genes, whereas the other two conformations (termed VLB and VLC) span 36 Kb and 27 Kb and contain 12 and 10 genes, respectively. Interestingly, both VLB and VLC appear to contain distinct secondary metabolism gene clusters; VLB contains a NRPS gene as well as four porphyrin metabolism genes never found to be physically linked in the genomes of 128 other fungal species, whereas VLC also contains a NRPS gene as well as several others typically found associated with secondary metabolism gene clusters. Phylogenetic evidence suggests that the VL locus was present in the ancestor of all seven species achieving its present distribution through subsequent differential losses or retentions of specific conformations. We propose that the existence of variable loci, similar to the one we studied, in fungal genomes could potentially explain the dramatic differences in secondary metabolic diversity between closely related species of filamentous fungi, and contribute to host adaptation and the generation of metabolic diversity.
Characterization of a Gene Cluster Involved in 4-Chlorocatechol Degradation by Pseudomonas reinekei MT1▿

PubMed Central

Cámara, Beatriz; Nikodem, Patricia; Bielecki, Piotr; Bobadilla, Roberto; Junca, Howard; Pieper, Dietmar H.

2009-01-01

Pseudomonas reinekei MT1 has previously been reported to degrade 4- and 5-chlorosalicylate by a pathway with 4-chlorocatechol, 3-chloromuconate, 4-chloromuconolactone, and maleylacetate as intermediates, and a gene cluster channeling various salicylates into an intradiol cleavage route has been reported. We now report that during growth on 5-chlorosalicylate, besides a novel (chloro)catechol 1,2-dioxygenase, C12OccaA, a novel (chloro)muconate cycloisomerase, MCIccaB, which showed features not yet reported, was induced. This cycloisomerase, which was practically inactive with muconate, evolved for the turnover of 3-substituted muconates and transforms 3-chloromuconate into equal amounts of cis-dienelactone and protoanemonin, suggesting that it is a functional intermediate between chloromuconate cycloisomerases and muconate cycloisomerases. The corresponding genes, ccaA (C12OccaA) and ccaB (MCIccaB), were located in a 5.1-kb genomic region clustered with genes encoding trans-dienelactone hydrolase (ccaC) and maleylacetate reductase (ccaD) and a putative regulatory gene, ccaR, homologous to regulators of the IclR-type family. Thus, this region includes genes sufficient to enable MT1 to transform 4-chlorocatechol to 3-oxoadipate. Phylogenetic analysis showed that C12OccaA and MCIccaB are only distantly related to previously described catechol 1,2-dioxygenases and muconate cycloisomerases. Kinetic analysis indicated that MCIccaB and the previously identified C12OsalD, rather than C12OccaA, are crucial for 5-chlorosalicylate degradation. Thus, MT1 uses enzymes encoded by a completely novel gene cluster for degradation of chlorosalicylates, which, together with a gene cluster encoding enzymes for channeling salicylates into the ortho-cleavage pathway, form an effective pathway for 4- and 5-chlorosalicylate mineralization. PMID:19465655
An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

PubMed

Booma, P M; Prabhakaran, S; Dhanalakshmi, R

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

PubMed Central

Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661
Identification of the first diphenyl ether gene cluster for pestheic acid biosynthesis in plant endophyte Pestalotiopsis fici.

PubMed

Xu, Xinxin; Liu, Ling; Zhang, Fan; Wang, Wenzhao; Li, Jinyang; Guo, Liangdong; Che, Yongsheng; Liu, Gang

2014-01-24

The diphenyl ether pestheic acid was isolated from the endophytic fungus Pestalotiopsis fici, which is proposed to be the biosynthetic precursor of the unique chloropupukeananes. The pestheic acid biosynthetic gene (pta) cluster was identified in the fungus through genome scanning. Sequence analysis revealed that this gene cluster encodes a nonreducing polyketide synthase, a number of modification enzymes, and three regulators. Gene disruption and intermediate analysis demonstrated that the biosynthesis proceeded through formation of the polyketide backbone, cyclization of a polyketo acid to a benzophenone, chlorination, and formation of the diphenyl ether skeleton through oxidation and hydrolyzation. A dihydrogeodin oxidase gene, ptaE, was essential for diphenyl ether formation, and ptaM encoded a flavin-dependent halogenase catalyzing chlorination in the biosynthesis. Identification of the pta cluster laid the foundation to decipher the genetic and biochemical mechanisms involved in the pathway. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genetic homogeneity of Clostridium botulinum type A1 strains with unique toxin gene clusters.

PubMed

Raphael, Brian H; Luquez, Carolina; McCroskey, Loretta M; Joseph, Lavin A; Jacobson, Mark J; Johnson, Eric A; Maslanka, Susan E; Andreadis, Joanne D

2008-07-01

A group of five clonally related Clostridium botulinum type A strains isolated from different sources over a period of nearly 40 years harbored several conserved genetic properties. These strains contained a variant bont/A1 with five nucleotide polymorphisms compared to the gene in C. botulinum strain ATCC 3502. The strains also had a common toxin gene cluster composition (ha-/orfX+) similar to that associated with bont/A in type A strains containing an unexpressed bont/B [termed A(B) strains]. However, bont/B was not identified in the strains examined. Comparative genomic hybridization demonstrated identical genomic content among the strains relative to C. botulinum strain ATCC 3502. In addition, microarray data demonstrated the absence of several genes flanking the toxin gene cluster among the ha-/orfX+ A1 strains, suggesting the presence of genomic rearrangements with respect to this region compared to the C. botulinum ATCC 3502 strain. All five strains were shown to have identical flaA variable region nucleotide sequences. The pulsed-field gel electrophoresis patterns of the strains were indistinguishable when digested with SmaI, and a shift in the size of at least one band was observed in a single strain when digested with XhoI. These results demonstrate surprising genomic homogeneity among a cluster of unique C. botulinum type A strains of diverse origin.
Teaching Gene Technology in an Outreach Lab: Students' Assigned Cognitive Load Clusters and the Clusters' Relationships to Learner Characteristics, Laboratory Variables, and Cognitive Achievement

ERIC Educational Resources Information Center

Scharfenberg, Franz-Josef; Bogner, Franz X.

2013-01-01

This study classified students into different cognitive load (CL) groups by means of cluster analysis based on their experienced CL in a gene technology outreach lab which has instructionally been designed with regard to CL theory. The relationships of the identified student CL clusters to learner characteristics, laboratory variables, and…
Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

DOE Office of Scientific and Technical Information (OSTI.GOV)

Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna

Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

DOE PAGES

Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; ...

2015-04-09

Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less
Molecular Networking and Pattern-Based Genome Mining Improves discovery of biosynthetic gene clusters and their products from Salinispora species

PubMed Central

Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; Sarkar, Anindita; Li, Jie; Ziemert, Nadine; Wang, Mingxun; Bandeira, Nuno; Moore, Bradley S.; Dorrestein, Pieter C.; Jensen, Paul R.

2015-01-01

Summary Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. Here we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated the identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. These efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches. PMID:25865308
Characterization of the biosynthetic gene cluster of rebeccamycin from Lechevalieria aerocolonigenes ATCC 39243.

PubMed

Onaka, Hiroyasu; Taniguchi, Shin-ichi; Igarashi, Yasuhiro; Furumai, Tamotsu

2003-01-01

The biosynthetic gene cluster for rebeccamycin, an indolocarbazole antibiotic, from Lechevalieria aerocolonigenes ATCC 39243 has 11 ORFs. To clarify their functions, mutants with rebG, rebD, rebC, rebP, rebM, rebR, rebH, rebT, or orfD2 disrupted were constructed, and the gene products were examined. rebP disruptants produced 11,11'-dichlorochromopyrrolic acid, found to be a biosynthetic intermediate by a bioconversion experiment. Other genes encoded N-glycosyltransferase (rebG), monooxygenase (rebC), methyltransferase (rebM), a transcriptional activator (rebR), and halogenase (rebH). rebT disruptants produced rebeccamycin as much as the wild strain, so rebT was probably not involved in rebeccamycin production. Biosynthetic genes of staurosporine, an another indolocarbazole antibiotic, were cloned from Streptomyces sp. TP-A0274. staO, staD, and staP were similar to rebO, rebD, and rebP, respectively, all of which are responsible for indolocarbazole biosynthesis, But a rebC homolog, encoding a putative enzyme oxidizing the C-7 site of pyrrole rings, was not found in the staurosporine biosynthetic gene cluster. These results suggest that indolocarbazole is constructed by oxidative decarboxylation of chromopyrrolic acid (11,11'-dichlorochromopyrrolic acid in rebeccamycin) generated from two molecules of tryptophan by coupling and that the oxidation state at the C-7 position depends on the additional enzyme(s) encoded by the biosynthetic genes.
Characterization of a gene cluster responsible for the biosynthesis of anticancer agent FK228 in Chromobacterium violaceum No. 968.

PubMed

Cheng, Yi-Qiang; Yang, Min; Matter, Andrea M

2007-06-01

A gene cluster responsible for the biosynthesis of anticancer agent FK228 has been identified, cloned, and partially characterized in Chromobacterium violaceum no. 968. First, a genome-scanning approach was applied to identify three distinctive C. violaceum no. 968 genomic DNA clones that code for portions of nonribosomal peptide synthetase and polyketide synthase. Next, a gene replacement system developed originally for Pseudomonas aeruginosa was adapted to inactivate the genomic DNA-associated candidate natural product biosynthetic genes in vivo with high efficiency. Inactivation of a nonribosomal peptide synthetase-encoding gene completely abolished FK228 production in mutant strains. Subsequently, the entire FK228 biosynthetic gene cluster was cloned and sequenced. This gene cluster is predicted to encompass a 36.4-kb DNA region that includes 14 genes. The products of nine biosynthetic genes are proposed to constitute an unusual hybrid nonribosomal peptide synthetase-polyketide synthase-nonribosomal peptide synthetase assembly line including accessory activities for the biosynthesis of FK228. In particular, a putative flavin adenine dinucleotide-dependent pyridine nucleotide-disulfide oxidoreductase is proposed to catalyze disulfide bond formation between two sulfhydryl groups of cysteine residues as the final step in FK228 biosynthesis. Acquisition of the FK228 biosynthetic gene cluster and acclimation of an efficient genetic system should enable genetic engineering of the FK228 biosynthetic pathway in C. violaceum no. 968 for the generation of structural analogs as anticancer drug candidates.
Simultaneous clustering of gene expression data with clinical chemistry and pathological evaluations reveals phenotypic prototypes

PubMed Central

Bushel, Pierre R; Wolfinger, Russell D; Gibson, Greg

2007-01-01

Background Commonly employed clustering methods for analysis of gene expression data do not directly incorporate phenotypic data about the samples. Furthermore, clustering of samples with known phenotypes is typically performed in an informal fashion. The inability of clustering algorithms to incorporate biological data in the grouping process can limit proper interpretation of the data and its underlying biology. Results We present a more formal approach, the modk-prototypes algorithm, for clustering biological samples based on simultaneously considering microarray gene expression data and classes of known phenotypic variables such as clinical chemistry evaluations and histopathologic observations. The strategy involves constructing an objective function with the sum of the squared Euclidean distances for numeric microarray and clinical chemistry data and simple matching for histopathology categorical values in order to measure dissimilarity of the samples. Separate weighting terms are used for microarray, clinical chemistry and histopathology measurements to control the influence of each data domain on the clustering of the samples. The dynamic validity index for numeric data was modified with a category utility measure for determining the number of clusters in the data sets. A cluster's prototype, formed from the mean of the values for numeric features and the mode of the categorical values of all the samples in the group, is representative of the phenotype of the cluster members. The approach is shown to work well with a simulated mixed data set and two real data examples containing numeric and categorical data types. One from a heart disease study and another from acetaminophen (an analgesic) exposure in rat liver that causes centrilobular necrosis. Conclusion The modk-prototypes algorithm partitioned the simulated data into clusters with samples in their respective class group and the heart disease samples into two groups (sick and buff denoting samples
Genomic characterization of a new endophytic Streptomyces kebangsaanensis identifies biosynthetic pathway gene clusters for novel phenazine antibiotic production

PubMed Central

Remali, Juwairiah; Sarmin, Nurul ‘Izzah Mohd; Ng, Chyan Leong; Tiong, John J.L.; Aizat, Wan M.; Keong, Loke Kok

2017-01-01

Background Streptomyces are well known for their capability to produce many bioactive secondary metabolites with medical and industrial importance. Here we report a novel bioactive phenazine compound, 6-((2-hydroxy-4-methoxyphenoxy) carbonyl) phenazine-1-carboxylic acid (HCPCA) extracted from Streptomyces kebangsaanensis, an endophyte isolated from the ethnomedicinal Portulaca oleracea. Methods The HCPCA chemical structure was determined using nuclear magnetic resonance spectroscopy. We conducted whole genome sequencing for the identification of the gene cluster(s) believed to be responsible for phenazine biosynthesis in order to map its corresponding pathway, in addition to bioinformatics analysis to assess the potential of S. kebangsaanensis in producing other useful secondary metabolites. Results The S. kebangsaanensis genome comprises an 8,328,719 bp linear chromosome with high GC content (71.35%) consisting of 12 rRNA operons, 81 tRNA, and 7,558 protein coding genes. We identified 24 gene clusters involved in polyketide, nonribosomal peptide, terpene, bacteriocin, and siderophore biosynthesis, as well as a gene cluster predicted to be responsible for phenazine biosynthesis. Discussion The HCPCA phenazine structure was hypothesized to derive from the combination of two biosynthetic pathways, phenazine-1,6-dicarboxylic acid and 4-methoxybenzene-1,2-diol, originated from the shikimic acid pathway. The identification of a biosynthesis pathway gene cluster for phenazine antibiotics might facilitate future genetic engineering design of new synthetic phenazine antibiotics. Additionally, these findings confirm the potential of S. kebangsaanensis for producing various antibiotics and secondary metabolites. PMID:29201559
Prognostic value of alcohol dehydrogenase mRNA expression in gastric cancer.

PubMed

Guo, Erna; Wei, Haotang; Liao, Xiwen; Xu, Yang; Li, Shu; Zeng, Xiaoyun

2018-04-01

Previous studies have reported that alcohol dehydrogenase (ADH) isoenzymes possess diagnostic value in gastric cancer (GC). However, the prognostic value of ADH isoenzymes in GC remains unclear. The aim of the present study was to identify the prognostic value of ADH genes in patients with GC. The prognostic value of ADH genes was investigated in patients with GC using the Kaplan-Meier plotter tool. Kaplan-Meier plots were used to assess the difference between groups of patients with GC with different prognoses. Hazard ratios (HR) and 95% confidence intervals (CI) were used to assess the relative risk of GC survival. Overall, 593 patients with GC and 7 ADH genes were included in the survival analysis. High expression of ADH 1A (class 1), α polypeptide ( ADH1A; log-rank P=0.043; HR=0.79; 95% CI: 0.64-0.99), ADH 1B (class 1), β polypeptide ( ADH1B ; log-rank P=1.9×10 -05 ; HR=0.65; 95% CI: 0.53-0.79) and ADH 5 (class III), χ polypeptide ( ADH5 ; log-rank P=0.0011; HR=0.73; 95% CI: 0.6-0.88) resulted in a significantly decreased risk of mortality in all patients with GC compared with patients with low expression of those genes. Furthermore, protective effects may additionally be observed in patients with intestinal-type GC with high expression of ADH1B (log-rank P=0.031; HR=0.64; 95% CI: 0.43-0.96) and patients with diffuse-type GC with high expression of ADH1A (log-rank P=0.014; HR=0.51; 95% CI: 0.3-0.88), ADH1B (log-rank P=0.04; HR=0.53; 95% CI: 0.29-0.98), ADH 4 (class II), π polypeptide (log-rank P=0.033; HR=0.58; 95% CI: 0.35-0.96) and ADH 6 (class V) (log-rank P=0.037; HR=0.59; 95% CI: 0.35-0.97) resulting in a significantly decreased risk of mortality compared with patients with low expression of those genes. In contrast, patients with diffuse-type GC with high expression of ADH5 (log-rank P=0.044; HR=1.66; 95% CI: 1.01-2.74) were significantly correlated with a poor prognosis. The results of the present study suggest that ADH1A and ADH1B may be potential
Prognostic value of alcohol dehydrogenase mRNA expression in gastric cancer

PubMed Central

Guo, Erna; Wei, Haotang; Liao, Xiwen; Xu, Yang; Li, Shu; Zeng, Xiaoyun

2018-01-01

Previous studies have reported that alcohol dehydrogenase (ADH) isoenzymes possess diagnostic value in gastric cancer (GC). However, the prognostic value of ADH isoenzymes in GC remains unclear. The aim of the present study was to identify the prognostic value of ADH genes in patients with GC. The prognostic value of ADH genes was investigated in patients with GC using the Kaplan-Meier plotter tool. Kaplan-Meier plots were used to assess the difference between groups of patients with GC with different prognoses. Hazard ratios (HR) and 95% confidence intervals (CI) were used to assess the relative risk of GC survival. Overall, 593 patients with GC and 7 ADH genes were included in the survival analysis. High expression of ADH 1A (class 1), α polypeptide (ADH1A; log-rank P=0.043; HR=0.79; 95% CI: 0.64–0.99), ADH 1B (class 1), β polypeptide (ADH1B; log-rank P=1.9×10−05; HR=0.65; 95% CI: 0.53–0.79) and ADH 5 (class III), χ polypeptide (ADH5; log-rank P=0.0011; HR=0.73; 95% CI: 0.6–0.88) resulted in a significantly decreased risk of mortality in all patients with GC compared with patients with low expression of those genes. Furthermore, protective effects may additionally be observed in patients with intestinal-type GC with high expression of ADH1B (log-rank P=0.031; HR=0.64; 95% CI: 0.43–0.96) and patients with diffuse-type GC with high expression of ADH1A (log-rank P=0.014; HR=0.51; 95% CI: 0.3–0.88), ADH1B (log-rank P=0.04; HR=0.53; 95% CI: 0.29–0.98), ADH 4 (class II), π polypeptide (log-rank P=0.033; HR=0.58; 95% CI: 0.35–0.96) and ADH 6 (class V) (log-rank P=0.037; HR=0.59; 95% CI: 0.35–0.97) resulting in a significantly decreased risk of mortality compared with patients with low expression of those genes. In contrast, patients with diffuse-type GC with high expression of ADH5 (log-rank P=0.044; HR=1.66; 95% CI: 1.01–2.74) were significantly correlated with a poor prognosis. The results of the present study suggest that ADH1A and ADH1B may
Patterning in time and space: HoxB cluster gene expression in the developing chick embryo.

PubMed

Gouveia, Analuce; Marcelino, Hugo M; Gonçalves, Lisa; Palmeirim, Isabel; Andrade, Raquel P

2015-01-01

The developing embryo is a paradigmatic model to study molecular mechanisms of time control in Biology. Hox genes are key players in the specification of tissue identity during embryo development and their expression is under strict temporal regulation. However, the molecular mechanisms underlying timely Hox activation in the early embryo remain unknown. This is hindered by the lack of a rigorous temporal framework of sequential Hox expression within a single cluster. Herein, a thorough characterization of HoxB cluster gene expression was performed over time and space in the early chick embryo. Clear temporal collinearity of HoxB cluster gene expression activation was observed. Spatial collinearity of HoxB expression was evidenced in different stages of development and in multiple tissues. Using embryo explant cultures we showed that HoxB2 is cyclically expressed in the rostral presomitic mesoderm with the same periodicity as somite formation, suggesting a link between timely tissue specification and somite formation. We foresee that the molecular framework herein provided will facilitate experimental approaches aimed at identifying the regulatory mechanisms underlying Hox expression in Time and Space.
Patterning in time and space: HoxB cluster gene expression in the developing chick embryo

PubMed Central

Gouveia, Analuce; Marcelino, Hugo M; Gonçalves, Lisa; Palmeirim, Isabel; Andrade, Raquel P

2015-01-01

The developing embryo is a paradigmatic model to study molecular mechanisms of time control in Biology. Hox genes are key players in the specification of tissue identity during embryo development and their expression is under strict temporal regulation. However, the molecular mechanisms underlying timely Hox activation in the early embryo remain unknown. This is hindered by the lack of a rigorous temporal framework of sequential Hox expression within a single cluster. Herein, a thorough characterization of HoxB cluster gene expression was performed over time and space in the early chick embryo. Clear temporal collinearity of HoxB cluster gene expression activation was observed. Spatial collinearity of HoxB expression was evidenced in different stages of development and in multiple tissues. Using embryo explant cultures we showed that HoxB2 is cyclically expressed in the rostral presomitic mesoderm with the same periodicity as somite formation, suggesting a link between timely tissue specification and somite formation. We foresee that the molecular framework herein provided will facilitate experimental approaches aimed at identifying the regulatory mechanisms underlying Hox expression in Time and Space. PMID:25602523
Drug repositioning for orphan genetic diseases through Conserved Anticoexpressed Gene Clusters (CAGCs)

PubMed Central

2013-01-01

Background The development of new therapies for orphan genetic diseases represents an extremely important medical and social challenge. Drug repositioning, i.e. finding new indications for approved drugs, could be one of the most cost- and time-effective strategies to cope with this problem, at least in a subset of cases. Therefore, many computational approaches based on the analysis of high throughput gene expression data have so far been proposed to reposition available drugs. However, most of these methods require gene expression profiles directly relevant to the pathologic conditions under study, such as those obtained from patient cells and/or from suitable experimental models. In this work we have developed a new approach for drug repositioning, based on identifying known drug targets showing conserved anti-correlated expression profiles with human disease genes, which is completely independent from the availability of ‘ad hoc’ gene expression data-sets. Results By analyzing available data, we provide evidence that the genes displaying conserved anti-correlation with drug targets are antagonistically modulated in their expression by treatment with the relevant drugs. We then identified clusters of genes associated to similar phenotypes and showing conserved anticorrelation with drug targets. On this basis, we generated a list of potential candidate drug-disease associations. Importantly, we show that some of the proposed associations are already supported by independent experimental evidence. Conclusions Our results support the hypothesis that the identification of gene clusters showing conserved anticorrelation with drug targets can be an effective method for drug repositioning and provide a wide list of new potential drug-disease associations for experimental validation. PMID:24088245
[Abnormal expression of genes that regulate retinoid metabolism and signaling in non-small-cell lung cancer].

PubMed

Kuznetsova, E S; Zinovieva, O L; Oparina, N Yu; Prokofjeva, M M; Spirin, P V; Favorskaya, I A; Zborovskaya, I B; Lisitsyn, N A; Prassolov, V S; Mashkova, T D

2016-01-01

Retinoids are signaling molecules that control a wide variety of cellular processes and possess antitumor activity. This work presents a comprehensive description of changes in the expression of 23 genes that regulate retinoid metabolism and signaling in non-small-cell lung cancer tumors compared to adjacent normal tissues obtained using RT-PCR. Even at early stages of malignant transformation, a significant decrease in ADH1B, ADH3, RDHL, and RALDH1 mRNA levels was observed in 82, 79, 73, and 64% of tumor specimens, respectively, and a considerable increase in AKR1B10 mRNA content was observed in 80% of tumors. Dramatic changes in the levels of these mRNAs can impair the synthesis of all-trans retinoic acid, a key natural regulatory retinoid. Apart from that, it was found that mRNA levels of nuclear retinoid receptor genes RXRγ, RARα, RXRα, and gene RDH11 were significantly decreased in 80, 67, 57, and 66% of tumor specimens, respectively. Thus, neoplastic transformation of lung tissue cells is accompanied with deregulated expression of key genes of retinoid metabolism and function.

Early Dysregulation of Cell Adhesion and Extracellular Matrix Pathways in Breast Cancer Progression

PubMed Central

Emery, Lyndsey A.; Tripathi, Anusri; King, Chialin; Kavanah, Maureen; Mendez, Jane; Stone, Michael D.; de las Morenas, Antonio; Sebastiani, Paola; Rosenberg, Carol L.

2009-01-01

Proliferative breast lesions, such as simple ductal hyperplasia (SH) and atypical ductal hyperplasia (ADH), are candidate precursors to ductal carcinoma in situ (DCIS) and invasive cancer. To better understand the relationship of breast lesions to more advanced disease, we used microdissection and DNA microarrays to profile the gene expression of patient-matched histologically normal (HN), ADH, and DCIS from 12 patients with estrogen receptor positive sporadic breast cancer. SH were profiled from a subset of cases. We found 837 differentially expressed genes between DCIS-HN and 447 between ADH-HN, with >90% of the ADH-HN genes also present among the DCIS-HN genes. Only 61 genes were identified between ADH-DCIS. Expression differences were reproduced in an independent cohort of patient-matched lesions by quantitative real-time PCR. Many breast cancer-related genes and pathways were dysregulated in ADH and maintained in DCIS. Particularly, cell adhesion and extracellular matrix interactions were overrepresented. Focal adhesion was the top pathway in each gene set. We conclude that ADH and DCIS share highly similar gene expression and are distinct from HN. In contrast, SH appear more similar to HN. These data provide genetic evidence that ADH (but not SH) are often precursors to cancer and suggest cancer-related genetic changes, particularly adhesion and extracellular matrix pathways, are dysregulated before invasion and even before malignancy is apparent. These findings could lead to novel risk stratification, prevention, and treatment approaches. PMID:19700746
Molecular characterization of the PR-toxin gene cluster in Penicillium roqueforti and Penicillium chrysogenum: cross talk of secondary metabolite pathways.

PubMed

Hidalgo, Pedro I; Ullán, Ricardo V; Albillos, Silvia M; Montero, Olimpio; Fernández-Bodega, María Ángeles; García-Estrada, Carlos; Fernández-Aguado, Marta; Martín, Juan-Francisco

2014-01-01

The PR-toxin is a potent mycotoxin produced by Penicillium roqueforti in moulded grains and grass silages and may contaminate blue-veined cheese. The PR-toxin derives from the 15 carbon atoms sesquiterpene aristolochene formed by the aristolochene synthase (encoded by ari1). We have cloned and sequenced a four gene cluster that includes the ari1 gene from P. roqueforti. Gene silencing of each of the four genes (named prx1 to prx4) resulted in a reduction of 65-75% in the production of PR-toxin indicating that the four genes encode enzymes involved in PR-toxin biosynthesis. Interestingly the four silenced mutants overproduce large amounts of mycophenolic acid, an antitumor compound formed by an unrelated pathway suggesting a cross-talk of PR-toxin and mycophenolic acid production. An eleven gene cluster that includes the above mentioned four prx genes and a 14-TMS drug/H(+) antiporter was found in the genome of Penicillium chrysogenum. This eleven gene cluster has been reported to be very poorly expressed in a transcriptomic study of P. chrysogenum genes under conditions of penicillin production (strongly aerated cultures). We found that this apparently silent gene cluster is able to produce PR-toxin in P. chrysogenum under static culture conditions on hydrated rice medium. Noteworthily, the production of PR-toxin was 2.6-fold higher in P. chrysogenum npe10, a strain deleted in the 56.8kb amplifiable region containing the pen gene cluster, than in the parental strain Wisconsin 54-1255 providing another example of cross-talk between secondary metabolite pathways in this fungus. A detailed PR-toxin biosynthesis pathway is proposed based on all available evidence. Copyright © 2013 Elsevier Inc. All rights reserved.
RNase 1 genes from the Family Sciuridae define a novel rodent ribonuclease cluster

PubMed Central

Siegel, Steven J.; Percopo, Caroline M.; Dyer, Kimberly D.; Zhao, Wei; Roth, V. Louise; Mercer, John M.; Rosenberg, Helene F.

2009-01-01

The RNase A ribonucleases are complex group of functionally diverse secretory proteins with conserved enzymatic activity. We have identified novel RNase 1 genes from four species of squirrel (order Rodentia, family Sciuridae). Squirrel RNase 1 genes encode typical RNase A ribonucleases, each with eight cysteines, a conserved CKXXNTF signature motif, and a canonical His12-Lys41-His119 catalytic triad. Two alleles encode Callosciurus prevostii RNase 1, which include a Ser18↔Pro, analogous to the sequence polymorphisms found among the RNase 1 duplications in the genome of Rattus exulans. Interestingly, although the squirrel RNase 1 genes are closely related to one another (77 to 95% amino acid sequence identity), the cluster as a whole is distinct and divergent from the clusters including RNase 1 genes from other rodent species. We examined the specific sites at which Sciuridae RNase 1s diverge from Muridae / Cricetidae RNase 1s, and determined that the divergent sites are located on the external surface, with complete sparing of the catalytic crevice. The full significance of these findings awaits a more complete understanding of biological role of mammalian RNase 1s. PMID:19771477
Contribution of the Pmra Promoter to Expression of Genes in the Escherichia coli mra Cluster of Cell Envelope Biosynthesis and Cell Division Genes

PubMed Central

Mengin-Lecreulx, Dominique; Ayala, Juan; Bouhss, Ahmed; van Heijenoort, Jean; Parquet, Claudine; Hara, Hiroshi

1998-01-01

Recently, a promoter for the essential gene ftsI, which encodes penicillin-binding protein 3 of Escherichia coli, was precisely localized 1.9 kb upstream from this gene, at the beginning of the mra cluster of cell division and cell envelope biosynthesis genes (H. Hara, S. Yasuda, K. Horiuchi, and J. T. Park, J. Bacteriol. 179:5802–5811, 1997). Disruption of this promoter (Pmra) on the chromosome and its replacement by the lac promoter (Pmra::Plac) led to isopropyl-β-d-thiogalactopyranoside (IPTG)-dependent cells that lysed in the absence of inducer, a defect which was complemented only when the whole region from Pmra to ftsW, the fifth gene downstream from ftsI, was provided in trans on a plasmid. In the present work, the levels of various proteins involved in peptidoglycan synthesis and cell division were precisely determined in cells in which Pmra::Plac promoter expression was repressed or fully induced. It was confirmed that the Pmra promoter is required for expression of the first nine genes of the mra cluster: mraZ (orfC), mraW (orfB), ftsL (mraR), ftsI, murE, murF, mraY, murD, and ftsW. Interestingly, three- to sixfold-decreased levels of MurG and MurC enzymes were observed in uninduced Pmra::Plac cells. This was correlated with an accumulation of the nucleotide precursors UDP–N-acetylglucosamine and UDP–N-acetylmuramic acid, substrates of these enzymes, and with a depletion of the pool of UDP–N-acetylmuramyl pentapeptide, resulting in decreased cell wall peptidoglycan synthesis. Moreover, the expression of ftsZ, the penultimate gene from this cluster, was significantly reduced when Pmra expression was repressed. It was concluded that the transcription of the genes located downstream from ftsW in the mra cluster, from murG to ftsZ, is also mainly (but not exclusively) dependent on the Pmra promoter. PMID:9721276
Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets.

PubMed

Salem, Saeed; Ozcaglar, Cagri

2014-01-01

Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways.
Crystal structures of OrfX2 and P47 from a Botulinum neurotoxin OrfX-type gene cluster.

PubMed

Gustafsson, Robert; Berntsson, Ronnie P-A; Martínez-Carranza, Markel; El Tekle, Geniver; Odegrip, Richard; Johnson, Eric A; Stenmark, Pål

2017-11-01

Botulinum neurotoxins are highly toxic substances and are all encoded together with one of two alternative gene clusters, the HA or the OrfX gene cluster. Very little is known about the function and structure of the proteins encoded in the OrfX gene cluster, which in addition to the toxin contains five proteins (OrfX1, OrfX2, OrfX3, P47, and NTNH). We here present the structures of OrfX2 and P47, solved to 2.1 and 1.8 Å, respectively. We show that they belong to the TULIP protein superfamily, which are often involved in lipid binding. OrfX1 and OrfX2 were both found to bind phosphatidylinositol lipids. © 2017 Federation of European Biochemical Societies.
Tissue-specific impact of FADS cluster variants on FADS1 and FADS2 gene expression.

PubMed

Reynolds, Lindsay M; Howard, Timothy D; Ruczinski, Ingo; Kanchan, Kanika; Seeds, Michael C; Mathias, Rasika A; Chilton, Floyd H

2018-01-01

Omega-6 (n-6) and omega-3 (n-3) long (≥ 20 carbon) chain polyunsaturated fatty acids (LC-PUFAs) play a critical role in human health and disease. Biosynthesis of LC-PUFAs from dietary 18 carbon PUFAs in tissues such as the liver is highly associated with genetic variation within the fatty acid desaturase (FADS) gene cluster, containing FADS1 and FADS2 that encode the rate-limiting desaturation enzymes in the LC-PUFA biosynthesis pathway. However, the molecular mechanisms by which FADS genetic variants affect LC-PUFA biosynthesis, and in which tissues, are unclear. The current study examined associations between common single nucleotide polymorphisms (SNPs) within the FADS gene cluster and FADS1 and FADS2 gene expression in 44 different human tissues (sample sizes ranging 70-361) from the Genotype-Tissue Expression (GTEx) Project. FADS1 and FADS2 expression were detected in all 44 tissues. Significant cis-eQTLs (within 1 megabase of each gene, False Discovery Rate, FDR<0.05, as defined by GTEx) were identified in 12 tissues for FADS1 gene expression and 23 tissues for FADS2 gene expression. Six tissues had significant (FDR< 0.05) eQTLs associated with both FADS1 and FADS2 (including artery, esophagus, heart, muscle, nerve, and thyroid). Interestingly, the identified eQTLs were consistently found to be associated in opposite directions for FADS1 and FADS2 expression. Taken together, findings from this study suggest common SNPs within the FADS gene cluster impact the transcription of FADS1 and FADS2 in numerous tissues and raise important questions about how the inverse expression of these two genes impact intermediate molecular (such a LC-PUFA and LC-PUFA-containing glycerolipid levels) and ultimately clinical phenotypes associated with inflammatory diseases and brain health.
Comparing large covariance matrices under weak conditions on the dependence structure and its application to gene clustering.

PubMed

Chang, Jinyuan; Zhou, Wen; Zhou, Wen-Xin; Wang, Lan

2017-03-01

Comparing large covariance matrices has important applications in modern genomics, where scientists are often interested in understanding whether relationships (e.g., dependencies or co-regulations) among a large number of genes vary between different biological states. We propose a computationally fast procedure for testing the equality of two large covariance matrices when the dimensions of the covariance matrices are much larger than the sample sizes. A distinguishing feature of the new procedure is that it imposes no structural assumptions on the unknown covariance matrices. Hence, the test is robust with respect to various complex dependence structures that frequently arise in genomics. We prove that the proposed procedure is asymptotically valid under weak moment conditions. As an interesting application, we derive a new gene clustering algorithm which shares the same nice property of avoiding restrictive structural assumptions for high-dimensional genomics data. Using an asthma gene expression dataset, we illustrate how the new test helps compare the covariance matrices of the genes across different gene sets/pathways between the disease group and the control group, and how the gene clustering algorithm provides new insights on the way gene clustering patterns differ between the two groups. The proposed methods have been implemented in an R-package HDtest and are available on CRAN. © 2016, The International Biometric Society.
Spatiotemporal clustering of the epigenome reveals rules of dynamic gene regulation

PubMed Central

Yu, Pengfei; Xiao, Shu; Xin, Xiaoyun; Song, Chun-Xiao; Huang, Wei; McDee, Darina; Tanaka, Tetsuya; Wang, Ting; He, Chuan; Zhong, Sheng

2013-01-01

Spatial organization of different epigenomic marks was used to infer functions of the epigenome. It remains unclear what can be learned from the temporal changes of the epigenome. Here, we developed a probabilistic model to cluster genomic sequences based on the similarity of temporal changes of multiple epigenomic marks during a cellular differentiation process. We differentiated mouse embryonic stem (ES) cells into mesendoderm cells. At three time points during this differentiation process, we used high-throughput sequencing to measure seven histone modifications and variants—H3K4me1/2/3, H3K27ac, H3K27me3, H3K36me3, and H2A.Z; two DNA modifications—5-mC and 5-hmC; and transcribed mRNAs and noncoding RNAs (ncRNAs). Genomic sequences were clustered based on the spatiotemporal epigenomic information. These clusters not only clearly distinguished gene bodies, promoters, and enhancers, but also were predictive of bidirectional promoters, miRNA promoters, and piRNAs. This suggests specific epigenomic patterns exist on piRNA genes much earlier than germ cell development. Temporal changes of H3K4me2, unmethylated CpG, and H2A.Z were predictive of 5-hmC changes, suggesting unmethylated CpG and H3K4me2 as potential upstream signals guiding TETs to specific sequences. Several rules on combinatorial epigenomic changes and their effects on mRNA expression and ncRNA expression were derived, including a simple rule governing the relationship between 5-hmC and gene expression levels. A Sox17 enhancer containing a FOXA2 binding site and a Foxa2 enhancer containing a SOX17 binding site were identified, suggesting a positive feedback loop between the two mesendoderm transcription factors. These data illustrate the power of using epigenome dynamics to investigate regulatory functions. PMID:23033340
Mineral exploration, Mahd adh Dhahab District, Kingdom of Saudi Arabia

USGS Publications Warehouse

Worl, Ronald G.

1978-01-01

Mahd adh Dhahab is the largest of numerous ancient gold mines scattered through the Precambrian shield of Saudi Arabia and the only one with recent production. During the period 1939-54, 765,768 fine ounces of gold and 1,002,029 ounces of silver were produced from the mines by the Saudi Arabian Mining Syndicate. Ore minerals at Mahd adh Dhahab include free gold and silver, tellurides, sphalerite, and chalcopyrite in and associated with a system of north-trending quartz veins and quartz veinlet stockworks. Pyrite is a common sulfide gangue mineral. Country rocks are a north dipping sequence of pyroclastic and transported pyroclastic rocks of the Hulayfah Group that are locally highly silicified and potassium-feldspathized. The prime target for this exploration program was a north-trending zone of quartz veins and breccias, faults, alteration, and metalization approximately 400 m wide and 1000 m long. The ancient and recent mine workings are located in the northern part of this zone. Although the quartz veins and alteration cut all lithologies, the major metalization is confined to the intersection of veins and agglomerate. Ten holes were diamond drilled to explore geochemical, geological, and geophysical targets in the area. A significant new zone of metalization was discovered 700 m south of the ancient and recent mine workings and within the same major zone of quartz veins, alteration, and faults. Metalization in this southern mineralized zone is at the intersection of the quartz veins and a distinctive and highly altered agglomerate. The total zone of vein and agglomerate intercept is potentially metalized and comprises a block of ground 40 m thick and 400 m wide along the strike of the agglomerate and projected downdip 250 m. Tonnage of this block is 17.2 million tons. The explored zone, approximately 25 percent of the potentially metalized rock, has a potential resource of 1.1 million tons containing 27 g/t gold and 73 g/t silver.
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes.

PubMed

Hadjithomas, Michalis; Chen, I-Min A; Chu, Ken; Huang, Jinghua; Ratner, Anna; Palaniappan, Krishna; Andersen, Evan; Markowitz, Victor; Kyrpides, Nikos C; Ivanova, Natalia N

2017-01-04

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
ADH1B polymorphism, alcohol consumption, and binge drinking in Slavic Caucasians: results from the Czech HAPIEE study.

PubMed

Hubacek, Jaroslav A; Pikhart, Hynek; Peasey, Anne; Kubinova, Ruzena; Bobak, Martin

2012-05-01

Several genetic polymorphisms influence the risk of heavy alcohol consumption but it is not well understood whether the genetic effects are similar in different populations and drinking cultures, nor whether the genetic influences on binge drinking are similar to those seen for alcoholism. We have analyzed the effect of the Arg47His (rs1229984) variant within the alcohol dehydrogenase (ADH1B) gene on a range of drinking related variables in a large Eastern European Slavic population (Czech HAPIEE study), which recruited random samples of men and women aged 45-69 years in 7 Czech towns (3,016 males and 3,481 females with complete data). Drinking frequency, annual alcohol intake, prevalence of binge drinking (≥100 g in men and ≥60 g in women at least once a month) and the mean dose of alcohol per occasion were measured by the graduated frequency questionnaire. Alcohol intake in a typical week was used to define heavy drinking (≥350 g/wk in men and ≥210 g in women). Problem drinking (≥2 positive answers on CAGE) and negative consequences of drinking on different aspects of life were also measured. The frequency of the His47 allele carriers was 11%. Homozygotes in the common allele (Arg47Arg), among both males and females, had significantly higher drinking frequency, and annual and weekly intake of alcohol than His47 carriers. The odds ratio of heavy drinking in Arg47Arg homozygotes versus His47 carriers was 2.1 (95% confidence intervals 1.1-3.2) in men and 2.2 (1.0-4.7) in women. In females, but not in males, Arg47Arg homozygotes had marginally significantly higher prevalence of binge drinking and mean alcohol dose per drinking session. There was no consistent association with problem drinking and negative consequences of drinking. The ADH1B genotype was associated with the frequency and volume of drinking but its associations with binge drinking and problem drinking were less consistent. Copyright © 2011 by the Research Society on Alcoholism.
Discovery of a widely distributed toxin biosynthetic gene cluster

PubMed Central

Lee, Shaun W.; Mitchell, Douglas A.; Markley, Andrew L.; Hensler, Mary E.; Gonzalez, David; Wohlrab, Aaron; Dorrestein, Pieter C.; Nizet, Victor; Dixon, Jack E.

2008-01-01

Bacteriocins represent a large family of ribosomally produced peptide antibiotics. Here we describe the discovery of a widely conserved biosynthetic gene cluster for the synthesis of thiazole and oxazole heterocycles on ribosomally produced peptides. These clusters encode a toxin precursor and all necessary proteins for toxin maturation and export. Using the toxin precursor peptide and heterocycle-forming synthetase proteins from the human pathogen Streptococcus pyogenes, we demonstrate the in vitro reconstitution of streptolysin S activity. We provide evidence that the synthetase enzymes, as predicted from our bioinformatics analysis, introduce heterocycles onto precursor peptides, thereby providing molecular insight into the chemical structure of streptolysin S. Furthermore, our studies reveal that the synthetase exhibits relaxed substrate specificity and modifies toxin precursors from both related and distant species. Given our findings, it is likely that the discovery of similar peptidic toxins will rapidly expand to existing and emerging genomes. PMID:18375757
Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets

PubMed Central

2014-01-01

Background Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. Results We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways. PMID:25221624
Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

PubMed

Bergmann, D J; Arciero, D M; Hooper, A B

1994-06-01

The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.
Linkage of the Nit1C gene cluster to bacterial cyanide assimilation as a nitrogen source.

PubMed

Jones, Lauren B; Ghosh, Pallab; Lee, Jung-Hyun; Chou, Chia-Ni; Kunz, Daniel A

2018-05-21

A genetic linkage between a conserved gene cluster (Nit1C) and the ability of bacteria to utilize cyanide as the sole nitrogen source was demonstrated for nine different bacterial species. These included three strains whose cyanide nutritional ability has formerly been documented (Pseudomonas fluorescens Pf11764, Pseudomonas putida BCN3 and Klebsiella pneumoniae BCN33), and six not previously known to have this ability [Burkholderia (Paraburkholderia) xenovorans LB400, Paraburkholderia phymatum STM815, Paraburkholderia phytofirmans PsJN, Cupriavidus (Ralstonia) eutropha H16, Gluconoacetobacter diazotrophicus PA1 5 and Methylobacterium extorquens AM1]. For all bacteria, growth on or exposure to cyanide led to the induction of the canonical nitrilase (NitC) linked to the gene cluster, and in the case of Pf11764 in particular, transcript levels of cluster genes (nitBCDEFGH) were raised, and a nitC knock-out mutant failed to grow. Further studies demonstrated that the highly conserved nitB gene product was also significantly elevated. Collectively, these findings provide strong evidence for a genetic linkage between Nit1C and bacterial growth on cyanide, supporting use of the term cyanotrophy in describing what may represent a new nutritional paradigm in microbiology. A broader search of Nit1C genes in presently available genomes revealed its presence in 270 different bacteria, all contained within the domain Bacteria, including Gram-positive Firmicutes and Actinobacteria, and Gram-negative Proteobacteria and Cyanobacteria. Absence of the cluster in the Archaea is congruent with events that may have led to the inception of Nit1C occurring coincidentally with the first appearance of cyanogenic species on Earth, dating back 400-500 million years.
A cluster of bacterial genes for anaerobic benzene ring biodegradation

PubMed Central

Egland, Paul G.; Pelletier, Dale A.; Dispensa, Marilyn; Gibson, Jane; Harwood, Caroline S.

1997-01-01

A reductive benzoate pathway is the central conduit for the anaerobic biodegradation of aromatic pollutants and lignin monomers. Benzene ring reduction requires a large input of energy and this metabolic capability has, so far, been reported only in bacteria. To determine the molecular basis for this environmentally important process, we cloned and analyzed genes required for the anaerobic degradation of benzoate and related compounds from the phototrophic bacterium, Rhodopseudomonas palustris. A cluster of 24 genes was identified that includes twelve genes likely to be involved in anaerobic benzoate degradation and additional genes that convert the related compounds 4-hydroxybenzoate and cyclohexanecarboxylate to benzoyl-CoA. Genes encoding benzoyl-CoA reductase, a novel enzyme able to overcome the resonance stability of the aromatic ring, were identified by directed mutagenesis. The gene encoding the ring-cleavage enzyme, 2-ketocyclohexanecarboxyl-CoA hydrolase, was identified by assaying the enzymatic activity of the protein expressed in Escherichia coli. Physiological data and DNA sequence analyses indicate that the benzoate pathway consists of unusual enzymes for ring reduction and cleavage interposed among enzymes homologous to those catalyzing fatty acid degradation. The cloned genes should be useful as probes to identify benzoate degradation genes from other metabolically distinct groups of anaerobic bacteria, such as denitrifying bacteria and sulfate-reducing bacteria. PMID:9177244
A Functional Bikaverin Biosynthesis Gene Cluster in Rare Strains of Botrytis cinerea Is Positively Controlled by VELVET

PubMed Central

Schumacher, Julia; Gautier, Angélique; Morgant, Guillaume; Studt, Lena; Ducrot, Paul-Henri; Le Pêcheur, Pascal; Azeddine, Saad; Fillinger, Sabine; Leroux, Pierre; Tudzynski, Bettina; Viaud, Muriel

2013-01-01

The gene cluster responsible for the biosynthesis of the red polyketidic pigment bikaverin has only been characterized in Fusarium ssp. so far. Recently, a highly homologous but incomplete and nonfunctional bikaverin cluster has been found in the genome of the unrelated phytopathogenic fungus Botrytis cinerea. In this study, we provided evidence that rare B. cinerea strains such as 1750 have a complete and functional cluster comprising the six genes orthologous to Fusarium fujikuroi ffbik1-ffbik6 and do produce bikaverin. Phylogenetic analysis confirmed that the whole cluster was acquired from Fusarium through a horizontal gene transfer (HGT). In the bikaverin-nonproducing strain B05.10, the genes encoding bikaverin biosynthesis enzymes are nonfunctional due to deleterious mutations (bcbik2-3) or missing (bcbik1) but interestingly, the genes encoding the regulatory proteins BcBIK4 and BcBIK5 do not harbor deleterious mutations which suggests that they may still be functional. Heterologous complementation of the F. fujikuroi Δffbik4 mutant confirmed that bcbik4 of strain B05.10 is indeed fully functional. Deletion of bcvel1 in the pink strain 1750 resulted in loss of bikaverin and overproduction of melanin indicating that the VELVET protein BcVEL1 regulates the biosynthesis of the two pigments in an opposite manner. Although strain 1750 itself expresses a truncated BcVEL1 protein (100 instead of 575 aa) that is nonfunctional with regard to sclerotia formation, virulence and oxalic acid formation, it is sufficient to regulate pigment biosynthesis (bikaverin and melanin) and fenhexamid HydR2 type of resistance. Finally, a genetic cross between strain 1750 and a bikaverin-nonproducing strain sensitive to fenhexamid revealed that the functional bikaverin cluster is genetically linked to the HydR2 locus. PMID:23308280
Functional characterization of KanP, a methyltransferase from the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus.

PubMed

Nepal, Keshav Kumar; Yoo, Jin Cheol; Sohng, Jae Kyung

2010-09-20

KanP, a putative methyltransferase, is located in the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus ATCC12853. Amino acid sequence analysis of KanP revealed the presence of S-adenosyl-L-methionine binding motifs, which are present in other O-methyltransferases. The kanP gene was expressed in Escherichia coli BL21 (DE3) to generate the E. coli KANP recombinant strain. The conversion of external quercetin to methylated quercetin in the culture extract of E. coli KANP proved the function of kanP as S-adenosyl-L-methionine-dependent methyltransferase. This is the first report concerning the identification of an O-methyltransferase gene from the kanamycin gene cluster. The resistant activity assay and RT-PCR analysis demonstrated the leeway for obtaining methylated kanamycin derivatives from the wild-type strain of kanamycin producer. 2009 Elsevier GmbH. All rights reserved.
Characterization and detection of a widely distributed gene cluster that predicts anaerobic choline utilization by human gut bacteria.

PubMed

Martínez-del Campo, Ana; Bodea, Smaranda; Hamer, Hilary A; Marks, Jonathan A; Haiser, Henry J; Turnbaugh, Peter J; Balskus, Emily P

2015-04-14

Elucidation of the molecular mechanisms underlying the human gut microbiota's effects on health and disease has been complicated by difficulties in linking metabolic functions associated with the gut community as a whole to individual microorganisms and activities. Anaerobic microbial choline metabolism, a disease-associated metabolic pathway, exemplifies this challenge, as the specific human gut microorganisms responsible for this transformation have not yet been clearly identified. In this study, we established the link between a bacterial gene cluster, the choline utilization (cut) cluster, and anaerobic choline metabolism in human gut isolates by combining transcriptional, biochemical, bioinformatic, and cultivation-based approaches. Quantitative reverse transcription-PCR analysis and in vitro biochemical characterization of two cut gene products linked the entire cluster to growth on choline and supported a model for this pathway. Analyses of sequenced bacterial genomes revealed that the cut cluster is present in many human gut bacteria, is predictive of choline utilization in sequenced isolates, and is widely but discontinuously distributed across multiple bacterial phyla. Given that bacterial phylogeny is a poor marker for choline utilization, we were prompted to develop a degenerate PCR-based method for detecting the key functional gene choline TMA-lyase (cutC) in genomic and metagenomic DNA. Using this tool, we found that new choline-metabolizing gut isolates universally possessed cutC. We also demonstrated that this gene is widespread in stool metagenomic data sets. Overall, this work represents a crucial step toward understanding anaerobic choline metabolism in the human gut microbiota and underscores the importance of examining this microbial community from a function-oriented perspective. Anaerobic choline utilization is a bacterial metabolic activity that occurs in the human gut and is linked to multiple diseases. While bacterial genes responsible for

The human TREM gene cluster at 6p21.1 encodes both activating and inhibitory single IgV domain receptors and includes NKp44.

PubMed

Allcock, Richard J N; Barrow, Alexander D; Forbes, Simon; Beck, Stephan; Trowsdale, John

2003-02-01

We have characterized a cluster of single immunoglobulin variable (IgV) domain receptors centromeric of the major histocompatibility complex (MHC) on human chromosome 6. In addition to triggering receptor expressed on myeloid cells (TREM)-1 and TREM2, the cluster contains NKp44, a triggering receptor whose expression is limited to NK cells. We identified three new related genes and two gene fragments within a cluster of approximately 200 kb. Two of the three new genes lack charged residues in their transmembrane domain tails. Further, one of the genes contains two potential immunotyrosine Inhibitory motifs in its cytoplasmic tail, suggesting that it delivers inhibitory signals. The human and mouse TREM clusters appear to have diverged such that there are unique sequences in each species. Finally, each gene in the TREM cluster was expressed in a different range of cell types.
Analysis of genetic association using hierarchical clustering and cluster validation indices.

PubMed

Pagnuco, Inti A; Pastore, Juan I; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L

2017-10-01

It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, based on some criteria of similarity. This task is usually performed by clustering algorithms, where the genes are clustered into meaningful groups based on their expression values in a set of experiment. In this work, we propose a method to find sets of co-expressed genes, based on cluster validation indices as a measure of similarity for individual gene groups, and a combination of variants of hierarchical clustering to generate the candidate groups. We evaluated its ability to retrieve significant sets on simulated correlated and real genomics data, where the performance is measured based on its detection ability of co-regulated sets against a full search. Additionally, we analyzed the quality of the best ranked groups using an online bioinformatics tool that provides network information for the selected genes. Copyright © 2017 Elsevier Inc. All rights reserved.
Organization of nif gene cluster in Frankia sp. EuIK1 strain, a symbiont of Elaeagnus umbellata.

PubMed

Oh, Chang Jae; Kim, Ho Bang; Kim, Jitae; Kim, Won Jin; Lee, Hyoungseok; An, Chung Sun

2012-01-01

The nucleotide sequence of a 20.5-kb genomic region harboring nif genes was determined and analyzed. The fragment was obtained from Frankia sp. EuIK1 strain, an indigenous symbiont of Elaeagnus umbellata. A total of 20 ORFs including 12 nif genes were identified and subjected to comparative analysis with the genome sequences of 3 Frankia strains representing diverse host plant specificities. The nucleotide and deduced amino acid sequences showed highest levels of identity with orthologous genes from an Elaeagnus-infecting strain. The gene organization patterns around the nif gene clusters were well conserved among all 4 Frankia strains. However, characteristic features appeared in the location of the nifV gene for each Frankia strain, depending on the type of host plant. Sequence analysis was performed to determine the transcription units and suggested that there could be an independent operon starting from the nifW gene in the EuIK strain. Considering the organization patterns and their total extensions on the genome, we propose that the nif gene clusters remained stable despite genetic variations occurring in the Frankia genomes.
Lactose-Inducible System for Metabolic Engineering of Clostridium ljungdahlii

DOE Office of Scientific and Technical Information (OSTI.GOV)

Banerjee, A; Leang, C; Ueki, T

2014-03-25

The development of tools for genetic manipulation of Clostridium ljungdahlii has increased its attractiveness as a chassis for autotrophic production of organic commodities and biofuels from syngas and microbial electrosynthesis and established it as a model organism for the study of the basic physiology of acetogenesis. In an attempt to expand the genetic toolbox for C. ljungdahlii, the possibility of adapting a lactose-inducible system for gene expression, previously reported for Clostridium perfringens, was investigated. The plasmid pAH2, originally developed for C. perfringens with a gusA reporter gene, functioned as an effective lactose-inducible system in C. ljungdahlii. Lactose induction of C.more » ljungdahlii containing pB1, in which the gene for the aldehyde/alcohol dehydrogenase AdhE1 was downstream of the lactose-inducible promoter, increased expression of adhE1 30-fold over the wild-type level, increasing ethanol production 1.5-fold, with a corresponding decrease in acetate production. Lactose-inducible expression of adhE1 in a strain in which adhE1 and the adhE1 homolog adhE2 had been deleted from the chromosome restored ethanol production to levels comparable to those in the wild-type strain. Inducing expression of adhE2 similarly failed to restore ethanol production, suggesting that adhE1 is the homolog responsible for ethanol production. Lactose-inducible expression of the four heterologous genes necessary to convert acetyl coenzyme A (acetyl-CoA) to acetone diverted ca. 60% of carbon flow to acetone production during growth on fructose, and 25% of carbon flow went to acetone when carbon monoxide was the electron donor. These studies demonstrate that the lactose-inducible system described here will be useful for redirecting carbon and electron flow for the biosynthesis of products more valuable than acetate. Furthermore, this tool should aid in optimizing microbial electrosynthesis and for basic studies on the physiology of acetogenesis.« less
eMBI: Boosting Gene Expression-based Clustering for Cancer Subtypes.

PubMed

Chang, Zheng; Wang, Zhenjia; Ashby, Cody; Zhou, Chuan; Li, Guojun; Zhang, Shuzhong; Huang, Xiuzhen

2014-01-01

Identifying clinically relevant subtypes of a cancer using gene expression data is a challenging and important problem in medicine, and is a necessary premise to provide specific and efficient treatments for patients of different subtypes. Matrix factorization provides a solution by finding checker-board patterns in the matrices of gene expression data. In the context of gene expression profiles of cancer patients, these checkerboard patterns correspond to genes that are up- or down-regulated in patients with particular cancer subtypes. Recently, a new matrix factorization framework for biclustering called Maximum Block Improvement (MBI) is proposed; however, it still suffers several problems when applied to cancer gene expression data analysis. In this study, we developed many effective strategies to improve MBI and designed a new program called enhanced MBI (eMBI), which is more effective and efficient to identify cancer subtypes. Our tests on several gene expression profiling datasets of cancer patients consistently indicate that eMBI achieves significant improvements in comparison with MBI, in terms of cancer subtype prediction accuracy, robustness, and running time. In addition, the performance of eMBI is much better than another widely used matrix factorization method called nonnegative matrix factorization (NMF) and the method of hierarchical clustering, which is often the first choice of clinical analysts in practice.
eMBI: Boosting Gene Expression-based Clustering for Cancer Subtypes

PubMed Central

Chang, Zheng; Wang, Zhenjia; Ashby, Cody; Zhou, Chuan; Li, Guojun; Zhang, Shuzhong; Huang, Xiuzhen

2014-01-01

Identifying clinically relevant subtypes of a cancer using gene expression data is a challenging and important problem in medicine, and is a necessary premise to provide specific and efficient treatments for patients of different subtypes. Matrix factorization provides a solution by finding checker-board patterns in the matrices of gene expression data. In the context of gene expression profiles of cancer patients, these checkerboard patterns correspond to genes that are up- or down-regulated in patients with particular cancer subtypes. Recently, a new matrix factorization framework for biclustering called Maximum Block Improvement (MBI) is proposed; however, it still suffers several problems when applied to cancer gene expression data analysis. In this study, we developed many effective strategies to improve MBI and designed a new program called enhanced MBI (eMBI), which is more effective and efficient to identify cancer subtypes. Our tests on several gene expression profiling datasets of cancer patients consistently indicate that eMBI achieves significant improvements in comparison with MBI, in terms of cancer subtype prediction accuracy, robustness, and running time. In addition, the performance of eMBI is much better than another widely used matrix factorization method called nonnegative matrix factorization (NMF) and the method of hierarchical clustering, which is often the first choice of clinical analysts in practice. PMID:25374455
Wide Distribution of Foxicin Biosynthetic Gene Clusters in Streptomyces Strains – An Unusual Secondary Metabolite with Various Properties

PubMed Central

Greule, Anja; Marolt, Marija; Deubel, Denise; Peintner, Iris; Zhang, Songya; Jessen-Trefzer, Claudia; De Ford, Christian; Burschel, Sabrina; Li, Shu-Ming; Friedrich, Thorsten; Merfort, Irmgard; Lüdeke, Steffen; Bisel, Philippe; Müller, Michael; Paululat, Thomas; Bechthold, Andreas

2017-01-01

Streptomyces diastatochromogenes Tü6028 is known to produce the polyketide antibiotic polyketomycin. The deletion of the pokOIV oxygenase gene led to a non-polyketomycin-producing mutant. Instead, novel compounds were produced by the mutant, which have not been detected before in the wild type strain. Four different compounds were identified and named foxicins A–D. Foxicin A was isolated and its structure was elucidated as an unusual nitrogen-containing quinone derivative using various spectroscopic methods. Through genome mining, the foxicin biosynthetic gene cluster was identified in the draft genome sequence of S. diastatochromogenes. The cluster spans 57 kb and encodes three PKS type I modules, one NRPS module and 41 additional enzymes. A foxBII gene-inactivated mutant of S. diastatochromogenes Tü6028 ΔpokOIV is unable to produce foxicins. Homologous fox biosynthetic gene clusters were found in more than 20 additional Streptomyces strains, overall in about 2.6% of all sequenced Streptomyces genomes. However, the production of foxicin-like compounds in these strains has never been described indicating that the clusters are expressed at a very low level or are silent under fermentation conditions. Foxicin A acts as a siderophore through interacting with ferric ions. Furthermore, it is a weak inhibitor of the Escherichia coli aerobic respiratory chain and shows moderate antibiotic activity. The wide distribution of the cluster and the various properties of the compound indicate a major role of foxicins in Streptomyces strains. PMID:28270798
Identification of Pseudomonas mosselii BS011 gene clusters required for suppression of Rice Blast Fungus Magnaporthe oryzae.

PubMed

Wu, Lijuan; Xiao, Wei; Chen, Guoqing; Song, Dawei; Khaskheli, Maqsood Ahmed; Li, Pei; Zhang, Shiying; Feng, Guozhong

2018-04-25

Pseudomonas is a Gram-negative, rod-shaped bacteria. Many members of this genus displayed remarkable physiological and metabolic activity against different plant pathogens. However, Pseudomonas mosselii has not yet been characterized in biocontrol against plant disease. Here we isolated a strain of P. mosselii BS011 from the rhizosphere soil of rice plants, and the isolate showed strong inhibitory activity against the rice blast fungus Magnaporthe oryzae. Further we sequenced the complete genome of BS011, which consist of 5.75 Mb with a circular chromosome, 5,170 protein-coding genes, 23 rRNA and 78 tRNA operons. Bioinformatic analysis revealed that seven gene clusters may be involved in the biosynthesis of metabolites. Gene deletion experiments demonstrated that the gene cluster c-xtl is required for inhibitory activity against M. oryzae. Bioassay showed that the crude extract from BS011 fermentation sample significantly inhibited the development of M. oryzae at a concentration of 10 μg/ml. Besides, we illustrated that the crude extract of BS011 impaired the appressorial formation in a dose dependent manner. Collectively our results revealed that P. mosselii BS011 is a promising biocontrol agent and the gene cluster c-xtl is essential for inhibiting the development of M. oryzae. Copyright © 2018. Published by Elsevier B.V.
Genetic diversity of K-antigen gene clusters of Escherichia coli and their molecular typing using a suspension array.

PubMed

Yang, Shuang; Xi, Daoyi; Jing, Fuyi; Kong, Deju; Wu, Junli; Feng, Lu; Cao, Boyang; Wang, Lei

2018-04-01

Capsular polysaccharides (CPSs), or K-antigens, are the major surface antigens of Escherichia coli. More than 80 serologically unique K-antigens are classified into 4 groups (Groups 1-4) of capsules. Groups 1 and 4 contain the Wzy-dependent polymerization pathway and the gene clusters are in the order galF to gnd; Groups 2 and 3 contain the ABC-transporter-dependent pathway and the gene clusters consist of 3 regions, regions 1, 2 and 3. Little is known about the variations among the gene clusters. In this study, 9 serotypes of K-antigen gene clusters (K2ab, K11, K20, K24, K38, K84, K92, K96, and K102) were sequenced and correlated with their CPS chemical structures. On the basis of sequence data, a K-antigen-specific suspension array that detects 10 distinct CPSs, including the above 9 CPSs plus K30, was developed. This is the first report to catalog the genetic features of E. coli K-antigen variations and to develop a suspension array for their molecular typing. The method has a number of advantages over traditional bacteriophage and serum agglutination methods and lays the foundation for straightforward identification and detection of additional K-antigens in the future.
Molecular evolution and functional divergence of alcohol dehydrogenases in animals, fungi and plants.

PubMed

Thompson, Claudia E; Freitas, Loreta B; Salzano, Francisco M

2018-01-01

Alcohol dehydrogenases belong to the large superfamily of medium-chain dehydrogenases/reductases, which occur throughout the biological world and are involved with many important metabolic routes. We considered the phylogeny of 190 ADH sequences of animals, fungi, and plants. Non-class III Caenorhabditis elegans ADHs were seen closely related to tetrameric fungal ADHs. ADH3 forms a sister group to amphibian, reptilian, avian and mammalian non-class III ADHs. In fishes, two main forms are identified: ADH1 and ADH3, whereas in amphibians there is a new ADH form (ADH8). ADH2 is found in Mammalia and Aves, and they formed a monophyletic group. Additionally, mammalian ADH4 seems to result from an ADH1 duplication, while in Fungi, ADH formed clusters based on types and genera. The plant ADH isoforms constitute a basal clade in relation to ADHs from animals. We identified amino acid residues responsible for functional divergence between ADH types in fungi, mammals, and fishes. In mammals, these differences occur mainly between ADH1/ADH4 and ADH3/ADH5, whereas functional divergence occurred in fungi between ADH1/ADH5, ADH5/ADH4, and ADH5/ADH3. In fishes, the forms also seem to be functionally divergent. The ADH family expansion exemplifies a neofunctionalization process where reiterative duplication events are related to new activities.
An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

PubMed

Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

2017-12-01

Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.
Identification and functional analysis of the aspergillic acid gene cluster in Aspergillus flavus

USDA-ARS?s Scientific Manuscript database

Aspergillus flavus can colonize important food staples and produces aflatoxins, toxic and carcinogenic secondary metabolites. In silico analysis of the A. flavus genome revealed 56 gene clusters encoding for secondary metabolites. How these many of these metabolites affect fungal development, surviv...
Architectural roles of multiple chromatin insulators at the human apolipoprotein gene cluster

PubMed Central

Mishiro, Tsuyoshi; Ishihara, Ko; Hino, Shinjiro; Tsutsumi, Shuichi; Aburatani, Hiroyuki; Shirahige, Katsuhiko; Kinoshita, Yoshikazu; Nakao, Mitsuyoshi

2009-01-01

Long-range regulatory elements and higher-order chromatin structure coordinate the expression of multiple genes in cluster, and CTCF/cohesin-mediated chromatin insulator may be a key in this regulation. The human apolipoprotein (APO) A1/C3/A4/A5 gene region, whose alterations increase the risk of dyslipidemia and atherosclerosis, is partitioned at least by three CTCF-enriched sites and three cohesin protein RAD21-enriched sites (two overlap with the CTCF sites), resulting in the formation of two transcribed chromatin loops by interactions between insulators. The C3 enhancer and APOC3/A4/A5 promoters reside in the same loop, where the APOC3/A4 promoters are pointed towards the C3 enhancer, whereas the APOA1 promoter is present in the different loop. The depletion of either CTCF or RAD21 disrupts the chromatin loop structure, together with significant changes in the APO expression and the localization of transcription factor hepatocyte nuclear factor (HNF)-4α and transcriptionally active form of RNA polymerase II at the APO promoters. Thus, CTCF/cohesin-mediated insulators maintain the chromatin loop formation and the localization of transcriptional apparatus at the promoters, suggesting an essential role of chromatin insulation in controlling the expression of clustered genes. PMID:19322193
Rapid Detection of Positive Selection in Genes and Genomes Through Variation Clusters

PubMed Central

Wagner, Andreas

2007-01-01

Positive selection in genes and genomes can point to the evolutionary basis for differences among species and among races within a species. The detection of positive selection can also help identify functionally important protein regions and thus guide protein engineering. Many existing tests for positive selection are excessively conservative, vulnerable to artifacts caused by demographic population history, or computationally very intensive. I here propose a simple and rapid test that is complementary to existing tests and that overcomes some of these problems. It relies on the null hypothesis that neutrally evolving DNA regions should show a Poisson distribution of nucleotide substitutions. The test detects significant deviations from this expectation in the form of variation clusters, highly localized groups of amino acid changes in a coding region. In applying this test to several thousand human–chimpanzee gene orthologs, I show that such variation clusters are not generally caused by relaxed selection. They occur in well-defined domains of a protein's tertiary structure and show a large excess of amino acid replacement over silent substitutions. I also identify multiple new human–chimpanzee orthologs subject to positive selection, among them genes that are involved in reproductive functions, immune defense, and the nervous system. PMID:17603100
Electrical characteristics of Graphene based Field Effect Transistor (GFET) biosensor for ADH detection

NASA Astrophysics Data System (ADS)

Selvarajan, Reena Sri; Hamzah, Azrul Azlan; Majlis, Burhanuddin Yeop

2017-08-01

First pristine graphene was successfully produced by mechanical exfoliation and electrically characterized in 2004 by Andre Geim and Konstantin Novoselov at University of Manchester. Since its discovery in 2004, graphene also known as `super' material that has enticed many researchers and engineers to explore its potential in ultrasensitive detection of analytes in biosensing applications. Among myriad reported sensors, biosensors based on field effect transistors (FETs) have attracted much attention. Thus, implementing graphene as conducting channel material hastens the opportunities for production of ultrasensitive biosensors for future device applications. Herein, we have reported electrical characteristics of graphene based field effect transistor (GFET) for ADH detection. GFET was modelled and simulated using Lumerical DEVICE charge transport solver (DEVICE CT). Electrical characteristics comprising of transfer and output characteristics curves are reported in this study. The device shows ambipolar curve and achieved a minimum conductivity of 0.23912 e5A at Dirac point. However, the curve shifts to the left and introduces significant changes in the minimum conductivity as drain voltage is increased. Output characteristics of GFET exhibits linear Id - Vd dependence characteristics for gate voltage ranging from 0 to 1.5 V. In addition, behavior of electrical transport through GFET was analyzed for various simulation temperatures. It clearly proves that the electrical transport in GFET is dependent on the simulation temperature as it may vary the maximum resistance in channel of the device. Therefore, this unique electrical characteristics of GFET makes it as a promising candidate for ultrasensitive detection of small biomolecules such as ADH in biosensing applications.
The Cremeomycin Biosynthetic Gene Cluster Encodes a Pathway for Diazo Formation.

PubMed

Waldman, Abraham J; Pechersky, Yakov; Wang, Peng; Wang, Jennifer X; Balskus, Emily P

2015-10-12

Diazo groups are found in a range of natural products that possess potent biological activities. Despite longstanding interest in these metabolites, diazo group biosynthesis is not well understood, in part because of difficulties in identifying specific genes linked to diazo formation. Here we describe the discovery of the gene cluster that produces the o-diazoquinone natural product cremeomycin and its heterologous expression in Streptomyces lividans. We used stable isotope feeding experiments and in vitro characterization of biosynthetic enzymes to decipher the order of events in this pathway and establish that diazo construction involves late-stage N-N bond formation. This work represents the first successful production of a diazo-containing metabolite in a heterologous host, experimentally linking a set of genes with diazo formation. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Molecular and physiological aspects of alcohol dehydrogenases in the ethanol metabolism of Saccharomyces cerevisiae.

PubMed

de Smidt, Olga; du Preez, James C; Albertyn, Jacobus

2012-02-01

The physiological role and possible functional substitution of each of the five alcohol dehydrogenase (Adh) isozymes in Saccharomyces cerevisiae were investigated in five quadruple deletion mutants designated strains Q1-Q5, with the number indicating the sole intact ADH gene. Their growth in aerobic batch cultures was characterised in terms of kinetic and stoichiometric parameters. Cultivation with glucose or ethanol as carbon substrate revealed that Adh1 was the only alcohol dehydrogenase capable of efficiently catalysing the reduction of acetaldehyde to ethanol. The oxidation of produced or added ethanol could also be attributed to Adh1. Growth of strains lacking the ADH1 gene resulted in the production of glycerol as a major fermentation product, concomitant with the production of a significant amount of acetaldehyde. Strains Q2 and Q3, expressing only ADH2 or ADH3, respectively, produced ethanol from glucose, albeit less than strain Q1, and were also able to oxidise added ethanol. Strains Q4 and Q5 grew poorly on glucose and produced ethanol, but were neither able to utilise the produced ethanol nor grow on added ethanol. Transcription profiles of the ADH4 and ADH5 genes suggested that participation of these gene products in ethanol production from glucose was unlikely. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Discovery of Gene Cluster for Mycosporine-Like Amino Acid Biosynthesis from Actinomycetales Microorganisms and Production of a Novel Mycosporine-Like Amino Acid by Heterologous Expression

PubMed Central

Miyamoto, Kiyoko T.; Komatsu, Mamoru

2014-01-01

Mycosporines and mycosporine-like amino acids (MAAs), including shinorine (mycosporine-glycine-serine) and porphyra-334 (mycosporine-glycine-threonine), are UV-absorbing compounds produced by cyanobacteria, fungi, and marine micro- and macroalgae. These MAAs have the ability to protect these organisms from damage by environmental UV radiation. Although no reports have described the production of MAAs and the corresponding genes involved in MAA biosynthesis from Gram-positive bacteria to date, genome mining of the Gram-positive bacterial database revealed that two microorganisms belonging to the order Actinomycetales, Actinosynnema mirum DSM 43827 and Pseudonocardia sp. strain P1, possess a gene cluster homologous to the biosynthetic gene clusters identified from cyanobacteria. When the two strains were grown in liquid culture, Pseudonocardia sp. accumulated a very small amount of MAA-like compound in a medium-dependent manner, whereas A. mirum did not produce MAAs under any culture conditions, indicating that the biosynthetic gene cluster of A. mirum was in a cryptic state in this microorganism. In order to characterize these biosynthetic gene clusters, each biosynthetic gene cluster was heterologously expressed in an engineered host, Streptomyces avermitilis SUKA22. Since the resultant transformants carrying the entire biosynthetic gene cluster controlled by an alternative promoter produced mainly shinorine, this is the first confirmation of a biosynthetic gene cluster for MAA from Gram-positive bacteria. Furthermore, S. avermitilis SUKA22 transformants carrying the biosynthetic gene cluster for MAA of A. mirum accumulated not only shinorine and porphyra-334 but also a novel MAA. Structure elucidation revealed that the novel MAA is mycosporine-glycine-alanine, which substitutes l-alanine for the l-serine of shinorine. PMID:24907338
Discovery of gene cluster for mycosporine-like amino acid biosynthesis from Actinomycetales microorganisms and production of a novel mycosporine-like amino acid by heterologous expression.

PubMed

Miyamoto, Kiyoko T; Komatsu, Mamoru; Ikeda, Haruo

2014-08-01

Mycosporines and mycosporine-like amino acids (MAAs), including shinorine (mycosporine-glycine-serine) and porphyra-334 (mycosporine-glycine-threonine), are UV-absorbing compounds produced by cyanobacteria, fungi, and marine micro- and macroalgae. These MAAs have the ability to protect these organisms from damage by environmental UV radiation. Although no reports have described the production of MAAs and the corresponding genes involved in MAA biosynthesis from Gram-positive bacteria to date, genome mining of the Gram-positive bacterial database revealed that two microorganisms belonging to the order Actinomycetales, Actinosynnema mirum DSM 43827 and Pseudonocardia sp. strain P1, possess a gene cluster homologous to the biosynthetic gene clusters identified from cyanobacteria. When the two strains were grown in liquid culture, Pseudonocardia sp. accumulated a very small amount of MAA-like compound in a medium-dependent manner, whereas A. mirum did not produce MAAs under any culture conditions, indicating that the biosynthetic gene cluster of A. mirum was in a cryptic state in this microorganism. In order to characterize these biosynthetic gene clusters, each biosynthetic gene cluster was heterologously expressed in an engineered host, Streptomyces avermitilis SUKA22. Since the resultant transformants carrying the entire biosynthetic gene cluster controlled by an alternative promoter produced mainly shinorine, this is the first confirmation of a biosynthetic gene cluster for MAA from Gram-positive bacteria. Furthermore, S. avermitilis SUKA22 transformants carrying the biosynthetic gene cluster for MAA of A. mirum accumulated not only shinorine and porphyra-334 but also a novel MAA. Structure elucidation revealed that the novel MAA is mycosporine-glycine-alanine, which substitutes l-alanine for the l-serine of shinorine. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Aromatic Polyketide GTRI-02 is a Previously Unidentified Product of the act Gene Cluster in Streptomyces coelicolor A3(2).

PubMed

Wu, Changsheng; Ichinose, Koji; Choi, Young Hae; van Wezel, Gilles P

2017-07-18

The biosynthesis of aromatic polyketides derived from type II polyketide synthases (PKSs) is complex, and it is not uncommon that highly similar gene clusters give rise to diverse structural architectures. The act biosynthetic gene cluster (BGC) of the model actinomycete Streptomyces coelicolor A3(2) is an archetypal type II PKS. Here we show that the act BGC also specifies the aromatic polyketide GTRI-02 (1) and propose a mechanism for the biogenesis of its 3,4-dihydronaphthalen-1(2H)-one backbone. Polyketide 1 was also produced by Streptomyces sp. MBT76 after activation of the act-like qin gene cluster by overexpression of the pathway-specific activator. Mining of this strain also identified dehydroxy-GTRI-02 (2), which most likely originated from dehydration of 1 during the isolation process. This work shows that even extensively studied model gene clusters such as act of S. coelicolor can still produce new chemistry, offering new perspectives for drug discovery. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

Development of a gene cloning system in a fast-growing and moderately thermophilic Streptomyces species and heterologous expression of Streptomyces antibiotic biosynthetic gene clusters

PubMed Central

2011-01-01

Background Streptomyces species are a major source of antibiotics. They usually grow slowly at their optimal temperature and fermentation of industrial strains in a large scale often takes a long time, consuming more energy and materials than some other bacterial industrial strains (e.g., E. coli and Bacillus). Most thermophilic Streptomyces species grow fast, but no gene cloning systems have been developed in such strains. Results We report here the isolation of 41 fast-growing (about twice the rate of S. coelicolor), moderately thermophilic (growing at both 30°C and 50°C) Streptomyces strains, detection of one linear and three circular plasmids in them, and sequencing of a 6996-bp plasmid, pTSC1, from one of them. pTSC1-derived pCWH1 could replicate in both thermophilic and mesophilic Streptomyces strains. On the other hand, several Streptomyces replicons function in thermophilic Streptomyces species. By examining ten well-sporulating strains, we found two promising cloning hosts, 2C and 4F. A gene cloning system was established by using the two strains. The actinorhodin and anthramycin biosynthetic gene clusters from mesophilic S. coelicolor A3(2) and thermophilic S. refuineus were heterologously expressed in one of the hosts. Conclusions We have developed a gene cloning and expression system in a fast-growing and moderately thermophilic Streptomyces species. Although just a few plasmids and one antibiotic biosynthetic gene cluster from mesophilic Streptomyces were successfully expressed in thermophilic Streptomyces species, we expect that by utilizing thermophilic Streptomyces-specific promoters, more genes and especially antibiotic genes clusters of mesophilic Streptomyces should be heterologously expressed. PMID:22032628
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes

DOE PAGES

Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken; ...

2016-11-29

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken

Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
The powdery mildew resistance gene REN1 co-segregates with an NBS-LRR gene cluster in two Central Asian grapevines

PubMed Central

2009-01-01

Background Grape powdery mildew is caused by the North American native pathogen Erysiphe necator. Eurasian Vitis vinifera varieties were all believed to be susceptible. REN1 is the first resistance gene naturally found in cultivated plants of Vitis vinifera. Results REN1 is present in 'Kishmish vatkana' and 'Dzhandzhal kara', two grapevines documented in Central Asia since the 1920's. These cultivars have a second-degree relationship (half sibs, grandparent-grandchild, or avuncular), and share by descent the chromosome on which the resistance allele REN1 is located. The REN1 interval was restricted to 1.4 cM using 38 SSR markers distributed across the locus and the segregation of the resistance phenotype in two progenies of collectively 461 offspring, derived from either resistant parent. The boundary markers delimit a 1.4-Mbp sequence in the PN40024 reference genome, which contains 27 genes with known functions, 2 full-length coiled-coil NBS-LRR genes, and 9 NBS-LRR pseudogenes. In the REN1 locus of PN40024, NBS genes have proliferated through a mixture of segmental duplications, tandem gene duplications, and intragenic recombination between paralogues, indicating that the REN1 locus has been inherently prone to producing genetic variation. Three SSR markers co-segregate with REN1, the outer ones confining the 908-kb array of NBS-LRR genes. Kinship and clustering analyses based on genetic distances with susceptible cultivars representative of Central Asian Vitis vinifera indicated that 'Kishmish vatkana' and 'Dzhandzhal kara' fit well into local germplasm. 'Kishmish vatkana' also has a parent-offspring relationship with the seedless table grape 'Sultanina'. In addition, the distant genetic relatedness to rootstocks, some of which are derived from North American species resistant to powdery mildew and have been used worldwide to guard against phylloxera since the late 1800's, argues against REN1 being infused into Vitis vinifera from a recent interspecific
Cloning, sequencing, and expression of the Zymomonas mobilis phosphoglycerate mutase gene (pgm) in Escherichia coli.

PubMed Central

Yomano, L P; Scopes, R K; Ingram, L O

1993-01-01

Phosphoglycerate mutase is an essential glycolytic enzyme for Zymomonas mobilis, catalyzing the reversible interconversion of 3-phosphoglycerate and 2-phosphoglycerate. The pgm gene encoding this enzyme was cloned on a 5.2-kbp DNA fragment and expressed in Escherichia coli. Recombinants were identified by using antibodies directed against purified Z. mobilis phosphoglycerate mutase. The pgm gene contains a canonical ribosome-binding site, a biased pattern of codon usage, a long upstream untranslated region, and four promoters which share sequence homology. Interestingly, adhA and a D-specific 2-hydroxyacid dehydrogenase were found on the same DNA fragment and appear to form a cluster of genes which function in central metabolism. The translated sequence for Z. mobilis pgm was in full agreement with the 40 N-terminal amino acid residues determined by protein sequencing. The primary structure of the translated sequence is highly conserved (52 to 60% identity with other phosphoglycerate mutases) and also shares extensive homology with bisphosphoglycerate mutases (51 to 59% identity). Since Southern blots indicated the presence of only a single copy of pgm in the Z. mobilis chromosome, it is likely that the cloned pgm gene functions to provide both activities. Z. mobilis phosphoglycerate mutase is unusual in that it lacks the flexible tail and lysines at the carboxy terminus which are present in the enzyme isolated from all other organisms examined. Images PMID:8320209
Glycosulfatase-Encoding Gene Cluster in Bifidobacterium breve UCC2003.

PubMed

Egan, Muireann; Jiang, Hao; O'Connell Motherway, Mary; Oscarson, Stefan; van Sinderen, Douwe

2016-11-15

Bifidobacteria constitute a specific group of commensal bacteria typically found in the gastrointestinal tract (GIT) of humans and other mammals. Bifidobacterium breve strains are numerically prevalent among the gut microbiota of many healthy breastfed infants. In the present study, we investigated glycosulfatase activity in a bacterial isolate from a nursling stool sample, B. breve UCC2003. Two putative sulfatases were identified on the genome of B. breve UCC2003. The sulfated monosaccharide N-acetylglucosamine-6-sulfate (GlcNAc-6-S) was shown to support the growth of B. breve UCC2003, while N-acetylglucosamine-3-sulfate, N-acetylgalactosamine-3-sulfate, and N-acetylgalactosamine-6-sulfate did not support appreciable growth. By using a combination of transcriptomic and functional genomic approaches, a gene cluster designated ats2 was shown to be specifically required for GlcNAc-6-S metabolism. Transcription of the ats2 cluster is regulated by a repressor open reading frame kinase (ROK) family transcriptional repressor. This study represents the first description of glycosulfatase activity within the Bifidobacterium genus. Bifidobacteria are saccharolytic organisms naturally found in the digestive tract of mammals and insects. Bifidobacterium breve strains utilize a variety of plant- and host-derived carbohydrates that allow them to be present as prominent members of the infant gut microbiota as well as being present in the gastrointestinal tract of adults. In this study, we introduce a previously unexplored area of carbohydrate metabolism in bifidobacteria, namely, the metabolism of sulfated carbohydrates. B. breve UCC2003 was shown to metabolize N-acetylglucosamine-6-sulfate (GlcNAc-6-S) through one of two sulfatase-encoding gene clusters identified on its genome. GlcNAc-6-S can be found in terminal or branched positions of mucin oligosaccharides, the glycoprotein component of the mucous layer that covers the digestive tract. The results of this study provide
Glycosulfatase-Encoding Gene Cluster in Bifidobacterium breve UCC2003

PubMed Central

Egan, Muireann; Jiang, Hao; O'Connell Motherway, Mary; Oscarson, Stefan

2016-01-01

ABSTRACT Bifidobacteria constitute a specific group of commensal bacteria typically found in the gastrointestinal tract (GIT) of humans and other mammals. Bifidobacterium breve strains are numerically prevalent among the gut microbiota of many healthy breastfed infants. In the present study, we investigated glycosulfatase activity in a bacterial isolate from a nursling stool sample, B. breve UCC2003. Two putative sulfatases were identified on the genome of B. breve UCC2003. The sulfated monosaccharide N-acetylglucosamine-6-sulfate (GlcNAc-6-S) was shown to support the growth of B. breve UCC2003, while N-acetylglucosamine-3-sulfate, N-acetylgalactosamine-3-sulfate, and N-acetylgalactosamine-6-sulfate did not support appreciable growth. By using a combination of transcriptomic and functional genomic approaches, a gene cluster designated ats2 was shown to be specifically required for GlcNAc-6-S metabolism. Transcription of the ats2 cluster is regulated by a repressor open reading frame kinase (ROK) family transcriptional repressor. This study represents the first description of glycosulfatase activity within the Bifidobacterium genus. IMPORTANCE Bifidobacteria are saccharolytic organisms naturally found in the digestive tract of mammals and insects. Bifidobacterium breve strains utilize a variety of plant- and host-derived carbohydrates that allow them to be present as prominent members of the infant gut microbiota as well as being present in the gastrointestinal tract of adults. In this study, we introduce a previously unexplored area of carbohydrate metabolism in bifidobacteria, namely, the metabolism of sulfated carbohydrates. B. breve UCC2003 was shown to metabolize N-acetylglucosamine-6-sulfate (GlcNAc-6-S) through one of two sulfatase-encoding gene clusters identified on its genome. GlcNAc-6-S can be found in terminal or branched positions of mucin oligosaccharides, the glycoprotein component of the mucous layer that covers the digestive tract. The results of
Gene prioritization and clustering by multi-view text mining

PubMed Central

2010-01-01

Background Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. Results We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. Conclusions In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification. PMID:20074336
Diversity of nonribosomal peptide synthetase and polyketide synthase gene clusters among taxonomically close Streptomyces strains.

PubMed

Komaki, Hisayuki; Sakurai, Kenta; Hosoyama, Akira; Kimura, Akane; Igarashi, Yasuhiro; Tamura, Tomohiko

2018-05-02

To identify the species of butyrolactol-producing Streptomyces strain TP-A0882, whole genome-sequencing of three type strains in a close taxonomic relationship was performed. In silico DNA-DNA hybridization using the genome sequences suggested that Streptomyces sp. TP-A0882 is classified as Streptomyces diastaticus subsp. ardesiacus. Strain TP-A0882, S. diastaticus subsp. ardesiacus NBRC 15402 T , Streptomyces coelicoflavus NBRC 15399 T , and Streptomyces rubrogriseus NBRC 15455 T harbor at least 14, 14, 10, and 12 biosynthetic gene clusters (BGCs), respectively, coding for nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs). All 14 gene clusters were shared by S. diastaticus subsp. ardesiacus strains TP-A0882 and NBRC 15402 T , while only four gene clusters were shared by the three distinct species. Although BGCs for bacteriocin, ectoine, indole, melanine, siderophores such as deferrioxamine, terpenes such as albaflavenone, hopene, carotenoid and geosmin are shared by the three species, many BGCs for secondary metabolites such as butyrolactone, lantipeptides, oligosaccharide, some terpenes are species-specific. These results indicate the possibility that strains belonging to the same species possess the same set of secondary metabolite-biosynthetic pathways, whereas strains belonging to distinct species have species-specific pathways, in addition to some common pathways, even if the strains are taxonomically close.
SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering.

PubMed

Van den Eynden, Jimmy; Fierro, Ana Carolina; Verbeke, Lieven P C; Marchal, Kathleen

2015-04-23

With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutations do not influence tumour biology. In this study, SomInaClust is introduced, a method that accurately identifies driver genes based on their mutation pattern across tumour samples and then classifies them into oncogenes or tumour suppressor genes respectively. SomInaClust starts from the observation that oncogenes mainly contain mutations that, due to positive selection, cluster at similar positions in a gene across patient samples, whereas tumour suppressor genes contain a high number of protein-truncating mutations throughout the entire gene length. The method was shown to prioritize driver genes in 9 different solid cancers. Furthermore it was found to be complementary to existing similar-purpose methods with the additional advantages that it has a higher sensitivity, also for rare mutations (occurring in less than 1% of all samples), and it accurately classifies candidate driver genes in putative oncogenes and tumour suppressor genes. Pathway enrichment analysis showed that the identified genes belong to known cancer signalling pathways, and that the distinction between oncogenes and tumour suppressor genes is biologically relevant. SomInaClust was shown to detect candidate driver genes based on somatic mutation patterns of inactivation and clustering and to distinguish oncogenes from tumour suppressor genes. The method could be used for the identification of new cancer genes or to filter mutation data for further data-integration purposes.
Genes Involved in Degradation of para-Nitrophenol Are Differentially Arranged in Form of Non-Contiguous Gene Clusters in Burkholderia sp. strain SJ98

PubMed Central

Vikram, Surendra; Pandey, Janmejay; Kumar, Shailesh; Raghava, Gajendra Pal Singh

2013-01-01

Biodegradation of para-Nitrophenol (PNP) proceeds via two distinct pathways, having 1,2,3-benzenetriol (BT) and hydroquinone (HQ) as their respective terminal aromatic intermediates. Genes involved in these pathways have already been studied in different PNP degrading bacteria. Burkholderia sp. strain SJ98 degrades PNP via both the pathways. Earlier, we have sequenced and analyzed a ~41 kb fragment from the genomic library of strain SJ98. This DNA fragment was found to harbor all the lower pathway genes; however, genes responsible for the initial transformation of PNP could not be identified within this fragment. Now, we have sequenced and annotated the whole genome of strain SJ98 and found two ORFs (viz., pnpA and pnpB) showing maximum identity at amino acid level with p-nitrophenol 4-monooxygenase (PnpM) and p-benzoquinone reductase (BqR). Unlike the other PNP gene clusters reported earlier in different bacteria, these two ORFs in SJ98 genome are physically separated from the other genes of PNP degradation pathway. In order to ascertain the identity of ORFs pnpA and pnpB, we have performed in-vitro assays using recombinant proteins heterologously expressed and purified to homogeneity. Purified PnpA was found to be a functional PnpM and transformed PNP into benzoquinone (BQ), while PnpB was found to be a functional BqR which catalyzed the transformation of BQ into hydroquinone (HQ). Noticeably, PnpM from strain SJ98 could also transform a number of PNP analogues. Based on the above observations, we propose that the genes for PNP degradation in strain SJ98 are arranged differentially in form of non-contiguous gene clusters. This is the first report for such arrangement for gene clusters involved in PNP degradation. Therefore, we propose that PNP degradation in strain SJ98 could be an important model system for further studies on differential evolution of PNP degradation functions. PMID:24376843
The 987P fimbrial gene cluster of enterotoxigenic Escherichia coli is plasmid encoded.

PubMed Central

Schifferli, D M; Beachey, E H; Taylor, R K

1990-01-01

A clone containing the 987P fimbrial gene cluster was selected from a cosmid library of total DNA of the prototype Escherichia coli strain 987 by using 987P-specific antiserum. A subclone of 12 kilobases containing all of the genes required for fimbrial expression on a nonfimbriated K-12 strain of E. coli and a DNA fragment internal to the fimbrial subunit gene were used to probe the prototype strain and various isolates of 987P-fimbriated enterotoxigenic E. coli. All strains had several plasmids, as shown by agarose gel electrophoresis, and each of five strains which expressed 987P fimbriae showed a plasmid of 35 to 40 megadaltons (MDa) hybridizing to both 987P-specific probes. Hybridization to restricted DNA of strain 987 supported a plasmid origin for the cloned 987P gene cluster. Moreover, an isogenic strain which had lost its 35-MDa plasmid was no longer capable of synthesizing fimbrial subunits, but regained fimbrial expression after reintroduction of the TnphoA (Tn5 IS50L::phoA)-tagged 35-MDa plasmid. Absence of fimbrial subunit synthesis in K-12 strains transformed with the 35-MDa plasmid alone suggested the requirement of regulatory elements existing in strain 987 but missing in K-12 strains. A probe for the heat-stable enterotoxin STIa hybridized in each of the 987P-fimbriated strains to the plasmid containing the 987P genes and in most of these strains to an additional plasmid which contained the gene for the heat-stable enterotoxin STII. Occurrence of the 987P and STIa genes on the same replicon correlates with epidemiological observations, STIa being the most prevalent toxin produced by 987P-fimbriated E. coli. Images PMID:1967167
Genetic recombination as a major cause of mutagenesis in the human globin gene clusters.

PubMed

Borg, Joseph; Georgitsi, Marianthi; Aleporou-Marinou, Vassiliki; Kollia, Panagoula; Patrinos, George P

2009-12-01

Homologous recombination is a frequent phenomenon in multigene families and as such it occurs several times in both the alpha- and beta-like globin gene families. In numerous occasions, genetic recombination has been previously implicated as a major mechanism that drives mutagenesis in the human globin gene clusters, either in the form of unequal crossover or gene conversion. Unequal crossover results in the increase or decrease of the human globin gene copies, accompanied in the majority of cases with minor phenotypic consequences, while gene conversion contributes either to maintaining sequence homogeneity or generating sequence diversity. The role of genetic recombination, particularly gene conversion in the evolution of the human globin gene families has been discussed elsewhere. Here, we summarize our current knowledge and review existing experimental evidence outlining the role of genetic recombination in the mutagenic process in the human globin gene families.
A Minimal Nitrogen Fixation Gene Cluster from Paenibacillus sp. WLY78 Enables Expression of Active Nitrogenase in Escherichia coli

PubMed Central

Zhao, Dehua; Liu, Xiaomeng; Zhang, Bo; Xie, Jianbo; Hong, Yuanyuan; Li, Pengfei; Chen, Sanfeng; Dixon, Ray; Li, Jilun

2013-01-01

Most biological nitrogen fixation is catalyzed by molybdenum-dependent nitrogenase, an enzyme complex comprising two component proteins that contains three different metalloclusters. Diazotrophs contain a common core of nitrogen fixation nif genes that encode the structural subunits of the enzyme and components required to synthesize the metalloclusters. However, the complement of nif genes required to enable diazotrophic growth varies significantly amongst nitrogen fixing bacteria and archaea. In this study, we identified a minimal nif gene cluster consisting of nine nif genes in the genome of Paenibacillus sp. WLY78, a gram-positive, facultative anaerobe isolated from the rhizosphere of bamboo. We demonstrate that the nif genes in this organism are organized as an operon comprising nifB, nifH, nifD, nifK, nifE, nifN, nifX, hesA and nifV and that the nif cluster is under the control of a σ70 (σA)-dependent promoter located upstream of nifB. To investigate genetic requirements for diazotrophy, we transferred the Paenibacillus nif cluster to Escherichia coli. The minimal nif gene cluster enables synthesis of catalytically active nitrogenase in this host, when expressed either from the native nifB promoter or from the T7 promoter. Deletion analysis indicates that in addition to the core nif genes, hesA plays an important role in nitrogen fixation and is responsive to the availability of molybdenum. Whereas nif transcription in Paenibacillus is regulated in response to nitrogen availability and by the external oxygen concentration, transcription from the nifB promoter is constitutive in E. coli, indicating that negative regulation of nif transcription is bypassed in the heterologous host. This study demonstrates the potential for engineering nitrogen fixation in a non-nitrogen fixing organism with a minimum set of nine nif genes. PMID:24146630
The Epipolythiodiketopiperazine Gene Cluster in Claviceps purpurea: Dysfunctional Cytochrome P450 Enzyme Prevents Formation of the Previously Unknown Clapurines.

PubMed

Dopstadt, Julian; Neubauer, Lisa; Tudzynski, Paul; Humpf, Hans-Ulrich

2016-01-01

Claviceps purpurea is an important food contaminant and well known for the production of the toxic ergot alkaloids. Apart from that, little is known about its secondary metabolism and not all toxic substances going along with the food contamination with Claviceps are known yet. We explored the metabolite profile of a gene cluster in C. purpurea with a high homology to gene clusters, which are responsible for the formation of epipolythiodiketopiperazine (ETP) toxins in other fungi. By overexpressing the transcription factor, we were able to activate the cluster in the standard C. purpurea strain 20.1. Although all necessary genes for the formation of the characteristic disulfide bridge were expressed in the overexpression mutants, the fungus did not produce any ETPs. Isolation of pathway intermediates showed that the common biosynthetic pathway stops after the first steps. Our results demonstrate that hydroxylation of the diketopiperazine backbone is the critical step during the ETP biosynthesis. Due to a dysfunctional enzyme, the fungus is not able to produce toxic ETPs. Instead, the pathway end-products are new unusual metabolites with a unique nitrogen-sulfur bond. By heterologous expression of the Leptosphaeria maculans cytochrome P450 encoding gene sirC, we were able to identify the end-products of the ETP cluster in C. purpurea. The thioclapurines are so far unknown ETPs, which might contribute to the toxicity of other C. purpurea strains with a potentially intact ETP cluster.
The Epipolythiodiketopiperazine Gene Cluster in Claviceps purpurea: Dysfunctional Cytochrome P450 Enzyme Prevents Formation of the Previously Unknown Clapurines

PubMed Central

Tudzynski, Paul; Humpf, Hans-Ulrich

2016-01-01

Claviceps purpurea is an important food contaminant and well known for the production of the toxic ergot alkaloids. Apart from that, little is known about its secondary metabolism and not all toxic substances going along with the food contamination with Claviceps are known yet. We explored the metabolite profile of a gene cluster in C. purpurea with a high homology to gene clusters, which are responsible for the formation of epipolythiodiketopiperazine (ETP) toxins in other fungi. By overexpressing the transcription factor, we were able to activate the cluster in the standard C. purpurea strain 20.1. Although all necessary genes for the formation of the characteristic disulfide bridge were expressed in the overexpression mutants, the fungus did not produce any ETPs. Isolation of pathway intermediates showed that the common biosynthetic pathway stops after the first steps. Our results demonstrate that hydroxylation of the diketopiperazine backbone is the critical step during the ETP biosynthesis. Due to a dysfunctional enzyme, the fungus is not able to produce toxic ETPs. Instead, the pathway end-products are new unusual metabolites with a unique nitrogen-sulfur bond. By heterologous expression of the Leptosphaeria maculans cytochrome P450 encoding gene sirC, we were able to identify the end-products of the ETP cluster in C. purpurea. The thioclapurines are so far unknown ETPs, which might contribute to the toxicity of other C. purpurea strains with a potentially intact ETP cluster. PMID:27390873
Comparative genomic analysis of secondary metabolite biosynthetic gene clusters in 207 isolates of Fusarium

USDA-ARS?s Scientific Manuscript database

Fusarium species are known for their ability to produce secondary metabolites (SMs), including plant hormones, pigments, mycotoxins, and other compounds with potential agricultural, pharmaceutical, and biotechnological impact. Understanding the distribution of SM biosynthetic gene clusters across th...
IMG-ABC. A knowledge base to fuel discovery of biosynthetic gene clusters and novel secondary metabolites

DOE PAGES

Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; ...

2015-07-14

In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of “big” genomic data for discovering small molecules. IMG-ABC relies on IMG’s comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve asmore » the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC’s focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in lphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG’s extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG
IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites.

PubMed

Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Szeto, Ernest; Huang, Jinghua; Reddy, T B K; Cimermančič, Peter; Fischbach, Michael A; Ivanova, Natalia N; Markowitz, Victor M; Kyrpides, Nikos C; Pati, Amrita

2015-07-14

In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to
A pyrosequencing assay for the quantitative methylation analysis of the PCDHB gene cluster, the major factor in neuroblastoma methylator phenotype.

PubMed

Banelli, Barbara; Brigati, Claudio; Di Vinci, Angela; Casciano, Ida; Forlani, Alessandra; Borzì, Luana; Allemanni, Giorgio; Romani, Massimo

2012-03-01

Epigenetic alterations are hallmarks of cancer and powerful biomarkers, whose clinical utilization is made difficult by the absence of standardization and of common methods of data interpretation. The coordinate methylation of many loci in cancer is defined as 'CpG island methylator phenotype' (CIMP) and identifies clinically distinct groups of patients. In neuroblastoma (NB), CIMP is defined by a methylation signature, which includes different loci, but its predictive power on outcome is entirely recapitulated by the PCDHB cluster only. We have developed a robust and cost-effective pyrosequencing-based assay that could facilitate the clinical application of CIMP in NB. This assay permits the unbiased simultaneous amplification and sequencing of 17 out of 19 genes of the PCDHB cluster for quantitative methylation analysis, taking into account all the sequence variations. As some of these variations were at CpG doublets, we bypassed the data interpretation conducted by the methylation analysis software to assign the corrected methylation value at these sites. The final result of the assay is the mean methylation level of 17 gene fragments in the protocadherin B cluster (PCDHB) cluster. We have utilized this assay to compare the methylation levels of the PCDHB cluster between high-risk and very low-risk NB patients, confirming the predictive value of CIMP. Our results demonstrate that the pyrosequencing-based assay herein described is a powerful instrument for the analysis of this gene cluster that may simplify the data comparison between different laboratories and, in perspective, could facilitate its clinical application. Furthermore, our results demonstrate that, in principle, pyrosequencing can be efficiently utilized for the methylation analysis of gene clusters with high internal homologies.

Comparison of Ergot Alkaloid Biosynthesis Gene Clusters in Claviceps Species Indicates Loss of Late Pathway Steps in Evolution of C. fusiformis▿

PubMed Central

Lorenz, Nicole; Wilson, Ella V.; Machado, Caroline; Schardl, Christopher L.; Tudzynski, Paul

2007-01-01

The grass parasites Claviceps purpurea and Claviceps fusiformis produce ergot alkaloids (EA) in planta and in submerged culture. Whereas EA synthesis (EAS) in C. purpurea proceeds via clavine intermediates to lysergic acid and the complex ergopeptines, C. fusiformis produces only agroclavine and elymoclavine. In C. purpurea the EAS gene (EAS) cluster includes dmaW (encoding the first pathway step), cloA (elymoclavine oxidation to lysergic acid), and the lpsA/lpsB genes (ergopeptine formation). We analyzed the corresponding C. fusiformis EAS cluster to investigate the evolutionary basis for chemotypic differences between the Claviceps species. Other than three peptide synthetase genes (lpsC and the tandem paralogues lpsA1 and lpsA2), homologues of all C. purpurea EAS genes were identified in C. fusiformis, including homologues of lpsB and cloA, which in C. purpurea encode enzymes for steps after clavine synthesis. Rearrangement of the cluster was evident around lpsB, which is truncated in C. fusiformis. This and several frameshift mutations render CflpsB a pseudogene (CflpsBΨ). No obvious inactivating mutation was identified in CfcloA. All C. fusiformis EAS genes, including CflpsBΨ and CfcloA, were expressed in culture. Cross-complementation analyses demonstrated that CfcloA and CflpsBΨ were expressed in C. purpurea but did not encode functional enzymes. In contrast, CpcloA catalyzed lysergic acid biosynthesis in C. fusiformis, indicating that C. fusiformis terminates its EAS pathway at elymoclavine because the cloA gene product is inactive. We propose that the C. fusiformis EAS cluster evolved from a more complete cluster by loss of some lps genes and by rearrangements and mutations inactivating lpsB and cloA. PMID:17720822
A candidate gene study in low HDL-cholesterol families provides evidence for the involvement of the APOA2 gene and the APOA1C3A4 gene cluster.

PubMed

Lilja, Heidi E; Soro, Aino; Ylitalo, Kati; Nuotio, Ilpo; Viikari, Jorma S A; Salomaa, Veikko; Vartiainen, Erkki; Taskinen, Marja-Riitta; Peltonen, Leena; Pajukanta, Päivi

2002-09-01

In patients with premature coronary heart disease, the most common lipoprotein abnormality is high-density lipoprotein (HDL) deficiency. To assess the genetic background of the low HDL-cholesterol trait, we performed a candidate gene study in 25 families with low HDL, collected from the genetically isolated population of Finland. We studied 21 genes encoding essential proteins involved in the HDL metabolism by genotyping intragenic and flanking markers for these genes. We found suggestive evidence for linkage in two candidate regions: Marker D1S2844, in the apolipoprotein A-II (APOA2) region, yielded a LOD score of 2.14 and marker D11S939 flanking the apolipoprotein A-I/C-III/A-IV gene cluster (APOA1C3A4) produced a LOD score of 1.69. Interestingly, we identified potential shared haplotypes in these two regions in a subset of low HDL families. These families also contributed to the obtained positive LOD scores, whereas the rest of the families produced negative LOD scores. None of the remaining candidate regions provided any evidence for linkage. Since only a limited number of loci were tested in this candidate gene study, these LOD scores suggest significant involvement of the APOA2 gene and the APOA1C3A4 gene cluster, or loci in their immediate vicinity, in the pathogenesis of low HDL.
Analysis of FOXF1 and the FOX gene cluster in patients with VACTERL association

PubMed Central

Agochukwu, Nneamaka B.; Pineda-Alvarez, Daniel E.; Keaton, Amelia A.; Warren-Mora, Nicole; Raam, Manu S.; Kamat, Aparna; Chandrasekharappa, Settara C.; Solomon, Benjamin D.

2011-01-01

VACTERL association, a relatively common condition with an incidence of approximately 1 in 20,000 – 35,000 births, is a non-random association of birth defects that includes vertebral defects (V), anal atresia (A), cardiac defects (C), tracheo-esophageal fistula (TE), renal anomalies (R) and limb malformations (L). Although the etiology is unknown in the majority of patients, there is evidence that it is causally heterogeneous. Several studies have shown evidence for inheritance in VACTERL, implying a role for genetic loci. Recently, patients with component features of VACTERL and a lethal developmental pulmonary disorder, alveolar capillary dysplasia with misalignment of pulmonary veins (ACD/MPV), were found to harbor deletions or mutations affecting FOXF1 and the FOX gene cluster on chromosome 16q24. We investigated this gene through direct sequencing and high-density SNP microarray in 12 patients with VACTERL association but without ACD/MPV. Our mutational analysis of FOXF1 showed normal sequences and no genomic imbalances affecting the FOX gene cluster on chromosome 16q24 in the studied patients. Possible explanations for these results include the etiologic and clinical heterogeneity of VACTERL association, the possibility that mutations affecting this gene may occur only in more severely affected individuals, and insufficient study sample size. PMID:21315191
Sequencing and Transcriptional Analysis of the Biosynthesis Gene Cluster of Putrescine-Producing Lactococcus lactis ▿ †

PubMed Central

Ladero, Victor; Rattray, Fergal P.; Mayo, Baltasar; Martín, María Cruz; Fernández, María; Alvarez, Miguel A.

2011-01-01

Lactococcus lactis is a prokaryotic microorganism with great importance as a culture starter and has become the model species among the lactic acid bacteria. The long and safe history of use of L. lactis in dairy fermentations has resulted in the classification of this species as GRAS (General Regarded As Safe) or QPS (Qualified Presumption of Safety). However, our group has identified several strains of L. lactis subsp. lactis and L. lactis subsp. cremoris that are able to produce putrescine from agmatine via the agmatine deiminase (AGDI) pathway. Putrescine is a biogenic amine that confers undesirable flavor characteristics and may even have toxic effects. The AGDI cluster of L. lactis is composed of a putative regulatory gene, aguR, followed by the genes (aguB, aguD, aguA, and aguC) encoding the catabolic enzymes. These genes are transcribed as an operon that is induced in the presence of agmatine. In some strains, an insertion (IS) element interrupts the transcription of the cluster, which results in a non-putrescine-producing phenotype. Based on this knowledge, a PCR-based test was developed in order to differentiate nonproducing L. lactis strains from those with a functional AGDI cluster. The analysis of the AGDI cluster and their flanking regions revealed that the capacity to produce putrescine via the AGDI pathway could be a specific characteristic that was lost during the adaptation to the milk environment by a process of reductive genome evolution. PMID:21803900
Association between alcohol dehydrogenase 1C gene *1/*2 polymorphism and pancreatitis risk: a meta-analysis.

PubMed

Fang, F; Pan, J; Su, G H; Xu, L X; Li, G; Li, Z H; Zhao, H; Wang, J

2015-11-30

Numerous studies have focused on the relationship be-tween alcohol dehydrogenase 1C gene (ADH1C) *1/*2 polymorphism (Ile350Val, rs698, also known as ADH1C *1/*2) and pancreatitis risk, but the results have been inconsistent. Thus, we conducted a meta-anal-ysis to more precisely estimate this association. Relevant publications were searched in several widely used databases and 9 eligible studies were included in the meta-analysis. Pooled odds ratios (ORs) and 95% confidence intervals (CIs) were calculated to evaluate the strength of the association. Significant associations between ADH1C *1/*2 poly-morphism and pancreatitis risk were observed in both overall meta-analysis for 12 vs 22 (OR = 1.53, 95%CI = 1.12-2.10) and 11 + 12 vs 22 (OR = 1.44, 95%CI = 1.07-1.95), and the chronic alcoholic pancre-atitis subgroup for 12 vs 22 (OR = 1.64, 95%CI = 1.17-2.29) and 11 + 12 vs 22 (OR = 1.53, 95%CI = 1.11-2.11). Significant pancreatitis risk variation was also detected in Caucasians for 11 + 12 vs 22 (OR = 1.45, 95%CI = 1.07-1.98). In conclusion, the ADH1C *1/*2 polymorphism is likely associated with pancreatitis risk, particularly chronic alcoholic pancreatitis risk, with the *1 allele functioning as a risk factor.
NFκB-mediated activation of the cellular FUT3, 5 and 6 gene cluster by herpes simplex virus type 1.

PubMed

Nordén, Rickard; Samuelsson, Ebba; Nyström, Kristina

2017-11-01

Herpes simplex virus type 1 has the ability to induce expression of a human gene cluster located on chromosome 19 upon infection. This gene cluster contains three fucosyltransferases (encoded by FUT3, FUT5 and FUT6) with the ability to add a fucose to an N-acetylglucosamine residue. Little is known regarding the transcriptional activation of these three genes in human cells. Intriguingly, herpes simplex virus type 1 activates all three genes simultaneously during infection, a situation not observed in uninfected tissue, pointing towards a virus specific mechanism for transcriptional activation. The aim of this study was to define the underlying mechanism for the herpes simplex virus type 1 activation of FUT3, FUT5 and FUT6 transcription. The transcriptional activation of the FUT-gene cluster on chromosome 19 in fibroblasts was specific, not involving adjacent genes. Moreover, inhibition of NFκB signaling through panepoxydone treatment significantly decreased the induction of FUT3, FUT5 and FUT6 transcriptional activation, as did siRNA targeting of p65, in herpes simplex virus type 1 infected fibroblasts. NFκB and p65 signaling appears to play an important role in the regulation of FUT3, FUT5 and FUT6 transcriptional activation by herpes simplex virus type 1 although additional, unidentified, viral factors might account for part of the mechanism as direct interferon mediated stimulation of NFκB was not sufficient to induce the fucosyltransferase encoding gene cluster in uninfected cells. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Novel linkage disequilibrium clustering algorithm identifies new lupus genes on meta-analysis of GWAS datasets.

PubMed

Saeed, Mohammad

2017-05-01

Systemic lupus erythematosus (SLE) is a complex disorder. Genetic association studies of complex disorders suffer from the following three major issues: phenotypic heterogeneity, false positive (type I error), and false negative (type II error) results. Hence, genes with low to moderate effects are missed in standard analyses, especially after statistical corrections. OASIS is a novel linkage disequilibrium clustering algorithm that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders such as SLE. OASIS was applied to two SLE dbGAP GWAS datasets (6077 subjects; ∼0.75 million single-nucleotide polymorphisms). OASIS identified three known SLE genes viz. IFIH1, TNIP1, and CD44, not previously reported using these GWAS datasets. In addition, 22 novel loci for SLE were identified and the 5 SLE genes previously reported using these datasets were verified. OASIS methodology was validated using single-variant replication and gene-based analysis with GATES. This led to the verification of 60% of OASIS loci. New SLE genes that OASIS identified and were further verified include TNFAIP6, DNAJB3, TTF1, GRIN2B, MON2, LATS2, SNX6, RBFOX1, NCOA3, and CHAF1B. This study presents the OASIS algorithm, software, and the meta-analyses of two publicly available SLE GWAS datasets along with the novel SLE genes. Hence, OASIS is a novel linkage disequilibrium clustering method that can be universally applied to existing GWAS datasets for the identification of new genes.
Complete sequence of a plasmid from a bovine methicillin-resistant Staphylococcus aureus harbouring a novel ica-like gene cluster in addition to antimicrobial and heavy metal resistance genes.

PubMed

Feßler, Andrea T; Zhao, Qin; Schoenfelder, Sonja; Kadlec, Kristina; Brenner Michael, Geovana; Wang, Yang; Ziebuhr, Wilma; Shen, Jianzhong; Schwarz, Stefan

2017-02-01

The multiresistance plasmid pAFS11, obtained from a bovine methicillin-resistant Staphylococcus aureus (MRSA) isolate, was completely sequenced and analysed for its structure and organisation. Moreover, the susceptibility to the heavy metals cadmium and copper was determined by broth macrodilution. The 49,189-bp plasmid harboured the apramycin resistance gene apmA, two copies of the macrolide/lincosamide/streptogramin B resistance gene erm(B) (both located on remnants of a truncated transposon Tn917), the kanamycin/neomycin resistance gene aadD, the tetracycline resistance gene tet(L) and the trimethoprim resistance gene dfrK. The latter three genes were part of a 7,284-bp segment which was bracketed by two copies of IS431. In addition, the cadmium resistance operon cadDX as well as the copper resistance genes copA and mco were located on the plasmid and mediated a reduced susceptibility to cadmium and copper. Moreover, a complete novel ica-like gene cluster of so far unknown genetic origin was detected on this plasmid. The ica-like gene cluster comprised four different genes whose products showed 64.4-76.9% homology to the Ica proteins known to be involved in biofilm formation of the S. aureus strains Mu50, Mu3 and N315. However, 96.2-99.4% homology was seen to proteins from S. sciuri NS1 indicating an S. sciuri origin. The finding of five different antibiotic resistance genes co-located on a plasmid with heavy metal resistance genes and an ica-like gene cluster is alarming. With the acquisition of this plasmid, antimicrobial multiresistance, heavy metal resistances and potential virulence properties may be co-selected and spread via a single horizontal gene transfer event. Copyright © 2016 Elsevier B.V. All rights reserved.
Functional Angucycline-Like Antibiotic Gene Cluster in the Terminal Inverted Repeats of the Streptomyces ambofaciens Linear Chromosome

PubMed Central

Pang, Xiuhua; Aigle, Bertrand; Girardet, Jean-Michel; Mangenot, Sophie; Pernodet, Jean-Luc; Decaris, Bernard; Leblond, Pierre

2004-01-01

Streptomyces ambofaciens has an 8-Mb linear chromosome ending in 200-kb terminal inverted repeats. Analysis of the F6 cosmid overlapping the terminal inverted repeats revealed a locus similar to type II polyketide synthase (PKS) gene clusters. Sequence analysis identified 26 open reading frames, including genes encoding the β-ketoacyl synthase (KS), chain length factor (CLF), and acyl carrier protein (ACP) that make up the minimal PKS. These KS, CLF, and ACP subunits are highly homologous to minimal PKS subunits involved in the biosynthesis of angucycline antibiotics. The genes encoding the KS and ACP subunits are transcribed constitutively but show a remarkable increase in expression after entering transition phase. Five genes, including those encoding the minimal PKS, were replaced by resistance markers to generate single and double mutants (replacement in one and both terminal inverted repeats). Double mutants were unable to produce either diffusible orange pigment or antibacterial activity against Bacillus subtilis. Single mutants showed an intermediate phenotype, suggesting that each copy of the cluster was functional. Transformation of double mutants with a conjugative and integrative form of F6 partially restored both phenotypes. The pigmented and antibacterial compounds were shown to be two distinct molecules produced from the same biosynthetic pathway. High-pressure liquid chromatography analysis of culture extracts from wild-type and double mutants revealed a peak with an associated bioactivity that was absent from the mutants. Two additional genes encoding KS and CLF were present in the cluster. However, disruption of the second KS gene had no effect on either pigment or antibiotic production. PMID:14742212
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.

PubMed

Makarova, Kira S; Sorokin, Alexander V; Novichkov, Pavel S; Wolf, Yuri I; Koonin, Eugene V

2007-11-27

An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that
Identification of suitable genes contributes to lung adenocarcinoma clustering by multiple meta-analysis methods.

PubMed

Yang, Ze-Hui; Zheng, Rui; Gao, Yuan; Zhang, Qiang

2016-09-01

With the widespread application of high-throughput technology, numerous meta-analysis methods have been proposed for differential expression profiling across multiple studies. We identified the suitable differentially expressed (DE) genes that contributed to lung adenocarcinoma (ADC) clustering based on seven popular multiple meta-analysis methods. Seven microarray expression profiles of ADC and normal controls were extracted from the ArrayExpress database. The Bioconductor was used to perform the data preliminary preprocessing. Then, DE genes across multiple studies were identified. Hierarchical clustering was applied to compare the classification performance for microarray data samples. The classification efficiency was compared based on accuracy, sensitivity and specificity. Across seven datasets, 573 ADC cases and 222 normal controls were collected. After filtering out unexpressed and noninformative genes, 3688 genes were remained for further analysis. The classification efficiency analysis showed that DE genes identified by sum of ranks method separated ADC from normal controls with the best accuracy, sensitivity and specificity of 0.953, 0.969 and 0.932, respectively. The gene set with the highest classification accuracy mainly participated in the regulation of response to external stimulus (P = 7.97E-04), cyclic nucleotide-mediated signaling (P = 0.01), regulation of cell morphogenesis (P = 0.01) and regulation of cell proliferation (P = 0.01). Evaluation of DE genes identified by different meta-analysis methods in classification efficiency provided a new perspective to the choice of the suitable method in a given application. Varying meta-analysis methods always present varying abilities, so synthetic consideration should be taken when providing meta-analysis methods for particular research. © 2015 John Wiley & Sons Ltd.
Quick identification of acetic acid bacteria based on nucleotide sequences of the 16S-23S rDNA internal transcribed spacer region and of the PQQ-dependent alcohol dehydrogenase gene.

PubMed

Trcek, Janja

2005-10-01

Acetic acid bacteria (AAB) are well known for oxidizing different ethanol-containing substrates into various types of vinegar. They are also used for production of some biotechnologically important products, such as sorbose and gluconic acids. However, their presence is not always appreciated since certain species also spoil wine, juice, beer and fruits. To be able to follow AAB in all these processes, the species involved must be identified accurately and quickly. Because of inaccuracy and very time-consuming phenotypic analysis of AAB, the application of molecular methods is necessary. Since the pairwise comparison among the 16S rRNA gene sequences of AAB shows very high similarity (up to 99.9%) other DNA-targets should be used. Our previous studies showed that the restriction analysis of 16S-23S rDNA internal transcribed spacer region is a suitable approach for quick affiliation of an acetic acid bacterium to a distinct group of restriction types and also for quick identification of a potentially novel species of acetic acid bacterium (Trcek & Teuber 2002; Trcek 2002). However, with the exception of two conserved genes, encoding tRNAIle and tRNAAla, the sequences of 16S-23S rDNA are highly divergent among AAB species. For this reason we analyzed in this study a gene encoding PQQ-dependent ADH as a possible DNA-target. First we confirmed the expression of subunit I of PQQ-dependent ADH (AdhA) also in Asaia, the only genus of AAB which exhibits little or no ADH-activity. Further we analyzed the partial sequences of adhA among some representative species of the genera Acetobacter, Gluconobacter and Gluconacetobacter. The conserved and variable regions in these sequences made possible the construction of A. acetispecific oligonucleotide the specificity of which was confirmed in PCR-reaction using 45 well-defined strains of AAB as DNA-templates. The primer was also successfully used in direct identification of A. aceti from home made cider vinegar as well as for
The Human Paraoxonase Gene Cluster As a Target in the Treatment of Atherosclerosis

PubMed Central

She, Zhi-Gang; Chen, Hou-Zao; Yan, Yunfei; Li, Hongliang

2012-01-01

Abstract The paraoxonase (PON) gene cluster contains three adjacent gene members, PON1, PON2, and PON3. Originating from the same fungus lactonase precursor, all of the three PON genes share high sequence identity and a similar β propeller protein structure. PON1 and PON3 are primarily expressed in the liver and secreted into the serum upon expression, whereas PON2 is ubiquitously expressed and remains inside the cell. Each PON member has high catalytic activity toward corresponding artificial organophosphate, and all exhibit activities to lactones. Therefore, all three members of the family are regarded as lactonases. Under physiological conditions, they act to degrade metabolites of polyunsaturated fatty acids and homocysteine (Hcy) thiolactone, among other compounds. By detoxifying both oxidized low-density lipoprotein and Hcy thiolactone, PONs protect against atherosclerosis and coronary artery diseases, as has been illustrated by many types of in vitro and in vivo experimental evidence. Clinical observations focusing on gene polymorphisms also indicate that PON1, PON2, and PON3 are protective against coronary artery disease. Many other conditions, such as diabetes, metabolic syndrome, and aging, have been shown to relate to PONs. The abundance and/or activity of PONs can be regulated by lipoproteins and their metabolites, biological macromolecules, pharmacological treatments, dietary factors, and lifestyle. In conclusion, both previous results and ongoing studies provide evidence, making the PON cluster a prospective target for the treatment of atherosclerosis. Antioxid. Redox Signal. 16, 597–632. PMID:21867409
Comparison of ergot alkaloid biosynthesis gene clusters in Claviceps species indicates loss of late pathway steps in evolution of C. fusiformis.

PubMed

Lorenz, Nicole; Wilson, Ella V; Machado, Caroline; Schardl, Christopher L; Tudzynski, Paul

2007-11-01

The grass parasites Claviceps purpurea and Claviceps fusiformis produce ergot alkaloids (EA) in planta and in submerged culture. Whereas EA synthesis (EAS) in C. purpurea proceeds via clavine intermediates to lysergic acid and the complex ergopeptines, C. fusiformis produces only agroclavine and elymoclavine. In C. purpurea the EAS gene (EAS) cluster includes dmaW (encoding the first pathway step), cloA (elymoclavine oxidation to lysergic acid), and the lpsA/lpsB genes (ergopeptine formation). We analyzed the corresponding C. fusiformis EAS cluster to investigate the evolutionary basis for chemotypic differences between the Claviceps species. Other than three peptide synthetase genes (lpsC and the tandem paralogues lpsA1 and lpsA2), homologues of all C. purpurea EAS genes were identified in C. fusiformis, including homologues of lpsB and cloA, which in C. purpurea encode enzymes for steps after clavine synthesis. Rearrangement of the cluster was evident around lpsB, which is truncated in C. fusiformis. This and several frameshift mutations render CflpsB a pseudogene (CflpsB(Psi)). No obvious inactivating mutation was identified in CfcloA. All C. fusiformis EAS genes, including CflpsB(Psi) and CfcloA, were expressed in culture. Cross-complementation analyses demonstrated that CfcloA and CflpsB(Psi) were expressed in C. purpurea but did not encode functional enzymes. In contrast, CpcloA catalyzed lysergic acid biosynthesis in C. fusiformis, indicating that C. fusiformis terminates its EAS pathway at elymoclavine because the cloA gene product is inactive. We propose that the C. fusiformis EAS cluster evolved from a more complete cluster by loss of some lps genes and by rearrangements and mutations inactivating lpsB and cloA.
Hierarchical Bayesian modelling of gene expression time series across irregularly sampled replicates and clusters.

PubMed

Hensman, James; Lawrence, Neil D; Rattray, Magnus

2013-08-20

Time course data from microarrays and high-throughput sequencing experiments require simple, computationally efficient and powerful statistical models to extract meaningful biological signal, and for tasks such as data fusion and clustering. Existing methodologies fail to capture either the temporal or replicated nature of the experiments, and often impose constraints on the data collection process, such as regularly spaced samples, or similar sampling schema across replications. We propose hierarchical Gaussian processes as a general model of gene expression time-series, with application to a variety of problems. In particular, we illustrate the method's capacity for missing data imputation, data fusion and clustering.The method can impute data which is missing both systematically and at random: in a hold-out test on real data, performance is significantly better than commonly used imputation methods. The method's ability to model inter- and intra-cluster variance leads to more biologically meaningful clusters. The approach removes the necessity for evenly spaced samples, an advantage illustrated on a developmental Drosophila dataset with irregular replications. The hierarchical Gaussian process model provides an excellent statistical basis for several gene-expression time-series tasks. It has only a few additional parameters over a regular GP, has negligible additional complexity, is easily implemented and can be integrated into several existing algorithms. Our experiments were implemented in python, and are available from the authors' website: http://staffwww.dcs.shef.ac.uk/people/J.Hensman/.
New gene cluster from the thermophile Bacillus fordii MH602 in the conversion of DL-5-substituted hydantoins to L-amino acids.

PubMed

Mei, Yan-Zhen; Wan, Yong-Min; He, Bing-Fang; Ying, Han-Jie; Ouyang, Ping-Kai

2009-12-01

The thermophile Bacillus fordii MH602 was screened for stereospecifically hydrolyzing DL-5-substituted hydantoins to L-alpha-amino acids. Since the reaction at higher temperature, the advantageous for enhancement of substrate solubility and for racemization of DL-5-substituted hydantoins during the conversion were achieved. The hydantoin metabolism gene cluster from thermophile was firstly reported in this paper. The genes involved in hydantoin utilization (hyu) were isolated on an 8.2 kb DNA fragment by Restriction Site-dependent PCR, and six ORFs were identified by DNA sequence analysis. The hyu gene cluster contained four genes with novel cluster organization characteristics: the hydantoinase gene hyuH, putative transport protein hyuP, hyperprotein hyuHP, and L-carbamoylase gene hyuC. The hyuH and hyuC genes were heterogeneously expressed in E. coli. The results indicated that hyuH and hyuC are involved in the conversion of DL-5-substituted hydantoins to an N-carbamyl intermediate that is subsequently converted to L-alpha-amino acids. Hydantoinase and carbamoylase from B. fordii MH602 comparing respectively with reported hydantoinase and carbamoylase showed the highest identities of 71% and 39%. The novel cluster organization characteristics and the difference of the key enzymes between thermopile B. fordii MH602 and other mesophiles were presumed to be related to the evolutionary origins of concerned metabolism.
Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

PubMed

Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

2018-04-26

Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

PubMed

Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

1999-02-01

A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.
New natural products isolated from Metarhizium robertsii ARSEF 23 by chemical screening and identification of the gene cluster through engineered biosynthesis in Aspergillus nidulans A1145.

PubMed

Kato, Hiroki; Tsunematsu, Yuta; Yamamoto, Tsuyoshi; Namiki, Takuya; Kishimoto, Shinji; Noguchi, Hiroshi; Watanabe, Kenji

2016-07-01

To rapidly identify novel natural products and their associated biosynthetic genes from underutilized and genetically difficult-to-manipulate microbes, we developed a method that uses (1) chemical screening to isolate novel microbial secondary metabolites, (2) bioinformatic analyses to identify a potential biosynthetic gene cluster and (3) heterologous expression of the genes in a convenient host to confirm the identity of the gene cluster and the proposed biosynthetic mechanism. The chemical screen was achieved by searching known natural product databases with data from liquid chromatographic and high-resolution mass spectrometric analyses collected on the extract from a target microbe culture. Using this method, we were able to isolate two new meroterpenes, subglutinols C (1) and D (2), from an entomopathogenic filamentous fungus Metarhizium robertsii ARSEF 23. Bioinformatics analysis of the genome allowed us to identify a gene cluster likely to be responsible for the formation of subglutinols. Heterologous expression of three genes from the gene cluster encoding a polyketide synthase, a prenyltransferase and a geranylgeranyl pyrophosphate synthase in Aspergillus nidulans A1145 afforded an α-pyrone-fused uncyclized diterpene, the expected intermediate of the subglutinol biosynthesis, thereby confirming the gene cluster to be responsible for the subglutinol biosynthesis. These results indicate the usefulness of our methodology in isolating new natural products and identifying their associated biosynthetic gene cluster from microbes that are not amenable to genetic manipulation. Our method should facilitate the natural product discovery efforts by expediting the identification of new secondary metabolites and their associated biosynthetic genes from a wider source of microbes.
The Fdb3 transcription factor of the Fusarium Detoxification of Benzoxazolinone gene cluster is required for MBOA but not BOA degradation in Fusarium pseudograminearum.

PubMed

Kettle, Andrew J; Carere, Jason; Batley, Jacqueline; Manners, John M; Kazan, Kemal; Gardiner, Donald M

2016-03-01

A number of cereals produce the benzoxazolinone class of phytoalexins. Fusarium species pathogenic towards these hosts can typically degrade these compounds via an aminophenol intermediate, and the ability to do so is encoded by a group of genes found in the Fusarium Detoxification of Benzoxazolinone (FDB) cluster. A zinc finger transcription factor encoded by one of the FDB cluster genes (FDB3) has been proposed to regulate the expression of other genes in the cluster and hence is potentially involved in benzoxazolinone degradation. Herein we show that Fdb3 is essential for the ability of Fusarium pseudograminearum to efficiently detoxify the predominant wheat benzoxazolinone, 6-methoxy-benzoxazolin-2-one (MBOA), but not benzoxazoline-2-one (BOA). Furthermore, additional genes thought to be part of the FDB gene cluster, based upon transcriptional response to benzoxazolinones, are regulated by Fdb3. However, deletion mutants for these latter genes remain capable of benzoxazolinone degradation, suggesting that they are not essential for this process. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.

Invasive Species Management on Military Lands: Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR associated protein 9 (CRISPR/Cas9) based Gene Drives

DTIC Science & Technology

2017-06-30

Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR -associated protein 9 ( CRISPR /Cas9)-based Gene Drives En vi ro nm en ta l L ab or at...Management on Military Lands Clustered Regularly Interspaced Short Palindromic Repeat/ CRISPR -associated protein 9 ( CRISPR /Cas9)-based Gene Drives Ping... CRISPR /Cas9-based Gene Drives for Invasive Species Management on Military Lands” ERDC/EL SR-17-2 ii Abstract Applications of genetic engineering
Heterogeneic dynamics of the structures of multiple gene clusters in two pathogenetically different lines originating from the same phytoplasma.

PubMed

Arashida, Ryo; Kakizawa, Shigeyuki; Hoshi, Ayaka; Ishii, Yoshiko; Jung, Hee-Young; Kagiwada, Satoshi; Yamaji, Yasuyuki; Oshima, Kenro; Namba, Shigetou

2008-04-01

Phytoplasmas are phloem-limited plant pathogens that are transmitted by insect vectors and are associated with diseases in hundreds of plant species. Despite their small sizes, phytoplasma genomes have repeat-rich sequences, which are due to several genes that are encoded as multiple copies. These multiple genes exist in a gene cluster, the potential mobile unit (PMU). PMUs are present at several distinct regions in the phytoplasma genome. The multicopy genes encoded by PMUs (herein named mobile unit genes [MUGs]) and similar genes elsewhere in the genome (herein named fundamental genes [FUGs]) are likely to have the same function based on their annotations. In this manuscript we show evidence that MUGs and FUGs do not cluster together within the same clade. Each MUG is in a cluster with a short branch length, suggesting that MUGs are recently diverged paralogs, whereas the origin of FUGs is different from that of MUGs. We also compared the genome structures around the lplA gene in two derivative lines of the 'Candidatus Phytoplasma asteris' OY strain, the severe-symptom line W (OY-W) and the mild-symptom line M (OY-M). The gene organizations of the nucleotide sequences upstream of the lplA genes of OY-W and OY-M were dramatically different. The tra5 insertion sequence, an element of PMUs, was found only in this region in OY-W. These results suggest that transposition of entire PMUs and PMU sections has occurred frequently in the OY phytoplasma genome. The difference in the pathogenicities of OY-W and OY-M might be caused by the duplication and transposition of PMUs, followed by genome rearrangement.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

PubMed

Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

2017-08-29

The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.
Sequence and genetic organization of a Zymomonas mobilis gene cluster that encodes several enzymes of glucose metabolism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barnell, W.O.; Kyung Cheol Yi; Conway, T.

1990-12-01

The Zymomonas mobilis genes that encode glucose-6-phosphate dehydrogenase (zwf), 6-phosphogluconate dehydratase (edd), and glucokinase (glk) were cloned independently by genetic complementation of specific defects in Escherichia coli metabolism. The identify of these cloned genes was confirmed by various biochemical means. Nucleotide sequence analysis established that these three genes are clustered on the genome and revealed an additional open reading frame in this region that has significant amino acid identity to the E.coli xylose-proton symporter and the human glucose transporter. On the basis of this evidence and structural analysis of the deduced primary amino acid sequence, this gene is believed tomore » encode the Z. mobilis glucose-facilitated diffusion protein, glf. The four genes in the 6-kb cluster are organized in the order glf, zwf, edd, glk. The glf and zwf genes are separated by 146 bp. The zwf and edd genes overlap by 8 bp, and their expression may be translationally coupled. The edd and glk genes are separated by 203 bp. The glk gene is followed by tandem transcriptional terminators. The four genes appear to be organized in an operon. Such an arrangement of the genes that govern glucose uptake and the first three steps of the Entner-Doudoroff glycolytic pathway provides the organism with a mechanism for carefully regulating the levels of the enzymes that control carbon flux into the pathway.« less
CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks.

PubMed

Li, Min; Li, Dongyan; Tang, Yu; Wu, Fangxiang; Wang, Jianxin

2017-08-31

Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster.
The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

PubMed

Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

2011-02-01

display a different cellular localization compared to that of the gsdf gene indicating that the later gene is not co-regulated. Interestingly, our study identifies new clustered genes that are specifically expressed in previtellogenic oocytes (nup54, aff1, klhl8, sdad1). Copyright Â© 2010 Elsevier B.V. All rights reserved.
The redox-sensing protein Rex modulates ethanol production in Thermoanaerobacterium saccharolyticum

PubMed Central

Lanahan, Anthony A.; Lynd, Lee R.

2018-01-01

Thermoanaerobacterium saccharolyticum is a thermophilic anaerobe that has been engineered to produce high amounts of ethanol, reaching ~90% theoretical yield at a titer of 70 g/L. Here we report the physiological changes that occur upon deleting the redox-sensing transcriptional regulator Rex in wild type T. saccharolyticum: a single deletion of rex resulted in a two-fold increase in ethanol yield (from 40% to 91% theoretical yield), but the resulting strains grew only about a third as fast as the wild type strain. Deletion of the rex gene also had the effect of increasing expression of alcohol dehydrogenase genes, adhE and adhA. After several serial transfers, the ethanol yield decreased from an average of 91% to 55%, and the growth rates had increased. We performed whole-genome resequencing to identify secondary mutations in the Δrex strains adapted for faster growth. In several cases, secondary mutations had appeared in the adhE gene. Furthermore, in these strains the NADH-linked alcohol dehydrogenase activity was greatly reduced. Complementation studies were done to reintroduce rex into the Δrex strains: reintroducing rex decreased ethanol yield to below wild type levels in the Δrex strain without adhE mutations, but did not change the ethanol yield in the Δrex strain where an adhE mutation occurred. PMID:29621294
Transient regulation of three clustered tomato class-I small heat-shock chaperone genes by ethylene is mediated by SIMADS-RIN transcription factor

USDA-ARS?s Scientific Manuscript database

An intronless cluster of three class I small heat shock protein (sHSP) chaperone genes, Sl17.6, Sl20.0 and Sl20.1, resident on the short arm of chromosome 6 in tomato, was previously characterized (Goyal et al., 2012). This shsp chaperone gene cluster was found decorated with cis sequences known to ...
Elucidating the contributions of multiple aldehyde/alcohol dehydrogenases to butanol and ethanol production in Clostridium acetobutylicum.

PubMed

Dai, Zongjie; Dong, Hongjun; Zhang, Yanping; Li, Yin

2016-06-20

Ethanol and butanol biosynthesis in Clostridium acetobutylicum share common aldehyde/alcohol dehydrogenases. However, little is known about the relative contributions of these multiple dehydrogenases to ethanol and butanol production respectively. The contributions of six aldehyde/alcohol dehydrogenases of C. acetobutylicum on butanol and ethanol production were evaluated through inactivation of the corresponding genes respectively. For butanol production, the relative contributions from these enzymes were: AdhE1 > BdhB > BdhA ≈ YqhD > SMB_P058 > AdhE2. For ethanol production, the contributions were: AdhE1 > BdhB > YqhD > SMB_P058 > AdhE2 > BdhA. AdhE1 and BdhB are two essential enzymes for butanol and ethanol production. AdhE1 was relatively specific for butanol production over ethanol, while BdhB, YqhD, and SMB_P058 favor ethanol production over butanol. Butanol synthesis was increased in the adhE2 mutant, which had a higher butanol/ethanol ratio (8.15:1) compared with wild type strain (6.65:1). Both the SMB_P058 mutant and yqhD mutant produced less ethanol without loss of butanol formation, which led to higher butanol/ethanol ratio, 10.12:1 and 10.17:1, respectively. To engineer a more efficient butanol-producing strain, adhE1 could be overexpressed, furthermore, adhE2, SMB_P058, yqhD are promising gene inactivation targets. This work provides useful information guiding future strain improvement for butanol production.
Genes encoding major light-harvesting polypeptides are clustered on the genome of the cyanobacterium Fremyella diplosiphon.

PubMed Central

Conley, P B; Lemaux, P G; Lomax, T L; Grossman, A R

1986-01-01

The polypeptide composition of the phycobilisome, the major light-harvesting complex of prokaryotic cyanobacteria and certain eukaryotic algae, can be modulated by different light qualities in cyanobacteria exhibiting chromatic adaptation. We have identified genomic fragments encoding a cluster of phycobilisome polypeptides (phycobiliproteins) from the chromatically adapting cyanobacterium Fremyella diplosiphon using previously characterized DNA fragments of phycobiliprotein genes from the eukaryotic alga Cyanophora paradoxa and from F. diplosiphon. Characterization of two lambda-EMBL3 clones containing overlapping genomic fragments indicates that three sets of phycobiliprotein genes--the alpha- and beta-allophycocyanin genes plus two sets of alpha- and beta-phycocyanin genes--are clustered within 13 kilobases on the cyanobacterial genome and transcribed off the same strand. The gene order (alpha-allophycocyanin followed by beta-allophycocyanin and beta-phycocyanin followed by alpha-phycocyanin) appears to be a conserved arrangement found previously in a eukaryotic alga and another cyanobacterium. We have reported that one set of phycocyanin genes is transcribed as two abundant red light-induced mRNAs (1600 and 3800 bases). We now present data showing that the allophycocyanin genes and a second set of phycocyanin genes are transcribed into major mRNAs of 1400 and 1600 bases, respectively. These transcripts are present in RNA isolated from cultures grown in red and green light, although lower levels of the 1600-base phycocyanin transcript are present in cells grown in green light. Furthermore, a larger transcript of 1750 bases hybridizes to the allophycocyanin genes and may be a precursor to the 1400-base species. Images PMID:3086870
A cluster of culture positive gonococcal infections but with false negative cppB gene based PCR.

PubMed

Lum, G; Freeman, K; Nguyen, N L; Limnios, E A; Tabrizi, S N; Carter, I; Chambers, I W; Whiley, D M; Sloots, T P; Garland, S M; Tapsall, J W

2005-10-01

To describe the prevalence and characteristics of isolates of Neisseria gonorrhoeae grown from urine samples that produced negative results with nucleic acid amplification assays (NAA) targeting the cppB gene. An initial cluster of culture positive, but cppB gene based NAA negative, gonococcal infections was recognised. Urine samples and suspensions of gonococci isolated over 9 months in the Northern Territory of Australia were examined using cppB gene based and other non-cppB gene based NAA. The gonococcal isolates were phenotyped by determining the auxotype/serovar (A/S) class and genotyped by pulsed field gel electrophoresis (PFGE). 14 (9.8%) of 143 gonococci isolated were of A/S class Pro(-/)Brpyut, indistinguishable on PFGE and negative in cppB gene based, but not other, NAA. This cluster represents a temporal and geographic expansion of a gonococcal subtype lacking the cppB gene with consequent loss of sensitivity of NAA dependent on amplification of this target. Gonococci lacking the cppB gene have in the past been more commonly associated with the PAU-/PCU- auxotype, a gonococcal subtype hitherto infrequently encountered in Australia. NAA based on the cppB gene as a target may produce false positive as well as false negative NAA. This suggests that unless there is continuing comparison with culture to show their utility, cppB gene based NAA should be regarded as suboptimal for use either as a diagnostic or supplemental assay for diagnosis of gonorrhoea, and NAA with alternative amplification targets should be substituted.
Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.

PubMed

Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P

2017-11-23

The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In
Interactions of Environmental Factors and APOA1-APOC3-APOA4-APOA5 Gene Cluster Gene Polymorphisms with Metabolic Syndrome.

PubMed

Wu, Yanhua; Yu, Yaqin; Zhao, Tiancheng; Wang, Shibin; Fu, Yingli; Qi, Yue; Yang, Guang; Yao, Wenwang; Su, Yingying; Ma, Yue; Shi, Jieping; Jiang, Jing; Kou, Changgui

2016-01-01

The present study investigated the prevalence and risk factors for Metabolic syndrome. We evaluated the association between single nucleotide polymorphisms (SNPs) in the apolipoprotein APOA1/C3/A4/A5 gene cluster and the MetS risk and analyzed the interactions of environmental factors and APOA1/C3/A4/A5 gene cluster polymorphisms with MetS. A study on the prevalence and risk factors for MetS was conducted using data from a large cross-sectional survey representative of the population of Jilin Province situated in northeastern China. A total of 16,831 participations were randomly chosen by multistage stratified cluster sampling of residents aged from 18 to 79 years in all nine administrative areas of the province. Environmental factors associated with MetS were examined using univariate and multivariate logistic regression analyses based on the weighted sample data. A sub-sample of 1813 survey subjects who met the criteria for MetS patients and 2037 controls from this case-control study were used to evaluate the association between SNPs and MetS risk. Genomic DNA was extracted from peripheral blood lymphocytes, and SNP genotyping was determined by MALDI-TOF-MS. The associations between SNPs and MetS were examined using a case-control study design. The interactions of environmental factors and APOA1/C3/A4/A5 gene cluster polymorphisms with MetS were assessed using multivariate logistic regression analysis. The overall adjusted prevalence of MetS was 32.86% in Jilin province. The prevalence of MetS in men was 36.64%, which was significantly higher than the prevalence in women (29.66%). MetS was more common in urban areas (33.86%) than in rural areas (31.80%). The prevalence of MetS significantly increased with age (OR = 8.621, 95%CI = 6.594-11.272). Mental labor (OR = 1.098, 95%CI = 1.008-1.195), current smoking (OR = 1.259, 95%CI = 1.108-1.429), excess salt intake (OR = 1.252, 95%CI = 1.149-1.363), and a fruit and dairy intake less than 2 servings a week were
Transcriptional organization of the DNA region controlling expression of the K99 gene cluster.

PubMed

Roosendaal, B; Damoiseaux, J; Jordi, W; de Graaf, F K

1989-01-01

The transcriptional organization of the K99 gene cluster was investigated in two ways. First, the DNA region, containing the transcriptional signals was analyzed using a transcription vector system with Escherichia coli galactokinase (GalK) as assayable marker and second, an in vitro transcription system was employed. A detailed analysis of the transcription signals revealed that a strong promoter PA and a moderate promoter PB are located upstream of fanA and fanB, respectively. No promoter activity was detected in the intercistronic region between fanB and fanC. Factor-dependent terminators of transcription were detected and are probably located in the intercistronic region between fanA and fanB (T1), and between fanB and fanC (T2). A third terminator (T3) was observed between fanC and fanD and has an efficiency of 90%. Analysis of the regulatory region in an in vitro transcription system confirmed the location of the respective transcription signals. A model for the transcriptional organization of the K99 cluster is presented. Indications were obtained that the trans-acting regulatory polypeptides FanA and FanB both function as anti-terminators. A model for the regulation of expression of the K99 gene cluster is postulated.
Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

PubMed Central

Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

2014-01-01

Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822
Complex regulation of the aflatoxin biosynthesis gene cluster of Aspergillus flavus in relation to various combinations of water activity and temperature.

PubMed

Schmidt-Heydt, Markus; Abdel-Hadi, Ahmed; Magan, Naresh; Geisen, Rolf

2009-11-15

A microarray analysis was performed to study the effect of varying combinations of water activity and temperature on the activation of aflatoxin biosynthesis genes in Aspergillusflavus grown on YES medium. Generally A. flavus showed expression of the aflatoxin biosynthetic genes at all parameter combinations tested. Certain combinations of a(w) and temperature, especially combinations which imposed stress on the fungus resulted in a significant reduction of the growth rate. At these conditions induction of the whole aflatoxin biosynthesis gene cluster occurred, however the produced aflatoxin B(1) was low. At all other combinations (25 degrees C/0.95 and 0.99; 30 degrees C/0.95 and 0.99; 35 degrees C/0.95 and 0.99) a reduced basal level of cluster gene expression occurred. At these combinations a high growth rate was obtained as well as high aflatoxin production. When single genes were compared, two groups with different expression profiles in relation to water activity/temperature combinations occurred. These two groups were co-ordinately localized within the aflatoxin gene cluster. The ratio of aflR/aflJ expression was correlated with increased aflatoxin biosynthesis.
The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.

PubMed Central

Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W

1998-01-01

At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over. PMID:9811791
Ribulose bisphosphate carboxylase activity and a Calvin cycle gene cluster in Sulfobacillus species.

PubMed

Caldwell, Paul E; MacLean, Martin R; Norris, Paul R

2007-07-01

The Calvin-Benson-Bassham (CBB) cycle has been extensively studied in proteobacteria, cyanobacteria, algae and plants, but hardly at all in Gram-positive bacteria. Some characteristics of ribulose bisphosphate carboxylase/oxygenase (RuBisCO) and a cluster of potential CBB cycle genes in a Gram-positive bacterium are described in this study with two species of Sulfobacillus (Gram-positive, facultatively autotrophic, mineral sulfide-oxidizing acidophiles). In contrast to the Gram-negative, iron-oxidizing acidophile Acidithiobacillus ferrooxidans, Sulfobacillus thermosulfidooxidans grew poorly autotrophically unless the CO(2) concentration was enhanced over that in air. However, the RuBisCO of each organism showed similar affinities for CO(2) and for ribulose 1,5-bisphosphate, and similar apparent derepression of activity under CO(2) limitation. The red-type, form I RuBisCO of Sulfobacillus acidophilus was confirmed as closely related to that of the anoxygenic phototroph Oscillochloris trichoides. Eight genes potentially involved in the CBB cycle in S. acidophilus were clustered in the order cbbA, cbbP, cbbE, cbbL, cbbS, cbbX, cbbG and cbbT.
antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences.

PubMed

Medema, Marnix H; Blin, Kai; Cimermancic, Peter; de Jager, Victor; Zakrzewski, Piotr; Fischbach, Michael A; Weber, Tilmann; Takano, Eriko; Breitling, Rainer

2011-07-01

Bacterial and fungal secondary metabolism is a rich source of novel bioactive compounds with potential pharmaceutical applications as antibiotics, anti-tumor drugs or cholesterol-lowering drugs. To find new drug candidates, microbiologists are increasingly relying on sequencing genomes of a wide variety of microbes. However, rapidly and reliably pinpointing all the potential gene clusters for secondary metabolites in dozens of newly sequenced genomes has been extremely challenging, due to their biochemical heterogeneity, the presence of unknown enzymes and the dispersed nature of the necessary specialized bioinformatics tools and resources. Here, we present antiSMASH (antibiotics & Secondary Metabolite Analysis Shell), the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes (polyketides, non-ribosomal peptides, terpenes, aminoglycosides, aminocoumarins, indolocarbazoles, lantibiotics, bacteriocins, nucleosides, beta-lactams, butyrolactones, siderophores, melanins and others). It aligns the identified regions at the gene cluster level to their nearest relatives from a database containing all other known gene clusters, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view. antiSMASH is available at http://antismash.secondarymetabolites.org.
A Metabolic Gene Cluster in the Wheat W1 and the Barley Cer-cqu Loci Determines β-Diketone Biosynthesis and Glaucousness

PubMed Central

Lee, Wing-Sham; Malitsky, Sergey; Almekias-Siegl, Efrat; Levy, Matan; Ben-Zvi, Gil; Alkan, Noam; Uauy, Cristobal; Jetter, Reinhard

2016-01-01

The glaucous appearance of wheat (Triticum aestivum) and barley (Hordeum vulgare) plants, that is the light bluish-gray look of flag leaf, stem, and spike surfaces, results from deposition of cuticular β-diketone wax on their surfaces; this phenotype is associated with high yield, especially under drought conditions. Despite extensive genetic and biochemical characterization, the molecular genetic basis underlying the biosynthesis of β-diketones remains unclear. Here, we discovered that the wheat W1 locus contains a metabolic gene cluster mediating β-diketone biosynthesis. The cluster comprises genes encoding proteins of several families including type-III polyketide synthases, hydrolases, and cytochrome P450s related to known fatty acid hydroxylases. The cluster region was identified in both genetic and physical maps of glaucous and glossy tetraploid wheat, demonstrating entirely different haplotypes in these accessions. Complementary evidence obtained through gene silencing in planta and heterologous expression in bacteria supports a model for a β-diketone biosynthesis pathway involving members of these three protein families. Mutations in homologous genes were identified in the barley eceriferum mutants defective in β-diketone biosynthesis, demonstrating a gene cluster also in the β-diketone biosynthesis Cer-cqu locus in barley. Hence, our findings open new opportunities to breed major cereal crops for surface features that impact yield and stress response. PMID:27225753

Some links on this page may take you to non-federal websites. Their policies may differ from this site.