ketoreductase gilu gene: Topics by Science.gov

Sample records for ketoreductase gilu gene

Polyketide intermediate mimics as probes for revealing cryptic stereochemistry of ketoreductase domains.

PubMed

Li, Yang; Fiers, William D; Bernard, Steffen M; Smith, Janet L; Aldrich, Courtney C; Fecik, Robert A

2014-12-19

Among natural product families, polyketides have shown the most promise for combinatorial biosynthesis of natural product-like libraries. Though recent research in the area has provided many mechanistic revelations, a basic-level understanding of kinetic and substrate tolerability is still needed before the full potential of combinatorial biosynthesis can be realized. We have developed a novel set of chemical probes for the study of ketoreductase domains of polyketide synthases. This chemical tool-based approach was validated using the ketoreductase of pikromycin module 2 (PikKR2) as a model system. Triketide substrate mimics 12 and 13 were designed to increase stability (incorporating a nonhydrolyzable thioether linkage) and minimize nonessential functionality (truncating the phosphopantetheinyl arm). PikKR2 reduction product identities as well as steady-state kinetic parameters were determined by a combination of LC-MS/MS analysis of synthetic standards and a NADPH consumption assay. The d-hydroxyl product is consistent with bioinformatic analysis and results from a complementary biochemical and molecular biological approach. When compared to widely employed substrates in previous studies, diketide 63 and trans-decalone 64, substrates 12 and 13 showed 2-10 fold lower K(M) values (2.4 ± 0.8 and 7.8 ± 2.7 mM, respectively), indicating molecular recognition of intermediate-like substrates. Due to an abundance of the nonreducable enol-tautomer, the k(cat) values were attenuated by as much as 15-336 fold relative to known substrates. This study reveals the high stereoselectivity of PikKR2 in the face of gross substrate permutation, highlighting the utility of a chemical probe-based approach in the study of polyketide ketoreductases.
Inhibition Kinetics And Emodin Cocrystal Structure of a Type II Polyketide Ketoreductase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Korman, T.P.; Tan, Y.-H.; Wong, J.

Type II polyketides are a class of natural products that include pharmaceutically important aromatic compounds such as the antibiotic tetracycline and antitumor compound doxorubicin. The type II polyketide synthase (PKS) is a complex consisting of 5-10 standalone domains homologous to fatty acid synthase (FAS). Polyketide ketoreductase (KR) provides regio- and stereochemical diversity during the reduction. How the type II polyketide KR specifically reduces only the C9 carbonyl group is not well understood. The cocrystal structures of actinorhodin polyketide ketoreductase (actKR) bound with NADPH or NADP{sup +} and the inhibitor emodin were solved with the wild type and P94L mutant ofmore » actKR, revealing the first observation of a bent p-quinone in an enzyme active site. Molecular dynamics simulation help explain the origin of the bent geometry. Extensive screening for in vitro substrates shows that unlike FAS KR, the actKR prefers bicyclic substrates. Inhibition kinetics indicate that actKR follows an ordered Bi Bi mechanism. Together with docking simulations that identified a potential phosphopantetheine binding groove, the structural and functional studies reveal that the C9 specificity is a result of active site geometry and substrate ring constraints. The results lay the foundation for the design of novel aromatic polyketide natural products with different reduction patterns.« less
Synthesis of Vibegron Enabled by a Ketoreductase Rationally Designed for High pH Dynamic Kinetic Reduction.

PubMed

Xu, Feng; Kosjek, Birgit; Cabirol, Fabien L; Chen, Haibin; Desmond, Richard; Park, Jeonghan; Gohel, Anupam P; Collier, Steven J; Smith, Derek J; Liu, Zhuqing; Janey, Jacob M; Chung, John Y L; Alvizo, Oscar

2018-06-04

Described here is an efficient stereoselective synthesis of vibegron enabled by an enzymatic dynamic kinetic reduction that proceeds in a high-pH environment. To overcome enzyme performance limitations under these conditions, a ketoreductase was evolved by a computationally and structurally aided strategy to increase cofactor stability through tighter binding. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
SimC7 Is a Novel NAD(P)H-Dependent Ketoreductase Essential for the Antibiotic Activity of the DNA Gyrase Inhibitor Simocyclinone.

PubMed

Schäfer, Martin; Le, Tung B K; Hearnshaw, Stephen J; Maxwell, Anthony; Challis, Gregory L; Wilkinson, Barrie; Buttner, Mark J

2015-06-19

Simocyclinone D8 (SD8) is a potent DNA gyrase inhibitor produced by Streptomyces antibioticus Tü6040. The simocyclinone (sim) biosynthetic gene cluster has been sequenced and a hypothetical biosynthetic pathway has been proposed. The tetraene linker in SD8 was suggested to be the product of a modular type I polyketide synthase working in trans with two monofunctional enzymes. One of these monofunctional enzymes, SimC7, was proposed to supply a dehydratase activity missing from two modules of the polyketide synthase. In this study, we report the function of SimC7. We isolated the entire ~72-kb sim cluster on a single phage artificial chromosome clone and produced simocyclinone heterologously in a Streptomyces coelicolor strain engineered for improved antibiotic production. Deletion of simC7 resulted in the production of a novel simocyclinone, 7-oxo-SD8, which unexpectedly carried a normal tetraene linker but was altered in the angucyclinone moiety. We demonstrate that SimC7 is an NAD(P)H-dependent ketoreductase that catalyzes the conversion of 7-oxo-SD8 into SD8. 7-oxo-SD8 was essentially inactive as a DNA gyrase inhibitor, and the reduction of the keto group by SimC7 was shown to be crucial for high-affinity binding to the enzyme. Thus, SimC7 is an angucyclinone ketoreductase that is essential for the biological activity of simocyclinone. Copyright © 2015. Published by Elsevier Ltd.
Enantiomeric scaffolding of α-tetralone and related scaffolds by EKR (enzymatic kinetic resolution) and stereoselective ketoreduction with ketoreductases.

PubMed

Bhuniya, Rajib; Nanda, Samik

2012-01-21

Stereochemically pure compounds containing an all carbon quaternary stereocenter based on 1-tetralone, 1-indanone and 4-chromanone scaffolds have been synthesized by employing Lipase PS (Burkholderia cepacia) catalyzed kinetic resolution. These scaffolds are further functionalized by microbial ketoreductase enzymes (Geotrichum candidum, Candida parapsilosis and Aspergillus niger) to access stereochemically pure diols which, on further synthetic manipulation, yield novel cyclic compounds.
Origins of stereoselectivity in evolved ketoreductases.

PubMed

Noey, Elizabeth L; Tibrewal, Nidhi; Jiménez-Osés, Gonzalo; Osuna, Sílvia; Park, Jiyong; Bond, Carly M; Cascio, Duilio; Liang, Jack; Zhang, Xiyun; Huisman, Gjalt W; Tang, Yi; Houk, Kendall N

2015-12-22

Mutants of Lactobacillus kefir short-chain alcohol dehydrogenase, used here as ketoreductases (KREDs), enantioselectively reduce the pharmaceutically relevant substrates 3-thiacyclopentanone and 3-oxacyclopentanone. These substrates differ by only the heteroatom (S or O) in the ring, but the KRED mutants reduce them with different enantioselectivities. Kinetic studies show that these enzymes are more efficient with 3-thiacyclopentanone than with 3-oxacyclopentanone. X-ray crystal structures of apo- and NADP(+)-bound selected mutants show that the substrate-binding loop conformational preferences are modified by these mutations. Quantum mechanical calculations and molecular dynamics (MD) simulations are used to investigate the mechanism of reduction by the enzyme. We have developed an MD-based method for studying the diastereomeric transition state complexes and rationalize different enantiomeric ratios. This method, which probes the stability of the catalytic arrangement within the theozyme, shows a correlation between the relative fractions of catalytically competent poses for the enantiomeric reductions and the experimental enantiomeric ratio. Some mutations, such as A94F and Y190F, induce conformational changes in the active site that enlarge the small binding pocket, facilitating accommodation of the larger S atom in this region and enhancing S-selectivity with 3-thiacyclopentanone. In contrast, in the E145S mutant and the final variant evolved for large-scale production of the intermediate for the antibiotic sulopenem, R-selectivity is promoted by shrinking the small binding pocket, thereby destabilizing the pro-S orientation.
Origins of stereoselectivity in evolved ketoreductases

PubMed Central

Noey, Elizabeth L.; Tibrewal, Nidhi; Jiménez-Osés, Gonzalo; Osuna, Sílvia; Park, Jiyong; Bond, Carly M.; Cascio, Duilio; Liang, Jack; Zhang, Xiyun; Huisman, Gjalt W.; Tang, Yi; Houk, Kendall N.

2015-01-01

Mutants of Lactobacillus kefir short-chain alcohol dehydrogenase, used here as ketoreductases (KREDs), enantioselectively reduce the pharmaceutically relevant substrates 3-thiacyclopentanone and 3-oxacyclopentanone. These substrates differ by only the heteroatom (S or O) in the ring, but the KRED mutants reduce them with different enantioselectivities. Kinetic studies show that these enzymes are more efficient with 3-thiacyclopentanone than with 3-oxacyclopentanone. X-ray crystal structures of apo- and NADP+-bound selected mutants show that the substrate-binding loop conformational preferences are modified by these mutations. Quantum mechanical calculations and molecular dynamics (MD) simulations are used to investigate the mechanism of reduction by the enzyme. We have developed an MD-based method for studying the diastereomeric transition state complexes and rationalize different enantiomeric ratios. This method, which probes the stability of the catalytic arrangement within the theozyme, shows a correlation between the relative fractions of catalytically competent poses for the enantiomeric reductions and the experimental enantiomeric ratio. Some mutations, such as A94F and Y190F, induce conformational changes in the active site that enlarge the small binding pocket, facilitating accommodation of the larger S atom in this region and enhancing S-selectivity with 3-thiacyclopentanone. In contrast, in the E145S mutant and the final variant evolved for large-scale production of the intermediate for the antibiotic sulopenem, R-selectivity is promoted by shrinking the small binding pocket, thereby destabilizing the pro-S orientation. PMID:26644568
Stereoselective reduction of aromatic ketones by a new ketoreductase from Pichia glucozyma.

PubMed

Contente, Martina Letizia; Serra, Immacolata; Brambilla, Marta; Eberini, Ivano; Gianazza, Elisabetta; De Vitis, Valerio; Molinari, Francesco; Zambelli, Paolo; Romano, Diego

2016-01-01

A new NADPH-dependent benzil reductase (KRED1-Pglu) was identified from the genome of the non-conventional yeast Pichia glucozyma CBS 5766 and overexpressed in E. coli. The new protein was characterised and reaction parameters were optimised for the enantioselective reduction of benzil to (S)-benzoin. A thorough study of the substrate range of KRED1-Pglu was conducted; in contrast to most other known ketoreductases, KRED1-Pglu prefers space-demanding substrates, which are often converted with high stereoselectivity. A molecular modelling study was carried out for understanding the structural determinants involved in the stereorecognition experimentally observed and unpredictable on the basis of steric properties of the substrates. As a result, a new useful catalyst was identified, enabling the enantioselective preparation of different aromatic alcohols and hydroxyketones.
Recognition of Acyl Carrier Proteins by Ketoreductases in Assembly Line Polyketide Synthases

PubMed Central

Ostrowski, Matthew P.; Cane, David E.; Khosla, Chaitan

2016-01-01

Ketoreductases (KRs) are the most widespread tailoring domains found in individual modules of assembly line polyketide synthases (PKSs), and are responsible for controlling the configurations of both the α-methyl and β-hydroxyl stereogenic centers in the growing polyketide chain. Because they recognize substrates that are covalently bound to acyl carrier proteins (ACPs) within the same PKS module, we sought to quantify the extent to which protein-protein recognition contributes to the turnover of these oxidoreductive enzymes using stand-alone domains from the 6-deoxyerythronolide B synthase (DEBS). Reduced 2-methyl-3-hydroxyacyl-ACP substrates derived from two enantiomeric acyl chains and four distinct ACP domains were synthesized and presented to four distinct KR domains. Two KRs, from DEBS modules 2 and 5, displayed little preference for oxidation of substrates tethered to their cognate ACP domains over those attached to the other ACP domains tested. In contrast, the KR from DEBS module 1 showed a ca. 10-50-fold preference for substrate attached to its native ACP domain, whereas the KR from DEBS module 6 actually displayed a ca. 10-fold preference for the ACP from DEBS module 5. Our findings suggest that recognition of the ACP by a KR domain is unlikely to affect the rate of native assembly line polyketide biosynthesis. In some cases, however, unfavorable KR-ACP interactions may suppress the rate of substrate processing when KR domains are swapped to construct hybrid PKS modules. PMID:27118242
Identification and Characterization of the Pyridomycin Biosynthetic Gene Cluster of Streptomyces pyridomyceticus NRRL B-2517*

PubMed Central

Huang, Tingting; Wang, Yemin; Yin, Jun; Du, Yanhua; Tao, Meifeng; Xu, Jing; Chen, Wenqing; Lin, Shuangjun; Deng, Zixin

2011-01-01

Pyridomycin is a structurally unique antimycobacterial cyclodepsipeptide containing rare 3-(3-pyridyl)-l-alanine and 2-hydroxy-3-methylpent-2-enoic acid moieties. The biosynthetic gene cluster for pyridomycin has been cloned and identified from Streptomyces pyridomyceticus NRRL B-2517. Sequence analysis of a 42.5-kb DNA region revealed 26 putative open reading frames, including two nonribosomal peptide synthetase (NRPS) genes and a polyketide synthase gene. A special feature is the presence of a polyketide synthase-type ketoreductase domain embedded in an NRPS. Furthermore, we showed that PyrA functioned as an NRPS adenylation domain that activates 3-hydroxypicolinic acid and transfers it to a discrete peptidyl carrier protein, PyrU, which functions as a loading module that initiates pyridomycin biosynthesis in vivo and in vitro. PyrA could also activate other aromatic acids, generating three pyridomycin analogues in vivo. PMID:21454714
Structural and Biochemical Analyses of Regio- and Stereo-Specificities Observed in a Type II Polyketide Ketoreductase

PubMed Central

Javidpour, Pouya; Korman, Tyler Paz; Shakya, Gaurav; Tsai, Shiou-Chuan

2011-01-01

Type II polyketides include antibiotics such as tetracycline, and chemotherapeutics such as daunorubicin. Type II polyketides are biosynthesized by the type II polyketide synthase (PKS) that consists of 5 – 10 stand-alone domains. In many type II PKSs, the type II ketoreductase (KR) specifically reduce the C9-carbonyl group. How the type II KR achieves such a high regio-specificity, and the nature of stereo-specificity, are not well understood. Sequence alignment of KRs led to a hypothesis that a well-conserved 94-XGG-96 motif may be involved in controlling the stereochemistry. The stereo-specificity of single, double and triple mutant combinations of P94L, G95D and G96D were analyzed in vitro and in vivo for the actinorhodin KR (actKR). The P94L mutation is sufficient to change the stereospecificity of actKR. Binary and ternary crystal structures of both wild type and P94L actKR were solved. Together with assay results, docking simulations, and co-crystal structures, a model for stereochemical control is presented herein that elucidates how type II polyketides are introduced into the substrate pocket such that the C9-carbonyl can be reduced with high regio- and stereo-specificities. The molecular features of actKR important for regio- and stereo-specificities can potentially be applied to biosynthesize new polyketides via protein engineering that rationally controls polyketide ketoreduction. PMID:21506596
Generation of human endometrial knockout cell lines with the CRISPR/Cas9 system confirms the prostaglandin F2α synthase activity of aldo-ketoreductase 1B1.

PubMed

Lacroix Pépin, Nicolas; Chapdelaine, Pierre; Rodriguez, Yoima; Tremblay, Jacques-P; Fortier, Michel A

2014-07-01

Prostaglandins (PGs) are important regulators of female reproductive function. The primary PGs produced in the endometrium are PGE2 and PGF2α. Relatively little is known about the biosynthetic pathways leading to the formation of PGF2α. We have described the role of aldo-ketoreductase (AKR)1B1 in increased PGF2α production by human endometrial cells following stimulation with interleukin-1β (IL-1β). However, alternate PGF synthases are expressed concurrently in endometrial cells. A definite proof of the role of AKR1B1 would require gene knockout; unfortunately, this gene has no direct equivalent in the mouse. Recently, an efficient genome-editing technology using RNA-guided DNase Cas9 and the clustered regularly interspaced short palindromic repeats (CRISPR) system has been developed. We have adapted this approach to knockout AKR1B1 gene expression in human endometrial cell lines. One clone (16-2) of stromal origin generated by the CRISPR/Cas9 system exhibited a complete loss of AKR1B1 protein and mRNA expression, whereas other clones presented with partial edition. The present report focuses on the characterization of clone 16-2 exhibiting deletion of 68 and 2 nucleotides, respectively, on each of the alleles. Cells from this clone lost their ability to produce PGF2α but maintained their original stromal cell (human endometrial stromal cells-2) phenotype including the capacity to decidualize in the presence of progesterone (medroxyprogesterone acetate) and 8-bromo-cAMP. Knockout cells also maintained their ability to increase PGE2 production in response to IL-1β. In summary, we demonstrate that the new genome editing CRISPR/Cas9 system can be used in human cells to generate stable knockout cell line models. Our results suggest that genome editing of human cell lines can be used to complement mouse KO models to validate the function of genes in differentiated tissues and cells. Our results also confirm that AKR1B1 is involved in the synthesis of PGF2α. �
Cloning, sequencing, and analysis of the griseusin polyketide synthase gene cluster from Streptomyces griseus.

PubMed Central

Yu, T W; Bibb, M J; Revill, W P; Hopwood, D A

1994-01-01

A fragment of DNA was cloned from the Streptomyces griseus K-63 genome by using genes (act) for the actinorhodin polyketide synthase (PKS) of Streptomyces coelicolor as a probe. Sequencing of a 5.4-kb segment of the cloned DNA revealed a set of five gris open reading frames (ORFs), corresponding to the act PKS genes, in the following order: ORF1 for a ketosynthase, ORF2 for a chain length-determining factor, ORF3 for an acyl carrier protein, ORF5 for a ketoreductase, and ORF4 for a cyclase-dehydrase. Replacement of the gris genes with a marker gene in the S. griseus genome by using a single-stranded suicide vector propagated in Escherichia coli resulted in loss of the ability to produce griseusins A and B, showing that the five gris genes do indeed encode the type II griseusin PKS. These genes, encoding a PKS that is programmed differently from those for other aromatic PKSs so far available, will provide further valuable material for analysis of the programming mechanism by the construction and analysis of strains carrying hybrid PKS. Images PMID:8169211
A ketoreductase domain in the PksJ protein of the bacillaene assembly line carries out both α- and β-ketone reduction during chain growth

PubMed Central

Calderone, Christopher T.; Bumpus, Stefanie B.; Kelleher, Neil L.; Walsh, Christopher T.; Magarvey, Nathan A.

2008-01-01

The polyketide signaling metabolites bacillaene and dihydrobacillaene are biosynthesized in Bacillus subtilis on an enzymatic assembly line with both nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) modules acting along with catalytic domains servicing the assembly line in trans. These signaling metabolites possess the unusual starter unit α-hydroxyisocaproate (α-HIC). We show here that it arises from initial activation of α-ketoisocaproate (α-KIC) by the first adenylation domain of PksJ (a hybrid PKS/NRPS) and installation on the pantetheinyl arm of the adjacent thiolation (T) domain. The α-KIC unit is elongated to α-KIC-Gly by the second NRPS module in PksJ as demonstrated by mass spectrometric analysis. The third module of PksJ uses PKS logic and contains an embedded ketoreductase (KR) domain along with two adjacent T domains. We show that this KR domain reduces canonical 3-ketobutyryl chains but also the α-keto group of α-KIC-containing intermediates on the PksJ T-domain doublet. This KR activity accounts for the α-HIC moiety found in the dihydrobacillaene/bacillaene pair and represents an example of an assembly-line dual-function α- and β-KR acting on disparate positions of a growing chain intermediate. PMID:18723688
Engineered Biosynthesis of a Novel Amidated Polyketide, Using the Malonamyl-Specific Initiation Module from the Oxytetracycline Polyketide Synthase

PubMed Central

Zhang, Wenjun; Ames, Brian D.; Tsai, Shiou-Chuan; Tang, Yi

2006-01-01

Tetracyclines are aromatic polyketides biosynthesized by bacterial type II polyketide synthases (PKSs). Understanding the biochemistry of tetracycline PKSs is an important step toward the rational and combinatorial manipulation of tetracycline biosynthesis. To this end, we have sequenced the gene cluster of oxytetracycline (oxy and otc genes) PKS genes from Streptomyces rimosus. Sequence analysis revealed a total of 21 genes between the otrA and otrB resistance genes. We hypothesized that an amidotransferase, OxyD, synthesizes the malonamate starter unit that is a universal building block for tetracycline compounds. In vivo reconstitution using strain CH999 revealed that the minimal PKS and OxyD are necessary and sufficient for the biosynthesis of amidated polyketides. A novel alkaloid (WJ35, or compound 2) was synthesized as the major product when the oxy-encoded minimal PKS, the C-9 ketoreductase (OxyJ), and OxyD were coexpressed in CH999. WJ35 is an isoquinolone compound derived from an amidated decaketide backbone and cyclized with novel regioselectivity. The expression of OxyD with a heterologous minimal PKS did not afford similarly amidated polyketides, suggesting that the oxy-encoded minimal PKS possesses novel starter unit specificity. PMID:16597959
Combined application of plasma mutagenesis and gene engineering leads to 5-oxomilbemycins A3/A4 as main components from Streptomyces bingchenggensis.

PubMed

Wang, Hai-Yan; Zhang, Ji; Zhang, Yue-Jing; Zhang, Bo; Liu, Chong-Xi; He, Hai-Rong; Wang, Xiang-Jing; Xiang, Wen-Sheng

2014-12-01

Milbemycin oxime has been commercialized as effective anthelmintics in the fields of animal health, agriculture, and human infections. Currently, milbemycin oxime is synthesized by a two-step chemical reaction, which involves the ketonization of milbemycins A3/A4 to yield the intermediates 5-oxomilbemycins A3/A4 using CrO3 as catalyst. Due to the low efficiency and environmental unfriendliness of the ketonization of milbemycins A3/A4, it is imperative to develop alternative strategies to produce 5-oxomilbemycins A3/A4. In this study, the atmospheric and room temperature plasma (ARTP) mutation system was first employed to treat milbemycin-producing strain Streptomyces bingchenggensis, and a mutant strain BC-120-4 producing milbemycins A3, A4, B2, and B3 as main components was obtained, which favors the construction of genetically engineered strains producing 5-oxomilbemycins. Importantly, the milbemycins A3/A4 yield of BC-120-4 reached 3,890 ± 52 g/l, which was approximately two times higher than that of the initial strain BC-109-6 (1,326 ± 37 g/l). The subsequent interruption of the gene milF encoding a C5-ketoreductase responsible for the ketonization of milbemycins led to strain BCJ60 (∆milF) with the production of 5-oxomilbemycins A3/A4 and the elimination of milbemycins A3, A4, B2, and B3. The high 5-oxomilbemycins A3/A4 yield (3,470 ± 147 g/l) and genetic stability of BCJ60 implied the potential use in industry to prepare 5-oxomilbemycins A3/A4 for the semisynthesis of milbemycins oxime.
Inhibition Kinetics and Emodin Cocrystal Structure of a Type II Polyketide Ketoreductase†,‡

PubMed Central

Korman, Tyler Paz; Tan, Yuhong; Wong, Justin; Luo, Rui; Tsai, Shiou-Chuan

2008-01-01

Type II polyketides are a class of natural products that include pharmaceutically important aromatic compounds such as the antibiotic tetracycline and antitumor compound doxorubicin. The type II polyketide synthase (PKS) is a complex consisting of 5–10 standalone domains homologous to fatty acid synthase (FAS). Polyketide ketoreductase (KR) provides regio- and stereochemical diversity during the reduction. How the type II polyketide KR specifically reduces only the C9 carbonyl group is not well understood. The cocrystal structures of actinorhodin polyketide ketoreductase (actKR) bound with NADPH or NADP+ and the inhibitor emodin were solved with the wild type and P94L mutant of actKR, revealing the first observation of a bent p-quinone in an enzyme active site. Molecular dynamics simulation help explain the origin of the bent geometry. Extensive screening for in vitro substrates shows that unlike FAS KR, the actKR prefers bicyclic substrates. Inhibition kinetics indicate that actKR follows an ordered Bi Bi mechanism. Together with docking simulations that identified a potential phosphopantetheine binding groove, the structural and functional studies reveal that the C9 specificity is a result of active site geometry and substrate ring constraints. The results lay the foundation for the design of novel aromatic polyketide natural products with different reduction patterns. PMID:18205400
Role of Modular Polyketide Synthases in the Production of Polyether Ladder Compounds in Ciguatoxin-Producing Gambierdiscus polynesiensis and G. excentricus (Dinophyceae).

PubMed

Kohli, Gurjeet S; Campbell, Katrina; John, Uwe; Smith, Kirsty F; Fraga, Santiago; Rhodes, Lesley L; Murray, Shauna A

2017-09-01

Gambierdiscus, a benthic dinoflagellate, produces ciguatoxins that cause the human illness Ciguatera. Ciguatoxins are polyether ladder compounds that have a polyketide origin, indicating that polyketide synthases (PKS) are involved in their production. We sequenced transcriptomes of Gambierdiscus excentricus and Gambierdiscus polynesiensis and found 264 contigs encoding single domain ketoacyl synthases (KS; G. excentricus: 106, G. polynesiensis: 143) and ketoreductases (KR; G. excentricus: 7, G. polynesiensis: 8) with sequence similarity to type I PKSs, as reported in other dinoflagellates. In addition, 24 contigs (G. excentricus: 3, G. polynesiensis: 21) encoding multiple PKS domains (forming typical type I PKSs modules) were found. The proposed structure produced by one of these megasynthases resembles a partial carbon backbone of a polyether ladder compound. Seventeen contigs encoding single domain KS, KR, s-malonyltransacylase, dehydratase and enoyl reductase with sequence similarity to type II fatty acid synthases (FAS) in plants were found. Type I PKS and type II FAS genes were distinguished based on the arrangement of domains on the contigs and their sequence similarity and phylogenetic clustering with known PKS/FAS genes in other organisms. This differentiation of PKS and FAS pathways in Gambierdiscus is important, as it will facilitate approaches to investigating toxin biosynthesis pathways in dinoflagellates. © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Isolation and characterization of a cDNA from Cuphea lanceolata encoding a beta-ketoacyl-ACP reductase.

PubMed

Klein, B; Pawlowski, K; Höricke-Grandpierre, C; Schell, J; Töpfer, R

1992-05-01

A cDNA encoding beta-ketoacyl-ACP reductase (EC 1.1.1.100), an integral part of the fatty acid synthase type II, was cloned from Cuphea lanceolata. This cDNA of 1276 bp codes for a polypeptide of 320 amino acids with 63 N-terminal residues presumably representing a transit peptide and 257 residues corresponding to the mature protein of 27 kDa. The encoded protein shows strong homology with the amino-terminal sequence and two tryptic peptides from avocado mesocarp beta-ketoacyl-ACP reductase, and its total amino acid composition is highly similar to those of the beta-ketoacyl-ACP reductases of avocado and spinach. Amino acid sequence homologies to polyketide synthase, beta-ketoreductases and short-chain alcohol dehydrogenases are discussed. An engineered fusion protein lacking most of the transit peptide, which was produced in Escherichia coli, was isolated and proved to possess beta-ketoacyl-ACP reductase activity. Hybridization studies revealed that in C. lanceolata beta-ketoacyl-ACP reductase is encoded by a small family of at least two genes and that members of this family are expressed in roots, leaves, flowers and seeds.
Enzymatic reduction of acetophenone derivatives with a benzil reductase from Pichia glucozyma (KRED1-Pglu): electronic and steric effects on activity and enantioselectivity.

PubMed

Contente, Martina L; Serra, Immacolata; Palazzolo, Luca; Parravicini, Chiara; Gianazza, Elisabetta; Eberini, Ivano; Pinto, Andrea; Guidi, Benedetta; Molinari, Francesco; Romano, Diego

2016-04-07

A recombinant ketoreductase from Pichia glucozyma (KRED1-Pglu) was used for the enantioselective reduction of various mono-substituted acetophenones. Reaction rates of meta- and para-derivatives were consistent with the electronic effects described by σ-Hammett coefficients; on the other hand, enantioselectivity was determined by an opposite orientation of the substrate in the binding pocket. Reduction of ortho-derivatives occurred only with substrates bearing substituents with low steric impact (i.e., F and CN). Reactivity was controlled by stereoelectronic features (C[double bond, length as m-dash]O length and charge, shape of LUMO frontier molecular orbitals), which can be theoretically calculated.

Genes and Gene Therapy

MedlinePlus

... a child can have a genetic disorder. Gene therapy is an experimental technique that uses genes to ... prevent disease. The most common form of gene therapy involves inserting a normal gene to replace an ...
Gene network biological validity based on gene-gene interaction relevance.

PubMed

Gómez-Vela, Francisco; Díaz-Díaz, Norberto

2014-01-01

In recent years, gene networks have become one of the most useful tools for modeling biological processes. Many inference gene network algorithms have been developed as techniques for extracting knowledge from gene expression data. Ensuring the reliability of the inferred gene relationships is a crucial task in any study in order to prove that the algorithms used are precise. Usually, this validation process can be carried out using prior biological knowledge. The metabolic pathways stored in KEGG are one of the most widely used knowledgeable sources for analyzing relationships between genes. This paper introduces a new methodology, GeneNetVal, to assess the biological validity of gene networks based on the relevance of the gene-gene interactions stored in KEGG metabolic pathways. Hence, a complete KEGG pathway conversion into a gene association network and a new matching distance based on gene-gene interaction relevance are proposed. The performance of GeneNetVal was established with three different experiments. Firstly, our proposal is tested in a comparative ROC analysis. Secondly, a randomness study is presented to show the behavior of GeneNetVal when the noise is increased in the input network. Finally, the ability of GeneNetVal to detect biological functionality of the network is shown.
Gene-for-genes interactions between cotton R genes and Xanthomonas campestris pv. malvacearum avr genes.

PubMed

De Feyter, R; Yang, Y; Gabriel, D W

1993-01-01

Six plasmid-borne avirulence (avr) genes were previously cloned from strain XcmH of the cotton pathogen, Xanthomonas campestris pv. malvacearum. We have now localized all six avr genes on the cloned fragments by subcloning and Tn5-gusA insertional mutagenesis. None of these avr genes appeared to exhibit exclusively gene-for-gene patterns of interactions with cotton R genes, and avrB4 was demonstrated to confer avr gene-for-R genes (plural) avirulence to X. c. pv. malvacearum on congenic cotton lines carrying either of two different resistance loci, B1 or B4. Furthermore, the B1 locus appeared to confer R gene-for-avr genes resistance to cotton against isogenic X. c. pv. malvacearum strains carrying any one of three avr genes: avrB4, avrb6, or avrB102. Restriction enzyme, Southern blot hybridization, and DNA sequence analyses showed that the XcmH avr genes are all highly similar to each other, to avrBs3 and avrBsP from the pepper pathogen X. c. pv. vesicatoria, and to the host-specific virulence gene pthA from the citrus pathogen X. citri. The XcmH avr genes differed primarily in the multiplicity of a tandemly repeated 102-base pair motif within the central portions of the genes, repeated from 14 to 23 times in members of this gene family. The complete nucleotide sequence of avrb6 revealed that it is 97% identical in DNA sequence to avrB4, avrBs3, avrBsP, and pthA and that 62-bp inverted terminal repeats mark the boundaries of homology between avrb6 and all members of this Xanthomonas virulence/avirulence gene family sequenced to date. The terminal 38 bp of both inverted repeats are highly similar to the 38-bp consensus terminal sequence of the Tn3 family of transposons. Up to 11 members of the avr gene family appear to be present in North American strains of X. c. pv. malvacearum, including XcmH. The high level of homology observed among these avr genes and their presence in multiple copies may explain the gene-for-genes interactions and also the observed high
Mechanism and Stereochemistry of Polyketide Chain Elongation and Methyl Group Epimerization in Polyether Biosynthesis.

PubMed

Xie, Xinqiang; Garg, Ashish; Khosla, Chaitan; Cane, David E

2017-03-01

The polyketide synthases responsible for the biosynthesis of the polyether antibiotics nanchangmycin (1) and salinomycin (4) harbor a number of redox-inactive ketoreductase (KR 0 ) domains that are implicated in the generation of C2-epimerized (2S)-2-methyl-3-ketoacyl-ACP intermediates. Evidence that the natural substrate for the polyether KR 0 domains is, as predicted, a (2R)-2-methyl-3-ketoacyl-ACP intermediate, came from a newly developed coupled ketosynthase (KS)-ketoreductase (KR) assay that established that the decarboxylative condensation of methylmalonyl-CoA with S-propionyl-N-acetylcysteamine catalyzed by the Nan[KS1][AT1] didomain from module 1 of the nanchangmycin synthase generates exclusively the corresponding (2R)-2-methyl-3-ketopentanoyl-ACP (7a) product. In tandem equilibrium isotope exchange experiments, incubation of [2- 2 H]-(2R,3S)-2-methyl-3-hydroxypentanoyl-ACP (6a) with redox-active, epimerase-inactive EryKR6 from module 6 of the 6-deoxyerythronolide B synthase and catalytic quantities of NADP + in the presence of redox-inactive, recombinant NanKR1 0 or NanKR5 0 , from modules 1 and 5 of the nanchangmycin synthase, or recombinant SalKR7 0 from module 7 of the salinomycin synthase, resulted in first-order, time-dependent washout of deuterium from 6a. Control experiments confirmed that this washout was due to KR 0 -catalyzed isotope exchange of the reversibly generated, transiently formed oxidation product [2- 2 H]-(2R)-2-methyl-3-ketopentanoyl-ACP (7a), consistent with the proposed epimerase activity of each of the KR 0 domains. Although they belong to the superfamily of short chain dehydrogenase-reductases, the epimerase-active KR 0 domains from polyether synthases lack one or both residues of the conserved Tyr-Ser dyad that has previously been implicated in KR-catalyzed epimerizations.
Mechanism and Stereochemistry of Polyketide Chain Elongation and Methyl Group Epimerization in Polyether Biosynthesis

PubMed Central

Xie, Xinqiang; Garg, Ashish; Khosla, Chaitan; Cane, David E.

2017-01-01

The polyketide synthases responsible for the biosynthesis of the polyether antibiotics nanchangmycin (1) and salinomycin (4) harbor a number of redox-inactive ketoreductase (KR0) domains that are implicated in the generation of C2-epimerized (2S)-2-methyl-3-ketoacyl-ACP intermediates. Evidence that the natural substrate for the polyether KR0 domains is, as predicted, a (2R)-2-methyl-3-ketoacyl-ACP intermediate, came from a newly developed coupled ketosynthase (KS)-ketoreductase (KR) assay that established that the decarboxylative condensation of methylmalonyl-CoA with S-propionyl-N-acetylcysteamine catalyzed by the Nan[KS1][AT1] didomain from module 1 of the nanchangmycin synthase generates exclusively the corresponding (2R)-2-methyl-3-ketopentanoyl-ACP (7a) product. In tandem equilibrium isotope exchange experiments, incubation of [2-2H]-(2R,3S)-2-methyl-3-hydroxypentanoyl-ACP (6a) with redox-active, epimerase-inactive EryKR6 from module 6 of the 6-deoxyerythronolide B synthase and catalytic quantities of NADP+ in the presence of redox-inactive, recombinant NanKR10 or NanKR50, from modules 1 and 5 of the nanchangmycin synthase, or recombinant SalKR70 from module 7 of the salinomycin synthase, resulted in first-order, time-dependent washout of deuterium from 6a. Control experiments confirmed that this washout was due to KR0-catalyzed isotope exchange of the reversibly-generated, transiently-formed oxidation product [2-2H]-(2R)-2-methyl-3-ketopentanoyl-ACP (7a), consistent with the proposed epimerase activity of each of the KR0 domains. Although they belong to the superfamily of short chain dehydrogenase-reductases, the epimerase-active KR0 domains from polyether synthases lack one or both residues of the conserved Tyr-Ser dyad that has previously been implicated in KR-catalyzed epimerizations. PMID:28157306
Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0.

PubMed

Bourras, Salim; McNally, Kaitlin E; Müller, Marion C; Wicker, Thomas; Keller, Beat

2016-01-01

The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the "avirulence" gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew-cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field.
Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0

PubMed Central

Bourras, Salim; McNally, Kaitlin E.; Müller, Marion C.; Wicker, Thomas; Keller, Beat

2016-01-01

The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the “avirulence” gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew–cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field. PMID:26973683
Converting cancer genes into killer genes.

PubMed Central

Da Costa, L T; Jen, J; He, T C; Chan, T A; Kinzler, K W; Vogelstein, B

1996-01-01

Over the past decade, it has become clear that tumorigenesis is driven by alterations in genes that control cell growth or cell death. Theoretically, the proteins encoded by these genes provide excellent targets for new therapeutic agents. Here, we describe a gene therapy approach to specifically kill tumor cells expressing such oncoproteins. In outline, the target oncoprotein binds to exogenously introduced gene products, resulting in transcriptional activation of a toxic gene. As an example, we show that this approach can be used to specifically kill cells overexpressing a mutant p53 gene in cell culture. The strategy may be generally applicable to neoplastic diseases in which the underlying patterns of genetic alterations or abnormal gene expression are known. Images Fig. 1 Fig. 2 Fig. 4 Fig. 5 PMID:8633039
Seasonal Pattern of Mycobacterium ulcerans, the Causative Agent of Buruli Ulcer, in the Environment in Ghana.

PubMed

Aboagye, Samuel Yaw; Ampah, Kobina Assan; Ross, Amanda; Asare, Prince; Otchere, Isaac Darko; Fyfe, Janet; Yeboah-Manu, Dorothy

2017-08-01

This study aimed to contribute to the understanding of Mycobacterium ulcerans (MU) ecology by analysing both clinical and environmental samples collected from ten communities along two major river basins (Offin and Densu) associated with Buruli ulcer (BU) at different seasons. We collected clinical samples from presumptive BU cases and environmental samples from ten communities. Following DNA extraction, clinical samples were confirmed by IS2404 PCR and environmental samples were confirmed by targeting MU-specific genes, IS2404, IS2606 and the ketoreductase (KR) using real-time PCR. Environmental samples were first analysed for IS2404; after which, IS2404-positive samples were multiplexed for the IS2606 and KR gene. Our findings indicate an overall decline in BU incidence along both river basins, although incidence at Densu outweighs that of Offin. Overall, 1600 environmental samples were screened along Densu (434, 27 %) and Offin (1166, 73 %) and MU was detected in 139 (9 %) of the combined samples. The positivity of MU along the Densu River basin was 89/434 (20.5 %), whilst that of the Offin River basin was 50/1166 (4.3 %). The DNA was detected mainly in snails (5/6, 83 %), moss (8/40, 20 %), soil (55/586, 9 %) and vegetation (55/675, 8 %). The proportion of MU positive samples recorded was higher during the months with higher rainfall levels (126/1175, 11 %) than during the dry season months (13/425, 3 %). This study indicates for the first time that there is a seasonal pattern in the presence of MU in the environment, which may be related to recent rainfall or water in the soil.
GoGene: gene annotation in the fast lane.

PubMed

Plake, Conrad; Royer, Loic; Winnenburg, Rainer; Hakenberg, Jörg; Schroeder, Michael

2009-07-01

High-throughput screens such as microarrays and RNAi screens produce huge amounts of data. They typically result in hundreds of genes, which are often further explored and clustered via enriched GeneOntology terms. The strength of such analyses is that they build on high-quality manual annotations provided with the GeneOntology. However, the weakness is that annotations are restricted to process, function and location and that they do not cover all known genes in model organisms. GoGene addresses this weakness by complementing high-quality manual annotation with high-throughput text mining extracting co-occurrences of genes and ontology terms from literature. GoGene contains over 4,000,000 associations between genes and gene-related terms for 10 model organisms extracted from more than 18,000,000 PubMed entries. It does not cover only process, function and location of genes, but also biomedical categories such as diseases, compounds, techniques and mutations. By bringing it all together, GoGene provides the most recent and most complete facts about genes and can rank them according to novelty and importance. GoGene accepts keywords, gene lists, gene sequences and protein sequences as input and supports search for genes in PubMed, EntrezGene and via BLAST. Since all associations of genes to terms are supported by evidence in the literature, the results are transparent and can be verified by the user. GoGene is available at http://gopubmed.org/gogene.
Down-weighting overlapping genes improves gene set analysis

PubMed Central

2012-01-01

Background The identification of gene sets that are significantly impacted in a given condition based on microarray data is a crucial step in current life science research. Most gene set analysis methods treat genes equally, regardless how specific they are to a given gene set. Results In this work we propose a new gene set analysis method that computes a gene set score as the mean of absolute values of weighted moderated gene t-scores. The gene weights are designed to emphasize the genes appearing in few gene sets, versus genes that appear in many gene sets. We demonstrate the usefulness of the method when analyzing gene sets that correspond to the KEGG pathways, and hence we called our method Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). Unlike most gene set analysis methods which are validated through the analysis of 2-3 data sets followed by a human interpretation of the results, the validation employed here uses 24 different data sets and a completely objective assessment scheme that makes minimal assumptions and eliminates the need for possibly biased human assessments of the analysis results. Conclusions PADOG significantly improves gene set ranking and boosts sensitivity of analysis using information already available in the gene expression profiles and the collection of gene sets to be analyzed. The advantages of PADOG over other existing approaches are shown to be stable to changes in the database of gene sets to be analyzed. PADOG was implemented as an R package available at: http://bioinformaticsprb.med.wayne.edu/PADOG/or http://www.bioconductor.org. PMID:22713124
Gene-gene interactions and gene polymorphisms of VEGFA and EG-VEGF gene systems in recurrent pregnancy loss.

PubMed

Su, Mei-Tsz; Lin, Sheng-Hsiang; Chen, Yi-Chi; Kuo, Pao-Lin

2014-06-01

Both vascular endothelial growth factor A (VEGFA) and endocrine gland-derived vascular endothelial growth factor (EG-VEGF) systems play major roles in angiogenesis. A body of evidence suggests VEGFs regulate critical processes during pregnancy and have been associated with recurrent pregnancy loss (RPL). However, little information is available regarding the interaction of these two major major angiogenesis-related systems in early human pregnancy. This study was conducted to investigate the association of gene polymorphisms and gene-gene interaction among genes in VEGFA and EG-VEGF systems and idiopathic RPL. A total of 98 women with history of idiopathic RPL and 142 controls were included, and 5 functional SNPs selected from VEGFA, KDR, EG-VEGF (PROK1), PROKR1 and PROKR2 were genotyped. We used multifactor dimensionality reduction (MDR) analysis to choose a best model and evaluate gene-gene interactions. Ingenuity pathways analysis (IPA) was introduced to explore possible complex interactions. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL (P<0.01). The MDR test revealed that the KDR (Q472H) polymorphism was the best loci to be associated with RPL (P=0.02). IPA revealed EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3 signaling pathways. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL. EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3.
FunGene: the functional gene pipeline and repository.

PubMed

Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

2013-01-01

Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

PubMed

Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

2015-01-01

In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.
Gene-gene, gene-environment, gene-nutrient interactions and single nucleotide polymorphisms of inflammatory cytokines.

PubMed

Nadeem, Amina; Mumtaz, Sadaf; Naveed, Abdul Khaliq; Aslam, Muhammad; Siddiqui, Arif; Lodhi, Ghulam Mustafa; Ahmad, Tausif

2015-05-15

Inflammation plays a significant role in the etiology of type 2 diabetes mellitus (T2DM). The rise in the pro-inflammatory cytokines is the essential step in glucotoxicity and lipotoxicity induced mitochondrial injury, oxidative stress and beta cell apoptosis in T2DM. Among the recognized markers are interleukin (IL)-6, IL-1, IL-10, IL-18, tissue necrosis factor-alpha (TNF-α), C-reactive protein, resistin, adiponectin, tissue plasminogen activator, fibrinogen and heptoglobins. Diabetes mellitus has firm genetic and very strong environmental influence; exhibiting a polygenic mode of inheritance. Many single nucleotide polymorphisms (SNPs) in various genes including those of pro and anti-inflammatory cytokines have been reported as a risk for T2DM. Not all the SNPs have been confirmed by unifying results in different studies and wide variations have been reported in various ethnic groups. The inter-ethnic variations can be explained by the fact that gene expression may be regulated by gene-gene, gene-environment and gene-nutrient interactions. This review highlights the impact of these interactions on determining the role of single nucleotide polymorphism of IL-6, TNF-α, resistin and adiponectin in pathogenesis of T2DM.
Neighboring Genes Show Correlated Evolution in Gene Expression.

PubMed

Ghanbarian, Avazeh T; Hurst, Laurence D

2015-07-01

When considering the evolution of a gene's expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Neighboring Genes Show Correlated Evolution in Gene Expression

PubMed Central

Ghanbarian, Avazeh T.; Hurst, Laurence D.

2015-01-01

When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543
Gene doping: gene delivery for olympic victory

PubMed Central

Gould, David

2013-01-01

With one recently recommended gene therapy in Europe and a number of other gene therapy treatments now proving effective in clinical trials it is feasible that the same technologies will soon be adopted in the world of sport by unscrupulous athletes and their trainers in so called ‘gene doping’. In this article an overview of the successful gene therapy clinical trials is provided and the potential targets for gene doping are highlighted. Depending on whether a doping gene product is secreted from the engineered cells or is retained locally to, or inside engineered cells will, to some extent, determine the likelihood of detection. It is clear that effective gene delivery technologies now exist and it is important that detection and prevention plans are in place. PMID:23082866
GeneSigDB—a curated database of gene expression signatures

PubMed Central

Culhane, Aedín C.; Schwarzl, Thomas; Sultana, Razvan; Picard, Kermshlise C.; Picard, Shaita C.; Lu, Tim H.; Franklin, Katherine R.; French, Simon J.; Papenhausen, Gerald; Correll, Mick; Quackenbush, John

2010-01-01

The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats. PMID:19934259
Gene doping: gene delivery for olympic victory.

PubMed

Gould, David

2013-08-01

With one recently recommended gene therapy in Europe and a number of other gene therapy treatments now proving effective in clinical trials it is feasible that the same technologies will soon be adopted in the world of sport by unscrupulous athletes and their trainers in so called 'gene doping'. In this article an overview of the successful gene therapy clinical trials is provided and the potential targets for gene doping are highlighted. Depending on whether a doping gene product is secreted from the engineered cells or is retained locally to, or inside engineered cells will, to some extent, determine the likelihood of detection. It is clear that effective gene delivery technologies now exist and it is important that detection and prevention plans are in place. © 2012 The Author. British Journal of Clinical Pharmacology © 2012 The British Pharmacological Society.

Gene-Gene and Gene-Environment Interactions in Ulcerative Colitis

PubMed Central

Wang, Ming-Hsi; Fiocchi, Claudio; Zhu, Xiaofeng; Ripke, Stephan; Kamboh, M. Ilyas; Rebert, Nancy; Duerr, Richard H.; Achkar, Jean-Paul

2014-01-01

Genome-wide association studies (GWAS) have identified at least 133 ulcerative colitis (UC) associated loci. The role of genetic factors in clinical practice is not clearly defined. The relevance of genetic variants to disease pathogenesis is still uncertain because of not characterized gene-gene and gene-environment interactions. We examined the predictive value of combining the 133 UC risk loci with genetic interactions in an ongoing inflammatory bowel disease (IBD) GWAS. The Wellcome Trust Case-Control Consortium (WTCCC) IBD GWAS was used as a replication cohort. We applied logic regression (LR), a novel adaptive regression methodology, to search for high order interactions. Exploratory genotype correlations with UC sub-phenotypes (extent of disease, need of surgery, age of onset, extra-intestinal manifestations and primary sclerosing cholangitis (PSC)) were conducted. The combination of 133 UC loci yielded good UC risk predictability (area under the curve [AUC] of 0.86). A higher cumulative allele score predicted higher UC risk. Through LR, several lines of evidence for genetic interactions were identified and successfully replicated in the WTCCC cohort. The genetic interactions combined with the gene-smoking interaction significantly improved predictability in the model (AUC, from 0.86 to 0.89, P=3.26E-05). Explained UC variance increased from 37% to 42% after adding the interaction terms. A within case analysis found suggested genetic association with PSC. Our study demonstrates that the LR methodology allows the identification and replication of high order genetic interactions in UC GWAS datasets. UC risk can be predicted by a 133 loci and improved by adding gene-gene and gene-environment interactions. PMID:24241240
Gene doping.

PubMed

Azzazy, Hassan M E

2010-01-01

Gene doping abuses the legitimate approach of gene therapy. While gene therapy aims to correct genetic disorders by introducing a foreign gene to replace an existing faulty one or by manipulating existing gene(s) to achieve a therapeutic benefit, gene doping employs the same concepts to bestow performance advantages on athletes over their competitors. Recent developments in genetic engineering have contributed significantly to the progress of gene therapy research and currently numerous clinical trials are underway. Some athletes and their staff are probably watching this progress closely. Any gene that plays a role in muscle development, oxygen delivery to tissues, neuromuscular coordination, or even pain control is considered a candidate for gene dopers. Unfortunately, detecting gene doping is technically very difficult because the transgenic proteins expressed by the introduced genes are similar to their endogenous counterparts. Researchers today are racing the clock because assuring the continued integrity of sports competition depends on their ability to develop effective detection strategies in preparation for the 2012 Olympics, which may mark the appearance of genetically modified athletes.
Gene trap and gene inversion methods for conditional gene inactivation in the mouse

PubMed Central

Xin, Hong-Bo; Deng, Ke-Yu; Shui, Bo; Qu, Shimian; Sun, Qi; Lee, Jane; Greene, Kai Su; Wilson, Jason; Yu, Ying; Feldman, Morris; Kotlikoff, Michael I.

2005-01-01

Conditional inactivation of individual genes in mice using site-specific recombinases is an extremely powerful method for determining the complex roles of mammalian genes in developmental and tissue-specific contexts, a major goal of post-genomic research. However, the process of generating mice with recombinase recognition sequences placed at specific locations within a gene, while maintaining a functional allele, is time consuming, expensive and technically challenging. We describe a system that combines gene trap and site-specific DNA inversion to generate mouse embryonic stem (ES) cell clones for the rapid production of conditional knockout mice, and the use of this system in an initial gene trap screen. Gene trapping should allow the selection of thousands of ES cell clones with defined insertions that can be used to generate conditional knockout mice, thereby providing extensive parallelism that eliminates the time-consuming steps of targeting vector construction and homologous recombination for each gene. PMID:15659575
Differentially Coexpressed Disease Gene Identification Based on Gene Coexpression Network.

PubMed

Jiang, Xue; Zhang, Han; Quan, Xiongwen

2016-01-01

Screening disease-related genes by analyzing gene expression data has become a popular theme. Traditional disease-related gene selection methods always focus on identifying differentially expressed gene between case samples and a control group. These traditional methods may not fully consider the changes of interactions between genes at different cell states and the dynamic processes of gene expression levels during the disease progression. However, in order to understand the mechanism of disease, it is important to explore the dynamic changes of interactions between genes in biological networks at different cell states. In this study, we designed a novel framework to identify disease-related genes and developed a differentially coexpressed disease-related gene identification method based on gene coexpression network (DCGN) to screen differentially coexpressed genes. We firstly constructed phase-specific gene coexpression network using time-series gene expression data and defined the conception of differential coexpression of genes in coexpression network. Then, we designed two metrics to measure the value of gene differential coexpression according to the change of local topological structures between different phase-specific networks. Finally, we conducted meta-analysis of gene differential coexpression based on the rank-product method. Experimental results demonstrated the feasibility and effectiveness of DCGN and the superior performance of DCGN over other popular disease-related gene selection methods through real-world gene expression data sets.
Hox genes and study of Hox genes in crustacean

NASA Astrophysics Data System (ADS)

Hou, Lin; Chen, Zhijuan; Xu, Mingyu; Lin, Shengguo; Wang, Lu

2004-12-01

Homeobox genes have been discovered in many species. These genes are known to play a major role in specifying regional identity along the anterior-posterior axis of animals from a wide range of phyla. The products of the homeotic genes are a set of evolutionarily conserved transcription factors that control elaborate developmental processes and specify cell fates in metazoans. Crustacean, presenting a variety of body plans not encountered in any other class or phylum of the Metazoa, has been shown to possess a single set of homologous Hox genes like insect. The ancestral crustacean Hox gene complex comprised ten genes: eight homologous to the hometic Hox genes and two related to nonhomeotic genes presented within the insect Hox complexes. The crustacean in particular exhibits an abundant diversity segment specialization and tagmosis. This morphological diversity relates to the Hox genes. In crustacean body plan, different Hox genes control different segments and tagmosis.
Construction of ivermectin producer by domain swaps of avermectin polyketide synthase in Streptomyces avermitilis.

PubMed

Zhang, Xiaolin; Chen, Zhi; Li, Meng; Wen, Ying; Song, Yuan; Li, Jilun

2006-10-01

Ivermectin, 22, 23-dihydroavermectin B1, is commercially important in human, veterinary medicine, and pesticides. It is currently synthesized by chemical reduction of the double bond between C22 and C23 of avermectins B1, which are a mixture of B1a (>80%) and B1b (<20%) produced by fermentation of Streptomyces avermitilis. The cost of ivermectin is much higher than that of avermectins B1 owing to the necessity of region-specific hydrogenation at C22-C23 of avermectins B1 with rhodium chloride as the catalyst for producing ivermectin. Here we report that ivermectin can be produced directly by fermentation of recombinant strains constructed through targeted genetic engineering of the avermectin polyketide synthase (PKS) in S. avermitilis Olm73-12, which produces only avermectins B and not avermectins A and oligomycin. The DNA region encoding the dehydratase (DH) and ketoreductase (KR) domains of module 2 from the avermectin PKS in S. avermitilis Olm73-12 was replaced by the DNA fragment encoding the DH, enoylreductase, and KR domains from module 4 of the pikromycin PKS of Streptomyces venezuelae ATCC 15439 using a gene replacement vector pXL211. Twenty-seven of mutants were found to produce a small amount of 22, 23-dihydroavermectin B1a and avermectin B1a and B2a by high performance liquid chromatography and liquid chromatography mass spectrometry analysis. This study might provide a route to the low-cost production of ivermectin by fermentation.
GeneBuilder: interactive in silico prediction of gene structure.

PubMed

Milanesi, L; D'Angelo, D; Rogozin, I B

1999-01-01

Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Gene: a gene-centered information resource at NCBI.

PubMed

Brown, Garth R; Hem, Vichet; Katz, Kenneth S; Ovetsky, Michael; Wallin, Craig; Ermolaeva, Olga; Tolstoy, Igor; Tatusova, Tatiana; Pruitt, Kim D; Maglott, Donna R; Murphy, Terence D

2015-01-01

The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.nih.gov/gene) integrates gene-specific information from multiple data sources. NCBI Reference Sequence (RefSeq) genomes for viruses, prokaryotes and eukaryotes are the primary foundation for Gene records in that they form the critical association between sequence and a tracked gene upon which additional functional and descriptive content is anchored. Additional content is integrated based on the genomic location and RefSeq transcript and protein sequence data. The content of a Gene record represents the integration of curation and automated processing from RefSeq, collaborating model organism databases, consortia such as Gene Ontology, and other databases within NCBI. Records in Gene are assigned unique, tracked integers as identifiers. The content (citations, nomenclature, genomic location, gene products and their attributes, phenotypes, sequences, interactions, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programming utilities (E-Utilities and Entrez Direct) and for bulk transfer by FTP. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Gene-gene and gene-environment interactions: new insights into the prevention, detection and management of coronary artery disease.

PubMed

Lanktree, Matthew B; Hegele, Robert A

2009-02-26

Despite the recent success of genome-wide association studies (GWASs) in identifying loci consistently associated with coronary artery disease (CAD), a large proportion of the genetic components of CAD and its metabolic risk factors, including plasma lipids, type 2 diabetes and body mass index, remain unattributed. Gene-gene and gene-environment interactions might produce a meaningful improvement in quantification of the genetic determinants of CAD. Testing for gene-gene and gene-environment interactions is thus a new frontier for large-scale GWASs of CAD. There are several anecdotal examples of monogenic susceptibility to CAD in which the phenotype was worsened by an adverse environment. In addition, small-scale candidate gene association studies with functional hypotheses have identified gene-environment interactions. For future evaluation of gene-gene and gene-environment interactions to achieve the same success as the single gene associations reported in recent GWASs, it will be important to pre-specify agreed standards of study design and statistical power, environmental exposure measurement, phenomic characterization and analytical strategies. Here we discuss these issues, particularly in relation to the investigation and potential clinical utility of gene-gene and gene-environment interactions in CAD.
Sexy gene conversions: locating gene conversions on the X-chromosome.

PubMed

Lawson, Mark J; Zhang, Liqing

2009-08-01

Gene conversion can have a profound impact on both the short- and long-term evolution of genes and genomes. Here, we examined the gene families that are located on the X-chromosomes of human (Homo sapiens), chimpanzee (Pan troglodytes), mouse (Mus musculus) and rat (Rattus norvegicus) for evidence of gene conversion. We identified seven gene families (WD repeat protein family, Ferritin Heavy Chain family, RAS-related Protein RAB-40 family, Diphosphoinositol polyphosphate phosphohydrolase family, Transcription Elongation Factor A family, LDOC1-related family, Zinc Finger Protein ZIC, and GLI family) that show evidence of gene conversion. Through phylogenetic analyses and synteny evidence, we show that gene conversion has played an important role in the evolution of these gene families and that gene conversion has occurred independently in both primates and rodents. Comparing the results with those of two gene conversion prediction programs (GENECONV and Partimatrix), we found that both GENECONV and Partimatrix have very high false negative rates (i.e. failed to predict gene conversions), which leads to many undetected gene conversions. The combination of phylogenetic analyses with physical synteny evidence exhibits high resolution in the detection of gene conversions.
Gene doping.

PubMed

Harridge, Stephen D R; Velloso, Cristiana P

2008-01-01

Gene doping is the misuse of gene therapy to enhance athletic performance. It has recently been recognised as a potential threat and subsequently been prohibited by the World Anti-Doping Agency. Despite concerns with safety and efficacy of gene therapy, the technology is progressing steadily. Many of the genes/proteins which are involved in determining key components of athletic performance have been identified. Naturally occurring mutations in humans as well as gene-transfer experiments in adult animals have shown that altered expression of these genes does indeed affect physical performance. For athletes, however, the gains in performance must be weighed against the health risks associated with the gene-transfer process, whereas the detection of such practices will provide new challenges for the anti-doping authorities.
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis.

PubMed

dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

2015-01-01

Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis.
Reconstructing directed gene regulatory network by only gene expression data.

PubMed

Zhang, Lu; Feng, Xi Kang; Ng, Yen Kaow; Li, Shuai Cheng

2016-08-18

Accurately identifying gene regulatory network is an important task in understanding in vivo biological activities. The inference of such networks is often accomplished through the use of gene expression data. Many methods have been developed to evaluate gene expression dependencies between transcription factor and its target genes, and some methods also eliminate transitive interactions. The regulatory (or edge) direction is undetermined if the target gene is also a transcription factor. Some methods predict the regulatory directions in the gene regulatory networks by locating the eQTL single nucleotide polymorphism, or by observing the gene expression changes when knocking out/down the candidate transcript factors; regrettably, these additional data are usually unavailable, especially for the samples deriving from human tissues. In this study, we propose the Context Based Dependency Network (CBDN), a method that is able to infer gene regulatory networks with the regulatory directions from gene expression data only. To determine the regulatory direction, CBDN computes the influence of source to target by evaluating the magnitude changes of expression dependencies between the target gene and the others with conditioning on the source gene. CBDN extends the data processing inequality by involving the dependency direction to distinguish between direct and transitive relationship between genes. We also define two types of important regulators which can influence a majority of the genes in the network directly or indirectly. CBDN can detect both of these two types of important regulators by averaging the influence functions of candidate regulator to the other genes. In our experiments with simulated and real data, even with the regulatory direction taken into account, CBDN outperforms the state-of-the-art approaches for inferring gene regulatory network. CBDN identifies the important regulators in the predicted network: 1. TYROBP influences a batch of genes that are
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

PubMed Central

dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

2015-01-01

Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928
Gene Architectures that Minimize Cost of Gene Expression.

PubMed

Frumkin, Idan; Schirman, Dvir; Rotman, Aviv; Li, Fangfei; Zahavi, Liron; Mordret, Ernest; Asraf, Omer; Wu, Song; Levy, Sasha F; Pilpel, Yitzhak

2017-01-05

Gene expression burdens cells by consuming resources and energy. While numerous studies have investigated regulation of expression level, little is known about gene design elements that govern expression costs. Here, we ask how cells minimize production costs while maintaining a given protein expression level and whether there are gene architectures that optimize this process. We measured fitness of ∼14,000 E. coli strains, each expressing a reporter gene with a unique 5' architecture. By comparing cost-effective and ineffective architectures, we found that cost per protein molecule could be minimized by lowering transcription levels, regulating translation speeds, and utilizing amino acids that are cheap to synthesize and that are less hydrophobic. We then examined natural E. coli genes and found that highly expressed genes have evolved more forcefully to minimize costs associated with their expression. Our study thus elucidates gene design elements that improve the economy of protein expression in natural and heterologous systems. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene transfer and gene mapping in mammalian cells in culture.

PubMed

Shows, T B; Sakaguchi, A Y

1980-01-01

The ability to transfer mammalian genes parasexually has opened new possibilities for gene mapping and fine structure mapping and offers great potential for contributing to several aspects of mammalian biology, including gene expression and genetic engineering. The DNA transferred has ranged from whole genomes to single genes and smaller segments of DNA. The transfer of whole genomes by cell fusion forms cell hybrids, which has promoted the extensive mapping of human and mouse genes. Transfer, by cell fusion, of rearranged chromosomes has contributed significantly to determining close linkage and the assignment of genes to specific chromosomal regions. Transfer of single chromosomes has been achieved utilizing microcells fused to recipient cells. Metaphase chromosomes have been isolated and used to transfer single-to-multigenic DNA segments. DNA-mediated gene transfer, simulating bacterial transformation, has achieved transfer of single-copy genes. By utilizing DNA cleaved with restriction endonucleases, gene transfer is being empolyed as a bioassay for the purification of genes. Gene mapping and the fate of transferred genes can be examined now at the molecular level using sequence-specific probles. Recently, single genes have been cloned into eucaryotic and procaryotic vectors for transfer into mammalian cells. Moreover, recombinant libraries in which entire mammalian genomes are represented collectively are a rich new source of transferable genes. Methodology for transferring mammalian genetic information and applications for mapping mammalian genes is presented and prospects for the future discussed.
Patenting human genes: Chinese academic articles' portrayal of gene patents.

PubMed

Du, Li

2018-04-24

The patenting of human genes has been the subject of debate for decades. While China has gradually come to play an important role in the global genomics-based testing and treatment market, little is known about Chinese scholars' perspectives on patent protection for human genes. A content analysis of academic literature was conducted to identify Chinese scholars' concerns regarding gene patents, including benefits and risks of patenting human genes, attitudes that researchers hold towards gene patenting, and any legal and policy recommendations offered for the gene patent regime in China. 57.2% of articles were written by law professors, but scholars from health sciences, liberal arts, and ethics also participated in discussions on gene patent issues. While discussions of benefits and risks were relatively balanced in the articles, 63.5% of the articles favored gene patenting in general and, of the articles (n = 41) that explored gene patents in the Chinese context, 90.2% supported patent protections for human genes in China. The patentability of human genes was discussed in 33 articles, and 75.8% of these articles reached the conclusion that human genes are patentable. Chinese scholars view the patent regime as an important legal tool to protect the interests of inventors and inventions as well as the genetic resources of China. As such, many scholars support a gene patent system in China. These attitudes towards gene patents remain unchanged following the court ruling in the Myriad case in 2013, but arguments have been raised about the scope of gene patents, in particular that the increasing numbers of gene patents may negatively impact public health in China.
Involvement of an octose ketoreductase and two acyltransferases in the biosynthesis of paulomycins

NASA Astrophysics Data System (ADS)

Li, Jine; Wang, Min; Ding, Yong; Tang, Yue; Zhang, Zhiguo; Chen, Yihua

2016-02-01

C-4 hydroxyethyl branched octoses have been observed in polysaccharides of several genera of gram negative bacteria and in various antibiotics produced by gram positive bacteria. The C-4 hydroxyethyl branch was proposed to be converted from C-4 acetyl branch by an uncharacterized ketoreduction step. Paulomycins (PAUs) are glycosylated antibiotics with potent inhibitory activity against gram positive bacteria and are structurally defined by its unique C-4‧ hydroxyethyl branched paulomycose moiety. A novel aldo-keto-reductase, Pau7 was characterized as the enzyme catalyzing the stereospecific ketoreduction of 7‧-keto of PAU E (1) to give the C-4‧ hydroxyethyl branched paulomycose moiety of PAU F (2). An acyltransferase Pau6 further decorates the C-4‧ hydroxyethyl branch of paulomycose moiety of 2 by attaching various fatty acyl chains to 7‧-OH to generate diverse PAUs. In addition, another acyltransferase Pau24 was proposed to be responsible for the 13-O-acetylation of PAUs.
Gene Circuit Analysis of the Terminal Gap Gene huckebein

PubMed Central

Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

2009-01-01

The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378
Gene circuit analysis of the terminal gap gene huckebein.

PubMed

Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

2009-10-01

The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network.

Genes from scratch--the evolutionary fate of de novo genes.

PubMed

Schlötterer, Christian

2015-04-01

Although considered an extremely unlikely event, many genes emerge from previously noncoding genomic regions. This review covers the entire life cycle of such de novo genes. Two competing hypotheses about the process of de novo gene birth are discussed as well as the high death rate of de novo genes. Despite the high death rate, some de novo genes are retained and remain functional, even in distantly related species, through their integration into gene networks. Further studies combining gene expression with ribosome profiling in multiple populations across different species will be instrumental for an improved understanding of the evolutionary processes operating on de novo genes. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Initial description of primate-specific cystine-knot Prometheus genes and differential gene expansions of D-dopachrome tautomerase genes

PubMed Central

Premzl, Marko

2015-01-01

Using eutherian comparative genomic analysis protocol and public genomic sequence data sets, the present work attempted to update and revise two gene data sets. The most comprehensive third party annotation gene data sets of eutherian adenohypophysis cystine-knot genes (128 complete coding sequences), and d-dopachrome tautomerases and macrophage migration inhibitory factor genes (30 complete coding sequences) were annotated. For example, the present study first described primate-specific cystine-knot Prometheus genes, as well as differential gene expansions of D-dopachrome tautomerase genes. Furthermore, new frameworks of future experiments of two eutherian gene data sets were proposed. PMID:25941635
GenePRIMP: A Gene Prediction Improvement Pipeline For Prokaryotic Genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kyrpides, Nikos C.; Ivanova, Natalia N.; Pati, Amrita

2010-07-08

GenePRIMP (Gene Prediction Improvement Pipeline, Http://geneprimp.jgi-psf.org), a computational process that performs evidence-based evaluation of gene models in prokaryotic genomes and reports anomalies including inconsistent start sites, missing genes, and split genes. We show that manual curation of gene models using the anomaly reports generated by GenePRIMP improves their quality and demonstrate the applicability of GenePRIMP in improving finishing quality and comparing different genome sequencing and annotation technologies. Keywords in context: Gene model, Quality Control, Translation start sites, Automatic correction. Hardware requirements; PC, MAC; Operating System: UNIX/LINUX; Compiler/Version: Perl 5.8.5 or higher; Special requirements: NCBI Blast and nr installation; File Types:more » Source Code, Executable module(s), Sample problem input data; installation instructions other; programmer documentation. Location/transmission: http://geneprimp.jgi-psf.org/gp.tar.gz« less
Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms.

PubMed

Li, Zhen; Defoort, Jonas; Tasdighian, Setareh; Maere, Steven; Van de Peer, Yves; De Smet, Riet

2016-02-01

Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. © 2016 American Society of Plant Biologists. All rights reserved.
Cytokine-related genes and oxidation-related genes detected in preeclamptic placentas.

PubMed

Lee, Gui Se Ra; Joe, Yoon Seong; Kim, Sa Jin; Shin, Jong Chul

2010-10-01

To investigate cytokine- and oxidation-related genes for preeclampsia using DNA microarray analysis. Placentas were collected from 13 normal pregnancies and 13 patients with preeclampsia. Gene expression was studied using DNA microarray. Among significantly expressed genes, we focused on genes associated with cytokines and oxidation, and the results were confirmed using quantitative real time-polymerase chain reaction (QRT-PCR). 415 genes out of 30,940 genes were altered by > or =2-fold in the microarray analysis. 121 up-regulated genes and 294 down-regulated genes were found to be in preeclamptic placenta. Six cytokine-related genes and 5 oxidation-related genes were found from among the 121 up-regulated genes. The cytokine-related genes studied included oncostatin M (OSM), fms-related tyrosine kinase (FLT1) and vascular endothelial growth factor A (VEGFA), and the oxidation-related genes studied included spermine oxidase (SMOX), l cytochrome P450, family 26, subfamily A, polypeptide 1 (CYP26A1), acetate dehydrogenase A (LDHA). These six genes were also significantly higher in placentas from patients with preeclampsia than in those from women with normal pregnancies. The placental tissue of patients with preeclampsia showed significantly higher mRNA expression of these six genes than the normal group, using QRT-PCR. DNA microarray analysis is one of the great methods for simultaneously detecting the functionally associated genes of preeclampsia. The cytokine-related genes such as OSM, FLT1 and VEGFA, and the oxidation-related genes such as LDHA, CYP26A1 and SMOX might prove to be the starting point in the elucidation of the pathogenesis of preeclampsia.
Down-Regulation of Gene Expression by RNA-Induced Gene Silencing

NASA Astrophysics Data System (ADS)

Travella, Silvia; Keller, Beat

Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.
Discovering Implicit Entity Relation with the Gene-Citation-Gene Network

PubMed Central

Song, Min; Han, Nam-Gi; Kim, Yong-Hwan; Ding, Ying; Chambers, Tamy

2013-01-01

In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner. PMID:24358368
Endovascular Gene Delivery from a Stent Platform: Gene- Eluting Stents

PubMed Central

Fishbein, Ilia; Chorny, Michael; Adamo, Richard F; Forbes, Scott P; Corrales, Ricardo A; Alferiev, Ivan S; Levy, Robert J

2015-01-01

A synergistic impact of research in the fields of post-angioplasty restenosis, drug-eluting stents and vascular gene therapy over the past 15 years has shaped the concept of gene-eluting stents. Gene-eluting stents hold promise of overcoming some biological and technical problems inherent to drug-eluting stent technology. As the field of gene-eluting stents matures it becomes evident that all three main design modules of a gene-eluting stent: a therapeutic transgene, a vector and a delivery system are equally important for accomplishing sustained inhibition of neointimal formation in arteries treated with gene delivery stents. This review summarizes prior work on stent-based gene delivery and discusses the main optimization strategies required to move the field of gene-eluting stents to clinical translation. PMID:26225356
Androgen-responsive gene database: integrated knowledge on androgen-responsive genes.

PubMed

Jiang, Mei; Ma, Yunsheng; Chen, Congcong; Fu, Xuping; Yang, Shu; Li, Xia; Yu, Guohua; Mao, Yumin; Xie, Yi; Li, Yao

2009-11-01

Androgen signaling plays an important role in many biological processes. Androgen Responsive Gene Database (ARGDB) is devoted to providing integrated knowledge on androgen-controlled genes. Gene records were collected on the basis of PubMed literature collections. More than 6000 abstracts and 950 original publications were manually screened, leading to 1785 human genes, 993 mouse genes, and 583 rat genes finally included in the database. All the collected genes were experimentally proved to be regulated by androgen at the expression level or to contain androgen-responsive regions. For each gene important details of the androgen regulation experiments were collected from references, such as expression change, androgen-responsive sequence, response time, tissue/cell type, experimental method, ligand identity, and androgen amount, which will facilitate further evaluation by researchers. Furthermore, the database was integrated with multiple annotation resources, including National Center for Biotechnology Information, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes pathway, to reveal the biological characteristics and significance of androgen-regulated genes. The ARGDB web site is mainly composed of the Browse, Search, Element Scan, and Submission modules. It is user friendly and freely accessible at http://argdb.fudan.edu.cn. Preliminary analysis of the collected data was performed. Many disease pathways, such as prostate carcinogenesis, were found to be enriched in androgen-regulated genes. The discovered androgen-response motifs were similar to those in previous reports. The analysis results are displayed in the web site. In conclusion, ARGDB provides a unified gateway to storage, retrieval, and update of information on androgen-regulated genes.
Allelic-based gene-gene interaction associated with quantitative traits.

PubMed

Jung, Jeesun; Sun, Bin; Kwon, Deukwoo; Koller, Daniel L; Foroud, Tatiana M

2009-05-01

Recent studies have shown that quantitative phenotypes may be influenced not only by multiple single nucleotide polymorphisms (SNPs) within a gene but also by the interaction between SNPs at unlinked genes. We propose a new statistical approach that can detect gene-gene interactions at the allelic level which contribute to the phenotypic variation in a quantitative trait. By testing for the association of allelic combinations at multiple unlinked loci with a quantitative trait, we can detect the SNP allelic interaction whether or not it can be detected as a main effect. Our proposed method assigns a score to unrelated subjects according to their allelic combination inferred from observed genotypes at two or more unlinked SNPs, and then tests for the association of the allelic score with a quantitative trait. To investigate the statistical properties of the proposed method, we performed a simulation study to estimate type I error rates and power and demonstrated that this allelic approach achieves greater power than the more commonly used genotypic approach to test for gene-gene interaction. As an example, the proposed method was applied to data obtained as part of a candidate gene study of sodium retention by the kidney. We found that this method detects an interaction between the calcium-sensing receptor gene (CaSR), the chloride channel gene (CLCNKB) and the Na, K, 2Cl cotransporter gene (CLC12A1) that contributes to variation in diastolic blood pressure.
A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

PubMed

Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

2015-01-01

Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.
A Hybrid Approach of Gene Sets and Single Genes for the Prediction of Survival Risks with Gene Expression Data

PubMed Central

Seok, Junhee; Davis, Ronald W.; Xiao, Wenzhong

2015-01-01

Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn’t been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge. PMID:25933378
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data

PubMed Central

Hejblum, Boris P.; Skinner, Jason; Thiébaut, Rodolphe

2015-01-01

Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA) introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR) measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial), and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA) for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package. PMID:26111374
GeneMachine: gene prediction and sequence annotation.

PubMed

Makalowska, I; Ryan, J F; Baxevanis, A D

2001-09-01

A number of free-standing programs have been developed in order to help researchers find potential coding regions and deduce gene structure for long stretches of what is essentially 'anonymous DNA'. As these programs apply inherently different criteria to the question of what is and is not a coding region, multiple algorithms should be used in the course of positional cloning and positional candidate projects to assure that all potential coding regions within a previously-identified critical region are identified. We have developed a gene identification tool called GeneMachine which allows users to query multiple exon and gene prediction programs in an automated fashion. BLAST searches are also performed in order to see whether a previously-characterized coding region corresponds to a region in the query sequence. A suite of Perl programs and modules are used to run MZEF, GENSCAN, GRAIL 2, FGENES, RepeatMasker, Sputnik, and BLAST. The results of these runs are then parsed and written into ASN.1 format. Output files can be opened using NCBI Sequin, in essence using Sequin as both a workbench and as a graphical viewer. The main feature of GeneMachine is that the process is fully automated; the user is only required to launch GeneMachine and then open the resulting file with Sequin. Annotations can then be made to these results prior to submission to GenBank, thereby increasing the intrinsic value of these data. GeneMachine is freely-available for download at http://genome.nhgri.nih.gov/genemachine. A public Web interface to the GeneMachine server for academic and not-for-profit users is available at http://genemachine.nhgri.nih.gov. The Web supplement to this paper may be found at http://genome.nhgri.nih.gov/genemachine/supplement/.
Gene-for-gene disease resistance: bridging insect pest and pathogen defense.

PubMed

Kaloshian, Isgouhi

2004-12-01

Active plant defense, also known as gene-for-gene resistance, is triggered when a plant resistance (R) gene recognizes the intrusion of a specific insect pest or pathogen. Activation of plant defense includes an array of physiological and transcriptional reprogramming. During the past decade, a large number of plant R genes that confer resistance to diverse group of pathogens have been cloned from a number of plant species. Based on predicted protein structures, these genes are classified into a small number of groups, indicating that structurally related R genes recognize phylogenetically distinct pathogens. An extreme example is the tomato Mi-1 gene, which confers resistance to potato aphid (Macrosiphum euphorbiae), whitefly (Bemisia tabaci), and root-knot nematodes (Meloidogyne spp.). While Mi-1 remains the only cloned insect R gene, there is evidence that gene-for-gene type of plant defense against piercing-sucking insects exists in a number of plant species.
LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network.

PubMed

Wei, Pi-Jing; Zhang, Di; Xia, Junfeng; Zheng, Chun-Hou

2016-12-23

Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
Identification of gene expression profiles and key genes in subchondral bone of osteoarthritis using weighted gene coexpression network analysis.

PubMed

Guo, Sheng-Min; Wang, Jian-Xiong; Li, Jin; Xu, Fang-Yuan; Wei, Quan; Wang, Hai-Ming; Huang, Hou-Qiang; Zheng, Si-Lin; Xie, Yu-Jie; Zhang, Chi

2018-06-15

Osteoarthritis (OA) significantly influences the quality life of people around the world. It is urgent to find an effective way to understand the genetic etiology of OA. We used weighted gene coexpression network analysis (WGCNA) to explore the key genes involved in the subchondral bone pathological process of OA. Fifty gene expression profiles of GSE51588 were downloaded from the Gene Expression Omnibus database. The OA-associated genes and gene ontologies were acquired from JuniorDoc. Weighted gene coexpression network analysis was used to find disease-related networks based on 21756 gene expression correlation coefficients, hub-genes with the highest connectivity in each module were selected, and the correlation between module eigengene and clinical traits was calculated. The genes in the traits-related gene coexpression modules were subject to functional annotation and pathway enrichment analysis using ClusterProfiler. A total of 73 gene modules were identified, of which, 12 modules were found with high connectivity with clinical traits. Five modules were found with enriched OA-associated genes. Moreover, 310 OA-associated genes were found, and 34 of them were among hub-genes in each module. Consequently, enrichment results indicated some key metabolic pathways, such as extracellular matrix (ECM)-receptor interaction (hsa04512), focal adhesion (hsa04510), the phosphatidylinositol 3'-kinase (PI3K)-Akt signaling pathway (PI3K-AKT) (hsa04151), transforming growth factor beta pathway, and Wnt pathway. We intended to identify some core genes, collagen (COL)6A3, COL6A1, ITGA11, BAMBI, and HCK, which could influence downstream signaling pathways once they were activated. In this study, we identified important genes within key coexpression modules, which associate with a pathological process of subchondral bone in OA. Functional analysis results could provide important information to understand the mechanism of OA. © 2018 Wiley Periodicals, Inc.
GeneSeqToFamily: a Galaxy workflow to find gene families based on the Ensembl Compara GeneTrees pipeline.

PubMed

Thanki, Anil S; Soranzo, Nicola; Haerty, Wilfried; Davey, Robert P

2018-03-01

Gene duplication is a major factor contributing to evolutionary novelty, and the contraction or expansion of gene families has often been associated with morphological, physiological, and environmental adaptations. The study of homologous genes helps us to understand the evolution of gene families. It plays a vital role in finding ancestral gene duplication events as well as identifying genes that have diverged from a common ancestor under positive selection. There are various tools available, such as MSOAR, OrthoMCL, and HomoloGene, to identify gene families and visualize syntenic information between species, providing an overview of syntenic regions evolution at the family level. Unfortunately, none of them provide information about structural changes within genes, such as the conservation of ancestral exon boundaries among multiple genomes. The Ensembl GeneTrees computational pipeline generates gene trees based on coding sequences, provides details about exon conservation, and is used in the Ensembl Compara project to discover gene families. A certain amount of expertise is required to configure and run the Ensembl Compara GeneTrees pipeline via command line. Therefore, we converted this pipeline into a Galaxy workflow, called GeneSeqToFamily, and provided additional functionality. This workflow uses existing tools from the Galaxy ToolShed, as well as providing additional wrappers and tools that are required to run the workflow. GeneSeqToFamily represents the Ensembl GeneTrees pipeline as a set of interconnected Galaxy tools, so they can be run interactively within the Galaxy's user-friendly workflow environment while still providing the flexibility to tailor the analysis by changing configurations and tools if necessary. Additional tools allow users to subsequently visualize the gene families produced by the workflow, using the Aequatus.js interactive tool, which has been developed as part of the Aequatus software project.
5. OVERHEAD VIEW OF GENE CAMP LOOKING SOUTH. GENE PUMP ...

Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

5. OVERHEAD VIEW OF GENE CAMP LOOKING SOUTH. GENE PUMP PLANT IS AT CENTER WITH ADMINISTRATIVE COMPLEX IN FOREGROUND AND RESIDENTIAL AREA BEYOND PLANT. - Gene Pump Plant, South of Gene Wash Reservoir, 2 miles west of Whitsett Pump Plant, Parker Dam, San Bernardino County, CA
Random forests-based differential analysis of gene sets for gene expression data.

PubMed

Hsueh, Huey-Miin; Zhou, Da-Wei; Tsai, Chen-An

2013-04-10

In DNA microarray studies, gene-set analysis (GSA) has become the focus of gene expression data analysis. GSA utilizes the gene expression profiles of functionally related gene sets in Gene Ontology (GO) categories or priori-defined biological classes to assess the significance of gene sets associated with clinical outcomes or phenotypes. Many statistical approaches have been proposed to determine whether such functionally related gene sets express differentially (enrichment and/or deletion) in variations of phenotypes. However, little attention has been given to the discriminatory power of gene sets and classification of patients. In this study, we propose a method of gene set analysis, in which gene sets are used to develop classifications of patients based on the Random Forest (RF) algorithm. The corresponding empirical p-value of an observed out-of-bag (OOB) error rate of the classifier is introduced to identify differentially expressed gene sets using an adequate resampling method. In addition, we discuss the impacts and correlations of genes within each gene set based on the measures of variable importance in the RF algorithm. Significant classifications are reported and visualized together with the underlying gene sets and their contribution to the phenotypes of interest. Numerical studies using both synthesized data and a series of publicly available gene expression data sets are conducted to evaluate the performance of the proposed methods. Compared with other hypothesis testing approaches, our proposed methods are reliable and successful in identifying enriched gene sets and in discovering the contributions of genes within a gene set. The classification results of identified gene sets can provide an valuable alternative to gene set testing to reveal the unknown, biologically relevant classes of samples or patients. In summary, our proposed method allows one to simultaneously assess the discriminatory ability of gene sets and the importance of genes for

Bacterial avirulence genes.

PubMed

Leach, J E; White, F F

1996-01-01

Although more than 30 bacterial avirulence genes have been cloned and characterized, the function of the gene products in the elictitation of resistance is unknown in all cases but one. The product of avrD from Pseudomonas syringae pv. glycinea likely functions indirectly to elicit resistance in soybean, that is, evidence suggests the gene product is an enzyme involved in elicitor production. In most if not all cases, bacterial avirulence gene function is dependent on interactions with the hypersensitive response and pathogenicity (hrp) genes. Many hrp genes are similar to genes involved in delivery of pathogenicity factors in mammalian bacterial pathogens. Thus, analogies between mammalian and plant pathogens may provide needed clues to elucidate how virulence gene products control induction of resistance.
Direct protein interaction underlies gene-for-gene specificity and coevolution of the flax resistance genes and flax rust avirulence genes

PubMed Central

Dodds, Peter N.; Lawrence, Gregory J.; Catanzariti, Ann-Maree; Teh, Trazel; Wang, Ching-I. A.; Ayliffe, Michael A.; Kobe, Bostjan; Ellis, Jeffrey G.

2006-01-01

Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R–Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrL567 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvrL567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R–Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R–Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant–pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes. PMID:16731621
Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms[OPEN

PubMed Central

Li, Zhen; Van de Peer, Yves; De Smet, Riet

2016-01-01

Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of “gene duplicability” is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. PMID:26744215
Genes Downregulated in Endometriosis Are Located Near the Known Imprinting Genes

PubMed Central

Higashiura, Yumi; Koike, Natsuki; Akasaka, Juria; Uekuri, Chiharu; Iwai, Kana; Niiro, Emiko; Morioka, Sachiko; Yamada, Yuki

2014-01-01

There is now accumulating evidence that endometriosis is a disease associated with an epigenetic disorder. Genomic imprinting is an epigenetic phenomenon known to regulate DNA methylation of either maternal or paternal alleles. We hypothesize that hypermethylated endometriosis-associated genes may be enriched at imprinted gene loci. We sought to determine whether downregulated genes associated with endometriosis susceptibility are associated with chromosomal location of the known paternally and maternally expressed imprinting genes. Gene information has been gathered from National Center for Biotechnology Information database geneimprint.com. Several researchers have identified specific loci with strong DNA methylation in eutopic endometrium and ectopic lesion with endometriosis. Of the 29 hypermethylated genes in endometriosis, 19 genes were located near 45 known imprinted foci. There may be an association of the genomic location between genes specifically downregulated in endometriosis and epigenetically imprinted genes. PMID:24615936
Magnetic nanoparticles: Applications in gene delivery and gene therapy.

PubMed

Majidi, Sima; Zeinali Sehrig, Fatemeh; Samiei, Mohammad; Milani, Morteza; Abbasi, Elham; Dadashzadeh, Kianoosh; Akbarzadeh, Abolfazl

2016-06-01

Gene therapy is defined as the direct transfer of genetic material to tissues or cells for the treatment of inherited disorders and acquired diseases. For gene delivery, magnetic nanoparticles (MNPs) are typically combined with a delivery platform to encapsulate the gene, and promote cell uptake. Delivery technologies that have been used with MNPs contain polymeric, viral, as well as non-viral platforms. In this review, we focus on targeted gene delivery using MNPs.
Effect of the absolute statistic on gene-sampling gene-set analysis methods.

PubMed

Nam, Dougu

2017-06-01

Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.
Human Gene Therapy: Genes without Frontiers?

ERIC Educational Resources Information Center

Simon, Eric J.

2002-01-01

Describes the latest advancements and setbacks in human gene therapy to provide reference material for biology teachers to use in their science classes. Focuses on basic concepts such as recombinant DNA technology, and provides examples of human gene therapy such as severe combined immunodeficiency syndrome, familial hypercholesterolemia, and…
Gene function prediction with gene interaction networks: a context graph kernel approach.

PubMed

Li, Xin; Chen, Hsinchun; Li, Jiexun; Zhang, Zhu

2010-01-01

Predicting gene functions is a challenge for biologists in the postgenomic era. Interactions among genes and their products compose networks that can be used to infer gene functions. Most previous studies adopt a linkage assumption, i.e., they assume that gene interactions indicate functional similarities between connected genes. In this study, we propose to use a gene's context graph, i.e., the gene interaction network associated with the focal gene, to infer its functions. In a kernel-based machine-learning framework, we design a context graph kernel to capture the information in context graphs. Our experimental study on a testbed of p53-related genes demonstrates the advantage of using indirect gene interactions and shows the empirical superiority of the proposed approach over linkage-assumption-based methods, such as the algorithm to minimize inconsistent connected genes and diffusion kernels.
The limitations of simple gene set enrichment analysis assuming gene independence.

PubMed

Tamayo, Pablo; Steinhardt, George; Liberzon, Arthur; Mesirov, Jill P

2016-02-01

Since its first publication in 2003, the Gene Set Enrichment Analysis method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach using a one-sample t-test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes Gene Set Enrichment Analysis's nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with Gene Set Enrichment Analysis's on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene-gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods. © The Author(s) 2012.
Simple F Test Reveals Gene-Gene Interactions in Case-Control Studies

PubMed Central

Chen, Guanjie; Yuan, Ao; Zhou, Jie; Bentley, Amy R.; Adeyemo, Adebowale; Rotimi, Charles N.

2012-01-01

Missing heritability is still a challenge for Genome Wide Association Studies (GWAS). Gene-gene interactions may partially explain this residual genetic influence and contribute broadly to complex disease. To analyze the gene-gene interactions in case-control studies of complex disease, we propose a simple, non-parametric method that utilizes the F-statistic. This approach consists of three steps. First, we examine the joint distribution of a pair of SNPs in cases and controls separately. Second, an F-test is used to evaluate the ratio of dependence in cases to that of controls. Finally, results are adjusted for multiple tests. This method was used to evaluate gene-gene interactions that are associated with risk of Type 2 Diabetes among African Americans in the Howard University Family Study. We identified 18 gene-gene interactions (P < 0.0001). Compared with the commonly-used logistical regression method, we demonstrate that the F-ratio test is an efficient approach to measuring gene-gene interactions, especially for studies with limited sample size. PMID:22837643
Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

PubMed

Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

2018-02-23

Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.
Transcriptional Coupling of Neighboring Genes and Gene Expression Noise: Evidence that Gene Orientation and Noncoding Transcripts Are Modulators of Noise

PubMed Central

Wang, Guang-Zhong; Lercher, Martin J.; Hurst, Laurence D.

2011-01-01

Abstract How is noise in gene expression modulated? Do mechanisms of noise control impact genome organization? In yeast, the expression of one gene can affect that of a very close neighbor. As the effect is highly regionalized, we hypothesize that genes in different orientations will have differing degrees of coupled expression and, in turn, different noise levels. Divergently organized gene pairs, in particular those with bidirectional promoters, have close promoters, maximizing the likelihood that expression of one gene affects the neighbor. With more distant promoters, the same is less likely to hold for gene pairs in nondivergent orientation. Stochastic models suggest that coupled chromatin dynamics will typically result in low abundance-corrected noise (ACN). Transcription of noncoding RNA (ncRNA) from a bidirectional promoter, we thus hypothesize to be a noise-reduction, expression-priming, mechanism. The hypothesis correctly predicts that protein-coding genes with a bidirectional promoter, including those with a ncRNA partner, have lower ACN than other genes and divergent gene pairs uniquely have correlated ACN. Moreover, as predicted, ACN increases with the distance between promoters. The model also correctly predicts ncRNA transcripts to be often divergently transcribed from genes that a priori would be under selection for low noise (essential genes, protein complex genes) and that the latter genes should commonly reside in divergent orientation. Likewise, that genes with bidirectional promoters are rare subtelomerically, cluster together, and are enriched in essential gene clusters is expected and observed. We conclude that gene orientation and transcription of ncRNAs are candidate modulators of noise. PMID:21402863
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
A Nonlinear Model for Gene-Based Gene-Environment Interaction.

PubMed

Sa, Jian; Liu, Xu; He, Tao; Liu, Guifen; Cui, Yuehua

2016-06-04

A vast amount of literature has confirmed the role of gene-environment (G×E) interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP) and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects) are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR) model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC) model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR) model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.
Compare Gene Calls

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ecale Zhou, Carol L.

2016-07-05

Compare Gene Calls (CGC) is a Python code used for combining and comparing gene calls from any number of gene callers. A gene caller is a computer program that predicts the extends of open reading frames within genomes of biological organisms.
Gene-breaking: A new paradigm for human retrotransposon-mediated gene evolution

PubMed Central

Wheelan, Sarah J.; Aizawa, Yasunori; Han, Jeffrey S.; Boeke, Jef D.

2005-01-01

The L1 retrotransposon is the most highly successful autonomous retrotransposon in mammals. This prolific genome parasite may on occasion benefit its host through genome rearrangements or adjustments of host gene expression. In examining possible effects of L1 elements on host gene expression, we investigated whether a full-length L1 element inserted in the antisense orientation into an intron of a cellular gene may actually split the gene's transcript into two smaller transcripts: (1) a transcript containing the upstream exons and terminating in the major antisense polyadenylation site (MAPS) of the L1, and (2) a transcript derived from the L1 antisense promoter (ASP) that includes the downstream exons of the gene. Bioinformatic analysis and experimental follow-up provide evidence for this L1 “gene-breaking” hypothesis. We identified three human genes apparently “broken” by L1 elements, as well as 12 more candidate genes. Most of the inserted L1 elements in our 15 candidate genes predate the human/chimp divergence. If indeed split, the transcripts of these genes may in at least one case encode potentially interacting proteins, and in another case may encode novel proteins. Gene-breaking represents a new mechanism through which L1 elements remodel mammalian genomes. PMID:16024818
The Ocean Gene Atlas: exploring the biogeography of plankton genes online.

PubMed

Villar, Emilie; Vannier, Thomas; Vernette, Caroline; Lescot, Magali; Cuenca, Miguelangel; Alexandre, Aurélien; Bachelerie, Paul; Rosnet, Thomas; Pelletier, Eric; Sunagawa, Shinichi; Hingamp, Pascal

2018-05-21

The Ocean Gene Atlas is a web service to explore the biogeography of genes from marine planktonic organisms. It allows users to query protein or nucleotide sequences against global ocean reference gene catalogs. With just one click, the abundance and location of target sequences are visualized on world maps as well as their taxonomic distribution. Interactive results panels allow for adjusting cutoffs for alignment quality and displaying the abundances of genes in the context of environmental features (temperature, nutrients, etc.) measured at the time of sampling. The ease of use enables non-bioinformaticians to explore quantitative and contextualized information on genes of interest in the global ocean ecosystem. Currently the Ocean Gene Atlas is deployed with (i) the Ocean Microbial Reference Gene Catalog (OM-RGC) comprising 40 million non-redundant mostly prokaryotic gene sequences associated with both Tara Oceans and Global Ocean Sampling (GOS) gene abundances and (ii) the Marine Atlas of Tara Ocean Unigenes (MATOU) composed of >116 million eukaryote unigenes. Additional datasets will be added upon availability of further marine environmental datasets that provide the required complement of sequence assemblies, raw reads and contextual environmental parameters. Ocean Gene Atlas is a freely-available web service at: http://tara-oceans.mio.osupytheas.fr/ocean-gene-atlas/.
The human RHOX gene cluster: target genes and functional analysis of gene variants in infertile men.

PubMed

Borgmann, Jennifer; Tüttelmann, Frank; Dworniczak, Bernd; Röpke, Albrecht; Song, Hye-Won; Kliesch, Sabine; Wilkinson, Miles F; Laurentino, Sandra; Gromoll, Jörg

2016-11-15

The X-linked reproductive homeobox (RHOX) gene cluster encodes transcription factors preferentially expressed in reproductive tissues. This gene cluster has important roles in male fertility based on phenotypic defects of Rhox-mutant mice and the finding that aberrant RHOX promoter methylation is strongly associated with abnormal human sperm parameters. However, little is known about the molecular mechanism of RHOX function in humans. Using gene expression profiling, we identified genes regulated by members of the human RHOX gene cluster. Some genes were uniquely regulated by RHOXF1 or RHOXF2/2B, while others were regulated by both of these transcription factors. Several of these regulated genes encode proteins involved in processes relevant to spermatogenesis; e.g. stress protection and cell survival. One of the target genes of RHOXF2/2B is RHOXF1, suggesting cross-regulation to enhance transcriptional responses. The potential role of RHOX in human infertility was addressed by sequencing all RHOX exons in a group of 250 patients with severe oligozoospermia. This revealed two mutations in RHOXF1 (c.515G > A and c.522C > T) and four in RHOXF2/2B (-73C > G, c.202G > A, c.411C > T and c.679G > A), of which only one (c.202G > A) was found in a control group of men with normal sperm concentration. Functional analysis demonstrated that c.202G > A and c.679G > A significantly impaired the ability of RHOXF2/2B to regulate downstream genes. Molecular modelling suggested that these mutations alter RHOXF2/F2B protein conformation. By combining clinical data with in vitro functional analysis, we demonstrate how the X-linked RHOX gene cluster may function in normal human spermatogenesis and we provide evidence that it is impaired in human male fertility.
Linking genes to diseases with a SNPedia-Gene Wiki mashup

PubMed Central

2012-01-01

Background A variety of topic-focused wikis are used in the biomedical sciences to enable the mass-collaborative synthesis and distribution of diverse bodies of knowledge. To address complex problems such as defining the relationships between genes and disease, it is important to bring the knowledge from many different domains together. Here we show how advances in wiki technology and natural language processing can be used to automatically assemble ‘meta-wikis’ that present integrated views over the data collaboratively created in multiple source wikis. Results We produced a semantic meta-wiki called the Gene Wiki+ that automatically mirrors and integrates data from the Gene Wiki and SNPedia. The Gene Wiki+, available at (http://genewikiplus.org/), captures 8,047 distinct gene-disease relationships. SNPedia accounts for 4,149 of the gene-disease pairs, the Gene Wiki provides 4,377 and only 479 appear independently in both sources. All of this content is available to query and browse and is provided as linked open data. Conclusions Wikis contain increasing amounts of diverse, biological information useful for elucidating the connections between genes and disease. The Gene Wiki+ shows how wiki technology can be used in concert with natural language processing to provide integrated views over diverse underlying data sources. PMID:22541597
Immunoglobulin λ Gene Rearrangement Can Precede κ Gene Rearrangement

DOE PAGES

Berg, Jörg; Mcdowell, Mindy; Jäck, Hans-Martin; ...

1990-01-01

Imore » mmunoglobulin genes are generated during differentiation of B lymphocytes by joining gene segments. A mouse pre-B cell contains a functional immunoglobulin heavy-chain gene, but no light-chain gene. Although there is only one heavy-chain locus, there are two lightchain loci: κ and λ .t has been reported that κ loci in the germ-line configuration are never (in man) or very rarely (in the mouse) present in cells with functionally rearranged λ -chain genes. Two explanations have been proposed to explain this: (a) the ordered rearrangement theory, which postulates that light-chain gene rearrangement in the pre-B cell is first attempted at the κ locus, and that only upon failure to produce a functional κ chain is there an attempt to rearrange the λ locus; and (b) the stochastic theory, which postulates that rearrangement at the λ locus proceeds at a rate that is intrinsically much slower than that at the κ locus. We show here that λ -chain genes are generated whether or not the κ locus has lost its germ-line arrangement, a result that is compatible only with the stochastic theory.« less

Evolution of homeobox genes.

PubMed

Holland, Peter W H

2013-01-01

Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.
Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes

PubMed Central

2010-01-01

Background Horizontal gene transfer (HGT) is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR) survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR) were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT)-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native mitochondrial copies suggests
Human AZU-1 gene, variants thereof and expressed gene products

DOEpatents

Chen, Huei-Mei; Bissell, Mina

2004-06-22

A human AZU-1 gene, mutants, variants and fragments thereof. Protein products encoded by the AZU-1 gene and homologs encoded by the variants of AZU-1 gene acting as tumor suppressors or markers of malignancy progression and tumorigenicity reversion. Identification, isolation and characterization of AZU-1 and AZU-2 genes localized to a tumor suppressive locus at chromosome 10q26, highly expressed in nonmalignant and premalignant cells derived from a human breast tumor progression model. A recombinant full length protein sequences encoded by the AZU-1 gene and nucleotide sequences of AZU-1 and AZU-2 genes and variant and fragments thereof. Monoclonal or polyclonal antibodies specific to AZU-1, AZU-2 encoded protein and to AZU-1, or AZU-2 encoded protein homologs.
The drug target genes show higher evolutionary conservation than non-target genes.

PubMed

Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

2016-01-26

Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.
Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

PubMed Central

Xu, Pingzhen

2018-01-01

Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Novel gene sets improve set-level classification of prokaryotic gene expression data.

PubMed

Holec, Matěj; Kuželka, Ondřej; Železný, Filip

2015-10-28

Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.
[Gene doping: gene transfer and possible molecular detection].

PubMed

Argüelles, Carlos Francisco; Hernández-Zamora, Edgar

2007-01-01

The use of illegal substances in sports to enhance athletic performance during competition has caused international sports organizations such as the COI and WADA to take anti doping measures. A new doping method know as gene doping is defined as "the non-therapeutic use of genes, genetic elements and/or cells that have the capacity to enhance athletic performance". However, gene doping in sports is not easily identified and can cause serious consequences. Molecular biology techniques are needed in order to distinguish the difference between a "normal" and an "altered" genome. Further, we need to develop new analytic methods and biological molecular techniques in anti-doping laboratories, and design programs that avoid the non therapeutic use of genes.
Progress in gene targeting and gene therapy for retinitis pigmentosa

DOE Office of Scientific and Technical Information (OSTI.GOV)

Farrar, G.J.; Humphries, M.M.; Erven, A.

1994-09-01

Previously, we localized disease genes involved in retinitis pigmentosa (RP), an inherited retinal degeneration, close to the rhodopsin and peripherin genes on 3q and 6p. Subsequently, we and others identified mutations in these genes in RP patients. Currently animal models for human retinopathies are being generated using gene targeting by homologous recombination in embryonic stem (ES) cells. Genomic clones for retinal genes including rhodopsin and peripherin have been obtained from a phage library carrying mouse DNA isogenic with the ES cell line (CC1.2). The peripherin clone has been sequenced to establish the genomic structure of the mouse gene. Targeting vectorsmore » for rhodopsin and peripherin including a neomycin cassette for positive selection and thymidine kinase genes enabling selection against random intergrants are under construction. Progress in vector construction will be presented. Simultaneously we are developing systems for delivery of gene therapies to retinal tissues utilizing replication-deficient adenovirus (Ad5). Efficacy of infection subsequent to various methods of intraocular injection and with varying viral titers is being assayed using an adenovirus construct containing a CMV promoter LacZ fusion as reporter and the range of tissues infected and the level of duration of LacZ expression monitored. Viral constructs with the LacZ reporter gene under the control of retinal specific promoters such as rhodopsin and IRBP cloned into pXCJL.1 are under construction. An update on developments in photoreceptor cell-directed expression of virally delivered genes will be presented.« less
Analysis of bHLH coding genes using gene co-expression network approach.

PubMed

Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

2016-07-01

Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.
Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.

PubMed

Springer, Mark S; Gatesy, John

2018-02-26

coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the 'recombination ratchet'-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d'etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful
Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets

PubMed Central

Springer, Mark S.; Gatesy, John

2018-01-01

Summary coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset—the ‘recombination ratchet’—is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d’etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent
Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions.

PubMed

Chatterjee, Nilanjan; Kalaylioglu, Zeynep; Moslehi, Roxana; Peters, Ulrike; Wacholder, Sholom

2006-12-01

In modern genetic epidemiology studies, the association between the disease and a genomic region, such as a candidate gene, is often investigated using multiple SNPs. We propose a multilocus test of genetic association that can account for genetic effects that might be modified by variants in other genes or by environmental factors. We consider use of the venerable and parsimonious Tukey's 1-degree-of-freedom model of interaction, which is natural when individual SNPs within a gene are associated with disease through a common biological mechanism; in contrast, many standard regression models are designed as if each SNP has unique functional significance. On the basis of Tukey's model, we propose a novel but computationally simple generalized test of association that can simultaneously capture both the main effects of the variants within a genomic region and their interactions with the variants in another region or with an environmental exposure. We compared performance of our method with that of two standard tests of association, one ignoring gene-gene/gene-environment interactions and the other based on a saturated model of interactions. We demonstrate major power advantages of our method both in analysis of data from a case-control study of the association between colorectal adenoma and DNA variants in the NAT2 genomic region, which are well known to be related to a common biological phenotype, and under different models of gene-gene interactions with use of simulated data.
FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data.

PubMed

Manijak, Mieszko P; Nielsen, Henrik B

2011-06-11

Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially circumvented by instead matching gene expression signatures to signatures of other experiments. To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700 Arabidopsis microarray experiments. Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/.
Bayesian Variable Selection for Hierarchical Gene-Environment and Gene-Gene Interactions

PubMed Central

Liu, Changlu; Ma, Jianzhong; Amos, Christopher I.

2014-01-01

We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions and gene by environment interactions in the same model. Our approach incorporates the natural hierarchical structure between the main effects and interaction effects into a mixture model, such that our methods tend to remove the irrelevant interaction effects more effectively, resulting in more robust and parsimonious models. We consider both strong and weak hierarchical models. For a strong hierarchical model, both of the main effects between interacting factors must be present for the interactions to be considered in the model development, while for a weak hierarchical model, only one of the two main effects is required to be present for the interaction to be evaluated. Our simulation results show that the proposed strong and weak hierarchical mixture models work well in controlling false positive rates and provide a powerful approach for identifying the predisposing effects and interactions in gene-environment interaction studies, in comparison with the naive model that does not impose this hierarchical constraint in most of the scenarios simulated. We illustrated our approach using data for lung cancer and cutaneous melanoma. PMID:25154630
4. AERIAL VIEW OF GENE WASH RESERVOIR AND GENE CAMP ...

Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

4. AERIAL VIEW OF GENE WASH RESERVOIR AND GENE CAMP LOOKING SOUTHWEST. DAM AND SPILLWAY VISIBLE IN BOTTOM OF PHOTO. - Gene Wash Reservoir & Dam, 2 miles west of Parker Dam, Parker Dam, San Bernardino County, CA
Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization.

PubMed

Jung, Sang-Kyu; McDonald, Karen

2011-08-16

Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net.
Gene essentiality, conservation index and co-evolution of genes in cyanobacteria.

PubMed

Tiruveedula, Gopi Siva Sai; Wangikar, Pramod P

2017-01-01

Cyanobacteria, a group of photosynthetic prokaryotes, dominate the earth with ~ 1015 g wet biomass. Despite diversity in habitats and an ancient origin, cyanobacterial phylum has retained a significant core genome. Cyanobacteria are being explored for direct conversion of solar energy and carbon dioxide into biofuels. For this, efficient cyanobacterial strains will need to be designed via metabolic engineering. This will require identification of target knockouts to channelize the flow of carbon toward the product of interest while minimizing deletions of essential genes. We propose "Gene Conservation Index" (GCI) as a quick measure to predict gene essentiality in cyanobacteria. GCI is based on phylogenetic profile of a gene constructed with a reduced dataset of cyanobacterial genomes. GCI is the percentage of organism clusters in which the query gene is present in the reduced dataset. Of the 750 genes deemed to be essential in the experimental study on S. elongatus PCC 7942, we found 494 to be conserved across the phylum which largely comprise of the essential metabolic pathways. On the contrary, the conserved but non-essential genes broadly comprise of genes required under stress conditions. Exceptions to this rule include genes such as the glycogen synthesis and degradation enzymes, deoxyribose-phosphate aldolase (DERA), glucose-6-phosphate 1-dehydrogenase (zwf) and fructose-1,6-bisphosphatase class1, which are conserved but non-essential. While the essential genes are to be avoided during gene knockout studies as potentially lethal deletions, the non-essential but conserved set of genes could be interesting targets for metabolic engineering. Further, we identify clusters of co-evolving genes (CCG), which provide insights that may be useful in annotation. Principal component analysis (PCA) plots of the CCGs are demonstrated as data visualization tools that are complementary to the conventional heatmaps. Our dataset consists of phylogenetic profiles for 23
Testing Gene-Gene Interactions in the Case-Parents Design

PubMed Central

Yu, Zhaoxia

2011-01-01

The case-parents design has been widely used to detect genetic associations as it can prevent spurious association that could occur in population-based designs. When examining the effect of an individual genetic locus on a disease, logistic regressions developed by conditioning on parental genotypes provide complete protection from spurious association caused by population stratification. However, when testing gene-gene interactions, it is unknown whether conditional logistic regressions are still robust. Here we evaluate the robustness and efficiency of several gene-gene interaction tests that are derived from conditional logistic regressions. We found that in the presence of SNP genotype correlation due to population stratification or linkage disequilibrium, tests with incorrectly specified main-genetic-effect models can lead to inflated type I error rates. We also found that a test with fully flexible main genetic effects always maintains correct test size and its robustness can be achieved with negligible sacrifice of its power. When testing gene-gene interactions is the focus, the test allowing fully flexible main effects is recommended to be used. PMID:21778736
GeneNetFinder2: Improved Inference of Dynamic Gene Regulatory Relations with Multiple Regulators.

PubMed

Han, Kyungsook; Lee, Jeonghoon

2016-01-01

A gene involved in complex regulatory interactions may have multiple regulators since gene expression in such interactions is often controlled by more than one gene. Another thing that makes gene regulatory interactions complicated is that regulatory interactions are not static, but change over time during the cell cycle. Most research so far has focused on identifying gene regulatory relations between individual genes in a particular stage of the cell cycle. In this study we developed a method for identifying dynamic gene regulations of several types from the time-series gene expression data. The method can find gene regulations with multiple regulators that work in combination or individually as well as those with single regulators. The method has been implemented as the second version of GeneNetFinder (hereafter called GeneNetFinder2) and tested on several gene expression datasets. Experimental results with gene expression data revealed the existence of genes that are not regulated by individual genes but rather by a combination of several genes. Such gene regulatory relations cannot be found by conventional methods. Our method finds such regulatory relations as well as those with multiple, independent regulators or single regulators, and represents gene regulatory relations as a dynamic network in which different gene regulatory relations are shown in different stages of the cell cycle. GeneNetFinder2 is available at http://bclab.inha.ac.kr/GeneNetFinder and will be useful for modeling dynamic gene regulations with multiple regulators.
Validation of reference genes for quantitative gene expression analysis in experimental epilepsy.

PubMed

Sadangi, Chinmaya; Rosenow, Felix; Norwood, Braxton A

2017-12-01

To grasp the molecular mechanisms and pathophysiology underlying epilepsy development (epileptogenesis) and epilepsy itself, it is important to understand the gene expression changes that occur during these phases. Quantitative real-time polymerase chain reaction (qPCR) is a technique that rapidly and accurately determines gene expression changes. It is crucial, however, that stable reference genes are selected for each experimental condition to ensure that accurate values are obtained for genes of interest. If reference genes are unstably expressed, this can lead to inaccurate data and erroneous conclusions. To date, epilepsy studies have used mostly single, nonvalidated reference genes. This is the first study to systematically evaluate reference genes in male Sprague-Dawley rat models of epilepsy. We assessed 15 potential reference genes in hippocampal tissue obtained from 2 different models during epileptogenesis, 1 model during chronic epilepsy, and a model of noninjurious seizures. Reference gene ranking varied between models and also differed between epileptogenesis and chronic epilepsy time points. There was also some variance between the four mathematical models used to rank reference genes. Notably, we found novel reference genes to be more stably expressed than those most often used in experimental epilepsy studies. The consequence of these findings is that reference genes suitable for one epilepsy model may not be appropriate for others and that reference genes can change over time. It is, therefore, critically important to validate potential reference genes before using them as normalizing factors in expression analysis in order to ensure accurate, valid results. © 2017 Wiley Periodicals, Inc.

BioGPS and MyGene.info: organizing online, gene-centric information.

PubMed

Wu, Chunlei; Macleod, Ian; Su, Andrew I

2013-01-01

Fast-evolving technologies have enabled researchers to easily generate data at genome scale, and using these technologies to compare biological states typically results in a list of candidate genes. Researchers are then faced with the daunting task of prioritizing these candidate genes for follow-up studies. There are hundreds, possibly even thousands, of web-based gene annotation resources available, but it quickly becomes impractical to manually access and review all of these sites for each gene in a candidate gene list. BioGPS (http://biogps.org) was created as a centralized gene portal for aggregating distributed gene annotation resources, emphasizing community extensibility and user customizability. BioGPS serves as a convenient tool for users to access known gene-centric resources, as well as a mechanism to discover new resources that were previously unknown to the user. This article describes updates to BioGPS made after its initial release in 2008. We summarize recent additions of features and data, as well as the robust user activity that underlies this community intelligence application. Finally, we describe MyGene.info (http://mygene.info) and related web services that provide programmatic access to BioGPS.
Fuzzy measures on the Gene Ontology for gene product similarity.

PubMed

Popescu, Mihail; Keller, James M; Mitchell, Joyce A

2006-01-01

One of the most important objects in bioinformatics is a gene product (protein or RNA). For many gene products, functional information is summarized in a set of Gene Ontology (GO) annotations. For these genes, it is reasonable to include similarity measures based on the terms found in the GO or other taxonomy. In this paper, we introduce several novel measures for computing the similarity of two gene products annotated with GO terms. The fuzzy measure similarity (FMS) has the advantage that it takes into consideration the context of both complete sets of annotation terms when computing the similarity between two gene products. When the two gene products are not annotated by common taxonomy terms, we propose a method that avoids a zero similarity result. To account for the variations in the annotation reliability, we propose a similarity measure based on the Choquet integral. These similarity measures provide extra tools for the biologist in search of functional information for gene products. The initial testing on a group of 194 sequences representing three proteins families shows a higher correlation of the FMS and Choquet similarities to the BLAST sequence similarities than the traditional similarity measures such as pairwise average or pairwise maximum.
Speciation genes in plants

PubMed Central

Rieseberg, Loren H.; Blackman, Benjamin K.

2010-01-01

Background Analyses of speciation genes – genes that contribute to the cessation of gene flow between populations – can offer clues regarding the ecological settings, evolutionary forces and molecular mechanisms that drive the divergence of populations and species. This review discusses the identities and attributes of genes that contribute to reproductive isolation (RI) in plants, compares them with animal speciation genes and investigates what these genes can tell us about speciation. Scope Forty-one candidate speciation genes were identified in the plant literature. Of these, seven contributed to pre-pollination RI, one to post-pollination, prezygotic RI, eight to hybrid inviability, and 25 to hybrid sterility. Genes, gene families and genetic pathways that were frequently found to underlie the evolution of RI in different plant groups include the anthocyanin pathway and its regulators (pollinator isolation), S RNase-SI genes (unilateral incompatibility), disease resistance genes (hybrid necrosis), chimeric mitochondrial genes (cytoplasmic male sterility), and pentatricopeptide repeat family genes (cytoplasmic male sterility). Conclusions The most surprising conclusion from this review is that identities of genes underlying both prezygotic and postzygotic RI are often predictable in a broad sense from the phenotype of the reproductive barrier. Regulatory changes (both cis and trans) dominate the evolution of pre-pollination RI in plants, whereas a mix of regulatory mutations and changes in protein-coding genes underlie intrinsic postzygotic barriers. Also, loss-of-function mutations and copy number variation frequently contribute to RI. Although direct evidence of positive selection on speciation genes is surprisingly scarce in plants, analyses of gene family evolution, along with theoretical considerations, imply an important role for diversifying selection and genetic conflict in the evolution of RI. Unlike in animals, however, most candidate speciation
Gene and enhancer trap tagging of vascular-expressed genes in poplar trees

Treesearch

Andrew Groover; Joseph R. Fontana; Gayle Dupper; Caiping Ma; Robert Martienssen; Steven Strauss; Richard Meilan

2004-01-01

We report a gene discovery system for poplar trees based on gene and enhancer traps. Gene and enhancer trap vectors carrying the β-glucuronidase (GUS) reporter gene were inserted into the poplar genome via Agrobacterium tumefaciens transformation, where they reveal the expression pattern of genes at or near the insertion sites. Because GUS...
Cardiac Gene Therapy: Optimization of Gene Delivery Techniques In Vivo

PubMed Central

Katz, Michael G.; Swain, JaBaris D.; White, Jennifer D.; Low, David; Stedman, Hansell

2010-01-01

Abstract Vector-mediated cardiac gene therapy holds tremendous promise as a translatable platform technology for treating many cardiovascular diseases. The ideal technique is one that is efficient and practical, allowing for global cardiac gene expression, while minimizing collateral expression in other organs. Here we survey the available in vivo vector-mediated cardiac gene delivery methods—including transcutaneous, intravascular, intramuscular, and cardiopulmonary bypass techniques—with consideration of the relative merits and deficiencies of each. Review of available techniques suggests that an optimal method for vector-mediated gene delivery to the large animal myocardium would ideally employ retrograde and/or anterograde transcoronary gene delivery,extended vector residence time in the coronary circulation, an increased myocardial transcapillary gradient using physical methods, increased endothelial permeability with pharmacological agents, minimal collateral gene expression by isolation of the cardiac circulation from the systemic, and have low immunogenicity. PMID:19947886
Occurrence and expression of gene transfer agent genes in marine bacterioplankton.

PubMed

Biers, Erin J; Wang, Kui; Pennington, Catherine; Belas, Robert; Chen, Feng; Moran, Mary Ann

2008-05-01

Genes with homology to the transduction-like gene transfer agent (GTA) were observed in genome sequences of three cultured members of the marine Roseobacter clade. A broader search for homologs for this host-controlled virus-like gene transfer system identified likely GTA systems in cultured Alphaproteobacteria, and particularly in marine bacterioplankton representatives. Expression of GTA genes and extracellular release of GTA particles ( approximately 50 to 70 nm) was demonstrated experimentally for the Roseobacter clade member Silicibacter pomeroyi DSS-3, and intraspecific gene transfer was documented. GTA homologs are surprisingly infrequent in marine metagenomic sequence data, however, and the role of this lateral gene transfer mechanism in ocean bacterioplankton communities remains unclear.
Direct Introduction of Genes into Rats and Expression of the Genes

NASA Astrophysics Data System (ADS)

Benvenisty, Nissim; Reshef, Lea

1986-12-01

A method of introducing actively expressed genes into intact mammals is described. DNA precipitated with calcium phosphate has been injected intraperitoneally into newborn rats. The injected genes have been taken up and expressed by the animal tissues. To examine the generality of the method we have injected newborn rats with the chloramphenicol acetyltransferase prokaryotic gene fused with various viral and cellular gene promoters and the gene for hepatitis B surface antigen, and we observed appearance of chloramphenicol acetyltransferase activity and hepatitis B surface antigen in liver and spleen. In addition, administration of genes coding for hormones (insulin or growth hormone) resulted in their expression.
A powerful score-based test statistic for detecting gene-gene co-association.

PubMed

Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

2016-01-29

The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Autism and Genes

ERIC Educational Resources Information Center

National Institutes of Health, 2005

2005-01-01

This document defines and discusses autism and how genes play a role in the condition. Answers to the following questions are covered: (1) What are genes? (2) What is autism? (3) What causes autism? (4) Why study genes to learn about autism? (5) How do researchers look for the genes involved in autism? (screen the whole genome; conduct cytogenetic…
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

PubMed

Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

2014-01-01

Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
Constructing an integrated gene similarity network for the identification of disease genes.

PubMed

Tian, Zhen; Guo, Maozu; Wang, Chunyu; Xing, LinLin; Wang, Lei; Zhang, Yin

2017-09-20

Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .
Machine Learning for Detecting Gene-Gene Interactions

PubMed Central

McKinney, Brett A.; Reif, David M.; Ritchie, Marylyn D.; Moore, Jason H.

2011-01-01

Complex interactions among genes and environmental factors are known to play a role in common human disease aetiology. There is a growing body of evidence to suggest that complex interactions are ‘the norm’ and, rather than amounting to a small perturbation to classical Mendelian genetics, interactions may be the predominant effect. Traditional statistical methods are not well suited for detecting such interactions, especially when the data are high dimensional (many attributes or independent variables) or when interactions occur between more than two polymorphisms. In this review, we discuss machine-learning models and algorithms for identifying and characterising susceptibility genes in common, complex, multifactorial human diseases. We focus on the following machine-learning methods that have been used to detect gene-gene interactions: neural networks, cellular automata, random forests, and multifactor dimensionality reduction. We conclude with some ideas about how these methods and others can be integrated into a comprehensive and flexible framework for data mining and knowledge discovery in human genetics. PMID:16722772
Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization

PubMed Central

2011-01-01

Background Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. Results The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Conclusion Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net. PMID:21846353
Gene delivery to the lungs: pulmonary gene therapy for cystic fibrosis.

PubMed

Villate-Beitia, Ilia; Zarate, Jon; Puras, Gustavo; Pedraz, José Luis

2017-07-01

Cystic fibrosis (CF) is a monogenic autosomal recessive disorder where the defective gene, the cystic fibrosis transmembrane conductance regulator (CFTR), is well identified. Moreover, the respiratory tract can be targeted through noninvasive aerosolized formulations for inhalation. Therefore, gene therapy is considered a plausible strategy to address this disease. Conventional gene therapy strategies rely on the addition of a correct copy of the CFTR gene into affected cells in order to restore the channel activity. In recent years, genome correction strategies have emerged, such as zinc-finger nucleases, transcription activator-like effector nucleases and clustered regularly interspaced short palindromic repeats associated to Cas9 nucleases. These gene editing tools aim to repair the mutated gene at its original genomic locus with high specificity. Besides, the success of gene therapy critically depends on the nucleic acids carriers. To date, several clinical studies have been carried out to add corrected copies of the CFTR gene into target cells using viral and non-viral vectors, some of them with encouraging results. Regarding genome editing systems, preliminary in vitro studies have been performed in order to repair the CFTR gene. In this review, after briefly introducing the basis of CF, we discuss the up-to-date gene therapy strategies to address the disease. The review focuses on the main factors to take into consideration when developing gene delivery strategies, such as the design of vectors and plasmid DNA, in vitro/in vivo tests, translation to human use, administration methods, manufacturing conditions and regulatory issues.
Computational gene network study on antibiotic resistance genes of Acinetobacter baumannii.

PubMed

Anitha, P; Anbarasu, Anand; Ramaiah, Sudha

2014-05-01

Multi Drug Resistance (MDR) in Acinetobacter baumannii is one of the major threats for emerging nosocomial infections in hospital environment. Multidrug-resistance in A. baumannii may be due to the implementation of multi-combination resistance mechanisms such as β-lactamase synthesis, Penicillin-Binding Proteins (PBPs) changes, alteration in porin proteins and in efflux pumps against various existing classes of antibiotics. Multiple antibiotic resistance genes are involved in MDR. These resistance genes are transferred through plasmids, which are responsible for the dissemination of antibiotic resistance among Acinetobacter spp. In addition, these resistance genes may also have a tendency to interact with each other or with their gene products. Therefore, it becomes necessary to understand the impact of these interactions in antibiotic resistance mechanism. Hence, our study focuses on protein and gene network analysis on various resistance genes, to elucidate the role of the interacting proteins and to study their functional contribution towards antibiotic resistance. From the search tool for the retrieval of interacting gene/protein (STRING), a total of 168 functional partners for 15 resistance genes were extracted based on the confidence scoring system. The network study was then followed up with functional clustering of associated partners using molecular complex detection (MCODE). Later, we selected eight efficient clusters based on score. Interestingly, the associated protein we identified from the network possessed greater functional similarity with known resistance genes. This network-based approach on resistance genes of A. baumannii could help in identifying new genes/proteins and provide clues on their association in antibiotic resistance. Copyright © 2014 Elsevier Ltd. All rights reserved.
Efficient disruption of Zebrafish genes using a Gal4-containing gene trap

PubMed Central

2013-01-01

Background External development and optical transparency of embryos make zebrafish exceptionally suitable for in vivo insertional mutagenesis using fluorescent proteins to visualize expression patterns of mutated genes. Recently developed Gene Breaking Transposon (GBT) vectors greatly improve the fidelity and mutagenicity of transposon-based gene trap vectors. Results We constructed and tested a bipartite GBT vector with Gal4-VP16 as the primary gene trap reporter. Our vector also contains a UAS:eGFP cassette for direct detection of gene trap events by fluorescence. To confirm gene trap events, we generated a UAS:mRFP tester line. We screened 270 potential founders and established 41 gene trap lines. Three of our gene trap alleles display homozygous lethal phenotypes ranging from embryonic to late larval: nsf tpl6, atp1a3atpl10 and flrtpl19. Our gene trap cassette is flanked by direct loxP sites, which enabled us to successfully revert nsf tpl6, atp1a3atpl10 and flrtpl19 gene trap alleles by injection of Cre mRNA. The UAS:eGFP cassette is flanked by direct FRT sites. It can be readily removed by injection of Flp mRNA for use of our gene trap alleles with other tissue-specific GFP-marked lines. The Gal4-VP16 component of our vector provides two important advantages over other GBT vectors. The first is increased sensitivity, which enabled us to detect previously unnoticed expression of nsf in the pancreas. The second advantage is that all our gene trap lines, including integrations into non-essential genes, can be used as highly specific Gal4 drivers for expression of other transgenes under the control of Gal4 UAS. Conclusions The Gal4-containing bipartite Gene Breaking Transposon vector presented here retains high specificity for integrations into genes, high mutagenicity and revertibility by Cre. These features, together with utility as highly specific Gal4 drivers, make gene trap mutants presented here especially useful to the research community. PMID:24034702
Regulatory systems for hypoxia-inducible gene expression in ischemic heart disease gene therapy.

PubMed

Kim, Hyun Ah; Rhim, Taiyoun; Lee, Minhyung

2011-07-18

Ischemic heart diseases are caused by narrowed coronary arteries that decrease the blood supply to the myocardium. In the ischemic myocardium, hypoxia-responsive genes are up-regulated by hypoxia-inducible factor-1 (HIF-1). Gene therapy for ischemic heart diseases uses genes encoding angiogenic growth factors and anti-apoptotic proteins as therapeutic genes. These genes increase blood supply into the myocardium by angiogenesis and protect cardiomyocytes from cell death. However, non-specific expression of these genes in normal tissues may be harmful, since growth factors and anti-apoptotic proteins may induce tumor growth. Therefore, tight gene regulation is required to limit gene expression to ischemic tissues, to avoid unwanted side effects. For this purpose, various gene expression strategies have been developed for ischemic-specific gene expression. Transcriptional, post-transcriptional, and post-translational regulatory strategies have been developed and evaluated in ischemic heart disease animal models. The regulatory systems can limit therapeutic gene expression to ischemic tissues and increase the efficiency of gene therapy. In this review, recent progresses in ischemic-specific gene expression systems are presented, and their applications to ischemic heart diseases are discussed. Copyright © 2011 Elsevier B.V. All rights reserved.
The Pathway From Genes to Gene Therapy in Glaucoma: A Review of Possibilities for Using Genes as Glaucoma Drugs

PubMed Central

Borrás, Teresa

2018-01-01

Treatment of diseases with gene therapy is advancing rapidly. The use of gene therapy has expanded from the original concept of replacing the mutated gene causing the disease to the use of genes to control nonphysiological levels of expression or to modify pathways known to affect the disease. Genes offer numerous advantages over conventional drugs. They have longer duration of action and are more specific. Genes can be delivered to the target site by naked DNA, cells, nonviral, and viral vectors. The enormous progress of the past decade in molecular biology and delivery systems has provided ways for targeting genes to the intended cell/tissue and safe, long-term vectors. The eye is an ideal organ for gene therapy. It is easily accessible and it is an immune-privileged site. Currently, there are clinical trials for diseases affecting practically every tissue of the eye, including those to restore vision in patients with Leber congenital amaurosis. However, the number of eye trials compared with those for systemic diseases is quite low (1.8%). Nevertheless, judging by the vast amount of ongoing preclinical studies, it is expected that such number will increase considerably in the near future. One area of great need for eye gene therapy is glaucoma, where a long-term gene drug would eliminate daily applications and compliance issues. Here, we review the current state of gene therapy for glaucoma and the possibilities for treating the trabecular meshwork to lower intraocular pressure and the retinal ganglion cells to protect them from neurodegeneration. PMID:28161916
Tissue-specific epigenetics in gene neighborhoods: myogenic transcription factor genes

PubMed Central

Chandra, Sruti; Terragni, Jolyon; Zhang, Guoqiang; Pradhan, Sriharsa; Haushka, Stephen; Johnston, Douglas; Baribault, Carl; Lacey, Michelle; Ehrlich, Melanie

2015-01-01

Myogenic regulatory factor (MRF) genes, MYOD1, MYOG, MYF6 and MYF5, are critical for the skeletal muscle lineage. Here, we used various epigenome profiles from human myoblasts (Mb), myotubes (Mt), muscle and diverse non-muscle samples to elucidate the involvement of multigene neighborhoods in the regulation of MRF genes. We found more far-distal enhancer chromatin associated with MRF genes in Mb and Mt than previously reported from studies in mice. For the MYF5/MYF6 gene-pair, regions of Mb-associated enhancer chromatin were located throughout the adjacent 236-kb PTPRQ gene even though Mb expressed negligible amounts of PTPRQ mRNA. Some enhancer chromatin regions inside PTPRQ in Mb were also seen in PTPRQ mRNA-expressing non-myogenic cells. This suggests dual-purpose PTPRQ enhancers that upregulate expression of PTPRQ in non-myogenic cells and MYF5/MYF6 in myogenic cells. In contrast, the myogenic enhancer chromatin regions distal to MYOD1 were intergenic and up to 19 kb long. Two of them contain small, known MYOD1 enhancers, and one displayed an unusually high level of 5-hydroxymethylcytosine in a quantitative DNA hydroxymethylation assay. Unexpectedly, three regions of MYOD1-distal enhancer chromatin in Mb and Mt overlapped enhancer chromatin in umbilical vein endothelial cells, which might upregulate a distant gene (PIK3C2A). Lastly, genes surrounding MYOG were preferentially transcribed in Mt, like MYOG itself, and exhibited nearby myogenic enhancer chromatin. These neighboring chromatin regions may be enhancers acting in concert to regulate myogenic expression of multiple adjacent genes. Our findings reveal the very different and complex organization of gene neighborhoods containing closely related transcription factor genes. PMID:26041816
Occurrence and Expression of Gene Transfer Agent Genes in Marine Bacterioplankton▿

PubMed Central

Biers, Erin J.; Wang, Kui; Pennington, Catherine; Belas, Robert; Chen, Feng; Moran, Mary Ann

2008-01-01

Genes with homology to the transduction-like gene transfer agent (GTA) were observed in genome sequences of three cultured members of the marine Roseobacter clade. A broader search for homologs for this host-controlled virus-like gene transfer system identified likely GTA systems in cultured Alphaproteobacteria, and particularly in marine bacterioplankton representatives. Expression of GTA genes and extracellular release of GTA particles (∼50 to 70 nm) was demonstrated experimentally for the Roseobacter clade member Silicibacter pomeroyi DSS-3, and intraspecific gene transfer was documented. GTA homologs are surprisingly infrequent in marine metagenomic sequence data, however, and the role of this lateral gene transfer mechanism in ocean bacterioplankton communities remains unclear. PMID:18359833

Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

PubMed Central

Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

2014-01-01

It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
GeneTopics - interpretation of gene sets via literature-driven topic models

PubMed Central

2013-01-01

Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly
Fluorogenic kinetic assay for high-throughput discovery of stereoselective ketoreductases relevant to pharmaceutical synthesis.

PubMed

Thai, Yen-Chi; Szekrenyi, Anna; Qi, Yuyin; Black, Gary W; Charnock, Simon J; Fessner, Wolf-Dieter

2018-04-01

Enantiomerically pure 1-(6-methoxynaphth-2-yl) and 1-(6-(dimethylamino)naphth-2-yl) carbinols are fluorogenic substrates for aldo/keto reductase (KRED) enzymes, which allow the highly sensitive and reliable determination of activity and kinetic constants of known and unknown enzymes, as well as an immediate enantioselectivity typing. Because of its simplicity in microtiter plate format, the assay qualifies for the discovery of novel KREDs of yet unknown specificity among this vast enzyme superfamily. The suitability of this approach for enzyme typing is illustrated by an exemplary screening of a large collection of short-chain dehydrogenase/reductase (SDR) enzymes arrayed from a metagenomic approach. We believe that this assay format should match well the pharmaceutical industry's demand for acetophenone-type substrates and the continuing interest in new enzymes with broad substrate promiscuity for the synthesis of chiral, non-racemic carbinols. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Gene doping in sports.

PubMed

Unal, Mehmet; Ozer Unal, Durisehvar

2004-01-01

Gene or cell doping is defined by the World Anti-Doping Agency (WADA) as "the non-therapeutic use of genes, genetic elements and/or cells that have the capacity to enhance athletic performance". New research in genetics and genomics will be used not only to diagnose and treat disease, but also to attempt to enhance human performance. In recent years, gene therapy has shown progress and positive results that have highlighted the potential misuse of this technology and the debate of 'gene doping'. Gene therapies developed for the treatment of diseases such as anaemia (the gene for erythropoietin), muscular dystrophy (the gene for insulin-like growth factor-1) and peripheral vascular diseases (the gene for vascular endothelial growth factor) are potential doping methods. With progress in gene technology, many other genes with this potential will be discovered. For this reason, it is important to develop timely legal regulations and to research the field of gene doping in order to develop methods of detection. To protect the health of athletes and to ensure equal competitive conditions, the International Olympic Committee, WADA and International Sports Federations have accepted performance-enhancing substances and methods as being doping, and have forbidden them. Nevertheless, the desire to win causes athletes to misuse these drugs and methods. This paper reviews the current status of gene doping and candidate performance enhancement genes, and also the use of gene therapy in sports medicine and ethics of genetic enhancement. Copyright 2004 Adis Data Information BV
Estimation of gene induction enables a relevance-based ranking of gene sets.

PubMed

Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens

2009-07-01

In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

PubMed

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
A Critical Look at Entropy-Based Gene-Gene Interaction Measures.

PubMed

Lee, Woojoo; Sjölander, Arvid; Pawitan, Yudi

2016-07-01

Several entropy-based measures for detecting gene-gene interaction have been proposed recently. It has been argued that the entropy-based measures are preferred because entropy can better capture the nonlinear relationships between genotypes and traits, so they can be useful to detect gene-gene interactions for complex diseases. These suggested measures look reasonable at intuitive level, but so far there has been no detailed characterization of the interactions captured by them. Here we study analytically the properties of some entropy-based measures for detecting gene-gene interactions in detail. The relationship between interactions captured by the entropy-based measures and those of logistic regression models is clarified. In general we find that the entropy-based measures can suffer from a lack of specificity in terms of target parameters, i.e., they can detect uninteresting signals as interactions. Numerical studies are carried out to confirm theoretical findings. © 2016 WILEY PERIODICALS, INC.
Horizontal gene transfer is a significant driver of gene innovation in dinoflagellates.

PubMed

Wisecaver, Jennifer H; Brosnahan, Michael L; Hackett, Jeremiah D

2013-01-01

The dinoflagellates are an evolutionarily and ecologically important group of microbial eukaryotes. Previous work suggests that horizontal gene transfer (HGT) is an important source of gene innovation in these organisms. However, dinoflagellate genomes are notoriously large and complex, making genomic investigation of this phenomenon impractical with currently available sequencing technology. Fortunately, de novo transcriptome sequencing and assembly provides an alternative approach for investigating HGT. We sequenced the transcriptome of the dinoflagellate Alexandrium tamarense Group IV to investigate how HGT has contributed to gene innovation in this group. Our comprehensive A. tamarense Group IV gene set was compared with those of 16 other eukaryotic genomes. Ancestral gene content reconstruction of ortholog groups shows that A. tamarense Group IV has the largest number of gene families gained (314-1,563 depending on inference method) relative to all other organisms in the analysis (0-782). Phylogenomic analysis indicates that genes horizontally acquired from bacteria are a significant proportion of this gene influx, as are genes transferred from other eukaryotes either through HGT or endosymbiosis. The dinoflagellates also display curious cases of gene loss associated with mitochondrial metabolism including the entire Complex I of oxidative phosphorylation. Some of these missing genes have been functionally replaced by bacterial and eukaryotic xenologs. The transcriptome of A. tamarense Group IV lends strong support to a growing body of evidence that dinoflagellate genomes are extraordinarily impacted by HGT.
Genes and gene networks implicated in aggression related behaviour.

PubMed

Malki, Karim; Pain, Oliver; Du Rietz, Ebba; Tosto, Maria Grazia; Paya-Cano, Jose; Sandnabba, Kenneth N; de Boer, Sietse; Schalkwyk, Leonard C; Sluyter, Frans

2014-10-01

Aggressive behaviour is a major cause of mortality and morbidity. Despite of moderate heritability estimates, progress in identifying the genetic factors underlying aggressive behaviour has been limited. There are currently three genetic mouse models of high and low aggression created using selective breeding. This is the first study to offer a global transcriptomic characterization of the prefrontal cortex across all three genetic mouse models of aggression. A systems biology approach has been applied to transcriptomic data across the three pairs of selected inbred mouse strains (Turku Aggressive (TA) and Turku Non-Aggressive (TNA), Short Attack Latency (SAL) and Long Attack Latency (LAL) mice and North Carolina Aggressive (NC900) and North Carolina Non-Aggressive (NC100)), providing novel insight into the neurobiological mechanisms and genetics underlying aggression. First, weighted gene co-expression network analysis (WGCNA) was performed to identify modules of highly correlated genes associated with aggression. Probe sets belonging to gene modules uncovered by WGCNA were carried forward for network analysis using ingenuity pathway analysis (IPA). The RankProd non-parametric algorithm was then used to statistically evaluate expression differences across the genes belonging to modules significantly associated with aggression. IPA uncovered two pathways, involving NF-kB and MAPKs. The secondary RankProd analysis yielded 14 differentially expressed genes, some of which have previously been implicated in pathways associated with aggressive behaviour, such as Adrbk2. The results highlighted plausible candidate genes and gene networks implicated in aggression-related behaviour.
UniGene Tabulator: a full parser for the UniGene format.

PubMed

Lenzi, Luca; Frabetti, Flavia; Facchin, Federica; Casadei, Raffaella; Vitale, Lorenza; Canaider, Silvia; Carinci, Paolo; Zannotti, Maria; Strippoli, Pierluigi

2006-10-15

UniGene Tabulator 1.0 provides a solution for full parsing of UniGene flat file format; it implements a structured graphical representation of each data field present in UniGene following import into a common database managing system usable in a personal computer. This database includes related tables for sequence, protein similarity, sequence-tagged site (STS) and transcript map interval (TXMAP) data, plus a summary table where each record represents a UniGene cluster. UniGene Tabulator enables full local management of UniGene data, allowing parsing, querying, indexing, retrieving, exporting and analysis of UniGene data in a relational database form, usable on Macintosh (OS X 10.3.9 or later) and Windows (2000, with service pack 4, XP, with service pack 2 or later) operating systems-based computers. The current release, including both the FileMaker runtime applications, is freely available at http://apollo11.isto.unibo.it/software/
The Renilla luciferase gene as a reference gene for normalization of gene expression in transiently transfected cells.

PubMed

Jiwaji, Meesbah; Daly, Rónán; Pansare, Kshama; McLean, Pauline; Yang, Jingli; Kolch, Walter; Pitt, Andrew R

2010-12-31

The importance of appropriate normalization controls in quantitative real-time polymerase chain reaction (qPCR) experiments has become more apparent as the number of biological studies using this methodology has increased. In developing a system to study gene expression from transiently transfected plasmids, it became clear that normalization using chromosomally encoded genes is not ideal, at it does not take into account the transfection efficiency and the significantly lower expression levels of the plasmids. We have developed and validated a normalization method for qPCR using a co-transfected plasmid. The best chromosomal gene for normalization in the presence of the transcriptional activators used in this study, cadmium, dexamethasone, forskolin and phorbol-12-myristate 13-acetate was first identified. qPCR data was analyzed using geNorm, Normfinder and BestKeeper. Each software application was found to rank the normalization controls differently with no clear correlation. Including a co-transfected plasmid encoding the Renilla luciferase gene (Rluc) in this analysis showed that its calculated stability was not as good as the optimised chromosomal genes, most likely as a result of the lower expression levels and transfection variability. Finally, we validated these analyses by testing two chromosomal genes (B2M and ActB) and a co-transfected gene (Rluc) under biological conditions. When analyzing co-transfected plasmids, Rluc normalization gave the smallest errors compared to the chromosomal reference genes. Our data demonstrates that transfected Rluc is the most appropriate normalization reference gene for transient transfection qPCR analysis; it significantly reduces the standard deviation within biological experiments as it takes into account the transfection efficiencies and has easily controllable expression levels. This improves reproducibility, data validity and most importantly, enables accurate interpretation of qPCR data.
Finding pathway-modulating genes from a novel Ontology Fingerprint-derived gene network.

PubMed

Qin, Tingting; Matmati, Nabil; Tsoi, Lam C; Mohanty, Bidyut K; Gao, Nan; Tang, Jijun; Lawson, Andrew B; Hannun, Yusuf A; Zheng, W Jim

2014-10-01

To enhance our knowledge regarding biological pathway regulation, we took an integrated approach, using the biomedical literature, ontologies, network analyses and experimental investigation to infer novel genes that could modulate biological pathways. We first constructed a novel gene network via a pairwise comparison of all yeast genes' Ontology Fingerprints--a set of Gene Ontology terms overrepresented in the PubMed abstracts linked to a gene along with those terms' corresponding enrichment P-values. The network was further refined using a Bayesian hierarchical model to identify novel genes that could potentially influence the pathway activities. We applied this method to the sphingolipid pathway in yeast and found that many top-ranked genes indeed displayed altered sphingolipid pathway functions, initially measured by their sensitivity to myriocin, an inhibitor of de novo sphingolipid biosynthesis. Further experiments confirmed the modulation of the sphingolipid pathway by one of these genes, PFA4, encoding a palmitoyl transferase. Comparative analysis showed that few of these novel genes could be discovered by other existing methods. Our novel gene network provides a unique and comprehensive resource to study pathway modulations and systems biology in general. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Primetime for Learning Genes.

PubMed

Keifer, Joyce

2017-02-11

Learning genes in mature neurons are uniquely suited to respond rapidly to specific environmental stimuli. Expression of individual learning genes, therefore, requires regulatory mechanisms that have the flexibility to respond with transcriptional activation or repression to select appropriate physiological and behavioral responses. Among the mechanisms that equip genes to respond adaptively are bivalent domains. These are specific histone modifications localized to gene promoters that are characteristic of both gene activation and repression, and have been studied primarily for developmental genes in embryonic stem cells. In this review, studies of the epigenetic regulation of learning genes in neurons, particularly the brain-derived neurotrophic factor gene ( BDNF ), by methylation/demethylation and chromatin modifications in the context of learning and memory will be highlighted. Because of the unique function of learning genes in the mature brain, it is proposed that bivalent domains are a characteristic feature of the chromatin landscape surrounding their promoters. This allows them to be "poised" for rapid response to activate or repress gene expression depending on environmental stimuli.
Horizontal Gene Transfer is a Significant Driver of Gene Innovation in Dinoflagellates

PubMed Central

Wisecaver, Jennifer H.; Brosnahan, Michael L.; Hackett, Jeremiah D.

2013-01-01

The dinoflagellates are an evolutionarily and ecologically important group of microbial eukaryotes. Previous work suggests that horizontal gene transfer (HGT) is an important source of gene innovation in these organisms. However, dinoflagellate genomes are notoriously large and complex, making genomic investigation of this phenomenon impractical with currently available sequencing technology. Fortunately, de novo transcriptome sequencing and assembly provides an alternative approach for investigating HGT. We sequenced the transcriptome of the dinoflagellate Alexandrium tamarense Group IV to investigate how HGT has contributed to gene innovation in this group. Our comprehensive A. tamarense Group IV gene set was compared with those of 16 other eukaryotic genomes. Ancestral gene content reconstruction of ortholog groups shows that A. tamarense Group IV has the largest number of gene families gained (314–1,563 depending on inference method) relative to all other organisms in the analysis (0–782). Phylogenomic analysis indicates that genes horizontally acquired from bacteria are a significant proportion of this gene influx, as are genes transferred from other eukaryotes either through HGT or endosymbiosis. The dinoflagellates also display curious cases of gene loss associated with mitochondrial metabolism including the entire Complex I of oxidative phosphorylation. Some of these missing genes have been functionally replaced by bacterial and eukaryotic xenologs. The transcriptome of A. tamarense Group IV lends strong support to a growing body of evidence that dinoflagellate genomes are extraordinarily impacted by HGT. PMID:24259313
The Gene Set Builder: collation, curation, and distribution of sets of genes

PubMed Central

Yusuf, Dimas; Lim, Jonathan S; Wasserman, Wyeth W

2005-01-01

Background In bioinformatics and genomics, there are many applications designed to investigate the common properties for a set of genes. Often, these multi-gene analysis tools attempt to reveal sequential, functional, and expressional ties. However, while tremendous effort has been invested in developing tools that can analyze a set of genes, minimal effort has been invested in developing tools that can help researchers compile, store, and annotate gene sets in the first place. As a result, the process of making or accessing a set often involves tedious and time consuming steps such as finding identifiers for each individual gene. These steps are often repeated extensively to shift from one identifier type to another; or to recreate a published set. In this paper, we present a simple online tool which – with the help of the gene catalogs Ensembl and GeneLynx – can help researchers build and annotate sets of genes quickly and easily. Description The Gene Set Builder is a database-driven, web-based tool designed to help researchers compile, store, export, and share sets of genes. This application supports the 17 eukaryotic genomes found in version 32 of the Ensembl database, which includes species from yeast to human. User-created information such as sets and customized annotations are stored to facilitate easy access. Gene sets stored in the system can be "exported" in a variety of output formats – as lists of identifiers, in tables, or as sequences. In addition, gene sets can be "shared" with specific users to facilitate collaborations or fully released to provide access to published results. The application also features a Perl API (Application Programming Interface) for direct connectivity to custom analysis tools. A downloadable Quick Reference guide and an online tutorial are available to help new users learn its functionalities. Conclusion The Gene Set Builder is an Ensembl-facilitated online tool designed to help researchers compile and manage sets of
Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.

PubMed

Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun

2017-12-21

Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene
Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

PubMed

Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

2010-10-07

PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a
Finding pathway-modulating genes from a novel Ontology Fingerprint-derived gene network

PubMed Central

Qin, Tingting; Matmati, Nabil; Tsoi, Lam C.; Mohanty, Bidyut K.; Gao, Nan; Tang, Jijun; Lawson, Andrew B.; Hannun, Yusuf A.; Zheng, W. Jim

2014-01-01

To enhance our knowledge regarding biological pathway regulation, we took an integrated approach, using the biomedical literature, ontologies, network analyses and experimental investigation to infer novel genes that could modulate biological pathways. We first constructed a novel gene network via a pairwise comparison of all yeast genes’ Ontology Fingerprints—a set of Gene Ontology terms overrepresented in the PubMed abstracts linked to a gene along with those terms’ corresponding enrichment P-values. The network was further refined using a Bayesian hierarchical model to identify novel genes that could potentially influence the pathway activities. We applied this method to the sphingolipid pathway in yeast and found that many top-ranked genes indeed displayed altered sphingolipid pathway functions, initially measured by their sensitivity to myriocin, an inhibitor of de novo sphingolipid biosynthesis. Further experiments confirmed the modulation of the sphingolipid pathway by one of these genes, PFA4, encoding a palmitoyl transferase. Comparative analysis showed that few of these novel genes could be discovered by other existing methods. Our novel gene network provides a unique and comprehensive resource to study pathway modulations and systems biology in general. PMID:25063300
Gene therapy in periodontics

PubMed Central

Chatterjee, Anirban; Singh, Nidhi; Saluja, Mini

2013-01-01

GENES are made of DNA - the code of life. They are made up of two types of base pair from different number of hydrogen bonds AT, GC which can be turned into instruction. Everyone inherits genes from their parents and passes them on in turn to their children. Every person's genes are different, and the changes in sequence determine the inherited differences between each of us. Some changes, usually in a single gene, may cause serious diseases. Gene therapy is ‘the use of genes as medicine’. It involves the transfer of a therapeutic or working gene copy into specific cells of an individual in order to repair a faulty gene copy. Thus it may be used to replace a faulty gene, or to introduce a new gene whose function is to cure or to favorably modify the clinical course of a condition. It has a promising era in the field of periodontics. Gene therapy has been used as a mode of tissue engineering in periodontics. The tissue engineering approach reconstructs the natural target tissue by combining four elements namely: Scaffold, signaling molecules, cells and blood supply and thus can help in the reconstruction of damaged periodontium including cementum, gingival, periodontal ligament and bone. PMID:23869119

Gene therapy in periodontics.

PubMed

Chatterjee, Anirban; Singh, Nidhi; Saluja, Mini

2013-03-01

GENES are made of DNA - the code of life. They are made up of two types of base pair from different number of hydrogen bonds AT, GC which can be turned into instruction. Everyone inherits genes from their parents and passes them on in turn to their children. Every person's genes are different, and the changes in sequence determine the inherited differences between each of us. Some changes, usually in a single gene, may cause serious diseases. Gene therapy is 'the use of genes as medicine'. It involves the transfer of a therapeutic or working gene copy into specific cells of an individual in order to repair a faulty gene copy. Thus it may be used to replace a faulty gene, or to introduce a new gene whose function is to cure or to favorably modify the clinical course of a condition. It has a promising era in the field of periodontics. Gene therapy has been used as a mode of tissue engineering in periodontics. The tissue engineering approach reconstructs the natural target tissue by combining four elements namely: Scaffold, signaling molecules, cells and blood supply and thus can help in the reconstruction of damaged periodontium including cementum, gingival, periodontal ligament and bone.
Norrie disease gene is distinct from the monoamine oxidase genes.

PubMed

Sims, K B; Ozelius, L; Corey, T; Rinehart, W B; Liberfarb, R; Haines, J; Chen, W J; Norio, R; Sankila, E; de la Chapelle, A

1989-09-01

The genes for MAO-A and MAO-B appear to be very close to the Norrie disease gene, on the basis of loss and/or disruption of the MAO genes and activities in atypical Norrie disease patients deleted for the DXS7 locus; linkage among the MAO genes, the Norrie disease gene, and the DXS7 locus; and mapping of all these loci to the chromosomal region Xp11. The present study provides evidence that the MAO genes are not disrupted in "classic" Norrie disease patients. Genomic DNA from these "nondeletion" Norrie disease patients did not show rearrangements at the MAOA or DXS7 loci. Normal levels of MAO-A activities, as well as normal amounts and size of the MAO-A mRNA, were observed in cultured skin fibroblasts from these patients, and MAO-B activity in their platelets was normal. Catecholamine metabolites evaluated in plasma and urine were in the control range. Thus, although some atypical Norrie disease patients lack both MAO-A and MAO-B activities, MAO does not appear to be an etiologic factor in classic Norrie disease.
Querying Co-regulated Genes on Diverse Gene Expression Datasets Via Biclustering.

PubMed

Deveci, Mehmet; Küçüktunç, Onur; Eren, Kemal; Bozdağ, Doruk; Kaya, Kamer; Çatalyürek, Ümit V

2016-01-01

Rapid development and increasing popularity of gene expression microarrays have resulted in a number of studies on the discovery of co-regulated genes. One important way of discovering such co-regulations is the query-based search since gene co-expressions may indicate a shared role in a biological process. Although there exist promising query-driven search methods adapting clustering, they fail to capture many genes that function in the same biological pathway because microarray datasets are fraught with spurious samples or samples of diverse origin, or the pathways might be regulated under only a subset of samples. On the other hand, a class of clustering algorithms known as biclustering algorithms which simultaneously cluster both the items and their features are useful while analyzing gene expression data, or any data in which items are related in only a subset of their samples. This means that genes need not be related in all samples to be clustered together. Because many genes only interact under specific circumstances, biclustering may recover the relationships that traditional clustering algorithms can easily miss. In this chapter, we briefly summarize the literature using biclustering for querying co-regulated genes. Then we present a novel biclustering approach and evaluate its performance by a thorough experimental analysis.
A gene-trap strategy identifies quiescence-induced genes in synchronized myoblasts.

PubMed

Sambasivan, Ramkumar; Pavlath, Grace K; Dhawan, Jyotsna

2008-03-01

Cellular quiescence is characterized not only by reduced mitotic and metabolic activity but also by altered gene expression. Growing evidence suggests that quiescence is not merely a basal state but is regulated by active mechanisms. To understand the molecular programme that governs reversible cell cycle exit, we focused on quiescence-related gene expression in a culture model of myogenic cell arrest and activation. Here we report the identification of quiescence-induced genes using a gene-trap strategy. Using a retroviral vector, we generated a library of gene traps in C2C12 myoblasts that were screened for arrest-induced insertions by live cell sorting (FACS-gal). Several independent gene- trap lines revealed arrest-dependent induction of betagal activity, confirming the efficacy of the FACS screen. The locus of integration was identified in 15 lines. In three lines,insertion occurred in genes previously implicated in the control of quiescence, i.e. EMSY - a BRCA2--interacting protein, p8/com1 - a p300HAT -- binding protein and MLL5 - a SET domain protein. Our results demonstrate that expression of chromatin modulatory genes is induced in G0, providing support to the notion that this reversibly arrested state is actively regulated.
Biased Gene Fractionation and Dominant Gene Expression among the Subgenomes of Brassica rapa

PubMed Central

Cheng, Feng; Wu, Jian; Fang, Lu; Sun, Silong; Liu, Bo; Lin, Ke; Bonnema, Guusje; Wang, Xiaowu

2012-01-01

Polyploidization, both ancient and recent, is frequent among plants. A “two-step theory" was proposed to explain the meso-triplication of the Brassica “A" genome: Brassica rapa. By accurately partitioning of this genome, we observed that genes in the less fractioned subgenome (LF) were dominantly expressed over the genes in more fractioned subgenomes (MFs: MF1 and MF2), while the genes in MF1 were slightly dominantly expressed over the genes in MF2. The results indicated that the dominantly expressed genes tended to be resistant against gene fractionation. By re-sequencing two B. rapa accessions: a vegetable turnip (VT117) and a Rapid Cycling line (L144), we found that genes in LF had less non-synonymous or frameshift mutations than genes in MFs; however mutation rates were not significantly different between MF1 and MF2. The differences in gene expression patterns and on-going gene death among the three subgenomes suggest that “two-step" genome triplication and differential subgenome methylation played important roles in the genome evolution of B. rapa. PMID:22567157
Biased gene fractionation and dominant gene expression among the subgenomes of Brassica rapa.

PubMed

Cheng, Feng; Wu, Jian; Fang, Lu; Sun, Silong; Liu, Bo; Lin, Ke; Bonnema, Guusje; Wang, Xiaowu

2012-01-01

Polyploidization, both ancient and recent, is frequent among plants. A "two-step theory" was proposed to explain the meso-triplication of the Brassica "A" genome: Brassica rapa. By accurately partitioning of this genome, we observed that genes in the less fractioned subgenome (LF) were dominantly expressed over the genes in more fractioned subgenomes (MFs: MF1 and MF2), while the genes in MF1 were slightly dominantly expressed over the genes in MF2. The results indicated that the dominantly expressed genes tended to be resistant against gene fractionation. By re-sequencing two B. rapa accessions: a vegetable turnip (VT117) and a Rapid Cycling line (L144), we found that genes in LF had less non-synonymous or frameshift mutations than genes in MFs; however mutation rates were not significantly different between MF1 and MF2. The differences in gene expression patterns and on-going gene death among the three subgenomes suggest that "two-step" genome triplication and differential subgenome methylation played important roles in the genome evolution of B. rapa.
GeneSigDB: a manually curated database and resource for analysis of gene expression signatures

PubMed Central

Culhane, Aedín C.; Schröder, Markus S.; Sultana, Razvan; Picard, Shaita C.; Martinelli, Enzo N.; Kelly, Caroline; Haibe-Kains, Benjamin; Kapushesky, Misha; St Pierre, Anne-Alyssa; Flahive, William; Picard, Kermshlise C.; Gusenleitner, Daniel; Papenhausen, Gerald; O'Connor, Niall; Correll, Mick; Quackenbush, John

2012-01-01

GeneSigDB (http://www.genesigdb.org or http://compbio.dfci.harvard.edu/genesigdb/) is a database of gene signatures that have been extracted and manually curated from the published literature. It provides a standardized resource of published prognostic, diagnostic and other gene signatures of cancer and related disease to the community so they can compare the predictive power of gene signatures or use these in gene set enrichment analysis. Since GeneSigDB release 1.0, we have expanded from 575 to 3515 gene signatures, which were collected and transcribed from 1604 published articles largely focused on gene expression in cancer, stem cells, immune cells, development and lung disease. We have made substantial upgrades to the GeneSigDB website to improve accessibility and usability, including adding a tag cloud browse function, facetted navigation and a ‘basket’ feature to store genes or gene signatures of interest. Users can analyze GeneSigDB gene signatures, or upload their own gene list, to identify gene signatures with significant gene overlap and results can be viewed on a dynamic editable heatmap that can be downloaded as a publication quality image. All data in GeneSigDB can be downloaded in numerous formats including .gmt file format for gene set enrichment analysis or as a R/Bioconductor data file. GeneSigDB is available from http://www.genesigdb.org. PMID:22110038
Systematic study of association of four GABAergic genes: glutamic acid decarboxylase 1 gene, glutamic acid decarboxylase 2 gene, GABA(B) receptor 1 gene and GABA(A) receptor subunit beta2 gene, with schizophrenia using a universal DNA microarray.

PubMed

Zhao, Xu; Qin, Shengying; Shi, Yongyong; Zhang, Aiping; Zhang, Jing; Bian, Li; Wan, Chunling; Feng, Guoyin; Gu, Niufan; Zhang, Guangqi; He, Guang; He, Lin

2007-07-01

Several studies have suggested the dysfunction of the GABAergic system as a risk factor in the pathogenesis of schizophrenia. In the present study, case-control association analysis was conducted in four GABAergic genes: two glutamic acid decarboxylase genes (GAD1 and GAD2), a GABA(A) receptor subunit beta2 gene (GABRB2) and a GABA(B) receptor 1 gene (GABBR1). Using a universal DNA microarray procedure we genotyped a total of 20 SNPs on the above four genes in a study involving 292 patients and 286 controls of Chinese descent. Statistically significant differences were observed in the allelic frequencies of the rs187269C/T polymorphism in the GABRB2 gene (P=0.0450, chi(2)=12.40, OR=1.65) and the -292A/C polymorphism in the GAD1 gene (P=0.0450, chi(2)=14.64 OR=1.77). In addition, using an electrophoretic mobility shift assay (EMSA), we discovered differences in the U251 nuclear protein binding to oligonucleotides representing the -292 SNP on the GAD1 gene, which suggests that the -292C allele has reduced transcription factor binding efficiency compared with the 292A allele. Using the multifactor-dimensionality reduction method (MDR), we found that the interactions among the rs187269C/T polymorphism in the GABRB2 gene, the -243A/G polymorphism in the GAD2 gene and the 27379C/T and 661C/T polymorphisms in the GAD1 gene revealed a significant association with schizophrenia (P<0.001). These findings suggest that the GABRB2 and GAD1 genes alone and the combined effects of the polymorphisms in the four GABAergic system genes may confer susceptibility to the development of schizophrenia in the Chinese population.
GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

PubMed

Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

2018-01-01

When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.
Validation of miRNA genes suitable as reference genes in qPCR analyses of miRNA gene expression in Atlantic salmon (Salmo salar).

PubMed

Johansen, Ilona; Andreassen, Rune

2014-12-23

MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the post-transcriptional level. They play important roles by regulating genes that control multiple biological processes, and recent years there has been an increased interest in studying miRNA genes and miRNA gene expression. The most common method applied to study gene expression of single genes is quantitative PCR (qPCR). However, before expression of mature miRNAs can be studied robust qPCR methods (miRNA-qPCR) must be developed. This includes identification and validation of suitable reference genes. We are particularly interested in Atlantic salmon (Salmo salar). This is an economically important aquaculture species, but no reference genes dedicated for use in miRNA-qPCR methods has been validated for this species. Our aim was, therefore, to identify suitable reference genes for miRNA-qPCR methods in Salmo salar. We used a systematic approach where we utilized similar studies in other species, some biological criteria, results from deep sequencing of small RNAs and, finally, experimental validation of candidate reference genes by qPCR to identify the most suitable reference genes. Ssa-miR-25-3p was identified as most suitable single reference gene. The best combinations of two reference genes were ssa-miR-25-3p and ssa-miR-455-5p. These two genes were constitutively and stably expressed across many different tissues. Furthermore, infectious salmon anaemia did not seem to affect their expression levels. These genes were amplified with high specificity, good efficiency and the qPCR assays showed a good linearity when applying a simple cybergreen miRNA-PCR method using miRNA gene specific forward primers. We have identified suitable reference genes for miRNA-qPCR in Atlantic salmon. These results will greatly facilitate further studies on miRNA genes in this species. The reference genes identified are conserved genes that are identical in their mature
Human gene therapy: novel approaches to improve the current gene delivery systems.

PubMed

Cucchiarini, Magali

2016-06-01

Even though gene therapy made its way through the clinics to treat a number of human pathologies since the early years of experimental research and despite the recent approval of the first gene-based product (Glybera) in Europe, the safe and effective use of gene transfer vectors remains a challenge in human gene therapy due to the existence of barriers in the host organism. While work is under active investigation to improve the gene transfer systems themselves, the use of controlled release approaches may offer alternative, convenient tools of vector delivery to achieve a performant gene transfer in vivo while overcoming the various physiological barriers that preclude its wide use in patients. This article provides an overview of the most significant contributions showing how the principles of controlled release strategies may be adapted for human gene therapy.
Gene Therapy

MedlinePlus

... material into the cells' genes. Researchers remove the original disease-causing genes from the viruses, replacing them ... into the body, the viruses may recover their original ability to cause disease. Possibility of causing a ...
From gene engineering to gene modulation and manipulation: can we prevent or detect gene doping in sports?

PubMed

Fischetto, Giuseppe; Bermon, Stéphane

2013-10-01

During the last 2 decades, progress in deciphering the human gene map as well as the discovery of specific defective genes encoding particular proteins in some serious human diseases have resulted in attempts to treat sick patients with gene therapy. There has been considerable focus on human recombinant proteins which were gene-engineered and produced in vitro (insulin, growth hormone, insulin-like growth factor-1, erythropoietin). Unfortunately, these substances and methods also became improper tools for unscrupulous athletes. Biomedical research has focused on the possible direct insertion of gene material into the body, in order to replace some defective genes in vivo and/or to promote long-lasting endogenous synthesis of deficient proteins. Theoretically, diabetes, anaemia, muscular dystrophies, immune deficiency, cardiovascular diseases and numerous other illnesses could benefit from such innovative biomedical research, though much work remains to be done. Considering recent findings linking specific genotypes and physical performance, it is tempting to submit the young athletic population to genetic screening or, alternatively, to artificial gene expression modulation. Much research is already being conducted in order to achieve a safe transfer of genetic material to humans. This is of critical importance since uncontrolled production of the specifically coded protein, with serious secondary adverse effects (polycythaemia, acute cardiovascular problems, cancer, etc.), could occur. Other unpredictable reactions (immunogenicity of vectors or DNA-vector complex, autoimmune anaemia, production of wild genetic material) also remain possible at the individual level. Some new substances (myostatin blockers or anti-myostatin antibodies), although not gene material, might represent a useful and well-tolerated treatment to prevent progression of muscular dystrophies. Similarly, other molecules, in the roles of gene or metabolic activators [5-aminoimidazole-4
Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (WGCNA).

PubMed

Tian, Honglai; Guan, Donghui; Li, Jianmin

2018-06-01

Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.
Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

PubMed

Osato, Naoki

2018-01-19

Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.

PubMed

Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L

2015-01-01

Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Notch signaling genes

PubMed Central

Terragni, Jolyon; Zhang, Guoqiang; Sun, Zhiyi; Pradhan, Sriharsa; Song, Lingyun; Crawford, Gregory E; Lacey, Michelle; Ehrlich, Melanie

2014-01-01

Notch intercellular signaling is critical for diverse developmental pathways and for homeostasis in various types of stem cells and progenitor cells. Because Notch gene products need to be precisely regulated spatially and temporally, epigenetics is likely to help control expression of Notch signaling genes. Reduced representation bisulfite sequencing (RRBS) indicated significant hypomethylation in myoblasts, myotubes, and skeletal muscle vs. many nonmuscle samples at intragenic or intergenic regions of the following Notch receptor or ligand genes: NOTCH1, NOTCH2, JAG2, and DLL1. An enzymatic assay of sites in or near these genes revealed unusually high enrichment of 5-hydroxymethylcytosine (up to 81%) in skeletal muscle, heart, and cerebellum. Epigenetics studies and gene expression profiles suggest that hypomethylation and/or hydroxymethylation help control expression of these genes in heart, brain, myoblasts, myotubes, and within skeletal muscle myofibers. Such regulation could promote cell renewal, cell maintenance, homeostasis, and a poised state for repair of tissue damage. PMID:24670287
Linking Genes to Cardiovascular Diseases: Gene Action and Gene–Environment Interactions

PubMed Central

2016-01-01

A unique myocardial characteristic is its ability to grow/remodel in order to adapt; this is determined partly by genes and partly by the environment and the milieu intérieur. In the “post-genomic” era, a need is emerging to elucidate the physiologic functions of myocardial genes, as well as potential adaptive and maladaptive modulations induced by environmental/epigenetic factors. Genome sequencing and analysis advances have become exponential lately, with escalation of our knowledge concerning sometimes controversial genetic underpinnings of cardiovascular diseases. Current technologies can identify candidate genes variously involved in diverse normal/abnormal morphomechanical phenotypes, and offer insights into multiple genetic factors implicated in complex cardiovascular syndromes. The expression profiles of thousands of genes are regularly ascertained under diverse conditions. Global analyses of gene expression levels are useful for cataloging genes and correlated phenotypes, and for elucidating the role of genes in maladies. Comparative expression of gene networks coupled to complex disorders can contribute insights as to how “modifier genes” influence the expressed phenotypes. Increasingly, a more comprehensive and detailed systematic understanding of genetic abnormalities underlying, for example, various genetic cardiomyopathies is emerging. Implementing genomic findings in cardiology practice may well lead directly to better diagnosing and therapeutics. There is currently evolving a strong appreciation for the value of studying gene anomalies, and doing so in a non-disjointed, cohesive manner. However, it is challenging for many—practitioners and investigators—to comprehend, interpret, and utilize the clinically increasingly accessible and affordable cardiovascular genomics studies. This survey addresses the need for fundamental understanding in this vital area. PMID:26545598
A new type of gene-disruption cassette with a rescue gene for Pichia pastoris.

PubMed

Shibui, Tatsuro; Hara, Hiroyoshi

2017-09-01

Pichia pastoris has been used for the production of many recombinant proteins, and many useful mutant strains have been created. However, the efficiency of mutant isolation by gene-targeting is usually low and the procedure is difficult for those inexperienced in yeast genetics. In order to overcome these issues, we developed a new gene-disruption system with a rescue gene using an inducible Cre/mutant-loxP system. With only short homology regions, the gene-disruption cassette of the system replaces its target-gene locus containing a mutation with a compensatory rescue gene. As the cassette contains the AOX1 promoter-driven Cre gene, when targeted strains are grown on media containing methanol, the DNA fragment, i.e., the marker, rescue and Cre genes, between the mutant-loxP sequences in the cassette is excised, leaving only the remaining mutant-loxP sequence in the genome, and consequently a target gene-disrupted mutant can be isolated. The system was initially validated on ADE2 gene disruption, where the disruption can easily be detected by color-change of the colonies. Then, the system was applied for knocking-out URA3 and OCH1 genes, reported to be difficult to accomplish by conventional gene-targeting methods. All three gene-disruption cassettes with their rescue genes replaced their target genes, and the Cre/mutant-loxP system worked well to successfully isolate their knock-out mutants. This study identified a new gene-disruption system that could be used to effectively and strategically knock out genes of interest, especially whose deletion is detrimental to growth, without using special strains, e.g., deficient in nonhomologous end-joining, in P. pastoris. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:1201-1208, 2017. © 2017 American Institute of Chemical Engineers.
Mutations in nuclear genes alter post-transcriptional regulation of mitochondrial genes.

USDA-ARS?s Scientific Manuscript database

Nuclear gene products are required for the expression of mitochondrial genes and elaboration of functional mitochondrial protein complexes. To better understand the roles of these nuclear genes, we exploited the mitochondrial encoded S-type of cytoplasmic male sterility (CMS-S) and developed a nove...

A recently transferred cluster of bacterial genes in Trichomonas vaginalis - lateral gene transfer and the fate of acquired genes

PubMed Central

2014-01-01

Background Lateral Gene Transfer (LGT) has recently gained recognition as an important contributor to some eukaryote proteomes, but the mechanisms of acquisition and fixation in eukaryotic genomes are still uncertain. A previously defined norm for LGTs in microbial eukaryotes states that the majority are genes involved in metabolism, the LGTs are typically localized one by one, surrounded by vertically inherited genes on the chromosome, and phylogenetics shows that a broad collection of bacterial lineages have contributed to the transferome. Results A unique 34 kbp long fragment with 27 clustered genes (TvLF) of prokaryote origin was identified in the sequenced genome of the protozoan parasite Trichomonas vaginalis. Using a PCR based approach we confirmed the presence of the orthologous fragment in four additional T. vaginalis strains. Detailed sequence analyses unambiguously suggest that TvLF is the result of one single, recent LGT event. The proposed donor is a close relative to the firmicute bacterium Peptoniphilus harei. High nucleotide sequence similarity between T. vaginalis strains, as well as to P. harei, and the absence of homologs in other Trichomonas species, suggests that the transfer event took place after the radiation of the genus Trichomonas. Some genes have undergone pseudogenization and degradation, indicating that they may not be retained in the future. Functional annotations reveal that genes involved in informational processes are particularly prone to degradation. Conclusions We conclude that, although the majority of eukaryote LGTs are single gene occurrences, they may be acquired in clusters of several genes that are subsequently cleansed of evolutionarily less advantageous genes. PMID:24898731
Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis.

PubMed

Rocha, Danilo J P; Santos, Carolina S; Pacheco, Luis G C

2015-09-01

The appropriate choice of reference genes is essential for accurate normalization of gene expression data obtained by the method of reverse transcription quantitative real-time PCR (RT-qPCR). In 2009, a guideline called the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) highlighted the importance of the selection and validation of more than one suitable reference gene for obtaining reliable RT-qPCR results. Herein, we searched the recent literature in order to identify the bacterial reference genes that have been most commonly validated in gene expression studies by RT-qPCR (in the first 5 years following publication of the MIQE guidelines). Through a combination of different search parameters with the text mining tool MedlineRanker, we identified 145 unique bacterial genes that were recently tested as candidate reference genes. Of these, 45 genes were experimentally validated and, in most of the cases, their expression stabilities were verified using the software tools geNorm and NormFinder. It is noteworthy that only 10 of these reference genes had been validated in two or more of the studies evaluated. An enrichment analysis using Gene Ontology classifications demonstrated that genes belonging to the functional categories of DNA Replication (GO: 0006260) and Transcription (GO: 0006351) rendered a proportionally higher number of validated reference genes. Three genes in the former functional class were also among the top five most stable genes identified through an analysis of gene expression data obtained from the Pathosystems Resource Integration Center. These results may provide a guideline for the initial selection of candidate reference genes for RT-qPCR studies in several different bacterial species.
Norrie disease gene is distinct from the monoamine oxidase genes

PubMed Central

Sims, Katherine B.; Ozelius, Laurie; Corey, Timothy; Rinehart, William B.; Liberfarb, Ruth; Haines, Jonathan; Chen, Wei Jane; Norio, Reijo; Sankila, Eeva; de la Chapelle, Albert; Murphy, Dennis L.; Gusella, James; Breakefield, Xandra O.

1989-01-01

The genes for MAO-A and MAO-B appear to be very close to the Norrie disease gene, on the basis of loss and /or disruption of the MAO genes and activities in atypical Norrie disease patients deleted for the DXS7 locus; linkage among the MAO genes, the Norrie disease gene, and the DXS7 locus; and mapping of all these loci to the chromosomal region Xp11. The present study provides evidence that the MAO genes are not disrupted in “classic” Norrie disease patients. Genomic DNA from these “nondeletion” Norrie disease patients did not show rearrangements at the MAOA or DXS7 loci. Normal levels of MAO-A activities, as well as normal amounts and size of the MAO-A mRNA, were observed in cultured skin fibroblasts from these patients, and MAO-B activity in their platelets was normal. Catecholamine metabolites evaluated in plasma and urine were in the control range. Thus, although some atypical Norrie disease patients lack both MAO-A and MAO-B activities, MAO does not appear to be an etiologic factor in classic Norrie disease. ImagesFigure 2Figure 3 PMID:2773935
The association of multiple interacting genes with specific phenotypes in rice using gene coexpression networks.

PubMed

Ficklin, Stephen P; Luo, Feng; Feltus, F Alex

2010-09-01

Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.
Gene doping.

PubMed

Haisma, H J; de Hon, O

2006-04-01

Together with the rapidly increasing knowledge on genetic therapies as a promising new branch of regular medicine, the issue has arisen whether these techniques might be abused in the field of sports. Previous experiences have shown that drugs that are still in the experimental phases of research may find their way into the athletic world. Both the World Anti-Doping Agency (WADA) and the International Olympic Committee (IOC) have expressed concerns about this possibility. As a result, the method of gene doping has been included in the list of prohibited classes of substances and prohibited methods. This review addresses the possible ways in which knowledge gained in the field of genetic therapies may be misused in elite sports. Many genes are readily available which may potentially have an effect on athletic performance. The sporting world will eventually be faced with the phenomena of gene doping to improve athletic performance. A combination of developing detection methods based on gene arrays or proteomics and a clear education program on the associated risks seems to be the most promising preventive method to counteract the possible application of gene doping.
funRiceGenes dataset for comprehensive understanding and application of rice functional genes.

PubMed

Yao, Wen; Li, Guangwei; Yu, Yiming; Ouyang, Yidan

2018-01-01

As a main staple food, rice is also a model plant for functional genomic studies of monocots. Decoding of every DNA element of the rice genome is essential for genetic improvement to address increasing food demands. The past 15 years have witnessed extraordinary advances in rice functional genomics. Systematic characterization and proper deposition of every rice gene are vital for both functional studies and crop genetic improvement. We built a comprehensive and accurate dataset of ∼2800 functionally characterized rice genes and ∼5000 members of different gene families by integrating data from available databases and reviewing every publication on rice functional genomic studies. The dataset accounts for 19.2% of the 39 045 annotated protein-coding rice genes, which provides the most exhaustive archive for investigating the functions of rice genes. We also constructed 214 gene interaction networks based on 1841 connections between 1310 genes. The largest network with 762 genes indicated that pleiotropic genes linked different biological pathways. Increasing degree of conservation of the flowering pathway was observed among more closely related plants, implying substantial value of rice genes for future dissection of flowering regulation in other crops. All data are deposited in the funRiceGenes database (https://funricegenes.github.io/). Functionality for advanced search and continuous updating of the database are provided by a Shiny application (http://funricegenes.ncpgr.cn/). The funRiceGenes dataset would enable further exploring of the crosslink between gene functions and natural variations in rice, which can also facilitate breeding design to improve target agronomic traits of rice. © The Authors 2017. Published by Oxford University Press.
Exploring the key genes and pathways in enchondromas using a gene expression microarray.

PubMed

Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

2017-07-04

Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.
Turning publicly available gene expression data into discoveries using gene set context analysis.

PubMed

Ji, Zhicheng; Vokes, Steven A; Dang, Chi V; Ji, Hongkai

2016-01-08

Gene Set Context Analysis (GSCA) is an open source software package to help researchers use massive amounts of publicly available gene expression data (PED) to make discoveries. Users can interactively visualize and explore gene and gene set activities in 25,000+ consistently normalized human and mouse gene expression samples representing diverse biological contexts (e.g. different cells, tissues and disease types, etc.). By providing one or multiple genes or gene sets as input and specifying a gene set activity pattern of interest, users can query the expression compendium to systematically identify biological contexts associated with the specified gene set activity pattern. In this way, researchers with new gene sets from their own experiments may discover previously unknown contexts of gene set functions and hence increase the value of their experiments. GSCA has a graphical user interface (GUI). The GUI makes the analysis convenient and customizable. Analysis results can be conveniently exported as publication quality figures and tables. GSCA is available at https://github.com/zji90/GSCA. This software significantly lowers the bar for biomedical investigators to use PED in their daily research for generating and screening hypotheses, which was previously difficult because of the complexity, heterogeneity and size of the data. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
An atlas of gene expression and gene co-regulation in the human retina.

PubMed

Pinelli, Michele; Carissimo, Annamaria; Cutillo, Luisa; Lai, Ching-Hung; Mutarelli, Margherita; Moretti, Maria Nicoletta; Singh, Marwah Veer; Karali, Marianthi; Carrella, Diego; Pizzo, Mariateresa; Russo, Francesco; Ferrari, Stefano; Ponzin, Diego; Angelini, Claudia; Banfi, Sandro; di Bernardo, Diego

2016-07-08

The human retina is a specialized tissue involved in light stimulus transduction. Despite its unique biology, an accurate reference transcriptome is still missing. Here, we performed gene expression analysis (RNA-seq) of 50 retinal samples from non-visually impaired post-mortem donors. We identified novel transcripts with high confidence (Observed Transcriptome (ObsT)) and quantified the expression level of known transcripts (Reference Transcriptome (RefT)). The ObsT included 77 623 transcripts (23 960 genes) covering 137 Mb (35 Mb new transcribed genome). Most of the transcripts (92%) were multi-exonic: 81% with known isoforms, 16% with new isoforms and 3% belonging to new genes. The RefT included 13 792 genes across 94 521 known transcripts. Mitochondrial genes were among the most highly expressed, accounting for about 10% of the reads. Of all the protein-coding genes in Gencode, 65% are expressed in the retina. We exploited inter-individual variability in gene expression to infer a gene co-expression network and to identify genes specifically expressed in photoreceptor cells. We experimentally validated the photoreceptors localization of three genes in human retina that had not been previously reported. RNA-seq data and the gene co-expression network are available online (http://retina.tigem.it). © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A Luciferase Reporter Gene System for High-Throughput Screening of γ-Globin Gene Activators.

PubMed

Xie, Wensheng; Silvers, Robert; Ouellette, Michael; Wu, Zining; Lu, Quinn; Li, Hu; Gallagher, Kathleen; Johnson, Kathy; Montoute, Monica

2016-01-01

Luciferase reporter gene assays have long been used for drug discovery due to their high sensitivity and robust signal. A dual reporter gene system contains a gene of interest and a control gene to monitor non-specific effects on gene expression. In our dual luciferase reporter gene system, a synthetic promoter of γ-globin gene was constructed immediately upstream of the firefly luciferase gene, followed downstream by a synthetic β-globin gene promoter in front of the Renilla luciferase gene. A stable cell line with the dual reporter gene was cloned and used for all assay development and HTS work. Due to the low activity of the control Renilla luciferase, only the firefly luciferase activity was further optimized for HTS. Several critical factors, such as cell density, serum concentration, and miniaturization, were optimized using tool compounds to achieve maximum robustness and sensitivity. Using the optimized reporter assay, the HTS campaign was successfully completed and approximately 1000 hits were identified. In this chapter, we also describe strategies to triage hits that non-specifically interfere with firefly luciferase.
New genes from old: asymmetric divergence of gene duplicates and the evolution of development.

PubMed

Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi

2017-02-05

Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).
Novel candidate genes important for asthma and hypertension comorbidity revealed from associative gene networks.

PubMed

Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A

2018-02-13

Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in
Derivation of an artificial gene to improve classification accuracy upon gene selection.

PubMed

Seo, Minseok; Oh, Sejong

2012-02-01

Classification analysis has been developed continuously since 1936. This research field has advanced as a result of development of classifiers such as KNN, ANN, and SVM, as well as through data preprocessing areas. Feature (gene) selection is required for very high dimensional data such as microarray before classification work. The goal of feature selection is to choose a subset of informative features that reduces processing time and provides higher classification accuracy. In this study, we devised a method of artificial gene making (AGM) for microarray data to improve classification accuracy. Our artificial gene was derived from a whole microarray dataset, and combined with a result of gene selection for classification analysis. We experimentally confirmed a clear improvement of classification accuracy after inserting artificial gene. Our artificial gene worked well for popular feature (gene) selection algorithms and classifiers. The proposed approach can be applied to any type of high dimensional dataset. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
PCR-based detection of gene transfer vectors: application to gene doping surveillance.

PubMed

Perez, Irene C; Le Guiner, Caroline; Ni, Weiyi; Lyles, Jennifer; Moullier, Philippe; Snyder, Richard O

2013-12-01

Athletes who illicitly use drugs to enhance their athletic performance are at risk of being banned from sports competitions. Consequently, some athletes may seek new doping methods that they expect to be capable of circumventing detection. With advances in gene transfer vector design and therapeutic gene transfer, and demonstrations of safety and therapeutic benefit in humans, there is an increased probability of the pursuit of gene doping by athletes. In anticipation of the potential for gene doping, assays have been established to directly detect complementary DNA of genes that are top candidates for use in doping, as well as vector control elements. The development of molecular assays that are capable of exposing gene doping in sports can serve as a deterrent and may also identify athletes who have illicitly used gene transfer for performance enhancement. PCR-based methods to detect foreign DNA with high reliability, sensitivity, and specificity include TaqMan real-time PCR, nested PCR, and internal threshold control PCR.
Flanking genes of an essential gene give information about the evolution of metazoa.

PubMed

Zimek, Alexander; Weber, Klaus

2011-04-01

We collected as much information as possible on new lamin genes and their flanking genes. The number of lamin genes varies from 1 to 4 depending more or less on the phylogenetic position of the species. Strong genome drift is recognised by fewer and unusually placed introns and a change in flanking genes. This applies to the nematode Caenorhabditis elegans, the insect Drosophila melanogaster, the urochordate Ciona intestinalis, the annelid Capitella teleta and the planaria Schmidtea mediterranea. In contrast stable genomes show astonishing conservation of the flanking genes. These are identical in the sea anemone Nematostella vectensis and the cephalochordate Branchiostoma floridae lamin B1 gene. Even in the lamin B1 genes from Xenopus tropicalis and man one of the flanking genes is conserved. Finally our analysis forms the basis for a molecular analysis of metazoan phylogeny. Copyright © 2010 Elsevier GmbH. All rights reserved.
Reconstruction of a Functional Human Gene Network, with an Application for Prioritizing Positional Candidate Genes

PubMed Central

Franke, Lude; Bakel, Harm van; Fokkens, Like; de Jong, Edwin D.; Egmont-Petersen, Michael; Wijmenga, Cisca

2006-01-01

Most common genetic disorders have a complex inheritance and may result from variants in many genes, each contributing only weak effects to the disease. Pinpointing these disease genes within the myriad of susceptibility loci identified in linkage studies is difficult because these loci may contain hundreds of genes. However, in any disorder, most of the disease genes will be involved in only a few different molecular pathways. If we know something about the relationships between the genes, we can assess whether some genes (which may reside in different loci) functionally interact with each other, indicating a joint basis for the disease etiology. There are various repositories of information on pathway relationships. To consolidate this information, we developed a functional human gene network that integrates information on genes and the functional relationships between genes, based on data from the Kyoto Encyclopedia of Genes and Genomes, the Biomolecular Interaction Network Database, Reactome, the Human Protein Reference Database, the Gene Ontology database, predicted protein-protein interactions, human yeast two-hybrid interactions, and microarray coexpressions. We applied this network to interrelate positional candidate genes from different disease loci and then tested 96 heritable disorders for which the Online Mendelian Inheritance in Man database reported at least three disease genes. Artificial susceptibility loci, each containing 100 genes, were constructed around each disease gene, and we used the network to rank these genes on the basis of their functional interactions. By following up the top five genes per artificial locus, we were able to detect at least one known disease gene in 54% of the loci studied, representing a 2.8-fold increase over random selection. This suggests that our method can significantly reduce the cost and effort of pinpointing true disease genes in analyses of disorders for which numerous loci have been reported but for which
Validation of reference genes for quantifying changes in gene expression in virus-infected tobacco.

PubMed

Baek, Eseul; Yoon, Ju-Yeon; Palukaitis, Peter

2017-10-01

To facilitate quantification of gene expression changes in virus-infected tobacco plants, eight housekeeping genes were evaluated for their stability of expression during infection by one of three systemically-infecting viruses (cucumber mosaic virus, potato virus X, potato virus Y) or a hypersensitive-response-inducing virus (tobacco mosaic virus; TMV) limited to the inoculated leaf. Five reference-gene validation programs were used to establish the order of the most stable genes for the systemically-infecting viruses as ribosomal protein L25 > β-Tubulin > Actin, and the least stable genes Ubiquitin-conjugating enzyme (UCE) < PP2A < GAPDH. For local infection by TMV, the most stable genes were EF1α > Cysteine protease > Actin, and the least stable genes were GAPDH < PP2A < UCE. Using two of the most stable and the two least stable validated reference genes, three defense responsive genes were examined to compare their relative changes in gene expression caused by each virus. Copyright © 2017 Elsevier Inc. All rights reserved.
Using RNA-seq data to select reference genes for normalizing gene expression in apple roots.

PubMed

Zhou, Zhe; Cong, Peihua; Tian, Yi; Zhu, Yanmin

2017-01-01

Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for their potential use as reliable reference genes. These genes were selected based on their low variance of gene expression in apple root tissues from a recent RNA-seq data set, and a few previously reported apple reference genes for other tissue types. Four methods, Delta Ct, geNorm, NormFinder and BestKeeper, were used to evaluate their stability in apple root tissues of various genotypes and under different experimental conditions. A small panel of stably expressed genes, MDP0000095375, MDP0000147424, MDP0000233640, MDP0000326399 and MDP0000173025 were recommended for normalizing quantitative gene expression data in apple roots under various abiotic or biotic stresses. When the most stable and least stable reference genes were used for data normalization, significant differences were observed on the expression patterns of two target genes, MdLecRLK5 (MDP0000228426, a gene encoding a lectin receptor like kinase) and MdMAPK3 (MDP0000187103, a gene encoding a mitogen-activated protein kinase). Our data also indicated that for those carefully validated reference genes, a single reference gene is sufficient for reliable normalization of the quantitative gene expression. Depending on the experimental conditions, the most suitable reference genes can be specific to the sample of interest for more reliable RT-qPCR data normalization.
Using RNA-seq data to select reference genes for normalizing gene expression in apple roots

PubMed Central

Zhou, Zhe; Cong, Peihua; Tian, Yi

2017-01-01

Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for their potential use as reliable reference genes. These genes were selected based on their low variance of gene expression in apple root tissues from a recent RNA-seq data set, and a few previously reported apple reference genes for other tissue types. Four methods, Delta Ct, geNorm, NormFinder and BestKeeper, were used to evaluate their stability in apple root tissues of various genotypes and under different experimental conditions. A small panel of stably expressed genes, MDP0000095375, MDP0000147424, MDP0000233640, MDP0000326399 and MDP0000173025 were recommended for normalizing quantitative gene expression data in apple roots under various abiotic or biotic stresses. When the most stable and least stable reference genes were used for data normalization, significant differences were observed on the expression patterns of two target genes, MdLecRLK5 (MDP0000228426, a gene encoding a lectin receptor like kinase) and MdMAPK3 (MDP0000187103, a gene encoding a mitogen-activated protein kinase). Our data also indicated that for those carefully validated reference genes, a single reference gene is sufficient for reliable normalization of the quantitative gene expression. Depending on the experimental conditions, the most suitable reference genes can be specific to the sample of interest for more reliable RT-qPCR data normalization. PMID:28934340
Generation of novel resistance genes using mutation and targeted gene editing.

PubMed

Gal-On, Amit; Fuchs, Marc; Gray, Stewart

2017-10-01

Classical breeding for virus resistance is a lengthy process and is restricted by the availability of resistance genes. Precise genome editing is a 'dream technology' to improve plants for virus resistance and these tools have opened new and very promising ways to generate virus resistant plants by disrupting host susceptibility genes, or by increasing the expression of viral resistance genes. However, precise targets must be identified and their roles understood to minimize potential negative effects on the plant. Nonetheless, the opportunities for genome editing are expanding, as are the technologies to generate effective and broad-spectrum resistance against plant viruses. Here we provide insights into recent progress related to gene targets and gene editing technologies. Published by Elsevier B.V.

[Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

PubMed

Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

2012-07-01

In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
Gene regulatory network of unfolded protein response genes in endoplasmic reticulum stress.

PubMed

Takayanagi, Sayuri; Fukuda, Riga; Takeuchi, Yuuki; Tsukada, Sakiko; Yoshida, Kenichi

2013-01-01

In the endoplasmic reticulum (ER), secretory and membrane proteins are properly folded and modified, and the failure of these processes leads to ER stress. At the same time, unfolded protein response (UPR) genes are activated to maintain homeostasis. Despite the thorough characterization of the individual gene regulation of UPR genes to date, further investigation of the mutual regulation among UPR genes is required to understand the complex mechanism underlying the ER stress response. In this study, we aimed to reveal a gene regulatory network formed by UPR genes, including immunoglobulin heavy chain-binding protein (BiP), X-box binding protein 1 (XBP1), C/EBP [CCAAT/enhancer-binding protein]-homologous protein (CHOP), PKR-like endoplasmic reticulum kinase (PERK), inositol-requiring 1 (IRE1), activating transcription factor 6 (ATF6), and ATF4. For this purpose, we focused on promoter-luciferase reporters for BiP, XBP1, and CHOP genes, which bear an ER stress response element (ERSE), and p5 × ATF6-GL3, which bears an unfolded protein response element (UPRE). We demonstrated that the luciferase activities of the BiP and CHOP promoters were upregulated by all the UPR genes, whereas those of the XBP1 promoter and p5 × ATF6-GL3 were upregulated by all the UPR genes except for BiP, CHOP, and ATF4 in HeLa cells. Therefore, an ERSE- and UPRE-centered gene regulatory network of UPR genes could be responsible for the robustness of the ER stress response. Finally, we revealed that BiP protein was degraded when cells were treated with DNA-damaging reagents, such as etoposide and doxorubicin; this finding suggests that the expression level of BiP is tightly regulated at the post-translational level, rather than at the transcriptional level, in the presence of DNA damage.
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.

PubMed

Tintle, Nathan L; Sitarik, Alexandra; Boerema, Benjamin; Young, Kylie; Best, Aaron A; Dejongh, Matthew

2012-08-08

Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Candidate genes for panhypopituitarism identified by gene expression profiling

PubMed Central

Mortensen, Amanda H.; MacDonald, James W.; Ghosh, Debashis

2011-01-01

Mutations in the transcription factors PROP1 and PIT1 (POU1F1) lead to pituitary hormone deficiency and hypopituitarism in mice and humans. The dysmorphology of developing Prop1 mutant pituitaries readily distinguishes them from those of Pit1 mutants and normal mice. This and other features suggest that Prop1 controls the expression of genes besides Pit1 that are important for pituitary cell migration, survival, and differentiation. To identify genes involved in these processes we used microarray analysis of gene expression to compare pituitary RNA from newborn Prop1 and Pit1 mutants and wild-type littermates. Significant differences in gene expression were noted between each mutant and their normal littermates, as well as between Prop1 and Pit1 mutants. Otx2, a gene critical for normal eye and pituitary development in humans and mice, exhibited elevated expression specifically in Prop1 mutant pituitaries. We report the spatial and temporal regulation of Otx2 in normal mice and Prop1 mutants, and the results suggest Otx2 could influence pituitary development by affecting signaling from the ventral diencephalon and regulation of gene expression in Rathke's pouch. The discovery that Otx2 expression is affected by Prop1 deficiency provides support for our hypothesis that identifying molecular differences in mutants will contribute to understanding the molecular mechanisms that control pituitary organogenesis and lead to human pituitary disease. PMID:21828248
Zinc-finger protein-targeted gene regulation: Genomewide single-gene specificity

PubMed Central

Tan, Siyuan; Guschin, Dmitry; Davalos, Albert; Lee, Ya-Li; Snowden, Andrew W.; Jouvenot, Yann; Zhang, H. Steven; Howes, Katherine; McNamara, Andrew R.; Lai, Albert; Ullman, Chris; Reynolds, Lindsey; Moore, Michael; Isalan, Mark; Berg, Lutz-Peter; Campos, Bradley; Qi, Hong; Spratt, S. Kaye; Case, Casey C.; Pabo, Carl O.; Campisi, Judith; Gregory, Philip D.

2003-01-01

Zinc-finger protein transcription factors (ZFP TFs) can be designed to control the expression of any desired target gene, and thus provide potential therapeutic tools for the study and treatment of disease. Here we report that a ZFP TF can repress target gene expression with single-gene specificity within the human genome. A ZFP TF repressor that binds an 18-bp recognition sequence within the promoter of the endogenous CHK2 gene gives a >10-fold reduction in CHK2 mRNA and protein. This level of repression was sufficient to generate a functional phenotype, as demonstrated by the loss of DNA damage-induced CHK2-dependent p53 phosphorylation. We determined the specificity of repression by using DNA microarrays and found that the ZFP TF repressed a single gene (CHK2) within the monitored genome in two different cell types. These data demonstrate the utility of ZFP TFs as precise tools for target validation, and highlight their potential as clinical therapeutics. PMID:14514889
Identification of a Transcriptionally Forward α Gene and Two υ Genes within the Pigeon (Columba livia) IgH Gene Locus.

PubMed

Huang, Tian; Wang, Xifeng; Si, Run; Chi, Hao; Han, Binyue; Han, Haitang; Cao, Gengsheng; Zhao, Yaofeng

2018-06-01

Compared with mammals, the bird Ig genetic system relies on gene conversion to create an Ab repertoire, with inversion of the IgA-encoding gene and very few cases of Ig subclass diversification. Although gene conversion has been studied intensively, class-switch recombination, a mechanism by which the IgH C region is exchanged, has rarely been investigated in birds. In this study, based on the published genome of pigeon ( Columba livia ) and high-throughput transcriptome sequencing of immune-related tissues, we identified a transcriptionally forward α gene and found that the pigeon IgH gene locus is arranged as μ-α-υ1-υ2. In this article, we show that both DNA deletion and inversion may result from IgA and IgY class switching, and similar junction patterns were observed for both types of class-switch recombination. We also identified two subclasses of υ genes in pigeon, which share low sequence identity. Phylogenetic analysis suggests that divergence of the two pigeon υ genes occurred during the early stage of bird evolution. The data obtained in this study provide new insight into class-switch recombination and Ig gene evolution in birds. Copyright © 2018 by The American Association of Immunologists, Inc.
Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes.

PubMed

Lomsadze, Alexandre; Gemayel, Karl; Tang, Shiyuyun; Borodovsky, Mark

2018-05-17

In a conventional view of the prokaryotic genome organization, promoters precede operons and ribosome binding sites (RBSs) with Shine-Dalgarno consensus precede genes. However, recent experimental research suggesting a more diverse view motivated us to develop an algorithm with improved gene-finding accuracy. We describe GeneMarkS-2, an ab initio algorithm that uses a model derived by self-training for finding species-specific (native) genes, along with an array of precomputed "heuristic" models designed to identify harder-to-detect genes (likely horizontally transferred). Importantly, we designed GeneMarkS-2 to identify several types of distinct sequence patterns (signals) involved in gene expression control, among them the patterns characteristic for leaderless transcription as well as noncanonical RBS patterns. To assess the accuracy of GeneMarkS-2, we used genes validated by COG (Clusters of Orthologous Groups) annotation, proteomics experiments, and N-terminal protein sequencing. We observed that GeneMarkS-2 performed better on average in all accuracy measures when compared with the current state-of-the-art gene prediction tools. Furthermore, the screening of ∼5000 representative prokaryotic genomes made by GeneMarkS-2 predicted frequent leaderless transcription in both archaea and bacteria. We also observed that the RBS sites in some species with leadered transcription did not necessarily exhibit the Shine-Dalgarno consensus. The modeling of different types of sequence motifs regulating gene expression prompted a division of prokaryotic genomes into five categories with distinct sequence patterns around the gene starts. © 2018 Lomsadze et al.; Published by Cold Spring Harbor Laboratory Press.
Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

PubMed

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-04-21

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

PubMed Central

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-01-01

Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis
Computational analysis of gene-gene interactions using multifactor dimensionality reduction.

PubMed

Moore, Jason H

2004-11-01

Understanding the relationship between DNA sequence variations and biologic traits is expected to improve the diagnosis, prevention and treatment of common human diseases. Success in characterizing genetic architecture will depend on our ability to address nonlinearities in the genotype-to-phenotype mapping relationship as a result of gene-gene interactions, or epistasis. This review addresses the challenges associated with the detection and characterization of epistasis. A novel strategy known as multifactor dimensionality reduction that was specifically designed for the identification of multilocus genetic effects is presented. Several case studies that demonstrate the detection of gene-gene interactions in common diseases such as atrial fibrillation, Type II diabetes and essential hypertension are also discussed.
A Partial Least Square Approach for Modeling Gene-gene and Gene-environment Interactions When Multiple Markers Are Genotyped

PubMed Central

Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C.

2008-01-01

Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense SNPs in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches: the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey’s 1-df model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women’s Health Initiative (WHI), this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with BMI. PMID:18615621
A partial least-square approach for modeling gene-gene and gene-environment interactions when multiple markers are genotyped.

PubMed

Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C

2009-01-01

Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one-degree-of-freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index.
An intronic microRNA silences genes that are functionally antagonistic to its host gene.

PubMed

Barik, Sailen

2008-09-01

MicroRNAs (miRNAs) are short noncoding RNAs that down-regulate gene expression by silencing specific target mRNAs. While many miRNAs are transcribed from their own genes, nearly half map within introns of 'host' genes, the significance of which remains unclear. We report that transcriptional activation of apoptosis-associated tyrosine kinase (AATK), essential for neuronal differentiation, also generates miR-338 from an AATK gene intron that silences a family of mRNAs whose protein products are negative regulators of neuronal differentiation. We conclude that an intronic miRNA, transcribed together with the host gene mRNA, may serve the interest of its host gene by silencing a cohort of genes that are functionally antagonistic to the host gene itself.
Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

PubMed

Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

2012-01-01

Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

PubMed

Chapman, Joanne R; Waldenström, Jonas

2015-01-01

The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.
Lineage-specific expansion of IFIT gene family: an insight into coevolution with IFN gene family.

PubMed

Liu, Ying; Zhang, Yi-Bing; Liu, Ting-Kai; Gui, Jian-Fang

2013-01-01

In mammals, IFIT (Interferon [IFN]-induced proteins with Tetratricopeptide Repeat [TPR] motifs) family genes are involved in many cellular and viral processes, which are tightly related to mammalian IFN response. However, little is known about non-mammalian IFIT genes. In the present study, IFIT genes are identified in the genome databases from the jawed vertebrates including the cartilaginous elephant shark but not from non-vertebrates such as lancelet, sea squirt and acorn worm, suggesting that IFIT gene family originates from a vertebrate ancestor about 450 million years ago. IFIT family genes show conserved gene structure and gene arrangements. Phylogenetic analyses reveal that this gene family has expanded through lineage-specific and species-specific gene duplication. Interestingly, IFN gene family seem to share a common ancestor and a similar evolutionary mechanism; the function link of IFIT genes to IFN response is present early since the origin of both gene families, as evidenced by the finding that zebrafish IFIT genes are upregulated by fish IFNs, poly(I:C) and two transcription factors IRF3/IRF7, likely via the IFN-stimulated response elements (ISRE) within the promoters of vertebrate IFIT family genes. These coevolution features creates functional association of both family genes to fulfill a common biological process, which is likely selected by viral infection during evolution of vertebrates. Our results are helpful for understanding of evolution of vertebrate IFN system.
Good genes, complementary genes and human mate preferences.

PubMed

Roberts, S Craig; Little, Anthony C

2008-03-01

The past decade has witnessed a rapidly growing interest in the biological basis of human mate choice. Here we review recent studies that demonstrate preferences for traits which might reveal genetic quality to prospective mates, with potential but still largely unknown influence on offspring fitness. These include studies assessing visual, olfactory and auditory preferences for potential good-gene indicator traits, such as dominance or bilateral symmetry. Individual differences in these robust preferences mainly arise through within and between individual variation in condition and reproductive status. Another set of studies have revealed preferences for traits indicating complementary genes, focussing on discrimination of dissimilarity at genes in the major histocompatibility complex (MHC). As in animal studies, we are only just beginning to understand how preferences for specific traits vary and inter-relate, how consideration of good and compatible genes can lead to substantial variability in individual mate choice decisions and how preferences expressed in one sensory modality may reflect those in another. Humans may be an ideal model species in which to explore these interesting complexities.
Good genes, complementary genes and human mate preferences.

PubMed

Roberts, S Craig; Little, Anthony C

2008-09-01

The past decade has witnessed a rapidly growing interest in the biological basis of human mate choice. Here we review recent studies that demonstrate preferences for traits which might reveal genetic quality to prospective mates, with potential but still largely unknown influence on offspring fitness. These include studies assessing visual, olfactory and auditory preferences for potential good-gene indicator traits, such as dominance or bilateral symmetry. Individual differences in these robust preferences mainly arise through within and between individual variation in condition and reproductive status. Another set of studies have revealed preferences for traits indicating complementary genes, focussing on discrimination of dissimilarity at genes in the major histocompatibility complex (MHC). As in animal studies, we are only just beginning to understand how preferences for specific traits vary and inter-relate, how consideration of good and compatible genes can lead to substantial variability in individual mate choice decisions and how preferences expressed in one sensory modality may reflect those in another. Humans may be an ideal model species in which to explore these interesting complexities.
Gene expression studies of reference genes for quantitative real-time PCR: an overview in insects.

PubMed

Shakeel, Muhammad; Rodriguez, Alicia; Tahir, Urfa Bin; Jin, Fengliang

2018-02-01

Whenever gene expression is being examined, it is essential that a normalization process is carried out to eliminate non-biological variations. The use of reference genes, such as glyceraldehyde-3-phosphate dehydrogenase, actin, and ribosomal protein genes, is the usual method of choice for normalizing gene expression. Although reference genes are used to normalize target gene expression, a major problem is that the stability of these genes differs among tissues, developmental stages, species, and responses to abiotic factors. Therefore, the use and validation of multiple reference genes are required. This review discusses the reasons that why RT-qPCR has become the preferred method for validating results of gene expression profiles, the use of specific and non-specific dyes and the importance of use of primers and probes for qPCR as well as to discuss several statistical algorithms developed to help the validation of potential reference genes. The conflicts arising in the use of classical reference genes in gene normalization and their replacement with novel references are also discussed by citing the high stability and low stability of classical and novel reference genes under various biotic and abiotic experimental conditions by employing various methods applied for the reference genes amplification.

Gene therapy in pancreatic cancer

PubMed Central

Liu, Si-Xue; Xia, Zhong-Sheng; Zhong, Ying-Qiang

2014-01-01

Pancreatic cancer (PC) is a highly lethal disease and notoriously difficult to treat. Only a small proportion of PC patients are eligible for surgical resection, whilst conventional chemoradiotherapy only has a modest effect with substantial toxicity. Gene therapy has become a new widely investigated therapeutic approach for PC. This article reviews the basic rationale, gene delivery methods, therapeutic targets and developments of laboratory research and clinical trials in gene therapy of PC by searching the literature published in English using the PubMed database and analyzing clinical trials registered on the Gene Therapy Clinical Trials Worldwide website (http://www. wiley.co.uk/genmed/ clinical). Viral vectors are main gene delivery tools in gene therapy of cancer, and especially, oncolytic virus shows brighter prospect due to its tumor-targeting property. Efficient therapeutic targets for gene therapy include tumor suppressor gene p53, mutant oncogene K-ras, anti-angiogenesis gene VEGFR, suicide gene HSK-TK, cytosine deaminase and cytochrome p450, multiple cytokine genes and so on. Combining different targets or combination strategies with traditional chemoradiotherapy may be a more effective approach to improve the efficacy of cancer gene therapy. Cancer gene therapy is not yet applied in clinical practice, but basic and clinical studies have demonstrated its safety and clinical benefits. Gene therapy will be a new and promising field for the treatment of PC. PMID:25309069
Optimal design of gene knockout experiments for gene regulatory network inference

PubMed Central

Ud-Dean, S. M. Minhaz; Gunawan, Rudiyanto

2016-01-01

Motivation: We addressed the problem of inferring gene regulatory network (GRN) from gene expression data of knockout (KO) experiments. This inference is known to be underdetermined and the GRN is not identifiable from data. Past studies have shown that suboptimal design of experiments (DOE) contributes significantly to the identifiability issue of biological networks, including GRNs. However, optimizing DOE has received much less attention than developing methods for GRN inference. Results: We developed REDuction of UnCertain Edges (REDUCE) algorithm for finding the optimal gene KO experiment for inferring directed graphs (digraphs) of GRNs. REDUCE employed ensemble inference to define uncertain gene interactions that could not be verified by prior data. The optimal experiment corresponds to the maximum number of uncertain interactions that could be verified by the resulting data. For this purpose, we introduced the concept of edge separatoid which gave a list of nodes (genes) that upon their removal would allow the verification of a particular gene interaction. Finally, we proposed a procedure that iterates over performing KO experiments, ensemble update and optimal DOE. The case studies including the inference of Escherichia coli GRN and DREAM 4 100-gene GRNs, demonstrated the efficacy of the iterative GRN inference. In comparison to systematic KOs, REDUCE could provide much higher information return per gene KO experiment and consequently more accurate GRN estimates. Conclusions: REDUCE represents an enabling tool for tackling the underdetermined GRN inference. Along with advances in gene deletion and automation technology, the iterative procedure brings an efficient and fully automated GRN inference closer to reality. Availability and implementation: MATLAB and Python scripts of REDUCE are available on www.cabsel.ethz.ch/tools/REDUCE. Contact: rudi.gunawan@chem.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID
Single-nucleotide polymorphism-gene intermixed networking reveals co-linkers connected to multiple gene expression phenotypes

PubMed Central

Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia

2007-01-01

Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544
A multi-strategy approach to informative gene identification from gene expression data.

PubMed

Liu, Ziying; Phan, Sieu; Famili, Fazel; Pan, Youlian; Lenferink, Anne E G; Cantin, Christiane; Collins, Catherine; O'Connor-McCourt, Maureen D

2010-02-01

An unsupervised multi-strategy approach has been developed to identify informative genes from high throughput genomic data. Several statistical methods have been used in the field to identify differentially expressed genes. Since different methods generate different lists of genes, it is very challenging to determine the most reliable gene list and the appropriate method. This paper presents a multi-strategy method, in which a combination of several data analysis techniques are applied to a given dataset and a confidence measure is established to select genes from the gene lists generated by these techniques to form the core of our final selection. The remainder of the genes that form the peripheral region are subject to exclusion or inclusion into the final selection. This paper demonstrates this methodology through its application to an in-house cancer genomics dataset and a public dataset. The results indicate that our method provides more reliable list of genes, which are validated using biological knowledge, biological experiments, and literature search. We further evaluated our multi-strategy method by consolidating two pairs of independent datasets, each pair is for the same disease, but generated by different labs using different platforms. The results showed that our method has produced far better results.
A kernel regression approach to gene-gene interaction detection for case-control studies.

PubMed

Larson, Nicholas B; Schaid, Daniel J

2013-11-01

Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.
Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici

PubMed Central

McDonald, Megan C.; McGinness, Lachlan; Hane, James K.; Williams, Angela H.; Milgate, Andrew; Solomon, Peter S.

2016-01-01

Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene. PMID:26837952
Calcisponges have a ParaHox gene and dynamic expression of dispersed NK homeobox genes.

PubMed

Fortunato, Sofia A V; Adamski, Marcin; Ramos, Olivia Mendivil; Leininger, Sven; Liu, Jing; Ferrier, David E K; Adamska, Maja

2014-10-30

Sponges are simple animals with few cell types, but their genomes paradoxically contain a wide variety of developmental transcription factors, including homeobox genes belonging to the Antennapedia (ANTP) class, which in bilaterians encompass Hox, ParaHox and NK genes. In the genome of the demosponge Amphimedon queenslandica, no Hox or ParaHox genes are present, but NK genes are linked in a tight cluster similar to the NK clusters of bilaterians. It has been proposed that Hox and ParaHox genes originated from NK cluster genes after divergence of sponges from the lineage leading to cnidarians and bilaterians. On the other hand, synteny analysis lends support to the notion that the absence of Hox and ParaHox genes in Amphimedon is a result of secondary loss (the ghost locus hypothesis). Here we analysed complete suites of ANTP-class homeoboxes in two calcareous sponges, Sycon ciliatum and Leucosolenia complicata. Our phylogenetic analyses demonstrate that these calcisponges possess orthologues of bilaterian NK genes (Hex, Hmx and Msx), a varying number of additional NK genes and one ParaHox gene, Cdx. Despite the generation of scaffolds spanning multiple genes, we find no evidence of clustering of Sycon NK genes. All Sycon ANTP-class genes are developmentally expressed, with patterns suggesting their involvement in cell type specification in embryos and adults, metamorphosis and body plan patterning. These results demonstrate that ParaHox genes predate the origin of sponges, thus confirming the ghost locus hypothesis, and highlight the need to analyse the genomes of multiple sponge lineages to obtain a complete picture of the ancestral composition of the first animal genome.
Annotation of gene function in citrus using gene expression information and co-expression networks

PubMed Central

2014-01-01

Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks
Recommended nomenclature for five mammalian carboxylesterase gene families: human, mouse, and rat genes and proteins.

PubMed

Holmes, Roger S; Wright, Matthew W; Laulederkind, Stanley J F; Cox, Laura A; Hosokawa, Masakiyo; Imai, Teruko; Ishibashi, Shun; Lehner, Richard; Miyazaki, Masao; Perkins, Everett J; Potter, Phillip M; Redinbo, Matthew R; Robert, Jacques; Satoh, Tetsuo; Yamashita, Tetsuro; Yan, Bingfan; Yokoi, Tsuyoshi; Zechner, Rudolf; Maltais, Lois J

2010-10-01

Mammalian carboxylesterase (CES or Ces) genes encode enzymes that participate in xenobiotic, drug, and lipid metabolism in the body and are members of at least five gene families. Tandem duplications have added more genes for some families, particularly for mouse and rat genomes, which has caused confusion in naming rodent Ces genes. This article describes a new nomenclature system for human, mouse, and rat carboxylesterase genes that identifies homolog gene families and allocates a unique name for each gene. The guidelines of human, mouse, and rat gene nomenclature committees were followed and "CES" (human) and "Ces" (mouse and rat) root symbols were used followed by the family number (e.g., human CES1). Where multiple genes were identified for a family or where a clash occurred with an existing gene name, a letter was added (e.g., human CES4A; mouse and rat Ces1a) that reflected gene relatedness among rodent species (e.g., mouse and rat Ces1a). Pseudogenes were named by adding "P" and a number to the human gene name (e.g., human CES1P1) or by using a new letter followed by ps for mouse and rat Ces pseudogenes (e.g., Ces2d-ps). Gene transcript isoforms were named by adding the GenBank accession ID to the gene symbol (e.g., human CES1_AB119995 or mouse Ces1e_BC019208). This nomenclature improves our understanding of human, mouse, and rat CES/Ces gene families and facilitates research into the structure, function, and evolution of these gene families. It also serves as a model for naming CES genes from other mammalian species.
An improved method for functional similarity analysis of genes based on Gene Ontology.

PubMed

Tian, Zhen; Wang, Chunyu; Guo, Maozu; Liu, Xiaoyan; Teng, Zhixia

2016-12-23

Measures of gene functional similarity are essential tools for gene clustering, gene function prediction, evaluation of protein-protein interaction, disease gene prioritization and other applications. In recent years, many gene functional similarity methods have been proposed based on the semantic similarity of GO terms. However, these leading approaches may make errorprone judgments especially when they measure the specificity of GO terms as well as the IC of a term set. Therefore, how to estimate the gene functional similarity reliably is still a challenging problem. We propose WIS, an effective method to measure the gene functional similarity. First of all, WIS computes the IC of a term by employing its depth, the number of its ancestors as well as the topology of its descendants in the GO graph. Secondly, WIS calculates the IC of a term set by means of considering the weighted inherited semantics of terms. Finally, WIS estimates the gene functional similarity based on the IC overlap ratio of term sets. WIS is superior to some other representative measures on the experiments of functional classification of genes in a biological pathway, collaborative evaluation of GO-based semantic similarity measures, protein-protein interaction prediction and correlation with gene expression. Further analysis suggests that WIS takes fully into account the specificity of terms and the weighted inherited semantics of terms between GO terms. The proposed WIS method is an effective and reliable way to compare gene function. The web service of WIS is freely available at http://nclab.hit.edu.cn/WIS/ .
A vitamin D pathway gene-gene interaction affects low-density lipoprotein cholesterol levels.

PubMed

Grave, Nathália; Tovo-Rodrigues, Luciana; da Silveira, Janaína; Rovaris, Diego Luiz; Dal Bosco, Simone Morelo; Contini, Verônica; Genro, Júlia Pasqualini

2016-12-01

Much evidence suggests an association between vitamin D deficiency and chronic diseases such as obesity and dyslipidemia. Although genetic factors play an important role in the etiology of these diseases, only a few studies have investigated the relationship between vitamin D-related genes and anthropometric and lipid profiles. The aim of this study was to investigate the association of three vitamin D-related genes with anthropometric and lipid parameters in 542 adult individuals. We analyzed the rs2228570 polymorphism in the vitamin D receptor gene (VDR), rs2134095 in the retinoid X receptor gamma gene (RXRG) and rs7041 in the vitamin D-binding protein gene (GC). Polymorphisms were genotyped by TaqMan allelic discrimination. Gene-gene interactions were evaluated by the general linear model. The functionality of the polymorphisms was investigated using the following predictors and databases: SIFT (Sorting Intolerant from Tolerant), PolyPhen-2 (Polymorphism Phenotyping v2) and Human Splicing Finder 3. We identified a significant effect of the interaction between RXRG (rs2134095) and GC (rs7041) on low-density lipoprotein cholesterol (LDL-c) levels (P=.005). Furthermore, our in silico analysis suggested a functional role for both variants in the regulation of the gene products. Our results suggest that the vitamin D-related genes RXRG and GC affect LDL-c levels. These findings are in agreement with other studies that consistently associate vitamin D and lipid profile. Together, our results corroborate the idea that analyzing gene-gene interaction would be helpful to clarify the genetic component of lipid profile. Copyright © 2016 Elsevier Inc. All rights reserved.
Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

PubMed

Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

2007-10-04

In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.
Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.

PubMed

Panwar, Vinay; Bakkeren, Guus

2017-01-01

Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.
MorphDB: Prioritizing Genes for Specialized Metabolism Pathways and Gene Ontology Categories in Plants.

PubMed

Zwaenepoel, Arthur; Diels, Tim; Amar, David; Van Parys, Thomas; Shamir, Ron; Van de Peer, Yves; Tzfadia, Oren

2018-01-01

Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.
Newer Gene Editing Technologies toward HIV Gene Therapy

PubMed Central

Manjunath, N.; Yi, Guohua; Dang, Ying; Shankar, Premlata

2013-01-01

Despite the great success of highly active antiretroviral therapy (HAART) in ameliorating the course of HIV infection, alternative therapeutic approaches are being pursued because of practical problems associated with life-long therapy. The eradication of HIV in the so-called “Berlin patient” who received a bone marrow transplant from a CCR5-negative donor has rekindled interest in genome engineering strategies to achieve the same effect. Precise gene editing within the cells is now a realistic possibility with recent advances in understanding the DNA repair mechanisms, DNA interaction with transcription factors and bacterial defense mechanisms. Within the past few years, four novel technologies have emerged that can be engineered for recognition of specific DNA target sequences to enable site-specific gene editing: Homing Endonuclease, ZFN, TALEN, and CRISPR/Cas9 system. The most recent CRISPR/Cas9 system uses a short stretch of complementary RNA bound to Cas9 nuclease to recognize and cleave target DNA, as opposed to the previous technologies that use DNA binding motifs of either zinc finger proteins or transcription activator-like effector molecules fused to an endonuclease to mediate sequence-specific DNA cleavage. Unlike RNA interference, which requires the continued presence of effector moieties to maintain gene silencing, the newer technologies allow permanent disruption of the targeted gene after a single treatment. Here, we review the applications, limitations and future prospects of novel gene-editing strategies for use as HIV therapy. PMID:24284874
Pyviko: an automated Python tool to design gene knockouts in complex viruses with overlapping genes.

PubMed

Taylor, Louis J; Strebel, Klaus

2017-01-07

Gene knockouts are a common tool used to study gene function in various organisms. However, designing gene knockouts is complicated in viruses, which frequently contain sequences that code for multiple overlapping genes. Designing mutants that can be traced by the creation of new or elimination of existing restriction sites further compounds the difficulty in experimental design of knockouts of overlapping genes. While software is available to rapidly identify restriction sites in a given nucleotide sequence, no existing software addresses experimental design of mutations involving multiple overlapping amino acid sequences in generating gene knockouts. Pyviko performed well on a test set of over 240,000 gene pairs collected from viral genomes deposited in the National Center for Biotechnology Information Nucleotide database, identifying a point mutation which added a premature stop codon within the first 20 codons of the target gene in 93.2% of all tested gene-overprinted gene pairs. This shows that Pyviko can be used successfully in a wide variety of contexts to facilitate the molecular cloning and study of viral overprinted genes. Pyviko is an extensible and intuitive Python tool for designing knockouts of overlapping genes. Freely available as both a Python package and a web-based interface ( http://louiejtaylor.github.io/pyViKO/ ), Pyviko simplifies the experimental design of gene knockouts in complex viruses with overlapping genes.
Turning the gene tap off; implications of regulating gene expression for cancer therapeutics

PubMed Central

Curtin, James F.; Candolfi, Marianela; Xiong, Weidong; Lowenstein, Pedro R.; Castro, Maria G.

2008-01-01

Cancer poses a tremendous therapeutic challenge worldwide, highlighting the critical need for developing novel therapeutics. A promising cancer treatment modality is gene therapy, which is a form of molecular medicine designed to introduce into target cells genetic material with therapeutic intent. Anticancer gene therapy strategies currently used in preclinical models, and in some cases in the clinic, include proapoptotic genes, oncolytic/replicative vectors, conditional cytotoxic approaches, inhibition of angiogenesis, inhibition of growth factor signaling, inactivation of oncogenes, inhibition of tumor invasion and stimulation of the immune system. The translation of these novel therapeutic modalities from the preclinical setting to the clinic has been driven by encouraging preclinical efficacy data and advances in gene delivery technologies. One area of intense research involves the ability to accurately regulate the levels of therapeutic gene expression to achieve enhanced efficacy and provide the capability to switch gene expression off completely if adverse side effects should arise. This feature could also be implemented to switch gene expression off when a successful therapeutic outcome ensues. Here, we will review recent developments related to the engineering of transcriptional switches within gene delivery systems, which could be implemented in clinical gene therapy applications directed at the treatment of cancer. PMID:18347132
Control of bacteriophage P2 gene expression: analysis of transcription of the ogr gene.

PubMed Central

Birkeland, N K; Lindqvist, B H; Christie, G E

1991-01-01

The bacteriophage P2 ogr gene encodes an 8.3-kDa protein that is a positive effector of P2 late gene transcription. The ogr gene is preceded by a promoter sequence (Pogr) resembling a normal Escherichia coli promoter and is located just downstream of a late transcription unit. We analyzed the kinetics and regulation of ogr gene transcription by using an ogr-specific antisense RNA probe in an S1 mapping assay. During a normal P2 infection, ogr gene transcription starts from Pogr at an intermediate time between the onset of early and late transcription. At late times after infection the ogr gene is cotranscribed with the late FETUD operon; the ogr gene product thus positively regulates its own synthesis from the P2 late promoter PF. Expression of the P2 late genes also requires P2 DNA replication. Complementation experiments and transcriptional analysis show that a nonreplicating P2 phage expresses the ogr gene from Pogr but is unable to transcribe the late genes. A P2 ogr-defective phage makes an increased level of ogr mRNA, consistent with autogenous control from Pogr. Transcription of the ogr gene in the prophage of a P2 heteroimmune lysogen is stimulated after infection with P2, suggesting that Pogr is under indirect immunity control and is activated by a yet-unidentified P2 early gene product during infection. Images FIG. 4 FIG. 5 FIG. 6 FIG. 7 FIG. 8 PMID:1938896
Genes, dreams, and cancer.

PubMed Central

Sikora, K.

1994-01-01

There have been tremendous advances in our understanding of cancer from the application of molecular biology over the past decade. The disease is caused by a series of defects in the genes that accelerate growth--oncogenes--and those that slow down cellular turnover--tumour suppressor genes. The proteins they encode provide a promising hunting ground in which to design and test new anticancer drugs. Several treatment strategies are now under clinical trial entailing direct gene transfer. These include the use of gene marking to detect minimal residual disease, the production of novel cancer vaccines by the insertion of genes which uncloak cancer cells so making them visible to the host's immune system, the isolation and coupling of cancer specific molecular switches upstream of drug activating genes, and the correction of aberrant oncogenes or tumour suppressor genes. The issues in these approaches are likely to have a profound impact on the management of cancer patients as we enter the next century. Images p1221-a PMID:8180542
GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains

PubMed Central

Lu, Zhiyong

2015-01-01

The automatic recognition of gene names and their associated database identifiers from biomedical text has been widely studied in recent years, as these tasks play an important role in many downstream text-mining applications. Despite significant previous research, only a small number of tools are publicly available and these tools are typically restricted to detecting only mention level gene names or only document level gene identifiers. In this work, we report GNormPlus: an end-to-end and open source system that handles both gene mention and identifier detection. We created a new corpus of 694 PubMed articles to support our development of GNormPlus, containing manual annotations for not only gene names and their identifiers, but also closely related concepts useful for gene name disambiguation, such as gene families and protein domains. GNormPlus integrates several advanced text-mining techniques, including SimConcept for resolving composite gene names. As a result, GNormPlus compares favorably to other state-of-the-art methods when evaluated on two widely used public benchmarking datasets, achieving 86.7% F1-score on the BioCreative II Gene Normalization task dataset and 50.1% F1-score on the BioCreative III Gene Normalization task dataset. The GNormPlus source code and its annotated corpus are freely available, and the results of applying GNormPlus to the entire PubMed are freely accessible through our web-based tool PubTator. PMID:26380306

Coexpression network based on natural variation in human gene expression reveals gene interactions and functions

PubMed Central

Nayak, Renuka R.; Kearns, Michael; Spielman, Richard S.; Cheung, Vivian G.

2009-01-01

Genes interact in networks to orchestrate cellular processes. Analysis of these networks provides insights into gene interactions and functions. Here, we took advantage of normal variation in human gene expression to infer gene networks, which we constructed using correlations in expression levels of more than 8.5 million gene pairs in immortalized B cells from three independent samples. The resulting networks allowed us to identify biological processes and gene functions. Among the biological pathways, we found processes such as translation and glycolysis that co-occur in the same subnetworks. We predicted the functions of poorly characterized genes, including CHCHD2 and TMEM111, and provided experimental evidence that TMEM111 is part of the endoplasmic reticulum-associated secretory pathway. We also found that IFIH1, a susceptibility gene of type 1 diabetes, interacts with YES1, which plays a role in glucose transport. Furthermore, genes that predispose to the same diseases are clustered nonrandomly in the coexpression network, suggesting that networks can provide candidate genes that influence disease susceptibility. Therefore, our analysis of gene coexpression networks offers information on the role of human genes in normal and disease processes. PMID:19797678
Phage-Mediated Gene Therapy.

PubMed

Hosseinidoust, Zeinab

2017-01-01

Bacteriophages (bacterial viruses) have long been under investigation as vectors for gene therapy. Similar to other viral vectors, the phage coat proteins have evolved over millions of years to protect the viral genome from degradation post injection, offering protection for the valuable therapeutic sequence. However, what sets phage apart from other viral gene delivery vectors is their safety for human use and the relative ease by which foreign molecules can be expressed on the phage outer surface, enabling highly targeted gene delivery. The latter property also makes phage a popular choice for gene therapy target discovery through directed evolution. Although promising, phage-mediated gene therapy faces several outstanding challenges, the most notable being lower gene delivery efficiency compared to animal viruses, vector stability, and nondesirable immune stimulation. This review presents a critical review of promises and challenges of employing phage as gene delivery vehicles as well as an introduction to the concept of phage-based microbiome therapy as the new frontier and perhaps the most promising application of phage-based gene therapy. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

PubMed

Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

2015-01-27

Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
Gene function prediction based on the Gene Ontology hierarchical structure.

PubMed

Cheng, Liangxi; Lin, Hongfei; Hu, Yuncui; Wang, Jian; Yang, Zhihao

2014-01-01

The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
Gene Editing and Gene-Based Therapeutics for Cardiomyopathies.

PubMed

Ohiri, Joyce C; McNally, Elizabeth M

2018-04-01

With an increasing understanding of genetic defects leading to cardiomyopathy, focus is shifting to correcting these underlying genetic defects. One approach involves treating mutant RNA through antisense oligonucleotides; the first drug has received regulatory approval to treat specific mutations associated with Duchenne muscular dystrophy. Gene editing is being evaluated in the preclinical setting. For inherited cardiomyopathies, genetic correction strategies require tight specificity for the mutant allele. Gene-editing methods are being tested to create deletions that may be useful to restore protein expression by through the bypass of mutations that restore protein production. Site-specific gene editing, which is required to correct many point mutations, is a less efficient process than inducing deletions. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene Delivery in Neuro-Oncology.

PubMed

Dixit, Karan; Kumthekar, Priya

2017-09-02

Glioblastoma multiforme (GBM) is the most common primary malignant brain tumor in adults with a dismal prognosis despite aggressive multimodal management thus novel treatments are urgently needed. Gene therapy is a versatile treatment strategy being investigated in multiple cancers including GBM. In gene therapy, a variety of vectors or "carriers" are used to deliver genes designed for different anti-tumoral effects. Gene delivery vehicles and approaches to treatment will be addressed in this review. The most commonly studied vectors are viral based, however, driven by advances in biomedical engineering, mesenchymal and neural stem cells, as well as multiple different types of nanoparticles have been developed to improve tumor tropism and also increase gene transfer into tumor cells. Different genes have been studied including suicide genes, which convert non-toxic prodrug into cytotoxic drug; immunomodulatory genes, which stimulate the immune system; and tumor suppressor genes which repair the defect that allow cells to divide unchecked. Gene therapy may be a promising treatment strategy in neuro-oncology as it is versatile and flexible due to the ability to tailor vectors and genes for specific therapeutic activity. Pre-clinical studies and clinical trials have demonstrated feasibility and safety of gene therapy; however, further studies are required to determine efficacy.
Cdx ParaHox genes acquired distinct developmental roles after gene duplication in vertebrate evolution.

PubMed

Marlétaz, Ferdinand; Maeso, Ignacio; Faas, Laura; Isaacs, Harry V; Holland, Peter W H

2015-08-01

The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates.
Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.

PubMed

Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M

2017-04-01

The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

PubMed

Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

2016-09-02

Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

PubMed

Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.
Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

PubMed Central

Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417
Gene Polymorphism Association with Type 2 Diabetes and Related Gene-Gene and Gene-Environment Interactions in a Uyghur Population

PubMed Central

Xiao, Shan; Zeng, Xiaoyun; Fan, Yong; Su, Yinxia; Ma, Qi; Zhu, Jun; Yao, Hua

2016-01-01

Background We investigated the association between 8 single-nucleotide polymorphisms (SNPs) at 3 genetic loci (CDKAL1, CDKN2A/2B and FTO) with type 2 diabetes (T2D) in a Uyghur population. Material/Methods A case-control study of 879 Uyghur patients with T2D and 895 non-diabetic Uyghur controls was conducted at the Hospital of Xinjiang Medical University between 2010 and 2013. Eight SNPs in CDKAL1, CDKN2A/2B and FTO were analyzed using Sequenom MassARRAY®SNP genotyping. Factors associated with T2D were assessed by logistic regression analyses. Gene-gene and gene-environment interactions were analyzed by generalized multifactor dimensionality reduction. Results Genotype distributions of rs10811661 (CDKN2A/2B), rs7195539, rs8050136, and rs9939609 (FTO) and allele frequencies of rs8050136 and rs9939609 differed significantly between diabetes and control groups (all P<0.05). While rs10811661, rs8050136, and rs9939609 were eliminated after adjusting for covariates (P>0.05), rs7195539 distribution differed significantly in co-dominant and dominant models (P<0.05). In gene-gene interaction analysis, after adjusting for covariates the two-locus rs10811661-rs7195539 interaction model had a cross-validation consistency of 10/10 and the highest balanced accuracy of 0.5483 (P=0.014). In gene-environment interaction analysis, the 3-locus interaction model TG-HDL-family history of diabetes had a cross-validation consistency of 10/10 and the highest balanced accuracy of 0.7072 (P<0.001). The 4-locus interaction model, rs7195539-TG-HDL-family history of diabetes had a cross-validation consistency of 8/10 (P<0.001). Conclusions Polymorphisms in CDKN2A/2B and FTO, but not CDKAL1, may be associated with T2D, and alleles rs8050136 and rs9939609 are likely risk alleles for T2D in this population. There were potential interactions among CDKN2A/2B (rs10811661) – FTO (rs7195539) or FTO (rs7195539)-TG-HDL-family history of diabetes in the pathogenesis of T2D in a Uyghur population. PMID
Reference genes for gene expression studies in wheat flag leaves grown under different farming conditions

PubMed Central

2011-01-01

Background Internal control genes with highly uniform expression throughout the experimental conditions are required for accurate gene expression analysis as no universal reference genes exists. In this study, the expression stability of 24 candidate genes from Triticum aestivum cv. Cubus flag leaves grown under organic and conventional farming systems was evaluated in two locations in order to select suitable genes that can be used for normalization of real-time quantitative reverse-transcription PCR (RT-qPCR) reactions. The genes were selected among the most common used reference genes as well as genes encoding proteins involved in several metabolic pathways. Findings Individual genes displayed different expression rates across all samples assayed. Applying geNorm, a set of three potential reference genes were suitable for normalization of RT-qPCR reactions in winter wheat flag leaves cv. Cubus: TaFNRII (ferredoxin-NADP(H) oxidoreductase; AJ457980.1), ACT2 (actin 2; TC234027), and rrn26 (a putative homologue to RNA 26S gene; AL827977.1). In addition of these three genes that were also top-ranked by NormFinder, two extra genes: CYP18-2 (Cyclophilin A, AY456122.1) and TaWIN1 (14-3-3 like protein, AB042193) were most consistently stably expressed. Furthermore, we showed that TaFNRII, ACT2, and CYP18-2 are suitable for gene expression normalization in other two winter wheat varieties (Tommi and Centenaire) grown under three treatments (organic, conventional and no nitrogen) and a different environment than the one tested with cv. Cubus. Conclusions This study provides a new set of reference genes which should improve the accuracy of gene expression analyses when using wheat flag leaves as those related to the improvement of nitrogen use efficiency for cereal production. PMID:21951810
Combining classifiers to predict gene function in Arabidopsis thaliana using large-scale gene expression measurements.

PubMed

Lan, Hui; Carson, Rachel; Provart, Nicholas J; Bonner, Anthony J

2007-09-21

Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these genes is currently unknown. The purpose of this study is to infer gene function in Arabidopsis using machine-learning algorithms applied to large-scale gene expression data sets, with the goal of identifying genes that are potentially involved in plant response to abiotic stress. Using in house and publicly available data, we assembled a large set of gene expression measurements for A. thaliana. Using those genes of known function, we first evaluated and compared the ability of basic machine-learning algorithms to predict which genes respond to stress. Predictive accuracy was measured using ROC50 and precision curves derived through cross validation. To improve accuracy, we developed a method for combining these classifiers using a weighted-voting scheme. The combined classifier was then trained on genes of known function and applied to genes of unknown function, identifying genes that potentially respond to stress. Visual evidence corroborating the predictions was obtained using electronic Northern analysis. Three of the predicted genes were chosen for biological validation. Gene knockout experiments confirmed that all three are involved in a variety of stress responses. The biological analysis of one of these genes (At1g16850) is presented here, where it is shown to be necessary for the normal response to temperature and NaCl. Supervised learning methods applied to large-scale gene expression measurements can be used to predict gene function. However, the ability of basic learning methods to predict stress response varies widely and depends heavily on how much dimensionality reduction is used. Our method of combining classifiers can improve the accuracy of such predictions - in this case, predictions of genes involved in stress response in plants - and it effectively chooses the appropriate amount
Mutagenesis of diploid mammalian genes by gene entrapment

PubMed Central

Lin, Qing; Donahue, Sarah L.; Moore-Jarrett, Tracy; Cao, Shang; Osipovich, Anna B.; Ruley, H. Earl

2006-01-01

The present study describes a genome-wide method for biallelic mutagenesis in mammalian cells. Novel poly(A) gene trap vectors, which contain features for direct cloning vector–cell fusion transcripts and for post-entrapment genome engineering, were used to generate a library of 979 mutant ES cells. The entrapment mutations generally disrupted gene expression and were readily transmitted through the germline, establishing the library as a resource for constructing mutant mice. Cells homozygous for most entrapment loci could be isolated by selecting for enhanced expression of an inserted neomycin-resistance gene that resulted from losses of heterozygosity (LOH). The frequencies of LOH measured at 37 sites in the genome ranged from 1.3 × 10−5 to 1.2 × 10−4 per cell and increased with increasing distance from the centromere, implicating mitotic recombination in the process. The ease and efficiency of obtaining homozygous mutations will (i) facilitate genetic studies of gene function in cultured cells, (ii) permit genome-wide studies of recombination events that result in LOH and mediate a type of chromosomal instability important in carcinogenesis, and (iii) provide new strategies for phenotype-driven mutagenesis screens in mammalian cells. PMID:17062627
No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

PubMed

Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

2017-11-15

A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
A survey of disease connections for CD4+ T cell master genes and their directly linked genes.

PubMed

Li, Wentian; Espinal-Enríquez, Jesús; Simpfendorfer, Kim R; Hernández-Lemus, Enrique

2015-12-01

Genome-wide association studies and other genetic analyses have identified a large number of genes and variants implicating a variety of disease etiological mechanisms. It is imperative for the study of human diseases to put these genetic findings into a coherent functional context. Here we use system biology tools to examine disease connections of five master genes for CD4+ T cell subtypes (TBX21, GATA3, RORC, BCL6, and FOXP3). We compiled a list of genes functionally interacting (protein-protein interaction, or by acting in the same pathway) with the master genes, then we surveyed the disease connections, either by experimental evidence or by genetic association. Embryonic lethal genes (also known as essential genes) are over-represented in master genes and their interacting genes (55% versus 40% in other genes). Transcription factors are significantly enriched among genes interacting with the master genes (63% versus 10% in other genes). Predicted haploinsufficiency is a feature of most these genes. Disease-connected genes are enriched in this list of genes: 42% of these genes have a disease connection according to Online Mendelian Inheritance in Man (OMIM) (versus 23% in other genes), and 74% are associated with some diseases or phenotype in a Genome Wide Association Study (GWAS) (versus 43% in other genes). Seemingly, not all of the diseases connected to genes surveyed were immune related, which may indicate pleiotropic functions of the master regulator genes and associated genes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Clock gene modulates roles of OXTR and AVPR1b genes in prosociality.

PubMed

Ci, Haipeng; Wu, Nan; Su, Yanjie

2014-01-01

The arginine vasopressin receptor (AVPR) and oxytocin receptor (OXTR) genes have been demonstrated to contribute to prosocial behavior. Recent research has focused on the manner by which these simple receptor genes influence prosociality, particularly with regard to the AVP system, which is modulated by the clock gene. The clock gene is responsible for regulating the human biological clock, affecting sleep, emotion and behavior. The current study examined in detail whether the influences of the OXTR and AVPR1b genes on prosociality are dependent on the clock gene. This study assessed interactions between the clock gene (rs1801260, rs6832769) and the OXTR (rs1042778, rs237887) and AVPR1b (rs28373064) genes in association with individual differences in prosociality in healthy male Chinese subjects (n = 436). The Prosocial Tendencies Measure (PTM-R) was used to assess prosociality. Participants carrying both the GG/GA variant of AVPR1b rs28373064 and the AA variant of clock rs6832769 showed the highest scores on the Emotional PTM. Carriers of both the T allele of OXTR rs1042778 and the C allele of clock rs1801260 showed the lowest total PTM scores compared with the other groups. The observed interaction effects provide converging evidence that the clock gene and OXT/AVP systems are intertwined and contribute to human prosociality.
Reference genes for normalization of gene expression studies in human osteoarthritic articular cartilage.

PubMed

Pombo-Suarez, Manuel; Calaza, Manuel; Gomez-Reino, Juan J; Gonzalez, Antonio

2008-01-29

Assessment of gene expression is an important component of osteoarthritis (OA) research, greatly improved by the development of quantitative real-time PCR (qPCR). This technique requires normalization for precise results, yet no suitable reference genes have been identified in human articular cartilage. We have examined ten well-known reference genes to determine the most adequate for this application. Analyses of expression stability in cartilage from 10 patients with hip OA, 8 patients with knee OA and 10 controls without OA were done with classical statistical tests and the software programs geNorm and NormFinder. Results from the three methods of analysis were broadly concordant. Some of the commonly used reference genes, GAPDH, ACTB and 18S RNA, performed poorly in our analysis. In contrast, the rarely used TBP, RPL13A and B2M genes were the best. It was necessary to use together several of these three genes to obtain the best results. The specific combination depended, to some extent, on the type of samples being compared. Our results provide a satisfactory set of previously unused reference genes for qPCR in hip and knee OA This confirms the need to evaluate the suitability of reference genes in every tissue and experimental situation before starting the quantitative assessment of gene expression by qPCR.
Defined single-gene and multi-gene deletion mutant collections in Salmonella enterica sv Typhimurium.

PubMed

Porwollik, Steffen; Santiviago, Carlos A; Cheng, Pui; Long, Fred; Desai, Prerak; Fredlund, Jennifer; Srikumar, Shabarinath; Silva, Cecilia A; Chu, Weiping; Chen, Xin; Canals, Rocío; Reynolds, M Megan; Bogomolnaya, Lydia; Shields, Christine; Cui, Ping; Guo, Jinbai; Zheng, Yi; Endicott-Yazdani, Tiana; Yang, Hee-Jeong; Maple, Aimee; Ragoza, Yury; Blondel, Carlos J; Valenzuela, Camila; Andrews-Polymenis, Helene; McClelland, Michael

2014-01-01

We constructed two collections of targeted single gene deletion (SGD) mutants and two collections of targeted multi-gene deletion (MGD) mutants in Salmonella enterica sv Typhimurium 14028s. The SGD mutant collections contain (1), 3517 mutants in which a single gene is replaced by a cassette containing a kanamycin resistance (KanR) gene oriented in the sense direction (SGD-K), and (2), 3376 mutants with a chloramphenicol resistance gene (CamR) oriented in the antisense direction (SGD-C). A combined total of 3773 individual genes were deleted across these SGD collections. The MGD collections contain mutants bearing deletions of contiguous regions of three or more genes and include (3), 198 mutants spanning 2543 genes replaced by a KanR cassette (MGD-K), and (4), 251 mutants spanning 2799 genes replaced by a CamR cassette (MGD-C). Overall, 3476 genes were deleted in at least one MGD collection. The collections with different antibiotic markers permit construction of all viable combinations of mutants in the same background. Together, the libraries allow hierarchical screening of MGDs for different phenotypic followed by screening of SGDs within the target MGD regions. The mutants of these collections are stored at BEI Resources (www.beiresources.org) and publicly available.

A new gene in A. rubens: A sea star Ig kappa gene.

PubMed

Vincent, Nadine; Osteras, Magne; Otten, Patricia; Leclerc, Michel

2014-12-01

The sea star Asterias rubens reacts specifically to the antigen:HRP (horse-radish peroxydase) and produces an antibody anti-HRP. We previously identified a candidate Ig kappa gene corresponding to this manuscript. We show now the gene referred to as: "sea star Ig kappa gene in its specificity".
Regulation of gene expression in plasmid ColE1: delayed expression of the kil gene.

PubMed Central

Zhang, S P; Yan, L F; Zubay, G

1988-01-01

cea, imm, and kil are a cluster of three functionally related genes of the plasmid ColE1. The cea and kil genes are in the same inducible operon, with transcription being initiated from a promoter adjacent to the cea gene. The imm gene is located between the cea and kil genes, but it is transcribed in the opposite direction. Complementary interaction between the imm mRNA and the anti-imm sequences in the middle of the cea-kil transcript causes a pronounced delay in expression of the kil gene when the cea-kil operon is induced. A segment in the overlapping region between the cea and imm genes causes delayed expression of the kil gene in the absence of imm gene transcription. This delay effect increases the yields of colicin synthesized in induced cells. Images PMID:3142845
Functional Genomic Analysis of Cotton Genes with Agrobacterium-Mediated Virus-Induced Gene Silencing

PubMed Central

Gao, Xiquan; Shan, Libo

2015-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses. PMID:23386302
Functional genomic analysis of cotton genes with agrobacterium-mediated virus-induced gene silencing.

PubMed

Gao, Xiquan; Shan, Libo

2013-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses.
Deregulated HOX genes in ameloblastomas are located in physical contiguity to keratin genes.

PubMed

Schiavo, Giulia; D'Antò, Vincenzo; Cantile, Monica; Procino, Alfredo; Di Giovanni, Stefano; Valletta, Rossella; Terracciano, Luigi; Baumhoer, Daniel; Jundt, Gernot; Cillo, Clemente

2011-11-01

The expression of the HOX gene network in mid-stage human tooth development mostly concerns the epithelial tooth germ compartment and involves the C and D HOX loci. To further dissect the HOX gene implication with tooth epithelium differentiation we compared the expression of the whole HOX network in human ameloblastomas, as paradigm of epithelial odontogenic tumors, with tooth germs. We identified two ameloblastoma molecular types with respectively low and high number of active HOX C genes. The highly expressing HOX C gene ameloblastomas were characterized by a strong keratinized phenotype. Locus C HOX genes are located on chromosome 12q13-15 in physical contiguity with one of the two keratin gene clusters included in the human genome. The most posterior HOX C gene, HOX C13, is capable to interact with hair keratin genes located on the other keratin gene cluster in physical contiguity with the HOX B locus on chromosome 17q21-22. Inside the HOX C locus, a 2.2 kb ncRNA (HOTAIR) able to repress transcription, in cis, along the entire HOX C locus and, in trans, at the posterior region of the HOX D locus has recently been identified. Interestingly both loci are deregulated in ameloblastomas. Our finding support an important role of the HOX network in characterizing the epithelial tooth compartment. Furthermore, the physical contiguity between locus C HOX and keratin genes in normal tooth epithelium and their deregulation in the neoplastic counterparts suggest they may act on the same mechanism potentially involved with epithelial tumorigenesis. Copyright © 2011 Wiley Periodicals, Inc.
Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

PubMed

Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

2014-01-01

Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.
Identification of crucial genes related to postmenopausal osteoporosis using gene expression profiling.

PubMed

Ma, Min; Chen, Xiaofei; Lu, Liangyu; Yuan, Feng; Zeng, Wen; Luo, Shulin; Yin, Feng; Cai, Junfeng

2016-12-01

Postmenopausal osteoporosis is a common bone disease and characterized by low bone mineral density. This study aimed to reveal key genes associated with postmenopausal osteoporosis (PMO), and provide a theoretical basis for subsequent experiments. The dataset GSE7429 was obtained from Gene Expression Omnibus. A total of 20 B cell samples (ten ones, respectively from postmenopausal women with low or high bone mineral density (BMD) were included in this dataset. Following screening of differentially expressed genes (DEGs), coexpression analysis of all genes was performed, and key genes in the coexpression network were screened using the random walk algorithm. Afterwards, functional and pathway analyses were conducted. Additionally, protein-protein interactions (PPIs) between DEGs and key genes were analyzed. A set of 308 DEGs (170 up-regulated ones and 138 down-regulated ones) between low BMD and high BMD samples were identified, and 101 key genes in the coexpression network were screened out. In the coexpression network, some genes had a higher score and degree, such as CSTA. The key genes in the coexpression network were mainly enriched in GO terms of the defense response (e.g., SERPINA1 and CST3), immune response (e.g., IL32 and CLEC7A); while, the DEGs were mainly enriched in structural constituent of cytoskeleton (e.g., CYLC2 and TUBA1B) and membrane-enclosed lumen (e.g., CCNE1 and INTS5). In the PPI network, CCNE1 interacted with REL; and TUBA1B interacted with ESR1. A series of interactions, such as CSTA/TYROBP, CCNE1/REL and TUBA1B/ESR1 might play pivotal roles in the occurrence and development of PMO.
Why Is the Correlation between Gene Importance and Gene Evolutionary Rate So Weak?

PubMed Central

Wang, Zhi; Zhang, Jianzhi

2009-01-01

One of the few commonly believed principles of molecular evolution is that functionally more important genes (or DNA sequences) evolve more slowly than less important ones. This principle is widely used by molecular biologists in daily practice. However, recent genomic analysis of a diverse array of organisms found only weak, negative correlations between the evolutionary rate of a gene and its functional importance, typically measured under a single benign lab condition. A frequently suggested cause of the above finding is that gene importance determined in the lab differs from that in an organism's natural environment. Here, we test this hypothesis in yeast using gene importance values experimentally determined in 418 lab conditions or computationally predicted for 10,000 nutritional conditions. In no single condition or combination of conditions did we find a much stronger negative correlation, which is explainable by our subsequent finding that always-essential (enzyme) genes do not evolve significantly more slowly than sometimes-essential or always-nonessential ones. Furthermore, we verified that functional density, approximated by the fraction of amino acid sites within protein domains, is uncorrelated with gene importance. Thus, neither the lab-nature mismatch nor a potentially biased among-gene distribution of functional density explains the observed weakness of the correlation between gene importance and evolutionary rate. We conclude that the weakness is factual, rather than artifactual. In addition to being weakened by population genetic reasons, the correlation is likely to have been further weakened by the presence of multiple nontrivial rate determinants that are independent from gene importance. These findings notwithstanding, we show that the principle of slower evolution of more important genes does have some predictive power when genes with vastly different evolutionary rates are compared, explaining why the principle can be practically useful
Why is the correlation between gene importance and gene evolutionary rate so weak?

PubMed

Wang, Zhi; Zhang, Jianzhi

2009-01-01

One of the few commonly believed principles of molecular evolution is that functionally more important genes (or DNA sequences) evolve more slowly than less important ones. This principle is widely used by molecular biologists in daily practice. However, recent genomic analysis of a diverse array of organisms found only weak, negative correlations between the evolutionary rate of a gene and its functional importance, typically measured under a single benign lab condition. A frequently suggested cause of the above finding is that gene importance determined in the lab differs from that in an organism's natural environment. Here, we test this hypothesis in yeast using gene importance values experimentally determined in 418 lab conditions or computationally predicted for 10,000 nutritional conditions. In no single condition or combination of conditions did we find a much stronger negative correlation, which is explainable by our subsequent finding that always-essential (enzyme) genes do not evolve significantly more slowly than sometimes-essential or always-nonessential ones. Furthermore, we verified that functional density, approximated by the fraction of amino acid sites within protein domains, is uncorrelated with gene importance. Thus, neither the lab-nature mismatch nor a potentially biased among-gene distribution of functional density explains the observed weakness of the correlation between gene importance and evolutionary rate. We conclude that the weakness is factual, rather than artifactual. In addition to being weakened by population genetic reasons, the correlation is likely to have been further weakened by the presence of multiple nontrivial rate determinants that are independent from gene importance. These findings notwithstanding, we show that the principle of slower evolution of more important genes does have some predictive power when genes with vastly different evolutionary rates are compared, explaining why the principle can be practically useful
Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

PubMed Central

Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

2013-01-01

The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer.

PubMed

Yang, Mary Qu; Li, Dan; Yang, William; Zhang, Yifan; Liu, Jun; Tong, Weida

2017-01-01

Clear cell renal cell carcinoma (ccRCC) is the most common and most aggressive form of renal cell cancer (RCC). The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1 , as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways.
The Caenorhabditis chemoreceptor gene families.

PubMed

Thomas, James H; Robertson, Hugh M

2008-10-06

Chemoreceptor proteins mediate the first step in the transduction of environmental chemical stimuli, defining the breadth of detection and conferring stimulus specificity. Animal genomes contain families of genes encoding chemoreceptors that mediate taste, olfaction, and pheromone responses. The size and diversity of these families reflect the biology of chemoperception in specific species. Based on manual curation and sequence comparisons among putative G-protein-coupled chemoreceptor genes in the nematode Caenorhabditis elegans, we identified approximately 1300 genes and 400 pseudogenes in the 19 largest gene families, most of which fall into larger superfamilies. In the related species C. briggsae and C. remanei, we identified most or all genes in each of the 19 families. For most families, C. elegans has the largest number of genes and C. briggsae the smallest number, suggesting changes in the importance of chemoperception among the species. Protein trees reveal family-specific and species-specific patterns of gene duplication and gene loss. The frequency of strict orthologs varies among the families, from just over 50% in two families to less than 5% in three families. Several families include large species-specific expansions, mostly in C. elegans and C. remanei. Chemoreceptor gene families in Caenorhabditis species are large and evolutionarily dynamic as a result of gene duplication and gene loss. These dynamics shape the chemoreceptor gene complements in Caenorhabditis species and define the receptor space available for chemosensory responses. To explain these patterns, we propose the gray pawn hypothesis: individual genes are of little significance, but the aggregate of a large number of diverse genes is required to cover a large phenotype space.
The Caenorhabditis chemoreceptor gene families

PubMed Central

Thomas, James H; Robertson, Hugh M

2008-01-01

Background Chemoreceptor proteins mediate the first step in the transduction of environmental chemical stimuli, defining the breadth of detection and conferring stimulus specificity. Animal genomes contain families of genes encoding chemoreceptors that mediate taste, olfaction, and pheromone responses. The size and diversity of these families reflect the biology of chemoperception in specific species. Results Based on manual curation and sequence comparisons among putative G-protein-coupled chemoreceptor genes in the nematode Caenorhabditis elegans, we identified approximately 1300 genes and 400 pseudogenes in the 19 largest gene families, most of which fall into larger superfamilies. In the related species C. briggsae and C. remanei, we identified most or all genes in each of the 19 families. For most families, C. elegans has the largest number of genes and C. briggsae the smallest number, suggesting changes in the importance of chemoperception among the species. Protein trees reveal family-specific and species-specific patterns of gene duplication and gene loss. The frequency of strict orthologs varies among the families, from just over 50% in two families to less than 5% in three families. Several families include large species-specific expansions, mostly in C. elegans and C. remanei. Conclusion Chemoreceptor gene families in Caenorhabditis species are large and evolutionarily dynamic as a result of gene duplication and gene loss. These dynamics shape the chemoreceptor gene complements in Caenorhabditis species and define the receptor space available for chemosensory responses. To explain these patterns, we propose the gray pawn hypothesis: individual genes are of little significance, but the aggregate of a large number of diverse genes is required to cover a large phenotype space. PMID:18837995
Gene therapy for haemophilia.

PubMed

Sharma, Akshay; Easow Mathew, Manu; Sriganesh, Vasumathi; Neely, Jessica A; Kalipatnapu, Sasank

2014-11-14

Haemophilia is a genetic disorder which is characterized by spontaneous or provoked, often uncontrolled, bleeding into joints, muscles and other soft tissues. Current methods of treatment are expensive, challenging and involve regular administration of clotting factors. Gene therapy has recently been prompted as a curative treatment modality. To evaluate the safety and efficacy of gene therapy for treating people with haemophilia A or B. We searched the Cochrane Cystic Fibrosis & Genetic Disorders Group's Coagulopathies Trials Register, compiled from electronic database searches and handsearching of journals and conference abstract books. We also searched the reference lists of relevant articles and reviews.Date of last search: 06 November 2014. Eligible trials included randomised or quasi-randomised clinical trials, including controlled clinical trials comparing gene therapy (with or without standard treatment) with standard treatment (factor replacement) or other 'curative' treatment such as stem cell transplantation individuals with haemophilia A or B of all ages who do not have inhibitors to factor VIII or IX. No trials of gene therapy for haemophilia were found. No trials of gene therapy for haemophilia were identified. No randomised or quasi-randomised clinical trials of gene therapy for haemophilia were identified. Thus, we are unable to determine the effects of gene therapy for haemophilia. Gene therapy for haemophilia is still in its nascent stages and there is a need for well-designed clinical trials to assess the long-term feasibility, success and risks of gene therapy for people with haemophilia.
Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.

PubMed

Williams, N A; Holland, P W

1998-05-01

We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.
On meme--gene coevolution.

PubMed

Bull, L; Holland, O; Blackmore, S

2000-01-01

In this article we examine the effects of the emergence of a new replicator, memes, on the evolution of a pre-existing replicator, genes. Using a version of the NKCS model we examine the effects of increasing the rate of meme evolution in relation to the rate of gene evolution, for various degrees of interdependence between the two replicators. That is, the effects of memes' (suggested) more rapid rate of evolution in comparison to that of genes is investigated using a tunable model of coevolution. It is found that, for almost any degree of interdependence between the two replicators, as the rate of meme evolution increases, a phase transition-like dynamic occurs under which memes have a significantly detrimental effect on the evolution of genes, quickly resulting in the cessation of effective gene evolution. Conversely, the memes experience a sharp increase in benefit from increasing their rate of evolution. We then examine the effects of enabling genes to reduce the percentage of gene-detrimental evolutionary steps taken by memes. Here a critical region emerges as the comparative rate of meme evolution increases, such that if genes cannot effectively select memes a high percentage of the time, they suffer from meme evolution as if they had almost no selective capability.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Wagner, Drew T.; Zeng, Jia; Bailey, Constance B.

In an effort to uncover the structural motifs and biosynthetic logic of the relatively uncharacterized trans-acyltransferase polyketide synthases, we have begun the dissection of the enigmatic dehydrating bimodules common in these enzymatic assembly lines. We report the 1.98 Å resolution structure of a ketoreductase (KR) from the first half of a type A dehydrating bimodule and the 2.22 Å resolution structure of a dehydratase (DH) from the second half of a type B dehydrating bimodule. The KR, from the third module of the bacillaene synthase, and the DH, from the tenth module of the difficidin synthase, possess features not observedmore » in structurally characterized homologs. The DH architecture provides clues for how it catalyzes a unique double dehydration. Correlations between the chemistries proposed for dehydrating bimodules and bioinformatic analysis indicate that type A dehydrating bimodules generally produce an α/β-cis alkene moiety, while type B dehydrating bimodules generally produce an α/β-trans, γ/δ-cis diene moiety.« less
Evaluation of endogenous control gene(s) for gene expression studies in human blood exposed to 60Co γ-rays ex vivo.

PubMed

Vaiphei, S Thangminlal; Keppen, Joshua; Nongrum, Saibadaiahun; Chaubey, R C; Kma, L; Sharan, R N

2015-01-01

In gene expression studies, it is critical to normalize data using a stably expressed endogenous control gene in order to obtain accurate and reliable results. However, we currently do not have a universally applied endogenous control gene for normalization of data for gene expression studies, particularly those involving (60)Co γ-ray-exposed human blood samples. In this study, a comparative assessment of the gene expression of six widely used housekeeping endogenous control genes, namely 18S, ACTB, B2M, GAPDH, MT-ATP6 and CDKN1A, was undertaken for a range of (60)Co γ-ray doses (0.5, 1.0, 2.0 and 4.0 Gy) at 8.4 Gy min(-1) at 0 and 24 h post-irradiation time intervals. Using the NormFinder algorithm, real-time PCR data obtained from six individuals (three males and three females) were analyzed with respect to the threshold cycle (Ct) value and abundance, ΔCt pair-wise comparison, intra- and inter-group variability assessments, etc. GAPDH, either alone or in combination with 18S, was found to be the most suitable endogenous control gene and should be used in gene expression studies, especially those involving qPCR of γ-ray-exposed human blood samples. © The Author 2014. Published by Oxford University Press on behalf of The Japan Radiation Research Society and Japanese Society for Radiation Oncology.
G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

USDA-ARS?s Scientific Manuscript database

In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...
Tumor suppressor genes are larger than apoptosis-effector genes and have more regions of active chromatin: Connection to a stochastic paradigm for sequential gene expression programs.

PubMed

Garcia, Marlene; Mauro, James A; Ramsamooj, Michael; Blanck, George

2015-08-03

Apoptosis- and proliferation-effector genes are substantially regulated by the same transactivators, with E2F-1 and Oct-1 being notable examples. The larger proliferation-effector genes have more binding sites for the transactivators that regulate both sets of genes, and proliferation-effector genes have more regions of active chromatin, i.e, DNase I hypersensitive and histone 3, lysine-4 trimethylation sites. Thus, the size differences between the 2 classes of genes suggest a transcriptional regulation paradigm whereby the accumulation of transcription factors that regulate both sets of genes, merely as an aspect of stochastic behavior, accumulate first on the larger proliferation-effector gene "traps," and then accumulate on the apoptosis effector genes, thereby effecting sequential activation of the 2 different gene sets. As IRF-1 and p53 levels increase, tumor suppressor proteins are first activated, followed by the activation of apoptosis-effector genes, for example during S-phase pausing for DNA repair. Tumor suppressor genes are larger than apoptosis-effector genes and have more IRF-1 and p53 binding sites, thereby likewise suggesting a paradigm for transcription sequencing based on stochastic interactions of transcription factors with different gene classes. In this report, using the ENCODE database, we determined that tumor suppressor genes have a greater number of open chromatin regions and histone 3 lysine-4 trimethylation sites, consistent with the idea that a larger gene size can facilitate earlier transcriptional activation via the inclusion of more transactivator binding sites.

Applying horizontal gene transfer phenomena to enhance non-viral gene therapy

PubMed Central

Elmer, Jacob J.; Christensen, Matthew D.; Rege, Kaushal

2014-01-01

Horizontal gene transfer (HGT) is widespread amongst prokaryotes, but eukaryotes tend to be far less promiscuous with their genetic information. However, several examples of HGT from pathogens into eukaryotic cells have been discovered and mimicked to improve non-viral gene delivery techniques. For example, several viral proteins and DNA sequences have been used to significantly increase cytoplasmic and nuclear gene delivery. Plant genetic engineering is routinely performed with the pathogenic bacterium Agrobacterium tumefaciens and similar pathogens (e.g. Bartonella henselae) may also be able to transform human cells. Intracellular parasites like Trypanosoma cruzi may also provide new insights into overcoming cellular barriers to gene delivery. Finally, intercellular nucleic acid transfer between host cells will also be briefly discussed. This article will review the unique characteristics of several different viruses and microbes and discuss how their traits have been successfully applied to improve non-viral gene delivery techniques. Consequently, pathogenic traits that originally caused diseases may eventually be used to treat many genetic diseases. PMID:23994344
Deletion and Gene Expression Analyses Define the Paxilline Biosynthetic Gene Cluster in Penicillium paxilli

PubMed Central

Scott, Barry; Young, Carolyn A.; Saikia, Sanjay; McMillan, Lisa K.; Monahan, Brendon J.; Koulman, Albert; Astin, Jonathan; Eaton, Carla J.; Bryant, Andrea; Wrenn, Ruth E.; Finch, Sarah C.; Tapper, Brian A.; Parker, Emily J.; Jameson, Geoffrey B.

2013-01-01

The indole-diterpene paxilline is an abundant secondary metabolite synthesized by Penicillium paxilli. In total, 21 genes have been identified at the PAX locus of which six have been previously confirmed to have a functional role in paxilline biosynthesis. A combination of bioinformatics, gene expression and targeted gene replacement analyses were used to define the boundaries of the PAX gene cluster. Targeted gene replacement identified seven genes, paxG, paxA, paxM, paxB, paxC, paxP and paxQ that were all required for paxilline production, with one additional gene, paxD, required for regular prenylation of the indole ring post paxilline synthesis. The two putative transcription factors, PP104 and PP105, were not co-regulated with the pax genes and based on targeted gene replacement, including the double knockout, did not have a role in paxilline production. The relationship of indole dimethylallyl transferases involved in prenylation of indole-diterpenes such as paxilline or lolitrem B, can be found as two disparate clades, not supported by prenylation type (e.g., regular or reverse). This paper provides insight into the P. paxilli indole-diterpene locus and reviews the recent advances identified in paxilline biosynthesis. PMID:23949005
Efficient strategy for detecting gene × gene joint action and its application in schizophrenia.

PubMed

Won, Sungho; Kwon, Min-Seok; Mattheisen, Manuel; Park, Suyeon; Park, Changsoon; Kihara, Daisuke; Cichon, Sven; Ophoff, Roel; Nöthen, Markus M; Rietschel, Marcella; Baur, Max; Uitterlinden, Andre G; Hofmann, A; Lange, Christoph

2014-01-01

We propose a new approach to detect gene × gene joint action in genome-wide association studies (GWASs) for case-control designs. This approach offers an exhaustive search for all two-way joint action (including, as a special case, single gene action) that is computationally feasible at the genome-wide level and has reasonable statistical power under most genetic models. We found that the presence of any gene × gene joint action may imply differences in three types of genetic components: the minor allele frequencies and the amounts of Hardy-Weinberg disequilibrium may differ between cases and controls, and between the two genetic loci the degree of linkage disequilibrium may differ between cases and controls. Using Fisher's method, it is possible to combine the different sources of genetic information in an overall test for detecting gene × gene joint action. The proposed statistical analysis is efficient and its simplicity makes it applicable to GWASs. In the current study, we applied the proposed approach to a GWAS on schizophrenia and found several potential gene × gene interactions. Our application illustrates the practical advantage of the proposed method. © 2013 WILEY PERIODICALS, INC.
Identification of reference genes in human myelomonocytic cells for gene expression studies in altered gravity.

PubMed

Thiel, Cora S; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Unverdorben, Felix; Buttron, Isabell; Lauber, Beatrice; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E; Ullrich, Oliver

2015-01-01

Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes ("housekeeping genes") are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity.
The legumin gene family: structure of a B type gene of Vicia faba and a possible legumin gene specific regulatory element.

PubMed Central

Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C

1986-01-01

The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730
RapGene: a fast and accurate strategy for synthetic gene assembly in Escherichia coli

PubMed Central

Zampini, Massimiliano; Stevens, Pauline Rees; Pachebat, Justin A.; Kingston-Smith, Alison; Mur, Luis A. J.; Hayes, Finbarr

2015-01-01

The ability to assemble DNA sequences de novo through efficient and powerful DNA fabrication methods is one of the foundational technologies of synthetic biology. Gene synthesis, in particular, has been considered the main driver for the emergence of this new scientific discipline. Here we describe RapGene, a rapid gene assembly technique which was successfully tested for the synthesis and cloning of both prokaryotic and eukaryotic genes through a ligation independent approach. The method developed in this study is a complete bacterial gene synthesis platform for the quick, accurate and cost effective fabrication and cloning of gene-length sequences that employ the widely used host Escherichia coli. PMID:26062748
Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies

PubMed Central

Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D.

2016-01-01

Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella. We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes—and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646
From Genes to Networks: Characterizing Gene-Regulatory Interactions in Plants.

PubMed

Kaufmann, Kerstin; Chen, Dijun

2017-01-01

Plants, like other eukaryotes, have evolved complex mechanisms to coordinate gene expression during development, environmental response, and cellular homeostasis. Transcription factors (TFs), accompanied by basic cofactors and posttranscriptional regulators, are key players in gene-regulatory networks (GRNs). The coordinated control of gene activity is achieved by the interplay of these factors and by physical interactions between TFs and DNA. Here, we will briefly outline recent technological progress made to elucidate GRNs in plants. We will focus on techniques that allow us to characterize physical interactions in GRNs in plants and to analyze their regulatory consequences. Targeted manipulation allows us to test the relevance of specific gene-regulatory interactions. The combination of genome-wide experimental approaches with mathematical modeling allows us to get deeper insights into key-regulatory interactions and combinatorial control of important processes in plants.
Trichoderma genes

DOEpatents

Foreman, Pamela [Los Altos, CA; Goedegebuur, Frits [Vlaardingen, NL; Van Solingen, Pieter [Naaldwijk, NL; Ward, Michael [San Francisco, CA

2012-06-19

Described herein are novel gene sequences isolated from Trichoderma reesei. Two genes encoding proteins comprising a cellulose binding domain, one encoding an arabionfuranosidase and one encoding an acetylxylanesterase are described. The sequences, CIP1 and CIP2, contain a cellulose binding domain. These proteins are especially useful in the textile and detergent industry and in pulp and paper industry.
Methylation of microRNA genes regulates gene expression in bisexual flower development in andromonoecious poplar

PubMed Central

Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang

2015-01-01

Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5′ and 3′ flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. PMID:25617468
Hox genes and chordate evolution.

PubMed

Holland, P W; Garcia-Fernàndez, J

1996-02-01

Hox genes are implicated in the control of axial patterning during embryonic development of many, perhaps all, animals. Here we review recent data on Hox gene diversity, genomic organization, and embryonic expression in chordates (including tunicates, amphioxus, hagfish, lampreys, teleosts) plus their putative sister group, the hemichordates. We consider the potential of comparative Hox gene data to resolve some outstanding controversies in chordate phylogeny. The use of Hox gene expression patterns to identify homologies between body plans both within the vertebrates and between the chordate subphyla is also discussed. Homology between the vertebrate hindbrain and an extensive region of amphioxus neural tube is suggested by comparison of Hox-3 homologues and strengthened by new data on amphioxus Hox-1 gene expression reported here. Finally, we give two examples of how Hox genes are giving glimpses into chordate developmental evolution. The first relates changes in Hox gene expression to transposition of vertebral of vertebral identities; the second describes a correlation between vertebrate origins and Hox gene cluster duplication. We suggest that the simultaneous duplication of many classes of genes, often interacting in gene networks, allowed the elaboration of new developmental control mechanisms at vertebrate origins.
Chapter 15: Disease Gene Prioritization

PubMed Central

Bromberg, Yana

2013-01-01

Disease-causing aberrations in the normal function of a gene define that gene as a disease gene. Proving a causal link between a gene and a disease experimentally is expensive and time-consuming. Comprehensive prioritization of candidate genes prior to experimental testing drastically reduces the associated costs. Computational gene prioritization is based on various pieces of correlative evidence that associate each gene with the given disease and suggest possible causal links. A fair amount of this evidence comes from high-throughput experimentation. Thus, well-developed methods are necessary to reliably deal with the quantity of information at hand. Existing gene prioritization techniques already significantly improve the outcomes of targeted experimental studies. Faster and more reliable techniques that account for novel data types are necessary for the development of new diagnostics, treatments, and cure for many diseases. PMID:23633938
Twenty Years of European Union Support to Gene Therapy and Gene Transfer.

PubMed

Gancberg, David

2017-11-01

For 20 years and throughout its research programmes, the European Union has supported the entire innovation chain for gene transfer and gene therapy. The fruits of this investment are ripening as gene therapy products are reaching the European market and as clinical trials are demonstrating the safety of this approach to treat previously untreatable diseases.
FlyBase: genes and gene models

PubMed Central

Drysdale, Rachel A.; Crosby, Madeline A.

2005-01-01

FlyBase (http://flybase.org) is the primary repository of genetic and molecular data of the insect family Drosophilidae. For the most extensively studied species, Drosophila melanogaster, a wide range of data are presented in integrated formats. Data types include mutant phenotypes, molecular characterization of mutant alleles and aberrations, cytological maps, wild-type expression patterns, anatomical images, transgenic constructs and insertions, sequence-level gene models and molecular classification of gene product functions. There is a growing body of data for other Drosophila species; this is expected to increase dramatically over the next year, with the completion of draft-quality genomic sequences of an additional 11 Drosphila species. PMID:15608223
Analysis of lamprey clustered Fox genes: insight into Fox gene evolution and expression in vertebrates.

PubMed

Wotton, Karl R; Shimeld, Sebastian M

2011-12-01

In the human genome, members of the FoxC, FoxF, FoxL1, and FoxQ1 gene families are found in two paralagous clusters. One cluster contains the genes FOXQ1, FOXF2, FOXC1 and the second consists of FOXF1, FOXC2, and FOXL1. In jawed vertebrates these genes are known to be expressed in different pharyngeal tissues and all, except FoxQ1, are involved in patterning the early embryonic mesoderm. We have previously traced the evolution of this cluster in the bony vertebrates, and the gene content is identical in the dogfish, a member of the most basally branching lineage of the jawed vertebrates. Here we extend these analyses to jawless vertebrates. Using genomic searches and molecular approaches we have identified homologues of these genes from lampreys. We identify two FoxC genes, two FoxF genes, two FoxQ1 genes and single FoxL1 gene. We examine the embryonic expression of one predominantly mesodermally expressed gene family, FoxC, and the endodermally expressed member of the cluster, FoxQ1. We identified FoxQ1 transcripts in the pharyngeal endoderm, while the two FoxC genes are differentially expressed in the pharyngeal mesenchyme and ectoderm. Furthermore we identify conserved expression of lamprey FoxC genes in the paraxial and intermediate mesoderms. We interpret our results through a chordate-wide comparison of expression patterns and discuss gene content in the context of theories on the evolution of the vertebrate genome. 2011 Elsevier B.V. All rights reserved.
Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

PubMed

Chen, Lei; Zhang, Yu-Hang; Wang, ShaoPeng; Zhang, YunHua; Huang, Tao; Cai, Yu-Dong

2017-01-01

Identifying essential genes in a given organism is important for research on their fundamental roles in organism survival. Furthermore, if possible, uncovering the links between core functions or pathways with these essential genes will further help us obtain deep insight into the key roles of these genes. In this study, we investigated the essential and non-essential genes reported in a previous study and extracted gene ontology (GO) terms and biological pathways that are important for the determination of essential genes. Through the enrichment theory of GO and KEGG pathways, we encoded each essential/non-essential gene into a vector in which each component represented the relationship between the gene and one GO term or KEGG pathway. To analyze these relationships, the maximum relevance minimum redundancy (mRMR) was adopted. Then, the incremental feature selection (IFS) and support vector machine (SVM) were employed to extract important GO terms and KEGG pathways. A prediction model was built simultaneously using the extracted GO terms and KEGG pathways, which yielded nearly perfect performance, with a Matthews correlation coefficient of 0.951, for distinguishing essential and non-essential genes. To fully investigate the key factors influencing the fundamental roles of essential genes, the 21 most important GO terms and three KEGG pathways were analyzed in detail. In addition, several genes was provided in this study, which were predicted to be essential genes by our prediction model. We suggest that this study provides more functional and pathway information on the essential genes and provides a new way to investigate related problems.
Gene therapy for haemophilia.

PubMed

Sharma, Akshay; Easow Mathew, Manu; Sriganesh, Vasumathi; Reiss, Ulrike M

2016-12-20

Haemophilia is a genetic disorder characterized by spontaneous or provoked, often uncontrolled, bleeding into joints, muscles and other soft tissues. Current methods of treatment are expensive, challenging and involve regular administration of clotting factors. Gene therapy has recently been prompted as a curative treatment modality. This is an update of a published Cochrane Review. To evaluate the safety and efficacy of gene therapy for treating people with haemophilia A or B. We searched the Cochrane Cystic Fibrosis & Genetic Disorders Group's Coagulopathies Trials Register, compiled from electronic database searches and handsearching of journals and conference abstract books. We also searched the reference lists of relevant articles and reviews.Date of last search: 18 August 2016. Eligible trials include randomised or quasi-randomised clinical trials, including controlled clinical trials comparing gene therapy (with or without standard treatment) with standard treatment (factor replacement) or other 'curative' treatment such as stem cell transplantation for individuals with haemophilia A or B of all ages who do not have inhibitors to factor VIII or IX. No trials of gene therapy for haemophilia were found. No trials of gene therapy for haemophilia were identified. No randomised or quasi-randomised clinical trials of gene therapy for haemophilia were identified. Thus, we are unable to determine the safety and efficacy of gene therapy for haemophilia. Gene therapy for haemophilia is still in its nascent stages and there is a need for well-designed clinical trials to assess the long-term feasibility, success and risks of gene therapy for people with haemophilia.
Antioxidant Defense Enzyme Genes and Asthma Susceptibility: Gender-Specific Effects and Heterogeneity in Gene-Gene Interactions between Pathogenetic Variants of the Disease

PubMed Central

Polonikov, Alexey V.; Ivanov, Vladimir P.; Bogomazov, Alexey D.; Freidin, Maxim B.; Illig, Thomas; Solodilova, Maria A.

2014-01-01

Oxidative stress resulting from an increased amount of reactive oxygen species and an imbalance between oxidants and antioxidants plays an important role in the pathogenesis of asthma. The present study tested the hypothesis that genetic susceptibility to allergic and nonallergic variants of asthma is determined by complex interactions between genes encoding antioxidant defense enzymes (ADE). We carried out a comprehensive analysis of the associations between adult asthma and 46 single nucleotide polymorphisms of 34 ADE genes and 12 other candidate genes of asthma in Russian population using set association analysis and multifactor dimensionality reduction approaches. We found for the first time epistatic interactions between ADE genes underlying asthma susceptibility and the genetic heterogeneity between allergic and nonallergic variants of the disease. We identified GSR (glutathione reductase) and PON2 (paraoxonase 2) as novel candidate genes for asthma susceptibility. We observed gender-specific effects of ADE genes on the risk of asthma. The results of the study demonstrate complexity and diversity of interactions between genes involved in oxidative stress underlying susceptibility to allergic and nonallergic asthma. PMID:24895604
Rational confederation of genes and diseases: NGS interpretation via GeneCards, MalaCards and VarElect.

PubMed

Rappaport, Noa; Fishilevich, Simon; Nudel, Ron; Twik, Michal; Belinky, Frida; Plaschkes, Inbar; Stein, Tsippi Iny; Cohen, Dana; Oz-Levi, Danit; Safran, Marilyn; Lancet, Doron

2017-08-18

A key challenge in the realm of human disease research is next generation sequencing (NGS) interpretation, whereby identified filtered variant-harboring genes are associated with a patient's disease phenotypes. This necessitates bioinformatics tools linked to comprehensive knowledgebases. The GeneCards suite databases, which include GeneCards (human genes), MalaCards (human diseases) and PathCards (human pathways) together with additional tools, are presented with the focus on MalaCards utility for NGS interpretation as well as for large scale bioinformatic analyses. VarElect, our NGS interpretation tool, leverages the broad information in the GeneCards suite databases. MalaCards algorithms unify disease-related terms and annotations from 69 sources. Further, MalaCards defines hierarchical relatedness-aliases, disease families, a related diseases network, categories and ontological classifications. GeneCards and MalaCards delineate and share a multi-tiered, scored gene-disease network, with stringency levels, including the definition of elite status-high quality gene-disease pairs, coming from manually curated trustworthy sources, that includes 4500 genes for 8000 diseases. This unique resource is key to NGS interpretation by VarElect. VarElect, a comprehensive search tool that helps infer both direct and indirect links between genes and user-supplied disease/phenotype terms, is robustly strengthened by the information found in MalaCards. The indirect mode benefits from GeneCards' diverse gene-to-gene relationships, including SuperPaths-integrated biological pathways from 12 information sources. We are currently adding an important information layer in the form of "disease SuperPaths", generated from the gene-disease matrix by an algorithm similar to that previously employed for biological pathway unification. This allows the discovery of novel gene-disease and disease-disease relationships. The advent of whole genome sequencing necessitates capacities to go beyond
Gene Fusion Markup Language: a prototype for exchanging gene fusion data.

PubMed

Kalyana-Sundaram, Shanker; Shanmugam, Achiraman; Chinnaiyan, Arul M

2012-10-16

An avalanche of next generation sequencing (NGS) studies has generated an unprecedented amount of genomic structural variation data. These studies have also identified many novel gene fusion candidates with more detailed resolution than previously achieved. However, in the excitement and necessity of publishing the observations from this recently developed cutting-edge technology, no community standardization approach has arisen to organize and represent the data with the essential attributes in an interchangeable manner. As transcriptome studies have been widely used for gene fusion discoveries, the current non-standard mode of data representation could potentially impede data accessibility, critical analyses, and further discoveries in the near future. Here we propose a prototype, Gene Fusion Markup Language (GFML) as an initiative to provide a standard format for organizing and representing the significant features of gene fusion data. GFML will offer the advantage of representing the data in a machine-readable format to enable data exchange, automated analysis interpretation, and independent verification. As this database-independent exchange initiative evolves it will further facilitate the formation of related databases, repositories, and analysis tools. The GFML prototype is made available at http://code.google.com/p/gfml-prototype/. The Gene Fusion Markup Language (GFML) presented here could facilitate the development of a standard format for organizing, integrating and representing the significant features of gene fusion data in an inter-operable and query-able fashion that will enable biologically intuitive access to gene fusion findings and expedite functional characterization. A similar model is envisaged for other NGS data analyses.

Gene Fusion Markup Language: a prototype for exchanging gene fusion data

PubMed Central

2012-01-01

Background An avalanche of next generation sequencing (NGS) studies has generated an unprecedented amount of genomic structural variation data. These studies have also identified many novel gene fusion candidates with more detailed resolution than previously achieved. However, in the excitement and necessity of publishing the observations from this recently developed cutting-edge technology, no community standardization approach has arisen to organize and represent the data with the essential attributes in an interchangeable manner. As transcriptome studies have been widely used for gene fusion discoveries, the current non-standard mode of data representation could potentially impede data accessibility, critical analyses, and further discoveries in the near future. Results Here we propose a prototype, Gene Fusion Markup Language (GFML) as an initiative to provide a standard format for organizing and representing the significant features of gene fusion data. GFML will offer the advantage of representing the data in a machine-readable format to enable data exchange, automated analysis interpretation, and independent verification. As this database-independent exchange initiative evolves it will further facilitate the formation of related databases, repositories, and analysis tools. The GFML prototype is made available at http://code.google.com/p/gfml-prototype/. Conclusion The Gene Fusion Markup Language (GFML) presented here could facilitate the development of a standard format for organizing, integrating and representing the significant features of gene fusion data in an inter-operable and query-able fashion that will enable biologically intuitive access to gene fusion findings and expedite functional characterization. A similar model is envisaged for other NGS data analyses. PMID:23072312
EcoGene 3.0

PubMed Central

Zhou, Jindan; Rudd, Kenneth E.

2013-01-01

EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection. PMID:23197660
EcoGene 3.0.

PubMed

Zhou, Jindan; Rudd, Kenneth E

2013-01-01

EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.
GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms.

PubMed

Jani, Saurin D; Argraves, Gary L; Barth, Jeremy L; Argraves, W Scott

2010-04-01

An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH) hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc.) or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.). Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment) to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies, intermolecular interactions, overlays of genes onto KEGG
LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights

PubMed Central

Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

2016-01-01

Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher’s exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO’s usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher. PMID:26750448
LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

PubMed

Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

2016-01-11

Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.
Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests.

PubMed

Yu, Zhaoxia; Demetriou, Michael; Gillen, Daniel L

2015-09-01

Despite the successful discovery of hundreds of variants for complex human traits using genome-wide association studies, the degree to which genes and environmental risk factors jointly affect disease risk is largely unknown. One obstacle toward this goal is that the computational effort required for testing gene-gene and gene-environment interactions is enormous. As a result, numerous computationally efficient tests were recently proposed. However, the validity of these methods often relies on unrealistic assumptions such as additive main effects, main effects at only one variable, no linkage disequilibrium between the two single-nucleotide polymorphisms (SNPs) in a pair or gene-environment independence. Here, we derive closed-form and consistent estimates for interaction parameters and propose to use Wald tests for testing interactions. The Wald tests are asymptotically equivalent to the likelihood ratio tests (LRTs), largely considered to be the gold standard tests but generally too computationally demanding for genome-wide interaction analysis. Simulation studies show that the proposed Wald tests have very similar performances with the LRTs but are much more computationally efficient. Applying the proposed tests to a genome-wide study of multiple sclerosis, we identify interactions within the major histocompatibility complex region. In this application, we find that (1) focusing on pairs where both SNPs are marginally significant leads to more significant interactions when compared to focusing on pairs where at least one SNP is marginally significant; and (2) parsimonious parameterization of interaction effects might decrease, rather than increase, statistical power. © 2015 WILEY PERIODICALS, INC.
Genotype-based association models of complex diseases to detect gene-gene and gene-environment interactions.

PubMed

Lobach, Iryna; Fan, Ruzong; Manga, Prashiela

A central problem in genetic epidemiology is to identify and rank genetic markers involved in a disease. Complex diseases, such as cancer, hypertension, diabetes, are thought to be caused by an interaction of a panel of genetic factors, that can be identified by markers, which modulate environmental factors. Moreover, the effect of each genetic marker may be small. Hence, the association signal may be missed unless a large sample is considered, or a priori biomedical data are used. Recent advances generated a vast variety of a priori information, including linkage maps and information about gene regulatory dependence assembled into curated pathway databases. We propose a genotype-based approach that takes into account linkage disequilibrium (LD) information between genetic markers that are in moderate LD while modeling gene-gene and gene-environment interactions. A major advantage of our method is that the observed genetic information enters a model directly thus eliminating the need to estimate haplotype-phase. Our approach results in an algorithm that is inexpensive computationally and does not suffer from bias induced by haplotype-phase ambiguity. We investigated our model in a series of simulation experiments and demonstrated that the proposed approach results in estimates that are nearly unbiased and have small variability. We applied our method to the analysis of data from a melanoma case-control study and investigated interaction between a set of pigmentation genes and environmental factors defined by age and gender. Furthermore, an application of our method is demonstrated using a study of Alcohol Dependence.
Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

PubMed

Obayashi, Takeshi; Kinoshita, Kengo

2010-05-01

Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.
Repeated evolution of chimeric fusion genes in the β-globin gene family of laurasiatherian mammals.

PubMed

Gaudry, Michael J; Storz, Jay F; Butts, Gary Tyler; Campbell, Kevin L; Hoffmann, Federico G

2014-05-09

The evolutionary fate of chimeric fusion genes may be strongly influenced by their recombinational mode of origin and the nature of functional divergence between the parental genes. In the β-globin gene family of placental mammals, the two postnatally expressed δ- and β-globin genes (HBD and HBB, respectively) have a propensity for recombinational exchange via gene conversion and unequal crossing-over. In the latter case, there are good reasons to expect differences in retention rates for the reciprocal HBB/HBD and HBD/HBB fusion genes due to thalassemia pathologies associated with the HBD/HBB "Lepore" deletion mutant in humans. Here, we report a comparative genomic analysis of the mammalian β-globin gene cluster, which revealed that chimeric HBB/HBD fusion genes originated independently in four separate lineages of laurasiatherian mammals: Eulipotyphlans (shrews, moles, and hedgehogs), carnivores, microchiropteran bats, and cetaceans. In cases where an independently derived "anti-Lepore" duplication mutant has become fixed, the parental HBD and/or HBB genes have typically been inactivated or deleted, so that the newly created HBB/HBD fusion gene is primarily responsible for synthesizing the β-type subunits of adult and fetal hemoglobin (Hb). Contrary to conventional wisdom that the HBD gene is a vestigial relict that is typically inactivated or expressed at negligible levels, we show that HBD-like genes often encode a substantial fraction (20-100%) of β-chain Hbs in laurasiatherian taxa. Our results indicate that the ascendancy or resuscitation of genes with HBD-like coding sequence requires the secondary acquisition of HBB-like promoter sequence via unequal crossing-over or interparalog gene conversion. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gene Presence-Absence Polymorphism in Castrating Anther-Smut Fungi: Recent Gene Gains and Phylogeographic Structure.

PubMed

Hartmann, Fanny E; Rodríguez de la Vega, Ricardo C; Brandenburg, Jean-Tristan; Carpentier, Fantin; Giraud, Tatiana

2018-04-01

Gene presence-absence polymorphisms segregating within species are a significant source of genetic variation but have been little investigated to date in natural populations. In plant pathogens, the gain or loss of genes encoding proteins interacting directly with the host, such as secreted proteins, probably plays an important role in coevolution and local adaptation. We investigated gene presence-absence polymorphism in populations of two closely related species of castrating anther-smut fungi, Microbotryum lychnidis-dioicae (MvSl) and M. silenes-dioicae (MvSd), from across Europe, on the basis of Illumina genome sequencing data and high-quality genome references. We observed presence-absence polymorphism for 186 autosomal genes (2% of all genes) in MvSl, and only 51 autosomal genes in MvSd. Distinct genes displayed presence-absence polymorphism in the two species. Genes displaying presence-absence polymorphism were frequently located in subtelomeric and centromeric regions and close to repetitive elements, and comparison with outgroups indicated that most were present in a single species, being recently acquired through duplications in multiple-gene families. Gene presence-absence polymorphism in MvSl showed a phylogeographic structure corresponding to clusters detected based on SNPs. In addition, gene absence alleles were rare within species and skewed toward low-frequency variants. These findings are consistent with a deleterious or neutral effect for most gene presence-absence polymorphism. Some of the observed gene loss and gain events may however be adaptive, as suggested by the putative functions of the corresponding encoded proteins (e.g., secreted proteins) or their localization within previously identified selective sweeps. The adaptive roles in plant and anther-smut fungi interactions of candidate genes however need to be experimentally tested in future studies.
Methylation of microRNA genes regulates gene expression in bisexual flower development in andromonoecious poplar.

PubMed

Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang

2015-04-01

Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5' and 3' flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Gene Presence–Absence Polymorphism in Castrating Anther-Smut Fungi: Recent Gene Gains and Phylogeographic Structure

PubMed Central

Rodríguez de la Vega, Ricardo C; Brandenburg, Jean-Tristan; Carpentier, Fantin; Giraud, Tatiana

2018-01-01

Abstract Gene presence–absence polymorphisms segregating within species are a significant source of genetic variation but have been little investigated to date in natural populations. In plant pathogens, the gain or loss of genes encoding proteins interacting directly with the host, such as secreted proteins, probably plays an important role in coevolution and local adaptation. We investigated gene presence–absence polymorphism in populations of two closely related species of castrating anther-smut fungi, Microbotryum lychnidis-dioicae (MvSl) and M. silenes-dioicae (MvSd), from across Europe, on the basis of Illumina genome sequencing data and high-quality genome references. We observed presence–absence polymorphism for 186 autosomal genes (2% of all genes) in MvSl, and only 51 autosomal genes in MvSd. Distinct genes displayed presence–absence polymorphism in the two species. Genes displaying presence–absence polymorphism were frequently located in subtelomeric and centromeric regions and close to repetitive elements, and comparison with outgroups indicated that most were present in a single species, being recently acquired through duplications in multiple-gene families. Gene presence–absence polymorphism in MvSl showed a phylogeographic structure corresponding to clusters detected based on SNPs. In addition, gene absence alleles were rare within species and skewed toward low-frequency variants. These findings are consistent with a deleterious or neutral effect for most gene presence–absence polymorphism. Some of the observed gene loss and gain events may however be adaptive, as suggested by the putative functions of the corresponding encoded proteins (e.g., secreted proteins) or their localization within previously identified selective sweeps. The adaptive roles in plant and anther-smut fungi interactions of candidate genes however need to be experimentally tested in future studies. PMID:29722826
The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

PubMed

Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

2011-02-01

The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they
A high-throughput virus-induced gene silencing protocol identifies genes involved in multi-stress tolerance

PubMed Central

2013-01-01

Background Understanding the function of a particular gene under various stresses is important for engineering plants for broad-spectrum stress tolerance. Although virus-induced gene silencing (VIGS) has been used to characterize genes involved in abiotic stress tolerance, currently available gene silencing and stress imposition methodology at the whole plant level is not suitable for high-throughput functional analyses of genes. This demands a robust and reliable methodology for characterizing genes involved in abiotic and multi-stress tolerance. Results Our methodology employs VIGS-based gene silencing in leaf disks combined with simple stress imposition and effect quantification methodologies for easy and faster characterization of genes involved in abiotic and multi-stress tolerance. By subjecting leaf disks from gene-silenced plants to various abiotic stresses and inoculating silenced plants with various pathogens, we show the involvement of several genes for multi-stress tolerance. In addition, we demonstrate that VIGS can be used to characterize genes involved in thermotolerance. Our results also showed the functional relevance of NtEDS1 in abiotic stress, NbRBX1 and NbCTR1 in oxidative stress; NtRAR1 and NtNPR1 in salinity stress; NbSOS1 and NbHSP101 in biotic stress; and NtEDS1, NbETR1, NbWRKY2 and NbMYC2 in thermotolerance. Conclusions In addition to widening the application of VIGS, we developed a robust, easy and high-throughput methodology for functional characterization of genes involved in multi-stress tolerance. PMID:24289810
Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis

DTIC Science & Technology

2012-08-01

AD_________________ Award Number: W81XWH-10-1-0469 TITLE: Genetic Evaluation for the Scoliosis ...Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis PRINCIPAL INVESTIGATOR: David W. Polly, Jr., M.D...2011 – 31 July 2012 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1
Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae).

PubMed

Baker, Richard H; Narechania, Apurva; Johns, Philip M; Wilkinson, Gerald S

2012-08-19

Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict.
Why commercialization of gene therapy stalled; examining the life cycles of gene therapy technologies.

PubMed

Ledley, F D; McNamee, L M; Uzdil, V; Morgan, I W

2014-02-01

This report examines the commercialization of gene therapy in the context of innovation theories that posit a relationship between the maturation of a technology through its life cycle and prospects for successful product development. We show that the field of gene therapy has matured steadily since the 1980s, with the congruent accumulation of >35 000 papers, >16 000 US patents, >1800 clinical trials and >$4.3 billion in capital investment in gene therapy companies. Gene therapy technologies comprise a series of dissimilar approaches for gene delivery, each of which has introduced a distinct product architecture. Using bibliometric methods, we quantify the maturation of each technology through a characteristic life cycle S-curve, from a Nascent stage, through a Growing stage of exponential advance, toward an Established stage and projected limit. Capital investment in gene therapy is shown to have occurred predominantly in Nascent stage technologies and to be negatively correlated with maturity. Gene therapy technologies are now achieving the level of maturity that innovation research and biotechnology experience suggest may be requisite for efficient product development. Asynchrony between the maturation of gene therapy technologies and capital investment in development-focused business models may have stalled the commercialization of gene therapy.
Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae)

PubMed Central

Baker, Richard H.; Narechania, Apurva; Johns, Philip M.; Wilkinson, Gerald S.

2012-01-01

Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict. PMID:22777023
Radial chromatin positioning is shaped by local gene density, not by gene expression

PubMed Central

2009-01-01

G- and R-bands of metaphase chromosomes are characterized by profound differences in gene density, CG content, replication timing, and chromatin compaction. The preferential localization of gene-dense, transcriptionally active, and early replicating chromatin in the nuclear interior and of gene-poor, later replicating chromatin at the nuclear envelope has been demonstrated to be evolutionary-conserved in various cell types. Yet, the impact of different local chromatin features on the radial nuclear arrangement of chromatin is still not well understood. In particular, it is not known whether radial chromatin positioning is preferentially shaped by local gene density per se or by other related parameters such as replication timing or transcriptional activity. The interdependence of these distinct chromatin features on the linear deoxyribonucleic acid (DNA) sequence precludes a simple dissection of these parameters with respect to their importance for the reorganization of the linear DNA organization into the distinct radial chromatin arrangements observed in the nuclear space. To analyze this problem, we generated probe sets of pooled bacterial artificial chromosome (BAC) clones from HSA 11, 12, 18, and 19 representing R/G-band-assigned chromatin, segments with different gene density and gene loci with different expression levels. Using multicolor 3D flourescent in situ hybridization (FISH) and 3D image analysis, we determined their localization in the nucleus and their positions within or outside the corresponding chromosome territory (CT). For each BAC data on local gene density within 2- and 10-Mb windows, as well as GC (guanine and cytosine) content, replication timing and expression levels were determined. A correlation analysis of these parameters with nuclear positioning revealed regional gene density as the decisive parameter determining the radial positioning of chromatin in the nucleus in contrast to band assignment, replication timing, and transcriptional

Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

PubMed

Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

2016-01-15

As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.
Discovery of time-delayed gene regulatory networks based on temporal gene expression profiling

PubMed Central

Li, Xia; Rao, Shaoqi; Jiang, Wei; Li, Chuanxing; Xiao, Yun; Guo, Zheng; Zhang, Qingpu; Wang, Lihong; Du, Lei; Li, Jing; Li, Li; Zhang, Tianwen; Wang, Qing K

2006-01-01

Background It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of versatile life phenomena, to name a few, cell cycling, developmental biology, aging, and the progressive and recurrent pathogenesis of complex diseases. The vast amount of large-scale and genome-wide time-resolved data is becoming increasing available, which provides the golden opportunity to unravel the challenging reverse-engineering problem of time-delayed gene regulatory networks. Results In particular, this methodological paper aims to reconstruct regulatory networks from temporal gene expression data by using delayed correlations between genes, i.e., pairwise overlaps of expression levels shifted in time relative each other. We have thus developed a novel model-free computational toolbox termed TdGRN (Time-delayed Gene Regulatory Network) to address the underlying regulations of genes that can span any unit(s) of time intervals. This bioinformatics toolbox has provided a unified approach to uncovering time trends of gene regulations through decision analysis of the newly designed time-delayed gene expression matrix. We have applied the proposed method to yeast cell cycling and human HeLa cell cycling and have discovered most of the underlying time-delayed regulations that are supported by multiple lines of experimental evidence and that are remarkably consistent with the current knowledge on phase characteristics for the cell cyclings. Conclusion We established a usable and powerful model-free approach to dissecting high-order dynamic trends of gene-gene interactions. We have carefully validated the proposed algorithm by applying it to two publicly available cell cycling datasets. In addition to uncovering the time trends of gene regulations for cell cycling, this unified approach can also be used to study the complex gene regulations related to
Transcriptome-Level Signatures in Gene Expression and Gene Expression Variability during Bacterial Adaptive Evolution.

PubMed

Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree

2017-01-01

Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress
Intracellular high cholesterol content disorders the clock genes, apoptosis-related genes and fibrinolytic-related genes rhythmic expressions in human plaque-derived vascular smooth muscle cells.

PubMed

Lin, Changpo; Tang, Xiao; Xu, Lirong; Qian, Ruizhe; Shi, Zhenyu; Wang, Lixin; Cai, Tingting; Yan, Dong; Fu, Weiguo; Guo, Daqiao

2017-07-10

The clock genes are involved in regulating cardiovascular functions, and their expression disorders would lead to circadian rhythm disruptions of clock-controlled genes (CCGs), resulting in atherosclerotic plaque formation and rupture. Our previous study revealed the rhythmic expression of clock genes were attenuated in human plaque-derived vascular smooth muscle cells (PVSMCs), but failed to detect the downstream CCGs expressions and the underlying molecular mechanism. In this study, we examined the difference of CCGs rhythmic expression between human normal carotid VSMCs (NVSMCs) and PVSMCs. Furthermore, we compared the cholesterol and triglycerides levels between two groups and the link to clock genes and CCGs expressions. Seven health donors' normal carotids and 19 carotid plaques yielded viable cultured NVSMCs and PVSMCs. The expression levels of target genes were measured by quantitative real-time PCR and Western-blot. The intracellular cholesterol and triglycerides levels were measured by kits. The circadian expressions of apoptosis-related genes and fibrinolytic-related genes were disordered. Besides, the cholesterol levels were significant higher in PVSMCs. After treated with cholesterol or oxidized low density lipoprotein (ox-LDL), the expressions of clock genes were inhibited; and the rhythmic expressions of clock genes, apoptosis-related genes and fibrinolytic-related genes were disturbed in NVSMCs, which were similar to PVSMCs. The results suggested that intracellular high cholesterol content of PVSMCs would lead to the disorders of clock genes and CCGs rhythmic expressions. And further studies should be conducted to demonstrate the specific molecular mechanisms involved.
Comparative studies of gene expression and the evolution of gene regulation

PubMed Central

Romero, Irene Gallego; Ruvinsky, Ilya; Gilad, Yoav

2014-01-01

The hypothesis that differences in gene regulation play an important role in speciation and adaptation is more than 40 years old. With the advent of new sequencing technologies, we are able to characterize and study gene expression levels and associated regulatory mechanisms in a large number of individuals and species at unprecedented resolution and scale. We have thus gained new insights into the evolutionary pressures that shape gene expression levels, as well as developed an appreciation for the relative importance of evolutionary changes in different regulatory genetic and epigenetic mechanisms. The current challenge is to link gene regulatory changes to adaptive evolution of complex phenotypes. Here we mainly focus on comparative studies in primates, and how they are complemented by studies in model organisms. PMID:22705669
Anti-inflammatory genes associated with multiple sclerosis: a gene expression study.

PubMed

Perga, S; Montarolo, F; Martire, S; Berchialla, P; Malucchi, S; Bertolotto, A

2015-02-15

Multiple sclerosis (MS) is an autoimmune inflammatory disease of the central nervous system caused by a complex interaction between multiple genes and environmental factors. HLA region is the strongest susceptibility locus, but recent huge genome-wide association studies identified new susceptibility genes. Among these, BACH2, PTGER4, RGS1 and ZFP36L1 were highlighted. Here, a gene expression analysis revealed that three of them, namely BACH2, PTGER4 and ZFP36L1, are down-regulated in MS patients' blood cells compared to healthy subjects. Interestingly, all these genes are involved in the immune system regulation with predominant anti-inflammatory role and their reduction could predispose to MS development. Copyright © 2015 Elsevier B.V. All rights reserved.
Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning.

PubMed

Wu, Mengmeng; Zeng, Wanwen; Liu, Wenqiang; Lv, Hairong; Chen, Ting; Jiang, Rui

2018-06-03

Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT. Copyright © 2018 Elsevier Inc. All rights reserved.
Inverse gene-for-gene interactions contribute additively to tan spot susceptibility in wheat.

PubMed

Liu, Zhaohui; Zurn, Jason D; Kariyawasam, Gayan; Faris, Justin D; Shi, Gongjun; Hansen, Jana; Rasmussen, Jack B; Acevedo, Maricelis

2017-06-01

Tan spot susceptibility is conferred by multiple interactions of necrotrophic effector and host sensitivity genes. Tan spot of wheat, caused by Pyrenophora tritici-repentis, is an important disease in almost all wheat-growing areas of the world. The disease system is known to involve at least three fungal-produced necrotrophic effectors (NEs) that interact with the corresponding host sensitivity (S) genes in an inverse gene-for-gene manner to induce disease. However, it is unknown if the effects of these NE-S gene interactions contribute additively to the development of tan spot. In this work, we conducted disease evaluations using different races and quantitative trait loci (QTL) analysis in a wheat recombinant inbred line (RIL) population derived from a cross between two susceptible genotypes, LMPG-6 and PI 626573. The two parental lines each harbored a single known NE sensitivity gene with LMPG-6 having the Ptr ToxC sensitivity gene Tsc1 and PI 626573 having the Ptr ToxA sensitivity gene Tsn1. Transgressive segregation was observed in the population for all races. QTL mapping revealed that both loci (Tsn1 and Tsc1) were significantly associated with susceptibility to race 1 isolates, which produce both Ptr ToxA and Ptr ToxC, and the two genes contributed additively to tan spot susceptibility. For isolates of races 2 and 3, which produce only Ptr ToxA and Ptr ToxC, only Tsn1 and Tsc1 were associated with tan spot susceptibility, respectively. This work clearly demonstrates that tan spot susceptibility in this population is due primarily to two NE-S interactions. Breeders should remove both sensitivity genes from wheat lines to obtain high levels of tan spot resistance.
Gene therapy for ocular diseases.

PubMed

Liu, Melissa M; Tuo, Jingsheng; Chan, Chi-Chao

2011-05-01

The eye is an easily accessible, highly compartmentalised and immune-privileged organ that offers unique advantages as a gene therapy target. Significant advancements have been made in understanding the genetic pathogenesis of ocular diseases, and gene replacement and gene silencing have been implicated as potentially efficacious therapies. Recent improvements have been made in the safety and specificity of vector-based ocular gene transfer methods. Proof-of-concept for vector-based gene therapies has also been established in several experimental models of human ocular diseases. After nearly two decades of ocular gene therapy research, preliminary successes are now being reported in phase 1 clinical trials for the treatment of Leber congenital amaurosis. This review describes current developments and future prospects for ocular gene therapy. Novel methods are being developed to enhance the performance and regulation of recombinant adeno-associated virus- and lentivirus-mediated ocular gene transfer. Gene therapy prospects have advanced for a variety of retinal disorders, including retinitis pigmentosa, retinoschisis, Stargardt disease and age-related macular degeneration. Advances have also been made using experimental models for non-retinal diseases, such as uveitis and glaucoma. These methodological advancements are critical for the implementation of additional gene-based therapies for human ocular diseases in the near future.
Spectral gene set enrichment (SGSE).

PubMed

Frost, H Robert; Li, Zhigang; Moore, Jason H

2015-03-03

Gene set testing is typically performed in a supervised context to quantify the association between groups of genes and a clinical phenotype. In many cases, however, a gene set-based interpretation of genomic data is desired in the absence of a phenotype variable. Although methods exist for unsupervised gene set testing, they predominantly compute enrichment relative to clusters of the genomic variables with performance strongly dependent on the clustering algorithm and number of clusters. We propose a novel method, spectral gene set enrichment (SGSE), for unsupervised competitive testing of the association between gene sets and empirical data sources. SGSE first computes the statistical association between gene sets and principal components (PCs) using our principal component gene set enrichment (PCGSE) method. The overall statistical association between each gene set and the spectral structure of the data is then computed by combining the PC-level p-values using the weighted Z-method with weights set to the PC variance scaled by Tracy-Widom test p-values. Using simulated data, we show that the SGSE algorithm can accurately recover spectral features from noisy data. To illustrate the utility of our method on real data, we demonstrate the superior performance of the SGSE method relative to standard cluster-based techniques for testing the association between MSigDB gene sets and the variance structure of microarray gene expression data. Unsupervised gene set testing can provide important information about the biological signal held in high-dimensional genomic data sets. Because it uses the association between gene sets and samples PCs to generate a measure of unsupervised enrichment, the SGSE method is independent of cluster or network creation algorithms and, most importantly, is able to utilize the statistical significance of PC eigenvalues to ignore elements of the data most likely to represent noise.
Aberrant Gene Expression in Humans

PubMed Central

Yang, Ence; Ji, Guoli; Brinkmeyer-Langford, Candice L.; Cai, James J.

2015-01-01

Gene expression as an intermediate molecular phenotype has been a focus of research interest. In particular, studies of expression quantitative trait loci (eQTL) have offered promise for understanding gene regulation through the discovery of genetic variants that explain variation in gene expression levels. Existing eQTL methods are designed for assessing the effects of common variants, but not rare variants. Here, we address the problem by establishing a novel analytical framework for evaluating the effects of rare or private variants on gene expression. Our method starts from the identification of outlier individuals that show markedly different gene expression from the majority of a population, and then reveals the contributions of private SNPs to the aberrant gene expression in these outliers. Using population-scale mRNA sequencing data, we identify outlier individuals using a multivariate approach. We find that outlier individuals are more readily detected with respect to gene sets that include genes involved in cellular regulation and signal transduction, and less likely to be detected with respect to the gene sets with genes involved in metabolic pathways and other fundamental molecular functions. Analysis of polymorphic data suggests that private SNPs of outlier individuals are enriched in the enhancer and promoter regions of corresponding aberrantly-expressed genes, suggesting a specific regulatory role of private SNPs, while the commonly-occurring regulatory genetic variants (i.e., eQTL SNPs) show little evidence of involvement. Additional data suggest that non-genetic factors may also underlie aberrant gene expression. Taken together, our findings advance a novel viewpoint relevant to situations wherein common eQTLs fail to predict gene expression when heritable, rare inter-individual variation exists. The analytical framework we describe, taking into consideration the reality of differential phenotypic robustness, may be valuable for investigating
Gene expression analysis of pancreatic cell lines reveals genes overexpressed in pancreatic cancer.

PubMed

Alldinger, Ingo; Dittert, Dag; Peiper, Matthias; Fusco, Alberto; Chiappetta, Gennaro; Staub, Eike; Lohr, Matthias; Jesnowski, Ralf; Baretton, Gustavo; Ockert, Detlef; Saeger, Hans-Detlev; Grützmann, Robert; Pilarsky, Christian

2005-01-01

Pancreatic cancer is one of the leading causes of cancer-related death. Using DNA gene expression analysis based on a custom made Affymetrix cancer array, we investigated the expression pattern of both primary and established pancreatic carcinoma cell lines. We analyzed the gene expression of 5 established pancreatic cancer cell lines (AsPC-1, BxPC-3, Capan-1, Capan-2 and HPAF II) and 5 primary isolates, 1 of them derived from benign pancreatic duct cells. Out of 1,540 genes which were expressed in at least 3 experiments, we found 122 genes upregulated and 18 downregulated in tumor cell lines compared to benign cells with a fold change >3. Several of the upregulated genes (like Prefoldin 5, ADAM9 and E-cadherin) have been associated with pancreatic cancer before. The other differentially regulated genes, however, play a so far unknown role in the course of human pancreatic carcinoma. By means of immunohistochemistry we could show that thymosin beta-10 (TMSB10), upregulated in tumor cell lines, is expressed in human pancreatic carcinoma, but not in non-neoplastic pancreatic tissue, suggesting a role for TMSB10 in the carcinogenesis of pancreatic carcinoma. Using gene expression profiling of pancreatic cell lines we were able to identify genes differentially expressed in pancreatic adenocarcinoma, which might contribute to pancreatic cancer development. Copyright 2005 S. Karger AG, Basel.
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

DOE PAGES

Peng, Jiajie; Uygun, Sahra; Kim, Taehyong; ...

2015-02-14

Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less
Candidate qRT-PCR reference genes for barley that demonstrate better stability than traditional housekeeping genes

USDA-ARS?s Scientific Manuscript database

Gene transcript expression analysis is a useful tool for correlating gene activity with plant phenotype. For these studies, an appropriate reference gene is necessary to quantify the expression of target genes. Classic housekeeping genes have often been used for this purpose, but may not be consis...
Familial aggregation analysis of gene expressions

PubMed Central

Rao, Shao-Qi; Xu, Liang-De; Zhang, Guang-Mei; Li, Xia; Li, Lin; Shen, Gong-Qing; Jiang, Yang; Yang, Yue-Ying; Gong, Bin-Sheng; Jiang, Wei; Zhang, Fan; Xiao, Yun; Wang, Qing K

2007-01-01

Traditional studies of familial aggregation are aimed at defining the genetic (and non-genetic) causes of a disease from physiological or clinical traits. However, there has been little attempt to use genome-wide gene expressions, the direct phenotypic measures of genes, as the traits to investigate several extended issues regarding the distributions of familially aggregated genes on chromosomes or in functions. In this study we conducted a genome-wide familial aggregation analysis by using the in vitro cell gene expressions of 3300 human autosome genes (Problem 1 data provided to Genetic Analysis Workshop 15) in order to answer three basic genetics questions. First, we investigated how gene expressions aggregate among different types (degrees) of relative pairs. Second, we conducted a bioinformatics analysis of highly familially aggregated genes to see how they are distributed on chromosomes. Third, we performed a gene ontology enrichment test of familially aggregated genes to find evidence to support their functional consensus. The results indicated that 1) gene expressions did aggregate in families, especially between sibs. Of 3300 human genes analyzed, there were a total of 1105 genes with one or more significant (empirical p < 0.05) familial correlation; 2) there were several genomic hot spots where highly familially aggregated genes (e.g., the chromosome 6 HLA genes cluster) were clustered; 3) as we expected, gene ontology enrichment tests revealed that the 1105 genes were aggregating not only in families but also in functional categories. PMID:18466548
Genome-Wide Comparative Gene Family Classification

PubMed Central

Frech, Christian; Chen, Nansheng

2010-01-01

Correct classification of genes into gene families is important for understanding gene function and evolution. Although gene families of many species have been resolved both computationally and experimentally with high accuracy, gene family classification in most newly sequenced genomes has not been done with the same high standard. This project has been designed to develop a strategy to effectively and accurately classify gene families across genomes. We first examine and compare the performance of computer programs developed for automated gene family classification. We demonstrate that some programs, including the hierarchical average-linkage clustering algorithm MC-UPGMA and the popular Markov clustering algorithm TRIBE-MCL, can reconstruct manual curation of gene families accurately. However, their performance is highly sensitive to parameter setting, i.e. different gene families require different program parameters for correct resolution. To circumvent the problem of parameterization, we have developed a comparative strategy for gene family classification. This strategy takes advantage of existing curated gene families of reference species to find suitable parameters for classifying genes in related genomes. To demonstrate the effectiveness of this novel strategy, we use TRIBE-MCL to classify chemosensory and ABC transporter gene families in C. elegans and its four sister species. We conclude that fully automated programs can establish biologically accurate gene families if parameterized accordingly. Comparative gene family classification finds optimal parameters automatically, thus allowing rapid insights into gene families of newly sequenced species. PMID:20976221
Molecular transformation, gene cloning, and gene expression systems for filamentous fungi

USGS Publications Warehouse

Gold, Scott E.; Duick, John W.; Redman, Regina S.; Rodriguez, Rusty J.

2001-01-01

This chapter discusses the molecular transformation, gene cloning, and gene expression systems for filamentous fungi. Molecular transformation involves the movement of discrete amounts of DNA into cells, the expression of genes on the transported DNA, and the sustainable replication of the transforming DNA. The ability to transform fungi is dependent on the stable replication and expression of genes located on the transforming DNA. Three phenomena observed in bacteria, that is, competence, plasmids, and restriction enzymes to facilitate cloning, were responsible for the development of molecular transformation in fungi. Initial transformation success with filamentous fungi, involving the complementation of auxotrophic mutants by exposure to sheared genomic DNA or RNA from wt isolates, occurred with low transformation efficiencies. In addition, it was difficult to retrieve complementing DNA fragments and isolate genes of interest. This prompted the development of transformation vectors and methods to increase efficiencies. The physiological studies performed with fungi indicated that the cell wall could be removed to generate protoplasts. It was evident that protoplasts could be transformed with significantly greater efficiencies than walled cells.
Identification of Reference Genes in Human Myelomonocytic Cells for Gene Expression Studies in Altered Gravity

PubMed Central

Thiel, Cora S.; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E.

2015-01-01

Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes (“housekeeping genes”) are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity. PMID:25654098
Identification of Suitable Reference Genes for Gene Expression Normalization in qRT-PCR Analysis in Watermelon

PubMed Central

Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

2014-01-01

Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT–PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT–PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT–PCR analyses involving watermelon. PMID:24587403
Identification of suitable reference genes for gene expression normalization in qRT-PCR analysis in watermelon.

PubMed

Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

2014-01-01

Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT-PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT-PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT-PCR analyses involving watermelon.

Evaluation of androgen receptor gene as a candidate gene in female androgenetic alopecia.

PubMed

el-Samahy, May H; Shaheen, Maha A; Saddik, Dina E B; Abdel-Fattah, Nermeen S A; el-Sawi, Mohammad A; Mahran, Manal Z; Shehab, Abeer A A

2009-06-01

Genetic polymorphisms of the androgen receptor (AR) gene have been studied in male androgenetic alopecia (AGA); however, little is known about gene polymorphism and female AGA. To evaluate the AR gene as a candidate gene for female AGA. Thirty premenopausal Egyptian female patients with AGA (mean age, 32.3 +/- 7 years) and 11 age- and sex-matched controls were included. All subjects underwent laboratory and pelvic ultrasound evaluation to exclude other precipitating cause(s) of hair loss. Scalp biopsy was taken and the AR gene was evaluated using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). According to Ludwig's classification, all patients had type II AGA. Statistical analysis showed no statistically significant difference in genotype (chi(2) = 5.513, P > or = 0.05) or allele frequency (chi(2) = 1.312, P > or = 0.05) between patients and controls. There was also no statistically significant difference between the genotype and allele frequency with disease duration. In contrast with male AGA, no association was found between type II AGA in Egyptian women and the AR gene. Therefore, the genetic study of this gene does not serve as a biomarker for the identification of women with a predisposition to AGA.
An 'instant gene bank' method for gene cloning by mutant complementation.

PubMed

Gems, D; Aleksenko, A; Belenky, L; Robertson, S; Ramsden, M; Vinetski, Y; Clutterbuck, A J

1994-02-01

We describe a new method of gene cloning by complementation of mutant alleles which obviates the need for construction of a gene library in a plasmid vector in vitro and its amplification in Escherichia coli. The method involves simultaneous transformation of mutant strains of the fungus Aspergillus nidulans with (i) fragmented chromosomal DNA from a donor species and (ii) DNA of a plasmid without a selectable marker gene, but with a fungal origin of DNA replication ('helper plasmid'). Transformant colonies appear as the result of the joining of chromosomal DNA fragments carrying the wild-type copies of the mutant allele with the helper plasmid. Joining may occur either by ligation (if the helper plasmid is in linear form) or recombination (if it is cccDNA). This event occurs with high efficiency in vivo, and generates an autonomously replicating plasmid cointegrate. Transformants containing Penicillium chrysogenum genomic DNA complementing A. nidulans niaD, nirA and argB mutations have been obtained. While some of these cointegrates were evidently rearranged or consisted only of unaltered replicating plasmid, in other cases plasmids could be recovered into E. coli and were subsequently shown to contain the selected gene. The utility of this "instant gene bank" technique is demonstrated here by the molecular cloning of the P. canescens trpC gene.
A Combinatorial Approach to Detecting Gene-Gene and Gene-Environment Interactions in Family Studies

PubMed Central

Lou, Xiang-Yang; Chen, Guo-Bo; Yan, Lei; Ma, Jennie Z.; Mangold, Jamie E.; Zhu, Jun; Elston, Robert C.; Li, Ming D.

2008-01-01

Widespread multifactor interactions present a significant challenge in determining risk factors of complex diseases. Several combinatorial approaches, such as the multifactor dimensionality reduction (MDR) method, have emerged as a promising tool for better detecting gene-gene (G × G) and gene-environment (G × E) interactions. We recently developed a general combinatorial approach, namely the generalized multifactor dimensionality reduction (GMDR) method, which can entertain both qualitative and quantitative phenotypes and allows for both discrete and continuous covariates to detect G × G and G × E interactions in a sample of unrelated individuals. In this article, we report the development of an algorithm that can be used to study G × G and G × E interactions for family-based designs, called pedigree-based GMDR (PGMDR). Compared to the available method, our proposed method has several major improvements, including allowing for covariate adjustments and being applicable to arbitrary phenotypes, arbitrary pedigree structures, and arbitrary patterns of missing marker genotypes. Our Monte Carlo simulations provide evidence that the PGMDR method is superior in performance to identify epistatic loci compared to the MDR-pedigree disequilibrium test (PDT). Finally, we applied our proposed approach to a genetic data set on tobacco dependence and found a significant interaction between two taste receptor genes (i.e., TAS2R16 and TAS2R38) in affecting nicotine dependence. PMID:18834969
[Differential expression genes of bone tissues surrounding implants in diabetic rats by gene chip].

PubMed

Wang, Xin-xin; Ma, Yue; Li, Qing; Jiang, Bao-qi; Lan, Jing

2012-10-01

To compare mRNA expression profiles of bone tissues surrounding implants between normal rats and rats with diabetes using microarray technology. Six Wistar rats were randomly selected and divided into normal model group and diabetic group. Diabetic model condition was established by injecting Streptozotocin into peritoneal space. Titanium implants were implanted into the epiphyseal end of the rats' tibia. Bone tissues surrounding implant were harvested and sampled after 3 months to perform comprehensive RNA gene expression profiling, including 17983 for genome-wide association study.GO analysis was used to compare different gene expression and real-time PCR was used to confirm the results on core samples. The results indicated that there were 1084 differential gene expression. In the diabetic model, there were 352 enhanced expression genes, 732 suppressed expression genes. GO analysis involved 1154 different functional type. Osteoblast related gene expressions in bone tissue samples of diabetic rats were decreased, and lipid metabolism pathway related gene expression was increased.
[High gene conversion frequency between genes encoding 2-deoxyglucose-6-phosphate phosphatase in 3 Saccharomyces species].

PubMed

Piscopo, Sara-Pier; Drouin, Guy

2014-05-01

Gene conversions are nonreciprocal sequence exchanges between genes. They are relatively common in Saccharomyces cerevisiae, but few studies have investigated the evolutionary fate of gene conversions or their functional impacts. Here, we analyze the evolution and impact of gene conversions between the two genes encoding 2-deoxyglucose-6-phosphate phosphatase in S. cerevisiae, Saccharomyces paradoxus and Saccharomyces mikatae. Our results demonstrate that the last half of these genes are subject to gene conversions among these three species. The greater similarity and the greater percentage of GC nucleotides in the converted regions, as well as the absence of long regions of adjacent common converted sites, suggest that these gene conversions are frequent and occur independently in all three species. The high frequency of these conversions probably result from the fact that they have little impact on the protein sequences encoded by these genes.
Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling

PubMed Central

Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried

2013-01-01

Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802
Evolution of the bovine lysozyme gene family: changes in gene expression and reversion of function.

PubMed

Irwin, D M

1995-09-01

Recruitment of lysozyme to a digestive function in ruminant artiodactyls is associated with amplification of the gene. At least four of the approximately ten genes are expressed in the stomach, and several are expressed in nonstomach tissues. Characterization of additional lysozymelike sequences in the bovine genome has identified most, if not all, of the members of this gene family. There are at least six stomachlike lysozyme genes, two of which are pseudogenes. The stomach lysozyme pseudogenes show a pattern of concerted evolution similar to that of the functional stomach genes. At least four nonstomach lysozyme genes exist. The nonstomach lysozyme genes are not monophyletic. A gene encoding a tracheal lysozyme was isolated, and the stomach lysozyme of advanced ruminants was found to be more closely related to the tracheal lysozyme than to the stomach lysozyme of the camel or other nonstomach lysozyme genes of ruminants. The tracheal lysozyme shares with stomach lysozymes of advanced ruminants the deletion of amino acid 103, and several other adaptive sequence characteristics of stomach lysozymes. I suggest here that tracheal lysozyme has reverted from a functional stomach lysozyme. Tracheal lysozyme then represents a second instance of a change in lysozyme gene expression and function within ruminants.
Adenovirus-Mediated p202 Gene Transfer in Breast Cancer Gene Therapy

DTIC Science & Technology

2005-05-01

transcriptional regulation of genes important for cell cycle control, differentiation, and apoptosis (1, 3, 4). Our previous studies have shown that p202...leads to induction of p53 and activation of p53 target gene (e.g., p21 CIP 1). 10. The positive regulation of p53 by IFIXcd can be observed only in...cancers. Together, our data suggest that both Ad-p202 and IFIX may be further developed into efficient therapeutic agents for human cancer gene
Investigation of candidate genes for osteoarthritis based on gene expression profiles.

PubMed

Dong, Shuanghai; Xia, Tian; Wang, Lei; Zhao, Qinghua; Tian, Jiwei

2016-12-01

To explore the mechanism of osteoarthritis (OA) and provide valid biological information for further investigation. Gene expression profile of GSE46750 was downloaded from Gene Expression Omnibus database. The Linear Models for Microarray Data (limma) package (Bioconductor project, http://www.bioconductor.org/packages/release/bioc/html/limma.html) was used to identify differentially expressed genes (DEGs) in inflamed OA samples. Gene Ontology function enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enrichment analysis of DEGs were performed based on Database for Annotation, Visualization and Integrated Discovery data, and protein-protein interaction (PPI) network was constructed based on the Search Tool for the Retrieval of Interacting Genes/Proteins database. Regulatory network was screened based on Encyclopedia of DNA Elements. Molecular Complex Detection was used for sub-network screening. Two sub-networks with highest node degree were integrated with transcriptional regulatory network and KEGG functional enrichment analysis was processed for 2 modules. In total, 401 up- and 196 down-regulated DEGs were obtained. Up-regulated DEGs were involved in inflammatory response, while down-regulated DEGs were involved in cell cycle. PPI network with 2392 protein interactions was constructed. Moreover, 10 genes including Interleukin 6 (IL6) and Aurora B kinase (AURKB) were found to be outstanding in PPI network. There are 214 up- and 8 down-regulated transcription factor (TF)-target pairs in the TF regulatory network. Module 1 had TFs including SPI1, PRDM1, and FOS, while module 2 contained FOSL1. The nodes in module 1 were enriched in chemokine signaling pathway, while the nodes in module 2 were mainly enriched in cell cycle. The screened DEGs including IL6, AGT, and AURKB might be potential biomarkers for gene therapy for OA by being regulated by TFs such as FOS and SPI1, and participating in the cell cycle and cytokine-cytokine receptor
T-cell lymphomas associated gene expression signature: Bioinformatics analysis based on gene expression Omnibus.

PubMed

Zhou, Lei-Lei; Xu, Xiao-Yue; Ni, Jie; Zhao, Xia; Zhou, Jian-Wei; Feng, Ji-Feng

2018-06-01

Due to the low incidence and the heterogeneity of subtypes, the biological process of T-cell lymphomas is largely unknown. Although many genes have been detected in T-cell lymphomas, the role of these genes in biological process of T-cell lymphomas was not further analyzed. Two qualified datasets were downloaded from Gene Expression Omnibus database. The biological functions of differentially expressed genes were evaluated by gene ontology enrichment and KEGG pathway analysis. The network for intersection genes was constructed by the cytoscape v3.0 software. Kaplan-Meier survival curves and log-rank test were employed to assess the association between differentially expressed genes and clinical characters. The intersection mRNAs were proved to be associated with fundamental processes of T-cell lymphoma cells. These intersection mRNAs were involved in the activation of some cancer-related pathways, including PI3K/AKT, Ras, JAK-STAT, and NF-kappa B signaling pathway. PDGFRA, CXCL12, and CCL19 were the most significant central genes in the signal-net analysis. The results of survival analysis are not entirely credible. Our findings uncovered aberrantly expressed genes and a complex RNA signal network in T-cell lymphomas and indicated cancer-related pathways involved in disease initiation and progression, providing a new insight for biotargeted therapy in T-cell lymphomas. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The role of gene-gene interaction in the prediction of criminal behavior.

PubMed

Boutwell, Brian B; Menard, Scott; Barnes, J C; Beaver, Kevin M; Armstrong, Todd A; Boisvert, Danielle

2014-04-01

A host of research has examined the possibility that environmental risk factors might condition the influence of genes on various outcomes. Less research, however, has been aimed at exploring the possibility that genetic factors might interact to impact the emergence of human traits. Even fewer studies exist examining the interaction of genes in the prediction of behavioral outcomes. The current study expands this body of research by testing the interaction between genes involved in neural transmission. Our findings suggest that certain dopamine genes interact to increase the odds of criminogenic outcomes in a national sample of Americans. Copyright © 2014 Elsevier Inc. All rights reserved.
Gene-gene interactions among genetic variants from obesity candidate genes for nonobese and obese populations in type 2 diabetes.

PubMed

Lin, Eugene; Pei, Dee; Huang, Yi-Jen; Hsieh, Chang-Hsun; Wu, Lawrence Shih-Hsin

2009-08-01

Recent studies indicate that obesity may play a key role in modulating genetic predispositions to type 2 diabetes (T2D). This study examines the main effects of both single-locus and multilocus interactions among genetic variants in Taiwanese obese and nonobese individuals to test the hypothesis that obesity-related genes may contribute to the etiology of T2D independently and/or through such complex interactions. We genotyped 11 single nucleotide polymorphisms for 10 obesity candidate genes including adrenergic beta-2-receptor surface, adrenergic beta-3-receptor surface, angiotensinogen, fat mass and obesity associated gene, guanine nucleotide binding protein beta polypeptide 3 (GNB3), interleukin 6 receptor, proprotein convertase subtilisin/kexin type 1 (PCSK1), uncoupling protein 1, uncoupling protein 2, and uncoupling protein 3. There were 389 patients diagnosed with T2D and 186 age- and sex-matched controls. Single-locus analyses showed significant main effects of the GNB3 and PCSK1 genes on the risk of T2D among the nonobese group (p = 0.002 and 0.047, respectively). Further, interactions involving GNB3 and PCSK1 were suggested among the nonobese population using the generalized multifactor dimensionality reduction method (p = 0.001). In addition, interactions among angiotensinogen, fat mass and obesity associated gene, GNB3, and uncoupling protein 3 genes were found in a significant four-locus generalized multifactor dimensionality reduction model among the obese population (p = 0.001). The results suggest that the single nucleotide polymorphisms from the obesity candidate genes may contribute to the risk of T2D independently and/or in an interactive manner according to the presence or absence of obesity.
Homology-dependent Gene Silencing in Paramecium

PubMed Central

Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa

1998-01-01

Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389
Using RNA-Seq data to select refence genes for normalizing gene expression in apple roots

USDA-ARS?s Scientific Manuscript database

Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for t...
Identification of new genes in a cell envelope-cell division gene cluster of Escherichia coli: cell envelope gene murG.

PubMed Central

Salmond, G P; Lutkenhaus, J F; Donachie, W D

1980-01-01

We report the identification, cloning, and mapping of a new cell envelope gene, murG. This lies in a group of five genes of similar phenotype (in the order murE murF murG murC ddl) all concerned with peptidoglycan biosynthesis. This group is in a larger cluster of at least 10 genes, all of which are involved in some way with cell envelope growth. Images PMID:6998962
Reading and Generalist Genes

ERIC Educational Resources Information Center

Haworth, Claire M. A.; Meaburn, Emma L.; Harlaar, Nicole; Plomin, Robert

2007-01-01

Twin-study research suggests that many (but not all) of the same genes contribute to genetic influence on diverse learning abilities and disabilities, a hypothesis called "generalist genes". This generalist genes hypothesis was tested using a set of 10 DNA markers (single nucleotide polymorphisms [SNPs]) found to be associated with early reading…
Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

PubMed

Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

2017-09-01

The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative
Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

PubMed

Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

2018-04-23

Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis
Genes encoding cuticular proteins are components of the Nimrod gene cluster in Drosophila.

PubMed

Cinege, Gyöngyi; Zsámboki, János; Vidal-Quadras, Maite; Uv, Anne; Csordás, Gábor; Honti, Viktor; Gábor, Erika; Hegedűs, Zoltán; Varga, Gergely I B; Kovács, Attila L; Juhász, Gábor; Williams, Michael J; Andó, István; Kurucz, Éva

2017-08-01

The Nimrod gene cluster, located on the second chromosome of Drosophila melanogaster, is the largest synthenic unit of the Drosophila genome. Nimrod genes show blood cell specific expression and code for phagocytosis receptors that play a major role in fruit fly innate immune functions. We previously identified three homologous genes (vajk-1, vajk-2 and vajk-3) located within the Nimrod cluster, which are unrelated to the Nimrod genes, but are homologous to a fourth gene (vajk-4) located outside the cluster. Here we show that, unlike the Nimrod candidates, the Vajk proteins are expressed in cuticular structures of the late embryo and the late pupa, indicating that they contribute to cuticular barrier functions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

PubMed

Thomassen, Mads; Tan, Qihua; Kruse, Torben A

2009-01-01

Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.

The evolution of milk casein genes from tooth genes before the origin of mammals.

PubMed

Kawasaki, Kazuhiko; Lafont, Anne-Gaelle; Sire, Jean-Yves

2011-07-01

Caseins are among cardinal proteins that evolved in the lineage leading to mammals. In milk, caseins and calcium phosphate (CaP) form a huge complex called casein micelle. By forming the micelle, milk maintains high CaP concentrations, which help altricial mammalian neonates to grow bone and teeth. Two types of caseins are known. Ca-sensitive caseins (α(s)- and β-caseins) bind Ca but precipitate at high Ca concentrations, whereas Ca-insensitive casein (κ-casein) does not usually interact with Ca but instead stabilizes the micelle. Thus, it is thought that these two types of caseins are both necessary for stable micelle formation. Both types of caseins show high substitution rates, which make it difficult to elucidate the evolution of caseins. Yet, recent studies have revealed that all casein genes belong to the secretory calcium-binding phosphoprotein (SCPP) gene family that arose by gene duplication. In the present study, we investigated exon-intron structures and phylogenetic distributions of casein and other SCPP genes, particularly the odontogenic ameloblast-associated (ODAM) gene, the SCPP-Pro-Gln-rich 1 (SCPPPQ1) gene, and the follicular dendritic cell secreted peptide (FDCSP) gene. The results suggest that contemporary Ca-sensitive casein genes arose from a putative common ancestor, which we refer to as CSN1/2. The six putative exons comprising CSN1/2 are all found in SCPPPQ1, although ODAM also shares four of these exons. By contrast, the five exons of the Ca-insensitive casein gene are all reminiscent of FDCSP. The phylogenetic distribution of these genes suggests that both SCPPPQ1 and FDCSP arose from ODAM. We thus argue that all casein genes evolved from ODAM via two different pathways; Ca-sensitive casein genes likely originated directly from SCPPPQ1, whereas the Ca-insensitive casein genes directly differentiated from FDCSP. Further, expression of ODAM, SCPPPQ1, and FDCSP was detected in dental tissues, supporting the idea that both types of caseins
Identification of genes associated with renal cell carcinoma using gene expression profiling analysis.

PubMed

Yao, Ting; Wang, Qinfu; Zhang, Wenyong; Bian, Aihong; Zhang, Jinping

2016-07-01

Renal cell carcinoma (RCC) is the most common type of kidney cancer in adults and accounts for ~80% of all kidney cancer cases. However, the pathogenesis of RCC has not yet been fully elucidated. To interpret the pathogenesis of RCC at the molecular level, gene expression data and bio-informatics methods were used to identify RCC associated genes. Gene expression data was downloaded from Gene Expression Omnibus (GEO) database and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in RCC patients compared with controls. In addition, a regulatory network was constructed using the known regulatory data between transcription factors (TFs) and target genes in the University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) and the regulatory impact factor of each TF was calculated. A total of 258,0427 pairs of DCGs were identified. The regulatory network contained 1,525 pairs of regulatory associations between 126 TFs and 1,259 target genes and these genes were mainly enriched in cancer pathways, ErbB and MAPK. In the regulatory network, the 10 most strongly associated TFs were FOXC1, GATA3, ESR1, FOXL1, PATZ1, MYB, STAT5A, EGR2, EGR3 and PELP1. GATA3, ERG and MYB serve important roles in RCC while FOXC1, ESR1, FOXL1, PATZ1, STAT5A and PELP1 may be potential genes associated with RCC. In conclusion, the present study constructed a regulatory network and screened out several TFs that may be used as molecular biomarkers of RCC. However, future studies are needed to confirm the findings of the present study.
Identification of genes associated with renal cell carcinoma using gene expression profiling analysis

PubMed Central

YAO, TING; WANG, QINFU; ZHANG, WENYONG; BIAN, AIHONG; ZHANG, JINPING

2016-01-01

Renal cell carcinoma (RCC) is the most common type of kidney cancer in adults and accounts for ~80% of all kidney cancer cases. However, the pathogenesis of RCC has not yet been fully elucidated. To interpret the pathogenesis of RCC at the molecular level, gene expression data and bio-informatics methods were used to identify RCC associated genes. Gene expression data was downloaded from Gene Expression Omnibus (GEO) database and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in RCC patients compared with controls. In addition, a regulatory network was constructed using the known regulatory data between transcription factors (TFs) and target genes in the University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) and the regulatory impact factor of each TF was calculated. A total of 258,0427 pairs of DCGs were identified. The regulatory network contained 1,525 pairs of regulatory associations between 126 TFs and 1,259 target genes and these genes were mainly enriched in cancer pathways, ErbB and MAPK. In the regulatory network, the 10 most strongly associated TFs were FOXC1, GATA3, ESR1, FOXL1, PATZ1, MYB, STAT5A, EGR2, EGR3 and PELP1. GATA3, ERG and MYB serve important roles in RCC while FOXC1, ESR1, FOXL1, PATZ1, STAT5A and PELP1 may be potential genes associated with RCC. In conclusion, the present study constructed a regulatory network and screened out several TFs that may be used as molecular biomarkers of RCC. However, future studies are needed to confirm the findings of the present study. PMID:27347102
Recent advances in the use of ZFN-mediated gene editing for human gene therapy.

PubMed

Chandrasegaran, Srinivasan

2017-01-01

Targeted genome editing with programmable nucleases has revolutionized biomedical research. The ability to make site-specific modifications to the human genome, has invoked a paradigm shift in gene therapy. Using gene editing technologies, the sequence in the human genome can now be precisely engineered to achieve a therapeutic effect. Zinc finger nucleases (ZFNs) were the first programmable nucleases designed to target and cleave custom sites. This article summarizes the advances in the use of ZFN-mediated gene editing for human gene therapy and discusses the challenges associated with translating this gene editing technology into clinical use.
Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

PubMed

Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

2015-11-24

Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Study on the association between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes.

PubMed

Ma, Quan-Ping; Su, Liang; Liu, Jing-Wen; Yao, Ming-Xiao; Yuan, Guang-Ying

2018-06-01

The aim of the present study was to investigate the correlation between the multi‑drug resistance of Shigella flexneri and the drug‑resistant gene cassette carried by integrons; in the meanwhile, to detect the associations between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes, including marOR, acrR and soxS. A total of 158 isolates were isolated from the stool samples of 1,026 children with diarrhoea aged 14 years old between May 2012 and October 2015 in Henan. The K‑B method was applied for the determination of drug resistance of Shigella flexneri, and polymerase chain reaction amplification was used for class 1, 2 and 3 integrase genes. Enzyme digestion and sequence analysis were performed for the variable regions of positive strains. Based on the drug sensitivity assessment, multi‑drug resistant strains that were resistant to five or more antibiotics, and sensitive strains were selected for amplification. Their active efflux pump genes, acrA and acrB, and regulatory genes, marOR, acrR and soxS, were selected for sequencing. The results revealed that 91.1% of the 158 strains were multi‑resistant to ampicillin, chloramphenicol, tetracycline and streptomycin, and 69.6% of the strains were multi‑resistant to sulfamethoxazole/trimethoprim. The resistance to ceftazidime, ciprofloxacin and levofloxacin was <32.9%. All strains (100%) were sensitive to cefoxitin, cefoperazone/sulbactam and imipenem. The rate of the class 1 integron positivity was 91.9% (144/158). Among these class 1 integron‑positive strains, 18 strains exhibited the resistance gene cassette dfrV in the variable region of the strain, four strains exhibited dfrA17‑aadA5 in the variable region and 140 strains exhibited blaOXA‑30‑aadA1 in the variable region. Four strains showed no resistance gene in the variable regions. The rate of class 2 integron positivity was 86.1% (136/158), and all positive strains harboured the
Evolutionary analysis of the kinesin light chain genes in the yellow fever mosquito Aedes aegypti: gene duplication as a source for novel early zygotic genes.

PubMed

Biedler, James K; Tu, Zhijian

2010-07-08

The maternal zygotic transition marks the time at which transcription from the zygotic genome is initiated and a subset of maternal RNAs are progressively degraded in the developing embryo. A number of early zygotic genes have been identified in Drosophila melanogaster and comparisons to sequenced mosquito genomes suggest that some of these early zygotic genes such as bottleneck are fast-evolving or subject to turnover in dipteran insects. One objective of this study is to identify early zygotic genes from the yellow fever mosquito Aedes aegypti to study their evolution. We are also interested in obtaining early zygotic promoters that will direct transgene expression in the early embryo as part of a Medea gene drive system. Two novel early zygotic kinesin light chain genes we call AaKLC2.1 and AaKLC2.2 were identified by transcriptome sequencing of Aedes aegypti embryos at various time points. These two genes have 98% nucleotide and amino acid identity in their coding regions and show transcription confined to the early zygotic stage according to gene-specific RT-PCR analysis. These AaKLC2 genes have a paralogous gene (AaKLC1) in Ae. aegypti. Phylogenetic inference shows that an ortholog to the AaKLC2 genes is only found in the sequenced genome of Culex quinquefasciatus. In contrast, AaKLC1 gene orthologs are found in all three sequenced mosquito species including Anopheles gambiae. There is only one KLC gene in D. melanogaster and other sequenced holometabolous insects that appears to be similar to AaKLC1. Unlike AaKLC2, AaKLC1 is expressed in all life stages and tissues tested, which is consistent with the expression pattern of the An. gambiae and D. melanogaster KLC genes. Phylogenetic inference also suggests that AaKLC2 genes and their likely C. quinquefasciatus ortholog are fast-evolving genes relative to the highly conserved AaKLC1-like paralogs. Embryonic injection of a luciferase reporter under the control of a 1 kb fragment upstream of the AaKLC2.1 start
Nanoparticle-mediated gene delivery.

PubMed

Jin, Sha; Leach, John C; Ye, Kaiming

2009-01-01

Nonviral gene delivery has been gaining considerable attention recently. Although the efficacy of DNA transfection, which is a major concern, is low in nonviral vector-mediated gene transfer compared with viral ones, nonviral vectors are relatively easy to prepare, less immunogenic and oncogenic, and have no potential of virus recombination and no limitation on the size of a transferred gene. The ability to incorporate genetic materials such as plasmid DNA, RNA, and siRNA into functionalized nanoparticles with little toxicity demonstrates a new era in pharmacotherapy for delivering genes selectively to tissues and cells. In this chapter, we highlight the basic concepts and applications of nonviral gene delivery using super paramagnetic iron oxide nanoparticles and functionalized silica nanoparticles. The experimental protocols related to these topics are described in the chapter.
Gene Therapy for Skin Diseases

PubMed Central

Gorell, Emily; Nguyen, Ngon; Lane, Alfred; Siprashvili, Zurab

2014-01-01

The skin possesses qualities that make it desirable for gene therapy, and studies have focused on gene therapy for multiple cutaneous diseases. Gene therapy uses a vector to introduce genetic material into cells to alter gene expression, negating a pathological process. This can be accomplished with a variety of viral vectors or nonviral administrations. Although results are promising, there are several potential pitfalls that must be addressed to improve the safety profile to make gene therapy widely available clinically. PMID:24692191
Genes with minimal phylogenetic information are problematic for coalescent analyses when gene tree estimation is biased.

PubMed

Xi, Zhenxiang; Liu, Liang; Davis, Charles C

2015-11-01

The development and application of coalescent methods are undergoing rapid changes. One little explored area that bears on the application of gene-tree-based coalescent methods to species tree estimation is gene informativeness. Here, we investigate the accuracy of these coalescent methods when genes have minimal phylogenetic information, including the implementation of the multilocus bootstrap approach. Using simulated DNA sequences, we demonstrate that genes with minimal phylogenetic information can produce unreliable gene trees (i.e., high error in gene tree estimation), which may in turn reduce the accuracy of species tree estimation using gene-tree-based coalescent methods. We demonstrate that this problem can be alleviated by sampling more genes, as is commonly done in large-scale phylogenomic analyses. This applies even when these genes are minimally informative. If gene tree estimation is biased, however, gene-tree-based coalescent analyses will produce inconsistent results, which cannot be remedied by increasing the number of genes. In this case, it is not the gene-tree-based coalescent methods that are flawed, but rather the input data (i.e., estimated gene trees). Along these lines, the commonly used program PhyML has a tendency to infer one particular bifurcating topology even though it is best represented as a polytomy. We additionally corroborate these findings by analyzing the 183-locus mammal data set assembled by McCormack et al. (2012) using ultra-conserved elements (UCEs) and flanking DNA. Lastly, we demonstrate that when employing the multilocus bootstrap approach on this 183-locus data set, there is no strong conflict between species trees estimated from concatenation and gene-tree-based coalescent analyses, as has been previously suggested by Gatesy and Springer (2014). Copyright © 2015 Elsevier Inc. All rights reserved.
The Bacillus thuringiensis cyt Genes for Hemolytic Endotoxins Constitute a Gene Family

PubMed Central

Guerchicoff, Alejandra; Delécluse, Armelle; Rubinstein, Clara P.

2001-01-01

In the same way that cry genes, coding for larvicidal delta endotoxins, constitute a large and diverse gene family, the cyt genes for hemolytic toxins seem to compose another set of highly related genes in Bacillus thuringiensis. Although the occurrence of Cyt hemolytic factors in B. thuringiensis has been typically associated with mosquitocidal strains, we have recently shown that cyt genes are also present in strains with different pathotypes; this is the case for the morrisoni subspecies, which includes strains biologically active against dipteran, lepidopteran, and coleopteran larvae. In addition, while one Cyt type of protein has been described in all of the mosquitocidal strains studied so far, the present study confirms that at least two Cyt toxins coexist in the more toxic antidipteran strains, such as B. thuringiensis subsp. israelensis and subsp. morrisoni PG14, and that this could also be the case for many others. In fact, PCR screening and Western blot analysis of 50 B. thuringiensis strains revealed that cyt2-related genes are present in all strains with known antidipteran activity, as well as in some others with different or unknown host ranges. Partial DNA sequences for several of these genes were determined, and protein sequence alignments revealed a high degree of conservation of the structural domains. These findings point to an important biological role for Cyt toxins in the final in vivo toxic activity of many B. thuringiensis strains. PMID:11229896
Horizontal gene transfer of microbial cellulases into nematode genomes is associated with functional assimilation and gene turnover

PubMed Central

2011-01-01

Background Natural acquisition of novel genes from other organisms by horizontal or lateral gene transfer is well established for microorganisms. There is now growing evidence that horizontal gene transfer also plays important roles in the evolution of eukaryotes. Genome-sequencing and EST projects of plant and animal associated nematodes such as Brugia, Meloidogyne, Bursaphelenchus and Pristionchus indicate horizontal gene transfer as a key adaptation towards parasitism and pathogenicity. However, little is known about the functional activity and evolutionary longevity of genes acquired by horizontal gene transfer and the mechanisms favoring such processes. Results We examine the transfer of cellulase genes to the free-living and beetle-associated nematode Pristionchus pacificus, for which detailed phylogenetic knowledge is available, to address predictions by evolutionary theory for successful gene transfer. We used transcriptomics in seven Pristionchus species and three other related diplogastrid nematodes with a well-defined phylogenetic framework to study the evolution of ancestral cellulase genes acquired by horizontal gene transfer. We performed intra-species, inter-species and inter-genic analysis by comparing the transcriptomes of these ten species and tested for cellulase activity in each species. Species with cellulase genes in their transcriptome always exhibited cellulase activity indicating functional integration into the host's genome and biology. The phylogenetic profile of cellulase genes was congruent with the species phylogeny demonstrating gene longevity. Cellulase genes show notable turnover with elevated birth and death rates. Comparison by sequencing of three selected cellulase genes in 24 natural isolates of Pristionchus pacificus suggests these high evolutionary dynamics to be associated with copy number variations and positive selection. Conclusion We could demonstrate functional integration of acquired cellulase genes into the nematode
Horizontal gene transfer of microbial cellulases into nematode genomes is associated with functional assimilation and gene turnover.

PubMed

Mayer, Werner E; Schuster, Lisa N; Bartelmes, Gabi; Dieterich, Christoph; Sommer, Ralf J

2011-01-13

Natural acquisition of novel genes from other organisms by horizontal or lateral gene transfer is well established for microorganisms. There is now growing evidence that horizontal gene transfer also plays important roles in the evolution of eukaryotes. Genome-sequencing and EST projects of plant and animal associated nematodes such as Brugia, Meloidogyne, Bursaphelenchus and Pristionchus indicate horizontal gene transfer as a key adaptation towards parasitism and pathogenicity. However, little is known about the functional activity and evolutionary longevity of genes acquired by horizontal gene transfer and the mechanisms favoring such processes. We examine the transfer of cellulase genes to the free-living and beetle-associated nematode Pristionchus pacificus, for which detailed phylogenetic knowledge is available, to address predictions by evolutionary theory for successful gene transfer. We used transcriptomics in seven Pristionchus species and three other related diplogastrid nematodes with a well-defined phylogenetic framework to study the evolution of ancestral cellulase genes acquired by horizontal gene transfer. We performed intra-species, inter-species and inter-genic analysis by comparing the transcriptomes of these ten species and tested for cellulase activity in each species. Species with cellulase genes in their transcriptome always exhibited cellulase activity indicating functional integration into the host's genome and biology. The phylogenetic profile of cellulase genes was congruent with the species phylogeny demonstrating gene longevity. Cellulase genes show notable turnover with elevated birth and death rates. Comparison by sequencing of three selected cellulase genes in 24 natural isolates of Pristionchus pacificus suggests these high evolutionary dynamics to be associated with copy number variations and positive selection. We could demonstrate functional integration of acquired cellulase genes into the nematode's biology as predicted by theory
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.

PubMed

Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

2007-11-29

Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Systematic Search for Gene-Gene Interaction Effect on Prostate Cancer Risk

DTIC Science & Technology

2013-07-01

Systematic Search for Gene-Gene Interaction 5a. CONTRACT NUMBER Effect on Prostate Cancer Risk 5b. GRANT NUMBER W81XWH-09-1-0488 5c. PROGRAM...Supported by this grant ) 1. Tao S, Wang Z, Feng J, Hsu FC, Jin G, Kin ST, Zhang Z, Gronberg H, Zheng, SL, Isaacs WB, XU J, Sun J. A Genome-Wide Search for...order interactions among estrogen- metabolism genes in sporadic breast cancer. Am J Hum Genet, 69, 138-47. 48. Marchini, J., Donnelly, P. and Cardon
Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis

DTIC Science & Technology

2013-08-01

AD_________________ (Leave blank) Award Number: W81HWH-10-1-0469 TITLE: Genetic Evaluation for the Scoliosis Gene(s) in Patients with...Neurofibromatosis 1 and Scoliosis PRINCIPAL INVESTIGATOR: David W. Polly, Jr., MD CONTRACTING ORGANIZATION: UNIVERSITY OF MINNESOTA Minneapolis, MN 55455...the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis 5b. GRANT NUMBER W81HWH-10- -0469 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S
Integrated analysis of gene expression and methylation profiles of 48 candidate genes in breast cancer patients.

PubMed

Li, Zibo; Heng, Jianfu; Yan, Jinhua; Guo, Xinwu; Tang, Lili; Chen, Ming; Peng, Limin; Wu, Yepeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Wang, Jun

2016-11-01

Gene-specific methylation and expression have shown biological and clinical importance for breast cancer diagnosis and prognosis. Integrated analysis of gene methylation and gene expression may identify genes associated with biology mechanism and clinical outcome of breast cancer and aid in clinical management. Using high-throughput microfluidic quantitative PCR, we analyzed the expression profiles of 48 candidate genes in 96 Chinese breast cancer patients and investigated their correlation with gene methylation and associations with breast cancer clinical parameters. Breast cancer-specific gene expression alternation was found in 25 genes with significant expression difference between paired tumor and normal tissues. A total of 9 genes (CCND2, EGFR, GSTP1, PGR, PTGS2, RECK, SOX17, TNFRSF10D, and WIF1) showed significant negative correlation between methylation and gene expression, which were validated in the TCGA database. Total 23 genes (ACADL, APC, BRCA2, CADM1, CAV1, CCND2, CST6, EGFR, ESR2, GSTP1, ICAM5, NPY, PGR, PTGS2, RECK, RUNX3, SFRP1, SOX17, SYK, TGFBR2, TNFRSF10D, WIF1, and WRN) annotated with potential TFBSs in the promoter regions showed negative correlation between methylation and expression. In logistics regression analysis, 31 of the 48 genes showed improved performance in disease prediction with combination of methylation and expression coefficient. Our results demonstrated the complex correlation and the possible regulatory mechanisms between DNA methylation and gene expression. Integration analysis of methylation and expression of candidate genes could improve performance in breast cancer prediction. These findings would contribute to molecular characterization and identification of biomarkers for potential clinical applications.
[Progress in research on pathogenic genes and gene therapy for inherited retinal diseases].

PubMed

Zhu, Ling; Cao, Cong; Sun, Jiji; Gao, Tao; Liang, Xiaoyang; Nie, Zhipeng; Ji, Yanchun; Jiang, Pingping; Guan, Minxin

2017-02-10

Inherited retinal diseases (IRDs), including retinitis pigmentosa, Usher syndrome, Cone-Rod degenerations, inherited macular dystrophy, Leber's congenital amaurosis, Leber's hereditary optic neuropathy are the most common and severe types of hereditary ocular diseases. So far more than 200 pathogenic genes have been identified. With the growing knowledge of the genetics and mechanisms of IRDs, a number of gene therapeutic strategies have been developed in the laboratory or even entered clinical trials. Here the progress of IRD research on the pathogenic genes and therapeutic strategies, particularly gene therapy, are reviewed.
Photochemical internalization-mediated nonviral gene transfection: polyamine core-shell nanoparticles as gene carrier

NASA Astrophysics Data System (ADS)

Zamora, Genesis; Wang, Frederick; Sun, Chung-Ho; Trinidad, Anthony; Kwon, Young Jik; Cho, Soo Kyung; Berg, Kristian; Madsen, Steen J.; Hirschberg, Henry

2014-10-01

The overall objective of the research was to investigate the utility of photochemical internalization (PCI) for the enhanced nonviral transfection of genes into glioma cells. The PCI-mediated introduction of the tumor suppressor gene phosphatase and tensin homolog (PTEN) or the cytosine deaminase (CD) pro-drug activating gene into U87 or U251 glioma cell monolayers and multicell tumor spheroids were evaluated. In the study reported here, polyamine-DNA gene polyplexes were encapsulated in a nanoparticle (NP) with an acid degradable polyketal outer shell. These NP synthetically mimic the roles of viral capsid and envelope, which transport and release the gene, respectively. The effects of PCI-mediated suppressor and suicide genes transfection efficiency employing either "naked" polyplex cores alone or as NP-shelled cores were compared. PCI was performed with the photosensitizer AlPcS2a and λ=670-nm laser irradiance. The results clearly demonstrated that the PCI can enhance the delivery of both the PTEN or CD genes in human glioma cell monolayers and multicell tumor spheroids. The transfection efficiency, as measured by cell survival and inhibition of spheroid growth, was found to be significantly greater at suboptimal light and DNA levels for shelled NPs compared with polyamine-DNA polyplexes alone.
MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers

PubMed Central

Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier

2017-01-01

Background The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. Objective MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. Methods MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. Results MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user’s specific interests and provides an efficient way to share information with collaborators. Furthermore, the user’s behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. Conclusions We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi

Discovering potential driver genes through an integrated model of somatic mutation profiles and gene functional information.

PubMed

Xi, Jianing; Wang, Minghui; Li, Ao

2017-09-26

The accumulating availability of next-generation sequencing data offers an opportunity to pinpoint driver genes that are causally implicated in oncogenesis through computational models. Despite previous efforts made regarding this challenging problem, there is still room for improvement in the driver gene identification accuracy. In this paper, we propose a novel integrated approach called IntDriver for prioritizing driver genes. Based on a matrix factorization framework, IntDriver can effectively incorporate functional information from both the interaction network and Gene Ontology similarity, and detect driver genes mutated in different sets of patients at the same time. When evaluated through known benchmarking driver genes, the top ranked genes of our result show highly significant enrichment for the known genes. Meanwhile, IntDriver also detects some known driver genes that are not found by the other competing approaches. When measured by precision, recall and F1 score, the performances of our approach are comparable or increased in comparison to the competing approaches.
Aldehyde Dehydrogenase Gene Superfamily in Populus: Organization and Expression Divergence between Paralogous Gene Pairs.

PubMed

Tian, Feng-Xia; Zang, Jian-Lei; Wang, Tan; Xie, Yu-Li; Zhang, Jin; Hu, Jian-Jun

2015-01-01

Aldehyde dehydrogenases (ALDHs) constitute a superfamily of NAD(P)+-dependent enzymes that catalyze the irreversible oxidation of a wide range of reactive aldehydes to their corresponding nontoxic carboxylic acids. ALDHs have been studied in many organisms from bacteria to mammals; however, no systematic analyses incorporating genome organization, gene structure, expression profiles, and cis-acting elements have been conducted in the model tree species Populus trichocarpa thus far. In this study, a comprehensive analysis of the Populus ALDH gene superfamily was performed. A total of 26 Populus ALDH genes were found to be distributed across 12 chromosomes. Genomic organization analysis indicated that purifying selection may have played a pivotal role in the retention and maintenance of PtALDH gene families. The exon-intron organizations of PtALDHs were highly conserved within the same family, suggesting that the members of the same family also may have conserved functionalities. Microarray data and qRT-PCR analysis indicated that most PtALDHs had distinct tissue-specific expression patterns. The specificity of cis-acting elements in the promoter regions of the PtALDHs and the divergence of expression patterns between nine paralogous PtALDH gene pairs suggested that gene duplications may have freed the duplicate genes from the functional constraints. The expression levels of some ALDHs were up- or down-regulated by various abiotic stresses, implying that the products of these genes may be involved in the adaptation of Populus to abiotic stresses. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the Populus ALDH gene superfamily and provide insights into the function and evolution of ALDH gene families in vascular plants.
Gene targeting in mosquito cells: a demonstration of 'knockout' technology in extrachromosomal gene arrays

PubMed Central

Eggleston, Paul; Zhao, Yuguang

2001-01-01

Background Gene targeting would offer a number of advantages over current transposon-based strategies for insect transformation. These include freedom from both position effects associated with quasi-random integration and concerns over transgene instability mediated by endogenous transposases, independence from phylogenetic restrictions on transposon mobility and the ability to generate gene knockouts. Results We describe here our initial investigations of gene targeting in the mosquito. The target site was a hygromycin resistance gene, stably maintained as part of an extrachromosomal array. Using a promoter-trap strategy to enrich for targeted events, a neomycin resistance gene was integrated into the target site. This resulted in knockout of hygromycin resistance concurrent with the expression of high levels of neomycin resistance from the resident promoter. PCR amplification of the targeted site generated a product that was specific to the targeted cell line and consistent with precise integration of the neomycin resistance gene into the 5' end of the hygromycin resistance gene. Sequencing of the PCR product and Southern analysis of cellular DNA subsequently confirmed this molecular structure. Conclusions These experiments provide the first demonstration of gene targeting in mosquito tissue and show that mosquito cells possess the necessary machinery to bring about precise integration of exogenous sequences through homologous recombination. Further development of these procedures and their extension to chromosomally located targets hold much promise for the exploitation of gene targeting in a wide range of medically and economically important insect species. PMID:11513755
Fine-scale mergers of chloroplast and mitochondrial genes create functional, transcompartmentally chimeric mitochondrial genes.

PubMed

Hao, Weilong; Palmer, Jeffrey D

2009-09-29

The mitochondrial genomes of flowering plants possess a promiscuous proclivity for taking up sequences from the chloroplast genome. All characterized chloroplast integrants exist apart from native mitochondrial genes, and only a few, involving chloroplast tRNA genes that have functionally supplanted their mitochondrial counterparts, appear to be of functional consequence. We developed a novel computational approach to search for homologous recombination (gene conversion) in a large number of sequences and applied it to 22 mitochondrial and chloroplast gene pairs, which last shared common ancestry some 2 billion years ago. We found evidence of recurrent conversion of short patches of mitochondrial genes by chloroplast homologs during angiosperm evolution, but no evidence of gene conversion in the opposite direction. All 9 putative conversion events involve the atp1/atpA gene encoding the alpha subunit of ATP synthase, which is unusually well conserved between the 2 organelles and the only shared gene that is widely sequenced across plant mitochondria. Moreover, all conversions were limited to the 2 regions of greatest nucleotide and amino acid conservation of atp1/atpA. These observations probably reflect constraints operating on both the occurrence and fixation of recombination between ancient homologs. These findings indicate that recombination between anciently related sequences is more frequent than previously appreciated and creates functional mitochondrial genes of chimeric origin. These results also have implications for the widespread use of mitochondrial atp1 in phylogeny reconstruction.
Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?

PubMed

Kaur, Simranjeet; Pociot, Flemming

2015-07-13

Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value < 10e-16), which highlights their importance in T1D. Functional annotation of T1D genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value < 10e-6). We also identified eight T1D genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.
Evaluation and selection of reliable reference genes for gene expression under abiotic stress in cotton (Gossypium hirsutum L.).

PubMed

Wang, Min; Wang, Qinglian; Zhang, Baohong

2013-11-01

Reference genes are critical for normalization of the gene expression level of target genes. The widely used housekeeping genes may change their expression levels at different tissue under different treatment or stress conditions. Therefore, systematical evaluation on the housekeeping genes is required for gene expression analysis. Up to date, no work was performed to evaluate the housekeeping genes in cotton under stress treatment. In this study, we chose 10 housekeeping genes to systematically assess their expression levels at two different tissues (leaves and roots) under two different abiotic stresses (salt and drought) with three different concentrations. Our results show that there is no best reference gene for all tissues at all stress conditions. The reliable reference gene should be selected based on a specific condition. For example, under salt stress, UBQ7, GAPDH and EF1A8 are better reference genes in leaves; TUA10, UBQ7, CYP1, GAPDH and EF1A8 were better in roots. Under drought stress, UBQ7, EF1A8, TUA10, and GAPDH showed less variety of expression level in leaves and roots. Thus, it is better to identify reliable reference genes first before performing any gene expression analysis. However, using a combination of housekeeping genes as reference gene may provide a new strategy for normalization of gene expression. In this study, we found that combination of four housekeeping genes worked well as reference genes under all the stress conditions. © 2013.
Systematic prediction of gene function in Arabidopsis thaliana using a probabilistic functional gene network

PubMed Central

Hwang, Sohyun; Rhee, Seung Y; Marcotte, Edward M; Lee, Insuk

2012-01-01

AraNet is a functional gene network for the reference plant Arabidopsis and has been constructed in order to identify new genes associated with plant traits. It is highly predictive for diverse biological pathways and can be used to prioritize genes for functional screens. Moreover, AraNet provides a web-based tool with which plant biologists can efficiently discover novel functions of Arabidopsis genes (http://www.functionalnet.org/aranet/). This protocol explains how to conduct network-based prediction of gene functions using AraNet and how to interpret the prediction results. Functional discovery in plant biology is facilitated by combining candidate prioritization by AraNet with focused experimental tests. PMID:21886106
Cloning of novel rice blast resistance genes from two rapidly evolving NBS-LRR gene families in rice.

PubMed

Guo, Changjiang; Sun, Xiaoguang; Chen, Xiao; Yang, Sihai; Li, Jing; Wang, Long; Zhang, Xiaohui

2016-01-01

Most rice blast resistance genes (R-genes) encode proteins with nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. Our previous study has shown that more rice blast R-genes can be cloned in rapidly evolving NBS-LRR gene families. In the present study, two rapidly evolving R-gene families in rice were selected for cloning a subset of genes from their paralogs in three resistant rice lines. A total of eight functional blast R-genes were identified among nine NBS-LRR genes, and some of these showed resistance to three or more blast strains. Evolutionary analysis indicated that high nucleotide diversity of coding regions served as important parameters in the determination of gene resistance. We also observed that amino-acid variants (nonsynonymous mutations, insertions, or deletions) in essential motifs of the NBS domain contribute to the blast resistance capacity of NBS-LRR genes. These results suggested that the NBS regions might also play an important role in resistance specificity determination. On the other hand, different splicing patterns of introns were commonly observed in R-genes. The results of the present study contribute to improving the effectiveness of R-gene identification by using evolutionary analysis method and acquisition of novel blast resistance genes.
Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis

DTIC Science & Technology

2015-10-01

AWARD NUMBER: W81XWH-10-1-0469 TITLE: Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis...31Jul2015 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER "Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis 1 and Scoliosis." 5b...ABSTRACT Dystrophic or non-dystrophic forms of scoliosis are skeletal manifestations of Neurofibromatosis type 1 (NF1). Dystrophic scoliosis has a more
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene

PubMed Central

Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

2007-01-01

Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
Phylogenomics of MADS-Box Genes in Plants - Two Opposing Life Styles in One Gene Family.

PubMed

Gramzow, Lydia; Theißen, Günter

2013-09-12

The development of multicellular eukaryotes, according to their body plan, is often directed by members of multigene families that encode transcription factors. MADS (for MINICHROMOSOME MAINTENANCE1, AGAMOUS, DEFICIENS and SERUM RESPONSE FACTOR)-box genes form one of those families controlling nearly all major aspects of plant development. Knowing the complete complement of MADS-box genes in sequenced plant genomes will allow a better understanding of the evolutionary patterns of these genes and the association of their evolution with the evolution of plant morphologies. Here, we have applied a combination of automatic and manual annotations to identify the complete set of MADS-box genes in 17 plant genomes. Furthermore, three plant genomes were reanalyzed and published datasets were used for four genomes such that more than 2,600 genes from 24 species were classified into the two types of MADS-box genes, Type I and Type II. Our results extend previous studies, highlighting the remarkably different evolutionary patterns of Type I and Type II genes and provide a basis for further studies on the evolution and function of MADS-box genes.
Selection and validation of reference genes for gene expression analysis in apomictic and sexual Cenchrus ciliaris

PubMed Central

2013-01-01

Background Apomixis is a naturally occurring asexual mode of seed reproduction resulting in offspring genetically identical to the maternal plant. Identifying differential gene expression patterns between apomictic and sexual plants is valuable to help deconstruct the trait. Quantitative RT-PCR (qRT-PCR) is a popular method for analyzing gene expression. Normalizing gene expression data using proper reference genes which show stable expression under investigated conditions is critical in qRT-PCR analysis. We used qRT-PCR to validate expression and stability of six potential reference genes (EF1alpha, EIF4A, UBCE, GAPDH, ACT2 and TUBA) in vegetative and reproductive tissues of B-2S and B-12-9 accessions of C. ciliaris. Findings Among tissue types evaluated, EF1alpha showed the highest level of expression while TUBA showed the lowest. When all tissue types were evaluated and compared between genotypes, EIF4A was the most stable reference gene. Gene expression stability for specific ovary stages of B-2S and B-12-9 was also determined. Except for TUBA, all other tested reference genes could be used for any stage-specific ovary tissue normalization, irrespective of the mode of reproduction. Conclusion Our gene expression stability assay using six reference genes, in sexual and apomictic accessions of C. ciliaris, suggests that EIF4A is the most stable gene across all tissue types analyzed. All other tested reference genes, with the exception of TUBA, could be used for gene expression comparison studies between sexual and apomictic ovaries over multiple developmental stages. This reference gene validation data in C. ciliaris will serve as an important base for future apomixis-related transcriptome data validation. PMID:24083672
GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

PubMed

Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

2016-03-01

Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics
Characterization of the interferon genes in homozygous rainbow trout reveals two novel genes, alternate splicing and differential regulation of duplicated genes

USGS Publications Warehouse

Purcell, M.K.; Laing, K.J.; Woodson, J.C.; Thorgaard, G.H.; Hansen, J.D.

2009-01-01

The genes encoding the type I and type II interferons (IFNs) have previously been identified in rainbow trout and their proteins partially characterized. These previous studies reported a single type II IFN (rtIFN-??) and three rainbow trout type I IFN genes that are classified into either group I (rtIFN1, rtIFN2) or group II (rtIFN3). In this present study, we report the identification of a novel IFN-?? gene (rtIFN-??2) and a novel type I group II IFN (rtIFN4) in homozygous rainbow trout and predict that additional IFN genes or pseudogenes exist in the rainbow trout genome. Additionally, we provide evidence that short and long forms of rtIFN1 are actively and differentially transcribed in homozygous trout, and likely arose due to alternate splicing of the first exon. Quantitative reverse transcriptase PCR (qRT-PCR) assays were developed to systematically profile all of the rainbow trout IFN transcripts, with high specificity at an individual gene level, in na??ve fish and after stimulation with virus or viral-related molecules. Cloned PCR products were used to ensure the specificity of the qRT-PCR assays and as absolute standards to assess transcript abundance of each gene. All IFN genes were modulated in response to Infectious hematopoietic necrosis virus (IHNV), a DNA vaccine based on the IHNV glycoprotein, and poly I:C. The most inducible of the type I IFN genes, by all stimuli tested, were rtIFN3 and the short transcript form of rtIFN1. Gene expression of rtIFN-??1 and rtIFN-??2 was highly up-regulated by IHNV infection and DNA vaccination but rtIFN-??2 was induced to a greater magnitude. The specificity of the qRT-PCR assays reported here will be useful for future studies aimed at identifying which cells produce IFNs at early time points after infection. ?? 2008 Elsevier Ltd.
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

PubMed

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-10-03

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes

PubMed Central

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-01-01

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274
Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data.

PubMed

Ezer, Daphne; Moignard, Victoria; Göttgens, Berthold; Adryan, Boris

2016-08-01

Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete pipeline for the analysis of single cell qPCR data that uses the mathematics behind bursty expression to develop more accurate and robust algorithms for analyzing the origin of heterogeneity in experimental samples, specifically an algorithm for clustering cells by their bursting behavior (Simulated Annealing for Bursty Expression Clustering, SABEC) and a statistical tool for comparing the kinetic parameters of bursty expression across populations of cells (Estimation of Parameter changes in Kinetics, EPiK). We applied these methods to hematopoiesis, including a new single cell dataset in which transcription factors (TFs) involved in the earliest branchpoint of blood differentiation were individually up- and down-regulated. We could identify two unique sub-populations within a seemingly homogenous group of hematopoietic stem cells. In addition, we could predict regulatory mechanisms controlling the expression levels of eighteen key hematopoietic transcription factors throughout differentiation. Detailed information about gene regulatory mechanisms can therefore be obtained simply from high throughput single cell gene expression data, which should be widely applicable given the rapid expansion of single cell genomics.
Heterologous gene expression driven by carbonic anhydrase gene promoter in Dunaliella salina

NASA Astrophysics Data System (ADS)

Chai, Yurong; Lu, Yumin; Wang, Tianyun; Hou, Weihong; Xue, Lexun

2006-12-01

Dunaliella salina, a halotolerant unicellular green alga without a rigid cell wall, can live in salinities ranging from 0.05 to 5 mol/L NaCl. These features of D. salina make it an ideal host for the production of antibodies, oral vaccine, and commercially valuable polypeptides. To produce high level of heterologous proteins from D. salina, highly efficient promoters are required to drive expression of target genes under controlled condition. In the present study, we cloned a 5' franking region of 1.4 kb from the carbonic anhydrase ( CAH) gene of D. salina by genomic walking and PCR. The fragment was ligated to the pMD18-T vector and characterized. Sequence analysis indicated that this region contained conserved motifs, including a TATA- like box and CAAT-box. Tandem (GT)n repeats that had a potential role of transcriptional control, were also found in this region. The transcription start site (TSS) of the CAH gene was determined by 5' RACE and nested PCR method. Transformation assays showed that the 1.4 kb fragment was able to drive expression of the selectable bar (bialaphos resistance) gene when the fusion was transformed into D. salina by biolistics. Northern blotting hybridizations showed that the bar transcript was most abundant in cells grown in 2 mol/L NaCl, and less abundant in 0.5 mol/L NaCl, indicating that expression of the bar gene was induced at high salinity. These results suggest the potential use of the CAH gene promoter to induce the expression of heterologous genes in D. salina under varied salt condition.
Gene-based interaction analysis shows GABAergic genes interacting with parenting in adolescent depressive symptoms.

PubMed

Van Assche, Evelien; Moons, Tim; Cinar, Ozan; Viechtbauer, Wolfgang; Oldehinkel, Albertine J; Van Leeuwen, Karla; Verschueren, Karine; Colpin, Hilde; Lambrechts, Diether; Van den Noortgate, Wim; Goossens, Luc; Claes, Stephan; van Winkel, Ruud

2017-12-01

Most gene-environment interaction studies (G × E) have focused on single candidate genes. This approach is criticized for its expectations of large effect sizes and occurrence of spurious results. We describe an approach that accounts for the polygenic nature of most psychiatric phenotypes and reduces the risk of false-positive findings. We apply this method focusing on the role of perceived parental support, psychological control, and harsh punishment in depressive symptoms in adolescence. Analyses were conducted on 982 adolescents of Caucasian origin (M age (SD) = 13.78 (.94) years) genotyped for 4,947 SNPs in 263 genes, selected based on a literature survey. The Leuven Adolescent Perceived Parenting Scale (LAPPS) and the Parental Behavior Scale (PBS) were used to assess perceived parental psychological control, harsh punishment, and support. The Center for Epidemiologic Studies Depression Scale (CES-D) was the outcome. We used gene-based testing taking into account linkage disequilibrium to identify genes containing SNPs exhibiting an interaction with environmental factors yielding a p-value per single gene. Significant results at the corrected p-value of p < 1.90 × 10 -4 were examined in an independent replication sample of Dutch adolescents (N = 1354). Two genes showed evidence for interaction with perceived support: GABRR1 (p = 4.62 × 10 -5 ) and GABRR2 (p = 9.05 × 10 -6 ). No genes interacted significantly with psychological control or harsh punishment. Gene-based analysis was unable to confirm the interaction of GABRR1 or GABRR2 with support in the replication sample. However, for GABRR2, but not GABRR1, the correlation of the estimates between the two datasets was significant (r (46) = .32; p = .027) and a gene-based analysis of the combined datasets supported GABRR2 × support interaction (p = 1.63 × 10 -4 ). We present a gene-based method for gene-environment interactions in a polygenic context and show that genes
GENE EXPRESSION NETWORKS

EPA Science Inventory

"Gene expression network" is the term used to describe the interplay, simple or complex, between two or more gene products in performing a specific cellular function. Although the delineation of such networks is complicated by the existence of multiple and subtle types of intera...

Genes in sport and doping.

PubMed

Pokrywka, A; Kaliszewski, P; Majorczyk, E; Zembroń-Łacny, A

2013-09-01

Genes control biological processes such as muscle production of energy, mitochondria biogenesis, bone formation, erythropoiesis, angiogenesis, vasodilation, neurogenesis, etc. DNA profiling for athletes reveals genetic variations that may be associated with endurance ability, muscle performance and power exercise, tendon susceptibility to injuries and psychological aptitude. Already, over 200 genes relating to physical performance have been identified by several research groups. Athletes' genotyping is developing as a tool for the formulation of personalized training and nutritional programmes to optimize sport training as well as for the prediction of exercise-related injuries. On the other hand, development of molecular technology and gene therapy creates a risk of non-therapeutic use of cells, genes and genetic elements to improve athletic performance. Therefore, the World Anti-Doping Agency decided to include prohibition of gene doping within their World Anti-Doping Code in 2003. In this review article, we will provide a current overview of genes for use in athletes' genotyping and gene doping possibilities, including their development and detection techniques.
Simple Monitoring of Gene Targeting Efficiency in Human Somatic Cell Lines Using the PIGA Gene

PubMed Central

Karnan, Sivasundaram; Konishi, Yuko; Ota, Akinobu; Takahashi, Miyuki; Damdindorj, Lkhagvasuren; Hosokawa, Yoshitaka; Konishi, Hiroyuki

2012-01-01

Gene targeting in most of human somatic cell lines has been labor-intensive because of low homologous recombination efficiency. The development of an experimental system that permits a facile evaluation of gene targeting efficiency in human somatic cell lines is the first step towards the improvement of this technology and its application to a broad range of cell lines. In this study, we utilized phosphatidylinositol glycan anchor biosynthesis class A (PIGA), a gene essential for the synthesis of glycosylphosphatidyl inositol (GPI) anchors, as a reporter of gene targeting events in human somatic cell lines. Targeted disruption of PIGA was quantitatively detected with FLAER, a reagent that specifically binds to GPI anchors. Using this PIGA-based reporter system, we successfully detected adeno-associated virus (AAV)-mediated gene targeting events both with and without promoter-trap enrichment of gene-targeted cell population. The PIGA-based reporter system was also capable of reproducing previous findings that an AAV-mediated gene targeting achieves a remarkably higher ratio of homologous versus random integration (H/R ratio) of targeting vectors than a plasmid-mediated gene targeting. The PIGA-based system also detected an approximately 2-fold increase in the H/R ratio achieved by a small negative selection cassette introduced at the end of the AAV-based targeting vector with a promoter-trap system. Thus, our PIGA-based system is useful for monitoring AAV-mediated gene targeting and will assist in improving gene targeting technology in human somatic cell lines. PMID:23056640
Tuning Gene Activity by Inducible and Targeted Regulation of Gene Expression in Minimal Bacterial Cells.

PubMed

Mariscal, Ana M; Kakizawa, Shigeyuki; Hsu, Jonathan Y; Tanaka, Kazuki; González-González, Luis; Broto, Alicia; Querol, Enrique; Lluch-Senar, Maria; Piñero-Lambea, Carlos; Sun, Lijie; Weyman, Philip D; Wise, Kim S; Merryman, Chuck; Tse, Gavin; Moore, Adam J; Hutchison, Clyde A; Smith, Hamilton O; Tomita, Masaru; Venter, J Craig; Glass, John I; Piñol, Jaume; Suzuki, Yo

2018-05-22

Functional genomics studies in minimal mycoplasma cells enable unobstructed access to some of the most fundamental processes in biology. Conventional transposon bombardment and gene knockout approaches often fail to reveal functions of genes that are essential for viability, where lethality precludes phenotypic characterization. Conditional inactivation of genes is effective for characterizing functions central to cell growth and division, but tools are limited for this purpose in mycoplasmas. Here we demonstrate systems for inducible repression of gene expression based on clustered regularly interspaced short palindromic repeats-mediated interference (CRISPRi) in Mycoplasma pneumoniae and synthetic Mycoplasma mycoides, two organisms with reduced genomes actively used in systems biology studies. In the synthetic cell, we also demonstrate inducible gene expression for the first time. Time-course data suggest rapid kinetics and reversible engagement of CRISPRi. Targeting of six selected endogenous genes with this system results in lowered transcript levels or reduced growth rates that agree with lack or shortage of data in previous transposon bombardment studies, and now produces actual cells to analyze. The ksgA gene encodes a methylase that modifies 16S rRNA, rendering it vulnerable to inhibition by the antibiotic kasugamycin. Targeting the ksgA gene with CRISPRi removes the lethal effect of kasugamycin and enables cell growth, thereby establishing specific and effective gene modulation with our system. The facile methods for conditional gene activation and inactivation in mycoplasmas open the door to systematic dissection of genetic programs at the core of cellular life.
Genome Duplication and Gene Loss Affect the Evolution of Heat Shock Transcription Factor Genes in Legumes

PubMed Central

Jin, Jing; Jin, Xiaolei; Jiang, Haiyang; Yan, Hanwei; Cheng, Beijiu

2014-01-01

Whole-genome duplication events (polyploidy events) and gene loss events have played important roles in the evolution of legumes. Here we show that the vast majority of Hsf gene duplications resulted from whole genome duplication events rather than tandem duplication, and significant differences in gene retention exist between species. By searching for intraspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found that genome duplications accounted for 42 of 46 Hsf-containing segments in Glycine max, while paired segments were rarely identified in Lotus japonicas, Medicago truncatula and Cajanus cajan. However, by comparing interspecies microsynteny, we determined that the great majority of Hsf-containing segments in Lotus japonicas, Medicago truncatula and Cajanus cajan show extensive conservation with the duplicated regions of Glycine max. These segments formed 17 groups of orthologous segments. These results suggest that these regions shared ancient genome duplication with Hsf genes in Glycine max, but more than half of the copies of these genes were lost. On the other hand, the Glycine max Hsf gene family retained approximately 75% and 84% of duplicated genes produced from the ancient genome duplication and recent Glycine-specific genome duplication, respectively. Continuous purifying selection has played a key role in the maintenance of Hsf genes in Glycine max. Expression analysis of the Hsf genes in Lotus japonicus revealed their putative involvement in multiple tissue-/developmental stages and responses to various abiotic stimuli. This study traces the evolution of Hsf genes in legume species and demonstrates that the rates of gene gain and loss are far from equilibrium in different species. PMID:25047803
The glpD gene is a novel reporter gene for E. coli that is superior to established reporter genes like lacZ and gusA.

PubMed

Wegener, Marius; Vogtmann, Kristina; Huber, Madeleine; Laass, Sebastian; Soppa, Jörg

2016-12-01

Reporter genes facilitate the characterization of promoter activities, transcript stabilities, translational efficiencies, or intracellular localization. Various reporter genes for Escherichia coli have been established, however, most of them have drawbacks like transcript instability or the inability to be used in genetic selections. Therefore, the glpD gene encoding glycerol-3-phosphate dehydrogenase was introduced as a novel reporter gene for E. coli. The enzymatic assay was optimized, and it was verified that growth on glycerol strictly depends on the presence of GlpD. The 5'-UTRs of three E. coli genes were chosen and cloned upstream of the new reporter gene glpD as well as the established reporter genes lacZ and gusA. Protein and transcript levels were quantified and translational efficiencies were calculated. The lacZ transcript was very unstable and its level highly depended on its translation, compromising its use as a reporter. The results obtained with gusA and glpD were similar, however, only glpD can be used for genetic selections. Therefore, glpD was found to be a superior novel reporter gene compared to the established reporter genes lacZ and gusA. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Gene therapy on the move

PubMed Central

Kaufmann, Kerstin B; Büning, Hildegard; Galy, Anne; Schambach, Axel; Grez, Manuel

2013-01-01

The first gene therapy clinical trials were initiated more than two decades ago. In the early days, gene therapy shared the fate of many experimental medicine approaches and was impeded by the occurrence of severe side effects in a few treated patients. The understanding of the molecular and cellular mechanisms leading to treatment- and/or vector-associated setbacks has resulted in the development of highly sophisticated gene transfer tools with improved safety and therapeutic efficacy. Employing these advanced tools, a series of Phase I/II trials were started in the past few years with excellent clinical results and no side effects reported so far. Moreover, highly efficient gene targeting strategies and site-directed gene editing technologies have been developed and applied clinically. With more than 1900 clinical trials to date, gene therapy has moved from a vision to clinical reality. This review focuses on the application of gene therapy for the correction of inherited diseases, the limitations and drawbacks encountered in some of the early clinical trials and the revival of gene therapy as a powerful treatment option for the correction of monogenic disorders. PMID:24106209
Genes@Work: an efficient algorithm for pattern discovery and multivariate feature selection in gene expression data.

PubMed

Lepre, Jorge; Rice, J Jeremy; Tu, Yuhai; Stolovitzky, Gustavo

2004-05-01

Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen).
Cognitive analysis of schizophrenia risk genes that function as epigenetic regulators of gene expression.

PubMed

Whitton, Laura; Cosgrove, Donna; Clarkson, Christopher; Harold, Denise; Kendall, Kimberley; Richards, Alex; Mantripragada, Kiran; Owen, Michael J; O'Donovan, Michael C; Walters, James; Hartmann, Annette; Konte, Betina; Rujescu, Dan; Gill, Michael; Corvin, Aiden; Rea, Stephen; Donohoe, Gary; Morris, Derek W

2016-12-01

Epigenetic mechanisms are an important heritable and dynamic means of regulating various genomic functions, including gene expression, to orchestrate brain development, adult neurogenesis, and synaptic plasticity. These processes when perturbed are thought to contribute to schizophrenia pathophysiology. A core feature of schizophrenia is cognitive dysfunction. For genetic disorders where cognitive impairment is more severe such as intellectual disability, there are a disproportionally high number of genes involved in the epigenetic regulation of gene transcription. Evidence now supports some shared genetic aetiology between schizophrenia and intellectual disability. GWAS have identified 108 chromosomal regions associated with schizophrenia risk that span 350 genes. This study identified genes mapping to those loci that have epigenetic functions, and tested the risk alleles defining those loci for association with cognitive deficits. We developed a list of 350 genes with epigenetic functions and cross-referenced this with the GWAS loci. This identified eight candidate genes: BCL11B, CHD7, EP300, EPC2, GATAD2A, KDM3B, RERE, SATB2. Using a dataset of Irish psychosis cases and controls (n = 1235), the schizophrenia risk SNPs at these loci were tested for effects on IQ, working memory, episodic memory, and attention. Strongest associations were for rs6984242 with both measures of IQ (P = 0.001) and episodic memory (P = 0.007). We link rs6984242 to CHD7 via a long range eQTL. These associations were not replicated in independent samples. Our study highlights that a number of genes mapping to risk loci for schizophrenia may function as epigenetic regulators of gene expression but further studies are required to establish a role for these genes in cognition. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don.

PubMed

Xiao, Zheng; Sun, Xiaobo; Liu, Xiaoqing; Li, Chang; He, Lisi; Chen, Shangping; Su, Jiale

2016-01-01

The quantitative real-time polymerase chain reaction (qRT-PCR) approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder, and BestKeeper. The results showed that EF1- α (elongation factor 1-alpha), 18S (18s ribosomal RNA), and RPL3 (ribosomal protein L3) were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin ( TUB ) was the least stable. ACT5 (actin), RPL3 , 18S , and EF1- α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle . Furthermore, the expression profiles of RmPSY (phytoene synthase) and RmPDS (phytoene dehydrogenase) were assessed using EF1- α, 18S , ACT5 , RPL3 , and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle .
Identifying key genes in glaucoma based on a benchmarked dataset and the gene regulatory network.

PubMed

Chen, Xi; Wang, Qiao-Ling; Zhang, Meng-Hui

2017-10-01

The current study aimed to identify key genes in glaucoma based on a benchmarked dataset and gene regulatory network (GRN). Local and global noise was added to the gene expression dataset to produce a benchmarked dataset. Differentially-expressed genes (DEGs) between patients with glaucoma and normal controls were identified utilizing the Linear Models for Microarray Data (Limma) package based on benchmarked dataset. A total of 5 GRN inference methods, including Zscore, GeneNet, context likelihood of relatedness (CLR) algorithm, Partial Correlation coefficient with Information Theory (PCIT) and GEne Network Inference with Ensemble of Trees (Genie3) were evaluated using receiver operating characteristic (ROC) and precision and recall (PR) curves. The interference method with the best performance was selected to construct the GRN. Subsequently, topological centrality (degree, closeness and betweenness) was conducted to identify key genes in the GRN of glaucoma. Finally, the key genes were validated by performing reverse transcription-quantitative polymerase chain reaction (RT-qPCR). A total of 176 DEGs were detected from the benchmarked dataset. The ROC and PR curves of the 5 methods were analyzed and it was determined that Genie3 had a clear advantage over the other methods; thus, Genie3 was used to construct the GRN. Following topological centrality analysis, 14 key genes for glaucoma were identified, including IL6 , EPHA2 and GSTT1 and 5 of these 14 key genes were validated by RT-qPCR. Therefore, the current study identified 14 key genes in glaucoma, which may be potential biomarkers to use in the diagnosis of glaucoma and aid in identifying the molecular mechanism of this disease.
Targeting gene expression selectively in cancer cells by using the progression-elevated gene-3 promoter.

PubMed

Su, Zhao-Zhong; Sarkar, Devanand; Emdad, Luni; Duigou, Gregory J; Young, Charles S H; Ware, Joy; Randolph, Aaron; Valerie, Kristoffer; Fisher, Paul B

2005-01-25

One impediment to effective cancer-specific gene therapy is the rarity of regulatory sequences targeting gene expression selectively in tumor cells. Although many tissue-specific promoters are recognized, few cancer-selective gene promoters are available. Progression-elevated gene-3 (PEG-3) is a rodent gene identified by subtraction hybridization that displays elevated expression as a function of transformation by diversely acting oncogenes, DNA damage, and cancer cell progression. The promoter of PEG-3, PEG-Prom, displays robust expression in a broad spectrum of human cancer cell lines with marginal expression in normal cellular counterparts. Whereas GFP expression, when under the control of a CMV promoter, is detected in both normal and cancer cells, when GFP is expressed under the control of the PEG-Prom, cancer-selective expression is evident. Mutational analysis identifies the AP-1 and PEA-3 transcription factors as primary mediators of selective, cancer-specific expression of the PEG-Prom. Synthesis of apoptosis-inducing genes, under the control of the CMV promoter, inhibits the growth of both normal and cancer cells, whereas PEG-Prom-mediated expression of these genes kills only cancer cells and spares normal cells. The efficacy of the PEG-Prom as part of a cancer gene therapeutic regimen is further documented by in vivo experiments in which PEG-Prom-controlled expression of an apoptosis-inducing gene completely inhibited prostate cancer xenograft growth in nude mice. These compelling observations indicate that the PEG-Prom, with its cancer-specific expression, provides a means of selectively delivering genes to cancer cells, thereby providing a crucial component in developing effective cancer gene therapies.
Gene therapy for achromatopsia.

PubMed

Michalakis, Stylianos; Schön, Christian; Becirovic, Elvir; Biel, Martin

2017-03-01

The present review summarizes the current status of achromatopsia (ACHM) gene therapy-related research activities and provides an outlook for their clinical application. ACHM is an inherited eye disease characterized by a congenital absence of cone photoreceptor function. As a consequence, ACHM is associated with strongly impaired daylight vision, photophobia, nystagmus and a lack of color discrimination. Currently, six genes have been linked to ACHM. Up to 80% of the patients carry mutations in the genes CNGA3 and CNGB3 encoding the two subunits of the cone cyclic nucleotide-gated channel. Various animal models of the disease have been established and their characterization has helped to increase our understanding of the pathophysiology associated with ACHM. With the advent of adeno-associated virus vectors as valuable gene delivery tools for retinal photoreceptors, a number of promising gene supplementation therapy programs have been initiated. In recent years, huge progress has been made towards bringing a curative treatment for ACHM into clinics. The first clinical trials are ongoing or will be launched soon and are expected to contribute important data on the safety and efficacy of ACHM gene supplementation therapy. Copyright © 2017 John Wiley & Sons, Ltd.
Autophagy genes in immunity

PubMed Central

Virgin, Herbert W; Levine, Beth

2009-01-01

In its classical form, autophagy is a pathway by which cytoplasmic constituents, including intracellular pathogens, are sequestered in a double-membrane–bound autophagosome and delivered to the lysosome for degradation. This pathway has been linked to diverse aspects of innate and adaptive immunity, including pathogen resistance, production of type I interferon, antigen presentation, tolerance and lymphocyte development, as well as the negative regulation of cytokine signaling and inflammation. Most of these links have emerged from studies in which genes encoding molecules involved in autophagy are inactivated in immune effector cells. However, it is not yet known whether all of the critical functions of such genes in immunity represent ‘classical autophagy’ or possible as-yet-undefined autophagolysosome-independent functions of these genes. This review summarizes phenotypes that result from the inactivation of autophagy genes in the immune system and discusses the pleiotropic functions of autophagy genes in immunity. PMID:19381141
PlantTribes: a gene and gene family resource for comparative genomics in plants

PubMed Central

Wall, P. Kerr; Leebens-Mack, Jim; Müller, Kai F.; Field, Dawn; Altman, Naomi S.; dePamphilis, Claude W.

2008-01-01

The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting ∼4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study. PMID:18073194
Genes essential for phototrophic growth by a purple alphaproteobacterium: Genes for phototrophic growth

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yang, Jianming; Yin, Liang; Lessner, Faith H.

Anoxygenic purple phototrophic bacteria have served as important models for studies of photophosphorylation. The pigment-protein complexes responsible for converting light energy to ATP are relatively simple and these bacteria can grow heterotrophically under aerobic conditions, thus allowing for the study of mutants defective in photophosphorylation. In the past, genes responsible for anoxygenic phototrophic growth have been identified in a number of different bacterial species. Here we systematically studied the genetic basis for this metabolism by using Tn-seq to identify genes essential for the anaerobic growth of the purple bacterium Rhodopseudomonas palustris on acetate in light. We identified 171 genes requiredmore » for growth in this condition, 35 of which are annotated as photosynthesis genes. Among these are a few new genes not previously shown to be essential for phototrophic growth. We verified the essentiality of many of the genes we identified by analyzing the phenotypes of mutants we generated by Tn mutagenesis that had altered pigmentation. We used directed mutagenesis to verify that the R. palustris NADH:quinone oxidoreductase complex IE is essential for phototrophic growth. As a complement to the genetic data, we carried out proteomics experiments in which we found that 429 proteins were present in significantly higher amounts in cells grown anaerobically in light compared to aerobically. Among these were proteins encoded by subset of the phototrophic growth-essential genes.« less
Gene identification for risk of relapse in stage I lung adenocarcinoma patients: a combined methodology of gene expression profiling and computational gene network analysis.

PubMed

Ludovini, Vienna; Bianconi, Fortunato; Siggillino, Annamaria; Piobbico, Danilo; Vannucci, Jacopo; Metro, Giulio; Chiari, Rita; Bellezza, Guido; Puma, Francesco; Della Fazia, Maria Agnese; Servillo, Giuseppe; Crinò, Lucio

2016-05-24

Risk assessment and treatment choice remains a challenge in early non-small-cell lung cancer (NSCLC). The aim of this study was to identify novel genes involved in the risk of early relapse (ER) compared to no relapse (NR) in resected lung adenocarcinoma (AD) patients using a combination of high throughput technology and computational analysis. We identified 18 patients (n.13 NR and n.5 ER) with stage I AD. Frozen samples of patients in ER, NR and corresponding normal lung (NL) were subjected to Microarray technology and quantitative-PCR (Q-PCR). A gene network computational analysis was performed to select predictive genes. An independent set of 79 ADs stage I samples was used to validate selected genes by Q-PCR.From microarray analysis we selected 50 genes, using the fold change ratio of ER versus NR. They were validated both in pool and individually in patient samples (ER and NR) by Q-PCR. Fourteen increased and 25 decreased genes showed a concordance between two methods. They were used to perform a computational gene network analysis that identified 4 increased (HOXA10, CLCA2, AKR1B10, FABP3) and 6 decreased (SCGB1A1, PGC, TFF1, PSCA, SPRR1B and PRSS1) genes. Moreover, in an independent dataset of ADs samples, we showed that both high FABP3 expression and low SCGB1A1 expression was associated with a worse disease-free survival (DFS).Our results indicate that it is possible to define, through gene expression and computational analysis, a characteristic gene profiling of patients with an increased risk of relapse that may become a tool for patient selection for adjuvant therapy.
Gene therapy for arthritis

PubMed Central

Traister, Russell S.

2008-01-01

Arthritis is among the leading causes of disability in the developed world. There remains no cure for this disease and the current treatments are only modestly effective at slowing the disease's progression and providing symptomatic relief. The clinical effectiveness of current treatment regimens has been limited by short half-lives of the drugs and the requirement for repeated systemic administration. Utilizing gene transfer approaches for the treatment of arthritis may overcome some of the obstacles associated with current treatment strategies. The present review examines recent developments in gene therapy for arthritis. Delivery strategies, gene transfer vectors, candidate genes, and safety are also discussed. PMID:18176779
GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes.

PubMed

van den Broek, Evert; van Lieshout, Stef; Rausch, Christian; Ylstra, Bauke; van de Wiel, Mark A; Meijer, Gerrit A; Fijneman, Remond J A; Abeln, Sanne

2016-01-01

Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs) of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large) series of tumor samples. 'GeneBreak' is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH) or by (low-pass) whole genome sequencing (WGS). First, 'GeneBreak' collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, 'GeneBreak', is implemented in R ( www.cran.r-project.org ) and is available from Bioconductor ( www.bioconductor.org/packages/release/bioc/html/GeneBreak.html ).
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*

PubMed Central

Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar

2014-01-01

Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215
[Gene method for inconsistent hydrological frequency calculation. 2: Diagnosis system of hydrological genes and method of hydrological moment genes with inconsistent characters].

PubMed

Xie, Ping; Zhao, Jiang Yan; Wu, Zi Yi; Sang, Yan Fang; Chen, Jie; Li, Bin Bin; Gu, Hai Ting

2018-04-01

The analysis of inconsistent hydrological series is one of the major problems that should be solved for engineering hydrological calculation in changing environment. In this study, the diffe-rences of non-consistency and non-stationarity were analyzed from the perspective of composition of hydrological series. The inconsistent hydrological phenomena were generalized into hydrological processes with inheritance, variability and evolution characteristics or regulations. Furthermore, the hydrological genes were identified following the theory of biological genes, while their inheritance bases and variability bases were determined based on composition of hydrological series under diffe-rent time scales. To identify and test the components of hydrological genes, we constructed a diagnosis system of hydrological genes. With the P-3 distribution as an example, we described the process of construction and expression of the moment genes to illustrate the inheritance, variability and evolution principles of hydrological genes. With the annual minimum 1-month runoff series of Yunjinghong station in Lancangjiang River basin as an example, we verified the feasibility and practicability of hydrological gene theory for the calculation of inconsistent hydrological frequency. The results showed that the method could be used to reveal the evolution of inconsistent hydrological series. Therefore, it provided a new research pathway for engineering hydrological calculation in changing environment and an essential reference for the assessment of water security.

Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

PubMed

Beauparlant, Marc A; Drouin, Guy

2014-02-01

Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.
GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies

PubMed Central

Zhang, Bing; Schmoyer, Denise; Kirov, Stefan; Snoddy, Jay

2004-01-01

Background Microarray and other high-throughput technologies are producing large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in the gene sets. Results We have created a web-based tool for data analysis and data visualization for sets of genes called GOTree Machine (GOTM). This tool was originally intended to analyze sets of co-regulated genes identified from microarray analysis but is adaptable for use with other gene sets from other high-throughput analyses. GOTree Machine generates a GOTree, a tree-like structure to navigate the Gene Ontology Directed Acyclic Graph for input gene sets. This system provides user friendly data navigation and visualization. Statistical analysis helps users to identify the most important Gene Ontology categories for the input gene sets and suggests biological areas that warrant further study. GOTree Machine is available online at . Conclusion GOTree Machine has a broad application in functional genomic, proteomic and other high-throughput methods that generate large sets of interesting genes; its primary purpose is to help users sort for interesting patterns in gene sets. PMID:14975175
Multiple inter-kingdom horizontal gene transfers in the evolution of the phosphoenolpyruvate carboxylase gene family.

PubMed

Peng, Yingmei; Cai, Jing; Wang, Wen; Su, Bing

2012-01-01

Pepcase is a gene encoding phosphoenolpyruvate carboxylase that exists in bacteria, archaea and plants,playing an important role in plant metabolism and development. Most plants have two or more pepcase genes belonging to two gene sub-families, while only one gene exists in other organisms. Previous research categorized one plant pepcase gene as plant-type pepcase (PTPC) while the other as bacteria-type pepcase (BTPC) because of its similarity with the pepcase gene found in bacteria. Phylogenetic reconstruction showed that PTPC is the ancestral lineage of plant pepcase, and that all bacteria, protistpepcase and BTPC in plants are derived from a lineage of pepcase closely related with PTPC in algae. However, their phylogeny contradicts the species tree and traditional chronology of organism evolution. Because the diversification of bacteria occurred much earlier than the origin of plants, presumably all bacterialpepcase derived from the ancestral PTPC of algal plants after divergingfrom the ancestor of vascular plant PTPC. To solve this contradiction, we reconstructed the phylogeny of pepcase gene family. Our result showed that both PTPC and BTPC are derived from an ancestral lineage of gamma-proteobacteriapepcases, possibly via an ancient inter-kingdom horizontal gene transfer (HGT) from bacteria to the eukaryotic common ancestor of plants, protists and cellular slime mold. Our phylogenetic analysis also found 48other pepcase genes originated from inter-kingdom HGTs. These results imply that inter-kingdom HGTs played important roles in the evolution of the pepcase gene family and furthermore that HGTsare a more frequent evolutionary event than previouslythought.
Evolution of the F-Box Gene Family in Euarchontoglires: Gene Number Variation and Selection Patterns

PubMed Central

Wang, Ailan; Fu, Mingchuan; Jiang, Xiaoqian; Mao, Yuanhui; Li, Xiangchen; Tao, Shiheng

2014-01-01

F-box proteins are substrate adaptors used by the SKP1–CUL1–F-box protein (SCF) complex, a type of E3 ubiquitin ligase complex in the ubiquitin proteasome system (UPS). SCF-mediated ubiquitylation regulates proteolysis of hundreds of cellular proteins involved in key signaling and disease systems. However, our knowledge of the evolution of the F-box gene family in Euarchontoglires is limited. In the present study, 559 F-box genes and nine related pseudogenes were identified in eight genomes. Lineage-specific gene gain and loss events occurred during the evolution of Euarchontoglires, resulting in varying F-box gene numbers ranging from 66 to 81 among the eight species. Both tandem duplication and retrotransposition were found to have contributed to the increase of F-box gene number, whereas mutation in the F-box domain was the main mechanism responsible for reduction in the number of F-box genes, resulting in a balance of expansion and contraction in the F-box gene family. Thus, the Euarchontoglire F-box gene family evolved under a birth-and-death model. Signatures of positive selection were detected in substrate-recognizing domains of multiple F-box proteins, and adaptive changes played a role in evolution of the Euarchontoglire F-box gene family. In addition, single nucleotide polymorphism (SNP) distributions were found to be highly non-random among different regions of F-box genes in 1092 human individuals, with domain regions having a significantly lower number of non-synonymous SNPs. PMID:24727786
A study of structural properties of gene network graphs for mathematical modeling of integrated mosaic gene networks.

PubMed

Petrovskaya, Olga V; Petrovskiy, Evgeny D; Lavrik, Inna N; Ivanisenko, Vladimir A

2017-04-01

Gene network modeling is one of the widely used approaches in systems biology. It allows for the study of complex genetic systems function, including so-called mosaic gene networks, which consist of functionally interacting subnetworks. We conducted a study of a mosaic gene networks modeling method based on integration of models of gene subnetworks by linear control functionals. An automatic modeling of 10,000 synthetic mosaic gene regulatory networks was carried out using computer experiments on gene knockdowns/knockouts. Structural analysis of graphs of generated mosaic gene regulatory networks has revealed that the most important factor for building accurate integrated mathematical models, among those analyzed in the study, is data on expression of genes corresponding to the vertices with high properties of centrality.
MMTV insertional mutagenesis identifies genes, gene families and pathways involved in mammary cancer.

PubMed

Theodorou, Vassiliki; Kimm, Melanie A; Boer, Mandy; Wessels, Lodewyk; Theelen, Wendy; Jonkers, Jos; Hilkens, John

2007-06-01

We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.
A mechanistic explanation of popularity: genes, rule breaking, and evocative gene-environment correlations.

PubMed

Burt, Alexandra

2009-04-01

Previous work has suggested that the serotonergic system plays a key role in "popularity" or likeability. A polymorphism within the 5HT-sub(2A) serotonin receptor gene (-G1438A) has also been associated with popularity, suggesting that genes may predispose individuals to particular social experiences. However, because genes cannot code directly for others' reactions, any legitimate association should be mediated via the individual's behavior (i.e., genes-->behaviors-->social consequences), a phenomenon referred to as an evocative gene-environment correlation (rGE). The current study aimed to identify one such mediating behavior. The author focused on rule breaking given its prior links to both the serotonergic system and to increased popularity during adolescence. Two samples of previously unacquainted late-adolescent boys completed a peer-based interaction paradigm designed to assess their popularity. Analyses revealed that rule breaking partially mediated the genetic effect on popularity, thereby furthering our understanding of the biological mechanisms that underlie popularity. Moreover, the present results represent the first meaningfully explicated evidence that genes predispose individuals not only to particular behaviors but also to the social consequences of those behaviors. (c) 2009 APA, all rights reserved.
Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

PubMed Central

Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

2018-01-01

Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794
Genetic Evaluation for the Scoliosis Gene(s) in Patients with Neurofibromatosis Type I and Scoliosis

DTIC Science & Technology

2011-08-01

AWARD NUMBER: W81XWH-10-1-0469 TITLE: Genetic Evaluation for the Scoliosis Gene(s) in...Patients with Neurofibromatosis Type I and Scoliosis PRINCIPAL INVESTIGATOR: David W. Polly, Jr., M.D. CONTRACTING ORGANIZATION: University...for the Scoliosis Gene(s) in Patients with Neurofibromatosis Type I and Scoliosis 5b. GRANT NUMBER W81XWH-10-1-0469 5c. PROGRAM ELEMENT NUMBER 6
GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

PubMed Central

Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

2011-01-01

GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647
Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Raymond, Amy; Lovell, Scott; Lorimer, Don

2009-12-01

With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less
Gene network polymorphism is the raw material of natural selection: the selfish gene network hypothesis.

PubMed

Boldogköi, Zsolt

2004-09-01

Population genetics, the mathematical theory of modern evolutionary biology, defines evolution as the alteration of the frequency of distinct gene variants (alleles) differing in fitness over the time. The major problem with this view is that in gene and protein sequences we can find little evidence concerning the molecular basis of phenotypic variance, especially those that would confer adaptive benefit to the bearers. Some novel data, however, suggest that a large amount of genetic variation exists in the regulatory region of genes within populations. In addition, comparison of homologous DNA sequences of various species shows that evolution appears to depend more strongly on gene expression than on the genes themselves. Furthermore, it has been demonstrated in several systems that genes form functional networks, whose products exhibit interrelated expression profiles. Finally, it has been found that regulatory circuits of development behave as evolutionary units. These data demonstrate that our view of evolution calls for a new synthesis. In this article I propose a novel concept, termed the selfish gene network hypothesis, which is based on an overall consideration of the above findings. The major statements of this hypothesis are as follows. (1) Instead of individual genes, gene networks (GNs) are responsible for the determination of traits and behaviors. (2) The primary source of microevolution is the intraspecific polymorphism in GNs and not the allelic variation in either the coding or the regulatory sequences of individual genes. (3) GN polymorphism is generated by the variation in the regulatory regions of the component genes and not by the variance in their coding sequences. (4) Evolution proceeds through continuous restructuring of the composition of GNs rather than fixing of specific alleles or GN variants.
Gene Expression Profiling of Gastric Cancer

PubMed Central

Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh

2015-01-01

Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
A novel MADS-box gene subfamily with a sister-group relationship to class B floral homeotic genes.

PubMed

Becker, A; Kaufmann, K; Freialdenhoven, A; Vincent, C; Li, M-A; Saedler, H; Theissen, G

2002-02-01

Class B floral homeotic genes specify the identity of petals and stamens during the development of angiosperm flowers. Recently, putative orthologs of these genes have been identified in different gymnosperms. Together, these genes constitute a clade, termed B genes. Here we report that diverse seed plants also contain members of a hitherto unknown sister clade of the B genes, termed B(sister) (B(s)) genes. We have isolated members of the B(s) clade from the gymnosperm Gnetum gnemon, the monocotyledonous angiosperm Zea mays and the eudicots Arabidopsis thaliana and Antirrhinum majus. In addition, MADS-box genes from the basal angiosperm Asarum europaeum and the eudicot Petunia hybrida were identified as B(s) genes. Comprehensive expression studies revealed that B(s) genes are mainly transcribed in female reproductive organs (ovules and carpel walls). This is in clear contrast to the B genes, which are predominantly expressed in male reproductive organs (and in angiosperm petals). Our data suggest that the B(s) genes played an important role during the evolution of the reproductive structures in seed plants. The establishment of distinct B and B(s) gene lineages after duplication of an ancestral gene may have accompanied the evolution of male microsporophylls and female megasporophylls 400-300 million years ago. During flower evolution, expression of B(s) genes diversified, but the focus of expression remained in female reproductive organs. Our findings imply that a clade of highly conserved close relatives of class B floral homeotic genes has been completely overlooked until recently and awaits further evaluation of its developmental and evolutionary importance. Electronic supplementary material to this paper can be obtained by using the Springer Link server located at http://dx.doi.org/10.1007/s00438-001-0615-8.
Validation of reference genes for gene expression studies in soybean aphid, Aphis glycines Matsumura

USDA-ARS?s Scientific Manuscript database

Quantitative real-time PCR (qRT-PCR) is a common tool for quantifying mRNA transcripts. To normalize results, a reference gene is mandatory. Aphis glycines is a significant soybean pest, yet gene expression and functional genomics studies are hindered by a lack of stable reference genes. We evalu...
MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers.

PubMed

Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier; Lecompte, Odile

2017-06-16

The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user's specific interests and provides an efficient way to share information with collaborators. Furthermore, the user's behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends. ©Alexis Allot, Kirsley Chennen, Yannis
Gene duplication, silencing and expression alteration govern the molecular evolution of PRC2 genes in plants.

PubMed

Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira

2016-10-13

PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.
Determining Semantically Related Significant Genes.

PubMed

Taha, Kamal

2014-01-01

GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.
HOX Genes in Human Lung

PubMed Central

Golpon, Heiko A.; Geraci, Mark W.; Moore, Mark D.; Miller, Heidi L.; Miller, Gary J.; Tuder, Rubin M.; Voelkel, Norbert F.

2001-01-01

HOX genes belong to the large family of homeodomain genes that function as transcription factors. Animal studies indicate that they play an essential role in lung development. We investigated the expression pattern of HOX genes in human lung tissue by using microarray and degenerate reverse transcriptase-polymerase chain reaction survey techniques. HOX genes predominantly from the 3′ end of clusters A and B were expressed in normal human adult lung and among them HOXA5 was the most abundant, followed by HOXB2 and HOXB6. In fetal (12 weeks old) and diseased lung specimens (emphysema, primary pulmonary hypertension) additional HOX genes from clusters C and D were expressed. Using in situ hybridization, transcripts for HOXA5 were predominantly found in alveolar septal and epithelial cells, both in normal and diseased lungs. A 2.5-fold increase in HOXA5 mRNA expression was demonstrated by quantitative reverse transcriptase-polymerase chain reaction in primary pulmonary hypertension lung specimens when compared to normal lung tissue. In conclusion, we demonstrate that HOX genes are selectively expressed in the human lung. Differences in the pattern of HOX gene expression exist among fetal, adult, and diseased lung specimens. The altered pattern of HOX gene expression may contribute to the development of pulmonary diseases. PMID:11238043
Gene doping: possibilities and practicalities.

PubMed

Wells, Dominic J

2009-01-01

Our ever-increasing understanding of the genetic control of cardiovascular and musculoskeletal function together with recent technical improvements in genetic manipulation generates mounting concern over the possibility of such technology being abused by athletes in their quest for improved performance. Genetic manipulation in the context of athletic performance is commonly referred to as gene doping. A review of the literature was performed to identify the genes and methodologies most likely to be used for gene doping and the technologies that might be used to identify such doping. A large number of candidate performance-enhancing genes have been identified from animal studies, many of them using transgenic mice. Only a limited number have been shown to be effective following gene transfer into adults. Those that seem most likely to be abused are genes that exert their effects locally and leave little, if any, trace in blood or urine. There is currently no evidence that gene doping has yet been undertaken in competitive athletes but the anti-doping authorities will need to remain vigilant in reviewing this rapidly emerging technology. The detection of gene doping involves some different challenges from other agents and a number of promising approaches are currently being explored. 2009 S. Karger AG, Basel

GENES IN SPORT AND DOPING

PubMed Central

Kaliszewski, P.; Majorczyk, E.; Zembroń-Łacny, A.

2013-01-01

Genes control biological processes such as muscle production of energy, mitochondria biogenesis, bone formation, erythropoiesis, angiogenesis, vasodilation, neurogenesis, etc. DNA profiling for athletes reveals genetic variations that may be associated with endurance ability, muscle performance and power exercise, tendon susceptibility to injuries and psychological aptitude. Already, over 200 genes relating to physical performance have been identified by several research groups. Athletes’ genotyping is developing as a tool for the formulation of personalized training and nutritional programmes to optimize sport training as well as for the prediction of exercise-related injuries. On the other hand, development of molecular technology and gene therapy creates a risk of non-therapeutic use of cells, genes and genetic elements to improve athletic performance. Therefore, the World Anti-Doping Agency decided to include prohibition of gene doping within their World Anti-Doping Code in 2003. In this review article, we will provide a current overview of genes for use in athletes’ genotyping and gene doping possibilities, including their development and detection techniques. PMID:24744482
Gene Electrotransfer: A Mechanistic Perspective

PubMed Central

Rosazza, Christelle; Meglic, Sasa Haberl; Zumbusch, Andreas; Rols, Marie-Pierre; Miklavcic, Damijan

2016-01-01

Gene electrotransfer is a powerful method of DNA delivery offering several medical applications, among the most promising of which are DNA vaccination and gene therapy for cancer treatment. Electroporation entails the application of electric fields to cells which then experience a local and transient change of membrane permeability. Although gene electrotransfer has been extensively studied in in vitro and in vivo environments, the mechanisms by which DNA enters and navigates through cells are not fully understood. Here we present a comprehensive review of the body of knowledge concerning gene electrotransfer that has been accumulated over the last three decades. For that purpose, after briefly reviewing the medical applications that gene electrotransfer can provide, we outline membrane electropermeabilization, a key process for the delivery of DNA and smaller molecules. Since gene electrotransfer is a multipart process, we proceed our review in describing step by step our current understanding, with particular emphasis on DNA internalization and intracellular trafficking. Finally, we turn our attention to in vivo testing and methodology for gene electrotransfer. PMID:27029943
Gene Therapy in Heart Failure.

PubMed

Fargnoli, Anthony S; Katz, Michael G; Bridges, Charles R; Hajjar, Roger J

2017-01-01

Heart failure is a significant burden to the global healthcare system and represents an underserved market for new pharmacologic strategies, especially therapies which can address root cause myocyte dysfunction. Modern drugs, surgeries, and state-of-the-art interventions are costly and do not improve survival outcome measures. Gene therapy is an attractive strategy, whereby selected gene targets and their associated regulatory mechanisms can be permanently managed therapeutically in a single treatment. This in theory could be sustainable for the patient's life. Despite the promise, however, gene therapy has numerous challenges that must be addressed together as a treatment plan comprising these key elements: myocyte physiologic target validation, gene target manipulation strategy, vector selection for the correct level of manipulation, and carefully utilizing an efficient delivery route that can be implemented in the clinic to efficiently transfer the therapy within safety limits. This chapter summarizes the key developments in cardiac gene therapy from the perspective of understanding each of these components of the treatment plan. The latest pharmacologic gene targets, gene therapy vectors, delivery routes, and strategies are reviewed.
BLISTER Regulates Polycomb-Target Genes, Represses Stress-Regulated Genes and Promotes Stress Responses in Arabidopsis thaliana.

PubMed

Kleinmanns, Julia A; Schatlowski, Nicole; Heckmann, David; Schubert, Daniel

2017-01-01

HIGHLIGHTS The PRC2 interacting protein BLISTER likely acts downstream of PRC2 to silence Polycomb target genes and is a key regulator of specific stress responses in Arabidopsis . Polycomb group (PcG) proteins are key epigenetic regulators of development. The highly conserved Polycomb repressive complex 2 (PRC2) represses thousands of target genes by trimethylating H3K27 (H3K27me3). Plant specific PcG components and functions are largely unknown, however, we previously identified the plant-specific protein BLISTER (BLI) as a PRC2 interactor. BLI regulates PcG target genes and promotes cold stress resistance. To further understand the function of BLI , we analyzed the transcriptional profile of bli-1 mutants. Approximately 40% of the up-regulated genes in bli are PcG target genes, however, bli-1 mutants did not show changes in H3K27me3 levels at all tested genes, indicating that BLI regulates PcG target genes downstream of or in parallel to PRC2. Interestingly, a significant number of BLI regulated H3K27me3 target genes is regulated by the stress hormone absciscic acid (ABA). We further reveal an overrepresentation of genes responding to abiotic stresses such as drought, high salinity, or heat stress among the up-regulated genes in bli mutants. Consistently, bli mutants showed reduced desiccation stress tolerance. We conclude that the PRC2 associated protein BLI is a key regulator of stress-responsive genes in Arabidopsis : it represses ABA-responsive PcG target genes, likely downstream of PRC2, and promotes resistance to several stresses such as cold and drought.
GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts

PubMed Central

Aubourg, Sébastien; Brunaud, Véronique; Bruyère, Clémence; Cock, Mark; Cooke, Richard; Cottet, Annick; Couloux, Arnaud; Déhais, Patrice; Deléage, Gilbert; Duclert, Aymeric; Echeverria, Manuel; Eschbach, Aimée; Falconet, Denis; Filippi, Ghislain; Gaspin, Christine; Geourjon, Christophe; Grienenberger, Jean-Michel; Houlné, Guy; Jamet, Elisabeth; Lechauve, Frédéric; Leleu, Olivier; Leroy, Philippe; Mache, Régis; Meyer, Christian; Nedjari, Hafed; Negrutiu, Ioan; Orsini, Valérie; Peyretaillade, Eric; Pommier, Cyril; Raes, Jeroen; Risler, Jean-Loup; Rivière, Stéphane; Rombauts, Stéphane; Rouzé, Pierre; Schneider, Michel; Schwob, Philippe; Small, Ian; Soumayet-Kampetenga, Ghislain; Stankovski, Darko; Toffano, Claire; Tognolli, Michael; Caboche, Michel; Lecharny, Alain

2005-01-01

Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Prot. PMID:15608279
The low-recombining pericentromeric region of barley restricts gene diversity and evolution but not gene expression

PubMed Central

Baker, Katie; Bayer, Micha; Cook, Nicola; Dreißig, Steven; Dhillon, Taniya; Russell, Joanne; Hedley, Pete E; Morris, Jenny; Ramsay, Luke; Colas, Isabelle; Waugh, Robbie; Steffenson, Brian; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J

2014-01-01

The low-recombining pericentromeric region of the barley genome contains roughly a quarter of the genes of the species, embedded in low-recombining DNA that is rich in repeats and repressive chromatin signatures. We have investigated the effects of pericentromeric region residency upon the expression, diversity and evolution of these genes. We observe no significant difference in average transcript level or developmental RNA specificity between the barley pericentromeric region and the rest of the genome. In contrast, all of the evolutionary parameters studied here show evidence of compromised gene evolution in this region. First, genes within the pericentromeric region of wild barley show reduced diversity and significantly weakened purifying selection compared with the rest of the genome. Second, gene duplicates (ohnolog pairs) derived from the cereal whole-genome duplication event ca. 60MYa have been completely eliminated from the barley pericentromeric region. Third, local gene duplication in the pericentromeric region is reduced by 29% relative to the rest of the genome. Thus, the pericentromeric region of barley is a permissive environment for gene expression but has restricted gene evolution in a sizeable fraction of barley's genes. PMID:24947331
Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

PubMed

Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

2013-12-01

To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.
The roles of gene duplication, gene conversion and positive selection in rodent Esp and Mup pheromone gene families with comparison to the Abp family.

PubMed

Karn, Robert C; Laukaitis, Christina M

2012-01-01

Three proteinaceous pheromone families, the androgen-binding proteins (ABPs), the exocrine-gland secreting peptides (ESPs) and the major urinary proteins (MUPs) are encoded by large gene families in the genomes of Mus musculus and Rattus norvegicus. We studied the evolutionary histories of the Mup and Esp genes and compared them with what is known about the Abp genes. Apparently gene conversion has played little if any role in the expansion of the mouse Class A and Class B Mup genes and pseudogenes, and the rat Mups. By contrast, we found evidence of extensive gene conversion in many Esp genes although not in all of them. Our studies of selection identified at least two amino acid sites in β-sheets as having evolved under positive selection in the mouse Class A and Class B MUPs and in rat MUPs. We show that selection may have acted on the ESPs by determining K(a)/K(s) for Exon 3 sequences with and without the converted sequence segment. While it appears that purifying selection acted on the ESP signal peptides, the secreted portions of the ESPs probably have undergone much more rapid evolution. When the inner gene converted fragment sequences were removed, eleven Esp paralogs were present in two or more pairs with K(a)/K(s) >1.0 and thus we propose that positive selection is detectable by this means in at least some mouse Esp paralogs. We compare and contrast the evolutionary histories of all three mouse pheromone gene families in light of their proposed functions in mouse communication.
The gene space in wheat: the complete γ-gliadin gene family from the wheat cultivar Chinese Spring.

PubMed

Anderson, Olin D; Huo, Naxin; Gu, Yong Q

2013-06-01

The complete set of unique γ-gliadin genes is described for the wheat cultivar Chinese Spring using a combination of expressed sequence tag (EST) and Roche 454 DNA sequences. Assemblies of Chinese Spring ESTs yielded 11 different γ-gliadin gene sequences. Two of the sequences encode identical polypeptides and are assumed to be the result of a recent gene duplication. One gene has a 3' coding mutation that changes the reading frame in the final eight codons. A second assembly of Chinese Spring γ-gliadin sequences was generated using Roche 454 total genomic DNA sequences. The 454 assembly confirmed the same 11 active genes as the EST assembly plus two pseudogenes not represented by ESTs. These 13 γ-gliadin sequences represent the complete unique set of γ-gliadin genes for cv Chinese Spring, although not ruled out are additional genes that are exact duplications of these 13 genes. A comparison with the ESTs of two other hexaploid cultivars (Butte 86 and Recital) finds that the most active genes are present in all three cultivars, with exceptions likely due to too few ESTs for detection in Butte 86 and Recital. A comparison of the numbers of ESTs per gene indicates differential levels of expression within the γ-gliadin gene family. Genome assignments were made for 6 of the 13 Chinese Spring γ-gliadin genes, i.e., one assignment from a match to two γ-gliadin genes found within a tetraploid wheat A genome BAC and four genes that match four distinct γ-gliadin sequences assembled from Roche 454 sequences from Aegilops tauschii, the hexaploid wheat D-genome ancestor.
Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

PubMed

Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

2010-04-19

Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and Norm
Gene therapy progress and prospects: magnetic nanoparticle-based gene delivery.

PubMed

Dobson, J

2006-02-01

The recent emphasis on the development of non-viral transfection agents for gene delivery has led to new physics and chemistry-based techniques, which take advantage of charge interactions and energetic processes. One of these techniques which shows much promise for both in vitro and in vivo transfection involves the use of biocompatible magnetic nanoparticles for gene delivery. In these systems, therapeutic or reporter genes are attached to magnetic nanoparticles, which are then focused to the target site/cells via high-field/high-gradient magnets. The technique promotes rapid transfection and, as more recent work indicates, excellent overall transfection levels as well. The advantages and difficulties associated with magnetic nanoparticle-based transfection will be discussed as will the underlying physical principles, recent studies and potential future applications.
Biodegradable nanoparticles for gene therapy technology

NASA Astrophysics Data System (ADS)

Hosseinkhani, Hossein; He, Wen-Jie; Chiang, Chiao-Hsi; Hong, Po-Da; Yu, Dah-Shyong; Domb, Abraham J.; Ou, Keng-Liang

2013-07-01

Rapid propagations in materials technology together with biology have initiated great hopes in the possibility of treating many diseases by gene therapy technology. Viral and non-viral gene carriers are currently applied for gene delivery. Non-viral technology is safe and effective for the delivery of genetic materials to cells and tissues. Non-viral systems are based on plasmid expression containing a gene encoding a therapeutic protein and synthetic biodegradable nanoparticles as a safe carrier of gene. Biodegradable nanoparticles have shown great interest in drug and gene delivery systems as they are easy to be synthesized and have no side effect in cells and tissues. This review provides a critical view of applications of biodegradable nanoparticles on gene therapy technology to enhance the localization of in vitro and in vivo and improve the function of administered genes.
Application of HSVtk suicide gene to X-SCID gene therapy: Ganciclovir treatment offsets gene corrected X-SCID B cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Uchiyama, Toru; Kumaki, Satoru; Ishikawa, Yoshinori

Recently, a serious adverse effect of uncontrolled clonal T cell proliferation due to insertional mutagenesis of retroviral vector was reported in X-SCID gene therapy clinical trial. To offset the side effect, we have incorporated a suicide gene into therapeutic retroviral vector for selective elimination of transduced cells. In this study, B-cell lines from two X-SCID patients were transduced with bicistronic retroviral vector carrying human {gamma}c chain cDNA and Herpes simplex virus thymidine kinase gene. After confirmation of functional reconstitution of the {gamma}c chain, the cells were treated with ganciclovir (GCV). The {gamma}c chain positive cells were eliminated under low concentrationmore » without cytotoxicity on untransduced cells and have not reappeared at least for 5 months. Furthermore, the {gamma}c chain transduced cells were still sensitive to GCV after five months. These results demonstrated the efficacy of the suicide gene therapy although further in vivo studies are required to assess feasibility of this approach in clinical trial.« less
Application of community phylogenetic approaches to understand gene expression: differential exploration of venom gene space in predatory marine gastropods.

PubMed

Chang, Dan; Duda, Thomas F

2014-06-05

Predatory marine gastropods of the genus Conus exhibit substantial variation in venom composition both within and among species. Apart from mechanisms associated with extensive turnover of gene families and rapid evolution of genes that encode venom components ('conotoxins'), the evolution of distinct conotoxin expression patterns is an additional source of variation that may drive interspecific differences in the utilization of species' 'venom gene space'. To determine the evolution of expression patterns of venom genes of Conus species, we evaluated the expression of A-superfamily conotoxin genes of a set of closely related Conus species by comparing recovered transcripts of A-superfamily genes that were previously identified from the genomes of these species. We modified community phylogenetics approaches to incorporate phylogenetic history and disparity of genes and their expression profiles to determine patterns of venom gene space utilization. Less than half of the A-superfamily gene repertoire of these species is expressed, and only a few orthologous genes are coexpressed among species. Species exhibit substantially distinct expression strategies, with some expressing sets of closely related loci ('under-dispersed' expression of available genes) while others express sets of more disparate genes ('over-dispersed' expression). In addition, expressed genes show higher dN/dS values than either unexpressed or ancestral genes; this implies that expression exposes genes to selection and facilitates rapid evolution of these genes. Few recent lineage-specific gene duplicates are expressed simultaneously, suggesting that expression divergence among redundant gene copies may be established shortly after gene duplication. Our study demonstrates that venom gene space is explored differentially by Conus species, a process that effectively permits the independent and rapid evolution of venoms in these species.
Apolipoprotein gene involved in lipid metabolism

DOEpatents

Rubin, Edward [Berkeley, CA; Pennacchio, Len A [Sebastopol, CA

2007-07-03

Methods and materials for studying the effects of a newly identified human gene, APOAV, and the corresponding mouse gene apoAV. The sequences of the genes are given, and transgenic animals which either contain the gene or have the endogenous gene knocked out are described. In addition, single nucleotide polymorphisms (SNPs) in the gene are described and characterized. It is demonstrated that certain SNPs are associated with diseases involving lipids and triglycerides and other metabolic diseases. These SNPs may be used alone or with SNPs from other genes to study individual risk factors. Methods for intervention in lipid diseases, including the screening of drugs to treat lipid-related or diabetic diseases are also disclosed.
[Current status of gene test market].

PubMed

Ohtani, Shinichi

2002-12-01

The technological innovation of the gene analysis makes the adaptation range of the gene test in clinical diagnosis expand. Then, gene test has popularized increasingly around the infection disease for clinical inspection. Also in the field of clinical inspection, the increase of the importance of clinical application and the inspection item new year by year have appeared with the functional analysis of a gene. Moreover, the new test method and automation analysis equipment tend to be developed by progress of gene-analysis technology, and it is going to be introduced. The spread of gene test and development of a gene test market have an important possibility of activating the present clinical inspection field.
The Natural History of Class I Primate Alcohol Dehydrogenases Includes Gene Duplication, Gene Loss, and Gene Conversion

PubMed Central

Carrigan, Matthew A.; Uryasev, Oleg; Davis, Ross P.; Zhai, LanMin; Hurley, Thomas D.; Benner, Steven A.

2012-01-01

Background Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids. Methodology/Principal Findings To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences. Conclusions/Significance We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in
Differential expression of the eight genes of the petunia ribulose bisphosphate carboxylase small subunit multi-gene family

PubMed Central

Dean, Caroline; Elzen, Peter van den; Tamaki, Stanley; Dunsmuir, Pamela; Bedbrook, John

1985-01-01

Of the eight nuclear genes in the plant multi-gene family which encodes the small subunit (rbcS) of Petunia (Mitchell) ribulose bisphosphate carboxylase, one rbcS gene accounts for 47% of the total rbcS gene expression in petunia leaf tissue. Expression of each of five other rbcS genes is detected at levels between 2 and 23% of the total rbcS expression in leaf tissue, while expression of the remaining two rbcS genes is not detected. There is considerable variation (500-fold) in the levels of total rbcS mRNA in six organs of petunia (leaves, sepals, petals, stems, roots and stigmas/anthers). One gene, SSU301, showed the highest levels of steady-state mRNA in each of the organs examined. We discuss the differences in the steady-state mRNA levels of the individual rbcS genes in relation to their gene structure, nucleotide sequence and genomic linkage. ImagesFig. 2.Fig. 3. PMID:16453647
Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.

PubMed

Arnaiz, Olivier; Van Dijk, Erwin; Bétermier, Mireille; Lhuillier-Akakpo, Maoussi; de Vanssay, Augustin; Duharcourt, Sandra; Sallet, Erika; Gouzy, Jérôme; Sperling, Linda

2017-06-26

The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis
Evidence for homosexuality gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pool, R.

1993-07-16

A genetic analysis of 40 pairs of homosexual brothers has uncovered a region on the X chromosome that appears to contain a gene or genes for homosexuality. When analyzing the pedigrees of homosexual males, the researcheres found evidence that the trait has a higher likelihood of being passed through maternal genes. This led them to search the X chromosome for genes predisposing to homosexuality. The researchers examined the X chromosomes of pairs of homosexual brothers for regions of DNA that most or all had in common. Of the 40 sets of brothers, 33 shared a set of five markers inmore » the q28 region of the long arm of the X chromosome. The linkage has a LOD score of 4.0, which translates into a 99.5% certainty that there is a gene or genes in this area that predispose males to homosexuality. The chief researcher warns, however, that this one site cannot explain all instances of homosexuality, since there were some cases where the trait seemed to be passed paternally. And even among those brothers where there was no evidence that the trait was passed paternally, seven sets of brothers did not share the Xq28 markers. It seems likely that homosexuality arises from a variety of causes.« less

The Association of Multiple Interacting Genes with Specific Phenotypes in Rice Using Gene Coexpression Networks1[C][W][OA

PubMed Central

Ficklin, Stephen P.; Luo, Feng; Feltus, F. Alex

2010-01-01

Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes. PMID:20668062
Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress

PubMed Central

Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

2017-01-01

Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species. PMID:28429727
Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress.

PubMed

Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

2017-04-21

Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species.
A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

PubMed

He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

2017-03-01

Female moths synthesize species-specific sex pheromone components and release them to attract male moths, which depend on precise sex pheromone chemosensory system to locate females. Two types of genes involved in the sex pheromone biosynthesis and degradation pathways play essential roles in this important moth behavior. To understand the function of genes in the sex pheromone pathway, this study investigated the genome-wide and digital gene expression of sex pheromone biosynthesis and degradation genes in various adult tissues in the diamondback moth (DBM), Plutella xylostella, which is a notorious vegetable pest worldwide. A massive transcriptome data (at least 39.04 Gb) was generated by sequencing 6 adult tissues including male antennae, female antennae, heads, legs, abdomen and female pheromone glands from DBM by using Illumina 4000 next-generation sequencing and mapping to a published DBM genome. Bioinformatics analysis yielded a total of 89,332 unigenes among which 87 transcripts were putatively related to seven gene families in the sex pheromone biosynthesis pathway. Among these, seven [two desaturases (DES), three fatty acyl-CoA reductases (FAR) one acetyltransferase (ACT) and one alcohol dehydrogenase (AD)] were mainly expressed in the pheromone glands with likely function in the three essential sex pheromone biosynthesis steps: desaturation, reduction, and esterification. We also identified 210 odorant-degradation related genes (including sex pheromone-degradation related genes) from seven major enzyme groups. Among these genes, 100 genes are new identified and two aldehyde oxidases (AOXs), one aldehyde dehydrogenase (ALDH), five carboxyl/cholinesterases (CCEs), five UDP-glycosyltransferases (UGTs), eight cytochrome P450 (CYP) and three glutathione S-transferases (GSTs) displayed more robust expression in the antennae, and thus are proposed to participate in the degradation of sex pheromone components and plant volatiles. To date, this is the most
Nonviral Vectors for Gene Delivery

NASA Astrophysics Data System (ADS)

Baoum, Abdulgader Ahmed

2011-12-01

The development of nonviral vectors for safe and efficient gene delivery has been gaining considerable attention recently. An ideal nonviral vector must protect the gene against degradation by nuclease in the extracellular matrix, internalize the plasma membrane, escape from the endosomal compartment, unpackage the gene at some point and have no detrimental effects. In comparison to viruses, nonviral vectors are relatively easy to synthesize, less immunogenic, low in cost, and have no limitation in the size of a gene that can be delivered. Significant progress has been made in the basic science and applications of various nonviral gene delivery vectors; however, the majority of nonviral approaches are still inefficient and often toxic. To this end, two nonviral gene delivery systems using either biodegradable poly(D,L-lactide- co-glycolide) (PLG) nanoparticles or cell penetrating peptide (CPP) complexes have been designed and studied using A549 human lung epithelial cells. PLG nanoparticles were optimized for gene delivery by varying particle surface chemistry using different coating materials that adsorb to the particle surface during formation. A variety of cationic coating materials were studied and compared to more conventional surfactants used for PLG nanoparticle fabrication. Nanoparticles (˜200 nm) efficiently encapsulated plasmids encoding for luciferase (80-90%) and slowly released the same for two weeks. After a delay, moderate levels of gene expression appeared at day 5 for certain positively charged PLG particles and gene expression was maintained for at least two weeks. In contrast, gene expression mediated by polyethyleneimine (PEI) ended at day 5. PLG particles were also significantly less cytotoxic than PEI suggesting the use of these vehicles for localized, sustained gene delivery to the pulmonary epithelium. On the other hand, a more simple method to synthesize 50-200 nm complexes capable of high transfection efficiency or high gene knockdown was
ReliefSeq: A Gene-Wise Adaptive-K Nearest-Neighbor Feature Selection Tool for Finding Gene-Gene Interactions and Main Effects in mRNA-Seq Gene Expression Data

PubMed Central

McKinney, Brett A.; White, Bill C.; Grill, Diane E.; Li, Peter W.; Kennedy, Richard B.; Poland, Gregory A.; Oberg, Ann L.

2013-01-01

Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k) for each gene to optimize the Relief-F test statistics (importance scores) for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak) Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to detect both main
Screening for genes and subnetworks associated with pancreatic cancer based on the gene expression profile.

PubMed

Long, Jin; Liu, Zhe; Wu, Xingda; Xu, Yuanhong; Ge, Chunlin

2016-05-01

The present study aimed to screen for potential genes and subnetworks associated with pancreatic cancer (PC) using the gene expression profile. The expression profile GSE 16515 was downloaded from the Gene Expression Omnibus database, which included 36 PC tissue samples and 16 normal samples. Limma package in R language was used to screen differentially expressed genes (DEGs), which were grouped as up‑ and downregulated genes. Then, PFSNet was applied to perform subnetwork analysis for all the DEGs. Moreover, Gene Ontology (GO) and REACTOME pathway enrichment analysis of up‑ and downregulated genes was performed, followed by protein‑protein interaction (PPI) network construction using Search Tool for the Retrieval of Interacting Genes Search Tool for the Retrieval of Interacting Genes. In total, 1,989 DEGs including 1,461 up‑ and 528 downregulated genes were screened out. Subnetworks including pancreatic cancer in PC tissue samples and intercellular adhesion in normal samples were identified, respectively. A total of 8 significant REACTOME pathways for upregulated DEGs, such as hemostasis and cell cycle, mitotic were identified. Moreover, 4 significant REACTOME pathways for downregulated DEGs, including regulation of β‑cell development and transmembrane transport of small molecules were screened out. Additionally, DEGs with high connectivity degrees, such as CCNA2 (cyclin A2) and PBK (PDZ binding kinase), of the module in the protein‑protein interaction network were mainly enriched with cell‑division cycle. CCNA2 and PBK of the module and their relative pathway cell‑division cycle, and two subnetworks (pancreatic cancer and intercellular adhesion subnetworks) may be pivotal for further understanding of the molecular mechanism of PC.
Selection and validation of reliable housekeeping genes to evaluate Piscirickettsia salmonis gene expression.

PubMed

Flores-Herrera, Patricio; Arredondo-Zelada, Oscar; Marshall, Sergio H; Gómez, Fernando A

2018-06-01

Piscirickettsia salmonis is a highly aggressive facultative intracellular bacterium that challenges the sustainability of Chilean salmon production. Due to the limited knowledge of its biology, there is a need to identify key molecular markers that could help define the pathogenic potential of this bacterium. We think a model system should be implemented that efficiently evaluates the expression of putative bacterial markers by using validated, stable, and highly specific housekeeping genes to properly select target genes, which could lead to identifying those responsible for infection and disease induction in naturally infected fish. Here, we selected a set of validated reference or housekeeping genes for RT-qPCR expression analyses of P. salmonis under different growth and stress conditions, including an in vitro infection kinetic. After a thorough screening, we selected sdhA as the most reliable housekeeping gene able to represent stable and highly specific host reference genes for RT-qPCR-driven P. salmonis analysis. Copyright © 2018. Published by Elsevier B.V.
Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

PubMed Central

Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

2005-01-01

Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
In silico analysis of miRNA-mediated gene regulation in OCA and OA genes.

PubMed

Kamaraj, Balu; Gopalakrishnan, Chandrasekhar; Purohit, Rituraj

2014-12-01

Albinism is an autosomal recessive genetic disorder due to low secretion of melanin. The oculocutaneous albinism (OCA) and ocular albinism (OA) genes are responsible for melanin production and also act as a potential targets for miRNAs. The role of miRNA is to inhibit the protein synthesis partially or completely by binding with the 3'UTR of the mRNA thus regulating gene expression. In this analysis, we predicted the genetic variation that occurred in 3'UTR of the transcript which can be a reason for low melanin production thus causing albinism. The single nucleotide polymorphisms (SNPs) in 3'UTR cause more new binding sites for miRNA which binds with mRNA which leads to inhibit the translation process either partially or completely. The SNPs in the mRNA of OCA and OA genes can create new binding sites for miRNA which may control the gene expression and lead to hypopigmentation. We have developed a computational procedure to determine the SNPs in the 3'UTR region of mRNA of OCA (TYR, OCA2, TYRP1 and SLC45A2) and OA (GPR143) genes which will be a potential cause for albinism. We identified 37 SNPs in five genes that are predicted to create 87 new binding sites on mRNA, which may lead to abrogation of the translation process. Expression analysis confirms that these genes are highly expressed in skin and eye regions. It is well supported by enrichment analysis that these genes are mainly involved in eye pigmentation and melanin biosynthesis process. The network analysis also shows how the genes are interacting and expressing in a complex network. This insight provides clue to wet-lab researches to understand the expression pattern of OCA and OA genes and binding phenomenon of mRNA and miRNA upon mutation, which is responsible for inhibition of translation process at genomic levels.
Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data.

PubMed

Hettne, Kristina M; Boorsma, André; van Dartel, Dorien A M; Goeman, Jelle J; de Jong, Esther; Piersma, Aldert H; Stierum, Rob H; Kleinjans, Jos C; Kors, Jan A

2013-01-29

Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity
Selection of reference genes for gene expression studies related to intramuscular fat deposition in Capra hircus skeletal muscle.

PubMed

Zhu, Wuzheng; Lin, Yaqiu; Liao, Honghai; Wang, Yong

2015-01-01

The identification of suitable reference genes is critical for obtaining reliable results from gene expression studies using quantitative real-time PCR (qPCR) because the expression of reference genes may vary considerably under different experimental conditions. In most cases, however, commonly used reference genes are employed in data normalization without proper validation, which may lead to incorrect data interpretation. Here, we aim to select a set of optimal reference genes for the accurate normalization of gene expression associated with intramuscular fat (IMF) deposition during development. In the present study, eight reference genes (PPIB, HMBS, RPLP0, B2M, YWHAZ, 18S, GAPDH and ACTB) were evaluated by three different algorithms (geNorm, NormFinder and BestKeeper) in two types of muscle tissues (longissimus dorsi muscle and biceps femoris muscle) across different developmental stages. All three algorithms gave similar results. PPIB and HMBS were identified as the most stable reference genes, while the commonly used reference genes 18S and GAPDH were the most variably expressed, with expression varying dramatically across different developmental stages. Furthermore, to reveal the crucial role of appropriate reference genes in obtaining a reliable result, analysis of PPARG expression was performed by normalization to the most and the least stable reference genes. The relative expression levels of PPARG normalized to the most stable reference genes greatly differed from those normalized to the least stable one. Therefore, evaluation of reference genes must be performed for a given experimental condition before the reference genes are used. PPIB and HMBS are the optimal reference genes for analysis of gene expression associated with IMF deposition in skeletal muscle during development.
[Gene deletion and functional analysis of the heptyl glycosyltransferase (waaF) gene in Vibrio parahemolyticus O-antigen cluster].

PubMed

Zhao, Feng; Meng, Songsong; Zhou, Deqing

2016-02-04

To construct heptyl glycosyltransferase gene II (waaF) gene deletion mutant of Vibrio parahaemolyticus, and explore the function of the waaF gene in Vibrio parahaemolyticus. The waaF gene deletion mutant was constructed by chitin-based transformation technology using clinical isolates, and then the growth rate, morphology and serotypes were identified. The different sources (O3, O5 and O10) waaF gene complementations were constructed through E. coli S17λpir strains conjugative transferring with Vibrio parahaemolyticus, and the function of the waaF gene was further verified by serotypes. The waaF gene deletion mutant strain was successfully constructed and it grew normally. The growth rate and morphology of mutant were similar with the wild type strains (WT), but the mutant could not occurred agglutination reaction with O antisera. The O3 and O5 sources waaF gene complementations occurred agglutination reaction with O antisera, but the O10 sources waaF gene complementations was not. The waaF gene was related with O-antigen synthesis and it was the key gene of O-antigen synthesis pathway in Vibrio parahaemolyticus. The function of different sources waaF gene were not the same.
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

PubMed Central

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

PubMed

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.
Polycistronic gene expression in Aspergillus niger.

PubMed

Schuetze, Tabea; Meyer, Vera

2017-09-25

Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at
Compositional gene landscapes in vertebrates.

PubMed

Cruveiller, Stéphane; Jabbari, Kamel; Clay, Oliver; Bernardi, Giorgio

2004-05-01

The existence of a well conserved linear relationship between GC levels of genes' second and third codon positions (GC2, GC3) prompted us to focus on the landscape, or joint distribution, spanned by these two variables. In human, well curated coding sequences now cover at least 15%-30% of the estimated total gene set. Our analysis of the landscape defined by this gene set revealed not only the well documented linear crest, but also the presence of several peaks and valleys along that crest, a property that was also indicated in two other warm-blooded vertebrates represented by large gene databases, that is, mouse and chicken. GC2 is the sum of eight amino acid frequencies, whereas GC3 is linearly related to the GC level of the chromosomal region containing the gene. The landscapes therefore portray relations between proteins and the DNA environments of the genes that encode them.
Evolution of Gene Duplication in Plants.

PubMed

Panchy, Nicholas; Lehti-Shiu, Melissa; Shiu, Shin-Han

2016-08-01

Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. © 2016 American Society of Plant Biologists. All Rights Reserved.
Methylation of an alpha-foetoprotein gene intragenic site modulates gene activity.

PubMed Central

Opdecamp, K; Rivière, M; Molné, M; Szpirer, J; Szpirer, C

1992-01-01

By comparing the methylation pattern of Mspl/Hpall sites in the 5' region of the mouse alpha-foetoprotein (AFP) gene of different cells (hepatoma cells, foetal and adult liver, fibroblasts), we found a correlation between gene expression and unmethylation of a site located in the first intron of the gene. Other sites did not show this correlation. In transfection experiments of unmethylated and methylated AFP-CAT chimeric constructions, we then showed that methylation of the intronic site negatively modulates expression of CAT activity. We also found that a DNA segment centered on this site binds nuclear proteins; however methylation did not affect protein binding. Images PMID:1371343
Investigating highly replicated asthma genes as candidate genes for allergic rhinitis.

PubMed

Andiappan, Anand Kumar; Nilsson, Daniel; Halldén, Christer; Yun, Wang De; Säll, Torbjörn; Cardell, Lars Olaf; Tim, Chew Fook

2013-05-10

Asthma genetics has been extensively studied and many genes have been associated with the development or severity of this disease. In contrast, the genetic basis of allergic rhinitis (AR) has not been evaluated as extensively. It is well known that asthma is closely related with AR since a large proportion of individuals with asthma also present symptoms of AR, and patients with AR have a 5-6 fold increased risk of developing asthma. Thus, the relevance of asthma candidate genes as predisposing factors for AR is worth investigating. The present study was designed to investigate if SNPs in highly replicated asthma genes are associated with the occurrence of AR. A total of 192 SNPs from 21 asthma candidate genes reported to be associated with asthma in 6 or more unrelated studies were genotyped in a Swedish population with 246 AR patients and 431 controls. Genotypes for 429 SNPs from the same set of genes were also extracted from a Singapore Chinese genome-wide dataset which consisted of 456 AR cases and 486 controls. All SNPs were subsequently analyzed for association with AR and their influence on allergic sensitization to common allergens. A limited number of potential associations were observed and the overall pattern of P-values corresponds well to the expectations in the absence of an effect. However, in the tests of allele effects in the Chinese population the number of significant P-values exceeds the expectations. The strongest signals were found for SNPs in NPSR1 and CTLA4. In these genes, a total of nine SNPs showed P-values <0.001 with corresponding Q-values <0.05. In the NPSR1 gene some P-values were lower than the Bonferroni correction level. Reanalysis after elimination of all patients with asthmatic symptoms excluded asthma as a confounding factor in our results. Weaker indications were found for IL13 and GSTP1 with respect to sensitization to birch pollen in the Swedish population. Genetic variation in the majority of the highly replicated asthma

Association analysis of the vitamin D receptor gene, the type I collagen gene COL1A1, and the estrogen receptor gene in idiopathic osteoarthritis.

PubMed

Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K

2000-03-01

Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.
Assessment of reference genes for reliable analysis of gene transcription by RT-qPCR in ovine leukocytes.

PubMed

Mahakapuge, T A N; Scheerlinck, J-P Y; Rojas, C A Alvarez; Every, A L; Hagen, J

2016-03-01

With the availability of genetic sequencing data, quantitative reverse transcription PCR (RT-qPCR) is increasingly being used for the quantification of gene transcription across species. Too often there is little regard to the selection of reference genes and the impact that a poor choice has on data interpretation. Indeed, RT-qPCR provides a snapshot of relative gene transcription at a given time-point, and hence is highly dependent on the stability of the transcription of the reference gene(s). Using ovine efferent lymph cells and peripheral blood mono-nuclear cells (PBMCs), the two most frequently used leukocytes in immunological studies, we have compared the stability of transcription of the most commonly used ovine reference genes: YWHAZ, RPL-13A, PGK1, B2M, GAPDH, HPRT, SDHA and ACTB. Using established algorithms for reference gene normalization "geNorm" and "Norm Finder", PGK1, GAPDH and YWHAZ were deemed the most stably transcribed genes for efferent leukocytes and PGK1, YWHAZ and SDHA were optimal in PBMCs. These genes should therefore be considered for accurate and reproducible RT-qPCR data analysis of gene transcription in sheep. Copyright © 2016. Published by Elsevier B.V.
Extending gene ontology with gene association networks.

PubMed

Peng, Jiajie; Wang, Tao; Wang, Jixuan; Wang, Yadong; Chen, Jin

2016-04-15

Gene ontology (GO) is a widely used resource to describe the attributes for gene products. However, automatic GO maintenance remains to be difficult because of the complex logical reasoning and the need of biological knowledge that are not explicitly represented in the GO. The existing studies either construct whole GO based on network data or only infer the relations between existing GO terms. None is purposed to add new terms automatically to the existing GO. We proposed a new algorithm 'GOExtender' to efficiently identify all the connected gene pairs labeled by the same parent GO terms. GOExtender is used to predict new GO terms with biological network data, and connect them to the existing GO. Evaluation tests on biological process and cellular component categories of different GO releases showed that GOExtender can extend new GO terms automatically based on the biological network. Furthermore, we applied GOExtender to the recent release of GO and discovered new GO terms with strong support from literature. Software and supplementary document are available at www.msu.edu/%7Ejinchen/GOExtender jinchen@msu.edu or ydwang@hit.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Expressing genes do not forget their LINEs: transposable elements and gene expression

PubMed Central

Kines, Kristine J.; Belancio, Victoria P.

2012-01-01

1. ABSTRACT Historically the accumulated mass of mammalian transposable elements (TEs), particularly those located within gene boundaries, was viewed as a genetic burden potentially detrimental to the genomic landscape. This notion has been strengthened by the discovery that transposable sequences can alter the architecture of the transcriptome, not only through insertion, but also long after the integration process is completed. Insertions previously considered harmless are now known to impact the expression of host genes via modification of the transcript quality or quantity, transcriptional interference, or by the control of pathways that affect the mRNA life-cycle. Conversely, several examples of the evolutionary advantageous impact of TEs on the host gene structure that diversified the cellular transcriptome are reported. TE-induced changes in gene expression can be tissue-or disease-specific, raising the possibility that the impact of TE sequences may vary during development, among normal cell types, and between normal and disease-affected tissues. The understanding of the rules and abundance of TE-interference with gene expression is in its infancy, and its contribution to human disease and/or evolution remains largely unexplored. PMID:22201807
A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

PubMed

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

PubMed

Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

2014-06-01

With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
GeneCOST: a novel scoring-based prioritization framework for identifying disease causing genes.

PubMed

Ozer, Bugra; Sağıroğlu, Mahmut; Demirci, Hüseyin

2015-11-15

Due to the big data produced by next-generation sequencing studies, there is an evident need for methods to extract the valuable information gathered from these experiments. In this work, we propose GeneCOST, a novel scoring-based method to evaluate every gene for their disease association. Without any prior filtering and any prior knowledge, we assign a disease likelihood score to each gene in correspondence with their variations. Then, we rank all genes based on frequency, conservation, pedigree and detailed variation information to find out the causative reason of the disease state. We demonstrate the usage of GeneCOST with public and real life Mendelian disease cases including recessive, dominant, compound heterozygous and sporadic models. As a result, we were able to identify causative reason behind the disease state in top rankings of our list, proving that this novel prioritization framework provides a powerful environment for the analysis in genetic disease studies alternative to filtering-based approaches. GeneCOST software is freely available at www.igbam.bilgem.tubitak.gov.tr/en/softwares/genecost-en/index.html. buozer@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Identification of Single- and Multiple-Class Specific Signature Genes from Gene Expression Profiles by Group Marker Index

PubMed Central

Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang

2011-01-01

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing
Functional and evolutionary correlates of gene constellations in the Drosophila melanogaster genome that deviate from the stereotypical gene architecture

PubMed Central

2010-01-01

Background The biological dimensions of genes are manifold. These include genomic properties, (e.g., X/autosomal linkage, recombination) and functional properties (e.g., expression level, tissue specificity). Multiple properties, each generally of subtle influence individually, may affect the evolution of genes or merely be (auto-)correlates. Results of multidimensional analyses may reveal the relative importance of these properties on the evolution of genes, and therefore help evaluate whether these properties should be considered during analyses. While numerous properties are now considered during studies, most work still assumes the stereotypical solitary gene as commonly depicted in textbooks. Here, we investigate the Drosophila melanogaster genome to determine whether deviations from the stereotypical gene architecture correlate with other properties of genes. Results Deviations from the stereotypical gene architecture were classified as the following gene constellations: Overlapping genes were defined as those that overlap in the 5-prime, exonic, or intronic regions. Chromatin co-clustering genes were defined as genes that co-clustered within 20 kb of transcriptional territories. If this scheme is applied the stereotypical gene emerges as a rare occurrence (7.5%), slightly varied schemes yielded between ~1%-50%. Moreover, when following our scheme, paired-overlapping genes and chromatin co-clustering genes accounted for 50.1 and 42.4% of the genes analyzed, respectively. Gene constellation was a correlate of a number of functional and evolutionary properties of genes, but its statistical effect was ~1-2 orders of magnitude lower than the effects of recombination, chromosome linkage and protein function. Analysis of datasets on male reproductive proteins showed these were biased in their representation of gene constellations and evolutionary rate Ka/Ks estimates, but these biases did not overwhelm the biologically meaningful observation of high evolutionary
CYP1A1, GCLC, AGT, AGTR1 gene-gene interactions in community-acquired pneumonia pulmonary complications.

PubMed

Salnikova, Lyubov E; Smelaya, Tamara V; Golubev, Arkadiy M; Rubanovich, Alexander V; Moroz, Viktor V

2013-11-01

This study was conducted to establish the possible contribution of functional gene polymorphisms in detoxification/oxidative stress and vascular remodeling pathways to community-acquired pneumonia (CAP) susceptibility in the case-control study (350 CAP patients, 432 control subjects) and to predisposition to the development of CAP complications in the prospective study. All subjects were genotyped for 16 polymorphic variants in the 14 genes of xenobiotics detoxification CYP1A1, AhR, GSTM1, GSTT1, ABCB1, redox-status SOD2, CAT, GCLC, and vascular homeostasis ACE, AGT, AGTR1, NOS3, MTHFR, VEGFα. Risk of pulmonary complications (PC) in the single locus analysis was associated with CYP1A1, GCLC and AGTR1 genes. Extra PC (toxic shock syndrome and myocarditis) were not associated with these genes. We evaluated gene-gene interactions using multi-factor dimensionality reduction, and cumulative gene risk score approaches. The final model which included >5 risk alleles in the CYP1A1 (rs2606345, rs4646903, rs1048943), GCLC, AGT, and AGTR1 genes was associated with pleuritis, empyema, acute respiratory distress syndrome, all PC and acute respiratory failure (ARF). We considered CYP1A1, GCLC, AGT, AGTR1 gene set using Set Distiller mode implemented in GeneDecks for discovering gene-set relations via the degree of sharing descriptors within a given gene set. N-acetylcysteine and oxygen were defined by Set Distiller as the best descriptors for the gene set associated in the present study with PC and ARF. Results of the study are in line with literature data and suggest that genetically determined oxidative stress exacerbation may contribute to the progression of lung inflammation.
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

PubMed

Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

2004-03-01

By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
New Gene Evolution: Little Did We Know

PubMed Central

Long, Manyuan; VanKuren, Nicholas W.; Chen, Sidi; Vibranovski, Maria D.

2014-01-01

Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and evolve as critical components of the genetic systems determining the biological diversity of life. Two decades of effort have shed light on the process of new gene origination, and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular and phenotypic functions. PMID:24050177
The early stages of duplicate gene evolution

PubMed Central

Moore, Richard C.; Purugganan, Michael D.

2003-01-01

Gene duplications are one of the primary driving forces in the evolution of genomes and genetic systems. Gene duplicates account for 8–20% of the genes in eukaryotic genomes, and the rates of gene duplication are estimated at between 0.2% and 2% per gene per million years. Duplicate genes are believed to be a major mechanism for the establishment of new gene functions and the generation of evolutionary novelty, yet very little is known about the early stages of the evolution of duplicated gene pairs. It is unclear, for example, to what extent selection, rather than neutral genetic drift, drives the fixation and early evolution of duplicate loci. Analysis of recently duplicated genes in the Arabidopsis thaliana genome reveals significantly reduced species-wide levels of nucleotide polymorphisms in the progenitor and/or duplicate gene copies, suggesting that selective sweeps accompany the initial stages of the evolution of these duplicated gene pairs. Our results support recent theoretical work that indicates that fates of duplicate gene pairs may be determined in the initial phases of duplicate gene evolution and that positive selection plays a prominent role in the evolutionary dynamics of the very early histories of duplicate nuclear genes. PMID:14671323
Apoptosis Gene Information System--AGIS.

PubMed

Sakharkar, Kishore R; Clement, Marie V; Chow, Vincent T K; Pervaiz, Shazib

2006-05-01

Genes implicated in apoptosis have great relevance to biology, medicine and oncology. Here, we describe a unique resource, Apoptosis Gene Information System (AGIS) that provides data for over 2400 genes involved directly or indirectly, in apoptotic pathways of more than 350 different organisms. The organization of this information system is based on the principle of one-gene, one record. AGIS will be updated on a six monthly basis as new information becomes available. AGIS can be accessed at: http://www.cellfate.org/AGIS/.
Amplification of a Gene Related to Mammalian mdr Genes in Drug-Resistant Plasmodium falciparum

NASA Astrophysics Data System (ADS)

Wilson, Craig M.; Serrano, Adelfa E.; Wasley, Annemarie; Bogenschutz, Michael P.; Shankar, Anuraj H.; Wirth, Dyann F.

1989-06-01

The malaria parasite Plasmodium falciparum contains at least two genes related to the mammalian multiple drug resistance genes, and at least one of the P. falciparum genes is expressed at a higher level and is present in higher copy number in a strain that is resistant to multiple drugs than in a strain that is sensitive to the drugs.
Evolution of four gene families with patchy phylogenetic distributions: influx of genes into protist genomes

PubMed Central

Andersson, Jan O; Hirt, Robert P; Foster, Peter G; Roger, Andrew J

2006-01-01

Background Lateral gene transfer (LGT) in eukaryotes from non-organellar sources is a controversial subject in need of further study. Here we present gene distribution and phylogenetic analyses of the genes encoding the hybrid-cluster protein, A-type flavoprotein, glucosamine-6-phosphate isomerase, and alcohol dehydrogenase E. These four genes have a limited distribution among sequenced prokaryotic and eukaryotic genomes and were previously implicated in gene transfer events affecting eukaryotes. If our previous contention that these genes were introduced by LGT independently into the diplomonad and Entamoeba lineages were true, we expect that the number of putative transfers and the phylogenetic signal supporting LGT should be stable or increase, rather than decrease, when novel eukaryotic and prokaryotic homologs are added to the analyses. Results The addition of homologs from phagotrophic protists, including several Entamoeba species, the pelobiont Mastigamoeba balamuthi, and the parabasalid Trichomonas vaginalis, and a large quantity of sequences from genome projects resulted in an apparent increase in the number of putative transfer events affecting all three domains of life. Some of the eukaryotic transfers affect a wide range of protists, such as three divergent lineages of Amoebozoa, represented by Entamoeba, Mastigamoeba, and Dictyostelium, while other transfers only affect a limited diversity, for example only the Entamoeba lineage. These observations are consistent with a model where these genes have been introduced into protist genomes independently from various sources over a long evolutionary time. Conclusion Phylogenetic analyses of the updated datasets using more sophisticated phylogenetic methods, in combination with the gene distribution analyses, strengthened, rather than weakened, the support for LGT as an important mechanism affecting the evolution of these gene families. Thus, gene transfer seems to be an on-going evolutionary mechanism by
Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

PubMed

Zhou, Xionghui; Liu, Juan

2014-01-01

Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes.

PubMed

Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa

2017-08-29

Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/.
fabp4 is central to eight obesity associated genes: a functional gene network-based polymorphic study.

PubMed

Bag, Susmita; Ramaiah, Sudha; Anbarasu, Anand

2015-01-07

Network study on genes and proteins offers functional basics of the complexity of gene and protein, and its interacting partners. The gene fatty acid-binding protein 4 (fabp4) is found to be highly expressed in adipose tissue, and is one of the most abundant proteins in mature adipocytes. Our investigations on functional modules of fabp4 provide useful information on the functional genes interacting with fabp4, their biochemical properties and their regulatory functions. The present study shows that there are eight set of candidate genes: acp1, ext2, insr, lipe, ostf1, sncg, usp15, and vim that are strongly and functionally linked up with fabp4. Gene ontological analysis of network modules of fabp4 provides an explicit idea on the functional aspect of fabp4 and its interacting nodes. The hierarchal mapping on gene ontology indicates gene specific processes and functions as well as their compartmentalization in tissues. The fabp4 along with its interacting genes are involved in lipid metabolic activity and are integrated in multi-cellular processes of tissues and organs. They also have important protein/enzyme binding activity. Our study elucidated disease-associated nsSNP prediction for fabp4 and it is interesting to note that there are four rsID׳s (rs1051231, rs3204631, rs140925685 and rs141169989) with disease allelic variation (T104P, T126P, G27D and G90V respectively). On the whole, our gene network analysis presents a clear insight about the interactions and functions associated with fabp4 gene network. Copyright © 2014 Elsevier Ltd. All rights reserved.
Phylogenetics and Gene Structure Dynamics of Polygalacturonase Genes in Aspergillus and Neurospora crassa

PubMed Central

Hong, Jin-Sung; Ryu, Ki-Hyun; Kwon, Soon-Jae; Kim, Jin-Won; Kim, Kwang-Soo; Park, Kyong-Cheul

2013-01-01

Polygalacturonase (PG) gene is a typical gene family present in eukaryotes. Forty-nine PGs were mined from the genomes of Neurospora crassa and five Aspergillus species. The PGs were classified into 3 clades such as clade 1 for rhamno-PGs, clade 2 for exo-PGs and clade 3 for exo- and endo-PGs, which were further grouped into 13 sub-clades based on the polypeptide sequence similarity. In gene structure analysis, a total of 124 introns were present in 44 genes and five genes lacked introns to give an average of 2.5 introns per gene. Intron phase distribution was 64.5% for phase 0, 21.8% for phase 1, and 13.7% for phase 2, respectively. The introns varied in their sequences and their lengths ranged from 20 bp to 424 bp with an average of 65.9 bp, which is approximately half the size of introns in other fungal genes. There were 29 homologous intron blocks and 26 of those were sub-clade specific. Intron losses were counted in 18 introns in which no obvious phase preference for intron loss was observed. Eighteen introns were placed at novel positions, which is considerably higher than those of plant PGs. In an evolutionary sense both intron loss and gain must have taken place for shaping the current PGs in these fungi. Together with the small intron size, low conservation of homologous intron blocks and higher number of novel introns, PGs of fungal species seem to have recently undergone highly dynamic evolution. PMID:25288950

Some links on this page may take you to non-federal websites. Their policies may differ from this site.