structural genes: Topics by Science.gov

Sample records for structural genes

GeneBuilder: interactive in silico prediction of gene structure.

PubMed

Milanesi, L; D'Angelo, D; Rogozin, I B

1999-01-01

Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Intron-loss evolution of hatching enzyme genes in Teleostei

PubMed Central

2010-01-01

Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321
Function does not follow form in gene regulatory circuits.

PubMed

Payne, Joshua L; Wagner, Andreas

2015-08-20

Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.
Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments

PubMed Central

Haas, Brian J; Salzberg, Steven L; Zhu, Wei; Pertea, Mihaela; Allen, Jonathan E; Orvis, Joshua; White, Owen; Buell, C Robin; Wortman, Jennifer R

2008-01-01

EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation. PMID:18190707
Bioinformatics study of the mangrove actin genes

NASA Astrophysics Data System (ADS)

Basyuni, M.; Wasilah, M.; Sumardi

2017-01-01

This study describes the bioinformatics methods to analyze eight actin genes from mangrove plants on DDBJ/EMBL/GenBank as well as predicted the structure, composition, subcellular localization, similarity, and phylogenetic. The physical and chemical properties of eight mangroves showed variation among the genes. The percentage of the secondary structure of eight mangrove actin genes followed the order of a helix > random coil > extended chain structure for BgActl, KcActl, RsActl, and A. corniculatum Act. In contrast to this observation, the remaining actin genes were random coil > extended chain structure > a helix. This study, therefore, shown the prediction of secondary structure was performed for necessary structural information. The values of chloroplast or signal peptide or mitochondrial target were too small, indicated that no chloroplast or mitochondrial transit peptide or signal peptide of secretion pathway in mangrove actin genes. These results suggested the importance of understanding the diversity and functional of properties of the different amino acids in mangrove actin genes. To clarify the relationship among the mangrove actin gene, a phylogenetic tree was constructed. Three groups of mangrove actin genes were formed, the first group contains B. gymnorrhiza BgAct and R. stylosa RsActl. The second cluster which consists of 5 actin genes the largest group, and the last branch consist of one gene, B. sexagula Act. The present study, therefore, supported the previous results that plant actin genes form distinct clusters in the tree.
Deep transcriptome sequencing provides new insights into the structural and functional organization of the wheat genome.

PubMed

Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne

2015-02-10

Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

PubMed

Heendeniya, Ravindra G; Yu, Peiqiang

2017-03-20

Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.
[Construction, identification and expression of three kinds of shuttle plasmids of adenovirus expression vector of hepatitis C virus structure gene].

PubMed

Cao, Yi-zhan; Hao, Chun-qiu; Feng, Zhi-hua; Zhou, Yong-xing; Li, Jin-ge; Jia, Zhan-sheng; Wang, Ping-zhong

2003-02-01

To construct three recombinant shuttle plasmids of adenovirus expression vector which can express hepatitis C virus(HCV) different structure genes(C, C+E1, C+E1+E2) in order to pack adenovirus expression vectors which can express HCV different structure gene effectively. The different HCV structure genes derived from the plasmid pBRTM/HCV1-3011 by using polymerase chain reaction (PCR) were inserted into the backward position of cytomegalovirus(CMV) immediate early promotor element of shuttle plasmid(pAd.CMV-Link.1) of adenovirus expression vector respectively, then the three recombinant plasmids (pAd.HCV-C, pAd.HCV-CE1, pAd.HCV-S) were obtained. The recombinant plasmids were identified by endonuclease, PCR and sequencing. HCV structure genes were expressed transiently with Lipofectamine 2000 coated in HepG2 cells which were confirmed by immunofluorescence and Western-Blot. Insert DNAs of the three recombinant plasmids' were confirmed to be HCV different structure genes by endonuclease, PCR and sequencing. The three recombinant plasmids can express HCV structure gene (C, C+E1, C+E1+E2) transiently in HepG2 cells which were confirmed by immunofluorescence and Western-Blot. The three recombinant shuttle plasmids of adenovirus expression vector can express HCV structure gene(C, C+E1, C+E1+E2) transiently. This should be useful to pack adenovirus expression vector which can express HCV structure genes.
Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Raymond, Amy; Lovell, Scott; Lorimer, Don

2009-12-01

With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less
The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

PubMed

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.
Translation-coupling systems

DOEpatents

Pfleger, Brian; Mendez-Perez, Daniel

2013-11-05

Disclosed are systems and methods for coupling translation of a target gene to a detectable response gene. A version of the invention includes a translation-coupling cassette. The translation-coupling cassette includes a target gene, a response gene, a response-gene translation control element, and a secondary structure-forming sequence that reversibly forms a secondary structure masking the response-gene translation control element. Masking of the response-gene translation control element inhibits translation of the response gene. Full translation of the target gene results in unfolding of the secondary structure and consequent translation of the response gene. Translation of the target gene is determined by detecting presence of the response-gene protein product. The invention further includes RNA transcripts of the translation-coupling cassettes, vectors comprising the translation-coupling cassettes, hosts comprising the translation-coupling cassettes, methods of using the translation-coupling cassettes, and gene products produced with the translation-coupling cassettes.
Translation-coupling systems

DOEpatents

Pfleger, Brian; Mendez-Perez, Daniel

2015-05-19

Disclosed are systems and methods for coupling translation of a target gene to a detectable response gene. A version of the invention includes a translation-coupling cassette. The translation-coupling cassette includes a target gene, a response gene, a response-gene translation control element, and a secondary structure-forming sequence that reversibly forms a secondary structure masking the response-gene translation control element. Masking of the response-gene translation control element inhibits translation of the response gene. Full translation of the target gene results in unfolding of the secondary structure and consequent translation of the response gene. Translation of the target gene is determined by detecting presence of the response-gene protein product. The invention further includes RNA transcripts of the translation-coupling cassettes, vectors comprising the translation-coupling cassettes, hosts comprising the translation-coupling cassettes, methods of using the translation-coupling cassettes, and gene products produced with the translation-coupling cassettes.
The structural and functional connectivity of the grassland plant Lychnis flos-cuculi

PubMed Central

Aavik, T; Holderegger, R; Bolliger, J

2014-01-01

Understanding the relationship between structural and functional connectivity is essential for successful restoration and conservation management, particularly in intensely managed agricultural landscapes. We evaluated the relationship between structural and functional connectivity of the wetland plant Lychnis flos-cuculi in a fragmented agricultural landscape using landscape genetic and network approaches. First, we studied the effect of structural connectivity, such as geographic distance and various landscape elements (forest, agricultural land, settlements and ditch verges), on gene flow among populations as a measurement of functional connectivity. Second, we examined the effect of structural graph-theoretic connectivity measures on gene flow among populations and on genetic diversity within populations of L. flos-cuculi. Among landscape elements, forests hindered gene flow in L. flos-cuculi, whereas gene flow was independent of geographic distance. Among the structural graph-theoretic connectivity variables, only intrapopulation connectivity, which was based on population size, had a significant positive effect on gene flow, that is, more gene flow took place among larger populations. Unexpectedly, interpopulation connectivity of populations, which takes into account the spatial location and distance among populations, did not influence gene flow in L. flos-cuculi. However, higher observed heterozygosity and lower inbreeding was observed in populations characterised by higher structural interpopulation connectivity. This finding shows that a spatially coherent network of populations is significant for maintaining the genetic diversity of populations. Nevertheless, lack of significant relationships between gene flow and most of the structural connectivity measures suggests that structural connectivity does not necessarily correspond to functional connectivity. PMID:24253937
Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes.

PubMed

Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

2016-05-26

Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes

PubMed Central

Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki

2016-01-01

Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Organization of Genes Required for the Oxidation of Methanol to Formaldehyde in Three Type II Methylotrophs

PubMed Central

Bastien, C.; Machlin, S.; Zhang, Y.; Donaldson, K.; Hanson, R. S.

1989-01-01

Restriction maps of genes required for the synthesis of active methanol dehydrogenase in Methylobacterium organophilum XX and Methylobacterium sp. strain AM1 have been completed and compared. In these two species of pink-pigmented, type II methylotrophs, 15 genes were identified that were required for the expression of methanol dehydrogenase activity. None of these genes were required for the synthesis of the prosthetic group of methanol dehydrogenase, pyrroloquinoline quinone. The structural gene required for the synthesis of cytochrome cL, an electron acceptor uniquely required for methanol dehydrogenase, and the genes encoding small basic peptides that copurified with methanol dehydrogenases were closely linked to the methanol dehydrogenase structural genes. A cloned 22-kilobase DNA insert from Methylsporovibrio methanica 81Z, an obligate type II methanotroph, complemented mutants that contained lesions in four genes closely linked to the methanol dehydrogenase structural genes. The methanol dehydrogenase and cytochrome cL structural genes were found to be transcribed independently in M. organophilum XX. Only two of the genes required for methanol dehydrogenase synthesis in this bacterium were found to be cotranscribed. PMID:16348074
Structural polymorphism at LCR and its role in beta-globin gene regulation.

PubMed

Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree

2010-09-01

Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Characterization of the Structural Gene Promoter of Aedes aegypti Densovirus

PubMed Central

Ward, Todd W.; Kimmick, Michael W.; Afanasiev, Boris N.; Carlson, Jonathan O.

2001-01-01

Aedes aegypti densonucleosis virus (AeDNV) has two promoters that have been shown to be active by reporter gene expression analysis (B. N. Afanasiev, Y. V. Koslov, J. O. Carlson, and B. J. Beaty, Exp. Parasitol. 79:322–339, 1994). Northern blot analysis of cells infected with AeDNV revealed two transcripts 1,200 and 3,500 nucleotides in length that are assumed to express the structural protein (VP) gene and nonstructural protein genes, respectively. Primer extension was used to map the transcriptional start site of the structural protein gene. Surprisingly, the structural protein gene transcript began at an initiator consensus sequence, CAGT, 60 nucleotides upstream from the map unit 61 TATAA sequence previously thought to define the promoter. Constructs with the β-galactosidase gene fused to the structural protein gene were used to determine elements necessary for promoter function. Deletion or mutation of the initiator sequence, CAGT, reduced protein expression by 93%, whereas mutation of the TATAA sequence at map unit 61 had little effect. An additional open reading frame was observed upstream of the structural protein gene that can express β-galactosidase at a low level (20% of that of VP fusions). Expression of the AeDNV structural protein gene was shown to be stimulated by the major nonstructural protein NS1 (Afanasiev et al., Exp. parasitol., 1994). To determine the sequences required for transactivation, expression of structural protein gene–β-galactosidase gene fusion constructs differing in AeDNV genome content was measured with and without NS1. The presence of NS1 led to an 8- to 10-fold increase in expression when either genomic end was present, compared to a 2-fold increase with a construct lacking the genomic ends. An even higher (37-fold) increase in expression occurred with both genomic ends present; however, this was in part due to template replication as shown by Southern blot analysis. These data indicate the location and importance of various elements necessary for efficient protein expression and transactivation from the structural protein gene promoter of AeDNV. PMID:11152505
Using deep RNA sequencing for the structural annotation of the laccaria bicolor mycorrhizal transcriptome.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Larsen, P. E.; Trivedi, G.; Sreedasyam, A.

2010-07-06

Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derivedmore » from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. 69% of expressed mycorrhizal JGI 'best' gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided that there is a sequenced genome and a set of gene models.« less
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses.

Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

PubMed

Proudhon, D; Wei, J; Briat, J; Theil, E C

1996-03-01

Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
[Mechanisms of endogenous drug resistance acquisition by spontaneous chromosomal gene mutation].

PubMed

Fukuda, H; Hiramatsu, K

1997-05-01

Endogenous resistance in bacteria is caused by a change or loss of function and generally genetically recessive. However, this type of resistance acquisition are now prevalent in clinical setting. Chromosomal genes that afford endogenous resistance are the genes correlated with the target of the drug, the drug inactivating enzymes, and permeability of the molecules including the antibacterial agents. Endogenous alteration of the drug target are mediated by the spontaneous mutation of their structural gene. This mutation provides much lower affinity of the drugs for the target. Gene expression of the inactivating enzymes, such as class C beta-lactamase, is generally regulated by regulatory genes. Spontaneous mutations in the regulatory genes cause constitutive enzyme production and provides the resistant to the agent which is usually stable for such enzymes. Spontaneous mutation in the structural gene gives the enzyme extra-spectrum substrate specificity, like ESBL (Extra-Spectrum-beta-Lactamase). Expression of structural genes encoding the permeability systems are also regulated by some regulatory genes. The spontaneous mutation of the regulatory genes reduce an amount of porin protein. This mutation causes much lower influx of the drug in the cell. Spontaneous mutation in promoter region of the structural gene of efflux protein was observed. This mutation raised the gene transcription and overproduced efflux protein. This protein progresses the drug efflux from the cell.
Organization of genes required for the oxidation of methanol to formaldehyde in three type II methylotrophs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bastien, C.; Machlin, S.; Zhang, Y.

Restriction maps of genes required for the synthesis of active methanol dehydrogenase in Methylobacterium organophilum XX and Methylobacterium sp. strain AM1 have been completed and compared. In these two species of pink-pigmented, type II methylotrophs, 15 genes were identified that were required for the expression of methanol dehydrogenase activity. None of these genes were required for the synthesis of the prosthetic group of methanol dehydrogenase, pyrroloquinoline quinone. The structural gene required for the synthesis of cytochrome c{sub L}, an electron acceptor uniquely required for methanol dehydrogenase, and the genes encoding small basic peptides that copurified with methanol dehydrogenases were closelymore » linked to the methanol dehydrogenase structural genes. A cloned 22-kilobase DNA insert from Methylsporovibrio methanica 81Z, an obligate type II methanotroph, complemented mutants that contained lesions in four genes closely linked to the methanol dehydrogenase structural genes. The methanol dehydrogenase and cytochrome c{sub L} structural genes were found to be transcribed independently in M. organophilum XX. Only two of the genes required for methanol dehydrogenase synthesis in this bacterium were found to be cotranscribed.« less
Robust Learning of High-dimensional Biological Networks with Bayesian Networks

NASA Astrophysics Data System (ADS)

Nägele, Andreas; Dejori, Mathäus; Stetter, Martin

Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.
Evaluating bacterial gene-finding HMM structures as probabilistic logic programs.

PubMed

Mørk, Søren; Holmes, Ian

2012-03-01

Probabilistic logic programming offers a powerful way to describe and evaluate structured statistical models. To investigate the practicality of probabilistic logic programming for structure learning in bioinformatics, we undertook a simplified bacterial gene-finding benchmark in PRISM, a probabilistic dialect of Prolog. We evaluate Hidden Markov Model structures for bacterial protein-coding gene potential, including a simple null model structure, three structures based on existing bacterial gene finders and two novel model structures. We test standard versions as well as ADPH length modeling and three-state versions of the five model structures. The models are all represented as probabilistic logic programs and evaluated using the PRISM machine learning system in terms of statistical information criteria and gene-finding prediction accuracy, in two bacterial genomes. Neither of our implementations of the two currently most used model structures are best performing in terms of statistical information criteria or prediction performances, suggesting that better-fitting models might be achievable. The source code of all PRISM models, data and additional scripts are freely available for download at: http://github.com/somork/codonhmm. Supplementary data are available at Bioinformatics online.
Three-gene identity coefficients demonstrate that clonal reproduction promotes inbreeding and spatial relatedness in yellow-cedar, Callitropsis nootkatensis.

PubMed

Thompson, Stacey Lee; Bérubé, Yanik; Bruneau, Anne; Ritland, Kermit

2008-10-01

Asexual reproduction has the potential to promote population structuring through matings between clones as well as through limited dispersal of related progeny. Here we present an application of three-gene identity coefficients that tests whether clonal reproduction promotes inbreeding and spatial relatedness within populations. With this method, the first two genes are sampled to estimate pairwise relatedness or inbreeding, whereas the third gene is sampled from either a clone or a sexually derived individual. If three-gene coefficients are significantly greater for clones than nonclones, then clonality contributes excessively to genetic structure. First, we describe an estimator of three-gene identity and briefly evaluate its properties. We then use this estimator to test the effect of clonality on the genetic structure within populations of yellow-cedar (Callitropsis nootkatensis) using a molecular marker survey. Five microsatellite loci were genotyped for 485 trees sampled from nine populations. Our three-gene analyses show that clonal ramets promote inbreeding and spatial structure in most populations. Among-population correlations between clonal extent and genetic structure generally support these trends, yet with less statistical significance. Clones appear to contribute to genetic structure through the limited dispersal of offspring from replicated ramets of the same clonal genet, whereas this structure is likely maintained by mating among these relatives.
Occurrence of the structural enterocin A, P, B, L50B genes in enterococci of different origin.

PubMed

Strompfová, Viola; Lauková, Andrea; Simonová, Monika; Marcináková, Miroslava

2008-12-10

Enterococci are well-known producers of antimicrobial peptides--bacteriocins (enterocins) and the number of characterized enterocins has been significantly increased. Recently, enterocins are of great interest for their potential as biopreservatives in food or feed while research on enterocins as alternative antimicrobials in humans and animals is only at the beginning. The present study provides a survey about the occurrence of enterocin structural genes A, P, B, L50B in a target of 427 strains of Enterococcus faecium (368) and Enterococcus faecalis (59) species from different sources (animal isolates, food and feed) performed by PCR method. Based on our results, 234 strains possessed one or more enterocin structural gene(s). The genes of enterocin P and enterocin A were the most frequently detected structural genes among the PCR positive strains (170 and 155 strains, respectively). Different frequency of the enterocin genes occurrence was detected in strains according to their origin; the strains from horses and silage showed the highest frequency of enterocin genes presence. All possible combinations of the tested genes occurred at least twice except the combination of the gene of enterocin B and L50B which possessed neither strain. The gene of enterocin A was exclusively detected among E. faecium strains, while the gene of enterocin P, B, L50B were detected in strains of both species E. faecium and E. faecalis. In conclusion, a high-frequency and variability of enterocin structural genes exists among enterococci of different origin what offers a big possibility to find effective bacteriocin-producing strains for their application in veterinary medicine.
Functional understanding of the diverse exon-intron structures of human GPCR genes.

PubMed

Hammond, Dorothy A; Olman, Victor; Xu, Ying

2014-02-01

The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.
High-throughput RNA sequencing reveals structural differences of orthologous brain-expressed genes between western lowland gorillas and humans.

PubMed

Lipovich, Leonard; Hou, Zhuo-Cheng; Jia, Hui; Sinkler, Christopher; McGowen, Michael; Sterner, Kirstin N; Weckle, Amy; Sugalski, Amara B; Pipes, Lenore; Gatti, Domenico L; Mason, Christopher E; Sherwood, Chet C; Hof, Patrick R; Kuzawa, Christopher W; Grossman, Lawrence I; Goodman, Morris; Wildman, Derek E

2016-02-01

The human brain and human cognitive abilities are strikingly different from those of other great apes despite relatively modest genome sequence divergence. However, little is presently known about the interspecies divergence in gene structure and transcription that might contribute to these phenotypic differences. To date, most comparative studies of gene structure in the brain have examined humans, chimpanzees, and macaque monkeys. To add to this body of knowledge, we analyze here the brain transcriptome of the western lowland gorilla (Gorilla gorilla gorilla), an African great ape species that is phylogenetically closely related to humans, but with a brain that is approximately one-third the size. Manual transcriptome curation from a sample of the planum temporale region of the neocortex revealed 12 protein-coding genes and one noncoding-RNA gene with exons in the gorilla unmatched by public transcriptome data from the orthologous human loci. These interspecies gene structure differences accounted for a total of 134 amino acids in proteins found in the gorilla that were absent from protein products of the orthologous human genes. Proteins varying in structure between human and gorilla were involved in immunity and energy metabolism, suggesting their relevance to phenotypic differences. This gorilla neocortical transcriptome comprises an empirical, not homology- or prediction-driven, resource for orthologous gene comparisons between human and gorilla. These findings provide a unique repository of the sequences and structures of thousands of genes transcribed in the gorilla brain, pointing to candidate genes that may contribute to the traits distinguishing humans from other closely related great apes. © 2015 Wiley Periodicals, Inc.
Independent Gene Discovery and Testing

ERIC Educational Resources Information Center

Palsule, Vrushalee; Coric, Dijana; Delancy, Russell; Dunham, Heather; Melancon, Caleb; Thompson, Dennis; Toms, Jamie; White, Ashley; Shultz, Jeffry

2010-01-01

A clear understanding of basic gene structure is critical when teaching molecular genetics, the central dogma and the biological sciences. We sought to create a gene-based teaching project to improve students' understanding of gene structure and to integrate this into a research project that can be implemented by instructors at the secondary level…
Gene function prediction based on the Gene Ontology hierarchical structure.

PubMed

Cheng, Liangxi; Lin, Hongfei; Hu, Yuncui; Wang, Jian; Yang, Zhihao

2014-01-01

The information of the Gene Ontology annotation is helpful in the explanation of life science phenomena, and can provide great support for the research of the biomedical field. The use of the Gene Ontology is gradually affecting the way people store and understand bioinformatic data. To facilitate the prediction of gene functions with the aid of text mining methods and existing resources, we transform it into a multi-label top-down classification problem and develop a method that uses the hierarchical relationships in the Gene Ontology structure to relieve the quantitative imbalance of positive and negative training samples. Meanwhile the method enhances the discriminating ability of classifiers by retaining and highlighting the key training samples. Additionally, the top-down classifier based on a tree structure takes the relationship of target classes into consideration and thus solves the incompatibility between the classification results and the Gene Ontology structure. Our experiment on the Gene Ontology annotation corpus achieves an F-value performance of 50.7% (precision: 52.7% recall: 48.9%). The experimental results demonstrate that when the size of training set is small, it can be expanded via topological propagation of associated documents between the parent and child nodes in the tree structure. The top-down classification model applies to the set of texts in an ontology structure or with a hierarchical relationship.
Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.)

PubMed Central

Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

2015-01-01

The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. PMID:25362073
De Novo Protein Structure Prediction

NASA Astrophysics Data System (ADS)

Hung, Ling-Hong; Ngan, Shing-Chung; Samudrala, Ram

An unparalleled amount of sequence data is being made available from large-scale genome sequencing efforts. The data provide a shortcut to the determination of the function of a gene of interest, as long as there is an existing sequenced gene with similar sequence and of known function. This has spurred structural genomic initiatives with the goal of determining as many protein folds as possible (Brenner and Levitt, 2000; Burley, 2000; Brenner, 2001; Heinemann et al., 2001). The purpose of this is twofold: First, the structure of a gene product can often lead to direct inference of its function. Second, since the function of a protein is dependent on its structure, direct comparison of the structures of gene products can be more sensitive than the comparison of sequences of genes for detecting homology. Presently, structural determination by crystallography and NMR techniques is still slow and expensive in terms of manpower and resources, despite attempts to automate the processes. Computer structure prediction algorithms, while not providing the accuracy of the traditional techniques, are extremely quick and inexpensive and can provide useful low-resolution data for structure comparisons (Bonneau and Baker, 2001). Given the immense number of structures which the structural genomic projects are attempting to solve, there would be a considerable gain even if the computer structure prediction approach were applicable to a subset of proteins.
Transcriptional Modulation of Genes Encoding Structural Characteristics of Differentiating Enterocytes During Development of a Polarized Epithelium In Vitro

PubMed Central

Halbleib, Jennifer M.; Sääf, Annika M.

2007-01-01

Although there is considerable evidence implicating posttranslational mechanisms in the development of epithelial cell polarity, little is known about the patterns of gene expression and transcriptional regulation during this process. We characterized the temporal program of gene expression during cell–cell adhesion–initiated polarization of human Caco-2 cells in tissue culture, which develop structural and functional polarity similar to that of enterocytes in vivo. A distinctive switch in gene expression patterns occurred upon formation of cell–cell contacts between neighboring cells. Expression of genes involved in cell proliferation was down-regulated concomitant with induction of genes necessary for functional specialization of polarized epithelial cells. Transcriptional up-regulation of these latter genes correlated with formation of important structural and functional features in enterocyte differentiation and establishment of structural and functional cell polarity; components of the apical microvilli were induced as the brush border formed during polarization; as barrier function was established, expression of tight junction transmembrane proteins peaked; transcripts encoding components of the apical, but not the basal-lateral trafficking machinery were increased during polarization. Coordinated expression of genes encoding components of functional cell structures were often observed indicating temporal control of expression and assembly of multiprotein complexes. PMID:17699590
Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

PubMed

Uchiyama, Ikuo

2008-10-31

Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

PubMed Central

Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

1993-01-01

Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.).

PubMed

Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

2015-02-01

The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica

PubMed Central

Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu

2017-01-01

GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple (Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS (MdGRAS6, 26, 28, 44, 53, 64, 107, and 122) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA3, 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees. PMID:28503152
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica.

PubMed

Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu

2017-01-01

GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple ( Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS ( MdGRAS6, 26, 28, 44, 53, 64, 107 , and 122 ) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA 3 , 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees.
The Isoforms of the p53 Protein

PubMed Central

Khoury, Marie P.; Bourdon, Jean-Christophe

2010-01-01

p53 is a transcription factor with a key role in the maintenance of genetic stability and therefore preventing cancer formation. It belongs to a family of genes composed of p53, p63, and p73. The p63 and p73 genes have a dual gene structure with an internal promoter in intron-3 and together with alternative splicing, can express 6 and 29 mRNA variants, respectively. Such a complex expression pattern had not been previously described for the p53 gene, which was not consistent with our understanding of the evolution of the p53 gene family. Consequently, we revisited the human p53 gene structure and established that it encodes nine different p53 protein isoforms because of alternative splicing, alternative promoter usage, and alternative initiation sites of translation. Therefore, the human p53 gene family (p53, p63, and p73) has a dual gene structure. We determined that the dual gene structure is conserved in Drosophila and in zebrafish p53 genes. The conservation through evolution of the dual gene structure suggests that the p53 isoforms play an important role in p53 tumor-suppressor activity. We and others have established that the p53 isoforms can regulate cell-fate outcome in response to stress, by modulating p53 transcriptional activity in a promoter and stress-dependent manner. We have also shown that the p53 isoforms are abnormally expressed in several types of human cancers, suggesting that they play an important role in cancer formation. The determination of p53 isoforms' expression may help to link clinical outcome to p53 status and to improve cancer patient treatment. PMID:20300206

An Introductory Bioinformatics Exercise to Reinforce Gene Structure and Expression and Analyze the Relationship between Gene and Protein Sequences

ERIC Educational Resources Information Center

Almeida, Craig A.; Tardiff, Daniel F.; De Luca, Jane P.

2004-01-01

We have developed an introductory bioinformatics exercise for sophomore biology and biochemistry students that reinforces the understanding of the structure of a gene and the principles and events involved in its expression. In addition, the activity illustrates the severe effect mutations in a gene sequence can have on the protein product.…
Covariance Structure Models for Gene Expression Microarray Data

ERIC Educational Resources Information Center

Xie, Jun; Bentler, Peter M.

2003-01-01

Covariance structure models are applied to gene expression data using a factor model, a path model, and their combination. The factor model is based on a few factors that capture most of the expression information. A common factor of a group of genes may represent a common protein factor for the transcript of the co-expressed genes, and hence, it…
CHARACTERIZATION OF THE ALKANE-INDUCIBLE CYTOCHROME P450 (P450ALK) GENE FROM THE YEAST CANDIDA TROPICALIS: IDENTIFICATION OF A NEW P450 FAMILY

EPA Science Inventory

The P450alk gene, which is inducible by the assimilation of alkane in Candida tropicalis, was sequenced and characterized. Structural features described in promoter and terminator regions of Saccharomyces yeast genes are present in the P450alk gene and some particular structures ...
Stable zymomonas mobilis xylose and arabinose fermenting strains

DOEpatents

Zhang, Min [Lakewood, CO; Chou, Yat-Chen [Taipei, TW

2008-04-08

The present invention briefly includes a transposon for stable insertion of foreign genes into a bacterial genome, comprising at least one operon having structural genes encoding enzymes selected from the group consisting of xylAxylB, araBAD and tal/tkt, and at least one promoter for expression of the structural genes in the bacterium, a pair of inverted insertion sequences, the operons contained inside the insertion sequences, and a transposase gene located outside of the insertion sequences. A plasmid shuttle vector for transformation of foreign genes into a bacterial genome, comprising at least one operon having structural genes encoding enzymes selected from the group consisting of xylAxylB, araBAD and tal/tkt, at least one promoter for expression of the structural genes in the bacterium, and at least two DNA fragments having homology with a gene in the bacterial genome to be transformed, is also provided.The transposon and shuttle vectors are useful in constructing significantly different Zymomonas mobilis strains, according to the present invention, which are useful in the conversion of the cellulose derived pentose sugars into fuels and chemicals, using traditional fermentation technology, because they are stable for expression in a non-selection medium.
Recombinant structures expand and contract inter and intragenic diversification at the KIR locus

PubMed Central

2013-01-01

Background The human KIR genes are arranged in at least six major gene-content haplotypes, all of which are combinations of four centromeric and two telomeric motifs. Several less frequent or minor haplotypes also exist, including insertions, deletions, and hybridization of KIR genes derived from the major haplotypes. These haplotype structures and their concomitant linkage disequilibrium among KIR genes suggest that more meaningful correlative data from studies of KIR genetics and complex disease may be achieved by measuring haplotypes of the KIR region in total. Results Towards that end, we developed a KIR haplotyping method that reports unambiguous combinations of KIR gene-content haplotypes, including both phase and copy number for each KIR. A total of 37 different gene content haplotypes were detected from 4,512 individuals and new sequence data was derived from haplotypes where the detailed structure was not previously available. Conclusions These new structures suggest a number of specific recombinant events during the course of KIR evolution, and add to an expanding diversity of potential new KIR haplotypes derived from gene duplication, deletion, and hybridization. PMID:23394822
Pseudoscorpion mitochondria show rearranged genes and genome-wide reductions of RNA gene sizes and inferred structures, yet typical nucleotide composition bias

PubMed Central

2012-01-01

Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic analyses of all 13 mitochondrial protein-coding gene sequences consistently yield trees that place pseudoscorpions as sister to acariform mites. Conclusion The well-supported phylogenetic placement of pseudoscorpions as sister to Acariformes differs from some previous analyses based on morphology. However, these two lineages share multiple molecular evolutionary traits, including substantial mitochondrial genome rearrangements, extensive nucleotide substitution, and loss of helices in their inferred tRNA and rRNA structures. PMID:22409411
Evolutionary implications of phylogenetic analyses of the gene transfer agent (GTA) of Rhodobacter capsulatus.

PubMed

Lang, Andrew S; Taylor, Terumi A; Beatty, J Thomas

2002-11-01

The gene transfer agent (GTA) of the a-proteobacterium Rhodobacter capsulatus is a cell-controlled genetic exchange vector. Genes that encode the GTA structure are clustered in a 15-kb region of the R. capsulatus chromosome, and some of these genes show sequence similarity to known bacteriophage head and tail genes. However, the production of GTA is controlled at the level of transcription by a cellular two-component signal transduction system. This paper describes homologues of both the GTA structural gene cluster and the GTA regulatory genes in the a-proteobacteria Rhodopseudomonas palustris, Rhodobacter sphaeroides, Caulobacter crescentus, Agrobacterium tumefaciens and Brucella melitensis. These sequences were used in a phylogenetic tree approach to examine the evolutionary relationships of selected GTA proteins to these homologues and (pro)phage proteins, which was compared to a 16S rRNA tree. The data indicate that a GTA-like element was present in a single progenitor of the extant species that contain both GTA structural cluster and regulatory gene homologues. The evolutionary relationships of GTA structural proteins to (pro)phage proteins indicated by the phylogenetic tree patterns suggest a predominantly vertical descent of GTA-like sequences in the a-proteobacteria and little past gene exchange with (pro)phages.
Characterization of an AGAMOUS gene expressed throughout development of the fleshy fruit-like structure produced by Ginkgo biloba around its seeds.

PubMed

Lovisetto, Alessandro; Baldan, Barbara; Pavanello, Anna; Casadoro, Giorgio

2015-07-16

The involvement of MADS-box genes of the AGAMOUS lineage in the formation of both flowers and fruits has been studied in detail in Angiosperms. AGAMOUS genes are expressed also in the reproductive structures of Gymnosperms, yet the demonstration of their role has been problematic because Gymnosperms are woody plants difficult to manipulate for physiological and genetic studies. Recently, it was shown that in the gymnosperm Ginkgo biloba an AGAMOUS gene was expressed throughout development and ripening of the fleshy fruit-like structures produced by this species around its seeds. Such fleshy structures are evolutionarily very important because they favor the dispersal of seeds through endozoochory. In this work a characterization of the Ginkgo gene was carried out by over-expressing it in tomato. In tomato plants ectopically expressing the Ginkgo AGAMOUS gene a macroscopic anomaly was observed only in the flower sepals. While the wild type sepals had a leaf-like appearance, the transgenic ones appeared connately adjoined at their proximal extremity and, concomitant with the development and ripening of the fruit, they became thicker and acquired a yellowish-orange color, thus indicating that they had undergone a homeotic transformation into carpel-like structures. Molecular analyses of several genes associated with either the control of ripening or the ripening syndrome in tomato fruits confirmed that the transgenic sepals behaved like ectopic fruits that could undergo some ripening, although the red color typical of the ripe tomato fruit was never achieved. The ectopic expression of the Ginkgo AGAMOUS gene in tomato caused the homeotic transformation of the transgenic sepals into carpel-like structures, and this showed that the gymnosperm gene has a genuine C function. In parallel with the ripening of fruits the related transgenic sepals became fleshy fruit-like structures that also underwent some ripening and such a result indicates that this C function gene might be involved, together with other gens, also in the development of the Ginkgo fruit-like structures. It seems thus strengthened the hypothesis that AGAMOUS MADS-box genes were recruited already in Gymnosperms for the development of the fleshy fruit habit which is evolutionarily so important for the dispersal of seeds.
The compositional transition of vertebrate genomes: an analysis of the secondary structure of the proteins encoded by human genes.

PubMed

D'Onofrio, Giuseppe; Ghosh, Tapash Chandra

2005-01-17

Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.
Macro optical projection tomography for large scale 3D imaging of plant structures and gene activity

PubMed Central

Lee, Karen J. I.; Calder, Grant M.; Hindle, Christopher R.; Newman, Jacob L.; Robinson, Simon N.; Avondo, Jerome J. H. Y.

2017-01-01

Abstract Optical projection tomography (OPT) is a well-established method for visualising gene activity in plants and animals. However, a limitation of conventional OPT is that the specimen upper size limit precludes its application to larger structures. To address this problem we constructed a macro version called Macro OPT (M-OPT). We apply M-OPT to 3D live imaging of gene activity in growing whole plants and to visualise structural morphology in large optically cleared plant and insect specimens up to 60 mm tall and 45 mm deep. We also show how M-OPT can be used to image gene expression domains in 3D within fixed tissue and to visualise gene activity in 3D in clones of growing young whole Arabidopsis plants. A further application of M-OPT is to visualise plant-insect interactions. Thus M-OPT provides an effective 3D imaging platform that allows the study of gene activity, internal plant structures and plant-insect interactions at a macroscopic scale. PMID:28025317
Bioinformatics analysis of the predicted polyprenol reductase genes in higher plants

NASA Astrophysics Data System (ADS)

Basyuni, M.; Wati, R.

2018-03-01

The present study evaluates the bioinformatics methods to analyze twenty-four predicted polyprenol reductase genes from higher plants on GenBank as well as predicted the structure, composition, similarity, subcellular localization, and phylogenetic. The physicochemical properties of plant polyprenol showed diversity among the observed genes. The percentage of the secondary structure of plant polyprenol genes followed the ratio order of α helix > random coil > extended chain structure. The values of chloroplast but not signal peptide were too low, indicated that few chloroplast transit peptide in plant polyprenol reductase genes. The possibility of the potential transit peptide showed variation among the plant polyprenol reductase, suggested the importance of understanding the variety of peptide components of plant polyprenol genes. To clarify this finding, a phylogenetic tree was drawn. The phylogenetic tree shows several branches in the tree, suggested that plant polyprenol reductase genes grouped into divergent clusters in the tree.
Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach.

PubMed

Peng, Jiajie; Zhang, Xuanshuo; Hui, Weiwei; Lu, Junya; Li, Qianqian; Liu, Shuhui; Shang, Xuequn

2018-03-19

Gene Ontology (GO) is one of the most popular bioinformatics resources. In the past decade, Gene Ontology-based gene semantic similarity has been effectively used to model gene-to-gene interactions in multiple research areas. However, most existing semantic similarity approaches rely only on GO annotations and structure, or incorporate only local interactions in the co-functional network. This may lead to inaccurate GO-based similarity resulting from the incomplete GO topology structure and gene annotations. We present NETSIM2, a new network-based method that allows researchers to measure GO-based gene functional similarities by considering the global structure of the co-functional network with a random walk with restart (RWR)-based method, and by selecting the significant term pairs to decrease the noise information. Based on the EC number (Enzyme Commission)-based groups of yeast and Arabidopsis, evaluation test shows that NETSIM2 can enhance the accuracy of Gene Ontology-based gene functional similarity. Using NETSIM2 as an example, we found that the accuracy of semantic similarities can be significantly improved after effectively incorporating the global gene-to-gene interactions in the co-functional network, especially on the species that gene annotations in GO are far from complete.
Bacterial community and arsenic functional genes diversity in arsenic contaminated soils from different geographic locations

PubMed Central

Gu, Yunfu; D. Van Nostrand, Joy; Wu, Liyou; He, Zhili; Qin, Yujia; Zhao, Fang-Jie; Zhou, Jizhong

2017-01-01

To understand how soil microbial communities and arsenic (As) functional genes respond to soil arsenic (As) contamination, five soils contaminated with As at different levels were collected from diverse geographic locations, incubated for 54 days under flooded conditions, and examined by both MiSeq sequencing of 16S rRNA gene amplicons and functional gene microarray (GeoChip 4.0). The results showed that both bacterial community structure and As functional gene structure differed among geographical locations. The diversity of As functional genes correlated positively with the diversity of 16S rRNA genes (P< 0.05). Higher diversities of As functional genes and 16S rRNA genes were observed in the soils with higher available As. Soil pH, phosphate-extractable As, and amorphous Fe content were the most important factors in shaping the bacterial community structure and As transformation functional genes. Geographic location was also important in controlling both the bacterial community and As transformation functional potential. These findings provide insights into the variation of As transformation functional genes in soils contaminated with different levels of As at different geographic locations, and the impact of environmental As contamination on the soil bacterial community. PMID:28475654
The Gene Ontology (GO) project: structured vocabularies for molecular biology and their application to genome and expression analysis.

PubMed

Blake, Judith A; Harris, Midori A

2008-09-01

Scientists wishing to utilize genomic data have quickly come to realize the benefit of standardizing descriptions of experimental procedures and results for computer-driven information retrieval systems. The focus of the Gene Ontology project is three-fold. First, the project goal is to compile the Gene Ontologies: structured vocabularies describing domains of molecular biology. Second, the project supports the use of these structured vocabularies in the annotation of gene products. Third, the gene product-to-GO annotation sets are provided by participating groups to the public through open access to the GO database and Web resource. This unit describes the current ontologies and what is beyond the scope of the Gene Ontology project. It addresses the issue of how GO vocabularies are constructed and related to genes and gene products. It concludes with a discussion of how researchers can access, browse, and utilize the GO project in the course of their own research. Copyright 2008 by John Wiley & Sons, Inc.
Genetic connectivity among swarming sites in the wide ranging and recently declining little brown bat (Myotis lucifugus)

PubMed Central

Burns, Lynne E; Frasier, Timothy R; Broders, Hugh G

2014-01-01

Characterizing movement dynamics and spatial aspects of gene flow within a species permits inference on population structuring. As patterns of structuring are products of historical and current demographics and gene flow, assessment of structure through time can yield an understanding of evolutionary dynamics acting on populations that are necessary to inform management. Recent dramatic population declines in hibernating bats in eastern North America from white-nose syndrome have prompted the need for information on movement dynamics for multiple bat species. We characterized population genetic structure of the little brown bat, Myotis lucifugus, at swarming sites in southeastern Canada using 9 nuclear microsatellites and a 292-bp region of the mitochondrial genome. Analyses of FST, ΦST, and Bayesian clustering (STRUCTURE) found weak levels of genetic structure among swarming sites for the nuclear and mitochondrial genome (Global FST = 0.001, P < 0.05, Global ΦST = 0.045, P < 0.01, STRUCTURE K = 1) suggesting high contemporary gene flow. Hierarchical AMOVA also suggests little structuring at a regional (provincial) level. Metrics of nuclear genetic structure were not found to differ between males and females suggesting weak asymmetries in gene flow between the sexes. However, a greater degree of mitochondrial structuring does support male-biased dispersal long term. Demographic analyses were consistent with past population growth and suggest a population expansion occurred from approximately 1250 to 12,500 BP, following Pleistocene deglaciation in the region. Our study suggests high gene flow and thus a high degree of connectivity among bats that visit swarming sites whereby mainland areas of the region may be best considered as one large gene pool for management and conservation. PMID:25505539
Process and genes for expression and overexpression of active [FeFe] hydrogenases

DOEpatents

Seibert, Michael; King, Paul W; Ghirardi, Maria Lucia; Posewitz, Matthew C; Smolinski, Sharon L

2014-09-16

A process for expression of active [FeFe]-hydrogenase in a host organism that does not contain either the structural gene(s) for [FeFe]-hydrogenases and/or homologues for the maturation genes HydE, HydF and HyG, comprising: cloning the structural hydrogenase gene(s) and/or the maturation genes HydE, HydF and HydG from an organisms that contains these genes into expression plasmids; transferring the plasmids into an organism that lacks a native [FeFe]-hydrogenase or that has a disrupted [FeFe]-hydrogenase and culturing it aerobically; and inducing anaerobiosis to provide [FeFe] hydrogenase biosynthesis and H?2#191 production.
Dizeez: An Online Game for Human Gene-Disease Annotation

PubMed Central

Loguercio, Salvatore; Good, Benjamin M.; Su, Andrew I.

2013-01-01

Structured gene annotations are a foundation upon which many bioinformatics and statistical analyses are built. However the structured annotations available in public databases are a sparse representation of biological knowledge as a whole. The rate of biomedical data generation is such that centralized biocuration efforts struggle to keep up. New models for gene annotation need to be explored that expand the pace at which we are able to structure biomedical knowledge. Recently, online games have emerged as an effective way to recruit, engage and organize large numbers of volunteers to help address difficult biological challenges. For example, games have been successfully developed for protein folding (Foldit), multiple sequence alignment (Phylo) and RNA structure design (EteRNA). Here we present Dizeez, a simple online game built with the purpose of structuring knowledge of gene-disease associations. Preliminary results from game play online and at scientific conferences suggest that Dizeez is producing valid gene-disease annotations not yet present in any public database. These early results provide a basic proof of principle that online games can be successfully applied to the challenge of gene annotation. Dizeez is available at http://genegames.org. PMID:23951102
Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

PubMed

Vouille, V; Amiche, M; Nicolas, P

1997-09-01

We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
A study of structural properties of gene network graphs for mathematical modeling of integrated mosaic gene networks.

PubMed

Petrovskaya, Olga V; Petrovskiy, Evgeny D; Lavrik, Inna N; Ivanisenko, Vladimir A

2017-04-01

Gene network modeling is one of the widely used approaches in systems biology. It allows for the study of complex genetic systems function, including so-called mosaic gene networks, which consist of functionally interacting subnetworks. We conducted a study of a mosaic gene networks modeling method based on integration of models of gene subnetworks by linear control functionals. An automatic modeling of 10,000 synthetic mosaic gene regulatory networks was carried out using computer experiments on gene knockdowns/knockouts. Structural analysis of graphs of generated mosaic gene regulatory networks has revealed that the most important factor for building accurate integrated mathematical models, among those analyzed in the study, is data on expression of genes corresponding to the vertices with high properties of centrality.
MAISTAS: a tool for automatic structural evaluation of alternative splicing products.

PubMed

Floris, Matteo; Raimondo, Domenico; Leoni, Guido; Orsini, Massimiliano; Marcatili, Paolo; Tramontano, Anna

2011-06-15

Analysis of the human genome revealed that the amount of transcribed sequence is an order of magnitude greater than the number of predicted and well-characterized genes. A sizeable fraction of these transcripts is related to alternatively spliced forms of known protein coding genes. Inspection of the alternatively spliced transcripts identified in the pilot phase of the ENCODE project has clearly shown that often their structure might substantially differ from that of other isoforms of the same gene, and therefore that they might perform unrelated functions, or that they might even not correspond to a functional protein. Identifying these cases is obviously relevant for the functional assignment of gene products and for the interpretation of the effect of variations in the corresponding proteins. Here we describe a publicly available tool that, given a gene or a protein, retrieves and analyses all its annotated isoforms, provides users with three-dimensional models of the isoform(s) of his/her interest whenever possible and automatically assesses whether homology derived structural models correspond to plausible structures. This information is clearly relevant. When the homology model of some isoforms of a gene does not seem structurally plausible, the implications are that either they assume a structure unrelated to that of the other isoforms of the same gene with presumably significant functional differences, or do not correspond to functional products. We provide indications that the second hypothesis is likely to be true for a substantial fraction of the cases. http://maistas.bioinformatica.crs4.it/.

SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

PubMed

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2017-04-01

Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Molecular evolution and diversification of snake toxin genes, revealed by analysis of intron sequences.

PubMed

Fujimi, T J; Nakajyo, T; Nishimura, E; Ogura, E; Tsuchiya, T; Tamiya, T

2003-08-14

The genes encoding erabutoxin (short chain neurotoxin) isoforms (Ea, Eb, and Ec), LsIII (long chain neurotoxin) and a novel long chain neurotoxin pseudogene were cloned from a Laticauda semifasciata genomic library. Short and long chain neurotoxin genes were also cloned from the genome of Laticauda laticaudata, a closely related species of L. semifasciata, by PCR. A putative matrix attached region (MAR) sequence was found in the intron I of the LsIII gene. Comparative analysis of 11 structurally relevant snake toxin genes (three-finger-structure toxins) revealed the molecular evolution of these toxins. Three-finger-structure toxin genes diverged from a common ancestor through two types of evolutionary pathways (long and short types), early in the course of evolution. At a later stage of evolution in each gene, the accumulation of mutations in the exons, especially exon II, by accelerated evolution may have caused the increased diversification in their functions. It was also revealed that the putative MAR sequence found in the LsIII gene was integrated into the gene after the species-level divergence.
PIECE 2.0: an update for the plant gene structure comparison and evolution database

USDA-ARS?s Scientific Manuscript database

PIECE (Plant Intron Exon Comparision and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron-exon organization and provides valuable insights into the evolution of gene structure in ...
PRIMARY STRUCTURE OF THE P450 LANOSTEROL DEMETHYLASE GENE FROM SACCHAROMYCES CEREVISIAE

EPA Science Inventory

We have sequenced the structural gene and flanking regions for lanosterol 14 alpha-demethylase (14DM) from Saccharomyces cerevisiae. An open reading frame of 530 codons encodes a 60.7-kDa protein. When this gene is disrupted by integrative transformation, the resulting strain req...
The structure of the coding and 5'-flanking region of the type 1 iodothyronine deiodinase (dio1) gene is normal in a patient with suspected congenital dio1 deficiency.

PubMed

Toyoda, N; Kleinhaus, N; Larsen, P R

1996-06-01

We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
Chromatin Configuration Determines Cell Responses to Hormone Stimuli | Center for Cancer Research

Cancer.gov

Ever since selective gene expression was established as the central driver of cell behavior, researchers have been working to understand the forces that control gene transcription. Aberrant gene expression can cause or promote many diseases, including cancer, and alterations in gene expression are the goal of many therapeutic agents. Recent work has focused on the potential role of chromatin structure as a contributor to gene regulation. Chromatin can exist in a tightly packed/inaccessible or loose/accessible configuration depending on the interactions between DNA and its associated proteins. Patterns of chromatin structure can differ between cell types and can also change within cells in response to certain signals. Cancer researchers are particularly interested in the role of chromatin in gene regulation because many of the genomic regions found to be associated with cancer risk are in open chromatin structures.
Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins.

PubMed

Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro

2008-01-03

The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.

PubMed

Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu

2013-11-01

Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.
Multicellular structures developing during maize microspore culture express endosperm and embryo-specific genes and show different embryogenic potentialities.

PubMed

Massonneau, Agnes; Coronado, Maria-José; Audran, Arthur; Bagniewska, Agnieszka; Mòl, Rafal; Testillano, Pilar S; Goralski, Grzegorz; Dumas, Christian; Risueño, Maria-Carmen; Matthys-Rochon, Elisabeth

2005-07-01

During maize pollen embryogenesis, a range of multicellular structures are formed. Using different approaches, the "nature" of these structures has been determined in terms of their embryogenic potential. In situ molecular identification techniques for gene transcripts and products, and a novel cell tracking system indicated the presence of embryogenic (embryo-like structures, ELS) and non-embryogenic (callus-like structures, CLS) structures that occurred for short periods within the cultures. Some multicellular structures with a compact appearance generated embryos. RT-PCR and fluorescence in situ hybridization (FISH) with confocal microscopy techniques using specific gene markers of the endosperm (ZmESR2, ZmAE3) and embryo (LTP2 and ZmOCL1, ZmOCL3) revealed "embryo" and "endosperm" potentialities in these various multicellular structures present in the cultures. The results presented here showed distinct and specific patterns of gene expression. Altogether, the results demonstrate the presence of different molecules on both embryonic and non-embryonic structures. Their possible roles are discussed in the context of a parallel between embryo/endosperm interactions in planta and embryonic and non-embryonic structure interrelations under in vitro conditions.
Gene-for-gene disease resistance: bridging insect pest and pathogen defense.

PubMed

Kaloshian, Isgouhi

2004-12-01

Active plant defense, also known as gene-for-gene resistance, is triggered when a plant resistance (R) gene recognizes the intrusion of a specific insect pest or pathogen. Activation of plant defense includes an array of physiological and transcriptional reprogramming. During the past decade, a large number of plant R genes that confer resistance to diverse group of pathogens have been cloned from a number of plant species. Based on predicted protein structures, these genes are classified into a small number of groups, indicating that structurally related R genes recognize phylogenetically distinct pathogens. An extreme example is the tomato Mi-1 gene, which confers resistance to potato aphid (Macrosiphum euphorbiae), whitefly (Bemisia tabaci), and root-knot nematodes (Meloidogyne spp.). While Mi-1 remains the only cloned insect R gene, there is evidence that gene-for-gene type of plant defense against piercing-sucking insects exists in a number of plant species.
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

PubMed Central

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-01-01

Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

PubMed

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-04-21

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
Multilocus analyses indicate a mosaic distribution of hybrid populations in ground squirrels (genus Ictidomys)

PubMed Central

Thompson, Cody W; Anwarali Khan, Faisal Ali; Stangl, Frederick B; Baker, Robert J; Bradley, Robert D

2013-01-01

DNA sequence data from mitochondrial cytochrome-b (Cytb) and Y-linked structural maintenance of chromosomes (SmcY) genes were combined with 478 nuclear loci obtained from amplified fragment length polymorphisms (AFLP) to assess the extent of hybridization and genetic spatial structure of populations in two hybridizing species of ground squirrel (Ictidomys parvidens and Ictidomys tridecemlineatus). Based on AFLP analyses of 134 individuals from 28 populations, 10 populations were identified that possessed hybrid individuals. Overall estimates of FST values revealed strong support for population structure in the Cytb data set; however, analyses of the SmcY gene and the AFLP data indicated ongoing gene flow between species. Pairwise FST comparisons of populations were not significant for the SmcY gene; although they were significant for the Cytb gene, indicating that these populations were structured and that gene flow was minimal. Therefore, gene flow between I. parvidens and I. tridecemlineatus appeared to be restricted to populations that exhibited hybridization. In addition, the fragmented nature of the geographic landscape suggested limited gene flow between populations. As a result, the distributional pattern of interspersed parental and hybrid populations were compatible with a mosaic hybrid zone model. Because ground squirrels display female philopatry and male-biased dispersal, the ecology of these species is compatible with this hypothesis. PMID:24340186
Structure and expression of dna methyltransferase genes from apomictic and sexual Boechera species.

PubMed

Taşkin, Kemal Melik; Özbilen, Aslıhan; Sezer, Fatih; Hürkan, Kaan; Güneş, Şebnem

2017-04-01

In this study, we determined the structure of DNA methyltransferase (DNMT) genes in apomict and sexual Boechera species and investigated the expression levels during seed development. Protein and DNA sequences of diploid sexual Boechera stricta DNMT genes obtained from Phytozome 10.3 were used to identify the homologues in apomicts, Boechera holboellii and Boechera divaricarpa. Geneious R8 software was used to map the short-paired reads library of B. holboellii whole genome or B. divaricarpa transcriptome reads to the reference gene sequences. We determined three DNMT genes; for Boechera spp. METHYLTRANSFERASE1 (MET1), CHROMOMETHYLASE 3 (CMT3) and DOMAINS REARRANGED METHYLTRANSFERASE 1/2 (DRM2). We examined the structure of these genes with bioinformatic tools and compared with other DNMT genes in plants. We also examined the levels of expression in silique tissues after fertilization by semi-quantitative PCR. The structure of DNMT proteins in apomict and sexual Boechera species share common features. However, the expression levels of DNMT genes were different in apomict and sexual Boechera species. We found that DRM2 was upregulated in apomictic Boechera species after fertilization. Phylogenetic trees showed that three genes are conserved among green algae, monocotyledons and dicotyledons. Our results indicated a deregulation of DNA methylation machinery during seed development in apomicts. Copyright © 2016 Elsevier Ltd. All rights reserved.
Chromatin structure of the LCR in the human β-globin locus transcribing the adult δ- and β-globin genes.

PubMed

Kim, Seoyeon; Kim, Yea Woon; Shim, Sung Han; Kim, Chul Geun; Kim, Aeri

2012-03-01

The β-like globin genes are transcribed in a developmental stage specific fashion in erythroid cells. The specific transcription of globin genes is conferred by the locus control region (LCR), but the chromatin structure of the LCR in the human adult β-globin locus transcribing the δ- and β-globin genes is not clear. Here, we employed hybrid MEL cells that contain a human chromosome 11. The δ- and β-globin genes were highly transcribed in hybrid MEL/ch11 cells after transcriptional induction. LCR HS3 and HS2 were strongly occupied by erythroid specific transcriptional activators and co-factors in the induced locus. These HSs, but not HS4 and HS1, were in close proximity with the active globin genes as revealed by high resolution 3C experiments. The active features at HS3 were markedly established after transcriptional induction, while HS2 was in a relatively active conformation before the induction. Unexpectedly, HS1 did not show notable active features except histone hyperacetylation. Taken together, the LCR of the human β-globin locus transcribing the adult δ- and β-globin genes has HS specific chromatin structure. The structure at each HS, which is different from the locus transcribing the fetal globin genes, might relate to its role in transcribing the adult genes. Copyright © 2011 Elsevier Ltd. All rights reserved.
Three-Dimensional Gene Map of Cancer Cell Types: Structural Entropy Minimisation Principle for Defining Tumour Subtypes

PubMed Central

Li, Angsheng; Yin, Xianchen; Pan, Yicheng

2016-01-01

In this study, we propose a method for constructing cell sample networks from gene expression profiles, and a structural entropy minimisation principle for detecting natural structure of networks and for identifying cancer cell subtypes. Our method establishes a three-dimensional gene map of cancer cell types and subtypes. The identified subtypes are defined by a unique gene expression pattern, and a three-dimensional gene map is established by defining the unique gene expression pattern for each identified subtype for cancers, including acute leukaemia, lymphoma, multi-tissue, lung cancer and healthy tissue. Our three-dimensional gene map demonstrates that a true tumour type may be divided into subtypes, each defined by a unique gene expression pattern. Clinical data analyses demonstrate that most cell samples of an identified subtype share similar survival times, survival indicators and International Prognostic Index (IPI) scores and indicate that distinct subtypes identified by our algorithms exhibit different overall survival times, survival ratios and IPI scores. Our three-dimensional gene map establishes a high-definition, one-to-one map between the biologically and medically meaningful tumour subtypes and the gene expression patterns, and identifies remarkable cells that form singleton submodules. PMID:26842724
Genetics of bacteria that utilize one carbon compounds: Final report, March 1, 1982-February 29, 1988

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hanson, R.S.

Broad host range plasmid vectors useful for cloning genes from bacteria that grow on methane and methanol were constructed. We have cloned and mapped nineteen genes required for the growth of Methylobacterium organophilum strain XX on methanol. Nineteen genes were found in seven linkage groups on the M. organophilum genome and were separated by 40 kb or more. Eleven genes were required for the synthesis of methanol dehydrogenase (MDH) and were located in three unlinked gene clusters. The MDH structural gene was localized on a 2.5 kb DNA fragment. The gene was sequenced and contains a 175 bp untranslated leadermore » sequence, a signal sequence and the structural gene. MDH messenger RNA (mRNA) has a half life of approximately 20 min. and is present at approximately 2% of the cellular mRNA. The structural gene for the ..gamma.. subunit of methane monoxygenases has been cloned from Methylosporovibrio. Methane monooxygenase subunits have been purified by Prof. J. Lipscomb's laboratory and are being sequenced to construct DNA probes to identify cloned subunit genes. New facultative methylotrophic bacteria were isolated and characterized. Several amino acid auxotrophs have been isolated. 11 refs.« less
Identification and Analysis of the Biosynthetic Gene Cluster Encoding the Thiopeptide Antibiotic Cyclothiazomycin in Streptomyces hygroscopicus 10-22▿ †

PubMed Central

Wang, Jiang; Yu, Yi; Tang, Kexuan; Liu, Wen; He, Xinyi; Huang, Xi; Deng, Zixin

2010-01-01

Thiopeptide antibiotics are an important class of natural products resulting from posttranslational modifications of ribosomally synthesized peptides. Cyclothiazomycin is a typical thiopeptide antibiotic that has a unique bridged macrocyclic structure derived from an 18-amino-acid structural peptide. Here we reported cloning, sequencing, and heterologous expression of the cyclothiazomycin biosynthetic gene cluster from Streptomyces hygroscopicus 10-22. Remarkably, successful heterologous expression of a 22.7-kb gene cluster in Streptomyces lividans 1326 suggested that there is a minimum set of 15 open reading frames that includes all of the functional genes required for cyclothiazomycin production. Six genes of these genes, cltBCDEFG flanking the structural gene cltA, were predicted to encode the enzymes required for the main framework of cyclothiazomycin, and two enzymes encoded by a putative operon, cltMN, were hypothesized to participate in the tailoring step to generate the tertiary thioether, leading to the final cyclization of the bridged macrocyclic structure. This rigorous bioinformatics analysis based on heterologous expression of cyclothiazomycin resulted in an ideal biosynthetic model for us to understand the biosynthesis of thiopeptides. PMID:20154110
Microbial community functional structure in response to antibiotics in pharmaceutical wastewater treatment systems.

PubMed

Zhang, Yu; Xie, Jianping; Liu, Miaomiao; Tian, Zhe; He, Zhili; van Nostrand, Joy D; Ren, Liren; Zhou, Jizhong; Yang, Min

2013-10-15

It is widely demonstrated that antibiotics in the environment affect microbial community structure. However, direct evidence regarding the impacts of antibiotics on microbial functional structures in wastewater treatment systems is limited. Herein, a high-throughput functional gene array (GeoChip 3.0) in combination with quantitative PCR and clone libraries were used to evaluate the microbial functional structures in two biological wastewater treatment systems, which treat antibiotic production wastewater mainly containing oxytetracycline. Despite the bacteriostatic effects of antibiotics, the GeoChip detected almost all key functional gene categories, including carbon cycling, nitrogen cycling, etc., suggesting that these microbial communities were functionally diverse. Totally 749 carbon-degrading genes belonging to 40 groups (24 from bacteria and 16 from fungi) were detected. The abundance of several fungal carbon-degrading genes (e.g., glyoxal oxidase (glx), lignin peroxidase or ligninase (lip), manganese peroxidase (mnp), endochitinase, exoglucanase_genes) was significantly correlated with antibiotic concentrations (Mantel test; P < 0.05), showing that the fungal functional genes have been enhanced by the presence of antibiotics. However, from the fact that the majority of carbon-degrading genes were derived from bacteria and diverse antibiotic resistance genes were detected in bacteria, it was assumed that many bacteria could survive in the environment by acquiring antibiotic resistance and may have maintained the position as a main player in nutrient removal. Variance partitioning analysis showed that antibiotics could explain 24.4% of variations in microbial functional structure of the treatment systems. This study provides insights into the impacts of antibiotics on microbial functional structure of a unique system receiving antibiotic production wastewater, and reveals the potential importance of the cooperation between fungi and bacteria with antibiotic resistance in maintaining the stability and performance of the systems. Copyright © 2013 Elsevier Ltd. All rights reserved.
Prevalent Exon-Intron Structural Changes in the APETALA1/FRUITFULL, SEPALLATA, AGAMOUS-LIKE6, and FLOWERING LOCUS C MADS-Box Gene Subfamilies Provide New Insights into Their Evolution

PubMed Central

Yu, Xianxian; Duan, Xiaoshan; Zhang, Rui; Fu, Xuehao; Ye, Lingling; Kong, Hongzhi; Xu, Guixia; Shan, Hongyan

2016-01-01

AP1/FUL, SEP, AGL6, and FLC subfamily genes play important roles in flower development. The phylogenetic relationships among them, however, have been controversial, which impedes our understanding of the origin and functional divergence of these genes. One possible reason for the controversy may be the problems caused by changes in the exon-intron structure of genes, which, according to recent studies, may generate non-homologous sites and hamper the homology-based sequence alignment. In this study, we first performed exon-by-exon alignments of these and three outgroup subfamilies (SOC1, AG, and STK). Phylogenetic trees reconstructed based on these matrices show improved resolution and better congruence with species phylogeny. In the context of these phylogenies, we traced evolutionary changes of exon-intron structures in each subfamily. We found that structural changes have occurred frequently following gene duplication and speciation events. Notably, exons 7 and 8 (if present) suffered more structural changes than others. With the knowledge of exon-intron structural changes, we generated more reasonable alignments containing all the focal subfamilies. The resulting trees showed that the SEP subfamily is sister to the monophyletic group formed by AP1/FUL and FLC subfamily genes and that the AGL6 subfamily forms a sister group to the three abovementioned subfamilies. Based on this topology, we inferred the evolutionary history of exon-intron structural changes among different subfamilies. Particularly, we found that the eighth exon originated before the divergence of AP1/FUL, FLC, SEP, and AGL6 subfamilies and degenerated in the ancestral FLC-like gene. These results provide new insights into the origin and evolution of the AP1/FUL, FLC, SEP, and AGL6 subfamilies. PMID:27200066

Structural and transcriptional analysis of plant genes encoding the bifunctional lysine ketoglutarate reductase saccharopine dehydrogenase enzyme.

PubMed

Anderson, Olin D; Coleman-Derr, Devin; Gu, Yong Q; Heath, Sekou

2010-06-16

Among the dietary essential amino acids, the most severely limiting in the cereals is lysine. Since cereals make up half of the human diet, lysine limitation has quality/nutritional consequences. The breakdown of lysine is controlled mainly by the catabolic bifunctional enzyme lysine ketoglutarate reductase - saccharopine dehydrogenase (LKR/SDH). The LKR/SDH gene has been reported to produce transcripts for the bifunctional enzyme and separate monofunctional transcripts. In addition to lysine metabolism, this gene has been implicated in a number of metabolic and developmental pathways, which along with its production of multiple transcript types and complex exon/intron structure suggest an important node in plant metabolism. Understanding more about the LKR/SDH gene is thus interesting both from applied standpoint and for basic plant metabolism. The current report describes a wheat genomic fragment containing an LKR/SDH gene and adjacent genes. The wheat LKR/SDH genomic segment was found to originate from the A-genome of wheat, and EST analysis indicates all three LKR/SDH genes in hexaploid wheat are transcriptionally active. A comparison of a set of plant LKR/SDH genes suggests regions of greater sequence conservation likely related to critical enzymatic functions and metabolic controls. Although most plants contain only a single LKR/SDH gene per genome, poplar contains at least two functional bifunctional genes in addition to a monofunctional LKR gene. Analysis of ESTs finds evidence for monofunctional LKR transcripts in switchgrass, and monofunctional SDH transcripts in wheat, Brachypodium, and poplar. The analysis of a wheat LKR/SDH gene and comparative structural and functional analyses among available plant genes provides new information on this important gene. Both the structure of the LKR/SDH gene and the immediately adjacent genes show lineage-specific differences between monocots and dicots, and findings suggest variation in activity of LKR/SDH genes among plants. Although most plant genomes seem to contain a single conserved LKR/SDH gene per genome, poplar possesses multiple contiguous genes. A preponderance of SDH transcripts suggests the LKR region may be more rate-limiting. Only switchgrass has EST evidence for LKR monofunctional transcripts. Evidence for monofunctional SDH transcripts shows a novel intron in wheat, Brachypodium, and poplar.
Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

PubMed Central

Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio

2004-01-01

The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394
Piece2.0: an update for the pant gene structure comparison and evolution database

USDA-ARS?s Scientific Manuscript database

PIECE (Plant Intron Exon Comparison and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron–exon organization and provides valuable insights into the evolution of gene structure in pl...
Succession of microbial functional communities in response to a pilot-scale ethanol-blended fuel release throughout the plume life cycle.

PubMed

Ma, Jie; Deng, Ye; Yuan, Tong; Zhou, Jizhong; Alvarez, Pedro J J

2015-03-01

GeoChip, a comprehensive gene microarray, was used to examine changes in microbial functional gene structure throughout the 4-year life cycle of a pilot-scale ethanol blend plume, including 2-year continuous released followed by plume disappearance after source removal. Canonical correlation analysis (CCA) and Mantel tests showed that dissolved O2 (which was depleted within 5 days of initiating the release and rebounded 194 days after source removal) was the most influential environmental factor on community structure. Initially, the abundance of anaerobic BTEX degradation genes increased significantly while that of aerobic BTEX degradation genes decreased. Gene abundance for N fixation, nitrification, P utilization, sulfate reduction and S oxidation also increased, potentially changing associated biogeochemical cycle dynamics. After plume disappearance, most genes returned to pre-release abundance levels, but the final functional structure significantly differed from pre-release conditions. Overall, observed successions of functional structure reflected adaptive responses that were conducive to biodegradation of ethanol-blend releases. Copyright © 2015. Published by Elsevier Ltd.
Structural, functional and evolutionary characterization of major drought transcription factors families in maize

NASA Astrophysics Data System (ADS)

Mittal, Shikha; Banduni, Pooja; Mallikarjuna, Mallana G.; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean

2018-05-01

Drought is one of the major threats to maize production. In order to improve the production and to breed tolerant hybrids, understanding the genes and regulatory mechanisms during drought stress is important. Transcription factors (TFs) play a major role in gene regulation and many TFs have been identified in response to drought stress. In our experiment, a set of 15 major TF families comprising 1436 genes was structurally and functionally characterized using in-silico tools and a gene expression assay. All 1436 genes were mapped on 10 chromosome of maize. The functional annotation indicated the involvement of these genes in ABA signaling, ROS scavenging, photosynthesis, stomatal regulation, and sucrose metabolism. Duplication was identified as the primary force in divergence and expansion of TF families. Phylogenetic relationship was developed individually for each TF family as well as combined TF families. Phylogenetic analysis grouped the TF family of genes into TF-specific and mixed groups. Phylogenetic analysis of genes belonging to various TF families suggested that the origin of TFs occurred in the lineage of maize evolution. Gene structure analysis revealed that more number of genes were intron-rich as compared to intronless genes. Drought-responsive CRE’s such as ABREA, ABREB, DRE1 and DRECRTCOREAT have been identified. Expression and interaction analyses identified leaf-specific bZIP TF, GRMZM2G140355, as a potential contributor toward drought tolerance in maize. We also analyzed protein-protein interaction network of 269 drought-responsive genes belonging to different drought-related TFs. The information generated on structural and functional characteristics, expression and interaction of the drought-related TF families will be useful to decipher the drought tolerance mechanisms and to derive drought-tolerant genotypes in maize.
Gene-Silencing-Induced Changes in Carbohydrate Conformation in Relation to Bioenergy Value and Carbohydrate Subfractions in Modeled Plant (Medicago sativa) with Down-Regulation of HB12 and TT8 Transcription Factors

PubMed Central

Li, Xinxin; Hannoufa, Abdelali; Zhang, Yonggen; Yu, Peiqiang

2016-01-01

Gene silencing with RNA interference (RNAi) technology may be capable of modifying internal structure at a molecular level. This structural modification could affect biofunctions in terms of biodegradation, biochemical metabolism, and bioactive compound availability. The objectives of this study were to (1) Detect gene silencing-induced changes in carbohydrate molecular structure in an alfalfa forage (Medicago sativa spp. sativa: alfalfa) with down-regulation of genes that encode transcription factors TT8 and HB12; (2) Determine gene silencing-induced changes in nutrient bioutilization and bioavailability in the alfalfa forage (Medicago sativa); and (3) Quantify the correlation between gene silencing-induced molecular structure changes and the nutrient bioutilization and bioavailability in animals of ruminants. The experimental treatments included: T1 = Non-transgenic and no-gene silenced alfalfa forage (code “NT”); T2 = HB12-RNAi forage with HB12 gene down regulation (code “HB12”); T3 = TT8-RNAi forage with TT8 gene down regulation (code “TT8”). The HB12 and TT8 gene silencing-induced molecular structure changes were determined by non-invasive and non-destructive advanced molecular spectroscopy in a middle infrared radiation region that focused on structural, non-structural and total carbohydrate compounds. The nutrient bioutilization and bioavailability of the modified forage were determined using NRC-2001 system in terms of total digestive nutrient (TDN), truly digestible fiber (tdNDF), non-fiber carbohydrate (tdNDF), fatty acid (tdFA), crude protein (tdCP) and bioenergy profiles (digestible energy, metabolizable energy, net energy) for ruminants. The carbohydrate subfractions were evaluated using the updated CNCPS 6.0 system. The results showed that gene silencing significantly affected tdNFC (42.3 (NT) vs. 38.7 (HB12) vs. 37.4% Dry Matter (TT8); p = 0.016) and tdCP (20.8 (NT) vs. 19.4 (HB12) vs. 22.3% DM (TT8); p = 0.009). The gene-silencing also affected carbohydrate CA4 (7.4 (NT) vs. 4.2 (HB12) and 4.4% carbohydrate (CHO) (TT8), p = 0.063) and CB1 fractions (5.3 (NT) vs. 2.0 (HB12) and 2.6% CHO (TT8), p = 0.006). The correlation study showed that the structural CHO functional group peak area intensity at ca. 1315 cm−1 was significantly correlated to the TDN1x (r = −0.83, p = 0.042) and the tdNFC (r = −0.83, p = 0.042), the structural CHO functional group height intensity at ca. 1370 cm−1 was significantly correlated to the tdNDF (r = −0.87, p = 0.025). The A_Non-stCHO to A_StCHO ratio and A_Non-stCHO to A_CHO ratio were significantly correlated to the tdFA (r = 0.83–0.91, p < 0.05). As to carbohydrate fractions, both CA4 and CB1 correlated with carbohydrate spectral intensity of the H_1415 and the H_1315 (p = 0.039; p = 0.059, respectively), CB3 tended to correlate with the H_1150, H_1100 and H_1025 (p < 0.10). In conclusion, RNAi-mediated silencing of HB12 and TT8 modified not only inherent CHO molecular structure but also the biofunctions. The CHO molecular structure changes induced by RNAi gene silencing were associated with biofunctions in terms of the carbohydrate subfractions and nutrient digestion. PMID:27187372
Gene-Silencing-Induced Changes in Carbohydrate Conformation in Relation to Bioenergy Value and Carbohydrate Subfractions in Modeled Plant (Medicago sativa) with Down-Regulation of HB12 and TT8 Transcription Factors.

PubMed

Li, Xinxin; Hannoufa, Abdelali; Zhang, Yonggen; Yu, Peiqiang

2016-05-13

Gene silencing with RNA interference (RNAi) technology may be capable of modifying internal structure at a molecular level. This structural modification could affect biofunctions in terms of biodegradation, biochemical metabolism, and bioactive compound availability. The objectives of this study were to (1) Detect gene silencing-induced changes in carbohydrate molecular structure in an alfalfa forage (Medicago sativa spp. sativa: alfalfa) with down-regulation of genes that encode transcription factors TT8 and HB12; (2) Determine gene silencing-induced changes in nutrient bioutilization and bioavailability in the alfalfa forage (Medicago sativa); and (3) Quantify the correlation between gene silencing-induced molecular structure changes and the nutrient bioutilization and bioavailability in animals of ruminants. The experimental treatments included: T1 = Non-transgenic and no-gene silenced alfalfa forage (code "NT"); T2 = HB12-RNAi forage with HB12 gene down regulation (code "HB12"); T3 = TT8-RNAi forage with TT8 gene down regulation (code "TT8"). The HB12 and TT8 gene silencing-induced molecular structure changes were determined by non-invasive and non-destructive advanced molecular spectroscopy in a middle infrared radiation region that focused on structural, non-structural and total carbohydrate compounds. The nutrient bioutilization and bioavailability of the modified forage were determined using NRC-2001 system in terms of total digestive nutrient (TDN), truly digestible fiber (tdNDF), non-fiber carbohydrate (tdNDF), fatty acid (tdFA), crude protein (tdCP) and bioenergy profiles (digestible energy, metabolizable energy, net energy) for ruminants. The carbohydrate subfractions were evaluated using the updated CNCPS 6.0 system. The results showed that gene silencing significantly affected tdNFC (42.3 (NT) vs. 38.7 (HB12) vs. 37.4% Dry Matter (TT8); p = 0.016) and tdCP (20.8 (NT) vs. 19.4 (HB12) vs. 22.3% DM (TT8); p = 0.009). The gene-silencing also affected carbohydrate CA4 (7.4 (NT) vs. 4.2 (HB12) and 4.4% carbohydrate (CHO) (TT8), p = 0.063) and CB1 fractions (5.3 (NT) vs. 2.0 (HB12) and 2.6% CHO (TT8), p = 0.006). The correlation study showed that the structural CHO functional group peak area intensity at ca. 1315 cm(-1) was significantly correlated to the TDN1x (r = -0.83, p = 0.042) and the tdNFC (r = -0.83, p = 0.042), the structural CHO functional group height intensity at ca. 1370 cm(-1) was significantly correlated to the tdNDF (r = -0.87, p = 0.025). The A_Non-stCHO to A_StCHO ratio and A_Non-stCHO to A_CHO ratio were significantly correlated to the tdFA (r = 0.83-0.91, p < 0.05). As to carbohydrate fractions, both CA4 and CB1 correlated with carbohydrate spectral intensity of the H_1415 and the H_1315 (p = 0.039; p = 0.059, respectively), CB3 tended to correlate with the H_1150, H_1100 and H_1025 (p < 0.10). In conclusion, RNAi-mediated silencing of HB12 and TT8 modified not only inherent CHO molecular structure but also the biofunctions. The CHO molecular structure changes induced by RNAi gene silencing were associated with biofunctions in terms of the carbohydrate subfractions and nutrient digestion.
Canonical Genetic Signatures of the Adult Human Brain

PubMed Central

Hawrylycz, Michael; Miller, Jeremy A.; Menon, Vilas; Feng, David; Dolbeare, Tim; Guillozet-Bongaarts, Angela L.; Jegga, Anil G.; Aronow, Bruce J.; Lee, Chang-Kyu; Bernard, Amy; Glasser, Matthew F.; Dierker, Donna L.; Menche, Jörge; Szafer, Aaron; Collman, Forrest; Grange, Pascal; Berman, Kenneth A.; Mihalas, Stefan; Yao, Zizhen; Stewart, Lance; Barabási, Albert-László; Schulkin, Jay; Phillips, John; Ng, Lydia; Dang, Chinh; Haynor, David R.; Jones, Allan; Van Essen, David C.; Koch, Christof; Lein, Ed

2015-01-01

The structure and function of the human brain are highly stereotyped, implying a conserved molecular program responsible for its development, cellular structure, and function. We applied a correlation-based metric of “differential stability” (DS) to assess reproducibility of gene expression patterning across 132 structures in six individual brains, revealing meso-scale genetic organization. The highest DS genes are highly biologically relevant, with enrichment for brain-related biological annotations, disease associations, drug targets, and literature citations. Using high DS genes we identified 32 anatomically diverse and reproducible gene expression signatures, which represent distinct cell types, intracellular components, and/or associations with neurodevelopmental and neurodegenerative disorders. Genes in neuron-associated compared to non-neuronal networks showed higher preservation between human and mouse; however, many diversely-patterned genes displayed dramatic shifts in regulation between species. Finally, highly consistent transcriptional architecture in neocortex is correlated with resting state functional connectivity, suggesting a link between conserved gene expression and functionally relevant circuitry. PMID:26571460
The structure of the human interferon alpha/beta receptor gene.

PubMed

Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G

1992-02-05

Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.
Genome-wide analysis of the homeodomain-leucine zipper (HD-ZIP) gene family in peach (Prunus persica).

PubMed

Zhang, C H; Ma, R J; Shen, Z J; Sun, X; Korir, N K; Yu, M L

2014-04-08

In this study, 33 homeodomain-leucine zipper (HD-ZIP) genes were identified in peach using the HD-ZIP amino acid sequences of Arabidopsis thaliana as a probe. Based on the phylogenetic analysis and the individual gene or protein characteristics, the HD-ZIP gene family in peach can be classified into 4 subfamilies, HD-ZIP I, II, III, and IV, containing 14, 7, 4, and 8 members, respectively. The most closely related peach HD-ZIP members within the same subfamilies shared very similar gene structure in terms of either intron/exon numbers or lengths. Almost all members of the same subfamily shared common motif compositions, thereby implying that the HD-ZIP proteins within the same subfamily may have functional similarity. The 33 peach HD-ZIP genes were distributed across scaffolds 1 to 7. Although the primary structure varied among HD-ZIP family proteins, their tertiary structures were similar. The results from this study will be useful in selecting candidate genes from specific subfamilies for functional analysis.
Stationary and structural control in gene regulatory networks: basic concepts

NASA Astrophysics Data System (ADS)

Dougherty, Edward R.; Pal, Ranadip; Qian, Xiaoning; Bittner, Michael L.; Datta, Aniruddha

2010-01-01

A major reason for constructing gene regulatory networks is to use them as models for determining therapeutic intervention strategies by deriving ways of altering their long-run dynamics in such a way as to reduce the likelihood of entering undesirable states. In general, two paradigms have been taken for gene network intervention: (1) stationary external control is based on optimally altering the status of a control gene (or genes) over time to drive network dynamics; and (2) structural intervention involves an optimal one-time change of the network structure (wiring) to beneficially alter the long-run behaviour of the network. These intervention approaches have mainly been developed within the context of the probabilistic Boolean network model for gene regulation. This article reviews both types of intervention and applies them to reducing the metastatic competence of cells via intervention in a melanoma-related network.
Estimation of Dynamic Systems for Gene Regulatory Networks from Dependent Time-Course Data.

PubMed

Kim, Yoonji; Kim, Jaejik

2018-06-15

Dynamic system consisting of ordinary differential equations (ODEs) is a well-known tool for describing dynamic nature of gene regulatory networks (GRNs), and the dynamic features of GRNs are usually captured through time-course gene expression data. Owing to high-throughput technologies, time-course gene expression data have complex structures such as heteroscedasticity, correlations between genes, and time dependence. Since gene experiments typically yield highly noisy data with small sample size, for a more accurate prediction of the dynamics, the complex structures should be taken into account in ODE models. Hence, this study proposes an ODE model considering such data structures and a fast and stable estimation method for the ODE parameters based on the generalized profiling approach with data smoothing techniques. The proposed method also provides statistical inference for the ODE estimator and it is applied to a zebrafish retina cell network.
In silico identification and analysis of phytoene synthase genes in plants.

PubMed

Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J

2015-08-14

In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.
The cytological manifestation of gene amplification in multidrug-resistant mouse leukemia P388 sublines is correlated with amplicon content

DOE Office of Scientific and Technical Information (OSTI.GOV)

Il`inskaya, G.V.; Kopnin, B.P.; Demidova, N.S.

1995-10-01

Previously, we showed that development of multidrug resistance (MDR) in mouse P388 leukemia cells is often associated with the appearance of newly-formed chromosomelike structures that contain amplified copies of the mdr1 gene. In the present study, we compared amplicon content in P388 sublines showing different types of these structures. A strong correlation between the formation of specific acentric markers consisting of two identical arms and the absence of the sorcin gene coamplification was found. In all the sublines containing other types of chromosomelike structures, the sorcin gene is coamplified. 9 refs., 2 figs., 1 tab.
Exploring Demographic, Physical, and Historical Explanations for the Genetic Structure of Two Lineages of Greater Antillean Bats

PubMed Central

Muscarella, Robert A.; Murray, Kevin L.; Ortt, Derek; Russell, Amy L.; Fleming, Theodore H.

2011-01-01

Observed patterns of genetic structure result from the interactions of demographic, physical, and historical influences on gene flow. The particular strength of various factors in governing gene flow, however, may differ between species in biologically relevant ways. We investigated the role of demographic factors (population size and sex-biased dispersal) and physical features (geographic distance, island size and climatological winds) on patterns of genetic structure and gene flow for two lineages of Greater Antillean bats. We used microsatellite genetic data to estimate demographic characteristics, infer population genetic structure, and estimate gene flow among island populations of Erophylla sezekorni/E. bombifrons and Macrotus waterhousii (Chiroptera: Phyllostomidae). Using a landscape genetics approach, we asked if geographic distance, island size, or climatological winds mediate historical gene flow in this system. Samples from 13 islands spanning Erophylla's range clustered into five genetically distinct populations. Samples of M. waterhousii from eight islands represented eight genetically distinct populations. While we found evidence that a majority of historical gene flow between genetic populations was asymmetric for both lineages, we were not able to entirely rule out incomplete lineage sorting in generating this pattern. We found no evidence of contemporary gene flow except between two genetic populations of Erophylla. Both lineages exhibited significant isolation by geographic distance. Patterns of genetic structure and gene flow, however, were not explained by differences in relative effective population sizes, island area, sex-biased dispersal (tested only for Erophylla), or surface-level climatological winds. Gene flow among islands appears to be highly restricted, particularly for M. waterhousii, and we suggest that this species deserves increased taxonomic attention and conservation concern. PMID:21445291
Gymnosperm B-sister genes may be involved in ovule/seed development and, in some species, in the growth of fleshy fruit-like structures.

PubMed

Lovisetto, Alessandro; Guzzo, Flavia; Busatto, Nicola; Casadoro, Giorgio

2013-08-01

The evolution of seeds together with the mechanisms related to their dispersal into the environment represented a turning point in the evolution of plants. Seeds are produced by gymnosperms and angiosperms but only the latter have an ovary to be transformed into a fruit. Yet some gymnosperms produce fleshy structures attractive to animals, thus behaving like fruits from a functional point of view. The aim of this work is to increase our knowledge of possible mechanisms common to the development of both gymnosperm and angiosperm fruits. B-sister genes from two gymnosperms (Ginkgo biloba and Taxus baccata) were isolated and studied. The Ginkgo gene was also functionally characterized by ectopically expressing it in tobacco. In Ginkgo the fleshy structure derives from the outer seed integument and the B-sister gene is involved in its growth. In Taxus the fleshy structure is formed de novo as an outgrowth of the ovule peduncle, and the B-sister gene is not involved in this growth. In transgenic tobacco the Ginkgo gene has a positive role in tissue growth and confirms its importance in ovule/seed development. This study suggests that B-sister genes have a main function in ovule/seed development and a subsidiary role in the formation of fleshy fruit-like structures when the latter have an ovular origin, as occurs in Ginkgo. Thus, the 'fruit function' of B-sister genes is quite old, already being present in Gymnosperms as ancient as Ginkgoales, and is also present in Angiosperms where a B-sister gene has been shown to be involved in the formation of the Arabidopsis fruit.
Evolution of the Structure and Chromosomal Distribution of Histidine Biosynthetic Genes

NASA Astrophysics Data System (ADS)

Fani, Renato; Mori, Elena; Tamburini, Elena; Lazcano, Antonio

1998-10-01

A database of more than 100 histidine biosynthetic genes from different organisms belonging to the three primary domains has been analyzed, including those found in the now completely sequenced genomes of Haemophilus influenzae, Mycoplasma genitalium, Synechocystis sp., Methanococcus jannaschii, and Saccharomyces cerevisiae. The ubiquity of his genes suggests that it is a highly conserved pathway that was probably already present in the last common ancestor of all extant life. The chromosomal distribution of the his genes shows that the enterobacterial histidine operon structure is not the only possible organization, and that there is a diversity of gene arrays for the his pathway. Analysis of the available sequences shows that gene fusions (like those involved in the origin of the Escherichia coli and Salmonella typhimurium hisIE and hisB gene structures) are not universal. In contrast, the elongation event that led to the extant hisA gene from two homologous ancestral modules, as well as the subsequent paralogous duplication that originated hisF, appear to be irreversible and are conserved in all known organisms. The available evidence supports the hypothesis that histidine biosynthesis was assembled by a gene recruitment process.
Further mapping of the properdin deficiency gene in a Tunisian Jewish family--evidence for genetic homogeneity.

PubMed

Ash, S; Johnson, C; Shohat, M; Shohat, T; Schlesinger, M

1994-08-01

The properdin deficiency gene has been localized to Xp21.1-Xcen; however, it is not clear whether the mutation responsible for the disease co-maps exactly with the structural properdin gene. Based on a recent study on a total of six families, the gene was found linked to DXS255 (theta = 0.00). As only a few families have been studied, it is not known whether the same gene is responsible for the disease in all families. In order to better localize the disease gene in Israel, we studied a Tunisian Jewish family with properdin deficiency for linkage with various X-markers. A maximum lod score of 1.93 at theta = 0.00 was calculated with the DXS7 probe while there was one recombination with DXS255. This study helps to better localize the properdin deficiency gene to Xp11.3-p21.1 proximal to DXS255 locus and confirms that there is no indication of genetic heterogeneity. Whether the properdin structural gene (PFC) and properdin deficiency locus are one and the same await demonstration of mutations in the structural gene in patients with properdin deficiency.
Influences of the G2350A polymorphism in the ACE Gene on cardiac structure and function of ball game players

PubMed Central

2012-01-01

Background Except for the I/D polymorphism in the angiotensin I-converting enzyme (ACE) gene, there were few reports about the relationship between other genetic polymorphisms in this gene and the changes in cardiac structure and function of athletes. Thus, we investigated whether the G2350A polymorphism in the ACE gene is associated with the changes in cardiac structure and function of ball game players. Total 85 healthy ball game players were recruited in this study, and they were composed of 35 controls and 50 ball game players, respectively. Cardiac structure and function were measured by 2-D echocardiography, and the G2350A polymorphism in the ACE gene analyzed by the SNaPshot method. Results There were significant differences in left ventricular mass index (LVmassI) value among each sporting discipline studied. Especially in the athletes of basketball disciplines, indicated the highest LVmassI value than those of other sporting disciplines studied (p < 0.05). However, there were no significant association between any echocardiographic data and the G2350A polymorphism in the ACE gene in the both controls and ball game players. Conclusions Our data suggests that the G2350A polymorphism in the ACE gene may not significantly contribute to the changes in cardiac structure and function of ball game players, although sporting disciplines of ball game players may influence the changes in LVmassI value of these athletes. Further studies using a larger sample size and other genetic markers in the ACE gene will be needed. PMID:22239999
RNA-Seq analysis of yak ovary: improving yak gene structure information and mining reproduction-related genes.

PubMed

Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian

2014-09-01

RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.

How gene order is influenced by the biophysics of transcription regulation

PubMed Central

Kolesov, Grigory; Wunderlich, Zeba; Laikova, Olga N.; Gelfand, Mikhail S.; Mirny, Leonid A.

2007-01-01

What are the forces that shape the structure of prokaryotic genomes: the order of genes, their proximity, and their orientation? Coregulation and coordinated horizontal gene transfer are believed to promote the proximity of functionally related genes and the formation of operons. However, forces that influence the structure of the genome beyond the level of a single operon remain unknown. Here, we show that the biophysical mechanism by which regulatory proteins search for their sites on DNA can impose constraints on genome structure. Using simulations, we demonstrate that rapid and reliable gene regulation requires that the transcription factor (TF) gene be close to the site on DNA the TF has to bind, thus promoting the colocalization of TF genes and their targets on the genome. We use parameters that have been measured in recent experiments to estimate the relevant length and times scales of this process and demonstrate that the search for a cognate site may be prohibitively slow if a TF has a low copy number and is not colocalized. We also analyze TFs and their sites in a number of bacterial genomes, confirm that they are colocalized significantly more often than expected, and show that this observation cannot be attributed to the pressure for coregulation or formation of selfish gene clusters, thus supporting the role of the biophysical constraint in shaping the structure of prokaryotic genomes. Our results demonstrate how spatial organization can influence timing and noise in gene expression. PMID:17709750
Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.

PubMed

Williams, N A; Holland, P W

1998-05-01

We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.
The importance of the specific Z-DNA structure and polyamines in carcinogenesis: fact or fiction.

PubMed

Juranic, Z; Kidric, M; Tomin, R; Juranić, I; Spuzić, I; Petrović, J

1991-08-01

In this work some aspects of carcinogenesis are given. The importance of the emergence of Z or H DNA structure in the gene, or in the flanking gene sequences for the gene deletion and unusual gene recombination, is discussed. Some considerations on the role of selective pressure (of polyamines, of Mg2+, of the various levels of topoisomerase II, and of ATP) in the process of oncogene amplification, are given too.
Expression of alcoholism-relevant genes in the liver are differently correlated to different parts of the brain.

PubMed

Wang, Lishi; Huang, Yue; Jiao, Yan; Chen, Hong; Cao, Yanhong; Bennett, Beth; Wang, Yongjun; Gu, Weikuan

2013-01-01

The purpose of this study is to investigate whether expression profiles of alcoholism-relevant genes in different parts of the brain are correlated differently with those in the liver. Four experiments were conducted. First, we used gene expression profiles from five parts of the brain (striatum, prefrontal cortex, nucleus accumbens, hippocampus, and cerebellum) and from liver in a population of recombinant inbred mouse strains to examine the expression association of 10 alcoholism-relevant genes. Second, we conducted the same association analysis between brain structures and the lung. Third, using five randomly selected, nonalcoholism-relevant genes, we conducted the association analysis between brain and liver. Finally, we compared the expression of 10 alcoholism-relevant genes in hippocampus and cerebellum between an alcohol preference strain and a wild-type control. We observed a difference in correlation patterns in expression levels of 10 alcoholism-relevant genes between different parts of the brain with those of liver. We then examined the association of gene expression between alcohol dehydrogenases (Adh1, Adh2, Adh5, and Adh7) and different parts of the brain. The results were similar to those of the 10 genes. Then, we found that the association of those genes between brain structures and lung was different from that of liver. Next, we found that the association patterns of five alcoholism-nonrelevant genes were different from those of 10 alcoholism-relevant genes. Finally, we found that the expression level of 10 alcohol-relevant genes is influenced more in hippocampus than in cerebellum in the alcohol preference strain. Our results show that the expression of alcoholism-relevant genes in liver is differently associated with the expression of genes in different parts of the brain. Because different structural changes in different parts of the brain in alcoholism have been reported, it is important to investigate whether those structural differences in the brains of those with alcoholism are due to the difference in the associations of gene expression between genes in liver and in different parts of the brain.
Comparing large covariance matrices under weak conditions on the dependence structure and its application to gene clustering.

PubMed

Chang, Jinyuan; Zhou, Wen; Zhou, Wen-Xin; Wang, Lan

2017-03-01

Comparing large covariance matrices has important applications in modern genomics, where scientists are often interested in understanding whether relationships (e.g., dependencies or co-regulations) among a large number of genes vary between different biological states. We propose a computationally fast procedure for testing the equality of two large covariance matrices when the dimensions of the covariance matrices are much larger than the sample sizes. A distinguishing feature of the new procedure is that it imposes no structural assumptions on the unknown covariance matrices. Hence, the test is robust with respect to various complex dependence structures that frequently arise in genomics. We prove that the proposed procedure is asymptotically valid under weak moment conditions. As an interesting application, we derive a new gene clustering algorithm which shares the same nice property of avoiding restrictive structural assumptions for high-dimensional genomics data. Using an asthma gene expression dataset, we illustrate how the new test helps compare the covariance matrices of the genes across different gene sets/pathways between the disease group and the control group, and how the gene clustering algorithm provides new insights on the way gene clustering patterns differ between the two groups. The proposed methods have been implemented in an R-package HDtest and are available on CRAN. © 2016, The International Biometric Society.
Soil-borne microbial functional structure across different land uses.

PubMed

Kuramae, Eiko E; Zhou, Jizhong Z; Kowalchuk, George A; van Veen, Johannes A

2014-01-01

Land use change alters the structure and composition of microbial communities. However, the links between environmental factors and microbial functions are not well understood. Here we interrogated the functional structure of soil microbial communities across different land uses. In a multivariate regression tree analysis of soil physicochemical properties and genes detected by functional microarrays, the main factor that explained the different microbial community functional structures was C : N ratio. C : N ratio showed a significant positive correlation with clay and soil pH. Fields with low C : N ratio had an overrepresentation of genes for carbon degradation, carbon fixation, metal reductase, and organic remediation categories, while fields with high C : N ratio had an overrepresentation of genes encoding dissimilatory sulfate reductase, methane oxidation, nitrification, and nitrogen fixation. The most abundant genes related to carbon degradation comprised bacterial and fungal cellulases; bacterial and fungal chitinases; fungal laccases; and bacterial, fungal, and oomycete polygalacturonases. The high number of genes related to organic remediation was probably driven by high phosphate content, while the high number of genes for nitrification was probably explained by high total nitrogen content. The functional gene diversity found in different soils did not group the sites accordingly to land management. Rather, the soil factors, C : N ratio, phosphate, and total N, were the main factors driving the differences in functional genes across the fields examined.
Predicting Gene Structure Changes Resulting from Genetic Variants via Exon Definition Features.

PubMed

Majoros, William H; Holt, Carson; Campbell, Michael S; Ware, Doreen; Yandell, Mark; Reddy, Timothy E

2018-04-25

Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed, and produce functional proteins. We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and noncoding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or noncoding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products, and we propose that they may commonly act as cryptic factors in disease. The software is available from geneprediction.org/SGRF. bmajoros@duke.edu. Supplementary information is available at Bioinformatics online.
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.

PubMed

Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A

2016-04-16

Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Structure of the Elastin-Contractile Units in the Thoracic Aorta and How Genes That Cause Thoracic Aortic Aneurysms and Dissections Disrupt This Structure.

PubMed

Karimi, Ashkan; Milewicz, Dianna M

2016-01-01

The medial layer of the aorta confers elasticity and strength to the aortic wall and is composed of alternating layers of smooth muscle cells (SMCs) and elastic fibres. The SMC elastin-contractile unit is a structural unit that links the elastin fibres to the SMCs and is characterized by the following: (1) layers of elastin fibres that are surrounded by microfibrils; (2) microfibrils that bind to the integrin receptors in focal adhesions on the cell surface of the SMCs; and (3) SMC contractile filaments that are linked to the focal adhesions on the inner side of the membrane. The genes that are altered to cause thoracic aortic aneurysms and aortic dissections encode proteins involved in the structure or function of the SMC elastin-contractile unit. Included in this gene list are the genes encoding protein that are structural components of elastin fibres and microfibrils, FBN1, MFAP5, ELN, and FBLN4. Also included are genes that encode structural proteins in the SMC contractile unit, including ACTA2, which encodes SMC-specific α-actin and MYH11, which encodes SMC-specific myosin heavy chain, along with MYLK and PRKG1, which encode kinases that control SMC contraction. Finally, mutations in the gene encoding the protein linking integrin receptors to the contractile filaments, FLNA, also predispose to thoracic aortic disease. Thus, these data suggest that functional SMC elastin-contractile units are important for maintaining the structural integrity of the aorta. Copyright © 2016 Canadian Cardiovascular Society. Published by Elsevier Inc. All rights reserved.
Sequence- and Structure-Based Functional Annotation and Assessment of Metabolic Transporters in Aspergillus oryzae: A Representative Case Study

PubMed Central

Raethong, Nachon; Wong-ekkabut, Jirasak; Laoteng, Kobkul; Vongsangnak, Wanwipa

2016-01-01

Aspergillus oryzae is widely used for the industrial production of enzymes. In A. oryzae metabolism, transporters appear to play crucial roles in controlling the flux of molecules for energy generation, nutrients delivery, and waste elimination in the cell. While the A. oryzae genome sequence is available, transporter annotation remains limited and thus the connectivity of metabolic networks is incomplete. In this study, we developed a metabolic annotation strategy to understand the relationship between the sequence, structure, and function for annotation of A. oryzae metabolic transporters. Sequence-based analysis with manual curation showed that 58 genes of 12,096 total genes in the A. oryzae genome encoded metabolic transporters. Under consensus integrative databases, 55 unambiguous metabolic transporter genes were distributed into channels and pores (7 genes), electrochemical potential-driven transporters (33 genes), and primary active transporters (15 genes). To reveal the transporter functional role, a combination of homology modeling and molecular dynamics simulation was implemented to assess the relationship between sequence to structure and structure to function. As in the energy metabolism of A. oryzae, the H+-ATPase encoded by the AO090005000842 gene was selected as a representative case study of multilevel linkage annotation. Our developed strategy can be used for enhancing metabolic network reconstruction. PMID:27274991
Sequence- and Structure-Based Functional Annotation and Assessment of Metabolic Transporters in Aspergillus oryzae: A Representative Case Study.

PubMed

Raethong, Nachon; Wong-Ekkabut, Jirasak; Laoteng, Kobkul; Vongsangnak, Wanwipa

2016-01-01

Aspergillus oryzae is widely used for the industrial production of enzymes. In A. oryzae metabolism, transporters appear to play crucial roles in controlling the flux of molecules for energy generation, nutrients delivery, and waste elimination in the cell. While the A. oryzae genome sequence is available, transporter annotation remains limited and thus the connectivity of metabolic networks is incomplete. In this study, we developed a metabolic annotation strategy to understand the relationship between the sequence, structure, and function for annotation of A. oryzae metabolic transporters. Sequence-based analysis with manual curation showed that 58 genes of 12,096 total genes in the A. oryzae genome encoded metabolic transporters. Under consensus integrative databases, 55 unambiguous metabolic transporter genes were distributed into channels and pores (7 genes), electrochemical potential-driven transporters (33 genes), and primary active transporters (15 genes). To reveal the transporter functional role, a combination of homology modeling and molecular dynamics simulation was implemented to assess the relationship between sequence to structure and structure to function. As in the energy metabolism of A. oryzae, the H(+)-ATPase encoded by the AO090005000842 gene was selected as a representative case study of multilevel linkage annotation. Our developed strategy can be used for enhancing metabolic network reconstruction.
Causal gene identification using combinatorial V-structure search.

PubMed

Cai, Ruichu; Zhang, Zhenjie; Hao, Zhifeng

2013-07-01

With the advances of biomedical techniques in the last decade, the costs of human genomic sequencing and genomic activity monitoring are coming down rapidly. To support the huge genome-based business in the near future, researchers are eager to find killer applications based on human genome information. Causal gene identification is one of the most promising applications, which may help the potential patients to estimate the risk of certain genetic diseases and locate the target gene for further genetic therapy. Unfortunately, existing pattern recognition techniques, such as Bayesian networks, cannot be directly applied to find the accurate causal relationship between genes and diseases. This is mainly due to the insufficient number of samples and the extremely high dimensionality of the gene space. In this paper, we present the first practical solution to causal gene identification, utilizing a new combinatorial formulation over V-Structures commonly used in conventional Bayesian networks, by exploring the combinations of significant V-Structures. We prove the NP-hardness of the combinatorial search problem under a general settings on the significance measure on the V-Structures, and present a greedy algorithm to find sub-optimal results. Extensive experiments show that our proposal is both scalable and effective, particularly with interesting findings on the causal genes over real human genome data. Copyright © 2013 Elsevier Ltd. All rights reserved.
Ancestral multipartite units in light-responsive plant promoters have structural features correlating with specific phototransduction pathways.

PubMed Central

Argüello-Astorga, G R; Herrera-Estrella, L R

1996-01-01

Regulation of plant gene transcription by light is mediated by multipartite cis-regulatory units. Previous attempts to identify structural features that are common to all light-responsive elements (LREs) have been unsuccessful. To address the question of what is needed to confer photoresponsiveness to a promoter, the upstream sequences from more than 110 light-regulated plant genes were analyzed by a new, phylogenetic-structural method. As a result, 30 distinct conserved DNA module arrays (CMAs) associated with light-responsive promoter regions were identified. Several of these CMAs have remained invariant throughout the evolutionary radiation of angiosperms and are conserved between homologous genes as well as between members of different gene families. The identified CMAs share a gene superfamily-specific core that correlates with the particular phytochrome-dependent transduction pathway that controls their expression, i.e. ACCTA(A/C)C(A/C) for the cGMP-dependent phenylpropanoid metabolism-associated genes, and GATA(A/T)GR for the Ca2+/calmodulin-dependent photosynthesis-associated nuclear genes. In addition to suggesting a general model for the functional and structural organization of LREs, the data obtained in this study indicate that angiosperm LREs probably evolved from complex cis-acting elements involved in regulatory processes other than photoregulation in gymnosperms. PMID:8938415
Cationic liposome/DNA complexes: from structure to interactions with cellular membranes.

PubMed

Caracciolo, Giulio; Amenitsch, Heinz

2012-10-01

Gene-based therapeutic approaches are based upon the concept that, if a disease is caused by a mutation in a gene, then adding back the wild-type gene should restore regular function and attenuate the disease phenotype. To deliver the gene of interest, both viral and nonviral vectors are used. Viruses are efficient, but their application is impeded by detrimental side-effects. Among nonviral vectors, cationic liposomes are the most promising candidates for gene delivery. They form stable complexes with polyanionic DNA (lipoplexes). Despite several advantages over viral vectors, the transfection efficiency (TE) of lipoplexes is too low compared with those of engineered viral vectors. This is due to lack of knowledge about the interactions between complexes and cellular components. Rational design of efficient lipoplexes therefore requires deeper comprehension of the interactions between the vector and the DNA as well as the cellular pathways and mechanisms involved. The importance of the lipoplex structure in biological function is revealed in the application of synchrotron small-angle X-ray scattering in combination with functional TE measurements. According to current understanding, the structure of lipoplexes can change upon interaction with cellular membranes and such changes affect the delivery efficiency. Recently, a correlation between the mechanism of gene release from complexes, the structure, and the physical and chemical parameters of the complexes has been established. Studies aimed at correlating structure and activity of lipoplexes are reviewed herein. This is a fundamental step towards rational design of highly efficient lipid gene vectors.
Genome-wide identification, phylogenetic classification, and exon-intron structure characterisation of the tubulin and actin genes in flax (Linum usitatissimum).

PubMed

Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav

2018-06-08

Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.
The secondary structure of the ets domain of human Fli-1 resembles that of the helix-turn-helix DNA-binding motif of the Escherichia coli catabolite gene activator protein.

PubMed Central

Liang, H; Olejniczak, E T; Mao, X; Nettesheim, D G; Yu, L; Thompson, C B; Fesik, S W

1994-01-01

The ets family of eukaryotic transcription factors is characterized by a conserved DNA-binding domain of approximately 85 amino acids for which the three-dimensional structure is not known. By using multidimensional NMR spectroscopy, we have determined the secondary structure of the ets domain of one member of this gene family, human Fli-1, both in the free form and in a complex with a 16-bp cognate DNA site. The secondary structure of the Fli-1 ets domain consists of three alpha-helices and a short four-stranded antiparallel beta-sheet. This secondary structure arrangement resembles that of the DNA-binding domain of the catabolite gene activator protein of Escherichia coli, as well as those of several eukaryotic DNA-binding proteins including histone H5, HNF-3/fork head, and the heat shock transcription factor. Differences in chemical shifts of backbone resonances and amide exchange rates between the DNA-bound and free forms of the Fli-1 ets domain suggest that the third helix is the DNA recognition helix, as in the catabolite gene activator protein and other structurally related proteins. These results suggest that the ets domain is structurally similar to the catabolite gene activator protein family of helix-turn-helix DNA-binding proteins. Images PMID:7972119
Automating gene library synthesis by structure-based combinatorial protein engineering: examples from plant sesquiterpene synthases.

PubMed

Dokarry, Melissa; Laurendon, Caroline; O'Maille, Paul E

2012-01-01

Structure-based combinatorial protein engineering (SCOPE) is a homology-independent recombination method to create multiple crossover gene libraries by assembling defined combinations of structural elements ranging from single mutations to domains of protein structure. SCOPE was originally inspired by DNA shuffling, which mimics recombination during meiosis, where mutations from parental genes are "shuffled" to create novel combinations in the resulting progeny. DNA shuffling utilizes sequence identity between parental genes to mediate template-switching events (the annealing and extension of one parental gene fragment on another) in PCR reassembly reactions to generate crossovers and hence recombination between parental genes. In light of the conservation of protein structure and degeneracy of sequence, SCOPE was developed to enable the "shuffling" of distantly related genes with no requirement for sequence identity. The central principle involves the use of oligonucleotides to encode for crossover regions to choreograph template-switching events during PCR assembly of gene fragments to create chimeric genes. This approach was initially developed to create libraries of hybrid DNA polymerases from distantly related parents, and later developed to create a combinatorial mutant library of sesquiterpene synthases to explore the catalytic landscapes underlying the functional divergence of related enzymes. This chapter presents a simplified protocol of SCOPE that can be integrated with different mutagenesis techniques and is suitable for automation by liquid-handling robots. Two examples are presented to illustrate the application of SCOPE to create gene libraries using plant sesquiterpene synthases as the model system. In the first example, we outline how to create an active-site library as a series of complex mixtures of diverse mutants. In the second example, we outline how to create a focused library as an array of individual clones to distil minimal combinations of functionally important mutations. Through these examples, the principles of the technique are illustrated and the suitability of automating various aspects of the procedure for given applications are discussed. Copyright © 2012 Elsevier Inc. All rights reserved.
Genome-wide identification of galactinol synthase (GolS) genes in Solanum lycopersicum and Brachypodium distachyon.

PubMed

Filiz, Ertugrul; Ozyigit, Ibrahim Ilker; Vatansever, Recep

2015-10-01

GolS genes stand as potential candidate genes for molecular breeding and/or engineering programs in order for improving abiotic stress tolerance in plant species. In this study, a total of six galactinol synthase (GolS) genes/proteins were retrieved for Solanum lycopersicum and Brachypodium distachyon. GolS protein sequences were identified to include glyco_transf_8 (PF01501) domain structure, and to have a close molecular weight (36.40-39.59kDa) and amino acid length (318-347 aa) with a slightly acidic pI (5.35-6.40). The sub-cellular location was mainly predicted as cytoplasmic. S. lycopersicum genes located on chr 1 and 2, and included one segmental duplication while genes of B. distachyon were only on chr 1 with one tandem duplication. GolS sequences were found to have well conserved motif structures. Cis-acting analysis was performed for three abiotic stress responsive elements, including ABA responsive element (ABRE), dehydration and cold responsive elements (DRE/CRT) and low-temperature responsive element (LTRE). ABRE elements were found in all GolS genes, except for SlGolS4; DRE/CRT was not detected in any GolS genes and LTRE element found in SlGolS1 and BdGolS1 genes. AU analysis in UTR and ORF regions indicated that SlGolS and BdGolS mRNAs may have a short half-life. SlGolS3 and SlGolS4 genes may generate more stable transcripts since they included AATTAAA motif for polyadenylation signal POLASIG2. Seconder structures of SlGolS proteins were well conserved than that of BdGolS. Some structural divergences were detected in 3D structures and predicted binding sites exhibited various patterns in GolS proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Full structure and insight into the gene cluster of the O-specific polysaccharide of Yersinia intermedia H9-36/83 (O:17).

PubMed

Sizova, Olga V; Shashkov, Alexander S; Kondakova, Anna N; Knirel, Yuriy A; Shaikhutdinova, Rima Z; Ivanov, Sergei A; Kislichkina, Angelina A; Kadnikova, Lidia A; Bogun, Aleksandr G; Dentovskaya, Svetlana V

2018-05-02

Lipopolysaccharide was isolated from bacteria Yersinia intermedia H9-36/83 (O:17) and degraded with mild acid to give an O-specific polysaccharide, which was isolated by GPC on Sephadex G-50 and studied by sugar analysis and 1D and 2D NMR spectroscopy. The polysaccharide was found to contain 3-deoxy-3-[(R)-3-hydroxybutanoylamino]-d-fucose (d-Fuc3NR3Hb) and the following structure of the heptasaccharide repeating unit was established: The structure established is consistent with the gene content of the O-antigen gene cluster. The O-polysaccharide structure and gene cluster of Y. intermedia are related to those of Hafnia alvei 1211 and Escherichia coli O:103. Copyright © 2018 Elsevier Ltd. All rights reserved.
Systems Mechanobiology: Tension-Inhibited Protein Turnover Is Sufficient to Physically Control Gene Circuits

PubMed Central

Dingal, P.C. Dave P.; Discher, Dennis E.

2014-01-01

Mechanotransduction pathways convert forces that stress and strain structures within cells into gene expression levels that impact development, homeostasis, and disease. The levels of some key structural proteins in the nucleus, cytoskeleton, or extracellular matrix have been recently reported to scale with tissue- and cell-level forces or mechanical properties such as stiffness, and so the mathematics of mechanotransduction becomes important to understand. Here, we show that if a given structural protein positively regulates its own gene expression, then stresses need only inhibit degradation of that protein to achieve stable, mechanosensitive gene expression. This basic use-it-or-lose-it module is illustrated by application to meshworks of nuclear lamin A, minifilaments of myosin II, and extracellular matrix collagen fibers—all of which possess filamentous coiled-coil/supercoiled structures. Past experiments not only suggest that tension suppresses protein degradation mediated and/or initiated by various enzymes but also that transcript levels vary with protein levels because key transcription factors are regulated by these structural proteins. Coupling between modules occurs within single cells and between cells in tissue, as illustrated during embryonic heart development where cardiac fibroblasts make collagen that cardiomyocytes contract. With few additional assumptions, the basic module has sufficient physics to control key structural genes in both development and disease. PMID:25468352

Identification of Enzyme Genes Using Chemical Structure Alignments of Substrate-Product Pairs.

PubMed

Moriya, Yuki; Yamada, Takuji; Okuda, Shujiro; Nakagawa, Zenichi; Kotera, Masaaki; Tokimatsu, Toshiaki; Kanehisa, Minoru; Goto, Susumu

2016-03-28

Although there are several databases that contain data on many metabolites and reactions in biochemical pathways, there is still a big gap in the numbers between experimentally identified enzymes and metabolites. It is supposed that many catalytic enzyme genes are still unknown. Although there are previous studies that estimate the number of candidate enzyme genes, these studies required some additional information aside from the structures of metabolites such as gene expression and order in the genome. In this study, we developed a novel method to identify a candidate enzyme gene of a reaction using the chemical structures of the substrate-product pair (reactant pair). The proposed method is based on a search for similar reactant pairs in a reference database and offers ortholog groups that possibly mediate the given reaction. We applied the proposed method to two experimentally validated reactions. As a result, we confirmed that the histidine transaminase was correctly identified. Although our method could not directly identify the asparagine oxo-acid transaminase, we successfully found the paralog gene most similar to the correct enzyme gene. We also applied our method to infer candidate enzyme genes in the mesaconate pathway. The advantage of our method lies in the prediction of possible genes for orphan enzyme reactions where any associated gene sequences are not determined yet. We believe that this approach will facilitate experimental identification of genes for orphan enzymes.
Cellulose synthase (CesA) genes in the green alga Mesotaenium caldariorum.

PubMed

Roberts, Alison W; Roberts, Eric M; Delmer, Deborah P

2002-12-01

Cellulose, a microfibrillar polysaccharide consisting of bundles of beta-1,4-glucan chains, is a major component of plant and most algal cell walls and is also synthesized by some prokaryotes. Seed plants and bacteria differ in the structures of their membrane terminal complexes that make cellulose and, in turn, control the dimensions of the microfibrils produced. They also differ in the domain structures of their CesA gene products (the catalytic subunit of cellulose synthase), which have been localized to terminal complexes and appear to help maintain terminal complex structure. Terminal complex structures in algae range from rosettes (plant-like) to linear forms (bacterium-like). Thus, algal CesA genes may reveal domains that control terminal complex assembly and microfibril structure. The CesA genes from the alga Mesotaenium caldariorum, a member of the order Zygnematales, which have rosette terminal complexes, are remarkably similar to seed plant CesAs, with deduced amino acid sequence identities of up to 59%. In addition to the putative transmembrane helices and the D-D-D-QXXRW motif shared by all known CesA gene products, M. caldariorum and seed plant CesAs share a region conserved among plants, an N-terminal zinc-binding domain, and a variable or class-specific region. This indicates that the domains that characterize seed plant CesAs arose prior to the evolution of land plants and may play a role in maintaining the structures of rosette terminal complexes. The CesA genes identified in M. caldariorum are the first reported for any eukaryotic alga and will provide a basis for analyzing the CesA genes of algae with different types of terminal complexes.
Innovative approaches to the use of polyamines for DNA nanoparticle preparation for gene therapy.

PubMed

Vijayanathan, Veena; Agostinelli, Enzo; Thomas, Thresia; Thomas, T J

2014-03-01

Advances in genomic technologies, such as next generation sequencing and disease specific gene targeting through anti-sense, anti-gene, siRNA and microRNA approaches require the transport of nucleic acid drugs through the cell membrane. Membrane transport of DNA/RNA drugs is an inefficient process, and the mechanism(s) by which this process occurs is not clear. A pre-requisite for effective transport of DNA and RNA in cells is their condensation to nanoparticles of ~100 nm size. Although viral vectors are effective in gene therapy, the immune response elicited by viral proteins poses a major challenge. Multivalent cations, such as natural polyamines are excellent promoters of DNA/RNA condensation to nanoparticles. During the past 20 years, our laboratory has synthesized and tested several analogs of the natural polyamine, spermine, for their efficacy to provoke DNA condensation to nanoparticles. We determined the thermodynamics of polyamine-mediated DNA condensation, measured the structural specificity effects of polyamine analogs in facilitating the cellular uptake of oligonucleotides, and evaluated the gene silencing activity of DNA nanoparticles in breast cancer cells. Polyamine-complexed oligonucleotides showed a synergistic effect on target gene inhibition at the mRNA level compared to the use of polyamines and oligonucleotides as single agents. Ionic and structural specificity effects were evident in DNA condensation and cellular transportation effects of polyamines. In condensed DNA structures, correlation exists between the attractive and repulsive forces with structurally different polyamines and cobalt hexamine, indicating the existence of a common force in stabilizing the condensed structures. Future studies aimed at defining the mechanism(s) of DNA compaction and structural features of DNA nanoparticles might aid in the development of novel gene delivery vehicles.
Full expression of Bacillus anthracis toxin gene in the presence of bicarbonate requires a 2.7-kb-long atxA mRNA that contains a terminator structure.

PubMed

Bertin, Marine; Château, Alice; Fouet, Agnès

2010-05-01

Bacillus anthracis toxin gene expression requires AtxA, a virulence regulator that also activates capsule gene transcription and controls expression of more than a hundred genes. Here we report that atxA mRNA is 2.7-kb-long and ends, after a 500 nt-long 3' untranslated region, with a stem loop structure followed by a run of U's. The presence of this structure stabilizes atxA mRNA and is necessary for AtxA maximal accumulation, full expression of the PA toxin gene, pagA and optimal PA accumulation. This structure displays terminator activity independently of its orientation when cloned between an inducible promoter and a reporter gene. The 3.6-kb-long DNA fragment carrying both AtxA promoters and the terminator is sufficient for full expression of pagA in the presence of bicarbonate. No pXO1-encoded element other than the DNA fragment encompassing the 2.7 kb atxA transcript and the pagA promoter is required for bicarbonate induction of pagA transcription. (c) 2010 Elsevier Masson SAS. All rights reserved.
Anchoring of Heterochromatin to the Nuclear Lamina Reinforces Dosage Compensation-Mediated Gene Repression.

PubMed

Snyder, Martha J; Lau, Alyssa C; Brouhard, Elizabeth A; Davis, Michael B; Jiang, Jianhao; Sifuentes, Margarita H; Csankovszki, Györgyi

2016-09-01

Higher order chromosome structure and nuclear architecture can have profound effects on gene regulation. We analyzed how compartmentalizing the genome by tethering heterochromatic regions to the nuclear lamina affects dosage compensation in the nematode C. elegans. In this organism, the dosage compensation complex (DCC) binds both X chromosomes of hermaphrodites to repress transcription two-fold, thus balancing gene expression between XX hermaphrodites and XO males. X chromosome structure is disrupted by mutations in DCC subunits. Using X chromosome paint fluorescence microscopy, we found that X chromosome structure and subnuclear localization are also disrupted when the mechanisms that anchor heterochromatin to the nuclear lamina are defective. Strikingly, the heterochromatic left end of the X chromosome is less affected than the gene-rich middle region, which lacks heterochromatic anchors. These changes in X chromosome structure and subnuclear localization are accompanied by small, but significant levels of derepression of X-linked genes as measured by RNA-seq, without any observable defects in DCC localization and DCC-mediated changes in histone modifications. We propose a model in which heterochromatic tethers on the left arm of the X cooperate with the DCC to compact and peripherally relocate the X chromosomes, contributing to gene repression.
Anchoring of Heterochromatin to the Nuclear Lamina Reinforces Dosage Compensation-Mediated Gene Repression

PubMed Central

Brouhard, Elizabeth A.; Jiang, Jianhao; Sifuentes, Margarita H.

2016-01-01

Higher order chromosome structure and nuclear architecture can have profound effects on gene regulation. We analyzed how compartmentalizing the genome by tethering heterochromatic regions to the nuclear lamina affects dosage compensation in the nematode C. elegans. In this organism, the dosage compensation complex (DCC) binds both X chromosomes of hermaphrodites to repress transcription two-fold, thus balancing gene expression between XX hermaphrodites and XO males. X chromosome structure is disrupted by mutations in DCC subunits. Using X chromosome paint fluorescence microscopy, we found that X chromosome structure and subnuclear localization are also disrupted when the mechanisms that anchor heterochromatin to the nuclear lamina are defective. Strikingly, the heterochromatic left end of the X chromosome is less affected than the gene-rich middle region, which lacks heterochromatic anchors. These changes in X chromosome structure and subnuclear localization are accompanied by small, but significant levels of derepression of X-linked genes as measured by RNA-seq, without any observable defects in DCC localization and DCC-mediated changes in histone modifications. We propose a model in which heterochromatic tethers on the left arm of the X cooperate with the DCC to compact and peripherally relocate the X chromosomes, contributing to gene repression. PMID:27690361
Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering.

PubMed

Ji, Shuiwang

2013-07-11

The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship.
Mg2+-induced DNA compaction, condensation, and phase separation in gene delivery vehicles based on zwitterionic phospholipids: a dynamic light scattering and surface-enhanced Raman spectroscopic study.

PubMed

Süleymanoğlu, Erhan

2017-12-01

Despite the significant efforts towards applying improved non-destructive and label-free measurements of biomolecular structures of lipid-based gene delivery vectors, little is achieved in terms of their structural relevance in gene transfections. Better understanding of structure-activity relationships of lipid-DNA complexes and their gene expression efficiencies thus becomes an essential issue. Raman scattering offers a complimentary measurement technique for following the structural transitions of both DNA and lipid vesicles employed for their transfer. This work describes the use of SERS coupled with light scattering approaches for deciphering the bioelectrochemical phase formations between nucleic acids and lipid vesicles within lipoplexes and their surface parameters that could influence both the uptake of non-viral gene carriers and the endocytic routes of interacting cells. As promising non-viral alternatives of currently employed risky viral systems or highly cytotoxic cationic liposomes, complexations of both nucleic acids and zwitterionic lipids in the presence of Mg 2+ were studied applying colloidal Ag nanoparticles. It is shown that the results could be employed in further conformational characterizations of similar polyelectrolyte gene delivery systems.
Nucleosome Positioning and NDR Structure at RNA Polymerase III Promoters

NASA Astrophysics Data System (ADS)

Helbo, Alexandra Søgaard; Lay, Fides D.; Jones, Peter A.; Liang, Gangning; Grønbæk, Kirsten

2017-02-01

Chromatin is structurally involved in the transcriptional regulation of all genes. While the nucleosome positioning at RNA polymerase II (pol II) promoters has been extensively studied, less is known about the chromatin structure at pol III promoters in human cells. We use a high-resolution analysis to show substantial differences in chromatin structure of pol II and pol III promoters, and between subtypes of pol III genes. Notably, the nucleosome depleted region at the transcription start site of pol III genes extends past the termination sequences, resulting in nucleosome free gene bodies. The +1 nucleosome is located further downstream than at pol II genes and furthermore displays weak positioning. The variable position of the +1 location is seen not only within individual cell populations and between cell types, but also between different pol III promoter subtypes, suggesting that the +1 nucleosome may be involved in the transcriptional regulation of pol III genes. We find that expression and DNA methylation patterns correlate with distinct accessibility patterns, where DNA methylation associates with the silencing and inaccessibility at promoters. Taken together, this study provides the first high-resolution map of nucleosome positioning and occupancy at human pol III promoters at specific loci and genome wide.
Data on the genome-wide identification of CNL R-genes in Setaria italica (L.) P. Beauv.

PubMed

Andersen, Ethan J; Nepal, Madhav P

2017-08-01

We report data associated with the identification of 242 disease resistance genes (R-genes) in the genome of Setaria italica as presented in "Genetic diversity of disease resistance genes in foxtail millet ( Setaria italica L.)" (Andersen and Nepal, 2017) [1]. Our data describe the structure and evolution of the Coiled-coil, Nucleotide-binding site, Leucine-rich repeat (CNL) R-genes in foxtail millet. The CNL genes were identified through rigorous extraction and analysis of recently available plant genome sequences using cutting-edge analytical software. Data visualization includes gene structure diagrams, chromosomal syntenic maps, a chromosomal density plot, and a maximum-likelihood phylogenetic tree comparing Sorghum bicolor , Panicum virgatum , Setaria italica , and Arabidopsis thaliana . Compilation of InterProScan annotations, Gene Ontology (GO) annotations, and Basic Local Alignment Search Tool (BLAST) results for the 242 R-genes identified in the foxtail millet genome are also included in tabular format.
Relationships between Gene Structure and Genome Instability in Flowering Plants.

PubMed

Bennetzen, Jeffrey L; Wang, Xuewen

2018-03-05

Flowering plant (angiosperm) genomes are exceptional in their variability with respect to genome size, ploidy, chromosome number, gene content, and gene arrangement. Gene movement, although observed in some of the earliest plant genome comparisons, has been relatively underinvestigated. We present herein a description of several interesting properties of plant gene and genome structure that are pertinent to the successful movement of a gene to a new location. These considerations lead us to propose a model that can explain the frequent success of plant gene mobility, namely that Small Insulated Genes Move Around (SIGMAR). The SIGMAR model is then compared with known processes for gene mobilization, and predictions of the SIGMAR model are formulated to encourage future experimentation. The overall results indicate that the frequent gene movement in angiosperm genomes is partly an outcome of the unusual properties of angiosperm genes, especially their small size and insulation from epigenetic silencing. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Bipartite Structure of the 5s Ribosomal Gene Family in a Drosophila Melanogaster Strain, and Its Evolutionary Implications

PubMed Central

Samson, M. L.; Wegnez, M.

1988-01-01

Knowledge of multigenic family organization should provide insight into their mode of evolution. Accordingly, we characterized the 5S ribosomal gene family in the Drosophila melanogaster strain ry(506). The 5S genes in this strain display a striking HindIII restriction difference compared to the ``standard'' D. melanogaster 5S genes. The sequence of three ry(506) 5S genes was determined. We show that the HindIII restriction site heterogeneity within the ry(506) 5S family most probably results from the same point mutation, suggesting that a single 5S variant was propagated into the 5S cluster of this strain. Furthermore, we demonstrate that the structural organization of the 5S genes in ry(506) is a bipartite structure, i.e., that about 40% of the 5S genes constitute a HindIII(+)/HindIII(-) mixed cluster, while those remaining constitute an homogeneous HindIII(-) cluster. The events which might lead to such an heterogeneous pattern are discussed from an evolutionary point of view. PMID:2835290
Cloning and characterization of a Candida albicans maltase gene involved in sucrose utilization.

PubMed Central

Geber, A; Williamson, P R; Rex, J H; Sweeney, E C; Bennett, J E

1992-01-01

In order to isolate the structural gene involved in sucrose utilization, we screened a sucrose-induced Candida albicans cDNA library for clones expressing alpha-glucosidase activity. The C. albicans maltase structural gene (CAMAL2) was isolated. No other clones expressing alpha-glucosidase activity. were detected. A genomic CAMAL2 clone was obtained by screening a size-selected genomic library with the cDNA clone. DNA sequence analysis reveals that CAMAL2 encodes a 570-amino-acid protein which shares 50% identity with the maltase structural gene (MAL62) of Saccharomyces carlsbergensis. The substrate specificity of the recombinant protein purified from Escherichia coli identifies the enzyme as a maltase. Northern (RNA) analysis reveals that transcription of CAMAL2 is induced by maltose and sucrose and repressed by glucose. These results suggest that assimilation of sucrose in C. albicans relies on an inducible maltase enzyme. The family of genes controlling sucrose utilization in C. albicans shares similarities with the MAL gene family of Saccharomyces cerevisiae and provides a model system for studying gene regulation in this pathogenic yeast. Images PMID:1400249
A new fractionation assay, based on the size of formaldehyde-crosslinked, mildly sheared chromatin, delineates the chromatin structure at promoter regions

PubMed Central

Ishihara, Satoru; Varma, Rajat; Schwartz, Ronald H.

2010-01-01

To explore the higher order structure of transcribable chromatin in vivo, its local configuration was assessed through the accessibility of the chromatin to crosslinking with formaldehyde. The application of crosslinked and mildly sheared chromatin to sedimentation velocity centrifugation followed by size-fractionation of the DNA enabled us to biochemically distinguish between chromatin with heavily versus sparsely crosslinkable structures. The separated fractions showed a good correlation with gene expression profiles. Genes with poor crosslinking around the promoter region were actively transcribed, while transcripts were hardly detected from genes with extensive crosslinking in their promoter regions. For the inducible gene, Il2, the distribution of the promoter shifted in the gradient following T-cell receptor stimulation, consistent with a change in structure at this locus during activation. The kinetics of this switch preceded the chromatin change observed in a DNase I accessibility assay. Thus, this new chromatin fractionation technique has revealed a change in chromatin structure that has not been previously characterized. PMID:20371521
Investigation of mRNA quadruplex formation in Escherichia coli.

PubMed

Wieland, Markus; Hartig, Jörg S

2009-01-01

The protocol presented here allows for the investigation of the formation of unusual nucleic acid structures in the 5'-untranslated region (UTR) of bacteria by correlating gene expression levels to the in vitro stability of the respective structure. In particular, we describe the introduction of G-quadruplex forming sequences close to the ribosome-binding site (RBS) on the mRNA of a reporter gene and the subsequent read-out of the expression levels. Insertion of a stable secondary structure results in the cloaking of RBS and eventually reduced gene expression levels. The structures and stability of the introduced sequences are further characterized by circular dichroism (CD) spectroscopy and thermal melting experiments. The extent of inhibition is then correlated to the stability of the respective quadruplex structure, allowing judgement of whether factors other than thermodynamic stability affect the formation of a given quadruplex sequence in vivo. Measuring gene expression levels takes 2 d including cloning; CD experiments take 5 hours per experiment.
Characterization and engineering of the biosynthesis gene cluster for antitumor macrolides PM100117 and PM100118 from a marine actinobacteria: generation of a novel improved derivative.

PubMed

Salcedo, Raúl García; Olano, Carlos; Gómez, Cristina; Fernández, Rogelio; Braña, Alfredo F; Méndez, Carmen; de la Calle, Fernando; Salas, José A

2016-02-22

PM100117 and PM100118 are glycosylated polyketides with remarkable antitumor activity, which derive from the marine symbiotic actinobacteria Streptomyces caniferus GUA-06-05-006A. Structurally, PM100117 and PM100118 are composed of a macrocyclic lactone, three deoxysugar units and a naphthoquinone (NQ) chromophore that shows a clear structural similarity to menaquinone. Whole-genome sequencing of S. caniferus GUA-06-05-006A has enabled the identification of PM100117 and PM100118 biosynthesis gene cluster, which has been characterized on the basis of bioinformatics and genetic engineering data. The product of four genes shows high identity to proteins involved in the biosynthesis of menaquinone via futalosine. Deletion of one of these genes led to a decay in PM100117 and PM100118 production, and to the accumulation of several derivatives lacking NQ. Likewise, five additional genes have been genetically characterized to be involved in the biosynthesis of this moiety. Moreover, the generation of a mutant in a gene coding for a putative cytochrome P450 has led to the production of PM100117 and PM100118 structural analogues showing an enhanced in vitro cytotoxic activity relative to the parental products. Although a number of compounds structurally related to PM100117 and PM100118 has been discovered, this is, to our knowledge, the first insight reported into their biosynthesis. The structural resemblance of the NQ moiety to menaquinone, and the presence in the cluster of four putative menaquinone biosynthetic genes, suggests a connection between the biosynthesis pathways of both compounds. The availability of the PM100117 and PM100118 biosynthetic gene cluster will surely pave a way to the combinatorial engineering of more derivatives.
Multi-step formation, evolution, and functionalization of new cytoplasmic male sterility genes in the plant mitochondrial genomes

PubMed Central

Tang, Huiwu; Zheng, Xingmei; Li, Chuliang; Xie, Xianrong; Chen, Yuanling; Chen, Letian; Zhao, Xiucai; Zheng, Huiqi; Zhou, Jiajian; Ye, Shan; Guo, Jingxin; Liu, Yao-Guang

2017-01-01

New gene origination is a major source of genomic innovations that confer phenotypic changes and biological diversity. Generation of new mitochondrial genes in plants may cause cytoplasmic male sterility (CMS), which can promote outcrossing and increase fitness. However, how mitochondrial genes originate and evolve in structure and function remains unclear. The rice Wild Abortive type of CMS is conferred by the mitochondrial gene WA352c (previously named WA352) and has been widely exploited in hybrid rice breeding. Here, we reconstruct the evolutionary trajectory of WA352c by the identification and analyses of 11 mitochondrial genomic recombinant structures related to WA352c in wild and cultivated rice. We deduce that these structures arose through multiple rearrangements among conserved mitochondrial sequences in the mitochondrial genome of the wild rice Oryza rufipogon, coupled with substoichiometric shifting and sequence variation. We identify two expressed but nonfunctional protogenes among these structures, and show that they could evolve into functional CMS genes via sequence variations that could relieve the self-inhibitory potential of the proteins. These sequence changes would endow the proteins the ability to interact with the nucleus-encoded mitochondrial protein COX11, resulting in premature programmed cell death in the anther tapetum and male sterility. Furthermore, we show that the sequences that encode the COX11-interaction domains in these WA352c-related genes have experienced purifying selection during evolution. We propose a model for the formation and evolution of new CMS genes via a “multi-recombination/protogene formation/functionalization” mechanism involving gradual variations in the structure, sequence, copy number, and function. PMID:27725674
Genetics of bacteria that oxidize one-carbon compounds. Progress report, March 1, 1991--June 30, 1993

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hanson, R.S.

In the past several years researchers have identified at least 20 genes whose products were required for the oxidation of methanol to formaldehyde in three different facultative methylotrophic bacteria. These genes include structural genes for a cytochrome c{sub L} (mox G) and is a specific electron acceptor for methanol dehydrogenase (MDH), and the two structural genes that encode the large subunit (mox F) and smaller subunit (mox I) of MDH. Other genes are required for the synthesis of the prosthetic group of MDH, Pyrroloquinoline quinone (PQQ), and proteins required for assembly of the active MDH in the periplasm. Three genesmore » are believed to be required for incorporation of calcium into the MDH tetramer. The principal investigator`s group has studied the regulation of methanol oxidation in the pink-pigmented-facultative methylotroph Methylobacterium organophilum XX. The authors have mapped several genes and have sequenced the mox F gene and sequences upstream of mox F. The authors had tentatively identified several genes required for the transcription of the MDH structural genes in three methylotrophs. In the previous proposal, the P.I. proposed to establish an in-vitro transcription/translation system to study the function of the regulatory gene products. Further studies demonstrated that the regulation of transcription of these genes was far more complex than imagined at that time and the research plan was modified to determine the number and function of the regulatory genes using genetic approaches.« less
Herbicide targets and detoxification proteins in sugarcane: from gene assembly to structure modelling.

PubMed

Lloyd Evans, Dyfed; Joshi, Shailesh Vinay

2017-07-01

In a genome context, sugarcane is a classic orphan crop, in that no genome and only very few genes have been assembled. We have devised a novel exome assembly methodology that has allowed us to assemble and characterize 49 genes that serve as herbicide targets, safener interacting proteins, and members of herbicide detoxification pathways within the sugarcane genome. We have structurally modelled the products of each of these genes, as well as determining allelic, genomic, and RNA-Seq based polymorphisms for each gene. This study provides the largest collection of sugarcane structures modelled to date. We demonstrate that sugarcane genes are highly polymorphic, revealing that each genotype is evolving both uniquely and independently. In addition, we present an exome assembly system for orphan crops that can be executed on commodity infrastructure, making exome assembly practical for any group. In terms of knowledge about herbicide modes of action and detoxification, we have advanced sugarcane from a crop where no information about any herbicide-associated gene was available to the situation where sugarcane is now a species with the single largest collection of known and annotated herbicide-associated genes.
Simulation of gene evolution under directional mutational pressure

NASA Astrophysics Data System (ADS)

Dudkiewicz, Małgorzata; Mackiewicz, Paweł; Kowalczuk, Maria; Mackiewicz, Dorota; Nowicka, Aleksandra; Polak, Natalia; Smolarczyk, Kamila; Banaszak, Joanna; R. Dudek, Mirosław; Cebrat, Stanisław

2004-05-01

The two main mechanisms generating the genetic diversity, mutation and recombination, have random character but they are biased which has an effect on the generation of asymmetry in the bacterial chromosome structure and in the protein coding sequences. Thus, like in a case of two chiral molecules-the two possible orientations of a gene in relation to the topology of a chromosome are not equivalent. Assuming that the sequence of a gene may oscillate only between certain limits of its structural composition means that the gene could be forced out of these limits by the directional mutation pressure, in the course of evolution. The probability of the event depends on the time the gene stays under the same mutation pressure. Inversion of the gene changes the directional mutational pressure to the reciprocal one and hence it changes the distance of the gene to its lower and upper bound of the structural tolerance. Using Monte Carlo methods we were able to simulate the evolution of genes under experimentally found mutational pressure, assuming simple mechanisms of selection. We found that the mutation and recombination should work in accordance to lower their negative effects on the function of the products of coding sequences.

Do Structural Missense Variants in the ATM Gene Found in Women With Breast Cancer Cause Breast Cancer in Knock-in Mouse Strains?

DTIC Science & Technology

2006-04-01

W81XWH-05-1-0282 TITLE: Do Structural Missense Variants in the ATM Gene Found in Women with Breast Cancer Cause Breast Cancer in "Knock-in...5a. CONTRACT NUMBER Do Structural Missense Variants in the ATM Gene Found in Women with Breast Cancer Cause Breast Cancer in "Knock-in" Mouse...human cohort-specific missense mutations will develop breast cancer with dominant inheritance in a subset of animals. It also is hypothesized that
Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

PubMed Central

Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

2010-01-01

RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462
Linking structural biology with genome research: Beamlines for the Berlin ``Protein Structure Factory'' initiative

NASA Astrophysics Data System (ADS)

Illing, Gerd; Saenger, Wolfram; Heinemann, Udo

2000-06-01

The Protein Structure Factory will be established to characterize proteins encoded by human genes or cDNAs, which will be selected by criteria of potential structural novelty or medical or biotechnological usefulness. It represents an integrative approach to structure analysis combining bioinformatics techniques, automated gene expression and purification of gene products, generation of a biophysical fingerprint of the proteins and the determination of their three-dimensional structures either by NMR spectroscopy or by X-ray diffraction. The use of synchrotron radiation will be crucial to the Protein Structure Factory: high brilliance and tunable wavelengths are prerequisites for fast data collection, the use of small crystals and multiwavelength anomalous diffraction (MAD) phasing. With the opening of BESSY II, direct access to a third-generation XUV storage ring source with excellent conditions is available nearby. An insertion device with two MAD beamlines and one constant energy station will be set up until 2001.
Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

PubMed

Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

2012-01-01

Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
An RNAi Screen for Genes Involved in Nanoscale Protrusion Formation on Corneal Lens in Drosophila melanogaster.

PubMed

Minami, Ryunosuke; Sato, Chiaki; Yamahama, Yumi; Kubo, Hideo; Hariyama, Takahiko; Kimura, Ken-Ichi

2016-12-01

The "moth-eye" structure, which is observed on the surface of corneal lens in several insects, supports anti-reflective and self-cleaning functions due to nanoscale protrusions known as corneal nipples. Although the morphology and function of the "moth-eye" structure, are relatively well studied, the mechanism of protrusion formation from cell-secreted substances is unknown. In Drosophila melanogaster, a compound eye consists of approximately 800 facets, the surface of which is formed by the corneal lens with nanoscale protrusions. In the present study, we sought to identify genes involved in "moth-eye" structure, formation in order to elucidate the developmental mechanism of the protrusions in Drosophila. We re-examined the aberrant patterns in classical glossy-eye mutants by scanning electron microscope and classified the aberrant patterns into groups. Next, we screened genes encoding putative structural cuticular proteins and genes involved in cuticular formation using eye specific RNAi silencing methods combined with the Gal4/UAS expression system. We identified 12 of 100 candidate genes, such as cuticular proteins family genes (Cuticular protein 23B and Cuticular protein 49Ah), cuticle secretion-related genes (Syntaxin 1A and Sec61 ββ subunit), ecdysone signaling and biosynthesis-related genes (Ecdysone receptor, Blimp-1, and shroud), and genes involved in cell polarity/cell architecture (Actin 5C, shotgun, armadillo, discs large1, and coracle). Although some of the genes we identified may affect corneal protrusion formation indirectly through general patterning defects in eye formation, these initial findings have encouraged us to more systematically explore the precise mechanisms underlying the formation of nanoscale protrusions in Drosophila.
Adaptive expansion of the maize maternally expressed gene (Meg) family involves changes in expression patterns and protein secondary structures of its members

PubMed Central

2014-01-01

Background The Maternally expressed gene (Meg) family is a locally-duplicated gene family of maize which encodes cysteine-rich proteins (CRPs). The founding member of the family, Meg1, is required for normal development of the basal endosperm transfer cell layer (BETL) and is involved in the allocation of maternal nutrients to growing seeds. Despite the important roles of Meg1 in maize seed development, the evolutionary history of the Meg cluster and the activities of the duplicate genes are not understood. Results In maize, the Meg gene cluster resides in a 2.3 Mb-long genomic region that exhibits many features of non-centromeric heterochromatin. Using phylogenetic reconstruction and syntenic alignments, we identified the pedigree of the Meg family, in which 11 of its 13 members arose in maize after allotetraploidization ~4.8 mya. Phylogenetic and population-genetic analyses identified possible signatures suggesting recent positive selection in Meg homologs. Structural analyses of the Meg proteins indicated potentially adaptive changes in secondary structure from α-helix to β-strand during the expansion. Transcriptomic analysis of the maize endosperm indicated that 6 Meg genes are selectively activated in the BETL, and younger Meg genes are more active than older ones. In endosperms from B73 by Mo17 reciprocal crosses, most Meg genes did not display parent-specific expression patterns. Conclusions Recently-duplicated Meg genes have different protein secondary structures, and their expressions in the BETL dominate over those of older members. Together with the signs of positive selections in the young Meg genes, these results suggest that the expansion of the Meg family involves potentially adaptive transitions in which new members with novel functions prevailed over older members. PMID:25084677
Comparative and Evolutionary Analysis of Grass Pollen Allergens Using Brachypodium distachyon as a Model System.

PubMed

Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan

2017-01-01

Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species.
Comparative Analysis of Syntenic Genes in Grass Genomes Reveals Accelerated Rates of Gene Structure and Coding Sequence Evolution in Polyploid Wheat1[W][OA

PubMed Central

Akhunov, Eduard D.; Sehgal, Sunish; Liang, Hanquan; Wang, Shichen; Akhunova, Alina R.; Kaur, Gaganpreet; Li, Wanlong; Forrest, Kerrie L.; See, Deven; Šimková, Hana; Ma, Yaqin; Hayden, Matthew J.; Luo, Mingcheng; Faris, Justin D.; Doležel, Jaroslav; Gill, Bikram S.

2013-01-01

Cycles of whole-genome duplication (WGD) and diploidization are hallmarks of eukaryotic genome evolution and speciation. Polyploid wheat (Triticum aestivum) has had a massive increase in genome size largely due to recent WGDs. How these processes may impact the dynamics of gene evolution was studied by comparing the patterns of gene structure changes, alternative splicing (AS), and codon substitution rates among wheat and model grass genomes. In orthologous gene sets, significantly more acquired and lost exonic sequences were detected in wheat than in model grasses. In wheat, 35% of these gene structure rearrangements resulted in frame-shift mutations and premature termination codons. An increased codon mutation rate in the wheat lineage compared with Brachypodium distachyon was found for 17% of orthologs. The discovery of premature termination codons in 38% of expressed genes was consistent with ongoing pseudogenization of the wheat genome. The rates of AS within the individual wheat subgenomes (21%–25%) were similar to diploid plants. However, we uncovered a high level of AS pattern divergence between the duplicated homeologous copies of genes. Our results are consistent with the accelerated accumulation of AS isoforms, nonsynonymous mutations, and gene structure rearrangements in the wheat lineage, likely due to genetic redundancy created by WGDs. Whereas these processes mostly contribute to the degeneration of a duplicated genome and its diploidization, they have the potential to facilitate the origin of new functional variations, which, upon selection in the evolutionary lineage, may play an important role in the origin of novel traits. PMID:23124323
Cooperation and coexpression: How coexpression networks shift in response to multiple mutualists.

PubMed

Palakurty, Sathvik X; Stinchcombe, John R; Afkhami, Michelle E

2018-04-01

A mechanistic understanding of community ecology requires tackling the nonadditive effects of multispecies interactions, a challenge that necessitates integration of ecological and molecular complexity-namely moving beyond pairwise ecological interaction studies and the "gene at a time" approach to mechanism. Here, we investigate the consequences of multispecies mutualisms for the structure and function of genomewide differential coexpression networks for the first time, using the tractable and ecologically important interaction between legume Medicago truncatula, rhizobia and mycorrhizal fungi. First, we found that genes whose expression is affected nonadditively by multiple mutualists are more highly connected in gene networks than expected by chance and had 94% greater network centrality than genes showing additive effects, suggesting that nonadditive genes may be key players in the widespread transcriptomic responses to multispecies symbioses. Second, multispecies mutualisms substantially changed coexpression network structure of 18 modules of host plant genes and 22 modules of the fungal symbionts' genes, indicating that third-party mutualists can cause significant rewiring of plant and fungal molecular networks. Third, we found that 60% of the coexpressed gene sets that explained variation in plant performance had coexpression structures that were altered by interactive effects of rhizobia and fungi. Finally, an "across-symbiosis" approach identified sets of plant and mycorrhizal genes whose coexpression structure was unique to the multiple mutualist context and suggested coupled responses across the plant-mycorrhizal interaction to rhizobial mutualists. Taken together, these results show multispecies mutualisms have substantial effects on the molecular interactions in host plants, microbes and across symbiotic boundaries. © 2018 John Wiley & Sons Ltd.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

PubMed

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2015-01-01

Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Evolution of GHF5 endoglucanase gene structure in plant-parasitic nematodes: no evidence for an early domain shuffling event.

PubMed

Kyndt, Tina; Haegeman, Annelies; Gheysen, Godelieve

2008-11-03

Endo-1,4-beta-glucanases or cellulases from the glycosyl hydrolase family 5 (GHF5) have been found in numerous bacteria and fungi, and recently also in higher eukaryotes, particularly in plant-parasitic nematodes (PPN). The origin of these genes has been attributed to horizontal gene transfer from bacteria, although there still is a lot of uncertainty about the origin and structure of the ancestral GHF5 PPN endoglucanase. It is not clear whether this ancestral endoglucanase consisted of the whole gene cassette, containing a catalytic domain and a carbohydrate-binding module (CBM, type 2 in PPN and bacteria) or only of the catalytic domain while the CBM2 was retrieved by domain shuffling later in evolution. Previous studies on the evolution of these genes have focused primarily on data of sedentary nematodes, while in this study, extra data from migratory nematodes were included. Two new endoglucanases from the migratory nematodes Pratylenchus coffeae and Ditylenchus africanus were included in this study. The latter one is the first gene isolated from a PPN of a different superfamily (Sphaerularioidea); all previously known nematode endoglucanases belong to the superfamily Tylenchoidea (order Rhabditida). Phylogenetic analyses were conducted with the PPN GHF5 endoglucanases and homologous endoglucanases from bacterial and other eukaryotic lineages such as beetles, fungi and plants. No statistical incongruence between the phylogenetic trees deduced from the catalytic domain and the CBM2 was found, which could suggest that both domains have evolved together. Furthermore, based on gene structure data, we inferred a model for the evolution of the GHF5 endoglucanase gene structure in plant-parasitic nematodes. Our data confirm a close relationship between Pratylenchus spp. and the root knot nematodes, while some Radopholus similis endoglucanases are more similar to cyst nematode genes. We conclude that the ancestral PPN GHF5 endoglucanase gene most probably consisted of the whole gene cassette, i.e. the GHF5 catalytic domain and the CBM2, rather than that it evolved by domain shuffling. Our evolutionary model for the gene structure in PPN GHF5 endoglucanases implies the occurrence of an early duplication event, and more recent gene duplications at genus or species level.
Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

PubMed

Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

2013-03-21

Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
Comparative analysis of CDPK family in maize, Arabidopsis, rice and sorghum revealed potential targets for drought tolerance improvement

NASA Astrophysics Data System (ADS)

Mittal, Shikha; Mallikarjuna, Mallana Gowdra; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean

2017-12-01

Calcium dependent protein kinases (CDPKs) play major role in regulation of plant growth and development in response to various stresses including drought. A set of 32 CDPK genes identified in maize were further used for searching of orthologs in the model plant Arabidopsis (72) and major food crops such as rice (78) and sorghum (91). We comprehensively investigated the phylogenetic relationship, annotations, gene duplications, gene structure, divergence time, 3-D protein structures and tissue-specific drought induced expression of CDPK genes in all four species. Variation in intron frequency among these species likely contributed to the functional diversity of CDPK genes to various stress responses. Protein kinase and protein kinase C phosphorylation site domains were the most conserved motifs identified in all species. Four groups were identified from the sequence-based phylogenetic analysis, in which maize CDPKs were clustered in group III. The time of divergence (Ka/Ks) analysis revealed that the CDPKs were evolved through stabilizing selection. Expression data showed that the CDPK genes were highly expressed in leaf of maize, rice, and sorghum whereas in Arabidopsis the maximum expression was observed in root. 3-D protein structure were predicted for the nine genes (Arabidopsis: 2, maize: 2, rice: 3 and sorghum: 2) showing differential expression in at least three species. The predicted 3-D structures were further evaluated and validated by Ramachandran plot, ANOLEA, ProSA and Verify-3D. The superimposed 3-D structure of drought-related orthologous proteins retained similar folding pattern owing to their conserved nature. Functional annotation revealed the involvement of CDPK genes in various pathways such as osmotic homeostasis, cell protection and root growth. The interactions of CDPK genes in various pathways play crucial role in imparting drought tolerance through different ABA and MAPK signalling cascades. Our studies suggest that these selected candidate genes could be targeted in development of drought tolerant cultivars in maize, rice and sorghum through appropriate breeding approaches. Our comparative experiments of CDPK genes could also be extended in the drought stress breeding programmes of the related species.
Atypical Genetic Locus Associated with Constitutive Production of Enterocin B by Enterococcus faecium BFE 900

PubMed Central

Franz, Charles M. A. P.; Worobo, Randy W.; Quadri, Luis E. N.; Schillinger, Ulrich; Holzapfel, Wilhelm H.; Vederas, John C.; Stiles, Michael E.

1999-01-01

A purified bacteriocin produced by Enterococcus faecium BFE 900 isolated from black olives was shown by Edman degradation and mass spectrometric analyses to be identical to enterocin B produced by E. faecium T136 from meat (P. Casaus, T. Nilsen, L. M. Cintas, I. F. Nes, P. E. Hernández, and H. Holo, Microbiology 143:2287–2294, 1997). The structural gene was located on a 2.2-kb HindIII fragment and a 12.0-kb EcoRI chromosomal fragment. The genetic characteristics and production of EntB by E. faecium BFE 900 differed from that described so far by the presence of a conserved sequence like a regulatory box upstream of the EntB gene, and its production was constitutive and not regulated. The 2.2-kb chromosomal fragment contained the hitherto undetected immunity gene for EntB in an atypical orientation that is the reverse of that of the structural gene. Typical transport and other genes associated with bacteriocin production were not detected on the 12.0-kb chromosomal fragment containing the EntB structural gene. This makes the EntB genetic system different from most other bacteriocin systems, where transport and possible regulatory genes are clustered. EntB was subcloned and expressed by the dedicated secretion machinery of Carnobacterium piscicola LV17A. The structural gene was amplified by PCR, fused to the divergicin A signal peptide, and expressed by the general secretory pathway in Enterococcus faecalis ATCC 19433. PMID:10224016
The Escherichia coli supX locus is topA, the structural gene for DNA topoisomerase I.

PubMed Central

Margolin, P; Zumstein, L; Sternglanz, R; Wang, J C

1985-01-01

Mutations in the supX locus, which result in the absence of DNA topoisomerase I enzyme activity in both Salmonella typhimurium and Escherichia coli, are all selected as suppressors of the leu-500 promoter mutation in S. typhimurium. To determine whether the supX locus is the structural gene topA for the DNA topoisomerase I enzyme or is a positive-acting regulator/activator gene for a nearby topA structural gene, nonsense mutations were selected in the E. coli supX gene carried on an F' episome in S. typhimurium cells. The cysB-topA region of the episomes with nonsense-mutant supX alleles were then cloned onto plasmid pBR322 and transformed into E. coli cells lacking a chromosomal supX gene. Three such E. coli strains, each carrying cloned DNA from episomes with different nonsense-mutant supX alleles, all lacked DNA topoisomerase I activity but expressed antigenic determinants specific to the enzyme; control cells lacked both enzyme activity and antigenic determinants. Maxicell studies of plasmid-coded proteins demonstrated the absence of the DNA topoisomerase I protein (100 kDa) in the three strains but the appearance of a new smaller peptide in each (36, 47, and 64 kDa). These new peptides must represent fragments of the enzyme resulting from translation termination at the supX nonsense codons and confirm the interpretation that the supX gene is topA, the structural gene for DNA topoisomerase I. Images PMID:2991925
Systematic analysis of viral genes responsible for differential virulence between American and Australian West Nile virus strains.

PubMed

Setoh, Yin Xiang; Prow, Natalie A; Rawle, Daniel J; Tan, Cindy Si En; Edmonds, Judith H; Hall, Roy A; Khromykh, Alexander A

2015-06-01

A variant Australian West Nile virus (WNV) strain, WNVNSW2011, emerged in 2011 causing an unprecedented outbreak of encephalitis in horses in south-eastern Australia. However, no human cases associated with this strain have yet been reported. Studies using mouse models for WNV pathogenesis showed that WNVNSW2011 was less virulent than the human-pathogenic American strain of WNV, New York 99 (WNVNY99). To identify viral genes and mutations responsible for the difference in virulence between WNVNSW2011 and WNVNY99 strains, we constructed chimeric viruses with substitution of large genomic regions coding for the structural genes, non-structural genes and untranslated regions, as well as seven individual non-structural gene chimeras, using a modified circular polymerase extension cloning method. Our results showed that the complete non-structural region of WNVNSW2011, when substituted with that of WNVNY99, significantly enhanced viral replication and the ability to suppress type I IFN response in cells, resulting in higher virulence in mice. Analysis of the individual non-structural gene chimeras showed a predominant contribution of WNVNY99 NS3 to increased virus replication and evasion of IFN response in cells, and to virulence in mice. Other WNVNY99 non-structural proteins (NS2A, NS4B and NS5) were shown to contribute to the modulation of IFN response. Thus a combination of non-structural proteins, likely NS2A, NS3, NS4B and NS5, is primarily responsible for the difference in virulence between WNVNSW2011 and WNVNY99 strains, and accumulative mutations within these proteins would likely be required for the Australian WNVNSW2011 strain to become significantly more virulent. © 2015 The Authors.
Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster

PubMed Central

Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan

2002-01-01

Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380
Evolution of the dispersed SUC gene family of Saccharomyces by rearrangements of chromosome telomeres.

PubMed Central

Carlson, M; Celenza, J L; Eng, F J

1985-01-01

The SUC gene family of Saccharomyces contains six structural genes for invertase (SUC1 through SUC5 and SUC7) which are located on different chromosomes. Most yeast strains do not carry all six SUC genes and instead carry natural negative (suc0) alleles at some or all SUC loci. We determined the physical structures of SUC and suc0 loci. Except for SUC2, which is an unusual member of the family, all of the SUC genes are located very close to telomeres and are flanked by homologous sequences. On the centromere-proximal side of the gene, the conserved region contains X sequences, which are sequences found adjacent to telomeres (C. S. M. Chan and B.-K. Tye, Cell 33:563-573, 1983). On the other side of the gene, the homology includes about 4 kilobases of flanking sequence and then extends into a Y' element, which is an element often found distal to the X sequence at telomeres (Chan and Tye, Cell 33:563-573, 1983). Thus, these SUC genes and flanking sequences are embedded in telomere-adjacent sequences. Chromosomes carrying suc0 alleles (except suc20) lack SUC structural genes and portions of the conserved flanking sequences. The results indicate that the dispersal of SUC genes to different chromosomes occurred by rearrangements of chromosome telomeres. Images PMID:3018485
Structural and functional analysis of mouse Msx1 gene promoter: sequence conservation with human MSX1 promoter points at potential regulatory elements.

PubMed

Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E

1998-06-01

Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.
Isolation and sequencing of the gene encoding Sp23, a structural protein of spermatophore of the mealworm beetle, Tenebrio molitor.

PubMed

Feng, X; Happ, G M

1996-11-14

The cDNA for Sp23, a structural protein of the spermatophore of Tenebrio molitor, had been previously cloned and characterized (Paesen, G.C., Schwartz, M.B., Peferoen, M., Weyda, F. and Happ, G.M. (1992a) Amino acid sequence of Sp23, a structure protein of the spermatophore of the mealworm beetle, Tenebrio molitor. J. Biol. Chem. 257, 18852-18857). Using the labeled cDNA for Sp23 as a probe to screen a library of genomic DNA from Tenebrio molitor, we isolated a genomic clone for Sp23. A 5373-base pair (bp) restriction fragment containing the Sp23 gene was sequenced. The coding region is separated by a 55-bp intron which is located close to the translation start site. Three putative ecdysone response elements (EcRE) are identified in the 5' flanking region of the Sp23 gene. Comparison of the flanking regions of the Sp23 gene with those of the D-protein gene expressed in the accessory glands of Tenebrio reveals similar sequences present in the flanking regions of the two genes. The genomic organization of the coding region of the Sp23 gene shares similarities with that of the D-protein gene, three Drosophila accessory gland genes and two Drosophila 20-OH ecdysone-responsive genes.

Structure of the human CD97 gene: Exon shuffling has generated a new type of seven-span transmembrane molecule related to the secretin receptor superfamily

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hamann, J.; Van Lier, R.A.W.; Hartmann, E.

1996-02-15

This article reports on the structure and genetic mapping of the human CD97 gene, a homologue to the secretin receptor superfamily of cell surface proteins. The detailed organization of the gene, which maps to the short arm of chromosome 19, is given. 18 refs., 1 fig., 1 tab.
Development of studies of TPO gene and its application in nuclear medicine.

PubMed

Xing, Y; Kuang, A

2003-08-01

Thyroperoxidase (TPO) is a glycosylated protein bound to the apical plasma membrane of thyrocytes. It is the key enzyme in the synthesis of thyroid hormones. Its gene structure and transcriptional regulation have been studied in detail. This article reviews the structure, function and transcriptional regulation of the TPO gene, and the relationship between TPO, thyroid diseases and radioactive iodide therapy.
Limited family structure and BRCA gene mutation status in single cases of breast cancer.

PubMed

Weitzel, Jeffrey N; Lagos, Veronica I; Cullinane, Carey A; Gambol, Patricia J; Culver, Julie O; Blazer, Kathleen R; Palomares, Melanie R; Lowstuter, Katrina J; MacDonald, Deborah J

2007-06-20

An autosomal dominant pattern of hereditary breast cancer may be masked by small family size or transmission through males given sex-limited expression. To determine if BRCA gene mutations are more prevalent among single cases of early onset breast cancer in families with limited vs adequate family structure than would be predicted by currently available probability models. A total of 1543 women seen at US high-risk clinics for genetic cancer risk assessment and BRCA gene testing were enrolled in a prospective registry study between April 1997 and February 2007. Three hundred six of these women had breast cancer before age 50 years and no first- or second-degree relatives with breast or ovarian cancers. The main outcome measure was whether family structure, assessed from multigenerational pedigrees, predicts BRCA gene mutation status. Limited family structure was defined as fewer than 2 first- or second-degree female relatives surviving beyond age 45 years in either lineage. Family structure effect and mutation probability by the Couch, Myriad, and BRCAPRO models were assessed with stepwise multiple logistic regression. Model sensitivity and specificity were determined and receiver operating characteristic curves were generated. Family structure was limited in 153 cases (50%). BRCA gene mutations were detected in 13.7% of participants with limited vs 5.2% with adequate family structure. Family structure was a significant predictor of mutation status (odds ratio, 2.8; 95% confidence interval, 1.19-6.73; P = .02). Although none of the models performed well, receiver operating characteristic analysis indicated that modification of BRCAPRO output by a corrective probability index accounting for family structure was the most accurate BRCA gene mutation status predictor (area under the curve, 0.72; 95% confidence interval, 0.63-0.81; P<.001) for single cases of breast cancer. Family structure can affect the accuracy of mutation probability models. Genetic testing guidelines may need to be more inclusive for single cases of breast cancer when the family structure is limited and probability models need to be recreated using limited family history as an actual variable.
Analysis of the murine Dtk gene identifies conservation of genomic structure within a new receptor tyrosine kinase subfamily

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lewis, P.M.; Crosier, K.E.; Crosier, P.S.

The receptor tyrosine kinase Dtk/Tyro 3/Sky/rse/brt/tif is a member of a new subfamily of receptors that also includes Axl/Ufo/Ark and Eyk/Mer. These receptors are characterized by the presence of two immunoglobulin-like loops and two fibronectin type III repeats in their extracellular domains. The structure of the murine Dtk gene has been determined. The gene consists of 21 exons that are distributed over 21 kb of genomic DNA. An isoform of Dtk is generated by differential splicing of exons from the 5{prime} region of the gene. The overall genomic structure of Dtk is virtually identical to that determined for the humanmore » UFO gene. This particular genomic organization is likely to have been duplicated and closely maintained throughout evolution. 38 refs., 3 figs., 1 tab.« less
Safety Evaluation of Enterocin Producer Enterococcus sp. Strains Isolated from Traditional Turkish Cheeses.

PubMed

Avcı, Mine; Özden Tuncer, Banu

2017-07-06

The purpose of this study was to determine the antimicrobial activity and occurrence of bacteriocin structural genes in Enterococcus spp. isolated from different cheeses and also investigate some of their virulence factors. Enterococcus strains were isolated from 33 different cheeses. Enterococcus faecium (6 strains) and Enterococcus faecalis (5 strains) enterocin-producing strains were identified by 16S rDNA analyses. Structural genes entA, entB, entP and entX were detected in some isolates. Multiple enterocin structural genes were found in 7 strains. None of the tested enterococci demonstrated anyβ-haemolytic activity and only one strain had gelatinase activity. Six strains showed multiple antibiotic resistance patterns and in addition, vanA and several virulence genes were detected in many strains. Only E. faecalis MBE1-9 showed tyrosine decarboxylase activity and tdc gene was detected only in this strain.
Improvisation in evolution of genes and genomes: whose structure is it anyway?

PubMed

Shakhnovich, Boris E; Shakhnovich, Eugene I

2008-06-01

Significant progress has been made in recent years in a variety of seemingly unrelated fields such as sequencing, protein structure prediction, and high-throughput transcriptomics and metabolomics. At the same time, new microscopic models have been developed that made it possible to analyze the evolution of genes and genomes from first principles. The results from these efforts enable, for the first time, a comprehensive insight into the evolution of complex systems and organisms on all scales--from sequences to organisms and populations. Every newly sequenced genome uncovers new genes, families, and folds. Where do these new genes come from? How do gene duplication and subsequent divergence of sequence and structure affect the fitness of the organism? What role does regulation play in the evolution of proteins and folds? Emerging synergism between data and modeling provides first robust answers to these questions.
Relationships among msx gene structure and function in zebrafish and other vertebrates.

PubMed

Ekker, M; Akimenko, M A; Allende, M L; Smith, R; Drouin, G; Langille, R M; Weinberg, E S; Westerfield, M

1997-10-01

The zebrafish genome contains at least five msx homeobox genes, msxA, msxB, msxC, msxD, and the newly isolated msxE. Although these genes share structural features common to all Msx genes, phylogenetic analyses of protein sequences indicate that the msx genes from zebrafish are not orthologous to the Msx1 and Msx2 genes of mammals, birds, and amphibians. The zebrafish msxB and msxC are more closely related to each other and to the mouse Msx3. Similarly, although the combinatorial expression of the zebrafish msx genes in the embryonic dorsal neuroectoderm, visceral arches, fins, and sensory organs suggests functional similarities with the Msx genes of other vertebrates, differences in the expression patterns preclude precise assignment of orthological relationships. Distinct duplication events may have given rise to the msx genes of modern fish and other vertebrate lineages whereas many aspects of msx gene functions during embryonic development have been preserved.
DNA breaks and chromatin structural changes enhance the transcription of autoimmune regulator target genes.

PubMed

Guha, Mithu; Saare, Mario; Maslovskaja, Julia; Kisand, Kai; Liiv, Ingrid; Haljasorg, Uku; Tasa, Tõnis; Metspalu, Andres; Milani, Lili; Peterson, Pärt

2017-04-21

The autoimmune regulator (AIRE) protein is the key factor in thymic negative selection of autoreactive T cells by promoting the ectopic expression of tissue-specific genes in the thymic medullary epithelium. Mutations in AIRE cause a monogenic autoimmune disease called autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy. AIRE has been shown to promote DNA breaks via its interaction with topoisomerase 2 (TOP2). In this study, we investigated topoisomerase-induced DNA breaks and chromatin structural alterations in conjunction with AIRE-dependent gene expression. Using RNA sequencing, we found that inhibition of TOP2 religation activity by etoposide in AIRE-expressing cells had a synergistic effect on genes with low expression levels. AIRE-mediated transcription was not only enhanced by TOP2 inhibition but also by the TOP1 inhibitor camptothecin. The transcriptional activation was associated with structural rearrangements in chromatin, notably the accumulation of γH2AX and the exchange of histone H1 with HMGB1 at AIRE target gene promoters. In addition, we found the transcriptional up-regulation to co-occur with the chromatin structural changes within the genomic cluster of carcinoembryonic antigen-like cellular adhesion molecule genes. Overall, our results suggest that the presence of AIRE can trigger molecular events leading to an altered chromatin landscape and the enhanced transcription of low-expressed genes. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
DNA breaks and chromatin structural changes enhance the transcription of autoimmune regulator target genes

PubMed Central

Guha, Mithu; Saare, Mario; Maslovskaja, Julia; Kisand, Kai; Liiv, Ingrid; Haljasorg, Uku; Tasa, Tõnis; Metspalu, Andres; Milani, Lili; Peterson, Pärt

2017-01-01

The autoimmune regulator (AIRE) protein is the key factor in thymic negative selection of autoreactive T cells by promoting the ectopic expression of tissue-specific genes in the thymic medullary epithelium. Mutations in AIRE cause a monogenic autoimmune disease called autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy. AIRE has been shown to promote DNA breaks via its interaction with topoisomerase 2 (TOP2). In this study, we investigated topoisomerase-induced DNA breaks and chromatin structural alterations in conjunction with AIRE-dependent gene expression. Using RNA sequencing, we found that inhibition of TOP2 religation activity by etoposide in AIRE-expressing cells had a synergistic effect on genes with low expression levels. AIRE-mediated transcription was not only enhanced by TOP2 inhibition but also by the TOP1 inhibitor camptothecin. The transcriptional activation was associated with structural rearrangements in chromatin, notably the accumulation of γH2AX and the exchange of histone H1 with HMGB1 at AIRE target gene promoters. In addition, we found the transcriptional up-regulation to co-occur with the chromatin structural changes within the genomic cluster of carcinoembryonic antigen-like cellular adhesion molecule genes. Overall, our results suggest that the presence of AIRE can trigger molecular events leading to an altered chromatin landscape and the enhanced transcription of low-expressed genes. PMID:28242760
Improved systematic tRNA gene annotation allows new insights into the evolution of mitochondrial tRNA structures and into the mechanisms of mitochondrial genome rearrangements

PubMed Central

Jühling, Frank; Pütz, Joern; Bernt, Matthias; Donath, Alexander; Middendorf, Martin; Florentz, Catherine; Stadler, Peter F.

2012-01-01

Transfer RNAs (tRNAs) are present in all types of cells as well as in organelles. tRNAs of animal mitochondria show a low level of primary sequence conservation and exhibit ‘bizarre’ secondary structures, lacking complete domains of the common cloverleaf. Such sequences are hard to detect and hence frequently missed in computational analyses and mitochondrial genome annotation. Here, we introduce an automatic annotation procedure for mitochondrial tRNA genes in Metazoa based on sequence and structural information in manually curated covariance models. The method, applied to re-annotate 1876 available metazoan mitochondrial RefSeq genomes, allows to distinguish between remaining functional genes and degrading ‘pseudogenes’, even at early stages of divergence. The subsequent analysis of a comprehensive set of mitochondrial tRNA genes gives new insights into the evolution of structures of mitochondrial tRNA sequences as well as into the mechanisms of genome rearrangements. We find frequent losses of tRNA genes concentrated in basal Metazoa, frequent independent losses of individual parts of tRNA genes, particularly in Arthropoda, and wide-spread conserved overlaps of tRNAs in opposite reading direction. Direct evidence for several recent Tandem Duplication-Random Loss events is gained, demonstrating that this mechanism has an impact on the appearance of new mitochondrial gene orders. PMID:22139921
Genetic variations in the serotonergic system contribute to amygdala volume in humans.

PubMed

Li, Jin; Chen, Chunhui; Wu, Karen; Zhang, Mingxia; Zhu, Bi; Chen, Chuansheng; Moyzis, Robert K; Dong, Qi

2015-01-01

The amygdala plays a critical role in emotion processing and psychiatric disorders associated with emotion dysfunction. Accumulating evidence suggests that amygdala structure is modulated by serotonin-related genes. However, there is a gap between the small contributions of single loci (less than 1%) and the reported 63-65% heritability of amygdala structure. To understand the "missing heritability," we systematically explored the contribution of serotonin genes on amygdala structure at the gene set level. The present study of 417 healthy Chinese volunteers examined 129 representative polymorphisms in genes from multiple biological mechanisms in the regulation of serotonin neurotransmission. A system-level approach using multiple regression analyses identified that nine SNPs collectively accounted for approximately 8% of the variance in amygdala volume. Permutation analyses showed that the probability of obtaining these findings by chance was low (p = 0.043, permuted for 1000 times). Findings showed that serotonin genes contribute moderately to individual differences in amygdala volume in a healthy Chinese sample. These results indicate that the system-level approach can help us to understand the genetic basis of a complex trait such as amygdala structure.
VizPrimer: a web server for visualized PCR primer design based on known gene structure.

PubMed

Zhou, Yang; Qu, Wubin; Lu, Yiming; Zhang, Yanchun; Wang, Xiaolei; Zhao, Dongsheng; Yang, Yi; Zhang, Chenggang

2011-12-15

The visualization of gene structure plays an important role in polymerase chain reaction (PCR) primer design, especially for eukaryotic genes with a number of splice variants that users need to distinguish between via PCR. Here, we describe a visualized web server for primer design named VizPrimer. It utilizes the new information technology (IT) tools, HTML5 to display gene structure and JavaScript to interact with the users. In VizPrimer, the users can focus their attention on the gene structure and primer design strategy, without wasting time calculating the exon positions of splice variants or manually configuring complicated parameters. In addition, VizPrimer is also suitable for the design of PCR primers for amplifying open reading frames and detecting single nucleotide polymorphisms (SNPs). VizPrimer is freely available at http://biocompute.bmi.ac.cn/CZlab/VizPrimer/. The web server supported browsers: Chrome (≥5.0), Firefox (≥3.0), Safari (≥4.0) and Opera (≥10.0). zhangcg@bmi.ac.cn; yangyi528@vip.sina.com.
RPG: the Ribosomal Protein Gene database.

PubMed

Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

2004-01-01

RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes.
RPG: the Ribosomal Protein Gene database

PubMed Central

Nakao, Akihiro; Yoshihama, Maki; Kenmochi, Naoya

2004-01-01

RPG (http://ribosome.miyazaki-med.ac.jp/) is a new database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli. Users can search the database by gene name and organism. Each record includes sequences (genomic, cDNA and amino acid sequences), intron/exon structures, genomic locations and information about orthologs. In addition, users can view and compare the gene structures of the above organisms and make multiple amino acid sequence alignments. RPG also provides information on small nucleolar RNAs (snoRNAs) that are encoded in the introns of RP genes. PMID:14681386
Genome-Wide Analysis of the NADK Gene Family in Plants

PubMed Central

Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming

2014-01-01

Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity and functional divergence of the NADK family in plants. Conclusions These findings will facilitate further studies of the NADK family and provide valuable information for functional validation of this family in plants. PMID:24968225
Scale-dependent genetic structure of the Idaho giant salamander (Dicamptodon aterrimus) in stream networks

Treesearch

Lindy B. Mullen; H. Arthur Woods; Michael K. Schwartz; Adam J. Sepulveda; Winsor H. Lowe

2010-01-01

The network architecture of streams and rivers constrains evolutionary, demographic and ecological processes of freshwater organisms. This consistent architecture also makes stream networks useful for testing general models of population genetic structure and the scaling of gene flow. We examined genetic structure and gene flow in the facultatively paedomorphic Idaho...
The structure and evolution of angiosperm nuclear genomes.

PubMed

Bennetzen, J L

1998-04-01

Despite several decades of investigation, the organization of angiosperm genomes remained largely unknown until very recently. Data describing the sequence composition of large segments of genomes, covering hundreds of kilobases of contiguous sequence, have only become available in the past two years. Recent results indicate commonalities in the characteristics of many plant genomes, including in the structure of chromosomal components like telomeres and centromeres, and in the order and content of genes. Major differences between angiosperms have been associated mainly with repetitive DNAs, both gene families and mobile elements. Intriguing new studies have begun to characterize the dynamic three-dimensional structures of chromosomes and chromatin, and the relationship between genome structure and co-ordinated gene function.
Phylogenetic analysis and protein structure modelling identifies distinct Ca(2+)/Cation antiporters and conservation of gene family structure within Arabidopsis and rice species.

PubMed

Pittman, Jon K; Hirschi, Kendal D

2016-12-01

The Ca(2+)/Cation Antiporter (CaCA) superfamily is an ancient and widespread family of ion-coupled cation transporters found in nearly all kingdoms of life. In animals, K(+)-dependent and K(+)-indendent Na(+)/Ca(2+) exchangers (NCKX and NCX) are important CaCA members. Recently it was proposed that all rice and Arabidopsis CaCA proteins should be classified as NCX proteins. Here we performed phylogenetic analysis of CaCA genes and protein structure homology modelling to further characterise members of this transporter superfamily. Phylogenetic analysis of rice and Arabidopsis CaCAs in comparison with selected CaCA members from non-plant species demonstrated that these genes form clearly distinct families, with the H(+)/Cation exchanger (CAX) and cation/Ca(2+) exchanger (CCX) families dominant in higher plants but the NCKX and NCX families absent. NCX-related Mg(2+)/H(+) exchanger (MHX) and CAX-related Na(+)/Ca(2+) exchanger-like (NCL) proteins are instead present. Analysis of genomes of ten closely-related rice species and four Arabidopsis-related species found that CaCA gene family structures are highly conserved within related plants, apart from minor variation. Protein structures were modelled for OsCAX1a and OsMHX1. Despite exhibiting broad structural conservation, there are clear structural differences observed between the different CaCA types. Members of the CaCA superfamily form clearly distinct families with different phylogenetic, structural and functional characteristics, and therefore should not be simply classified as NCX proteins, which should remain as a separate gene family.
Conservation of Pax gene expression in ectodermal placodes of the lamprey

NASA Technical Reports Server (NTRS)

McCauley, David W.; Bronner-Fraser, Marianne

2002-01-01

Ectodermal placodes contribute to the cranial ganglia and sense organs of the head and, together with neural crest cells, represent defining features of the vertebrate embryo. The identity of different placodes appears to be specified in part by the expression of different Pax genes, with Pax-3/7 class genes being expressed in the trigeminal placode of mice, chick, frogs and fish, and Pax-2/5/8 class genes expressed in the otic placode. Here, we present the cloning and expression pattern of lamprey Pax-7 and Pax-2, which mark the trigeminal and otic placodes, respectively, as well as other structures characteristic of vertebrate Pax genes. These results suggest conservation of Pax genes and placodal structures in basal and derived vertebrates.
An integrated map of structural variation in 2,504 human genomes.

PubMed

Sudmant, Peter H; Rausch, Tobias; Gardner, Eugene J; Handsaker, Robert E; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo; Fritz, Markus Hsi-Yang; Konkel, Miriam K; Malhotra, Ankit; Stütz, Adrian M; Shi, Xinghua; Casale, Francesco Paolo; Chen, Jieming; Hormozdiari, Fereydoun; Dayama, Gargi; Chen, Ken; Malig, Maika; Chaisson, Mark J P; Walter, Klaudia; Meiers, Sascha; Kashin, Seva; Garrison, Erik; Auton, Adam; Lam, Hugo Y K; Mu, Xinmeng Jasmine; Alkan, Can; Antaki, Danny; Bae, Taejeong; Cerveira, Eliza; Chines, Peter; Chong, Zechen; Clarke, Laura; Dal, Elif; Ding, Li; Emery, Sarah; Fan, Xian; Gujral, Madhusudan; Kahveci, Fatma; Kidd, Jeffrey M; Kong, Yu; Lameijer, Eric-Wubbo; McCarthy, Shane; Flicek, Paul; Gibbs, Richard A; Marth, Gabor; Mason, Christopher E; Menelaou, Androniki; Muzny, Donna M; Nelson, Bradley J; Noor, Amina; Parrish, Nicholas F; Pendleton, Matthew; Quitadamo, Andrew; Raeder, Benjamin; Schadt, Eric E; Romanovitch, Mallory; Schlattl, Andreas; Sebra, Robert; Shabalin, Andrey A; Untergasser, Andreas; Walker, Jerilyn A; Wang, Min; Yu, Fuli; Zhang, Chengsheng; Zhang, Jing; Zheng-Bradley, Xiangqun; Zhou, Wanding; Zichner, Thomas; Sebat, Jonathan; Batzer, Mark A; McCarroll, Steven A; Mills, Ryan E; Gerstein, Mark B; Bashir, Ali; Stegle, Oliver; Devine, Scott E; Lee, Charles; Eichler, Evan E; Korbel, Jan O

2015-10-01

Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.

The WRKY Transcription Factor Genes in Lotus japonicus.

PubMed

Song, Hui; Wang, Pengfei; Nan, Zhibiao; Wang, Xingjun

2014-01-01

WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I-III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved.
Supervised group Lasso with applications to microarray data analysis

PubMed Central

Ma, Shuangge; Song, Xiao; Huang, Jian

2007-01-01

Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. PMID:17316436
Organization of the SUC gene family in Saccharomyces.

PubMed Central

Carlson, M; Botstein, D

1983-01-01

The SUC gene family of yeast (Saccharomyces) includes six structural genes for invertase (SUC1 through SUC5 and SUC7) found at unlinked chromosomal loci. A given yeast strain does not usually carry SUC+ alleles at all six loci; the natural negative alleles are called suc0 alleles. Cloned SUC2 DNA probes were used to investigate the physical structure of the SUC gene family in laboratory strains, commercial wine strains, and different Saccharomyces species. The active SUC+ genes are homologous. The suc0 allele at the SUC2 locus (suc2(0) in some strains is a silent gene or pseudogene. Other SUC loci carrying suc0 alleles appear to lack SUC DNA sequences. These findings imply that SUC genes have transposed to different chromosomal locations in closely related Saccharomyces strains. Images PMID:6843548
Fine-structure mapping of the firA gene, a locus involved in the phenotypic expression of rifampin resistance in Escherichia.

PubMed

Lathe, R

1977-09-01

The firA (Ts)200 mutation not only eliminates the resistance to rifampin of certain genetically resistant strains, but, moreover, renders ribonucleic acid synthesis thermolabile. The firA gene has been mapped by P1 tranduction and is located extremely close to the structural gene for deoxyribonucleic acid polymerase III at 4 min on the Escherichia coli linkage map.
Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering

PubMed Central

2013-01-01

Background The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development. Results In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in situ hybridization gene expression data to a two-dimensional space by preserving both the global and the local structures. Our results show that the developing brain anatomy is largely preserved in the reduced gene expression space. To provide a quantitative analysis, we cluster the reduced data into groups and measure the consistency with neuroanatomy at multiple levels. Our results show that the clusters in the low-dimensional space are more consistent with neuroanatomy than those in the original space. Conclusions Gene expression patterns and developing brain anatomy are closely related. Dimensionality reduction and visual exploration facilitate the study of this relationship. PMID:23845024
Construction of ontology augmented networks for protein complex prediction.

PubMed

Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian

2013-01-01

Protein complexes are of great importance in understanding the principles of cellular organization and function. The increase in available protein-protein interaction data, gene ontology and other resources make it possible to develop computational methods for protein complex prediction. Most existing methods focus mainly on the topological structure of protein-protein interaction networks, and largely ignore the gene ontology annotation information. In this article, we constructed ontology augmented networks with protein-protein interaction data and gene ontology, which effectively unified the topological structure of protein-protein interaction networks and the similarity of gene ontology annotations into unified distance measures. After constructing ontology augmented networks, a novel method (clustering based on ontology augmented networks) was proposed to predict protein complexes, which was capable of taking into account the topological structure of the protein-protein interaction network, as well as the similarity of gene ontology annotations. Our method was applied to two different yeast protein-protein interaction datasets and predicted many well-known complexes. The experimental results showed that (i) ontology augmented networks and the unified distance measure can effectively combine the structure closeness and gene ontology annotation similarity; (ii) our method is valuable in predicting protein complexes and has higher F1 and accuracy compared to other competing methods.
Linking Microbial Community and Catabolic Gene Structures during the Adaptation of Three Contaminated Soils under Continuous Long-Term Pollutant Stress

PubMed Central

Lima-Morales, Daiana; Jáuregui, Ruy; Camarinha-Silva, Amelia; Geffers, Robert; Vilchez-Vargas, Ramiro

2016-01-01

Three types of contaminated soil from three geographically different areas were subjected to a constant supply of benzene or benzene/toluene/ethylbenzene/xylenes (BTEX) for a period of 3 months. Different from the soil from Brazil (BRA) and Switzerland (SUI), the Czech Republic (CZE) soil which was previously subjected to intensive in situ bioremediation displayed only negligible changes in community structure. BRA and SUI soil samples showed a clear succession of phylotypes. A rapid response to benzene stress was observed, whereas the response to BTEX pollution was significantly slower. After extended incubation, actinobacterial phylotypes increased in relative abundance, indicating their superior fitness to pollution stress. Commonalities but also differences in the phylotypes were observed. Catabolic gene surveys confirmed the enrichment of actinobacteria by identifying the increase of actinobacterial genes involved in the degradation of pollutants. Proteobacterial phylotypes increased in relative abundance in SUI microcosms after short-term stress with benzene, and catabolic gene surveys indicated enriched metabolic routes. Interestingly, CZE soil, despite staying constant in community structure, showed a change in the catabolic gene structure. This indicates that a highly adapted community, which had to adjust its gene pool to meet novel challenges, has been enriched. PMID:26850298
Modification of the Campylobacter jejuni flagellin glycan by the product of the Cj1295 homopolymeric-tract-containing gene

PubMed Central

Hitchen, Paul; Brzostek, Joanna; Panico, Maria; Butler, Jonathan A.; Morris, Howard R.; Dell, Anne; Linton, Dennis

2010-01-01

The Campylobacter jejuni flagellin protein is O-glycosylated with structural analogues of the nine-carbon sugar pseudaminic acid. The most common modifications in the C. jejuni 81-176 strain are the 5,7-di-N-acetylated derivative (Pse5Ac7Ac) and an acetamidino-substituted version (Pse5Am7Ac). Other structures detected include O-acetylated and N-acetylglutamine-substituted derivatives (Pse5Am7Ac8OAc and Pse5Am7Ac8GlnNAc, respectively). Recently, a derivative of pseudaminic acid modified with a di-O-methylglyceroyl group was detected in C. jejuni NCTC 11168 strain. The gene products required for Pse5Ac7Ac biosynthesis have been characterized, but those genes involved in generating other structures have not. We have demonstrated that the mobility of the NCTC 11168 flagellin protein in SDS-PAGE gels can vary spontaneously and we investigated the role of single nucleotide repeats or homopolymeric-tract-containing genes from the flagellin glycosylation locus in this process. One such gene, Cj1295, was shown to be responsible for structural changes in the flagellin glycoprotein. Mass spectrometry demonstrated that the Cj1295 gene is required for glycosylation with the di-O-methylglyceroyl-modified version of pseudaminic acid. PMID:20338909
Low Genetic Diversity and Low Gene Flow Corresponded to a Weak Genetic Structure of Ruddy-Breasted Crake (Porzana fusca) in China.

PubMed

Zhu, Chaoying; Chen, Peng; Han, Yuqing; Ruan, Luzhang

2018-05-12

The Ruddy-breasted Crake (Porzana fusca) is an extremely poorly known species. Although it is not listed as globally endangered, in recent years, with the interference of climate change and human activities, its habitat is rapidly disappearing and its populations have been shrinking. There are two different life history traits for Ruddy-breasted Crake in China, i.e., non-migratory population in the south and migratory population in the north of China. In this study, mitochondrial control sequences and microsatellite datasets of 88 individuals sampled from 8 sites were applied to analyze their genetic diversity, genetic differentiation, and genetic structure. Our results indicated that low genetic diversity and genetic differentiation exit in most populations. The neutrality test suggested significantly negative Fu's Fs value, which, in combination with detection of the mismatch distribution, indicated that population expansion occurred in the interglacier approximately 98,000 years ago, and the time of the most recent common ancestor (TMRCA) was estimated to about 202,705 years ago. Gene flow analysis implied that the gene flow was low, but gene exchange was frequent among adjacent populations. Both phylogenetic and STRUCTURE analyses implied weak genetic structure. In general, the genetic diversity, gene flow, and genetic structure of Ruddy-breasted Crake were low.
Discovering perturbation of modular structure in HIV progression by integrating multiple data sources through non-negative matrix factorization.

PubMed

Ray, Sumanta; Maulik, Ujjwal

2016-12-20

Detecting perturbation in modular structure during HIV-1 disease progression is an important step to understand stage specific infection pattern of HIV-1 virus in human cell. In this article, we proposed a novel methodology on integration of multiple biological information to identify such disruption in human gene module during different stages of HIV-1 infection. We integrate three different biological information: gene expression information, protein-protein interaction information and gene ontology information in single gene meta-module, through non negative matrix factorization (NMF). As the identified metamodules inherit those information so, detecting perturbation of these, reflects the changes in expression pattern, in PPI structure and in functional similarity of genes during the infection progression. To integrate modules of different data sources into strong meta-modules, NMF based clustering is utilized here. Perturbation in meta-modular structure is identified by investigating the topological and intramodular properties and putting rank to those meta-modules using a rank aggregation algorithm. We have also analyzed the preservation structure of significant GO terms in which the human proteins of the meta-modules participate. Moreover, we have performed an analysis to show the change of coregulation pattern of identified transcription factors (TFs) over the HIV progression stages.
Novel mechanism of conjoined gene formation in the human genome.

PubMed

Kim, Ryong Nam; Kim, Aeri; Choi, Sang-Haeng; Kim, Dae-Soo; Nam, Seong-Hyeuk; Kim, Dae-Won; Kim, Dong-Wook; Kang, Aram; Kim, Min-Young; Park, Kun-Hyang; Yoon, Byoung-Ha; Lee, Kang Seon; Park, Hong-Seog

2012-03-01

Recently, conjoined genes (CGs) have emerged as important genetic factors necessary for understanding the human genome. However, their formation mechanism and precise structures have remained mysterious. Based on a detailed structural analysis of 57 human CG transcript variants (CGTVs, discovered in this study) and all (833) known CGs in the human genome, we discovered that the poly(A) signal site from the upstream parent gene region is completely removed via the skipping or truncation of the final exon; consequently, CG transcription is terminated at the poly(A) signal site of the downstream parent gene. This result led us to propose a novel mechanism of CG formation: the complete removal of the poly(A) signal site from the upstream parent gene is a prerequisite for the CG transcriptional machinery to continue transcribing uninterrupted into the intergenic region and downstream parent gene. The removal of the poly(A) signal sequence from the upstream gene region appears to be caused by a deletion or truncation mutation in the human genome rather than post-transcriptional trans-splicing events. With respect to the characteristics of CG sequence structures, we found that intergenic regions are hot spots for novel exon creation during CGTV formation and that exons farther from the intergenic regions are more highly conserved in the CGTVs. Interestingly, many novel exons newly created within the intergenic and intragenic regions originated from transposable element sequences. Additionally, the CGTVs showed tumor tissue-biased expression. In conclusion, our study provides novel insights into the CG formation mechanism and expands the present concepts of the genetic structural landscape, gene regulation, and gene formation mechanisms in the human genome.
A gene expression resource generated by genome-wide lacZ profiling in the mouse

PubMed Central

Tuck, Elizabeth; Estabel, Jeanne; Oellrich, Anika; Maguire, Anna Karin; Adissu, Hibret A.; Souter, Luke; Siragher, Emma; Lillistone, Charlotte; Green, Angela L.; Wardle-Jones, Hannah; Carragher, Damian M.; Karp, Natasha A.; Smedley, Damian; Adams, Niels C.; Bussell, James N.; Adams, David J.; Ramírez-Solis, Ramiro; Steel, Karen P.; Galli, Antonella; White, Jacqueline K.

2015-01-01

ABSTRACT Knowledge of the expression profile of a gene is a critical piece of information required to build an understanding of the normal and essential functions of that gene and any role it may play in the development or progression of disease. High-throughput, large-scale efforts are on-going internationally to characterise reporter-tagged knockout mouse lines. As part of that effort, we report an open access adult mouse expression resource, in which the expression profile of 424 genes has been assessed in up to 47 different organs, tissues and sub-structures using a lacZ reporter gene. Many specific and informative expression patterns were noted. Expression was most commonly observed in the testis and brain and was most restricted in white adipose tissue and mammary gland. Over half of the assessed genes presented with an absent or localised expression pattern (categorised as 0-10 positive structures). A link between complexity of expression profile and viability of homozygous null animals was observed; inactivation of genes expressed in ≥21 structures was more likely to result in reduced viability by postnatal day 14 compared with more restricted expression profiles. For validation purposes, this mouse expression resource was compared with Bgee, a federated composite of RNA-based expression data sets. Strong agreement was observed, indicating a high degree of specificity in our data. Furthermore, there were 1207 observations of expression of a particular gene in an anatomical structure where Bgee had no data, indicating a large amount of novelty in our data set. Examples of expression data corroborating and extending genotype-phenotype associations and supporting disease gene candidacy are presented to demonstrate the potential of this powerful resource. PMID:26398943
Structure, Expression, Chromosomal Location and Product of the Gene Encoding Adh2 in Petunia

PubMed Central

Gregerson, R. G.; Cameron, L.; McLean, M.; Dennis, P.; Strommer, J.

1993-01-01

In most higher plants the genes encoding alcohol dehydrogenase comprise a small gene family, usually with two members. The Adh1 gene of Petunia has been cloned and analyzed, but a second identifiable gene was not recovered from any of three genomic libraries. We have therefore employed the polymerase chain reaction to obtain the major portion of a second Adh gene. From sequence, mapping and northern data we conclude this gene encodes ADH2, the major anaerobically inducible Adh gene of Petunia. The availability of both Adh1 and Adh2 from Petunia has permitted us to compare their structures and patterns of expression to those of the well-studied Adh genes of maize, of which one is highly expressed developmentally, while both are induced in response to hypoxia. Despite their evolutionary distance, evidenced by deduced amino acid sequence as well as taxonomic classification, the pairs of genes are regulated in strikingly similar ways in maize and Petunia. Our findings suggest a significant biological basis for the regulatory strategy employed by these distant species for differential expression of multiple Adh genes. PMID:8096485
Long-Range Chromosome Interactions Mediated by Cohesin Shape Circadian Gene Expression

PubMed Central

Xu, Yichi; Guo, Weimin; Li, Ping; Zhang, Yan; Zhao, Meng; Fan, Zenghua; Zhao, Zhihu; Yan, Jun

2016-01-01

Mammalian circadian rhythm is established by the negative feedback loops consisting of a set of clock genes, which lead to the circadian expression of thousands of downstream genes in vivo. As genome-wide transcription is organized under the high-order chromosome structure, it is largely uncharted how circadian gene expression is influenced by chromosome architecture. We focus on the function of chromatin structure proteins cohesin as well as CTCF (CCCTC-binding factor) in circadian rhythm. Using circular chromosome conformation capture sequencing, we systematically examined the interacting loci of a Bmal1-bound super-enhancer upstream of a clock gene Nr1d1 in mouse liver. These interactions are largely stable in the circadian cycle and cohesin binding sites are enriched in the interactome. Global analysis showed that cohesin-CTCF co-binding sites tend to insulate the phases of circadian oscillating genes while cohesin-non-CTCF sites are associated with high circadian rhythmicity of transcription. A model integrating the effects of cohesin and CTCF markedly improved the mechanistic understanding of circadian gene expression. Further experiments in cohesin knockout cells demonstrated that cohesin is required at least in part for driving the circadian gene expression by facilitating the enhancer-promoter looping. This study provided a novel insight into the relationship between circadian transcriptome and the high-order chromosome structure. PMID:27135601
A framework for scalable parameter estimation of gene circuit models using structural information.

PubMed

Kuwahara, Hiroyuki; Fan, Ming; Wang, Suojin; Gao, Xin

2013-07-01

Systematic and scalable parameter estimation is a key to construct complex gene regulatory models and to ultimately facilitate an integrative systems biology approach to quantitatively understand the molecular mechanisms underpinning gene regulation. Here, we report a novel framework for efficient and scalable parameter estimation that focuses specifically on modeling of gene circuits. Exploiting the structure commonly found in gene circuit models, this framework decomposes a system of coupled rate equations into individual ones and efficiently integrates them separately to reconstruct the mean time evolution of the gene products. The accuracy of the parameter estimates is refined by iteratively increasing the accuracy of numerical integration using the model structure. As a case study, we applied our framework to four gene circuit models with complex dynamics based on three synthetic datasets and one time series microarray data set. We compared our framework to three state-of-the-art parameter estimation methods and found that our approach consistently generated higher quality parameter solutions efficiently. Although many general-purpose parameter estimation methods have been applied for modeling of gene circuits, our results suggest that the use of more tailored approaches to use domain-specific information may be a key to reverse engineering of complex biological systems. http://sfb.kaust.edu.sa/Pages/Software.aspx. Supplementary data are available at Bioinformatics online.
Menzerath-Altmann law in mammalian exons reflects the dynamics of gene structure evolution.

PubMed

Nikolaou, Christoforos

2014-12-01

Genomic sequences exhibit self-organization properties at various hierarchical levels. One such is the gene structure of higher eukaryotes with its complex exon/intron arrangement. Exon sizes and exon numbers in genes have been shown to conform to a law derived from statistical linguistics and formulated by Menzerath and Altmann, according to which the mean size of the constituents of an entity is inversely related to the number of these constituents. We herein perform a detailed analysis of this property in the complete exon set of the mouse genome in correlation to the sequence conservation of each exon and the transcriptional complexity of each gene locus. We show that extensive linear fits, representative of accordance to Menzerath-Altmann law are restricted to a particular subset of genes that are formed by exons under low or intermediate sequence constraints and have a small number of alternative transcripts. Based on this observation we propose a hypothesis for the law of Menzerath-Altmann in mammalian genes being predominantly due to genes that are more versatile in function and thus, more prone to undergo changes in their structure. To this end we demonstrate one test case where gene categories of different functionality also show differences in the extent of conformity to Menzerath-Altmann law. Copyright © 2014 Elsevier Ltd. All rights reserved.
Bioinformatics analysis of the oxidosqualene cyclase gene and the amino acid sequence in mangrove plants

NASA Astrophysics Data System (ADS)

Basyuni, M.; Wati, R.

2017-01-01

This study described the bioinformatics methods to analyze seven oxidosqualene cyclase (OSC) genes from mangrove plants on DDBJ/EMBL/GenBank as well as predicted the structure, composition, similarity, subcellular localization and phylogenetic. The physical and chemical properties of seven mangrove OSC showed variation among the genes. The percentage of the secondary structure of seven mangrove OSC genes followed the order of a helix > random coil > extended chain structure. The values of chloroplast or signal peptide were too low, indicated that no chloroplast transit peptide or signal peptide of secretion pathway in mangrove OSC genes. The target peptide value of mitochondria varied from 0.163 to 0.430, indicated it was possible to exist. These results suggested the importance of understanding the diversity and functional of properties of the different amino acids in mangrove OSC genes. To clarify the relationship among the mangrove OSC gene, a phylogenetic tree was constructed. The phylogenetic tree shows that there are three clusters, Kandelia KcMS join with Bruguiera BgLUS, Rhizophora RsM1 was close to Bruguiera BgbAS, and Rhizophora RcCAS join with Kandelia KcCAS. The present study, therefore, supported the previous results that plant OSC genes form distinct clusters in the tree.
Genetic Organization of the Chromosome Region Surrounding mecA in Clinical Staphylococcal Strains: Role of IS431-Mediated mecI Deletion in Expression of Resistance in mecA-Carrying, Low-Level Methicillin- Resistant Staphylococcus haemolyticus

PubMed Central

Katayama, Yuki; Ito, Teruyo; Hiramatsu, Keiichi

2001-01-01

We report on the structural diversity of mecA gene complexes carried by 38 methicillin-resistant Staphylococcus aureus and 91 methicillin-resistant coagulase-negative Staphylococcus strains of seven different species with a special reference to its correlation with phenotypic expression of methicillin resistance. The most prevalent and widely disseminated mec complex had the structure mecI-mecR1-mecA-IS431R (or IS431mec), designated the class A mecA gene complex. In contrast, in S. haemolyticus, mecA was bracketed by two copies of IS431, forming the structure IS431L-mecA-IS431R. Of the 38 S. haemolyticus strains, 5 had low-level methicillin resistance (MIC, 1 to 4 mg/liter) and characteristic heterogeneous methicillin resistance as judged by population analysis. In these five strains, IS431L was located to the left of an intact mecI gene, forming the structure IS431L-class A mecA-gene complex. In other S. haemolyticus strains, IS431L was associated with the deletion of mecI and mecR1, forming the structure IS431L-ΔmecR1-mecA-IS431mec, designated the class C mecA gene complex. Mutants with the class C mecA gene complex were obtained in vitro by selecting strain SH621, containing the IS431L-class A mecA gene complex with low concentrations of methicillin (1 and 3 mg/liter). The mutants had intermediate level of methicillin resistance (MIC, 16 to 64 mg/liter). The mecA gene transcription was shown to be derepressed in a representative mutant strain, SH621-37. Our study indicated that the mecI-encoded repressor function is responsible for the low-level methicillin resistance of some S. haemolyticus clinical strains and that the IS431-mediated mecI gene deletion causes the expression of methicillin resistance through the derepression of mecA gene transcription. PMID:11408208
Transcriptome-Wide Analysis of UTRs in Non-Small Cell Lung Cancer Reveals Cancer-Related Genes with SNV-Induced Changes on RNA Secondary Structure and miRNA Target Sites

PubMed Central

Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R.; Gorodkin, Jan

2014-01-01

Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways. PMID:24416147
Transcriptome-wide analysis of UTRs in non-small cell lung cancer reveals cancer-related genes with SNV-induced changes on RNA secondary structure and miRNA target sites.

PubMed

Sabarinathan, Radhakrishnan; Wenzel, Anne; Novotny, Peter; Tang, Xiaojia; Kalari, Krishna R; Gorodkin, Jan

2014-01-01

Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways.

Harnessing the complexity of gene expression data from cancer: from single gene to structural pathway methods

PubMed Central

2012-01-01

High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods. Reviewers This article was reviewed by Arcady Mushegian, Byung-Soo Kim and Joel Bader. PMID:23227854
Reverse engineering the gap gene network of Drosophila melanogaster.

PubMed

Perkins, Theodore J; Jaeger, Johannes; Reinitz, John; Glass, Leon

2006-05-01

A fundamental problem in functional genomics is to determine the structure and dynamics of genetic networks based on expression data. We describe a new strategy for solving this problem and apply it to recently published data on early Drosophila melanogaster development. Our method is orders of magnitude faster than current fitting methods and allows us to fit different types of rules for expressing regulatory relationships. Specifically, we use our approach to fit models using a smooth nonlinear formalism for modeling gene regulation (gene circuits) as well as models using logical rules based on activation and repression thresholds for transcription factors. Our technique also allows us to infer regulatory relationships de novo or to test network structures suggested by the literature. We fit a series of models to test several outstanding questions about gap gene regulation, including regulation of and by hunchback and the role of autoactivation. Based on our modeling results and validation against the experimental literature, we propose a revised network structure for the gap gene system. Interestingly, some relationships in standard textbook models of gap gene regulation appear to be unnecessary for or even inconsistent with the details of gap gene expression during wild-type development.
Genomic structure of two ras family genes in the slime mold Physarum polycephalum.

PubMed

Trzcińska-Danielewicz, Joanna; Kozlowski, Piotr; Gierdal, Katarzyna; Wiejak, Jolanta; Jagielski, Adam; Toczko, Kazimierz; Fronk, Jan

2002-08-01

Genomic structure of two Physarum polycephalum ras family genes, Ppras2 and Pprap1, has been determined, including the upstream region of the latter. The genes are interrupted by three and four introns, respectively. The first intron of Ppras2 has the same location within the coding sequence as the first intron in another ras homolog from this organism, Ppras1 [Trzcińska-Danielewicz, J., Kozlowski, P., and Toczko, K. (1996). "Cloning and genomic sequence of the Physarum polycephalum Ppras1 gene, a homologue of the ras protooncogene", Gene 169, pp. 143-144]. All introns, ranging from 53 to ca. 460 base pairs, have the canonical 5' and 3' ends, are greatly enriched in pyrimidines in the coding strand and have frequent pyrimidines-only tracts. These latter features seem to be responsible for the difficulties in cloning and sequencing of parts of these genes. Short sequences shared with P. polycephalum transposon-like repeats are common in the introns, indicating a possible role of transposition in intron evolution. In all three ras family genes phase zero introns are located mostly between sequences coding for regular protein secondary structure elements.
Structure-related clustering of gene expression fingerprints of thp-1 cells exposed to smaller polycyclic aromatic hydrocarbons.

PubMed

Wan, B; Yarbrough, J W; Schultz, T W

2008-01-01

This study was undertaken to test the hypothesis that structurally similar PAHs induce similar gene expression profiles. THP-1 cells were exposed to a series of 12 selected PAHs at 50 microM for 24 hours and gene expressions profiles were analyzed using both unsupervised and supervised methods. Clustering analysis of gene expression profiles revealed that the 12 tested chemicals were grouped into five clusters. Within each cluster, the gene expression profiles are more similar to each other than to the ones outside the cluster. One-methylanthracene and 1-methylfluorene were found to have the most similar profiles; dibenzothiophene and dibenzofuran were found to share common profiles with fluorine. As expression pattern comparisons were expanded, similarity in genomic fingerprint dropped off dramatically. Prediction analysis of microarrays (PAM) based on the clustering pattern generated 49 predictor genes that can be used for sample discrimination. Moreover, a significant analysis of Microarrays (SAM) identified 598 genes being modulated by tested chemicals with a variety of biological processes, such as cell cycle, metabolism, and protein binding and KEGG pathways being significantly (p < 0.05) affected. It is feasible to distinguish structurally different PAHs based on their genomic fingerprints, which are mechanism based.
A Roadmap for Functional Structural Variants in the Soybean Genome

PubMed Central

Anderson, Justin E.; Kantar, Michael B.; Kono, Thomas Y.; Fu, Fengli; Stec, Adrian O.; Song, Qijian; Cregan, Perry B.; Specht, James E.; Diers, Brian W.; Cannon, Steven B.; McHale, Leah K.; Stupar, Robert M.

2014-01-01

Gene structural variation (SV) has recently emerged as a key genetic mechanism underlying several important phenotypic traits in crop species. We screened a panel of 41 soybean (Glycine max) accessions serving as parents in a soybean nested association mapping population for deletions and duplications in more than 53,000 gene models. Array hybridization and whole genome resequencing methods were used as complementary technologies to identify SV in 1528 genes, or approximately 2.8%, of the soybean gene models. Although SV occurs throughout the genome, SV enrichment was noted in families of biotic defense response genes. Among accessions, SV was nearly eightfold less frequent for gene models that have retained paralogs since the last whole genome duplication event, compared with genes that have not retained paralogs. Increases in gene copy number, similar to that described at the Rhg1 resistance locus, account for approximately one-fourth of the genic SV events. This assessment of soybean SV occurrence presents a target list of genes potentially responsible for rapidly evolving and/or adaptive traits. PMID:24855315
Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.

PubMed

Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W

2011-09-01

Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.
Ol-Prx 3, a member of an additional class of homeobox genes, is unimodally expressed in several domains of the developing and adult central nervous system of the medaka (Oryzias latipes)

PubMed Central

Joly, Jean-Stephane; Bourrat, Franck; Nguyen, Van; Chourrout, Daniel

1997-01-01

Large-scale genetic screens for mutations affecting early neurogenesis of vertebrates have recently been performed with an aquarium fish, the zebrafish. Later stages of neural morphogenesis have attracted less attention in small fish species, partly because of the lack of molecular markers of developing structures that may facilitate the detection of discrete structural alterations. In this context, we report the characterization of Ol-Prx 3 (Oryzias latipes-Prx 3). This gene was isolated in the course of a large-scale screen for brain cDNAs containing a highly conserved DNA binding region, the homeobox helix-three. Sequence analysis revealed that this gene belongs to another class of homeobox genes, together with a previously isolated mouse ortholog, called OG-12 [Rovescalli, A. C., Asoh, S. & Nirenberg, M. (1996) Proc. Natl. Acad. Sci. USA 93, 10691–10696] and with the human SHOX gene [Rao, E., Weiss, B., Fukami, M., Rump, A., Niesler, B., et al. (1997) Nat. Genet. 16, 54–62], thought to be involved in the short-stature phenotype of Turner syndrome patients. These three genes exhibit a moderate level of identity in the homeobox with the other genes of the paired-related (PRX) gene family. Ol-Prx 3, as well as the PRX genes, are expressed in various cartilaginous structures of head and limbs. These genes might thus be involved in common regulatory pathways during the morphogenesis of these structures. Moreover, this paper reports a complex and monophasic pattern of Ol-Prx 3 expression in the central nervous system, which differs markedly from the patterns reported for the PRX genes, Prx 3 excluded: this gene begins to be expressed in a variety of central nervous system territories at late neurula stage. Strikingly, it remains turned on in some of the derivatives of each territory during the entire life of the fish. We hope this work will thus help identify common features for the PRX 3 family of homeobox genes. PMID:9371787
Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

PubMed

Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

2012-04-01

Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.
Ornithorhynchus anatinus (platypus) links the evolution of immunoglobulin genes in eutherian mammals and nonmammalian tetrapods.

PubMed

Zhao, Yaofeng; Cui, Huiting; Whittington, Camilla M; Wei, Zhiguo; Zhang, Xiaofeng; Zhang, Ziding; Yu, Li; Ren, Liming; Hu, Xiaoxiang; Zhang, Yaping; Hellman, Lars; Belov, Katherine; Li, Ning; Hammarström, Lennart

2009-09-01

The evolutionary origins of mammalian immunoglobulin H chain isotypes (IgM, IgD, IgG, IgE, and IgA) are still incompletely understood as these isotypes differ considerably in structure and number from their counterparts in nonmammalian tetrapods. We report in this study that the platypus (Ornithorhynchus anatinus) Ig H chain constant region gene locus contains eight Ig encoding genes, which are arranged in an mu-delta-omicron-gamma2-gamma1-alpha1-epsilon-alpha2 order, spanning a total of approximately 200 kb DNA, encoding six distinct isotypes. The omicron (omicron for Ornithorhynchus) gene encodes a novel Ig H chain isotype that consists of four constant region domains and a hinge, and is structurally different from any of the five known mammalian Ig classes. This gene is phylogenetically related to upsilon (epsilon) and gamma, and thus appears to be a structural intermediate between these two genes. The platypus delta gene encodes ten heavy chain constant region domains, lacks a hinge region and is similar to IgD in amphibians and fish, but strikingly different from that in eutherian mammals. The platypus Ig H chain isotype repertoire thus shows a unique combination of genes that share similarity both to those of nonmammalian tetrapods and eutherian animals and demonstrates how phylogenetically informative species can be used to reconstruct the evolutionary history of functionally important genes.
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.

PubMed

Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J

2018-06-27

The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Ubiquitous and gene-specific regulatory 5' sequences in a sea urchin histone DNA clone coding for histone protein variants.

PubMed Central

Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L

1980-01-01

The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes.

PubMed

Milbury, Coren A; Lee, Jung C; Cannone, Jamie J; Gaffney, Patrick M; Gutell, Robin R

2010-09-02

Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA) genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters C. virginica, C. gigas, and C. hongkongensis. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.
Genome-Wide Analysis of the Sucrose Synthase Gene Family in Grape (Vitis vinifera): Structure, Evolution, and Expression Profiles

PubMed Central

Zhu, Xudong; Wang, Mengqi; Li, Xiaopeng; Jiu, Songtao; Wang, Chen; Fang, Jinggui

2017-01-01

Sucrose synthase (SS) is widely considered as the key enzyme involved in the plant sugar metabolism that is critical to plant growth and development, especially quality of the fruit. The members of SS gene family have been identified and characterized in multiple plant genomes. However, detailed information about this gene family is lacking in grapevine (Vitis vinifera L.). In this study, we performed a systematic analysis of the grape (V. vinifera) genome and reported that there are five SS genes (VvSS1–5) in the grape genome. Comparison of the structures of grape SS genes showed high structural conservation of grape SS genes, resulting from the selection pressures during the evolutionary process. The segmental duplication of grape SS genes contributed to this gene family expansion. The syntenic analyses between grape and soybean (Glycine max) demonstrated that these genes located in corresponding syntenic blocks arose before the divergence of grape and soybean. Phylogenetic analysis revealed distinct evolutionary paths for the grape SS genes. VvSS1/VvSS5, VvSS2/VvSS3 and VvSS4 originated from three ancient SS genes, which were generated by duplication events before the split of monocots and eudicots. Bioinformatics analysis of publicly available microarray data, which was validated by quantitative real-time reverse transcription PCR (qRT-PCR), revealed distinct temporal and spatial expression patterns of VvSS genes in various tissues, organs and developmental stages, as well as in response to biotic and abiotic stresses. Taken together, our results will be beneficial for further investigations into the functions of SS gene in the processes of grape resistance to environmental stresses. PMID:28350372
Community structure and function of planktonic Crenarchaeota: changes with depth in the South China Sea.

PubMed

Hu, Anyi; Jiao, Nianzhi; Zhang, Chuanlun L

2011-10-01

Marine Crenarchaeota represent a widespread and abundant microbial group in marine ecosystems. Here, we investigated the abundance, diversity, and distribution of planktonic Crenarchaeota in the epi-, meso-, and bathypelagic zones at three stations in the South China Sea (SCS) by analysis of crenarchaeal 16S rRNA gene, ammonia monooxygenase gene amoA involved in ammonia oxidation, and biotin carboxylase gene accA putatively involved in archaeal CO(2) fixation. Quantitative PCR analyses indicated that crenarchaeal amoA and accA gene abundances varied similarly with archaeal and crenarchaeal 16S rRNA gene abundances at all stations, except that crenarchaeal accA genes were almost absent in the epipelagic zone. Ratios of the crenarchaeal amoA gene to 16S rRNA gene abundances decreased ~2.6 times from the epi- to bathypelagic zones, whereas the ratios of crenarchaeal accA gene to marine group I crenarchaeal 16S rRNA gene or to crenarchaeal amoA gene abundances increased with depth, suggesting that the metabolism of Crenarchaeota may change from the epi- to meso- or bathypelagic zones. Denaturing gradient gel electrophoresis profiling of the 16S rRNA genes revealed depth partitioning in archaeal community structures. Clone libraries of crenarchaeal amoA and accA genes showed two clusters: the "shallow" cluster was exclusively derived from epipelagic water and the "deep" cluster was from meso- and/or bathypelagic waters, suggesting that niche partitioning may take place between the shallow and deep marine Crenarchaeota. Overall, our results show strong depth partitioning of crenarchaeal populations in the SCS and suggest a shift in their community structure and ecological function with increasing depth.
Jumping Genes: The Transposable DNAs of Bacteria.

ERIC Educational Resources Information Center

Berg, Claire M.; Berg, Douglas E.

1984-01-01

Transposons are transposable elements that carry genes for antibiotic resistance. Provides background information on the structure and organization of these "jumping genes" in bacteria. Also describes the use of transposons in tagging genes and lists pertinent references and resource materials. (DH)
Dynamic Succession of Groundwater Functional Microbial Communities in Response to Emulsified Vegetable Oil Amendment during Sustained In Situ U(VI) Reduction.

PubMed

Zhang, Ping; Wu, Wei-Min; Van Nostrand, Joy D; Deng, Ye; He, Zhili; Gihring, Thomas; Zhang, Gengxin; Schadt, Chris W; Watson, David; Jardine, Phil; Criddle, Craig S; Brooks, Scott; Marsh, Terence L; Tiedje, James M; Arkin, Adam P; Zhou, Jizhong

2015-06-15

A pilot-scale field experiment demonstrated that a one-time amendment of emulsified vegetable oil (EVO) reduced groundwater U(VI) concentrations for 1 year in a fast-flowing aquifer. However, little is known about how EVO amendment stimulates the functional gene composition, structure, and dynamics of groundwater microbial communities toward prolonged U(VI) reduction. In this study, we hypothesized that EVO amendment would shift the functional gene composition and structure of groundwater microbial communities and stimulate key functional genes/groups involved in EVO biodegradation and reduction of electron acceptors in the aquifer. To test these hypotheses, groundwater microbial communities after EVO amendment were analyzed using a comprehensive functional gene microarray. Our results showed that EVO amendment stimulated sequential shifts in the functional composition and structure of groundwater microbial communities. Particularly, the relative abundance of key functional genes/groups involved in EVO biodegradation and the reduction of NO3 (-), Mn(IV), Fe(III), U(VI), and SO4 (2-) significantly increased, especially during the active U(VI) reduction period. The relative abundance for some of these key functional genes/groups remained elevated over 9 months. Montel tests suggested that the dynamics in the abundance, composition, and structure of these key functional genes/groups were significantly correlated with groundwater concentrations of acetate, NO3 (-), Mn(II), Fe(II), U(VI), and SO4 (2-). Our results suggest that EVO amendment stimulated dynamic succession of key functional microbial communities. This study improves our understanding of the composition, structure, and function changes needed for groundwater microbial communities to sustain a long-term U(VI) reduction. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Dynamic Succession of Groundwater Functional Microbial Communities in Response to Emulsified Vegetable Oil Amendment during Sustained In Situ U(VI) Reduction

PubMed Central

Zhang, Ping; Wu, Wei-Min; Van Nostrand, Joy D.; Deng, Ye; He, Zhili; Gihring, Thomas; Zhang, Gengxin; Schadt, Chris W.; Watson, David; Jardine, Phil; Criddle, Craig S.; Brooks, Scott; Marsh, Terence L.; Tiedje, James M.; Arkin, Adam P.

2015-01-01

A pilot-scale field experiment demonstrated that a one-time amendment of emulsified vegetable oil (EVO) reduced groundwater U(VI) concentrations for 1 year in a fast-flowing aquifer. However, little is known about how EVO amendment stimulates the functional gene composition, structure, and dynamics of groundwater microbial communities toward prolonged U(VI) reduction. In this study, we hypothesized that EVO amendment would shift the functional gene composition and structure of groundwater microbial communities and stimulate key functional genes/groups involved in EVO biodegradation and reduction of electron acceptors in the aquifer. To test these hypotheses, groundwater microbial communities after EVO amendment were analyzed using a comprehensive functional gene microarray. Our results showed that EVO amendment stimulated sequential shifts in the functional composition and structure of groundwater microbial communities. Particularly, the relative abundance of key functional genes/groups involved in EVO biodegradation and the reduction of NO3−, Mn(IV), Fe(III), U(VI), and SO42− significantly increased, especially during the active U(VI) reduction period. The relative abundance for some of these key functional genes/groups remained elevated over 9 months. Montel tests suggested that the dynamics in the abundance, composition, and structure of these key functional genes/groups were significantly correlated with groundwater concentrations of acetate, NO3−, Mn(II), Fe(II), U(VI), and SO42−. Our results suggest that EVO amendment stimulated dynamic succession of key functional microbial communities. This study improves our understanding of the composition, structure, and function changes needed for groundwater microbial communities to sustain a long-term U(VI) reduction. PMID:25862231
Dynamic Succession of Groundwater Functional Microbial Communities in Response to Emulsified Vegetable Oil Amendment during Sustained In Situ U(VI) Reduction

DOE PAGES

Zhang, Ping; Wu, Wei-Min; Van Nostrand, Joy D.; ...

2015-04-10

A pilot-scale field experiment demonstrated that a one-time amendment of emulsified vegetable oil (EVO) reduced groundwater U(VI) concentrations for 1 year in a fast-flowing aquifer. However, little is known about how EVO amendment stimulates the functional gene composition, structure, and dynamics of groundwater microbial communities toward prolonged U(VI) reduction. In this paper, we hypothesized that EVO amendment would shift the functional gene composition and structure of groundwater microbial communities and stimulate key functional genes/groups involved in EVO biodegradation and reduction of electron acceptors in the aquifer. To test these hypotheses, groundwater microbial communities after EVO amendment were analyzed using amore » comprehensive functional gene microarray. Our results showed that EVO amendment stimulated sequential shifts in the functional composition and structure of groundwater microbial communities. Particularly, the relative abundance of key functional genes/groups involved in EVO biodegradation and the reduction of NO 3 -, Mn(IV), Fe(III), U(VI), and SO 4 2- significantly increased, especially during the active U(VI) reduction period. The relative abundance for some of these key functional genes/groups remained elevated over 9 months. Montel tests suggested that the dynamics in the abundance, composition, and structure of these key functional genes/groups were significantly correlated with groundwater concentrations of acetate, NO 3 -, Mn(II), Fe(II), U(VI), and SO 4 2-. Our results suggest that EVO amendment stimulated dynamic succession of key functional microbial communities. Finally, this study improves our understanding of the composition, structure, and function changes needed for groundwater microbial communities to sustain a long-term U(VI) reduction.« less
Dynamic Succession of Groundwater Functional Microbial Communities in Response to Emulsified Vegetable Oil Amendment during Sustained In Situ U(VI) Reduction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Ping; Wu, Wei-Min; Van Nostrand, Joy D.

A pilot-scale field experiment demonstrated that a one-time amendment of emulsified vegetable oil (EVO) reduced groundwater U(VI) concentrations for 1 year in a fast-flowing aquifer. However, little is known about how EVO amendment stimulates the functional gene composition, structure, and dynamics of groundwater microbial communities toward prolonged U(VI) reduction. In this paper, we hypothesized that EVO amendment would shift the functional gene composition and structure of groundwater microbial communities and stimulate key functional genes/groups involved in EVO biodegradation and reduction of electron acceptors in the aquifer. To test these hypotheses, groundwater microbial communities after EVO amendment were analyzed using amore » comprehensive functional gene microarray. Our results showed that EVO amendment stimulated sequential shifts in the functional composition and structure of groundwater microbial communities. Particularly, the relative abundance of key functional genes/groups involved in EVO biodegradation and the reduction of NO 3 -, Mn(IV), Fe(III), U(VI), and SO 4 2- significantly increased, especially during the active U(VI) reduction period. The relative abundance for some of these key functional genes/groups remained elevated over 9 months. Montel tests suggested that the dynamics in the abundance, composition, and structure of these key functional genes/groups were significantly correlated with groundwater concentrations of acetate, NO 3 -, Mn(II), Fe(II), U(VI), and SO 4 2-. Our results suggest that EVO amendment stimulated dynamic succession of key functional microbial communities. Finally, this study improves our understanding of the composition, structure, and function changes needed for groundwater microbial communities to sustain a long-term U(VI) reduction.« less
Responses of the functional structure of soil microbial community to livestock grazing in the Tibetan alpine grassland.

PubMed

Yang, Yunfeng; Wu, Linwei; Lin, Qiaoyan; Yuan, Mengting; Xu, Depeng; Yu, Hao; Hu, Yigang; Duan, Jichuang; Li, Xiangzhen; He, Zhili; Xue, Kai; van Nostrand, Joy; Wang, Shiping; Zhou, Jizhong

2013-02-01

Microbes play key roles in various biogeochemical processes, including carbon (C) and nitrogen (N) cycling. However, changes of microbial community at the functional gene level by livestock grazing, which is a global land-use activity, remain unclear. Here we use a functional gene array, GeoChip 4.0, to examine the effects of free livestock grazing on the microbial community at an experimental site of Tibet, a region known to be very sensitive to anthropogenic perturbation and global warming. Our results showed that grazing changed microbial community functional structure, in addition to aboveground vegetation and soil geochemical properties. Further statistical tests showed that microbial community functional structures were closely correlated with environmental variables, and variations in microbial community functional structures were mainly controlled by aboveground vegetation, soil C/N ratio, and NH4 (+) -N. In-depth examination of N cycling genes showed that abundances of N mineralization and nitrification genes were increased at grazed sites, but denitrification and N-reduction genes were decreased, suggesting that functional potentials of relevant bioprocesses were changed. Meanwhile, abundances of genes involved in methane cycling, C fixation, and degradation were decreased, which might be caused by vegetation removal and hence decrease in litter accumulation at grazed sites. In contrast, abundances of virulence, stress, and antibiotics resistance genes were increased because of the presence of livestock. In conclusion, these results indicated that soil microbial community functional structure was very sensitive to the impact of livestock grazing and revealed microbial functional potentials in regulating soil N and C cycling, supporting the necessity to include microbial components in evaluating the consequence of land-use and/or climate changes. © 2012 Blackwell Publishing Ltd.

Assessment of Bacterial bph Gene in Amazonian Dark Earth and Their Adjacent Soils

PubMed Central

Brossi, Maria Julia de Lima; Mendes, Lucas William; Germano, Mariana Gomes; Lima, Amanda Barbosa; Tsai, Siu Mui

2014-01-01

Amazonian Anthrosols are known to harbour distinct and highly diverse microbial communities. As most of the current assessments of these communities are based on taxonomic profiles, the functional gene structure of these communities, such as those responsible for key steps in the carbon cycle, mostly remain elusive. To gain insights into the diversity of catabolic genes involved in the degradation of hydrocarbons in anthropogenic horizons, we analysed the bacterial bph gene community structure, composition and abundance using T-RFLP, 454-pyrosequencing and quantitative PCR essays, respectively. Soil samples were collected in two Brazilian Amazon Dark Earth (ADE) sites and at their corresponding non-anthropogenic adjacent soils (ADJ), under two different land use systems, secondary forest (SF) and manioc cultivation (M). Redundancy analysis of T-RFLP data revealed differences in bph gene structure according to both soil type and land use. Chemical properties of ADE soils, such as high organic carbon and organic matter, as well as effective cation exchange capacity and pH, were significantly correlated with the structure of bph communities. Also, the taxonomic affiliation of bph gene sequences revealed the segregation of community composition according to the soil type. Sequences at ADE sites were mostly affiliated to aromatic hydrocarbon degraders belonging to the genera Streptomyces, Sphingomonas, Rhodococcus, Mycobacterium, Conexibacter and Burkholderia. In both land use sites, shannon's diversity indices based on the bph gene data were higher in ADE than ADJ soils. Collectively, our findings provide evidence that specific properties in ADE soils shape the structure and composition of bph communities. These results provide a basis for further investigations focusing on the bio-exploration of novel enzymes with potential use in the biotechnology/biodegradation industry. PMID:24927167
Assessment of bacterial bph gene in Amazonian dark earth and their adjacent soils.

PubMed

Brossi, Maria Julia de Lima; Mendes, Lucas William; Germano, Mariana Gomes; Lima, Amanda Barbosa; Tsai, Siu Mui

2014-01-01

Amazonian Anthrosols are known to harbour distinct and highly diverse microbial communities. As most of the current assessments of these communities are based on taxonomic profiles, the functional gene structure of these communities, such as those responsible for key steps in the carbon cycle, mostly remain elusive. To gain insights into the diversity of catabolic genes involved in the degradation of hydrocarbons in anthropogenic horizons, we analysed the bacterial bph gene community structure, composition and abundance using T-RFLP, 454-pyrosequencing and quantitative PCR essays, respectively. Soil samples were collected in two Brazilian Amazon Dark Earth (ADE) sites and at their corresponding non-anthropogenic adjacent soils (ADJ), under two different land use systems, secondary forest (SF) and manioc cultivation (M). Redundancy analysis of T-RFLP data revealed differences in bph gene structure according to both soil type and land use. Chemical properties of ADE soils, such as high organic carbon and organic matter, as well as effective cation exchange capacity and pH, were significantly correlated with the structure of bph communities. Also, the taxonomic affiliation of bph gene sequences revealed the segregation of community composition according to the soil type. Sequences at ADE sites were mostly affiliated to aromatic hydrocarbon degraders belonging to the genera Streptomyces, Sphingomonas, Rhodococcus, Mycobacterium, Conexibacter and Burkholderia. In both land use sites, shannon's diversity indices based on the bph gene data were higher in ADE than ADJ soils. Collectively, our findings provide evidence that specific properties in ADE soils shape the structure and composition of bph communities. These results provide a basis for further investigations focusing on the bio-exploration of novel enzymes with potential use in the biotechnology/biodegradation industry.
Integration of the blaNDM-1 carbapenemase gene into Proteus genomic island 1 (PGI1-PmPEL) in a Proteus mirabilis clinical isolate.

PubMed

Girlich, Delphine; Dortet, Laurent; Poirel, Laurent; Nordmann, Patrice

2015-01-01

To decipher the mechanisms and their associated genetic determinants responsible for β-lactam resistance in a Proteus mirabilis clinical isolate. The entire genetic structure surrounding the β-lactam resistance genes was characterized by PCR, gene walking and DNA sequencing. Genes encoding the carbapenemase NDM-1 and the ESBL VEB-6 were located in a 38.5 kb MDR structure, which itself was inserted into a new variant of the Proteus genomic island 1 (PGI1). This new PGI1-PmPEL variant of 64.4 kb was chromosomally located, as an external circular form in the P. mirabilis isolate, suggesting potential mobility. This is the first known description of the bla(NDM-1) gene in a genomic island structure, which might further enhance the spread of the bla(NDM-1) carbapenemase gene among enteric pathogens. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Characterization of embryo-specific genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

1989-01-01

The objective of the proposed research is to characterize the structure and function of a set of genes whose expression is regulated in embryo development, and that is not expressed in mature tissues -- the embryonic genes. In the last two years, using cDNA clones, we have isolated 22 cDNA clones, and characterized the expression pattern of their corresponding RNA. At least 4 cDNA clones detect RNAs of embryonic genes. These cDNA clones detect RNAs expressed in somatic as well as zygotic embryos of carrot. Using the cDNA clones, we screened the genomic library of carrot embryo DNA, and isolatedmore » genomic clones for three genes. The structure and function of two genes DC 8 and DC 59 have been characterized and are reported in this paper.« less
The Origins and Evolution of the p53 Family of Genes

PubMed Central

Belyi, Vladimir A.; Ak, Prashanth; Markert, Elke; Wang, Haijian; Hu, Wenwei; Puzio-Kuter, Anna; Levine, Arnold J.

2010-01-01

A common ancestor to the three p53 family members of human genes p53, p63, and p73 is first detected in the evolution of modern‐day sea anemones, in which both structurally and functionally it acts to protect the germ line from genomic instabilities in response to stresses. This p63/p73 common ancestor gene is found in almost all invertebrates and first duplicates to produce a p53 gene and a p63/p73 ancestor in cartilaginous fish. Bony fish contain all three genes, p53, p63, and p73, and the functions of these three transcription factors diversify in the higher vertebrates. Thus, this gene family has preserved its structural features and functional activities for over one billion years of evolution. PMID:20516129
Transcript analysis of the extended hyp-operon in the cyanobacteria Nostoc sp. strain PCC 7120 and Nostoc punctiforme ATCC 29133

PubMed Central

2011-01-01

Background Cyanobacteria harbor two [NiFe]-type hydrogenases consisting of a large and a small subunit, the Hup- and Hox-hydrogenase, respectively. Insertion of ligands and correct folding of nickel-iron hydrogenases require assistance of accessory maturation proteins (encoded by the hyp-genes). The intergenic region between the structural genes encoding the uptake hydrogenase (hupSL) and the accessory maturation proteins (hyp genes) in the cyanobacteria Nostoc PCC 7120 and N. punctiforme were analysed using molecular methods. Findings The five ORFs, located in between the uptake hydrogenase structural genes and the hyp-genes, can form a transcript with the hyp-genes. An identical genomic localization of these ORFs are found in other filamentous, N2-fixing cyanobacterial strains. In N. punctiforme and Nostoc PCC 7120 the ORFs upstream of the hyp-genes showed similar transcript level profiles as hupS (hydrogenase structural gene), nifD (nitrogenase structural gene), hypC and hypF (accessory hydrogenase maturation genes) after nitrogen depletion. In silico analyzes showed that these ORFs in N. punctiforme harbor the same conserved regions as their homologues in Nostoc PCC 7120 and that they, like their homologues in Nostoc PCC 7120, can be transcribed together with the hyp-genes forming a larger extended hyp-operon. DNA binding studies showed interactions of the transcriptional regulators CalA and CalB to the promoter regions of the extended hyp-operon in N. punctiforme and Nostoc PCC 7120. Conclusions The five ORFs upstream of the hyp-genes in several filamentous N2-fixing cyanobacteria have an identical genomic localization, in between the genes encoding the uptake hydrogenase and the maturation protein genes. In N. punctiforme and Nostoc PCC 7120 they are transcribed as one operon and may form transcripts together with the hyp-genes. The expression pattern of the five ORFs within the extended hyp-operon in both Nostoc punctiforme and Nostoc PCC 7120 is similar to the expression patterns of hupS, nifD, hypF and hypC. CalA, a known transcription factor, interacts with the promoter region between hupSL and the five ORFs in the extended hyp-operon in both Nostoc strains. PMID:21672234
Relative importance of pollen and seed dispersal across a Neotropical mountain landscape for an epiphytic orchid.

PubMed

Kartzinel, Tyler R; Shefferson, Richard P; Trapnell, Dorset W

2013-12-01

Populations of many species are isolated within narrow elevation bands of Neotropical mountain habitat, and how well dispersal maintains genetic connectivity is unknown. We asked whether genetic structure of an epiphytic orchid, Epidendrum firmum, corresponds to gaps between Costa Rican mountain ranges, and how these gaps influence pollen and seed flow. We predicted that significant genetic structure exists among mountain ranges due to different colonization histories and limited gene flow. Furthermore, we predicted that pollen movement contributes more to gene flow than seeds because seeds are released into strong winds perpendicular to the narrow northwest-southeast species distribution, while the likely pollinators are strong fliers. Individuals from 12 populations and three mountain ranges were genotyped with nuclear microsatellites (nDNA) and chloroplast sequences (cpDNA). Genetic diversity was high for both markers, while nDNA genetic structure was low (FSTn = 0.020) and cpDNA structure was moderate (FSTc = 0.443). Significant cpDNA barriers occurred within and among mountain ranges, but nDNA barriers were not significant after accounting for geographic distance. Consistent with these contrasting patterns of genetic structure, pollen contributes substantially more to gene flow among populations than seed (mp /ms = 46). Pollinators mediated extensive gene flow, eroding nDNA colonization footprints, while seed flow was comparatively limited, possibly due to directional prevailing winds across linearly distributed populations. Dispersal traits alone may not accurately inform predictions about gene flow or genetic structure, supporting the need for research into the potentially crucial role of pollinators and landscape context in gene flow among isolated populations. © 2013 John Wiley & Sons Ltd.
Eyeing the Cyr61/CTGF/NOV (CCN) group of genes in development and diseases: highlights of their structural likenesses and functional dissimilarities.

PubMed

Krupska, Izabela; Bruford, Elspeth A; Chaqour, Brahim

2015-09-23

"CCN" is an acronym referring to the first letter of each of the first three members of this original group of mammalian functionally and phylogenetically distinct extracellular matrix (ECM) proteins [i.e., cysteine-rich 61 (CYR61), connective tissue growth factor (CTGF), and nephroblastoma-overexpressed (NOV)]. Although "CCN" genes are unlikely to have arisen from a common ancestral gene, their encoded proteins share multimodular structures in which most cysteine residues are strictly conserved in their positions within several structural motifs. The CCN genes can be subdivided into members developmentally indispensable for embryonic viability (e.g., CCN1, 2 and 5), each assuming unique tissue-specific functions, and members not essential for embryonic development (e.g., CCN3, 4 and 6), probably due to a balance of functional redundancy and specialization during evolution. The temporo-spatial regulation of the CCN genes and the structural information contained within the sequences of their encoded proteins reflect diversity in their context and tissue-specific functions. Genetic association studies and experimental anomalies, replicated in various animal models, have shown that altered CCN gene structure or expression is associated with "injury" stimuli--whether mechanical (e.g., trauma, shear stress) or chemical (e.g., ischemia, hyperglycemia, hyperlipidemia, inflammation). Consequently, increased organ-specific susceptibility to structural damages ensues. These data underscore the critical functions of CCN proteins in the dynamics of tissue repair and regeneration and in the compensatory responses preceding organ failure. A better understanding of the regulation and mode of action of each CCN member will be useful in developing specific gain- or loss-of-function strategies for therapeutic purposes.
Fine-structure mapping of the firA gene, a locus involved in the phenotypic expression of rifampin resistance in Escherichia.

PubMed Central

Lathe, R

1977-01-01

The firA (Ts)200 mutation not only eliminates the resistance to rifampin of certain genetically resistant strains, but, moreover, renders ribonucleic acid synthesis thermolabile. The firA gene has been mapped by P1 tranduction and is located extremely close to the structural gene for deoxyribonucleic acid polymerase III at 4 min on the Escherichia coli linkage map. PMID:330494
JPRS Report, Science and Technology USSR: Life Sciences.

DTIC Science & Technology

1990-07-16

4 1 VETERINARY MEDICINE Primary Structure of RNA Polymerase Gene of Foot-and-Mouth Disease Virus ( FMDV ...neering were used to obtain cDNA corresponding to the Primary Structure of RNA Polymerase Gene of RNA polymerase gene to FMDV A 2 2 , with a map of the...Foot-and-Mouth Disease Virus ( FMDV ) A22 primary nucleotide sequence of the cDNA provided. 18400538F Moscow BIOORGANICHESKA YA Analysis of the data
Structural, evolutionary and genetic analysis of the histidine biosynthetic "core" in the genus Burkholderia.

PubMed

Papaleo, Maria Cristiana; Russo, Edda; Fondi, Marco; Emiliani, Giovanni; Frandi, Antonio; Brilli, Matteo; Pastorelli, Roberta; Fani, Renato

2009-12-01

In this work a detailed analysis of the structure, the expression and the organization of his genes belonging to the core of histidine biosynthesis (hisBHAF) in 40 newly determined and 13 available sequences of Burkholderia strains was carried out. Data obtained revealed a strong conservation of the structure and organization of these genes through the entire genus. The phylogenetic analysis showed the monophyletic origin of this gene cluster and indicated that it did not undergo horizontal gene transfer events. The analysis of the intergenic regions, based on the substitution rate, entropy plot and bendability suggested the existence of a putative transcription promoter upstream of hisB, that was supported by the genetic analysis that showed that this cluster was able to complement Escherichia colihisA, hisB, and hisF mutations. Moreover, a preliminary transcriptional analysis and the analysis of microarray data revealed that the expression of the his core was constitutive. These findings are in agreement with the fact that the entire Burkholderiahis operon is heterogeneous, in that it contains "alien" genes apparently not involved in histidine biosynthesis. Besides, they also support the idea that the proteobacterial his operon was piece-wisely assembled, i.e. through accretion of smaller units containing only some of the genes (eventually together with their own promoters) involved in this biosynthetic route. The correlation existing between the structure, organization and regulation of his "core" genes and the function(s) they perform in cellular metabolism is discussed.
Homology of aspartyl- and lysyl-tRNA synthetases.

PubMed Central

Gampel, A; Tzagoloff, A

1989-01-01

The yeast nuclear gene MSD1 coding for mitochondrial aspartyl-tRNA synthetase has been cloned and sequenced. The identity of the gene is confirmed by the following evidence. (i) The primary structure of the protein derived from the gene sequence is similar to that of the yeast cytoplasmic aspartyl-tRNA synthetase. (ii) In situ disruption of MSD1 in a respiratory-competent haploid strain of yeast induces a pleiotropic phenotype consistent with a lesion in mitochondrial protein synthesis. (iii) Mitochondria from a mutant with a disrupted chromosomal copy of MSD1 are unable to acylate mitochondrial aspartyl-tRNA. The primary structures of the cytoplasmic and mitochondrial aspartyl-tRNA synthetases are similar to the yeast cytoplasmic lysyl-tRNA synthetase, suggesting that the two types of synthetases may have a common evolutionary origin. Searches of the current protein banks also have revealed a high degree of sequence similarity of the lysyl-tRNA synthetase to the product of the Escherichia coli herC gene and to the partial sequence of a protein encoded by an unidentified reading frame located adjacent to the E. coli frdA gene. Based on the sequence similarities and the map positions of the herC and frdA loci, we propose herC to be the structural gene of the constitutively expressed lysyl-tRNA synthetase of E. coli and the unidentified reading frame to be the structural gene of the heat-inducible lysyl-tRNA synthetase. Images PMID:2668951
The structure of a gene co-expression network reveals biological functions underlying eQTLs.

PubMed

Villa-Vialaneix, Nathalie; Liaubet, Laurence; Laurent, Thibault; Cherel, Pierre; Gamot, Adrien; SanCristobal, Magali

2013-01-01

What are the commonalities between genes, whose expression level is partially controlled by eQTL, especially with regard to biological functions? Moreover, how are these genes related to a phenotype of interest? These issues are particularly difficult to address when the genome annotation is incomplete, as is the case for mammalian species. Moreover, the direct link between gene expression and a phenotype of interest may be weak, and thus difficult to handle. In this framework, the use of a co-expression network has proven useful: it is a robust approach for modeling a complex system of genetic regulations, and to infer knowledge for yet unknown genes. In this article, a case study was conducted with a mammalian species. It showed that the use of a co-expression network based on partial correlation, combined with a relevant clustering of nodes, leads to an enrichment of biological functions of around 83%. Moreover, the use of a spatial statistics approach allowed us to superimpose additional information related to a phenotype; this lead to highlighting specific genes or gene clusters that are related to the network structure and the phenotype. Three main results are worth noting: first, key genes were highlighted as a potential focus for forthcoming biological experiments; second, a set of biological functions, which support a list of genes under partial eQTL control, was set up by an overview of the global structure of the gene expression network; third, pH was found correlated with gene clusters, and then with related biological functions, as a result of a spatial analysis of the network topology.
Genetic mechanisms involved in the evolution of the cephalopod camera eye revealed by transcriptomic and developmental studies

PubMed Central

2011-01-01

Background Coleoid cephalopods (squids and octopuses) have evolved a camera eye, the structure of which is very similar to that found in vertebrates and which is considered a classic example of convergent evolution. Other molluscs, however, possess mirror, pin-hole, or compound eyes, all of which differ from the camera eye in the degree of complexity of the eye structures and neurons participating in the visual circuit. Therefore, genes expressed in the cephalopod eye after divergence from the common molluscan ancestor could be involved in eye evolution through association with the acquisition of new structural components. To clarify the genetic mechanisms that contributed to the evolution of the cephalopod camera eye, we applied comprehensive transcriptomic analysis and conducted developmental validation of candidate genes involved in coleoid cephalopod eye evolution. Results We compared gene expression in the eyes of 6 molluscan (3 cephalopod and 3 non-cephalopod) species and selected 5,707 genes as cephalopod camera eye-specific candidate genes on the basis of homology searches against 3 molluscan species without camera eyes. First, we confirmed the expression of these 5,707 genes in the cephalopod camera eye formation processes by developmental array analysis. Second, using molecular evolutionary (dN/dS) analysis to detect positive selection in the cephalopod lineage, we identified 156 of these genes in which functions appeared to have changed after the divergence of cephalopods from the molluscan ancestor and which contributed to structural and functional diversification. Third, we selected 1,571 genes, expressed in the camera eyes of both cephalopods and vertebrates, which could have independently acquired a function related to eye development at the expression level. Finally, as experimental validation, we identified three functionally novel cephalopod camera eye genes related to optic lobe formation in cephalopods by in situ hybridization analysis of embryonic pygmy squid. Conclusion We identified 156 genes positively selected in the cephalopod lineage and 1,571 genes commonly found in the cephalopod and vertebrate camera eyes from the analysis of cephalopod camera eye specificity at the expression level. Experimental validation showed that the cephalopod camera eye-specific candidate genes include those expressed in the outer part of the optic lobes, which unique to coleoid cephalopods. The results of this study suggest that changes in gene expression and in the primary structure of proteins (through positive selection) from those in the common molluscan ancestor could have contributed, at least in part, to cephalopod camera eye acquisition. PMID:21702923
The Crystal Structure of a Quercetin 2,3-Dioxygenase from Bacillus subtilis Suggests Modulation of Enzyme Activity by a Change in the Metal Ion at the Active Site(s)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gopal, B.; Madan, Lalima L.; Betz, Stephen F.

2010-11-10

Common structural motifs, such as the cupin domains, are found in enzymes performing different biochemical functions while retaining a similar active site configuration and structural scaffold. The soil bacterium Bacillus subtilis has 20 cupin genes (0.5% of the total genome) with up to 14% of its genes in the form of doublets, thus making it an attractive system for studying the effects of gene duplication. There are four bicupins in B. subtilis encoded by the genes yvrK, yoaN, yxaG, and ywfC. The gene products of yvrK and yoaN function as oxalate decarboxylases with a manganese ion at the active site(s),more » whereas YwfC is a bacitracin synthetase. Here we present the crystal structure of YxaG, a novel iron-containing quercetin 2,3-dioxygenase with one active site in each cupin domain. Yxag is a dimer, both in solution and in the crystal. The crystal structure shows that the coordination geometry of the Fe ion is different in the two active sites of YxaG. Replacement of the iron at the active site with other metal ions suggests modulation of enzymatic activity in accordance with the Irving-Williams observation on the stability of metal ion complexes. This observation, along with a comparison with the crystal structure of YvrK determined recently, has allowed for a detailed structure-function analysis of the active site, providing clues to the diversification of function in the bicupin family of proteins.« less
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
The effects of graded levels of calorie restriction: VII. Topological rearrangement of hypothalamic aging networks.

PubMed

Derous, Davina; Mitchell, Sharon E; Green, Cara L; Wang, Yingchun; Han, Jing Dong J; Chen, Luonan; Promislow, Daniel E L; Lusseau, David; Speakman, John R; Douglas, Alex

2016-05-01

Connectivity in a gene-gene network declines with age, typically within gene clusters. We explored the effect of short-term (3 months) graded calorie restriction (CR) (up to 40 %) on network structure of aging-associated genes in the murine hypothalamus by using conditional mutual information. The networks showed a topological rearrangement when exposed to graded CR with a higher relative within cluster connectivity at 40CR. We observed changes in gene centrality concordant with changes in CR level, with Ppargc1a, and Ppt1 having increased centrality and Etfdh, Traf3 and Abcc1 decreased centrality as CR increased. This change in gene centrality in a graded manner with CR, occurred in the absence of parallel changes in gene expression levels. This study emphasizes the importance of augmenting traditional differential gene expression analyses to better understand structural changes in the transcriptome. Overall our results suggested that CR induced changes in centrality of biological relevant genes that play an important role in preventing the age-associated loss of network integrity irrespective of their gene expression levels.
The effects of graded levels of calorie restriction: VII. Topological rearrangement of hypothalamic aging networks

PubMed Central

Derous, Davina; Mitchell, Sharon E.; Green, Cara L.; Wang, Yingchun; Han, Jing Dong J.; Chen, Luonan; Promislow, Daniel E.L.; Lusseau, David; Speakman, John R.; Douglas, Alex

2016-01-01

Connectivity in a gene-gene network declines with age, typically within gene clusters. We explored the effect of short-term (3 months) graded calorie restriction (CR) (up to 40 %) on network structure of aging-associated genes in the murine hypothalamus by using conditional mutual information. The networks showed a topological rearrangement when exposed to graded CR with a higher relative within cluster connectivity at 40CR. We observed changes in gene centrality concordant with changes in CR level, with Ppargc1a, and Ppt1 having increased centrality and Etfdh, Traf3 and Abcc1 decreased centrality as CR increased. This change in gene centrality in a graded manner with CR, occurred in the absence of parallel changes in gene expression levels. This study emphasizes the importance of augmenting traditional differential gene expression analyses to better understand structural changes in the transcriptome. Overall our results suggested that CR induced changes in centrality of biological relevant genes that play an important role in preventing the age-associated loss of network integrity irrespective of their gene expression levels. PMID:27115072
Genome-wide Analyses of the Structural Gene Families Involved in the Legume-specific 5-Deoxyisoflavonoid Biosynthesis of Lotus japonicus

PubMed Central

Shimada, Norimoto; Sato, Shusei; Akashi, Tomoyoshi; Nakamura, Yasukazu; Tabata, Satoshi; Ayabe, Shin-ichi; Aoki, Toshio

2007-01-01

Abstract A model legume Lotus japonicus (Regel) K. Larsen is one of the subjects of genome sequencing and functional genomics programs. In the course of targeted approaches to the legume genomics, we analyzed the genes encoding enzymes involved in the biosynthesis of the legume-specific 5-deoxyisoflavonoid of L. japonicus, which produces isoflavan phytoalexins on elicitor treatment. The paralogous biosynthetic genes were assigned as comprehensively as possible by biochemical experiments, similarity searches, comparison of the gene structures, and phylogenetic analyses. Among the 10 biosynthetic genes investigated, six comprise multigene families, and in many cases they form gene clusters in the chromosomes. Semi-quantitative reverse transcriptase–PCR analyses showed coordinate up-regulation of most of the genes during phytoalexin induction and complex accumulation patterns of the transcripts in different organs. Some paralogous genes exhibited similar expression specificities, suggesting their genetic redundancy. The molecular evolution of the biosynthetic genes is discussed. The results presented here provide reliable annotations of the genes and genetic markers for comparative and functional genomics of leguminous plants. PMID:17452423
Mosaic Origins of a Complex Chimeric Mitochondrial Gene in Silene vulgaris

PubMed Central

Storchova, Helena; Müller, Karel; Lau, Steffen; Olson, Matthew S.

2012-01-01

Chimeric genes are significant sources of evolutionary innovation that are normally created when portions of two or more protein coding regions fuse to form a new open reading frame. In plant mitochondria astonishingly high numbers of different novel chimeric genes have been reported, where they are generated through processes of rearrangement and recombination. Nonetheless, because most studies do not find or report nucleotide variation within the same chimeric gene, evolution after the origination of these chimeric genes remains unstudied. Here we identify two alleles of a complex chimera in Silene vulgaris that are divergent in nucleotide sequence, genomic position relative to other mitochondrial genes, and expression patterns. Structural patterns suggest a history partially influenced by gene conversion between the chimeric gene and functional copies of subunit 1 of the mitochondrial ATP synthase gene (atp1). We identified small repeat structures within the chimeras that are likely recombination sites allowing generation of the chimera. These results establish the potential for chimeric gene divergence in different plant mitochondrial lineages within the same species. This result contrasts with the absence of diversity within mitochondrial chimeras found in crop species. PMID:22383961

Microbial community structure in fermentation process of Shaoxing rice wine by Illumina-based metagenomic sequencing.

PubMed

Xie, Guangfa; Wang, Lan; Gao, Qikang; Yu, Wenjing; Hong, Xutao; Zhao, Lingyun; Zou, Huijun

2013-09-01

To understand the role of the community structure of microbes in the environment in the fermentation of Shaoxing rice wine, samples collected from a wine factory were subjected to Illumina-based metagenomic sequencing. De novo assembly of the sequencing reads allowed the characterisation of more than 23 thousand microbial genes derived from 1.7 and 1.88 Gbp of sequences from two samples fermented for 5 and 30 days respectively. The microbial community structure at different fermentation times of Shaoxing rice wine was revealed, showing the different roles of the microbiota in the fermentation process of Shaoxing rice wine. The gene function of both samples was also studied in the COG database, with most genes belonging to category S (function unknown), category E (amino acid transport and metabolism) and unclassified group. The results show that both the microbial community structure and gene function composition change greatly at different time points of Shaoxing rice wine fermentation. © 2013 Society of Chemical Industry.
Structure and genetic diversity of natural populations of Morus alba in the trans-Himalayan Ladakh region.

PubMed

Bajpai, Prabodh K; Warghat, Ashish R; Sharma, Ram Kumar; Yadav, Ashish; Thakur, Anil K; Srivastava, Ravi B; Stobdan, Tsering

2014-04-01

Sequence-related amplified polymorphism markers were used to assess the genetic structure in three natural populations of Morus alba from trans-Himalaya. Multilocation sampling was conducted across 14 collection sites. The overall genetic diversity estimates were high: percentage polymorphic loci 89.66%, Nei's gene diversity 0.2286, and Shannon's information index 0.2175. At a regional level, partitioning of variability assessed using analysis of molecular variance (AMOVA), revealed 80% variation within and 20% among collection sites. Pattern appeared in STRUCTURE, BARRIER, and AMOVA, clearly demonstrating gene flow between the Indus and Suru populations and a geographic barrier between the Indus-Suru and Nubra populations, which effectively hinders gene flow. The results showed significant genetic differentiation, population structure, high to restricted gene flow, and high genetic diversity. The assumption that samples collected from the three valleys represent three different populations does not hold true. The fragmentation present in trans-Himalaya was more natural and less anthropogenic.
Cationic niosomes an effective gene carrier composed of novel spermine-derivative cationic lipids: effect of central core structures.

PubMed

Opanasopit, Praneet; Leksantikul, Lalita; Niyomtham, Nattisa; Rojanarata, Theerasak; Ngawhirunpat, Tanasait; Yingyongnarongkul, Boon-Ek

2017-05-01

Cationic niosomes formulated from Span 20, cholesterol (Chol) and novel spermine-based cationic lipids of multiple central core structures (di(oxyethyl)amino, di(oxyethyl)amino carboxy, 3-amino-1,2-dioxypropyl and 2-amino-1,3-dioxypropyl) were successfully prepared for improving transfection efficiency in vitro. The niosomes composed of spermine cationic lipid with central core structure of di(oxyethyl)amino revealed the highest gene transfection efficiency. To investigate the factors affecting gene transfection and cell viability including differences in the central core structures of cationic lipids, the composition of vesicles, molar ratio of cationic lipids in formulations and the weight ratio of niosomes to DNA. Cationic niosomes composed of nonionic surfactants (Span20), cholesterol and spermine-based cationic lipids of multiple central core structures were formulated. Gene transfection and cell viability were evaluated on a human cervical carcinoma cell line (HeLa cells) using pDNA encoding green fluorescent protein (pEGFP-C2). The morphology, size and charge were also characterized. High transfection efficiency was obtained from cationic niosomes composed of Span20:Chol:cationic lipid at the molar ratio of 2.5:2.5:0.5 mM. Cationic lipids with di(oxyethyl)amino as a central core structure exhibited highest transfection efficiency. In addition, there was also no serum effect on transfection efficiency. These novel cationic niosomes may constitute a good alternative carrier for gene transfection.
Genetic variations in the serotonergic system contribute to amygdala volume in humans

PubMed Central

Li, Jin; Chen, Chunhui; Wu, Karen; Zhang, Mingxia; Zhu, Bi; Chen, Chuansheng; Moyzis, Robert K.; Dong, Qi

2015-01-01

The amygdala plays a critical role in emotion processing and psychiatric disorders associated with emotion dysfunction. Accumulating evidence suggests that amygdala structure is modulated by serotonin-related genes. However, there is a gap between the small contributions of single loci (less than 1%) and the reported 63–65% heritability of amygdala structure. To understand the “missing heritability,” we systematically explored the contribution of serotonin genes on amygdala structure at the gene set level. The present study of 417 healthy Chinese volunteers examined 129 representative polymorphisms in genes from multiple biological mechanisms in the regulation of serotonin neurotransmission. A system-level approach using multiple regression analyses identified that nine SNPs collectively accounted for approximately 8% of the variance in amygdala volume. Permutation analyses showed that the probability of obtaining these findings by chance was low (p = 0.043, permuted for 1000 times). Findings showed that serotonin genes contribute moderately to individual differences in amygdala volume in a healthy Chinese sample. These results indicate that the system-level approach can help us to understand the genetic basis of a complex trait such as amygdala structure. PMID:26500508
Structural basis for regulation of rhizobial nodulation and symbiosis gene expression by the regulatory protein NolR.

PubMed

Lee, Soon Goo; Krishnan, Hari B; Jez, Joseph M

2014-04-29

The symbiosis between rhizobial microbes and host plants involves the coordinated expression of multiple genes, which leads to nodule formation and nitrogen fixation. As part of the transcriptional machinery for nodulation and symbiosis across a range of Rhizobium, NolR serves as a global regulatory protein. Here, we present the X-ray crystal structures of NolR in the unliganded form and complexed with two different 22-base pair (bp) double-stranded operator sequences (oligos AT and AA). Structural and biochemical analysis of NolR reveals protein-DNA interactions with an asymmetric operator site and defines a mechanism for conformational switching of a key residue (Gln56) to accommodate variation in target DNA sequences from diverse rhizobial genes for nodulation and symbiosis. This conformational switching alters the energetic contributions to DNA binding without changes in affinity for the target sequence. Two possible models for the role of NolR in the regulation of different nodulation and symbiosis genes are proposed. To our knowledge, these studies provide the first structural insight on the regulation of genes involved in the agriculturally and ecologically important symbiosis of microbes and plants that leads to nodule formation and nitrogen fixation.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa

PubMed Central

2015-01-01

Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Zebrafish hox paralogue group 2 genes function redundantly as selector genes to pattern the second pharyngeal arch.

PubMed

Hunter, Michael P; Prince, Victoria E

2002-07-15

The pharyngeal arches are one of the defining features of the vertebrates, with the first arch forming the mandibles of the jaw and the second forming jaw support structures. The cartilaginous elements of each arch are formed from separate migratory neural crest cell streams, which derive from the dorsal aspect of the neural tube. The second and more posterior crest streams are characterized by specific Hox gene expression. The zebrafish has a larger overall number of Hox genes than the tetrapod vertebrates, as the result of a duplication event in its lineage. However, in both zebrafish and mouse, there are just two members of Hox paralogue group 2 (PG2): Hoxa2 and Hoxb2. Here, we show that morpholino-mediated "knock-down" of both zebrafish Hox PG2 genes results in major defects in second pharyngeal arch cartilages, involving replacement of ventral elements with a mirror-image duplication of first arch structures, and accompanying changes to pharyngeal musculature. In the mouse, null mutants of Hoxa2 have revealed that this single Hox gene is required for normal second arch patterning. By contrast, loss-of-function of either zebrafish Hox PG2 gene individually has no phenotypic consequence, showing that these two genes function redundantly to confer proper pattern to the second pharyngeal arch. We have also used hoxb1a mis-expression to induce localized ectopic expression of zebrafish Hox PG2 genes in the first arch; using this strategy, we find that ectopic expression of either Hox PG2 gene can confer second arch identity onto first arch structures, suggesting that the zebrafish Hox PG2 genes act as "selector genes." 2002 Elsevier Science (USA).
Comparative symbiotic plasmid analysis indicates that symbiosis gene ancestor type affects plasmid genetic evolution.

PubMed

Wang, X; Zhao, L; Zhang, L; Wu, Y; Chou, M; Wei, G

2018-07-01

Rhizobial symbiotic plasmids play vital roles in mutualistic symbiosis with legume plants by executing the functions of nodulation and nitrogen fixation. To explore the gene composition and genetic constitution of rhizobial symbiotic plasmids, comparison analyses of 24 rhizobial symbiotic plasmids derived from four rhizobial genera was carried out. Results illustrated that rhizobial symbiotic plasmids had higher proportion of functional genes participating in amino acid transport and metabolism, replication; recombination and repair; carbohydrate transport and metabolism; energy production and conversion and transcription. Mesorhizobium amorphae CCNWGS0123 symbiotic plasmid - pM0123d had similar gene composition with pR899b and pSNGR234a. All symbiotic plasmids shared 13 orthologous genes, including five nod and eight nif/fix genes which participate in the rhizobia-legume symbiosis process. These plasmids contained nod genes from four ancestors and fix genes from six ancestors. The ancestral type of pM0123d nod genes was similar with that of Rhizobium etli plasmids, while the ancestral type of pM0123d fix genes was same as that of pM7653Rb. The phylogenetic trees constructed based on nodCIJ and fixABC displayed different topological structures mainly due to nodCIJ and fixABC ancestral type discordance. The study presents valuable insights into mosaic structures and the evolution of rhizobial symbiotic plasmids. This study compared 24 rhizobial symbiotic plasmids that included four genera and 11 species, illuminating the functional gene composition and symbiosis gene ancestor types of symbiotic plasmids from higher taxonomy. It provides valuable insights into mosaic structures and the evolution of symbiotic plasmids. © 2018 The Society for Applied Microbiology.
Hormone-induced modifications of the chromatin structure surrounding upstream regulatory regions conserved between the mouse and rabbit whey acidic protein genes.

PubMed Central

Millot, Benjamin; Montoliu, Lluís; Fontaine, Marie-Louise; Mata, Teresa; Devinoy, Eve

2003-01-01

The upstream regulatory regions of the mouse and rabbit whey acidic protein (WAP) genes have been used extensively to target the efficient expression of foreign genes into the mammary gland of transgenic animals. Therefore both regions have been studied to elucidate fully the mechanisms controlling WAP gene expression. Three DNase I-hypersensitive sites (HSS0, HSS1 and HSS2) have been described upstream of the rabbit WAP gene in the lactating mammary gland and correspond to important regulatory regions. These sites are surrounded by variable chromatin structures during mammary-gland development. In the present study, we describe the upstream sequence of the mouse WAP gene. Analysis of genomic sequences shows that the mouse WAP gene is situated between two widely expressed genes (Cpr2 and Ramp3). We show that the hypersensitive sites found upstream of the rabbit WAP gene are also detected in the mouse WAP gene. Further, they encompass functional signal transducer and activator of transcription 5-binding sites, as has been observed in the rabbit. A new hypersensitive site (HSS3), not specific to the mammary gland, was mapped 8 kb upstream of the rabbit WAP gene. Unlike the three HSSs described above, HSS3 is also detected in the liver, but similar to HSS1, it does not depend on lactogenic hormone treatments during cell culture. The region surrounding HSS3 encompasses a potential matrix attachment region, which is also conserved upstream of the mouse WAP gene and contains a functional transcription factor Ets-1 (E26 transformation-specific-1)-binding site. Finally, we demonstrate for the first time that variations in the chromatin structure are dependent on prolactin alone. PMID:12580766
The WRKY Transcription Factor Genes in Lotus japonicus

PubMed Central

Wang, Pengfei; Wang, Xingjun

2014-01-01

WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I–III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved. PMID:24745006
Comparative and Evolutionary Analysis of Grass Pollen Allergens Using Brachypodium distachyon as a Model System

PubMed Central

Sharma, Akanksha; Sharma, Niharika; Bhalla, Prem; Singh, Mohan

2017-01-01

Comparative genomics have facilitated the mining of biological information from a genome sequence, through the detection of similarities and differences with genomes of closely or more distantly related species. By using such comparative approaches, knowledge can be transferred from the model to non-model organisms and insights can be gained in the structural and evolutionary patterns of specific genes. In the absence of sequenced genomes for allergenic grasses, this study was aimed at understanding the structure, organisation and expression profiles of grass pollen allergens using the genomic data from Brachypodium distachyon as it is phylogenetically related to the allergenic grasses. Combining genomic data with the anther RNA-Seq dataset revealed 24 pollen allergen genes belonging to eight allergen groups mapping on the five chromosomes in B. distachyon. High levels of anther-specific expression profiles were observed for the 24 identified putative allergen-encoding genes in Brachypodium. The genomic evidence suggests that gene encoding the group 5 allergen, the most potent trigger of hay fever and allergic asthma originated as a pollen specific orphan gene in a common grass ancestor of Brachypodium and Triticiae clades. Gene structure analysis showed that the putative allergen-encoding genes in Brachypodium either lack or contain reduced number of introns. Promoter analysis of the identified Brachypodium genes revealed the presence of specific cis-regulatory sequences likely responsible for high anther/pollen-specific expression. With the identification of putative allergen-encoding genes in Brachypodium, this study has also described some important plant gene families (e.g. expansin superfamily, EF-Hand family, profilins etc) for the first time in the model plant Brachypodium. Altogether, the present study provides new insights into structural characterization and evolution of pollen allergens and will further serve as a base for their functional characterization in related grass species. PMID:28103252
Phylogenetics and evolution of Su(var)3-9 SET genes in land plants: rapid diversification in structure and function.

PubMed

Zhu, Xinyu; Ma, Hong; Chen, Zhiduan

2011-03-09

Plants contain numerous Su(var)3-9 homologues (SUVH) and related (SUVR) genes, some of which await functional characterization. Although there have been studies on the evolution of plant Su(var)3-9 SET genes, a systematic evolutionary study including major land plant groups has not been reported. Large-scale phylogenetic and evolutionary analyses can help to elucidate the underlying molecular mechanisms and contribute to improve genome annotation. Putative orthologs of plant Su(var)3-9 SET protein sequences were retrieved from major representatives of land plants. A novel clustering that included most members analyzed, henceforth referred to as core Su(var)3-9 homologues and related (cSUVHR) gene clade, was identified as well as all orthologous groups previously identified. Our analysis showed that plant Su(var)3-9 SET proteins possessed a variety of domain organizations, and can be classified into five types and ten subtypes. Plant Su(var)3-9 SET genes also exhibit a wide range of gene structures among different paralogs within a family, even in the regions encoding conserved PreSET and SET domains. We also found that the majority of SUVH members were intronless and formed three subclades within the SUVH clade. A detailed phylogenetic analysis of the plant Su(var)3-9 SET genes was performed. A novel deep phylogenetic relationship including most plant Su(var)3-9 SET genes was identified. Additional domains such as SAR, ZnF_C2H2 and WIYLD were early integrated into primordial PreSET/SET/PostSET domain organization. At least three classes of gene structures had been formed before the divergence of Physcomitrella patens (moss) from other land plants. One or multiple retroposition events might have occurred among SUVH genes with the donor genes leading to the V-2 orthologous group. The structural differences among evolutionary groups of plant Su(var)3-9 SET genes with different functions were described, contributing to the design of further experimental studies.
Scale-dependent genetic structure of the Idaho giant salamander (Dicamptodon aterrimus) in stream networks.

PubMed

Mullen, Lindy B; Arthur Woods, H; Schwartz, Michael K; Sepulveda, Adam J; Lowe, Winsor H

2010-03-01

The network architecture of streams and rivers constrains evolutionary, demographic and ecological processes of freshwater organisms. This consistent architecture also makes stream networks useful for testing general models of population genetic structure and the scaling of gene flow. We examined genetic structure and gene flow in the facultatively paedomorphic Idaho giant salamander, Dicamptodon aterrimus, in stream networks of Idaho and Montana, USA. We used microsatellite data to test population structure models by (i) examining hierarchical partitioning of genetic variation in stream networks; and (ii) testing for genetic isolation by distance along stream corridors vs. overland pathways. Replicated sampling of streams within catchments within three river basins revealed that hierarchical scale had strong effects on genetic structure and gene flow. amova identified significant structure at all hierarchical scales (among streams, among catchments, among basins), but divergence among catchments had the greatest structural influence. Isolation by distance was detected within catchments, and in-stream distance was a strong predictor of genetic divergence. Patterns of genetic divergence suggest that differentiation among streams within catchments was driven by limited migration, consistent with a stream hierarchy model of population structure. However, there was no evidence of migration among catchments within basins, or among basins, indicating that gene flow only counters the effects of genetic drift at smaller scales (within rather than among catchments). These results show the strong influence of stream networks on population structure and genetic divergence of a salamander, with contrasting effects at different hierarchical scales.
Population structure and landscape genetics of two endangered frog species of genus Odorrana: different scenarios on two islands

PubMed Central

Igawa, T; Oumi, S; Katsuren, S; Sumida, M

2013-01-01

Isolation by distance and landscape connectivity are fundamental factors underlying speciation and evolution. To understand how landscapes affect gene flow and shape population structures, island species provide intrinsic study objects. We investigated the effects of landscapes on the population structure of the endangered frog species, Odorrana ishikawae and O. splendida, which each inhabit an island in southwest Japan. This was done by examining population structure, gene flow and demographic history of each species by analyzing 12 microsatellite loci and exploring causal environmental factors through ecological niche modeling (ENM) and the cost-distance approach. Our results revealed that the limited gene flow and multiple-population structure in O. splendida and the single-population structure in O. ishikawae were maintained after divergence of the species through ancient vicariance between islands. We found that genetic distance correlated with geographic distance between populations of both species. Our landscape genetic analysis revealed that the connectivity of suitable habitats influences gene flow and leads to the formation of specific population structures. In particular, different degrees of topographical complexity between islands are the major determining factor for shaping contrasting population structures of two species. In conclusion, our results illustrate the diversification mechanism of organisms through the interaction with space and environment. Our results also present an ENM approach for identifying the key factors affecting demographic history and population structures of target species, especially endangered species. PMID:22990312
Single master regulatory gene coordinates the evolution and development of butterfly color and iridescence

PubMed Central

Zhang, Linlin

2017-01-01

The optix gene has been implicated in butterfly wing pattern adaptation by genetic association, mapping, and expression studies. The actual developmental function of this gene has remained unclear, however. Here we used CRISPR/Cas9 genome editing to show that optix plays a fundamental role in nymphalid butterfly wing pattern development, where it is required for determination of all chromatic coloration. optix knockouts in four species show complete replacement of color pigments with melanins, with corresponding changes in pigment-related gene expression, resulting in black and gray butterflies. We also show that optix simultaneously acts as a switch gene for blue structural iridescence in some butterflies, demonstrating simple regulatory coordination of structural and pigmentary coloration. Remarkably, these optix knockouts phenocopy the recurring “black and blue” wing pattern archetype that has arisen on many independent occasions in butterflies. Here we demonstrate a simple genetic basis for structural coloration, and show that optix plays a deeply conserved role in butterfly wing pattern development. PMID:28923944
Single master regulatory gene coordinates the evolution and development of butterfly color and iridescence.

PubMed

Zhang, Linlin; Mazo-Vargas, Anyi; Reed, Robert D

2017-10-03

The optix gene has been implicated in butterfly wing pattern adaptation by genetic association, mapping, and expression studies. The actual developmental function of this gene has remained unclear, however. Here we used CRISPR/Cas9 genome editing to show that optix plays a fundamental role in nymphalid butterfly wing pattern development, where it is required for determination of all chromatic coloration. optix knockouts in four species show complete replacement of color pigments with melanins, with corresponding changes in pigment-related gene expression, resulting in black and gray butterflies. We also show that optix simultaneously acts as a switch gene for blue structural iridescence in some butterflies, demonstrating simple regulatory coordination of structural and pigmentary coloration. Remarkably, these optix knockouts phenocopy the recurring "black and blue" wing pattern archetype that has arisen on many independent occasions in butterflies. Here we demonstrate a simple genetic basis for structural coloration, and show that optix plays a deeply conserved role in butterfly wing pattern development.
Influence of putative exopolysaccharide genes on Pseudomonas putida KT2440 biofilm stability.

PubMed

Nilsson, Martin; Chiang, Wen-Chi; Fazli, Mustafa; Gjermansen, Morten; Givskov, Michael; Tolker-Nielsen, Tim

2011-05-01

We report a study of the role of putative exopolysaccharide gene clusters in the formation and stability of Pseudomonas putida KT2440 biofilm. Two novel putative exopolysaccharide gene clusters, pea and peb, were identified, and evidence is provided that they encode products that stabilize P. putida KT2440 biofilm. The gene clusters alg and bcs, which code for proteins mediating alginate and cellulose biosynthesis, were found to play minor roles in P. putida KT2440 biofilm formation and stability under the conditions tested. A P. putida KT2440 derivative devoid of any identifiable exopolysaccharide genes was found to form biofilm with a structure similar to wild-type biofilm, but with a stability lower than that of wild-type biofilm. Based on our data, we suggest that the formation of structured P. putida KT2440 biofilm can occur in the absence of exopolysaccharides; however, exopolysaccharides play a role as structural stabilizers. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Glycosyltransferase Gene Expression Profiles Classify Cancer Types and Propose Prognostic Subtypes

NASA Astrophysics Data System (ADS)

Ashkani, Jahanshah; Naidoo, Kevin J.

2016-05-01

Aberrant glycosylation in tumours stem from altered glycosyltransferase (GT) gene expression but can the expression profiles of these signature genes be used to classify cancer types and lead to cancer subtype discovery? The differential structural changes to cellular glycan structures are predominantly regulated by the expression patterns of GT genes and are a hallmark of neoplastic cell metamorphoses. We found that the expression of 210 GT genes taken from 1893 cancer patient samples in The Cancer Genome Atlas (TCGA) microarray data are able to classify six cancers; breast, ovarian, glioblastoma, kidney, colon and lung. The GT gene expression profiles are used to develop cancer classifiers and propose subtypes. The subclassification of breast cancer solid tumour samples illustrates the discovery of subgroups from GT genes that match well against basal-like and HER2-enriched subtypes and correlates to clinical, mutation and survival data. This cancer type glycosyltransferase gene signature finding provides foundational evidence for the centrality of glycosylation in cancer.
Genome-wide analysis of TCP family in tobacco.

PubMed

Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

2016-05-23

The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.
Evidence for cryptic northern refugia in the last glacial period in Cryptomeria japonica

PubMed Central

Kimura, Megumi K.; Uchiyama, Kentaro; Nakao, Katsuhiro; Moriguchi, Yoshinari; San Jose-Maldia, Lerma; Tsumura, Yoshihiko

2014-01-01

Background and Aims Distribution shifts and natural selection during past climatic changes are important factors in determining the genetic structure of forest species. In particular, climatic fluctuations during the Quaternary appear to have caused changes in the distribution ranges of plants, and thus strongly affected their genetic structure. This study was undertaken to identify the responses of the conifer Cryptomeria japonica, endemic to the Japanese Archipelago, to past climatic changes using a combination of phylogeography and species distribution modelling (SDM) methods. Specifically, this study focused on the locations of refugia during the last glacial maximum (LGM). Methods Genetic diversity and structure were examined using 20 microsatellite markers in 37 populations of C. japonica. The locations of glacial refugia were assessed using STRUCTURE analysis, and potential habitats under current and past climate conditions were predicted using SDM. The process of genetic divergence was also examined using the approximate Bayesian computation procedure (ABC) in DIY ABC to test the divergence time between the gene pools detected by the STRUCTURE analysis. Key Results STRUCTURE analysis identified four gene pools: northern Tohoku district; from Chubu to Chugoku district; from Tohoku to Shikoku district on the Pacific Ocean side of the Archipelago; and Yakushima Island. DIY ABC analysis indicated that the four gene pools diverged at the same time before the LGM. SDM also indicated potential northern cryptic refugia. Conclusions The combined evidence from microsatellites and SDM clearly indicates that climatic changes have shaped the genetic structure of C. japonica. The gene pool detected in northern Tohoku district is likely to have been established by cryptic northern refugia on the coast of the Japan Sea to the west of the Archipelago. The gene pool in Yakushima Island can probably be explained simply by long-term isolation from the other gene pools since the LGM. These results are supported by those of SDM and the predicted divergence time determined using ABC analysis. PMID:25355521

Crystal structures of OrfX2 and P47 from a Botulinum neurotoxin OrfX-type gene cluster.

PubMed

Gustafsson, Robert; Berntsson, Ronnie P-A; Martínez-Carranza, Markel; El Tekle, Geniver; Odegrip, Richard; Johnson, Eric A; Stenmark, Pål

2017-11-01

Botulinum neurotoxins are highly toxic substances and are all encoded together with one of two alternative gene clusters, the HA or the OrfX gene cluster. Very little is known about the function and structure of the proteins encoded in the OrfX gene cluster, which in addition to the toxin contains five proteins (OrfX1, OrfX2, OrfX3, P47, and NTNH). We here present the structures of OrfX2 and P47, solved to 2.1 and 1.8 Å, respectively. We show that they belong to the TULIP protein superfamily, which are often involved in lipid binding. OrfX1 and OrfX2 were both found to bind phosphatidylinositol lipids. © 2017 Federation of European Biochemical Societies.
Conservation of mRNA secondary structures may filter out mutations in Escherichia coli evolution

PubMed Central

Chursov, Andrey; Frishman, Dmitrij; Shneider, Alexander

2013-01-01

Recent reports indicate that mutations in viral genomes tend to preserve RNA secondary structure, and those mutations that disrupt secondary structural elements may reduce gene expression levels, thereby serving as a functional knockout. In this article, we explore the conservation of secondary structures of mRNA coding regions, a previously unknown factor in bacterial evolution, by comparing the structural consequences of mutations in essential and nonessential Escherichia coli genes accumulated over 40 000 generations in the course of the ‘long-term evolution experiment’. We monitored the extent to which mutations influence minimum free energy (MFE) values, assuming that a substantial change in MFE is indicative of structural perturbation. Our principal finding is that purifying selection tends to eliminate those mutations in essential genes that lead to greater changes of MFE values and, therefore, may be more disruptive for the corresponding mRNA secondary structures. This effect implies that synonymous mutations disrupting mRNA secondary structures may directly affect the fitness of the organism. These results demonstrate that the need to maintain intact mRNA structures imposes additional evolutionary constraints on bacterial genomes, which go beyond preservation of structure and function of the encoded proteins. PMID:23783573
Human Chromosome 7: DNA Sequence and Biology

PubMed Central

Scherer, Stephen W.; Cheung, Joseph; MacDonald, Jeffrey R.; Osborne, Lucy R.; Nakabayashi, Kazuhiko; Herbrick, Jo-Anne; Carson, Andrew R.; Parker-Katiraee, Layla; Skaug, Jennifer; Khaja, Razi; Zhang, Junjun; Hudek, Alexander K.; Li, Martin; Haddad, May; Duggan, Gavin E.; Fernandez, Bridget A.; Kanematsu, Emiko; Gentles, Simone; Christopoulos, Constantine C.; Choufani, Sanaa; Kwasnicka, Dorota; Zheng, Xiangqun H.; Lai, Zhongwu; Nusskern, Deborah; Zhang, Qing; Gu, Zhiping; Lu, Fu; Zeesman, Susan; Nowaczyk, Malgorzata J.; Teshima, Ikuko; Chitayat, David; Shuman, Cheryl; Weksberg, Rosanna; Zackai, Elaine H.; Grebe, Theresa A.; Cox, Sarah R.; Kirkpatrick, Susan J.; Rahman, Nazneen; Friedman, Jan M.; Heng, Henry H. Q.; Pelicci, Pier Giuseppe; Lo-Coco, Francesco; Belloni, Elena; Shaffer, Lisa G.; Pober, Barbara; Morton, Cynthia C.; Gusella, James F.; Bruns, Gail A. P.; Korf, Bruce R.; Quade, Bradley J.; Ligon, Azra H.; Ferguson, Heather; Higgins, Anne W.; Leach, Natalia T.; Herrick, Steven R.; Lemyre, Emmanuelle; Farra, Chantal G.; Kim, Hyung-Goo; Summers, Anne M.; Gripp, Karen W.; Roberts, Wendy; Szatmari, Peter; Winsor, Elizabeth J. T.; Grzeschik, Karl-Heinz; Teebi, Ahmed; Minassian, Berge A.; Kere, Juha; Armengol, Lluis; Pujana, Miguel Angel; Estivill, Xavier; Wilson, Michael D.; Koop, Ben F.; Tosi, Sabrina; Moore, Gudrun E.; Boright, Andrew P.; Zlotorynski, Eitan; Kerem, Batsheva; Kroisel, Peter M.; Petek, Erwin; Oscier, David G.; Mould, Sarah J.; Döhner, Hartmut; Döhner, Konstanze; Rommens, Johanna M.; Vincent, John B.; Venter, J. Craig; Li, Peter W.; Mural, Richard J.; Adams, Mark D.; Tsui, Lap-Chee

2010-01-01

DNA sequence and annotation of the entire human chromosome 7, encompassing nearly 158 million nucleotides of DNA and 1917 gene structures, are presented. To generate a higher order description, additional structural features such as imprinted genes, fragile sites, and segmental duplications were integrated at the level of the DNA sequence with medical genetic data, including 440 chromosome rearrangement breakpoints associated with disease. This approach enabled the discovery of candidate genes for developmental diseases including autism. PMID:12690205
Population genetic structure in migratory sandhill cranes and the role of Pleistocene glaciations.

PubMed

Jones, Kenneth L; Krapu, Gary L; Brandt, David A; Ashley, Mary V

2005-08-01

Previous studies of migratory sandhill cranes (Grus canadensis) have made significant progress explaining evolution of this group at the species scale, but have been unsuccessful in explaining the geographically partitioned variation in morphology seen on the population scale. The objectives of this study were to assess the population structure and gene flow patterns among migratory sandhill cranes using microsatellite DNA genotypes and mitochondrial DNA haplotypes of a large sample of individuals across three populations. In particular, we were interested in evaluating the roles of Pleistocene glaciation events and postglaciation gene flow in shaping the present-day population structure. Our results indicate substantial gene flow across regions of the Midcontinental population that are geographically adjacent, suggesting that gene flow for most of the region follows an isolation-by-distance model. Male-mediated gene flow and strong female philopatry may explain the differing patterns of nuclear and mitochondrial variation. Taken in context with precise geographical information on breeding locations, the morphologic and microsatellite DNA variation shows a gradation from the Arctic-nesting subspecies G. c. canadensis to the nonArctic subspecies G. c. tabida. Analogous to other Arctic-nesting birds, it is probable that the population structure seen in Midcontinental sandhill cranes reflects the result of postglacial secondary contact. Our data suggest that subspecies of migratory sandhills experience significant gene flow and therefore do not represent distinct and independent genetic entities.
Robust Gaussian Graphical Modeling via l1 Penalization

PubMed Central

Sun, Hokeun; Li, Hongzhe

2012-01-01

Summary Gaussian graphical models have been widely used as an effective method for studying the conditional independency structure among genes and for constructing genetic networks. However, gene expression data typically have heavier tails or more outlying observations than the standard Gaussian distribution. Such outliers in gene expression data can lead to wrong inference on the dependency structure among the genes. We propose a l1 penalized estimation procedure for the sparse Gaussian graphical models that is robustified against possible outliers. The likelihood function is weighted according to how the observation is deviated, where the deviation of the observation is measured based on its own likelihood. An efficient computational algorithm based on the coordinate gradient descent method is developed to obtain the minimizer of the negative penalized robustified-likelihood, where nonzero elements of the concentration matrix represents the graphical links among the genes. After the graphical structure is obtained, we re-estimate the positive definite concentration matrix using an iterative proportional fitting algorithm. Through simulations, we demonstrate that the proposed robust method performs much better than the graphical Lasso for the Gaussian graphical models in terms of both graph structure selection and estimation when outliers are present. We apply the robust estimation procedure to an analysis of yeast gene expression data and show that the resulting graph has better biological interpretation than that obtained from the graphical Lasso. PMID:23020775
Bioinformatics analyses of Shigella CRISPR structure and spacer classification.

PubMed

Wang, Pengfei; Zhang, Bing; Duan, Guangcai; Wang, Yingfang; Hong, Lijuan; Wang, Linlin; Guo, Xiangjiao; Xi, Yuanlin; Yang, Haiyan

2016-03-01

Clustered regularly interspaced short palindromic repeats (CRISPR) are inheritable genetic elements of a variety of archaea and bacteria and indicative of the bacterial ecological adaptation, conferring acquired immunity against invading foreign nucleic acids. Shigella is an important pathogen for anthroponosis. This study aimed to analyze the features of Shigella CRISPR structure and classify the spacers through bioinformatics approach. Among 107 Shigella, 434 CRISPR structure loci were identified with two to seven loci in different strains. CRISPR-Q1, CRISPR-Q4 and CRISPR-Q5 were widely distributed in Shigella strains. Comparison of the first and last repeats of CRISPR1, CRISPR2 and CRISPR3 revealed several base variants and different stem-loop structures. A total of 259 cas genes were found among these 107 Shigella strains. The cas gene deletions were discovered in 88 strains. However, there is one strain that does not contain cas gene. Intact clusters of cas genes were found in 19 strains. From comprehensive analysis of sequence signature and BLAST and CRISPRTarget score, the 708 spacers were classified into three subtypes: Type I, Type II and Type III. Of them, Type I spacer referred to those linked with one gene segment, Type II spacer linked with two or more different gene segments, and Type III spacer undefined. This study examined the diversity of CRISPR/cas system in Shigella strains, demonstrated the main features of CRISPR structure and spacer classification, which provided critical information for elucidation of the mechanisms of spacer formation and exploration of the role the spacers play in the function of the CRISPR/cas system.
Association of BMPR-1B and GDF9 genes polymorphisms and secondary protein structure changes with reproduction traits in Mehraban ewes.

PubMed

Abdoli, R; Zamani, P; Deljou, A; Rezvan, H

2013-07-25

BMPR-1B and GDF9 genes are well known due to their important effects on litter size and mechanisms controlling ovulation rate in sheep. In the present study, polymorphisms of BMPR-1B gene exon 8 and GDF9 gene exon 1 were detected by single strand conformational polymorphism (SSCP) analysis and DNA sequencing methods in 100 Mehraban ewes. The PCR reaction forced to amplify 140 and 380-bp fragments of BMPR-1B and GDF9 genes, respectively. Two single nucleotide polymorphisms (SNPS) were identified in two different SSCP patterns of BMPR-1B gene (CC and CA genotypes) that deduced one amino acid exchange. Also, two SNPS were identified in three different SSCP patterns of GDF9 gene (AA, AG and GG genotypes) that deduced one amino acid exchanges. Two different secondary structures of protein were predicted for BMPR-1B exon 8, but the secondary protein structures predicted for GDF9 exon 1 were similar together. The evaluation of the associations between the SSCP patterns and the protein structure changes with reproduction traits showed that BMPR-1B exon 8 genotypes have significant effects on some of reproduction traits but the GDF9 genotypes did not have any significant effect. The CA genotype of BMPR-1B exon 8 had a significant positive effect on reproduction performance and could be considered as an important and new mutation, affecting the ewes reproduction performance. Marker assisted selection using BMPR-IB gene could be noticed to improve the reproduction traits in Mehraban sheep. Copyright © 2013 Elsevier B.V. All rights reserved.
Insights into soybean transcriptome reconfiguration under hypoxic stress: Functional, regulatory, structural, and compositional characterization.

PubMed

Nakayama, Thiago J; Rodrigues, Fabiana A; Neumaier, Norman; Marcolino-Gomes, Juliana; Molinari, Hugo B C; Santiago, Thaís R; Formighieri, Eduardo F; Basso, Marcos F; Farias, José R B; Emygdio, Beatriz M; de Oliveira, Ana C B; Campos, Ângela D; Borém, Aluízio; Harmon, Frank G; Mertz-Henning, Liliane M; Nepomuceno, Alexandre L

2017-01-01

Soybean (Glycine max) is one of the major crops worldwide and flooding stress affects the production and expansion of cultivated areas. Oxygen is essential for mitochondrial aerobic respiration to supply the energy demand of plant cells. Because oxygen diffusion in water is 10,000 times lower than in air, partial (hypoxic) or total (anoxic) oxygen deficiency is important component of flooding. Even when oxygen is externally available, oxygen deficiency frequently occurs in bulky, dense or metabolically active tissues such as phloem, meristems, seeds, and fruits. In this study, we analyzed conserved and divergent root transcriptional responses between flood-tolerant Embrapa 45 and flood-sensitive BR 4 soybean cultivars under hypoxic stress conditions with RNA-seq. To understand how soybean genes evolve and respond to hypoxia, stable and differentially expressed genes were characterized structurally and compositionally comparing its mechanistic relationship. Between cultivars, Embrapa 45 showed less up- and more down-regulated genes, and stronger induction of phosphoglucomutase (Glyma05g34790), unknown protein related to N-terminal protein myristoylation (Glyma06g03430), protein suppressor of phyA-105 (Glyma06g37080), and fibrillin (Glyma10g32620). RNA-seq and qRT-PCR analysis of non-symbiotic hemoglobin (Glyma11g12980) indicated divergence in gene structure between cultivars. Transcriptional changes for genes in amino acids and derivative metabolic process suggest involvement of amino acids metabolism in tRNA modifications, translation accuracy/efficiency, and endoplasmic reticulum stress in both cultivars under hypoxia. Gene groups differed in promoter TATA box, ABREs (ABA-responsive elements), and CRT/DREs (C-repeat/dehydration-responsive elements) frequency. Gene groups also differed in structure, composition, and codon usage, indicating biological significances. Additional data suggests that cis-acting ABRE elements can mediate gene expression independent of ABA in soybean roots under hypoxia.
Segmental Duplication, Microinversion, and Gene Loss Associated with a Complex Inversion Breakpoint Region in Drosophila

PubMed Central

Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo

2012-01-01

Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714
Evolution of a horizontally acquired legume gene, albumin 1, in the parasitic plant Phelipanche aegyptiaca and related species.

PubMed

Zhang, Yeting; Fernandez-Aparicio, Monica; Wafula, Eric K; Das, Malay; Jiao, Yuannian; Wickett, Norman J; Honaas, Loren A; Ralph, Paula E; Wojciechowski, Martin F; Timko, Michael P; Yoder, John I; Westwood, James H; Depamphilis, Claude W

2013-02-20

Parasitic plants, represented by several thousand species of angiosperms, use modified structures known as haustoria to tap into photosynthetic host plants and extract nutrients and water. As a result of their direct plant-plant connections with their host plant, parasitic plants have special opportunities for horizontal gene transfer, the nonsexual transmission of genetic material across species boundaries. There is increasing evidence that parasitic plants have served as recipients and donors of horizontal gene transfer (HGT), but the long-term impacts of eukaryotic HGT in parasitic plants are largely unknown. Here we show that a gene encoding albumin 1 KNOTTIN-like protein, closely related to the albumin 1 genes only known from papilionoid legumes, where they serve dual roles as food storage and insect toxin, was found in Phelipanche aegyptiaca and related parasitic species of family Orobanchaceae, and was likely acquired by a Phelipanche ancestor via HGT from a legume host based on phylogenetic analyses. The KNOTTINs are well known for their unique "disulfide through disulfide knot" structure and have been extensively studied in various contexts, including drug design. Genomic sequences from nine related parasite species were obtained, and 3D protein structure simulation tests and evolutionary constraint analyses were performed. The parasite gene we identified here retains the intron structure, six highly conserved cysteine residues necessary to form a KNOTTIN protein, and displays levels of purifying selection like those seen in legumes. The albumin 1 xenogene has evolved through >150 speciation events over ca. 16 million years, forming a small family of differentially expressed genes that may confer novel functions in the parasites. Moreover, further data show that a distantly related parasitic plant, Cuscuta, obtained two copies of albumin 1 KNOTTIN-like genes from legumes through a separate HGT event, suggesting that legume KNOTTIN structures have been repeatedly co-opted by parasitic plants. The HGT-derived albumins in Phelipanche represent a novel example of how plants can acquire genes from other plants via HGT that then go on to duplicate, evolve, and retain the specialized features required to perform a unique host-derived function.
Genetics Home Reference: steatocystoma multiplex

MedlinePlus

... Genetic Changes Steatocystoma multiplex can be caused by mutations in the KRT17 gene. This gene provides instructions ... skin, nails, and other tissues. The KRT17 gene mutations that cause steatocystoma multiplex alter the structure of ...
Comparative Reannotation of 21 Aspergillus Genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Salamov, Asaf; Riley, Robert; Kuo, Alan

2013-03-08

We used comparative gene modeling to reannotate 21 Aspergillus genomes. Initial automatic annotation of individual genomes may contain some errors of different nature, e.g. missing genes, incorrect exon-intron structures, 'chimeras', which fuse 2 or more real genes or alternatively splitting some real genes into 2 or more models. The main premise behind the comparative modeling approach is that for closely related genomes most orthologous families have the same conserved gene structure. The algorithm maps all gene models predicted in each individual Aspergillus genome to the other genomes and, for each locus, selects from potentially many competing models, the one whichmore » most closely resembles the orthologous genes from other genomes. This procedure is iterated until no further change in gene models is observed. For Aspergillus genomes we predicted in total 4503 new gene models ( ~;;2percent per genome), supported by comparative analysis, additionally correcting ~;;18percent of old gene models. This resulted in a total of 4065 more genes with annotated PFAM domains (~;;3percent increase per genome). Analysis of a few genomes with EST/transcriptomics data shows that the new annotation sets also have a higher number of EST-supported splice sites at exon-intron boundaries.« less
Large-Scale Collection and Analysis of Full-Length cDNAs from Brachypodium distachyon and Integration with Pooideae Sequence Resources

PubMed Central

Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Takahashi, Fuminori; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

2013-01-01

A comprehensive collection of full-length cDNAs is essential for correct structural gene annotation and functional analyses of genes. We constructed a mixed full-length cDNA library from 21 different tissues of Brachypodium distachyon Bd21, and obtained 78,163 high quality expressed sequence tags (ESTs) from both ends of ca. 40,000 clones (including 16,079 contigs). We updated gene structure annotations of Brachypodium genes based on full-length cDNA sequences in comparison with the latest publicly available annotations. About 10,000 non-redundant gene models were supported by full-length cDNAs; ca. 6,000 showed some transcription unit modifications. We also found ca. 580 novel gene models, including 362 newly identified in Bd21. Using the updated transcription start sites, we searched a total of 580 plant cis-motifs in the −3 kb promoter regions and determined a genome-wide Brachypodium promoter architecture. Furthermore, we integrated the Brachypodium full-length cDNAs and updated gene structures with available sequence resources in wheat and barley in a web-accessible database, the RIKEN Brachypodium FL cDNA database. The database represents a “one-stop” information resource for all genomic information in the Pooideae, facilitating functional analysis of genes in this model grass plant and seamless knowledge transfer to the Triticeae crops. PMID:24130698
Orgyia pseudotsugata baculovirus p10 and polyhedron envelope protein genes: analysis of their relative expression levels and role in polyhedron structure.

PubMed

Gross, C H; Russell, R L; Rohrmann, G F

1994-05-01

To investigate the regulation of p10 and polyhedron envelope protein (PEP) gene expression and their role in polyhedron development, Orgyia pseudotsugata multinucleocapsid nuclear polyhedrosis viruses lacking these genes were constructed. Recombinant viruses were produced, in which the p10 gene, the PEP gene or both genes were disrupted with the beta-glucuronidase (GUS) or beta-galactosidase (lacZ) genes. GUS activity under the control of the PEP protein promoter was observed later in infection and its maximal expression was less than 10% the level for p10 promoter-GUS constructs. Tissues from O. pseudotsugata larvae infected with these recombinants were examined by electron microscopy. Cells from insects infected with the p10- viruses lacked p10-associated fibrillar structures, but fragments of polyhedron envelope-like structures were observed on the surface of some polyhedra. Immunogold labelling of cells infected with the p10-GUS+ virus with an antibody directed against PEP showed that the PEP was concentrated at the surface of polyhedra. Although polyhedra produced by p10 and PEP gene deletion mutants demonstrated what appeared to be a polyhedron envelope by transmission electron microscopy, scanning electron microscopy showed that they had irregular, pitted surfaces that were different from wild-type polyhedra. These data suggested that both p10 and PEP are important for the proper formation of the periphery of polyhedra.
Plastid and mitochondrial genomes of Coccophora langsdorfii (Fucales, Phaeophyceae) and the utility of molecular markers

PubMed Central

Graf, Louis; Kim, Yae Jin; Cho, Ga Youn; Miller, Kathy Ann

2017-01-01

Coccophora langsdorfii (Turner) Greville (Fucales) is an intertidal brown alga that is endemic to Northeast Asia and increasingly endangered by habitat loss and climate change. We sequenced the complete circular plastid and mitochondrial genomes of C. langsdorfii. The circular plastid genome is 124,450 bp and contains 139 protein-coding, 28 tRNA and 6 rRNA genes. The circular mitochondrial genome is 35,660 bp and contains 38 protein-coding, 25 tRNA and 3 rRNA genes. The structure and gene content of the C. langsdorfii plastid genome is similar to those of other species in the Fucales. The plastid genomes of brown algae in other orders share similar gene content but exhibit large structural recombination. The large in-frame insert in the cox2 gene in the mitochondrial genome of C. langsdorfii is typical of other brown algae. We explored the effect of this insertion on the structure and function of the cox2 protein. We estimated the usefulness of 135 plastid genes and 35 mitochondrial genes for developing molecular markers. This study shows that 29 organellar genes will prove efficient for resolving brown algal phylogeny. In addition, we propose a new molecular marker suitable for the study of intraspecific genetic diversity that should be tested in a large survey of populations of C. langsdorfii. PMID:29095864
Myc-induced anchorage of the rDNA IGS region to nucleolar matrix modulates growth-stimulated changes in higher-order rDNA architecture

PubMed Central

Shiue, Chiou-Nan; Nematollahi-Mahani, Amir; Wright, Anthony P.H.

2014-01-01

Chromatin domain organization and the compartmentalized distribution of chromosomal regions are essential for packaging of deoxyribonucleic acid (DNA) in the eukaryotic nucleus as well as regulated gene expression. Nucleoli are the most prominent morphological structures of cell nuclei and nucleolar organization is coupled to cell growth. It has been shown that nuclear scaffold/matrix attachment regions often define the base of looped chromosomal domains in vivo and that they are thereby critical for correct chromosome architecture and gene expression. Here, we show regulated organization of mammalian ribosomal ribonucleic acid genes into distinct chromatin loops by tethering to nucleolar matrix via the non-transcribed inter-genic spacer region of the ribosomal DNA (rDNA). The rDNA gene loop structures are induced specifically upon growth stimulation and are dependent on the activity of the c-Myc protein. Matrix-attached rDNA genes are hypomethylated at the promoter and are thus available for transcriptional activation. rDNA genes silenced by methylation are not recruited to the matrix. c-Myc, which has been shown to induce rDNA transcription directly, is physically associated with rDNA gene looping structures and the intergenic spacer sequence in growing cells. Such a role of Myc proteins in gene activation has not been reported previously. PMID:24609384
Myc-induced anchorage of the rDNA IGS region to nucleolar matrix modulates growth-stimulated changes in higher-order rDNA architecture.

PubMed

Shiue, Chiou-Nan; Nematollahi-Mahani, Amir; Wright, Anthony P H

2014-05-01

Chromatin domain organization and the compartmentalized distribution of chromosomal regions are essential for packaging of deoxyribonucleic acid (DNA) in the eukaryotic nucleus as well as regulated gene expression. Nucleoli are the most prominent morphological structures of cell nuclei and nucleolar organization is coupled to cell growth. It has been shown that nuclear scaffold/matrix attachment regions often define the base of looped chromosomal domains in vivo and that they are thereby critical for correct chromosome architecture and gene expression. Here, we show regulated organization of mammalian ribosomal ribonucleic acid genes into distinct chromatin loops by tethering to nucleolar matrix via the non-transcribed inter-genic spacer region of the ribosomal DNA (rDNA). The rDNA gene loop structures are induced specifically upon growth stimulation and are dependent on the activity of the c-Myc protein. Matrix-attached rDNA genes are hypomethylated at the promoter and are thus available for transcriptional activation. rDNA genes silenced by methylation are not recruited to the matrix. c-Myc, which has been shown to induce rDNA transcription directly, is physically associated with rDNA gene looping structures and the intergenic spacer sequence in growing cells. Such a role of Myc proteins in gene activation has not been reported previously. © 2014 The Author(s). Published by Oxford University Press [on behalf of Nucleic Acids Research].
Structural characteristics of ScBx genes controlling the biosynthesis of hydroxamic acids in rye (Secale cereale L.).

PubMed

Bakera, Beata; Makowska, Bogna; Groszyk, Jolanta; Niziołek, Michał; Orczyk, Wacław; Bolibok-Brągoszewska, Hanna; Hromada-Judycka, Aneta; Rakoczy-Trojanowska, Monika

2015-08-01

Benzoxazinoids (BX) are major secondary metabolites of gramineous plants that play an important role in disease resistance and allelopathy. They also have many other unique properties including anti-bacterial and anti-fungal activity, and the ability to reduce alfa-amylase activity. The biosynthesis and modification of BX are controlled by the genes Bx1 ÷ Bx10, GT and glu, and the majority of these Bx genes have been mapped in maize, wheat and rye. However, the genetic basis of BX biosynthesis remains largely uncharacterized apart from some data from maize and wheat. The aim of this study was to isolate, sequence and characterize five genes (ScBx1, ScBx2, ScBx3, ScBx4 and ScBx5) encoding enzymes involved in the synthesis of DIBOA, an important defense compound of rye. Using a modified 3D procedure of BAC library screening, seven BAC clones containing all of the ScBx genes were isolated and sequenced. Bioinformatic analyses of the resulting contigs were used to examine the structure and other features of these genes, including their promoters, introns and 3'UTRs. Comparative analysis showed that the ScBx genes are similar to those of other Poaceae species, especially to the TaBx genes. The polymorphisms present both in the coding sequences and non-coding regions of ScBx in relation to other Bx genes are predicted to have an impact on the expression, structure and properties of the encoded proteins.
Genome-wide identification and evolutionary analysis of algal LPAT genes involved in TAG biosynthesis using bioinformatic approaches.

PubMed

Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar

2014-12-01

Lysophosphatidyl acyltransferase (LPAT) is one of the major triacylglycerol synthesis enzymes, controlling the metabolic flow of lysophosphatidic acid to phosphatidic acid. Experimental studies in Arabidopsis have shown that LPAT activity is exhibited primarily by three distinct isoforms, namely the plastid-located LPAT1, the endoplasmic reticulum-located LPAT2, and the soluble isoform of LPAT (solLPAT). In this study, 24 putative genes representing all LPAT isoforms were identified from the analysis of 11 complete genomes including green algae, red algae, diatoms and higher plants. We observed LPAT1 and solLPAT genes to be ubiquitously present in nearly all genomes examined, whereas LPAT2 genes to have evolved more recently in the plant lineage. Phylogenetic analysis indicated that LPAT1, LPAT2 and solLPAT have convergently evolved through separate evolutionary paths and belong to three different gene families, which was further evidenced by their wide divergence at gene structure and sequence level. The genome distribution supports the hypothesis that each gene encoding a LPAT is not duplicated. Mapping of exon-intron structure of LPAT genes to the domain structure of proteins across different algal and plant species indicates that exon shuffling plays no role in the evolution of LPAT genes. Besides the previously defined motifs, several conserved consensus sequences were discovered which could be useful to distinguish different LPAT isoforms. Taken together, this study will enable the generation of experimental approximations to better understand the functional role of algal LPAT in lipid accumulation.
Identification of miRNA-Mediated Core Gene Module for Glioma Patient Prediction by Integrating High-Throughput miRNA, mRNA Expression and Pathway Structure

PubMed Central

Han, Junwei; Shang, Desi; Zhang, Yunpeng; Zhang, Wei; Yao, Qianlan; Han, Lei; Xu, Yanjun; Yan, Wei; Bao, Zhaoshi; You, Gan; Jiang, Tao; Kang, Chunsheng; Li, Xia

2014-01-01

The prognosis of glioma patients is usually poor, especially in patients with glioblastoma (World Health Organization (WHO) grade IV). The regulatory functions of microRNA (miRNA) on genes have important implications in glioma cell survival. However, there are not many studies that have investigated glioma survival by integrating miRNAs and genes while also considering pathway structure. In this study, we performed sample-matched miRNA and mRNA expression profilings to systematically analyze glioma patient survival. During this analytical process, we developed pathway-based random walk to identify a glioma core miRNA-gene module, simultaneously considering pathway structure information and multi-level involvement of miRNAs and genes. The core miRNA-gene module we identified was comprised of four apparent sub-modules; all four sub-modules displayed a significant correlation with patient survival in the testing set (P-values≤0.001). Notably, one sub-module that consisted of 6 miRNAs and 26 genes also correlated with survival time in the high-grade subgroup (WHO grade III and IV), P-value = 0.0062. Furthermore, the 26-gene expression signature from this sub-module had robust predictive power in four independent, publicly available glioma datasets. Our findings suggested that the expression signatures, which were identified by integration of miRNA and gene level, were closely associated with overall survival among the glioma patients with various grades. PMID:24809850

Sulfamethoxazole and COD increase abundance of sulfonamide resistance genes and change bacterial community structures within sequencing batch reactors.

PubMed

Guo, Xueping; Pang, Weihai; Dou, Chunling; Yin, Daqiang

2017-05-01

The abundant microbial community in biological treatment processes in wastewater treatment plants (WWTPs) may potentially enhance the horizontal gene transfer of antibiotic resistance genes with the presence of antibiotics. A lab-scale sequencing batch reactor was designed to investigate response of sulfonamide resistance genes (sulI, sulII) and bacterial communities to various concentrations of sulfamethoxazole (SMX) and chemical oxygen demand (COD) of wastewater. The SMX concentrations (0.001 mg/L, 0.1 mg/L and 10 mg/L) decreased with treatment time and higher SMX level was more difficult to remove. The presence of SMX also significantly reduced the removal efficiency of ammonia nitrogen, affecting the normal function of WWTPs. All three concentrations of SMX raised both sulI and sulII genes with higher concentrations exhibiting greater increases. The abundance of sul genes was positive correlated with treatment time and followed the second-order reaction kinetic model. Interestingly, these two genes have rather similar activity. SulI and sulII gene abundance also performed similar response to COD. Simpson index and Shannon-Weiner index did not show changes in the microbial community diversity. However, the 16S rRNA gene cloning and sequencing results showed the bacterial community structures varied during different stages. The results demonstrated that influent antibiotics into WWTPs may facilitate selection of ARGs and affect the wastewater conventional treatment as well as the bacteria community structures. Copyright © 2017 Elsevier Ltd. All rights reserved.
Structural Diversification of Lyngbyatoxin A by Host-Dependent Heterologous Expression of the tleABC Biosynthetic Gene Cluster.

PubMed

Zhang, Lihan; Hoshino, Shotaro; Awakawa, Takayoshi; Wakimoto, Toshiyuki; Abe, Ikuro

2016-08-03

Natural products have enormous structural diversity, yet little is known about how such diversity is achieved in nature. Here we report the structural diversification of a cyanotoxin-lyngbyatoxin A-and its biosynthetic intermediates by heterologous expression of the Streptomyces-derived tleABC biosynthetic gene cluster in three different Streptomyces hosts: S. lividans, S. albus, and S. avermitilis. Notably, the isolated lyngbyatoxin derivatives, including four new natural products, were biosynthesized by crosstalk between the heterologous tleABC gene cluster and the endogenous host enzymes. The simple strategy described here has expanded the structural diversity of lyngbyatoxin A and its biosynthetic intermediates, and provides opportunities for investigation of the currently underestimated hidden biosynthetic crosstalk. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Ablation of Mrds1/Ofcc1 Induces Hyper-γ-Glutamyl Transpeptidasemia without Abnormal Head Development and Schizophrenia-Relevant Behaviors in Mice

PubMed Central

Ohnishi, Tetsuo; Yamada, Kazuo; Watanabe, Akiko; Ohba, Hisako; Sakaguchi, Toru; Honma, Yota; Iwayama, Yoshimi; Toyota, Tomoko; Maekawa, Motoko; Watanabe, Kazutada; Detera-Wadleigh, Sevilla D.; Wakana, Shigeharu; Yoshikawa, Takeo

2011-01-01

Mutations in the Opo gene result in eye malformation in medaka fish. The human ortholog of this gene, MRDS1/OFCC1, is a potentially causal gene for orofacial cleft, as well as a susceptibility gene for schizophrenia, a devastating mental illness. Based on this evidence, we hypothesized that this gene could perform crucial functions in the development of head and brain structures in vertebrates. To test this hypothesis, we created Mrds1/Ofcc1-null mice. Mice were examined thoroughly using an abnormality screening system referred to as “the Japan Mouse Clinic”. No malformations of the head structure, eye or other parts of the body were apparent in these knockout mice. However, the mutant mice showed a marked increase in serum γ-glutamyl transpeptidase (GGT), a marker for liver damage, but no abnormalities in other liver-related measurements. We also performed a family-based association study on the gene in schizophrenia samples of Japanese origin. We found five single nucleotide polymorphisms (SNPs) located across the gene that showed significant transmission distortion, supporting a prior report of association in a Caucasian cohort. However, the knockout mice showed no behavioral phenotypes relevant to schizophrenia. In conclusion, disruption of the Mrds1/Ofcc1 gene elicits asymptomatic hyper-γ-glutamyl-transpeptidasemia in mice. However, there were no phenotypes to support a role for the gene in the development of eye and craniofacial structures in vertebrates. These results prompt further examination of the gene, including its putative contribution to hyper-γ-glutamyl transpeptidasemia and schizophrenia. PMID:22242126
Analysis of flavonoids and the flavonoid structural genes in brown fiber of upland cotton.

PubMed

Feng, Hongjie; Tian, Xinhui; Liu, Yongchang; Li, Yanjun; Zhang, Xinyu; Jones, Brian Joseph; Sun, Yuqiang; Sun, Jie

2013-01-01

As a result of changing consumer preferences, cotton (Gossypium Hirsutum L.) from varieties with naturally colored fibers is becoming increasingly sought after in the textile industry. The molecular mechanisms leading to colored fiber development are still largely unknown, although it is expected that the color is derived from flavanoids. Firstly, four key genes of the flavonoid biosynthetic pathway in cotton (GhC4H, GhCHS, GhF3'H, and GhF3'5'H) were cloned and studied their expression profiles during the development of brown- and white cotton fibers by QRT-PCR. And then, the concentrations of four components of the flavonoid biosynthetic pathway, naringenin, quercetin, kaempferol and myricetin in brown- and white fibers were analyzed at different developmental stages by HPLC. The predicted proteins of the four flavonoid structural genes corresponding to these genes exhibit strong sequence similarity to their counterparts in various plant species. Transcript levels for all four genes were considerably higher in developing brown fibers than in white fibers from a near isogenic line (NIL). The contents of four flavonoids (naringenin, quercetin, kaempferol and myricetin) were significantly higher in brown than in white fibers and corresponding to the biosynthetic gene expression levels. Flavonoid structural gene expression and flavonoid metabolism are important in the development of pigmentation in brown cotton fibers.
Enhancing biological relevance of a weighted gene co-expression network for functional module identification.

PubMed

Prom-On, Santitham; Chanthaphan, Atthawut; Chan, Jonathan Hoyin; Meechai, Asawin

2011-02-01

Relationships among gene expression levels may be associated with the mechanisms of the disease. While identifying a direct association such as a difference in expression levels between case and control groups links genes to disease mechanisms, uncovering an indirect association in the form of a network structure may help reveal the underlying functional module associated with the disease under scrutiny. This paper presents a method to improve the biological relevance in functional module identification from the gene expression microarray data by enhancing the structure of a weighted gene co-expression network using minimum spanning tree. The enhanced network, which is called a backbone network, contains only the essential structural information to represent the gene co-expression network. The entire backbone network is decoupled into a number of coherent sub-networks, and then the functional modules are reconstructed from these sub-networks to ensure minimum redundancy. The method was tested with a simulated gene expression dataset and case-control expression datasets of autism spectrum disorder and colorectal cancer studies. The results indicate that the proposed method can accurately identify clusters in the simulated dataset, and the functional modules of the backbone network are more biologically relevant than those obtained from the original approach.
Genomic structure of rat 3alpha-hydroxysteroid/dihydrodiol dehydrogenase (3alpha-HSD/DD, AKR1C9).

PubMed

Lin, H K; Hung, C F; Moore, M; Penning, T M

1999-11-01

Rat liver 3alpha-hydroxysteroid/dihydrodiol dehydrogenase (3alpha-HSD/DD) is a member of the aldo-keto reductase (AKR) superfamily. It is involved in the inactivation of steroid hormones and the metabolic activation of polycyclic aromatic hydrocarbons (PAH) by converting trans-dihydrodiols into reactive and redox-active o-quinones. The structure of the 5'-flanking region of the gene and factors involved in the constitutive and regulated expression of this gene have been reported [H.-K. Lin, T.M. Penning, Cloning, sequencing, and functional analysis of the 5'-flanking region of the rat 3alpha-hydroxysteroid/dihydrodiol dehydrogenase gene, Cancer Res. 55 (1995) 4105-4113]. We now describe the complete genomic structure of the rat type 1 3alpha-HSD/DD gene. Charon 4A and P1 genomic clones contained at least three rat genes (type 1, type 2 and type 3 3alpha-HSD/DD) each of which encoded for the same open reading frame (ORF) but differed in their exon-intron organization. 5'-RACE confirmed that the type 1 3alpha-HSD/DD gene encodes for the dominant transcript in rat liver and it was the regulation of this gene that was previously studied. The rat type 1 3alpha-HSD/DD gene is 30 kb in length and consists of nine exons and eight introns. Exon 9 encodes +931 to 966 bp of the ORF and the 1292 bp 3'-UTR implicated in mRNA stability. This genomic structure is nearly identical to the homologous human genes, type 1 3alpha-HSD (chlordecone reductase/DD4, AKR1C4), type 2 3alpha-HSD (AKR1C3) and type 3 3alpha-HSD (bile-acid binding protein, AKR1C2) genes. Three different cDNA's containing identical ORFs for 3alpha-HSD have been reported suggesting that all three genes may be expressed in rat liver. Using 5' primers corresponding to the 5'-UTR's of the three different cDNA's only one PCR fragment was obtained and corresponded to the type 1 3alpha-HSD/DD gene. These data suggested that the type 2 and type 3 3alpha-HSD/DD genes are not abundantly expressed in rat liver. It is unknown whether the type 2 and type 3 3alpha-HSD/DD genes represent pseudo-genes or whether they represent genes that are differentially expressed in other rat tissues.
[Hsp70 Genes of the Megaphragma amalphitanum (Hymenoptera: Trichogrammatidae) Parasitic Wasp].

PubMed

Chuvakova, L N; Sharko, F S; Nedoluzhko, A V; Polilov, A A; Prokhorchuk, E B; Skryabin, K G; Evgen'ev, M B

2017-01-01

Miniaturization is an evolutionary process that is widely represented in both invertebrates and vertebrates. Miniaturization frequently affects not only the size of the organism and its constituent cells, but also changes the genome structure and functioning. The structure of the main heat shock genes (hsp70 and hsp83) was studied in one of the smallest insects, the Megaphragma amalphitanum (Hymenoptera: Trichogrammatidae) parasitic wasp, which is comparable in size with unicellular organisms. An analysis of the sequenced genome has detected six genes that relate to the hsp70 family, some of which are apparently induced upon heat shock. Both induced and constitutively expressed hsp70 genes contain a large number of introns, which is not typical for the genes of this family. Moreover, none of the found genes form clusters, and they are all very heterogeneous (individual copies are only 75-85% identical), which indicates the absence of gene conversion, which provides the identity of genes of this family in Drosophila and other organisms. Two hsp83 genes, one of which contains an intron, have also been found in the M. amalphitanum genome.
High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE

USDA-ARS?s Scientific Manuscript database

We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (“Assessing Changes to Exons”) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detect...
DSSTox chemical-index files for exposure-related experiments in ArrayExpress and Gene Expression Omnibus: enabling toxico-chemogenomics data linkages

EPA Science Inventory

The Distributed Structure-Searchable Toxicity (DSSTox) ARYEXP and GEOGSE files are newly published, structure-annotated files of the chemical-associated and chemical exposure-related summary experimental content contained in the ArrayExpress Repository and Gene Expression Omnibus...
Chromosome structures: reduction of certain problems with unequal gene content and gene paralogs to integer linear programming.

PubMed

Lyubetsky, Vassily; Gershgorin, Roman; Gorbunov, Konstantin

2017-12-06

Chromosome structure is a very limited model of the genome including the information about its chromosomes such as their linear or circular organization, the order of genes on them, and the DNA strand encoding a gene. Gene lengths, nucleotide composition, and intergenic regions are ignored. Although highly incomplete, such structure can be used in many cases, e.g., to reconstruct phylogeny and evolutionary events, to identify gene synteny, regulatory elements and promoters (considering highly conserved elements), etc. Three problems are considered; all assume unequal gene content and the presence of gene paralogs. The distance problem is to determine the minimum number of operations required to transform one chromosome structure into another and the corresponding transformation itself including the identification of paralogs in two structures. We use the DCJ model which is one of the most studied combinatorial rearrangement models. Double-, sesqui-, and single-operations as well as deletion and insertion of a chromosome region are considered in the model; the single ones comprise cut and join. In the reconstruction problem, a phylogenetic tree with chromosome structures in the leaves is given. It is necessary to assign the structures to inner nodes of the tree to minimize the sum of distances between terminal structures of each edge and to identify the mutual paralogs in a fairly large set of structures. A linear algorithm is known for the distance problem without paralogs, while the presence of paralogs makes it NP-hard. If paralogs are allowed but the insertion and deletion operations are missing (and special constraints are imposed), the reduction of the distance problem to integer linear programming is known. Apparently, the reconstruction problem is NP-hard even in the absence of paralogs. The problem of contigs is to find the optimal arrangements for each given set of contigs, which also includes the mutual identification of paralogs. We proved that these problems can be reduced to integer linear programming formulations, which allows an algorithm to redefine the problems to implement a very special case of the integer linear programming tool. The results were tested on synthetic and biological samples. Three well-known problems were reduced to a very special case of integer linear programming, which is a new method of their solutions. Integer linear programming is clearly among the main computational methods and, as generally accepted, is fast on average; in particular, computation systems specifically targeted at it are available. The challenges are to reduce the size of the corresponding integer linear programming formulations and to incorporate a more detailed biological concept in our model of the reconstruction.
Genome-Wide Identification of the Alba Gene Family in Plants and Stress-Responsive Expression of the Rice Alba Genes.

PubMed

Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan

2018-03-28

Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
Products of three accessory genes, pilB, pilC, and pilD, are required for biogenesis of Pseudomonas aeruginosa pili.

PubMed Central

Nunn, D; Bergman, S; Lory, S

1990-01-01

The polar pili of Pseudomonas aeruginosa are composed of monomers of the pilin structural subunits. The biogenesis of pili involves the synthesis of pilin precursor, cleavage of a six-amino-acid leader peptide, membrane translocation, and assembly of monomers into a filamentous structure extending from the bacterial surface. This report describes three novel genes necessary for the formation of pili. DNA sequences adjacent to pilA, the pilin structural gene, were cloned and mutagenized with transposon Tn5. Each of the insertions were introduced into the chromosome of P. aeruginosa PAK by gene replacement. The effect of the Tn5 insertions in the bacterial chromosome on pilus assembly was assessed by electron microscopy and sensitivity of mutants to a pilus-specific bacteriophage. The resultant mutants were also tested for synthesis and membrane localization of the pilin antigen in order to define the genes required for maturation, export, and assembly of pilin. A 4.0-kilobase-pair region of DNA adjacent to the pilin structural gene was found to be essential for formation of pili. This region was sequenced and found to contain three open reading frames coding for 62-, 38- to 45-, and 28- to 32-kilodalton proteins (pilB, pilC, and pilD, respectively). Three proteins of similar molecular weight were expressed in Escherichia coli from the 4.0-kilobase-pair fragment flanking pilA with use of a T7 promoter-polymerase expression system. The results of the analyses of the three genes and the implications for pilin assembly and maturation are discussed. Images PMID:1971619
New Implications on Genomic Adaptation Derived from the Helicobacter pylori Genome Comparison

PubMed Central

Lara-Ramírez, Edgar Eduardo; Segura-Cabrera, Aldo; Guo, Xianwu; Yu, Gongxin; García-Pérez, Carlos Armando; Rodríguez-Pérez, Mario A.

2011-01-01

Background Helicobacter pylori has a reduced genome and lives in a tough environment for long-term persistence. It evolved with its particular characteristics for biological adaptation. Because several H. pylori genome sequences are available, comparative analysis could help to better understand genomic adaptation of this particular bacterium. Principal Findings We analyzed nine H. pylori genomes with emphasis on microevolution from a different perspective. Inversion was an important factor to shape the genome structure. Illegitimate recombination not only led to genomic inversion but also inverted fragment duplication, both of which contributed to the creation of new genes and gene family, and further, homological recombination contributed to events of inversion. Based on the information of genomic rearrangement, the first genome scaffold structure of H. pylori last common ancestor was produced. The core genome consists of 1186 genes, of which 22 genes could particularly adapt to human stomach niche. H. pylori contains high proportion of pseudogenes whose genesis was principally caused by homopolynucleotide (HPN) mutations. Such mutations are reversible and facilitate the control of gene expression through the change of DNA structure. The reversible mutations and a quasi-panmictic feature could allow such genes or gene fragments frequently transferred within or between populations. Hence, pseudogenes could be a reservoir of adaptation materials and the HPN mutations could be favorable to H. pylori adaptation, leading to HPN accumulation on the genomes, which corresponds to a special feature of Helicobacter species: extremely high HPN composition of genome. Conclusion Our research demonstrated that both genome content and structure of H. pylori have been highly adapted to its particular life style. PMID:21387011
Sieve element occlusion (SEO) genes encode structural phloem proteins involved in wound sealing of the phloem.

PubMed

Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A

2012-07-10

The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.
Cyclen-based double-tailed lipids for DNA delivery: Synthesis and the effect of linking group structures.

PubMed

Zhang, Yi-Mei; Chang, De-Chun; Zhang, Ji; Liu, Yan-Hong; Yu, Xiao-Qi

2015-09-01

The gene transfection efficiency (TE) of cationic lipids is largely influenced by the lipid structure. Six novel 1, 4, 7, 10-tetraazacyclododecane (cyclen)-based cationic lipids L1-L6, which contain double oleyl as hydrophobic tails, were designed and synthesized. The difference between these lipids is their diverse backbone. Liposomes prepared by the lipids and DOPE showed good DNA affinity, and full DNA condensation could be achieved at N/P of 4 to form lipoplexes with proper size and zeta-potentials for gene transfection. Structure-activity relationship of these lipids as non-viral gene delivery vectors was investigated. It was found that minor backbone structural variations, including linking group and the structural symmetry would affect the TE. The diethylenetriamine derived lipid L4 containing amide linking bonds gave the best TE, which was several times higher than commercially available transfection reagent lipofectamine 2000. Besides, these lipids exhibited low cytotoxicity, suggesting their good biocompatibility. Results reveal that such type of cationic lipids might be promising non-viral gene vectors, and also afford us clues for the design of novel vectors with higher TE and biocompatibility. Copyright © 2015 Elsevier Ltd. All rights reserved.
Mitochondrial genome of the African lion Panthera leo leo.

PubMed

Ma, Yue-ping; Wang, Shuo

2015-01-01

In this study, the complete mitochondrial genome sequence of the African lion P. leo leo was reported. The total length of the mitogenome was 17,054 bp. It contained the typical mitochondrial structure, including 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region; 21 of the tRNA genes folded into typical cloverleaf secondary structure except for tRNASe. The overall composition of the mitogenome was A (32.0%), G (14.5%), C (26.5%) and T (27.0%). The new sequence will provide molecular genetic information for conservation genetics study of this important large carnivore.
Muscle Research and Gene Ontology: New standards for improved data integration

PubMed Central

Feltrin, Erika; Campanaro, Stefano; Diehl, Alexander D; Ehler, Elisabeth; Faulkner, Georgine; Fordham, Jennifer; Gardin, Chiara; Harris, Midori; Hill, David; Knoell, Ralph; Laveder, Paolo; Mittempergher, Lorenza; Nori, Alessandra; Reggiani, Carlo; Sorrentino, Vincenzo; Volpe, Pompeo; Zara, Ivano; Valle, Giorgio; Deegan née Clark, Jennifer

2009-01-01

Background The Gene Ontology Project provides structured controlled vocabularies for molecular biology that can be used for the functional annotation of genes and gene products. In a collaboration between the Gene Ontology (GO) Consortium and the muscle biology community, we have made large-scale additions to the GO biological process and cellular component ontologies. The main focus of this ontology development work concerns skeletal muscle, with specific consideration given to the processes of muscle contraction, plasticity, development, and regeneration, and to the sarcomere and membrane-delimited compartments. Our aims were to update the existing structure to reflect current knowledge, and to resolve, in an accommodating manner, the ambiguity in the language used by the community. Results The updated muscle terminologies have been incorporated into the GO. There are now 159 new terms covering critical research areas, and 57 existing terms have been improved and reorganized to follow their usage in muscle literature. Conclusion The revised GO structure should improve the interpretation of data from high-throughput (e.g. microarray and proteomic) experiments in the area of muscle science and muscle disease. We actively encourage community feedback on, and gene product annotation with these new terms. Please visit the Muscle Community Annotation Wiki . PMID:19178689
COGNATE: comparative gene annotation characterizer.

PubMed

Wilbrandt, Jeanne; Misof, Bernhard; Niehuis, Oliver

2017-07-17

The comparison of gene and genome structures across species has the potential to reveal major trends of genome evolution. However, such a comparative approach is currently hampered by a lack of standardization (e.g., Elliott TA, Gregory TR, Philos Trans Royal Soc B: Biol Sci 370:20140331, 2015). For example, testing the hypothesis that the total amount of coding sequences is a reliable measure of potential proteome diversity (Wang M, Kurland CG, Caetano-Anollés G, PNAS 108:11954, 2011) requires the application of standardized definitions of coding sequence and genes to create both comparable and comprehensive data sets and corresponding summary statistics. However, such standard definitions either do not exist or are not consistently applied. These circumstances call for a standard at the descriptive level using a minimum of parameters as well as an undeviating use of standardized terms, and for software that infers the required data under these strict definitions. The acquisition of a comprehensive, descriptive, and standardized set of parameters and summary statistics for genome publications and further analyses can thus greatly benefit from the availability of an easy to use standard tool. We developed a new open-source command-line tool, COGNATE (Comparative Gene Annotation Characterizer), which uses a given genome assembly and its annotation of protein-coding genes for a detailed description of the respective gene and genome structure parameters. Additionally, we revised the standard definitions of gene and genome structures and provide the definitions used by COGNATE as a working draft suggestion for further reference. Complete parameter lists and summary statistics are inferred using this set of definitions to allow down-stream analyses and to provide an overview of the genome and gene repertoire characteristics. COGNATE is written in Perl and freely available at the ZFMK homepage ( https://www.zfmk.de/en/COGNATE ) and on github ( https://github.com/ZFMK/COGNATE ). The tool COGNATE allows comparing genome assemblies and structural elements on multiples levels (e.g., scaffold or contig sequence, gene). It clearly enhances comparability between analyses. Thus, COGNATE can provide the important standardization of both genome and gene structure parameter disclosure as well as data acquisition for future comparative analyses. With the establishment of comprehensive descriptive standards and the extensive availability of genomes, an encompassing database will become possible.
Systematic analysis of human kinase genes: a large number of genes and alternative splicing events result in functional and structural diversity

PubMed Central

Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni

2005-01-01

Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set. PMID:16351747
Revolting Developments in Our Understanding of the Organization of the Eukaryotic Genome.

ERIC Educational Resources Information Center

Krider, Hallie M.

1984-01-01

Various typs of DNA are discussed. Areas considered include highly repetitive and satellite sequences, genes encoding, ribosomal RNA, histone protein genes, and dispersed repeated genes that jump. Regulated genetic misbehavior, structure and use of unique genes, and higher order complexities of chromosomes are also discussed. (JN)

Structure, Function, Interaction, Co-evolution of Rice Blast Resistance Genes

USDA-ARS?s Scientific Manuscript database

Rice blast disease caused by the fungal pathogen Magnaporthe oryzae is one of the most destructive rice diseases worldwide. Resistance (R) genes to blast encode proteins that detect pathogen signaling molecules encoded by M. oryzae avirulence (AVR) genes. R genes can be a single or a member of clu...
Analysis of co-evolving genes in campylobacter jejuni and C. coli

USDA-ARS?s Scientific Manuscript database

Background: The population structure of Campylobacter has been frequently studied by MLST, for which fragments of housekeeping genes are compared. We wished to determine if the used MLST genes are representative of the complete genome. Methods: A set of 1029 core gene families (CGF) was identifie...
TINAGL1 and B3GALNT1 are potential therapy target genes to suppress metastasis in non-small cell lung cancer

PubMed Central

2014-01-01

Background Non-small cell lung cancer (NSCLC) remains lethal despite the development of numerous drug therapy technologies. About 85% to 90% of lung cancers are NSCLC and the 5-year survival rate is at best still below 50%. Thus, it is important to find drugable target genes for NSCLC to develop an effective therapy for NSCLC. Results Integrated analysis of publically available gene expression and promoter methylation patterns of two highly aggressive NSCLC cell lines generated by in vivo selection was performed. We selected eleven critical genes that may mediate metastasis using recently proposed principal component analysis based unsupervised feature extraction. The eleven selected genes were significantly related to cancer diagnosis. The tertiary protein structure of the selected genes was inferred by Full Automatic Modeling System, a profile-based protein structure inference software, to determine protein functions and to specify genes that could be potential drug targets. Conclusions We identified eleven potentially critical genes that may mediate NSCLC metastasis using bioinformatic analysis of publically available data sets. These genes are potential target genes for the therapy of NSCLC. Among the eleven genes, TINAGL1 and B3GALNT1 are possible candidates for drug compounds that inhibit their gene expression. PMID:25521548
A curated catalog of canine and equine keratin genes

PubMed Central

Pujar, Shashikant; McGarvey, Kelly M.; Welle, Monika; Galichet, Arnaud; Müller, Eliane J.; Pruitt, Kim D.; Leeb, Tosso

2017-01-01

Keratins represent a large protein family with essential structural and functional roles in epithelial cells of skin, hair follicles, and other organs. During evolution the genes encoding keratins have undergone multiple rounds of duplication and humans have two clusters with a total of 55 functional keratin genes in their genomes. Due to the high similarity between different keratin paralogs and species-specific differences in gene content, the currently available keratin gene annotation in species with draft genome assemblies such as dog and horse is still imperfect. We compared the National Center for Biotechnology Information (NCBI) (dog annotation release 103, horse annotation release 101) and Ensembl (release 87) gene predictions for the canine and equine keratin gene clusters to RNA-seq data that were generated from adult skin of five dogs and two horses and from adult hair follicle tissue of one dog. Taking into consideration the knowledge on the conserved exon/intron structure of keratin genes, we annotated 61 putatively functional keratin genes in both the dog and horse, respectively. Subsequently, curators in the RefSeq group at NCBI reviewed their annotation of keratin genes in the dog and horse genomes (Annotation Release 104 and Annotation Release 102, respectively) and updated annotation and gene nomenclature of several keratin genes. The updates are now available in the NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene). PMID:28846680
GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts

PubMed Central

Aubourg, Sébastien; Brunaud, Véronique; Bruyère, Clémence; Cock, Mark; Cooke, Richard; Cottet, Annick; Couloux, Arnaud; Déhais, Patrice; Deléage, Gilbert; Duclert, Aymeric; Echeverria, Manuel; Eschbach, Aimée; Falconet, Denis; Filippi, Ghislain; Gaspin, Christine; Geourjon, Christophe; Grienenberger, Jean-Michel; Houlné, Guy; Jamet, Elisabeth; Lechauve, Frédéric; Leleu, Olivier; Leroy, Philippe; Mache, Régis; Meyer, Christian; Nedjari, Hafed; Negrutiu, Ioan; Orsini, Valérie; Peyretaillade, Eric; Pommier, Cyril; Raes, Jeroen; Risler, Jean-Loup; Rivière, Stéphane; Rombauts, Stéphane; Rouzé, Pierre; Schneider, Michel; Schwob, Philippe; Small, Ian; Soumayet-Kampetenga, Ghislain; Stankovski, Darko; Toffano, Claire; Tognolli, Michael; Caboche, Michel; Lecharny, Alain

2005-01-01

Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Prot. PMID:15608279
Efficient Reverse-Engineering of a Developmental Gene Regulatory Network

PubMed Central

Cicin-Sain, Damjan; Ashyraliyev, Maksat; Jaeger, Johannes

2012-01-01

Understanding the complex regulatory networks underlying development and evolution of multi-cellular organisms is a major problem in biology. Computational models can be used as tools to extract the regulatory structure and dynamics of such networks from gene expression data. This approach is called reverse engineering. It has been successfully applied to many gene networks in various biological systems. However, to reconstitute the structure and non-linear dynamics of a developmental gene network in its spatial context remains a considerable challenge. Here, we address this challenge using a case study: the gap gene network involved in segment determination during early development of Drosophila melanogaster. A major problem for reverse-engineering pattern-forming networks is the significant amount of time and effort required to acquire and quantify spatial gene expression data. We have developed a simplified data processing pipeline that considerably increases the throughput of the method, but results in data of reduced accuracy compared to those previously used for gap gene network inference. We demonstrate that we can infer the correct network structure using our reduced data set, and investigate minimal data requirements for successful reverse engineering. Our results show that timing and position of expression domain boundaries are the crucial features for determining regulatory network structure from data, while it is less important to precisely measure expression levels. Based on this, we define minimal data requirements for gap gene network inference. Our results demonstrate the feasibility of reverse-engineering with much reduced experimental effort. This enables more widespread use of the method in different developmental contexts and organisms. Such systematic application of data-driven models to real-world networks has enormous potential. Only the quantitative investigation of a large number of developmental gene regulatory networks will allow us to discover whether there are rules or regularities governing development and evolution of complex multi-cellular organisms. PMID:22807664
Genetic Structure of Avian Influenza Viruses from Ducks of the Atlantic Flyway of North America

PubMed Central

Huang, Yanyan; Wille, Michelle; Dobbin, Ashley; Walzthöni, Natasha M.; Robertson, Gregory J.; Ojkic, Davor; Whitney, Hugh; Lang, Andrew S.

2014-01-01

Wild birds, including waterfowl such as ducks, are reservoir hosts of influenza A viruses. Despite the increased number of avian influenza virus (AIV) genome sequences available, our understanding of AIV genetic structure and transmission through space and time in waterfowl in North America is still limited. In particular, AIVs in ducks of the Atlantic flyway of North America have not been thoroughly investigated. To begin to address this gap, we analyzed 109 AIV genome sequences from ducks in the Atlantic flyway to determine their genetic structure and to document the extent of gene flow in the context of sequences from other locations and other avian and mammalian host groups. The analyses included 25 AIVs from ducks from Newfoundland, Canada, from 2008–2011 and 84 available reference duck AIVs from the Atlantic flyway from 2006–2011. A vast diversity of viral genes and genomes was identified in the 109 viruses. The genetic structure differed amongst the 8 viral segments with predominant single lineages found for the PB2, PB1 and M segments, increased diversity found for the PA, NP and NS segments (2, 3 and 3 lineages, respectively), and the highest diversity found for the HA and NA segments (12 and 9 lineages, respectively). Identification of inter-hemispheric transmissions was rare with only 2% of the genes of Eurasian origin. Virus transmission between ducks and other bird groups was investigated, with 57.3% of the genes having highly similar (≥99% nucleotide identity) genes detected in birds other than ducks. Transmission between North American flyways has been frequent and 75.8% of the genes were highly similar to genes found in other North American flyways. However, the duck AIV genes did display spatial distribution bias, which was demonstrated by the different population sizes of specific viral genes in one or two neighbouring flyways compared to more distant flyways. PMID:24498009
Population structuring of multi-copy, antigen-encoding genes in Plasmodium falciparum

PubMed Central

Artzy-Randrup, Yael; Rorick, Mary M; Day, Karen; Chen, Donald; Dobson, Andrew P; Pascual, Mercedes

2012-01-01

The coexistence of multiple independently circulating strains in pathogen populations that undergo sexual recombination is a central question of epidemiology with profound implications for control. An agent-based model is developed that extends earlier ‘strain theory’ by addressing the var gene family of Plasmodium falciparum. The model explicitly considers the extensive diversity of multi-copy genes that undergo antigenic variation via sequential, mutually exclusive expression. It tracks the dynamics of all unique var repertoires in a population of hosts, and shows that even under high levels of sexual recombination, strain competition mediated through cross-immunity structures the parasite population into a subset of coexisting dominant repertoires of var genes whose degree of antigenic overlap depends on transmission intensity. Empirical comparison of patterns of genetic variation at antigenic and neutral sites supports this role for immune selection in structuring parasite diversity. DOI: http://dx.doi.org/10.7554/eLife.00093.001 PMID:23251784
Deletion Analysis of the Tumorous-Head (tuh–3) Gene in DROSOPHILA MELANOGASTER

PubMed Central

Kuhn, David T.; Woods, Daniel F.; Andrew, Deborah J.

1981-01-01

In the presence of the naturally occurring maternal-effect alleles tuh-1h or tuh-1g, the tuh-3 mutant gene can cause the tumorous-head trait or the sac-testis trait. The tuh-3 gene functions as a semidominant in the presence of the tuh-1h maternal effect. Eye-antennal structures are replaced by posterior abdominal tergites and genital structures. If tuh-1h is replaced by its naturally occurring allele tuh-1g, tuh-3 functions as a recessive hypomorph and the defect switches from anterior to posterior structures, with a male genital-disc defect appearing with variable penetrance. Function and regulation of tuh-3+ may better be understood in light of the cytological localization of tuh-3 either adjacent to or as part of the bithorax complex. The tuh-3+ gene product appears to be essential for normal development, at least in the posterior end of the embryo. PMID:6804305
The Sucrose Synthase Gene Family in Chinese Pear (Pyrus bretschneideri Rehd.): Structure, Expression, and Evolution.

PubMed

Abdullah, Muhammad; Cao, Yungpeng; Cheng, Xi; Meng, Dandan; Chen, Yu; Shakoor, Awais; Gao, Junshan; Cai, Yongping

2018-05-11

Sucrose synthase (SS) is a key enzyme involved in sucrose metabolism that is critical in plant growth and development, and particularly quality of the fruit. Sucrose synthase gene families have been identified and characterized in plants various plants such as tobacco, grape, rice, and Arabidopsis . However, there is still lack of detailed information about sucrose synthase gene in pear. In the present study, we performed a systematic analysis of the pear ( Pyrus bretschneideri Rehd.) genome and reported 30 sucrose synthase genes. Subsequently, gene structure, phylogenetic relationship, chromosomal localization, gene duplications, promoter regions, collinearity, RNA-Seq data and qRT-PCR were conducted on these sucrose synthase genes. The transcript analysis revealed that 10 PbSSs genes (30%) were especially expressed in pear fruit development. Additionally, qRT-PCR analysis verified the RNA-seq data and shown that PbSS30 , PbSS24 , and PbSS15 have a potential role in the pear fruit development stages. This study provides important insights into the evolution of sucrose synthase gene family in pear and will provide assistance for further investigation of sucrose synthase genes functions in the process of fruit development, fruit quality and resistance to environmental stresses.
Genome structure drives patterns of gene family evolution in ciliates, a case study using Chilodonella uncinata (Protista, Ciliophora, Phyllopharyngea)

PubMed Central

Gao, Feng; Song, Weibo; Katz, Laura A.

2014-01-01

In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that: 1) alternative processing is extensive among gene families; and 2) such gene families are likely to be C. uncinata-specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family -- a protein kinase domain containing protein (PKc) -- from two C. uncinata strains. Analysis of the PKc sequences reveals: 1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and 2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. PMID:24749903
The X chromosome in space.

PubMed

Jégu, Teddy; Aeby, Eric; Lee, Jeannie T

2017-06-01

Extensive 3D folding is required to package a genome into the tiny nuclear space, and this packaging must be compatible with proper gene expression. Thus, in the well-hierarchized nucleus, chromosomes occupy discrete territories and adopt specific 3D organizational structures that facilitate interactions between regulatory elements for gene expression. The mammalian X chromosome exemplifies this structure-function relationship. Recent studies have shown that, upon X-chromosome inactivation, active and inactive X chromosomes localize to different subnuclear positions and adopt distinct chromosomal architectures that reflect their activity states. Here, we review the roles of long non-coding RNAs, chromosomal organizational structures and the subnuclear localization of chromosomes as they relate to X-linked gene expression.
A new yeast gene with a myosin-like heptad repeat structure.

PubMed

Kölling, R; Nguyen, T; Chen, E Y; Botstein, D

1993-03-01

We isolated a gene encoding a 218 kDa myosin-like protein from Saccharomyces cerevisiae using a monoclonal antibody directed against human platelet myosin as a probe. The protein sequence encoded by the MLP1 gene (for myosin-like protein) contains extensive stretches of a heptad-repeat pattern suggesting that the protein can form coiled coils typical of myosins. Immunolocalization experiments using affinity-purified antibodies raised against a TrpE-MLP1 fusion protein showed a dot-like structure adjacent to the nucleus in yeast cells bearing the MLP1 gene on a multicopy plasmid. In mouse epithelial cells the yeast anti-MLP1 antibodies stained the nucleus. Mutants bearing disruptions of the MLP1 gene were viable, but more sensitive to ultraviolet light than wild-type strains, suggesting an involvement of MLP1 in DNA repair. The MLP1 gene was mapped to chromosome 11, 25 cM from met1.
Multiple Vibrio fischeri genes are involved in biofilm formation and host colonization

PubMed Central

Chavez-Dozal, Alba; Hogan, David; Gorman, Clayton; Quintanal-Villalonga, Alvaro; Nishiguchi, Michele K.

2012-01-01

Biofilms are increasingly recognized as the predominant form for survival in the environment for most bacteria. The successful colonization of Vibrio fischeri in its squid host Euprymna tasmanica, involves complex microbe-host interactions mediated by specific genes that are essential for biofilm formation and colonization. In the present investigation, structural and regulatory genes were selected to study their role in biofilm formation and host colonization. We have mutated several genes (pilT, pilU, flgF, motY, ibpA and mifB) by an insertional inactivation strategy. Results demonstrate that structural genes responsible for synthesis of type IV pili and flagella are crucial for biofilm formation and host infection. Moreover, regulatory genes affect colony aggregation by various mechanisms including alteration of synthesis of transcriptional factors and regulation of extracellular polysaccharide production. These results reflect the significance of how genetic alterations influence communal behavior, which is important in understanding symbiotic relationships. PMID:22486781
Genome-wide identification and characterization of Glyceraldehyde-3-phosphate dehydrogenase genes family in wheat (Triticum aestivum).

PubMed

Zeng, Lingfeng; Deng, Rong; Guo, Ziping; Yang, Shushen; Deng, Xiping

2016-03-16

Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a central enzyme in glycolysi, we performed genome-wide identification of GAPDH genes in wheat and analyzed their structural characteristics and expression patterns under abiotic stress in wheat. A total of 22 GAPDH genes were identified in wheat cv. Chinese spring; the phylogenetic and structure analysis showed that these GAPDH genes could be divided into four distinct subfamilies. The expression profiles of GAPDH genes showed tissue specificity all over plant development stages. The qRT-PCR results revealed that wheat GAPDHs were involved in several abiotic stress response. Wheat carried 22 GAPDH genes, representing four types of plant GAPDHs (gapA/B, gapC, gapCp and gapN). Whole genome duplication and segmental duplication might account for the expansion of wheat GAPDHs. Expression analysis implied that GAPDHs play roles in plants abiotic stress tolerance.
CATH-Gene3D: Generation of the Resource and Its Use in Obtaining Structural and Functional Annotations for Protein Sequences.

PubMed

Dawson, Natalie L; Sillitoe, Ian; Lees, Jonathan G; Lam, Su Datt; Orengo, Christine A

2017-01-01

This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.
Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

PubMed

Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.
Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

PubMed Central

Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417
Functional gene array-based analysis of microbial community structure in groundwaters with a gradient of contaminant levels

DOE Office of Scientific and Technical Information (OSTI.GOV)

Waldron, P.J.; Wu, L.; Van Nostrand, J.D.

2009-06-15

To understand how contaminants affect microbial community diversity, heterogeneity, and functional structure, six groundwater monitoring wells from the Field Research Center of the U.S. Department of Energy Environmental Remediation Science Program (ERSP; Oak Ridge, TN), with a wide range of pH, nitrate, and heavy metal contamination were investigated. DNA from the groundwater community was analyzed with a functional gene array containing 2006 probes to detect genes involved in metal resistance, sulfate reduction, organic contaminant degradation, and carbon and nitrogen cycling. Microbial diversity decreased in relation to the contamination levels of the wells. Highly contaminated wells had lower gene diversity butmore » greater signal intensity than the pristine well. The microbial composition was heterogeneous, with 17-70% overlap between different wells. Metal-resistant and metal-reducing microorganisms were detected in both contaminated and pristine wells, suggesting the potential for successful bioremediation of metal-contaminated groundwaters. In addition, results of Mantel tests and canonical correspondence analysis indicate that nitrate, sulfate, pH, uranium, and technetium have a significant (p < 0.05) effect on microbial community structure. This study provides an overall picture of microbial community structure in contaminated environments with functional gene arrays by showing that diversity and heterogeneity can vary greatly in relation to contamination.« less
The complete mitochondrial genome and its remarkable secondary structure for a stonefly Acroneuria hainana Wu (Insecta: Plecoptera, Perlidae).

PubMed

Huang, Mingchao; Wang, Yuyu; Liu, Xingyue; Li, Weihai; Kang, Zehui; Wang, Kai; Li, Xuankun; Yang, Ding

2015-02-15

The Plecoptera (stoneflies) is a hemimetabolous order of insects, whose larvae are usually used as indicators for fresh water biomonitoring. Herein, we describe the complete mitochondrial (mt) genome of a stonefly species, namely Acroneuria hainana Wu belonging to the family Perlidae. This mt genome contains 13 PCGs, 22 tRNA-coding genes and 2 rRNA-coding genes that are conserved in most insect mt genomes, and it also has the identical gene order with the insect ancestral gene order. However, there are three special initiation codons of ND1, ND5 and COI in PCGs: TTG, GTG and CGA, coding for L, V and R, respectively. Additionally, the 899-bp control region, with 73.30% A+T content, has two long repeated sequences which are found at the 3'-end closing to the tRNA(Ile) gene. Both of them can be folded into a stem-loop structure, whose adjacent upstream and downstream sequences can be also folded into stem-loop structures. It is presumed that the four special structures in series could be associated with the D-loop replication. It might be able to adjust the replication speed of two replicate directions. Copyright © 2014 Elsevier B.V. All rights reserved.

Integrative analysis of transcriptomic and metabolomic data via sparse canonical correlation analysis with incorporation of biological information.

PubMed

Safo, Sandra E; Li, Shuzhao; Long, Qi

2018-03-01

Integrative analysis of high dimensional omics data is becoming increasingly popular. At the same time, incorporating known functional relationships among variables in analysis of omics data has been shown to help elucidate underlying mechanisms for complex diseases. In this article, our goal is to assess association between transcriptomic and metabolomic data from a Predictive Health Institute (PHI) study that includes healthy adults at a high risk of developing cardiovascular diseases. Adopting a strategy that is both data-driven and knowledge-based, we develop statistical methods for sparse canonical correlation analysis (CCA) with incorporation of known biological information. Our proposed methods use prior network structural information among genes and among metabolites to guide selection of relevant genes and metabolites in sparse CCA, providing insight on the molecular underpinning of cardiovascular disease. Our simulations demonstrate that the structured sparse CCA methods outperform several existing sparse CCA methods in selecting relevant genes and metabolites when structural information is informative and are robust to mis-specified structural information. Our analysis of the PHI study reveals that a number of gene and metabolic pathways including some known to be associated with cardiovascular diseases are enriched in the set of genes and metabolites selected by our proposed approach. © 2017, The International Biometric Society.
Web application for automatic prediction of gene translation elongation efficiency.

PubMed

Sokolov, Vladimir; Zuraev, Bulat; Lashin, Sergei; Matushkin, Yury

2015-09-03

Expression efficiency is one of the major characteristics describing genes in various modern investigations. Expression efficiency of genes is regulated at various stages: transcription, translation, posttranslational protein modification and others. In this study, a special EloE (Elongation Efficiency) web application is described. The EloE sorts the organism's genes in a descend order on their theoretical rate of the elongation stage of translation based on the analysis of their nucleotide sequences. Obtained theoretical data have a significant correlation with available experimental data of gene expression in various organisms. In addition, the program identifies preferential codons in organism's genes and defines distribution of potential secondary structures energy in 5´ and 3´ regions of mRNA. The EloE can be useful in preliminary estimation of translation elongation efficiency for genes for which experimental data are not available yet. Some results can be used, for instance, in other programs modeling artificial genetic structures in genetically engineered experiments.
Retinal dystrophies, genomic applications in diagnosis and prospects for therapy

PubMed Central

Nash, Benjamin M.; Wright, Dale C.; Grigg, John R.; Bennetts, Bruce

2015-01-01

Retinal dystrophies (RDs) are degenerative diseases of the retina which have marked clinical and genetic heterogeneity. Common presentations among these disorders include night or colour blindness, tunnel vision and subsequent progression to complete blindness. The known causative disease genes have a variety of developmental and functional roles with mutations in more than 120 genes shown to be responsible for the phenotypes. In addition, mutations within the same gene have been shown to cause different disease phenotypes, even amongst affected individuals within the same family highlighting further levels of complexity. The known disease genes encode proteins involved in retinal cellular structures, phototransduction, the visual cycle, and photoreceptor structure or gene regulation. This review aims to demonstrate the high degree of genetic complexity in both the causative disease genes and their associated phenotypes, highlighting the more common clinical manifestation of retinitis pigmentosa (RP). The review also provides insight to recent advances in genomic molecular diagnosis and gene and cell-based therapies for the RDs. PMID:26835369
The organization of the fuc regulon specifying L-fucose dissimilation in Escherichia coli K12 as determined by gene cloning.

PubMed

Chen, Y M; Zhu, Y; Lin, E C

1987-12-01

In Escherichia coli the six known genes specifying the utilization of L-fucose as carbon and energy source cluster at 60.2 min and constitute a regulon. These genes include fucP (encoding L-fucose permease), fucI (encoding L-fucose isomerase), fucK (encoding L-fuculose kinase), fucA (encoding L-fuculose 1-phosphate aldolase), fucO (encoding L-1,2-propanediol oxidoreductase), and fucR (encoding the regulatory protein). In this study the fuc genes were cloned and their positions on the chromosome were established by restriction endonuclease and complementation analyses. Clockwise, the gene order is: fucO-fucA-fucP-fucI-fucK-fucR. The operons comprising the structural genes and the direction of transcription were determined by complementation analysis and Southern blot hybridization. The fucPIK and fucA operons are transcribed clockwise. The fucO operon is transcribed counterclockwise. The fucR gene product activates the three structural operons in trans.
Cloning and bioinformatic analysis of lovastatin biosynthesis regulatory gene lovE.

PubMed

Huang, Xin; Li, Hao-ming

2009-08-05

Lovastatin is an effective drug for treatment of hyperlipidemia. This study aimed to clone lovastatin biosynthesis regulatory gene lovE and analyze the structure and function of its encoding protein. According to the lovastatin synthase gene sequence from genebank, primers were designed to amplify and clone the lovastatin biosynthesis regulatory gene lovE from Aspergillus terrus genomic DNA. Bioinformatic analysis of lovE and its encoding animo acid sequence was performed through internet resources and software like DNAMAN. Target fragment lovE, almost 1500 bp in length, was amplified from Aspergillus terrus genomic DNA and the secondary and three-dimensional structures of LovE protein were predicted. In the lovastatin biosynthesis process lovE is a regulatory gene and LovE protein is a GAL4-like transcriptional factor.
The O-antigen structure of bacterium Comamonas aquatica CJG.

PubMed

Wang, Xiqian; Kondakova, Anna N; Zhu, Yutong; Knirel, Yuriy A; Han, Aidong

2017-11-01

Genus Comamonas is a group of bacteria that are able to degrade a variety of environmental waste. Comamonas aquatica CJG (C. aquatica) in this genus is able to absorb low-density lipoprotein but not high-density lipoprotein of human serum. Using 1 H and 13 C NMR spectroscopy, we found that the O-polysaccharide (O-antigen) of this bacterium is comprised of a disaccharide repeat (O-unit) of d-glucose and 2-O-acetyl-l-rhamnose, which is shared by Serratia marcescens O6. The O-antigen gene cluster of C. aquatica, which is located between coaX and tnp4 genes, contains rhamnose synthesis genes, glycosyl and acetyl transferase genes, and ATP-binding cassette transporter genes, and therefore is consistent with the O-antigen structure determined here.
A novel role for the Bombyx Slbo homologue, BmC/EBP, in insect choriogenesis.

PubMed

Sourmeli, S; Papantonis, A; Lecanidou, R

2005-11-18

One previously unidentified cDNA clone coding for a C/EBP factor, BmC/EBP, was isolated from Bombyx mori follicular cells. This is the first time that a C/EBP factor has been isolated and characterized in Lepidoptera. We provide information concerning structural features and developmental specificity, as well as in vitro interaction properties with chorion gene promoter modules. BmC/EBP was capable of effectively recognizing homologous binding sites from chorion gene promoters derived from flies and other moths, despite significant diversity of chorion structure, gene organization, and gene expression profiles. We propose that the relative concentration of BmC/EBP, in relation to its differential binding affinity for promoter cis-elements, results in activation or repression of silkmoth chorion gene expression.
Landscape heterogeneity predicts gene flow in a widespread polymorphic bumble bee, Bombus bifarius (Hymentoptera: Apidae).

USDA-ARS?s Scientific Manuscript database

Bombus bifarius is a widespread bumble bee that occurs in montane regions of western North America. This species has several major color polymorphisms, and shows evidence of genetic structuring among regional populations. We test whether this structure is evidence for discrete gene flow barriers tha...
qPCR for Second Year Undergraduates: A Short, Structured Inquiry to Illustrate Differential Gene Expression

ERIC Educational Resources Information Center

McCauslin, Christine Seitz; Gunn, Kathryn Elaine; Pirone, Dana; Staiger, Jennifer

2015-01-01

We describe a structured inquiry laboratory exercise that examines transcriptional regulation of the "NOS2" gene under conditions that simulate the inflammatory response in macrophages. Using quantitative PCR and the comparative C[subscript T] method, students are able determine whether transcriptional activation of "NOS2"…
Spatial scaling and multi-model inference in landscape genetics: Martes americana in northern Idaho

Treesearch

Tzeidle N. Wasserman; Samuel A. Cushman; Michael K. Schwartz; David O. Wallin

2010-01-01

Individual-based analyses relating landscape structure to genetic distances across complex landscapes enable rigorous evaluation of multiple alternative hypotheses linking landscape structure to gene flow. We utilize two extensions to increase the rigor of the individual-based causal modeling approach to inferring relationships between landscape patterns and gene flow...
Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach.

PubMed

Martínez, Amparo; Manunza, Arianna; Delgado, Juan Vicente; Landi, Vincenzo; Adebambo, Ayotunde; Ismaila, Muritala; Capote, Juan; El Ouni, Mabrouk; Elbeltagy, Ahmed; Abushady, Asmaa M; Galal, Salah; Ferrando, Ainhoa; Gómez, Mariano; Pons, Agueda; Badaoui, Bouabid; Jordana, Jordi; Vidal, Oriol; Amills, Marcel

2016-12-14

Human-driven migrations are one of the main processes shaping the genetic diversity and population structure of domestic species. However, their magnitude and direction have been rarely analysed in a statistical framework. We aimed to estimate the impact of migration on the population structure of Spanish and African goats. To achieve this goal, we analysed a dataset of 1,472 individuals typed with 23 microsatellites. Population structure of African and Spanish goats was moderate (mean F ST = 0.07), with the exception of the Canarian and South African breeds that displayed a significant differentiation when compared to goats from North Africa and Nigeria. Measurement of gene flow with Migrate-n and IMa coalescent genealogy samplers supported the existence of a bidirectional gene flow between African and Spanish goats. Moreover, IMa estimates of the effective number of migrants were remarkably lower than those calculated with Migrate-n and classical approaches. Such discrepancies suggest that recent divergence, rather than extensive gene flow, is the main cause of the weak population structure observed in caprine breeds.
Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach

PubMed Central

Martínez, Amparo; Manunza, Arianna; Delgado, Juan Vicente; Landi, Vincenzo; Adebambo, Ayotunde; Ismaila, Muritala; Capote, Juan; El Ouni, Mabrouk; Elbeltagy, Ahmed; Abushady, Asmaa M.; Galal, Salah; Ferrando, Ainhoa; Gómez, Mariano; Pons, Agueda; Badaoui, Bouabid; Jordana, Jordi; Vidal, Oriol; Amills, Marcel

2016-01-01

Human-driven migrations are one of the main processes shaping the genetic diversity and population structure of domestic species. However, their magnitude and direction have been rarely analysed in a statistical framework. We aimed to estimate the impact of migration on the population structure of Spanish and African goats. To achieve this goal, we analysed a dataset of 1,472 individuals typed with 23 microsatellites. Population structure of African and Spanish goats was moderate (mean FST = 0.07), with the exception of the Canarian and South African breeds that displayed a significant differentiation when compared to goats from North Africa and Nigeria. Measurement of gene flow with Migrate-n and IMa coalescent genealogy samplers supported the existence of a bidirectional gene flow between African and Spanish goats. Moreover, IMa estimates of the effective number of migrants were remarkably lower than those calculated with Migrate-n and classical approaches. Such discrepancies suggest that recent divergence, rather than extensive gene flow, is the main cause of the weak population structure observed in caprine breeds. PMID:27966592
Mixing and Matching Siderophore Clusters: Structure and Biosynthesis of Serratiochelins from Serratia sp. V4

PubMed Central

2012-01-01

Interrogation of the evolutionary history underlying the remarkable structures and biological activities of natural products has been complicated by not knowing the functions they have evolved to fulfill. Siderophores—soluble, low molecular weight compounds—have an easily understood and measured function: acquiring iron from the environment. Bacteria engage in a fierce competition to acquire iron, which rewards the production of siderophores that bind iron tightly and cannot be used or pirated by competitors. The structures and biosyntheses of “odd” siderophores can reveal the evolutionary strategy that led to their creation. We report a new Serratia strain that produces serratiochelin and an analog of serratiochelin. A genetic approach located the serratiochelin gene cluster, and targeted mutations in several genes implicated in serratiochelin biosynthesis were generated. Bioinformatic analyses and mutagenesis results demonstrate that genes from two well-known siderophore clusters, the Escherichia coli enterobactin cluster and the Vibrio cholera vibriobactin cluster, were shuffled to produce a new siderophore biosynthetic pathway. These results highlight how modular siderophore gene clusters can be mixed and matched during evolution to generate structural diversity in siderophores. PMID:22830960
Mixing and matching siderophore clusters: structure and biosynthesis of serratiochelins from Serratia sp. V4.

PubMed

Seyedsayamdost, Mohammad R; Cleto, Sara; Carr, Gavin; Vlamakis, Hera; João Vieira, Maria; Kolter, Roberto; Clardy, Jon

2012-08-22

Interrogation of the evolutionary history underlying the remarkable structures and biological activities of natural products has been complicated by not knowing the functions they have evolved to fulfill. Siderophores-soluble, low molecular weight compounds-have an easily understood and measured function: acquiring iron from the environment. Bacteria engage in a fierce competition to acquire iron, which rewards the production of siderophores that bind iron tightly and cannot be used or pirated by competitors. The structures and biosyntheses of "odd" siderophores can reveal the evolutionary strategy that led to their creation. We report a new Serratia strain that produces serratiochelin and an analog of serratiochelin. A genetic approach located the serratiochelin gene cluster, and targeted mutations in several genes implicated in serratiochelin biosynthesis were generated. Bioinformatic analyses and mutagenesis results demonstrate that genes from two well-known siderophore clusters, the Escherichia coli enterobactin cluster and the Vibrio cholera vibriobactin cluster, were shuffled to produce a new siderophore biosynthetic pathway. These results highlight how modular siderophore gene clusters can be mixed and matched during evolution to generate structural diversity in siderophores.
Genetic and Biochemical Map for the Biosynthesis of Occidiofungin, an Antifungal Produced by Burkholderia contaminans Strain MS14 ▿†

PubMed Central

Gu, Ganyu; Smith, Leif; Liu, Aixin; Lu, Shi-En

2011-01-01

A striking feature of Burkholderia contaminans strain MS14 is the production of a glycolipopeptide named occidiofungin. Occidiofungin has a broad range of antifungal activities against plant and animal pathogens. In this study, a complete covalent structure characterization and identification of the whole genomic DNA region for the occidiofungin gene (ocf) cluster are described. Discovery of the presence of 2,4-diaminobutyric acid and 3-chloro-β-hydroxytyrosine and elucidation of the structure of a novel C18 fatty amino acid residue have been achieved. In addition, seven additional putative open reading frames (the genes from ocfI to ocfN [ocfI-N] and ORF16) were identified. Transcription of all the putative genes ocfI-N identified in the region except ORF16 was regulated by both ambR1 and ambR2. Elucidation of the structure and the ocf gene cluster provides insight into the biosynthesis of occidiofungin and promotes future aims at understanding the biosynthetic machinery. This work provides new avenues for optimizing the production and synthesis of structural analogs of occidiofungin. PMID:21742901
SeMPI: a genome-based secondary metabolite prediction and identification web server.

PubMed

Zierep, Paul F; Padilla, Natàlia; Yonchev, Dimitar G; Telukunta, Kiran K; Klementz, Dennis; Günther, Stefan

2017-07-03

The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural features based genome-wide characterization and prediction of nucleosome organization

PubMed Central

2012-01-01

Background Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae. Results We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions. Conclusions Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization. The implementation of our DLaNe method based on structural features is available online. PMID:22449207
Comparative expression of the four enamel matrix protein genes, amelogenin, ameloblastin, enamelin and amelotin during amelogenesis in the lizard Anolis carolinensis.

PubMed

Gasse, Barbara; Sire, Jean-Yves

2015-01-01

In a recent study, we have demonstrated that amelotin (AMTN) gene structure and its expression during amelogenesis have changed during tetrapod evolution. Indeed, this gene is expressed throughout enamel matrix deposition and maturation in non-mammalian tetrapods, while in mammals its expression is restricted to the transition and maturation stages of amelogenesis. Previous studies of amelogenin (AMEL) gene expression in a lizard and a salamander have shown similar expression pattern to that in mammals, but to our knowledge there are no data regarding ameloblastin (AMBN) and enamelin (ENAM) expression in non-mammalian tetrapods. The present study aims to look at, and compare, the structure and expression of four enamel matrix protein genes, AMEL, AMBN, ENAM and AMTN during amelogenesis in the lizard Anolis carolinensis. We provide the full-length cDNA sequence of A. carolinensis AMEL and AMBN, and show for the first time the expression of ENAM and AMBN in a non-mammalian species. During amelogenesis in A. carolinensis, AMEL, AMBN and ENAM expression in ameloblasts is similar to that described in mammals. It is noteworthy that AMEL and AMBN expression is also found in odontoblasts. Our findings indicate that AMTN is the only enamel matrix protein gene that is differentially expressed in ameloblasts between mammals and sauropsids. Changes in AMTN structure and expression could be the key to explain the structural differences between mammalian and reptilian enamel, i.e. prismatic versus non-prismatic.
The complete mitochondrial genome of the pink stem borer, Sesamia inferens, in comparison with four other Noctuid moths.

PubMed

Chai, Huan-Na; Du, Yu-Zhou

2012-01-01

The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif "ATAGA" followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite "(AT)(7)", without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae.
The Complete Mitochondrial Genome of the Pink Stem Borer, Sesamia inferens, in Comparison with Four Other Noctuid Moths

PubMed Central

Chai, Huan-Na; Du, Yu-Zhou

2012-01-01

The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif “ATAGA” followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite “(AT)7”, without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae. PMID:22949858

Genome-wide Identification and analysis of the stress-resistance function of the TPS (Trehalose-6-Phosphate Synthase) gene family in cotton.

PubMed

Mu, Min; Lu, Xu-Ke; Wang, Jun-Juan; Wang, De-Long; Yin, Zu-Jun; Wang, Shuai; Fan, Wei-Li; Ye, Wu-Wei

2016-03-18

Trehalose (a-D-glucopyranosyl a-D-glucopyranoside) is a nonreducing disaccharide and is widely distributed in bacteria, fungi, algae, plants and invertebrates. In the study, the identification of trehalose-6-phosphate synthase (TPS) genes stress-related in cotton, and the genetic structure analysis and molecular evolution analysis of TPSs were conducted with bioinformatics methods, which could lay a foundation for further research of TPS functions in cotton. The genome information of Gossypium raimondii (group D), G. arboreum L. (group A), and G. hirsutum L. (group AD) was used in the study. Fifty-three TPSs were identified comprising 15 genes in group D, 14 in group A, and 24 in group AD. Bioinformatics methods were used to analyze the genetic structure and molecular evolution of TPSs. Real-time PCR analysis was performed to investigate the expression patterns of gene family members. All TPS family members in cotton can be divided into two subfamilies: Class I and Class II. The similarity of the TPS sequence is high within the same species and close within their family relatives. The genetic structures of two TPS subfamily members are different, with more introns and a more complicated gene structure in Class I. There is a TPS domain(Glyco transf_20) at the N-terminal in all TPS family members and a TPP domain(Trehalose_PPase) at the C-terminal in all except GrTPS6, GhTPS4, and GhTPS9. All Class II members contain a UDP-forming domain. The responses to environmental stresses showed that stresses could induce the expression of TPSs but the expression patterns vary with different stresses. The distribution of TPSs varies with different species but is relatively uniform on chromosomes. Genetic structure varies with different gene members, and expression levels vary with different stresses and exhibit tissue specificity. The upregulated genes in upland cotton TM-1 is significantly more than that in G. raimondii and G. arboreum L. Shixiya 1.
Next generation haplotyping to decipher nuclear genomic interspecific admixture in Citrus species: analysis of chromosome 2.

PubMed

Curk, Franck; Ancillo, Gema; Garcia-Lor, Andres; Luro, François; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Navarro, Luis; Ollitrault, Patrick

2014-12-29

The most economically important Citrus species originated by natural interspecific hybridization between four ancestral taxa (Citrus reticulata, Citrus maxima, Citrus medica, and Citrus micrantha) and from limited subsequent interspecific recombination as a result of apomixis and vegetative propagation. Such reticulate evolution coupled with vegetative propagation results in mosaic genomes with large chromosome fragments from the basic taxa in frequent interspecific heterozygosity. Modern breeding of these species is hampered by their complex heterozygous genomic structures that determine species phenotype and are broken by sexual hybridisation. Nevertheless, a large amount of diversity is present in the citrus gene pool, and breeding to allow inclusion of desirable traits is of paramount importance. However, the efficient mobilization of citrus biodiversity in innovative breeding schemes requires previous understanding of Citrus origins and genomic structures. Haplotyping of multiple gene fragments along the whole genome is a powerful approach to reveal the admixture genomic structure of current species and to resolve the evolutionary history of the gene pools. In this study, the efficiency of parallel sequencing with 454 methodology to decipher the hybrid structure of modern citrus species was assessed by analysis of 16 gene fragments on chromosome 2. 454 amplicon libraries were established using the Fluidigm array system for 48 genotypes and 16 gene fragments from chromosome 2. Haplotypes were established from the reads of each accession and phylogenetic analyses were performed using the haplotypic data for each gene fragment. The length of 454 reads and the level of differentiation between the ancestral taxa of modern citrus allowed efficient haplotype phylogenetic assignations for 12 of the 16 gene fragments. The analysis of the mixed genomic structure of modern species and cultivars (i) revealed C. maxima introgressions in modern mandarins, (ii) was consistent with previous hypotheses regarding the origin of secondary species, and (iii) provided a new picture of the evolution of chromosome 2. 454 sequencing was an efficient strategy to establish haplotypes with significant phylogenetic assignations in Citrus, providing a new picture of the mixed structure on chromosome 2 in 48 citrus genotypes.
Structure and variation of the mitochondrial genome of fishes.

PubMed

Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi

2016-09-07

The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.
SUC1 gene of Saccharomyces: a structural gene for the large (glycoprotein) and small (carbohydrate-free) forms of invertase.

PubMed Central

Rodriguez, L; Lampen, J O; MacKay, V L

1981-01-01

Saccharomyces cerevisiae revertant strain D10-ER1 has been shown to contain thermosensitive forms of the large (glycoprotein) and small (carbohydrate-free) invertases and a very low level of the small enzyme, along with a wild-type level of the large form (T. Mizunaga et al., Mol. Cell. Biol. 1:460-468, 1981). These characteristics cosegregated in crosses of the revertant strain with wild-type sucrose-fermenting (SUC1) or nonfermenting (suc0) strains. In addition, there is tight linkage between sucrose and maltose fermentation in revertant D10-ER1 (characteristic of the SUC1 and MAL1 genes). From this we infer that a single reversion event is responsible for the several changes observed in D10-ER1, and that this mutation maps within or very close to the SUC1 gene present in the ancestor strain 4059-358D. The revertant SUC1 allele in D10-ER1 (termed SUC1-R1) was expressed independently of the wild-type SUC1 gene when both were present in diploid cells. Diploids carrying only the wild-type or the mutant genes synthesized invertases with the characteristics of the parental Suc+ haploids. The possibility that a modifier gene was responsible for the alterations in the invertases of revertant D10-ER1 was ruled out by appropriate crosses. We conclude that SUC1 is a structural gene that codes for both the large and the small forms of invertase and suggest that SUC2 through SUC5 are structural genes as well. PMID:6765604
Analysis of the Highly Diverse Gene Borders in Ebola Virus Reveals a Distinct Mechanism of Transcriptional Regulation

PubMed Central

Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki

2014-01-01

ABSTRACT Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. IMPORTANCE Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3′ end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. PMID:25142600
Analysis of the highly diverse gene borders in Ebola virus reveals a distinct mechanism of transcriptional regulation.

PubMed

Brauburger, Kristina; Boehmann, Yannik; Tsuda, Yoshimi; Hoenen, Thomas; Olejnik, Judith; Schümann, Michael; Ebihara, Hideki; Mühlberger, Elke

2014-11-01

Ebola virus (EBOV) belongs to the group of nonsegmented negative-sense RNA viruses. The seven EBOV genes are separated by variable gene borders, including short (4- or 5-nucleotide) intergenic regions (IRs), a single long (144-nucleotide) IR, and gene overlaps, where the neighboring gene end and start signals share five conserved nucleotides. The unique structure of the gene overlaps and the presence of a single long IR are conserved among all filoviruses. Here, we sought to determine the impact of the EBOV gene borders during viral transcription. We show that readthrough mRNA synthesis occurs in EBOV-infected cells irrespective of the structure of the gene border, indicating that the gene overlaps do not promote recognition of the gene end signal. However, two consecutive gene end signals at the VP24 gene might improve termination at the VP24-L gene border, ensuring efficient L gene expression. We further demonstrate that the long IR is not essential for but regulates transcription reinitiation in a length-dependent but sequence-independent manner. Mutational analysis of bicistronic minigenomes and recombinant EBOVs showed no direct correlation between IR length and reinitiation rates but demonstrated that specific IR lengths not found naturally in filoviruses profoundly inhibit downstream gene expression. Intriguingly, although truncation of the 144-nucleotide-long IR to 5 nucleotides did not substantially affect EBOV transcription, it led to a significant reduction of viral growth. Our current understanding of EBOV transcription regulation is limited due to the requirement for high-containment conditions to study this highly pathogenic virus. EBOV is thought to share many mechanistic features with well-analyzed prototype nonsegmented negative-sense RNA viruses. A single polymerase entry site at the 3' end of the genome determines that transcription of the genes is mainly controlled by gene order and cis-acting signals found at the gene borders. Here, we examined the regulatory role of the structurally unique EBOV gene borders during viral transcription. Our data suggest that transcriptional regulation in EBOV is highly complex and differs from that in prototype viruses and further the understanding of this most fundamental process in the filovirus replication cycle. Moreover, our results with recombinant EBOVs suggest a novel role of the long IR found in all filovirus genomes during the viral replication cycle. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Microbial Functional Gene Diversity with a Shift of Subsurface Redox Conditions during In Situ Uranium Reduction

PubMed Central

Liang, Yuting; Van Nostrand, Joy D.; N′Guessan, Lucie A.; Peacock, Aaron D.; Deng, Ye; Long, Philip E.; Resch, C. Tom; Wu, Liyou; He, Zhili; Li, Guanghe; Hazen, Terry C.; Lovley, Derek R.

2012-01-01

To better understand the microbial functional diversity changes with subsurface redox conditions during in situ uranium bioremediation, key functional genes were studied with GeoChip, a comprehensive functional gene microarray, in field experiments at a uranium mill tailings remedial action (UMTRA) site (Rifle, CO). The results indicated that functional microbial communities altered with a shift in the dominant metabolic process, as documented by hierarchical cluster and ordination analyses of all detected functional genes. The abundance of dsrAB genes (dissimilatory sulfite reductase genes) and methane generation-related mcr genes (methyl coenzyme M reductase coding genes) increased when redox conditions shifted from Fe-reducing to sulfate-reducing conditions. The cytochrome genes detected were primarily from Geobacter sp. and decreased with lower subsurface redox conditions. Statistical analysis of environmental parameters and functional genes indicated that acetate, U(VI), and redox potential (Eh) were the most significant geochemical variables linked to microbial functional gene structures, and changes in microbial functional diversity were strongly related to the dominant terminal electron-accepting process following acetate addition. The study indicates that the microbial functional genes clearly reflect the in situ redox conditions and the dominant microbial processes, which in turn influence uranium bioreduction. Microbial functional genes thus could be very useful for tracking microbial community structure and dynamics during bioremediation. PMID:22327592
Multi-label literature classification based on the Gene Ontology graph.

PubMed

Jin, Bo; Muller, Brian; Zhai, Chengxiang; Lu, Xinghua

2008-12-08

The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators) that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate protein annotation based on the literature.
Temporal changes in soil bacterial and archaeal communities with different fertilizers in tea orchards.

PubMed

Wang, Hua; Yang, Shao-hui; Yang, Jing-ping; Lv, Ya-min; Zhao, Xing; Pang, Ji-liang

2014-11-01

It is important to understand the effects of temporal changes in microbial communities in the acidic soils of tea orchards with different fertilizers. A field experiment involving organic fertilizer (OF), chemical fertilizer (CF), and unfertilized control (CK) treatments was arranged to analyze the temporal changes in the bacterial and archaeal communities at bimonthly intervals based on the 16S ribosomal RNA (rRNA) gene using terminal restriction fragment length polymorphism (T-RFLP) profiling. The abundances of total bacteria, total archaea, and selected functional genes (bacterial and archaeal amoA, bacterial narG, nirK, nirS, and nosZ) were determined by quantitative polymerase chain reaction (qPCR). The results indicate that the structures of bacterial and archaeal communities varied significantly with time and fertilization based on changes in the relative abundance of dominant T-RFs. The abundancy of the detected genes changed with time. The total bacteria, total archaea, and archaeal amoA were less abundant in July. The bacterial amoA and denitrifying genes were less abundant in September, except the nirK gene. The OF treatment increased the abundance of the observed genes, while the CF treatment had little influence on them. The soil temperature significantly affected the bacterial and archaeal community structures. The soil moisture was significantly correlated with the abundance of denitrifying genes. Of the soil chemical properties, soil organic carbon was the most important factor and was significantly correlated with the abundance of the detected genes, except the nirK gene. Overall, this study demonstrated the effects of both temporal alteration and organic fertilizer on the structures of microbial communities and the abundance of genes involved in the nitrogen cycle.
Temporal changes in soil bacterial and archaeal communities with different fertilizers in tea orchards* #

PubMed Central

Wang, Hua; Yang, Shao-hui; Yang, Jing-ping; Lv, Ya-min; Zhao, Xing; Pang, Ji-liang

2014-01-01

It is important to understand the effects of temporal changes in microbial communities in the acidic soils of tea orchards with different fertilizers. A field experiment involving organic fertilizer (OF), chemical fertilizer (CF), and unfertilized control (CK) treatments was arranged to analyze the temporal changes in the bacterial and archaeal communities at bimonthly intervals based on the 16S ribosomal RNA (rRNA) gene using terminal restriction fragment length polymorphism (T-RFLP) profiling. The abundances of total bacteria, total archaea, and selected functional genes (bacterial and archaeal amoA, bacterial narG, nirK, nirS, and nosZ) were determined by quantitative polymerase chain reaction (qPCR). The results indicate that the structures of bacterial and archaeal communities varied significantly with time and fertilization based on changes in the relative abundance of dominant T-RFs. The abundancy of the detected genes changed with time. The total bacteria, total archaea, and archaeal amoA were less abundant in July. The bacterial amoA and denitrifying genes were less abundant in September, except the nirK gene. The OF treatment increased the abundance of the observed genes, while the CF treatment had little influence on them. The soil temperature significantly affected the bacterial and archaeal community structures. The soil moisture was significantly correlated with the abundance of denitrifying genes. Of the soil chemical properties, soil organic carbon was the most important factor and was significantly correlated with the abundance of the detected genes, except the nirK gene. Overall, this study demonstrated the effects of both temporal alteration and organic fertilizer on the structures of microbial communities and the abundance of genes involved in the nitrogen cycle. PMID:25367788
Structure of the 5' region of the Hst70 gene transcription unit: presence of an intron and multiple transcription initiation sites.

PubMed Central

Scieglinska, D; Widłak, W; Konopka, W; Poutanen, M; Rahman, N; Huhtaniemi, I; Krawczyk, Z

2001-01-01

The rat Hst70 gene and its mouse counterpart Hsp70.2 belong to the family of Hsp70 heat shock genes and are specifically expressed in male germ cells. Previous studies regarding the structure of the 5' region of the transcription unit of these genes as well as localization of the 'cis' elements conferring their testis-specific expression gave contradictory results [Widlak, Markkula, Krawczyk, Kananen and Huhtaniemi (1995) Biochim. Biophys. Acta 1264, 191-200; Dix, Rosario-Herrle, Gotoh, Mori, Goulding, Barret and Eddy (1996) Dev. Biol. 174, 310-321]. In the present paper we solve these controversies and show that the 5' untranslated region (UTR) of the Hst70 gene contains an intron which is localized similar to that of the mouse Hsp70.2 gene. Reverse transcriptase-mediated PCR, Northern blotting and RNase protection analysis revealed that the transcription initiation of both genes starts at two main distant sites, and one of them is localized within the intron. As a result two populations of Hst70 gene transcripts with similar sizes but different 5' UTR structures can be detected in total testicular RNA. Functional analysis of the Hst70 gene promoter in transgenic mice and transient transfection assays proved that the DNA fragment of approx. 360 bp localized upstream of the ATG transcription start codon is the minimal promoter required for testis-specific expression of the HST70/chloramphenicol acetyltransferase transgene. These experiments also suggest that the expression of the gene may depend on 'cis' regulatory elements localized within exon 1 and the intron sequences. PMID:11563976
Combining laser microdissection and RNA-seq to chart the transcriptional landscape of fungal development

PubMed Central

2012-01-01

Background During sexual development, filamentous ascomycetes form complex, three-dimensional fruiting bodies for the protection and dispersal of sexual spores. Fruiting bodies contain a number of cell types not found in vegetative mycelium, and these morphological differences are thought to be mediated by changes in gene expression. However, little is known about the spatial distribution of gene expression in fungal development. Here, we used laser microdissection (LM) and RNA-seq to determine gene expression patterns in young fruiting bodies (protoperithecia) and non-reproductive mycelia of the ascomycete Sordaria macrospora. Results Quantitative analysis showed major differences in the gene expression patterns between protoperithecia and total mycelium. Among the genes strongly up-regulated in protoperithecia were the pheromone precursor genes ppg1 and ppg2. The up-regulation was confirmed by fluorescence microscopy of egfp expression under the control of ppg1 regulatory sequences. RNA-seq analysis of protoperithecia from the sterile mutant pro1 showed that many genes that are differentially regulated in these structures are under the genetic control of transcription factor PRO1. Conclusions We have generated transcriptional profiles of young fungal sexual structures using a combination of LM and RNA-seq. This allowed a high spatial resolution and sensitivity, and yielded a detailed picture of gene expression during development. Our data revealed significant differences in gene expression between protoperithecia and non-reproductive mycelia, and showed that the transcription factor PRO1 is involved in the regulation of many genes expressed specifically in sexual structures. The LM/RNA-seq approach will also be relevant to other eukaryotic systems in which multicellular development is investigated. PMID:23016559
Diversity in copy number and structure of a silkworm morphogenetic gene as a result of domestication.

PubMed

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-03-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. © 2011 by the Genetics Society of America
Diversity in Copy Number and Structure of a Silkworm Morphogenetic Gene as a Result of Domestication

PubMed Central

Sakudoh, Takashi; Nakashima, Takeharu; Kuroki, Yoko; Fujiyama, Asao; Kohara, Yuji; Honda, Naoko; Fujimoto, Hirofumi; Shimada, Toru; Nakagaki, Masao; Banno, Yutaka; Tsuchida, Kozo

2011-01-01

The carotenoid-binding protein (CBP) of the domesticated silkworm, Bombyx mori, a major determinant of cocoon color, is likely to have been substantially influenced by domestication of this species. We analyzed the structure of the CBP gene in multiple strains of B. mori, in multiple individuals of the wild silkworm, B. mandarina (the putative wild ancestor of B. mori), and in a number of other lepidopterans. We found the CBP gene copy number in genomic DNA to vary widely among B. mori strains, ranging from 1 to 20. The copies of CBP are of several types, based on the presence of a retrotransposon or partial deletion of the coding sequence. In contrast to B. mori, B. mandarina was found to possess a single copy of CBP without the retrotransposon insertion, regardless of habitat. Several other lepidopterans were found to contain sequences homologous to CBP, revealing that this gene is evolutionarily conserved in the lepidopteran lineage. Thus, domestication can generate significant diversity of gene copy number and structure over a relatively short evolutionary time. PMID:21242537
Ketide Synthase (KS) Domain Prediction and Analysis of Iterative Type II PKS Gene in Marine Sponge-Associated Actinobacteria Producing Biosurfactants and Antimicrobial Agents

PubMed Central

Selvin, Joseph; Sathiyanarayanan, Ganesan; Lipton, Anuj N.; Al-Dhabi, Naif Abdullah; Valan Arasu, Mariadhas; Kiran, George S.

2016-01-01

The important biological macromolecules, such as lipopeptide and glycolipid biosurfactant producing marine actinobacteria were analyzed and their potential linkage between type II polyketide synthase (PKS) genes was explored. A unique feature of type II PKS genes is their high amino acid (AA) sequence homology and conserved gene organization. These enzymes mediate the biosynthesis of polyketide natural products with enormous structural complexity and chemical nature by combinatorial use of various domains. Therefore, deciphering the order of AA sequence encoded by PKS domains tailored the chemical structure of polyketide analogs still remains a great challenge. The present work deals with an in vitro and in silico analysis of PKS type II genes from five actinobacterial species to correlate KS domain architecture and structural features. Our present analysis reveals the unique protein domain organization of iterative type II PKS and KS domain of marine actinobacteria. The findings of this study would have implications in metabolic pathway reconstruction and design of semi-synthetic genomes to achieve rational design of novel natural products. PMID:26903957
K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

PubMed

Jangid, Kamlesh; Kao, Ming-Hung; Lahamge, Aishwarya; Williams, Mark A; Rathbun, Stephen L; Whitman, William B

2016-01-01

K-shuff is a new algorithm for comparing the similarity of gene sequence libraries, providing measures of the structural and compositional diversity as well as the significance of the differences between these measures. Inspired by Ripley's K-function for spatial point pattern analysis, the Intra K-function or IKF measures the structural diversity, including both the richness and overall similarity of the sequences, within a library. The Cross K-function or CKF measures the compositional diversity between gene libraries, reflecting both the number of OTUs shared as well as the overall similarity in OTUs. A Monte Carlo testing procedure then enables statistical evaluation of both the structural and compositional diversity between gene libraries. For 16S rRNA gene libraries from complex bacterial communities such as those found in seawater, salt marsh sediments, and soils, K-shuff yields reproducible estimates of structural and compositional diversity with libraries greater than 50 sequences. Similarly, for pyrosequencing libraries generated from a glacial retreat chronosequence and Illumina® libraries generated from US homes, K-shuff required >300 and 100 sequences per sample, respectively. Power analyses demonstrated that K-shuff is sensitive to small differences in Sanger or Illumina® libraries. This extra sensitivity of K-shuff enabled examination of compositional differences at much deeper taxonomic levels, such as within abundant OTUs. This is especially useful when comparing communities that are compositionally very similar but functionally different. K-shuff will therefore prove beneficial for conventional microbiome analysis as well as specific hypothesis testing.
Purification and Biochemical Characterization of Mutacin I from the Group I Strain of Streptococcus mutans, CH43, and Genetic Analysis of Mutacin I Biosynthesis Genes

PubMed Central

Qi, Fengxia; Chen, Ping; Caufield, Page W.

2000-01-01

Previously, we reported isolation and characterization of mutacin III and genetic analysis of mutacin III biosynthesis genes from the group III strain of Streptococcus mutans, UA787 (F. Qi, P. Chen, and P. W. Caufield, Appl. Environ. Microbiol. 65:3880–3887, 1999). During the same process of isolating the mutacin III structural gene, we also cloned the structural gene for mutacin I. In this report, we present purification and biochemical characterization of mutacin I from the group I strain CH43 and compare mutacin I and mutacin III biosynthesis genes. The mutacin I biosynthesis gene locus consists of 14 genes in the order mutR, -A, -A′, -B, -C, -D, -P, -T, -F, -E, -G, orfX, orfY, orfZ. mutA is the structural gene for mutacin I, while mutA′ is not required for mutacin I activity. DNA and protein sequence analysis revealed that mutacins I and III are homologous to each other, possibly arising from a common ancestor. The mature mutacin I is 24 amino acids in size and has a molecular mass of 2,364 Da. Ethanethiol modification and peptide sequencing of mutacin I revealed that it contains six dehydrated serines, four of which are probably involved with thioether bridge formation. Comparison of the primary sequence of mutacin I with that of mutacin III and epidermin suggests that mutacin I likely has the same bridging pattern as epidermin. PMID:10919773
Purification and biochemical characterization of mutacin I from the group I strain of Streptococcus mutans, CH43, and genetic analysis of mutacin I biosynthesis genes.

PubMed

Qi, F; Chen, P; Caufield, P W

2000-08-01

Previously, we reported isolation and characterization of mutacin III and genetic analysis of mutacin III biosynthesis genes from the group III strain of Streptococcus mutans, UA787 (F. Qi, P. Chen, and P. W. Caufield, Appl. Environ. Microbiol. 65:3880-3887, 1999). During the same process of isolating the mutacin III structural gene, we also cloned the structural gene for mutacin I. In this report, we present purification and biochemical characterization of mutacin I from the group I strain CH43 and compare mutacin I and mutacin III biosynthesis genes. The mutacin I biosynthesis gene locus consists of 14 genes in the order mutR, -A, -A', -B, -C, -D, -P, -T, -F, -E, -G, orfX, orfY, orfZ. mutA is the structural gene for mutacin I, while mutA' is not required for mutacin I activity. DNA and protein sequence analysis revealed that mutacins I and III are homologous to each other, possibly arising from a common ancestor. The mature mutacin I is 24 amino acids in size and has a molecular mass of 2, 364 Da. Ethanethiol modification and peptide sequencing of mutacin I revealed that it contains six dehydrated serines, four of which are probably involved with thioether bridge formation. Comparison of the primary sequence of mutacin I with that of mutacin III and epidermin suggests that mutacin I likely has the same bridging pattern as epidermin.
Long-term balanced fertilization increases the soil microbial functional diversity in a phosphorus-limited paddy soil.

PubMed

Su, Jian-Qiang; Ding, Long-Jun; Xue, Kai; Yao, Huai-Ying; Quensen, John; Bai, Shi-Jie; Wei, Wen-Xue; Wu, Jin-Shui; Zhou, Jizhong; Tiedje, James M; Zhu, Yong-Guan

2015-01-01

The influence of long-term chemical fertilization on soil microbial communities has been one of the frontier topics of agricultural and environmental sciences and is critical for linking soil microbial flora with soil functions. In this study, 16S rRNA gene pyrosequencing and a functional gene array, geochip 4.0, were used to investigate the shifts in microbial composition and functional gene structure in paddy soils with different fertilization treatments over a 22-year period. These included a control without fertilizers; chemical nitrogen fertilizer (N); N and phosphate (NP); N and potassium (NK); and N, P and K (NPK). Based on 16S rRNA gene data, both species evenness and key genera were affected by P fertilization. Functional gene array-based analysis revealed that long-term fertilization significantly changed the overall microbial functional structures. Chemical fertilization significantly increased the diversity and abundance of most genes involved in C, N, P and S cycling, especially for the treatments NK and NPK. Significant correlations were found among functional gene structure and abundance, related soil enzymatic activities and rice yield, suggesting that a fertilizer-induced shift in the microbial community may accelerate the nutrient turnover in soil, which in turn influenced rice growth. The effect of N fertilization on soil microbial functional genes was mitigated by the addition of P fertilizer in this P-limited paddy soil, suggesting that balanced chemical fertilization is beneficial to the soil microbial community and its functions. © 2014 John Wiley & Sons Ltd.
Gene structure, phylogeny and expression profile of the sucrose synthase gene family in cacao (Theobroma cacao L.).

PubMed

Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui

2015-09-01

In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.

Multifunctionality and diversity of GDSL esterase/lipase gene family in rice (Oryza sativa L. japonica) genome: new insights from bioinformatics analysis

PubMed Central

2012-01-01

Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
Comparative genomic analysis of the MHC: the evolution of class I duplication blocks, diversity and complexity from shark to man.

PubMed

Kulski, Jerzy K; Shiina, Takashi; Anzai, Tatsuya; Kohara, Sakae; Inoko, Hidetoshi

2002-12-01

The major histocompatibility complex (MHC) genomic region is composed of a group of linked genes involved functionally with the adaptive and innate immune systems. The class I and class II genes are intrinsic features of the MHC and have been found in all the jawed vertebrates studied so far. The MHC genomic regions of the human and the chicken (B locus) have been fully sequenced and mapped, and the mouse MHC sequence is almost finished. Information on the MHC genomic structures (size, complexity, genic and intergenic composition and organization, gene order and number) of other vertebrates is largely limited or nonexistent. Therefore, we are mapping, sequencing and analyzing the MHC genomic regions of different human haplotypes and at least eight nonhuman species. Here, we review our progress with these sequences and compare the human MHC structure with that of the nonhuman primates (chimpanzee and rhesus macaque), other mammals (pigs, mice and rats) and nonmammalian vertebrates such as birds (chicken and quail), bony fish (medaka, pufferfish and zebrafish) and cartilaginous fish (nurse shark). This comparison reveals a complex MHC structure for mammals and a relatively simpler design for nonmammalian animals with a hypothetical prototypic structure for the shark. In the mammalian MHC, there are two to five different class I duplication blocks embedded within a framework of conserved nonclass I and/or nonclass II genes. With a few exceptions, the class I framework genes are absent from the MHC of birds, bony fish and sharks. Comparative genomics of the MHC reveal a highly plastic region with major structural differences between the mammalian and nonmammalian vertebrates. Additional genomic data are needed on animals of the reptilia, crocodilia and marsupial classes to find the origins of the class I framework genes and examples of structures that may be intermediate between the simple and complex MHC organizations of birds and mammals, respectively.
Combinatorial explosion in model gene networks

NASA Astrophysics Data System (ADS)

Edwards, R.; Glass, L.

2000-09-01

The explosive growth in knowledge of the genome of humans and other organisms leaves open the question of how the functioning of genes in interacting networks is coordinated for orderly activity. One approach to this problem is to study mathematical properties of abstract network models that capture the logical structures of gene networks. The principal issue is to understand how particular patterns of activity can result from particular network structures, and what types of behavior are possible. We study idealized models in which the logical structure of the network is explicitly represented by Boolean functions that can be represented by directed graphs on n-cubes, but which are continuous in time and described by differential equations, rather than being updated synchronously via a discrete clock. The equations are piecewise linear, which allows significant analysis and facilitates rapid integration along trajectories. We first give a combinatorial solution to the question of how many distinct logical structures exist for n-dimensional networks, showing that the number increases very rapidly with n. We then outline analytic methods that can be used to establish the existence, stability and periods of periodic orbits corresponding to particular cycles on the n-cube. We use these methods to confirm the existence of limit cycles discovered in a sample of a million randomly generated structures of networks of 4 genes. Even with only 4 genes, at least several hundred different patterns of stable periodic behavior are possible, many of them surprisingly complex. We discuss ways of further classifying these periodic behaviors, showing that small mutations (reversal of one or a few edges on the n-cube) need not destroy the stability of a limit cycle. Although these networks are very simple as models of gene networks, their mathematical transparency reveals relationships between structure and behavior, they suggest that the possibilities for orderly dynamics in such networks are extremely rich and they offer novel ways to think about how mutations can alter dynamics.
Combinatorial explosion in model gene networks.

PubMed

Edwards, R.; Glass, L.

2000-09-01

The explosive growth in knowledge of the genome of humans and other organisms leaves open the question of how the functioning of genes in interacting networks is coordinated for orderly activity. One approach to this problem is to study mathematical properties of abstract network models that capture the logical structures of gene networks. The principal issue is to understand how particular patterns of activity can result from particular network structures, and what types of behavior are possible. We study idealized models in which the logical structure of the network is explicitly represented by Boolean functions that can be represented by directed graphs on n-cubes, but which are continuous in time and described by differential equations, rather than being updated synchronously via a discrete clock. The equations are piecewise linear, which allows significant analysis and facilitates rapid integration along trajectories. We first give a combinatorial solution to the question of how many distinct logical structures exist for n-dimensional networks, showing that the number increases very rapidly with n. We then outline analytic methods that can be used to establish the existence, stability and periods of periodic orbits corresponding to particular cycles on the n-cube. We use these methods to confirm the existence of limit cycles discovered in a sample of a million randomly generated structures of networks of 4 genes. Even with only 4 genes, at least several hundred different patterns of stable periodic behavior are possible, many of them surprisingly complex. We discuss ways of further classifying these periodic behaviors, showing that small mutations (reversal of one or a few edges on the n-cube) need not destroy the stability of a limit cycle. Although these networks are very simple as models of gene networks, their mathematical transparency reveals relationships between structure and behavior, they suggest that the possibilities for orderly dynamics in such networks are extremely rich and they offer novel ways to think about how mutations can alter dynamics. (c) 2000 American Institute of Physics.
Extreme Mutation Tolerance: Nearly Half of the Archaeal Fusellovirus Sulfolobus Spindle-Shaped Virus 1 Genes Are Not Required for Virus Function, Including the Minor Capsid Protein Gene vp3

PubMed Central

Iverson, Eric A.; Goodman, David A.; Gorchels, Madeline E.

2017-01-01

ABSTRACT Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae, where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae. The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3, allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. PMID:28148789
Extreme Mutation Tolerance: Nearly Half of the Archaeal Fusellovirus Sulfolobus Spindle-Shaped Virus 1 Genes Are Not Required for Virus Function, Including the Minor Capsid Protein Gene vp3.

PubMed

Iverson, Eric A; Goodman, David A; Gorchels, Madeline E; Stedman, Kenneth M

2017-05-15

Viruses infecting the Archaea harbor a tremendous amount of genetic diversity. This is especially true for the spindle-shaped viruses of the family Fuselloviridae , where >90% of the viral genes do not have detectable homologs in public databases. This significantly limits our ability to elucidate the role of viral proteins in the infection cycle. To address this, we have developed genetic techniques to study the well-characterized fusellovirus Sulfolobus spindle-shaped virus 1 (SSV1), which infects Sulfolobus solfataricus in volcanic hot springs at 80°C and pH 3. Here, we present a new comparative genome analysis and a thorough genetic analysis of SSV1 using both specific and random mutagenesis and thereby generate mutations in all open reading frames. We demonstrate that almost half of the SSV1 genes are not essential for infectivity, and the requirement for a particular gene correlates well with its degree of conservation within the Fuselloviridae The major capsid gene vp1 is essential for SSV1 infectivity. However, the universally conserved minor capsid gene vp3 could be deleted without a loss in infectivity and results in virions with abnormal morphology. IMPORTANCE Most of the putative genes in the spindle-shaped archaeal hyperthermophile fuselloviruses have no sequences that are clearly similar to characterized genes. In order to determine which of these SSV genes are important for function, we disrupted all of the putative genes in the prototypical fusellovirus, SSV1. Surprisingly, about half of the genes could be disrupted without destroying virus function. Even deletions of one of the known structural protein genes that is present in all known fuselloviruses, vp3 , allows the production of infectious viruses. However, viruses lacking vp3 have abnormal shapes, indicating that the vp3 gene is important for virus structure. Identification of essential genes will allow focused research on minimal SSV genomes and further understanding of the structure of these unique, ubiquitous, and extremely stable archaeal viruses. Copyright © 2017 American Society for Microbiology.
Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.

PubMed

Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A

2018-04-11

The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.
Distribution of mutations in the PEX gene in families with X-linked hypophosphataemic rickets (HYP).

PubMed

Rowe, P S; Oudet, C L; Francis, F; Sinding, C; Pannetier, S; Econs, M J; Strom, T M; Meitinger, T; Garabedian, M; David, A; Macher, M A; Questiaux, E; Popowska, E; Pronicka, E; Read, A P; Mokrzycki, A; Glorieux, F H; Drezner, M K; Hanauer, A; Lehrach, H; Goulding, J N; O'Riordan, J L

1997-04-01

Mutations in the PEX gene at Xp22.1 (phosphate-regulating gene with homologies to endopeptidases, on the X-chromosome), are responsible for X-linked hypophosphataemic rickets (HYP). Homology of PEX to the M13 family of Zn2+ metallopeptidases which include neprilysin (NEP) as prototype, has raised important questions regarding PEX function at the molecular level. The aim of this study was to analyse 99 HYP families for PEX gene mutations, and to correlate predicted changes in the protein structure with Zn2+ metallopeptidase gene function. Primers flanking 22 characterised exons were used to amplify DNA by PCR, and SSCP was then used to screen for mutations. Deletions, insertions, nonsense mutations, stop codons and splice mutations occurred in 83% of families screened for in all 22 exons, and 51% of a separate set of families screened in 17 PEX gene exons. Missense mutations in four regions of the gene were informative regarding function, with one mutation in the Zn2+-binding site predicted to alter substrate enzyme interaction and catalysis. Computer analysis of the remaining mutations predicted changes in secondary structure, N-glycosylation, protein phosphorylation and catalytic site molecular structure. The wide range of mutations that align with regions required for protease activity in NEP suggests that PEX also functions as a protease, and may act by processing factor(s) involved in bone mineral metabolism.
Analysis of Flavonoids and the Flavonoid Structural Genes in Brown Fiber of Upland Cotton

PubMed Central

Liu, Yongchang; Li, Yanjun; Zhang, Xinyu; Jones, Brian Joseph; Sun, Yuqiang; Sun, Jie

2013-01-01

Backgroud As a result of changing consumer preferences, cotton (Gossypium Hirsutum L.) from varieties with naturally colored fibers is becoming increasingly sought after in the textile industry. The molecular mechanisms leading to colored fiber development are still largely unknown, although it is expected that the color is derived from flavanoids. Experimental Design Firstly, four key genes of the flavonoid biosynthetic pathway in cotton (GhC4H, GhCHS, GhF3′H, and GhF3′5′H) were cloned and studied their expression profiles during the development of brown- and white cotton fibers by QRT-PCR. And then, the concentrations of four components of the flavonoid biosynthetic pathway, naringenin, quercetin, kaempferol and myricetin in brown- and white fibers were analyzed at different developmental stages by HPLC. Result The predicted proteins of the four flavonoid structural genes corresponding to these genes exhibit strong sequence similarity to their counterparts in various plant species. Transcript levels for all four genes were considerably higher in developing brown fibers than in white fibers from a near isogenic line (NIL). The contents of four flavonoids (naringenin, quercetin, kaempferol and myricetin) were significantly higher in brown than in white fibers and corresponding to the biosynthetic gene expression levels. Conclusions Flavonoid structural gene expression and flavonoid metabolism are important in the development of pigmentation in brown cotton fibers. PMID:23527031
Genetic and epigenetic alteration among three homoeologous genes of a class E MADS box gene in hexaploid wheat.

PubMed

Shitsukawa, Naoki; Tahira, Chikako; Kassai, Ken-Ichiro; Hirabayashi, Chizuru; Shimizu, Tomoaki; Takumi, Shigeo; Mochida, Keiichi; Kawaura, Kanako; Ogihara, Yasunari; Murai, Koji

2007-06-01

Bread wheat (Triticum aestivum) is a hexaploid species with A, B, and D ancestral genomes. Most bread wheat genes are present in the genome as triplicated homoeologous genes (homoeologs) derived from the ancestral species. Here, we report that both genetic and epigenetic alterations have occurred in the homoeologs of a wheat class E MADS box gene. Two class E genes are identified in wheat, wheat SEPALLATA (WSEP) and wheat LEAFY HULL STERILE1 (WLHS1), which are homologs of Os MADS45 and Os MADS1 in rice (Oryza sativa), respectively. The three wheat homoeologs of WSEP showed similar genomic structures and expression profiles. By contrast, the three homoeologs of WLHS1 showed genetic and epigenetic alterations. The A genome WLHS1 homoeolog (WLHS1-A) had a structural alteration that contained a large novel sequence in place of the K domain sequence. A yeast two-hybrid analysis and a transgenic experiment indicated that the WLHS1-A protein had no apparent function. The B and D genome homoeologs, WLHS1-B and WLHS1-D, respectively, had an intact MADS box gene structure, but WLHS1-B was predominantly silenced by cytosine methylation. Consequently, of the three WLHS1 homoeologs, only WLHS1-D functions in hexaploid wheat. This is a situation where three homoeologs are differentially regulated by genetic and epigenetic mechanisms.
The role of gene flow in shaping genetic structures of the subtropical conifer species Araucaria angustifolia.

PubMed

Stefenon, V M; Gailing, O; Finkeldey, R

2008-05-01

The morphological features of pollen and seed of Araucaria angustifolia have led to the proposal of limited gene dispersal for this species. We used nuclear microsatellite and AFLP markers to assess patterns of genetic variation in six natural populations at the intra- and inter-population level, and related our findings to gene dispersal in this species. Estimates of both fine-scale spatial genetic structure (SGS) and migration rate suggest relatively short-distance gene dispersal. However, gene dispersal differed among populations, and effects of more efficient dispersal within population were observed in at least one stand. In addition, even though some seed dispersal may be aggregated in this principally barochorous species, reasonable secondary seed dispersal, presumably facilitated by animals, and overlap of seed shadows within populations is suggested. Overall, no correlation was observed between levels of SGS and inbreeding, density or age structure, except that a higher level of SGS was revealed for the population with a higher number of juvenile individuals. A low estimate for the number of migrants per generation between two neighbouring populations implies limited gene flow. We expect that stepping-stone pollen flow may have contributed to low genetic differentiation among populations observed in a previous survey. Thus, strategies for maintenance of gene flow among remnant populations should be considered in order to avoid degrading effects of population fragmentation on the evolution of A. angustifolia.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

PubMed

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

PubMed Central

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Physical structure and chromosomal localization of a gene encoding human p58[sup clk-1], a cell division control related protein kinase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eipers, P.G.

1992-01-01

The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.

PubMed

Olson, Link E; Yoder, Anne D

2002-01-01

The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.
Structure and polymorphism of the mouse myelin/oligodendrocyte glycoprotein gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Daubas, P.; Pham-Dinh, D.; Dautigny, A.

1994-09-01

The authors have isolated and characterized genomic clones containing the mouse myelin/oligodendrocyte glycoprotein (MOG) gene. It spans a region of 12.5 kb and consists of eight exons. Its exon-intron structure differs from that of classical MHC-class I genes, with which it is linked in the mouse genome. Nucleotide sequencing of the 5{prime} flanking region revelas that it contains several putative protein-binding sites, some of them in common with other myelin gene promoters. One intragenic polymorphism has been identified: it consists of a GA repeat, defining at least three alleles in mouse inbred strains, and is easily detectable using the polymerasemore » chain reaction method.« less
Inhibitors of enzymes catalyzing modifications to histone lysine residues: structure, function and activity.

PubMed

Lillico, Ryan; Stesco, Nicholas; Khorshid Amhad, Tina; Cortes, Claudia; Namaka, Mike P; Lakowski, Ted M

2016-05-01

Gene expression is partly controlled by epigenetic mechanisms including histone-modifying enzymes. Some diseases are caused by changes in gene expression that can be mitigated by inhibiting histone-modifying enzymes. This review covers the enzyme inhibitors targeting histone lysine modifications. We summarize the enzymatic mechanisms of histone lysine acetylation, deacetylation, methylation and demethylation and discuss the biochemical roles of these modifications in gene expression and in disease. We discuss inhibitors of lysine acetylation, deacetylation, methylation and demethylation defining their structure-activity relationships and their potential mechanisms. We show that there are potentially indiscriminant off-target effects on gene expression even with the use of selective epigenetic enzyme inhibitors.
Molecular mechanisms of floral organ specification by MADS domain proteins.

PubMed

Yan, Wenhao; Chen, Dijun; Kaufmann, Kerstin

2016-02-01

Flower development is a model system to understand organ specification in plants. The identities of different types of floral organs are specified by homeotic MADS transcription factors that interact in a combinatorial fashion. Systematic identification of DNA-binding sites and target genes of these key regulators show that they have shared and unique sets of target genes. DNA binding by MADS proteins is not based on 'simple' recognition of a specific DNA sequence, but depends on DNA structure and combinatorial interactions. Homeotic MADS proteins regulate gene expression via alternative mechanisms, one of which may be to modulate chromatin structure and accessibility in their target gene promoters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Identification and characterization of new members of the SXT/R391 family of integrative and conjugative elements (ICEs) in Proteus mirabilis.

PubMed

Bie, Luyao; Wu, Hao; Wang, Xin-Hua; Wang, Mingyu; Xu, Hai

2017-08-01

Integrative and conjugative elements (ICEs) are self-transmissible chromosomal mobile elements that play significant roles in the dissemination of antimicrobial resistance genes. Identification of the structures and functions of ICEs, particularly those in pathogens, improves understanding of the dissemination of antimicrobial resistance. This study identified new members of the sulfamethoxazole-trimethoprim (SXT)/R391 family of ICEs that could confer multi-drug resistance in the opportunistic pathogen Proteus mirabilis, characterized their genetic structures, and explored their evolutionary connection with other members of this family of ICEs. Three new members of the SXT/R391 family of ICEs were detected in six of 77 P. mirabilis strains isolated in China: ICEPmiChn2 (one strain), ICEPmiChn3 (one strain) and ICEPmiChn4 (three strains). All three new ICEs harbour antimicrobial resistance genes from diverse origins, suggesting their capability in acquiring foreign genes and serving as important carriers for antimicrobial resistance genes. Structural analysis showed that ICEPmiChn3 is a particularly interesting and unique ICE that has lost core genes involved in conjugation, and could not transfer to other cells via conjugation. This finding confirmed the key roles of these missing genes in conjugation. Further phylogenetic analysis suggested that ICEs in geographically close strains are also connected evolutionarily, and ICEPmiChn3 lost its conjugation cassette from a former mobile ICE. The identification and characterization of the three new members of the SXT/R391 family of ICEs in this work leads to suggestions of core ICE genes essential for conjugation, and extends understanding on the structures of ICEs, evolutionary relationships between ICEs, and the antimicrobial resistance mechanisms of P. mirabilis. Copyright © 2017 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.
Fine-Scale Analysis Reveals Cryptic Landscape Genetic Structure in Desert Tortoises

PubMed Central

Latch, Emily K.; Boarman, William I.; Walde, Andrew; Fleischer, Robert C.

2011-01-01

Characterizing the effects of landscape features on genetic variation is essential for understanding how landscapes shape patterns of gene flow and spatial genetic structure of populations. Most landscape genetics studies have focused on patterns of gene flow at a regional scale. However, the genetic structure of populations at a local scale may be influenced by a unique suite of landscape variables that have little bearing on connectivity patterns observed at broader spatial scales. We investigated fine-scale spatial patterns of genetic variation and gene flow in relation to features of the landscape in desert tortoise (Gopherus agassizii), using 859 tortoises genotyped at 16 microsatellite loci with associated data on geographic location, sex, elevation, slope, and soil type, and spatial relationship to putative barriers (power lines, roads). We used spatially explicit and non-explicit Bayesian clustering algorithms to partition the sample into discrete clusters, and characterize the relationships between genetic distance and ecological variables to identify factors with the greatest influence on gene flow at a local scale. Desert tortoises exhibit weak genetic structure at a local scale, and we identified two subpopulations across the study area. Although genetic differentiation between the subpopulations was low, our landscape genetic analysis identified both natural (slope) and anthropogenic (roads) landscape variables that have significantly influenced gene flow within this local population. We show that desert tortoise movements at a local scale are influenced by features of the landscape, and that these features are different than those that influence gene flow at larger scales. Our findings are important for desert tortoise conservation and management, particularly in light of recent translocation efforts in the region. More generally, our results indicate that recent landscape changes can affect gene flow at a local scale and that their effects can be detected almost immediately. PMID:22132143

Fine-scale analysis reveals cryptic landscape genetic structure in desert tortoises.

PubMed

Latch, Emily K; Boarman, William I; Walde, Andrew; Fleischer, Robert C

2011-01-01

Characterizing the effects of landscape features on genetic variation is essential for understanding how landscapes shape patterns of gene flow and spatial genetic structure of populations. Most landscape genetics studies have focused on patterns of gene flow at a regional scale. However, the genetic structure of populations at a local scale may be influenced by a unique suite of landscape variables that have little bearing on connectivity patterns observed at broader spatial scales. We investigated fine-scale spatial patterns of genetic variation and gene flow in relation to features of the landscape in desert tortoise (Gopherus agassizii), using 859 tortoises genotyped at 16 microsatellite loci with associated data on geographic location, sex, elevation, slope, and soil type, and spatial relationship to putative barriers (power lines, roads). We used spatially explicit and non-explicit Bayesian clustering algorithms to partition the sample into discrete clusters, and characterize the relationships between genetic distance and ecological variables to identify factors with the greatest influence on gene flow at a local scale. Desert tortoises exhibit weak genetic structure at a local scale, and we identified two subpopulations across the study area. Although genetic differentiation between the subpopulations was low, our landscape genetic analysis identified both natural (slope) and anthropogenic (roads) landscape variables that have significantly influenced gene flow within this local population. We show that desert tortoise movements at a local scale are influenced by features of the landscape, and that these features are different than those that influence gene flow at larger scales. Our findings are important for desert tortoise conservation and management, particularly in light of recent translocation efforts in the region. More generally, our results indicate that recent landscape changes can affect gene flow at a local scale and that their effects can be detected almost immediately.
First report of a deletion encompassing an entire exon in the homogentisate 1,2-dioxygenase gene causing alkaptonuria.

PubMed

Zouheir Habbal, Mohammad; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F

2014-01-01

Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5-16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband's phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin.
First Report of a Deletion Encompassing an Entire Exon in the Homogentisate 1,2-Dioxygenase Gene Causing Alkaptonuria

PubMed Central

Habbal, Mohammad Zouheir; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F.

2014-01-01

Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5–16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband’s phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin. PMID:25233259
Evidence of IgY subclass diversification in snakes: evolutionary implications.

PubMed

Wang, Tao; Sun, Yi; Shao, Wenwei; Cheng, Gang; Li, Lingxiao; Cao, Zubing; Yang, Zhi; Zou, Huiying; Zhang, Wei; Han, Binyue; Hu, Yang; Ren, Liming; Hu, Xiaoxiang; Guo, Ying; Fei, Jing; Hammarström, Lennart; Li, Ning; Zhao, Yaofeng

2012-10-01

Mammalian IgG and IgE are thought to have evolved from IgY of nonmammalian tetrapods; however, no diversification of IgY subclasses has been reported in reptiles or birds, which are phylogenetically close to mammals. To our knowledge, we report the first evidence of the presence of multiple IgY-encoding (υ) genes in snakes. Two υ genes were identified in the snake Elaphe taeniura, and three υ genes were identified in the Burmese python (Python molurus bivittatus). Although four of the υ genes displayed a conventional four-H chain C region exon structure, one of the υ genes in the Burmese python lacked the H chain C region 2 exon, thus exhibiting a structure similar to that of the mammalian γ genes. We developed mouse mAbs specific for the IgY1 and IgY2 of E. taeniura and showed that both were expressed in serum; each had two isoforms: one full-length and one truncated at the C terminus. The truncation was not caused by alternative splicing or transcriptional termination. We also identified the μ and δ genes, but no α gene, in both snakes. This study provides valuable clues for our understanding of Ig gene evolution in tetrapods.
Diversification of Root Hair Development Genes in Vascular Plants.

PubMed

Huang, Ling; Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui; Schiefelbein, John

2017-07-01

The molecular genetic program for root hair development has been studied intensively in Arabidopsis ( Arabidopsis thaliana ). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Diversification of Root Hair Development Genes in Vascular Plants1[OPEN

PubMed Central

Shi, Xinhui; Wang, Wenjia; Ryu, Kook Hui

2017-01-01

The molecular genetic program for root hair development has been studied intensively in Arabidopsis (Arabidopsis thaliana). To understand the extent to which this program might operate in other plants, we conducted a large-scale comparative analysis of root hair development genes from diverse vascular plants, including eudicots, monocots, and a lycophyte. Combining phylogenetics and transcriptomics, we discovered conservation of a core set of root hair genes across all vascular plants, which may derive from an ancient program for unidirectional cell growth coopted for root hair development during vascular plant evolution. Interestingly, we also discovered preferential diversification in the structure and expression of root hair development genes, relative to other root hair- and root-expressed genes, among these species. These differences enabled the definition of sets of genes and gene functions that were acquired or lost in specific lineages during vascular plant evolution. In particular, we found substantial divergence in the structure and expression of genes used for root hair patterning, suggesting that the Arabidopsis transcriptional regulatory mechanism is not shared by other species. To our knowledge, this study provides the first comprehensive view of gene expression in a single plant cell type across multiple species. PMID:28487476
Analyses of the NAC transcription factor gene family in Gossypium raimondii Ulbr.: chromosomal location, structure, phylogeny, and expression patterns.

PubMed

Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu

2013-07-01

NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus)

PubMed Central

Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

2016-01-01

The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269
Genome-Wide Identification and Transcriptome-Based Expression Profiling of the Sox Gene Family in the Nile Tilapia (Oreochromis niloticus).

PubMed

Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou

2016-02-23

The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Genome-Wide Analyses of the NAC Transcription Factor Gene Family in Pepper (Capsicum annuum L.): Chromosome Location, Phylogeny, Structure, Expression Patterns, Cis-Elements in the Promoter, and Interaction Network

PubMed Central

Diao, Weiping; Snyder, John C.; Liu, Jinbing; Pan, Baogui; Guo, Guangjun; Ge, Wei; Dawood, Mohammad Hasan Salman Ali

2018-01-01

The NAM, ATAF1/2, and CUC2 (NAC) transcription factors form a large plant-specific gene family, which is involved in the regulation of tissue development in response to biotic and abiotic stress. To date, there have been no comprehensive studies investigating chromosomal location, gene structure, gene phylogeny, conserved motifs, or gene expression of NAC in pepper (Capsicum annuum L.). The recent release of the complete genome sequence of pepper allowed us to perform a genome-wide investigation of Capsicum annuum L. NAC (CaNAC) proteins. In the present study, a comprehensive analysis of the CaNAC gene family in pepper was performed, and a total of 104 CaNAC genes were identified. Genome mapping analysis revealed that CaNAC genes were enriched on four chromosomes (chromosomes 1, 2, 3, and 6). In addition, phylogenetic analysis of the NAC domains from pepper, potato, Arabidopsis, and rice showed that CaNAC genes could be clustered into three groups (I, II, and III). Group III, which contained 24 CaNAC genes, was exclusive to the Solanaceae plant family. Gene structure and protein motif analyses showed that these genes were relatively conserved within each subgroup. The number of introns in CaNAC genes varied from 0 to 8, with 83 (78.9%) of CaNAC genes containing two or less introns. Promoter analysis confirmed that CaNAC genes are involved in pepper growth, development, and biotic or abiotic stress responses. Further, the expression of 22 selected CaNAC genes in response to seven different biotic and abiotic stresses [salt, heat shock, drought, Phytophthora capsici, abscisic acid, salicylic acid (SA), and methyl jasmonate (MeJA)] was evaluated by quantitative RT-PCR to determine their stress-related expression patterns. Several putative stress-responsive CaNAC genes, including CaNAC72 and CaNAC27, which are orthologs of the known stress-responsive Arabidopsis gene ANAC055 and potato gene StNAC30, respectively, were highly regulated by treatment with different types of stress. Our results also showed that CaNAC36 plays an important role in the interaction network, interacting with 48 genes. Most of these genes are in the mitogen-activated protein kinase (MAPK) family. Taken together, our results provide a platform for further studies to identify the biological functions of CaNAC genes. PMID:29596349
Evolution of structural diversity of trichothecenes, a family of toxins produced by plant pathogenic and entomopathogenic fungi

PubMed Central

McCormick, Susan P.; Lee, Theresa; Vaughan, Martha M.; Alexander, Nancy J.; Busman, Mark

2018-01-01

Trichothecenes are a family of terpenoid toxins produced by multiple genera of fungi, including plant and insect pathogens. Some trichothecenes produced by the fungus Fusarium are among the mycotoxins of greatest concern to food and feed safety because of their toxicity and frequent occurrence in cereal crops, and trichothecene production contributes to pathogenesis of some Fusarium species on plants. Collectively, fungi produce over 150 trichothecene analogs: i.e., molecules that share the same core structure but differ in patterns of substituents attached to the core structure. Here, we carried out genomic, phylogenetic, gene-function, and analytical chemistry studies of strains from nine fungal genera to identify genetic variation responsible for trichothecene structural diversity and to gain insight into evolutionary processes that have contributed to the variation. The results indicate that structural diversity has resulted from gain, loss, and functional changes of trichothecene biosynthetic (TRI) genes. The results also indicate that the presence of some substituents has arisen independently in different fungi by gain of different genes with the same function. Variation in TRI gene duplication and number of TRI loci was also observed among the fungi examined, but there was no evidence that such genetic differences have contributed to trichothecene structural variation. We also inferred ancestral states of the TRI cluster and trichothecene biosynthetic pathway, and proposed scenarios for changes in trichothecene structures during divergence of TRI cluster homologs. Together, our findings provide insight into evolutionary processes responsible for structural diversification of toxins produced by pathogenic fungi. PMID:29649280
Evolution of structural diversity of trichothecenes, a family of toxins produced by plant pathogenic and entomopathogenic fungi.

PubMed

Proctor, Robert H; McCormick, Susan P; Kim, Hye-Seon; Cardoza, Rosa E; Stanley, April M; Lindo, Laura; Kelly, Amy; Brown, Daren W; Lee, Theresa; Vaughan, Martha M; Alexander, Nancy J; Busman, Mark; Gutiérrez, Santiago

2018-04-01

Trichothecenes are a family of terpenoid toxins produced by multiple genera of fungi, including plant and insect pathogens. Some trichothecenes produced by the fungus Fusarium are among the mycotoxins of greatest concern to food and feed safety because of their toxicity and frequent occurrence in cereal crops, and trichothecene production contributes to pathogenesis of some Fusarium species on plants. Collectively, fungi produce over 150 trichothecene analogs: i.e., molecules that share the same core structure but differ in patterns of substituents attached to the core structure. Here, we carried out genomic, phylogenetic, gene-function, and analytical chemistry studies of strains from nine fungal genera to identify genetic variation responsible for trichothecene structural diversity and to gain insight into evolutionary processes that have contributed to the variation. The results indicate that structural diversity has resulted from gain, loss, and functional changes of trichothecene biosynthetic (TRI) genes. The results also indicate that the presence of some substituents has arisen independently in different fungi by gain of different genes with the same function. Variation in TRI gene duplication and number of TRI loci was also observed among the fungi examined, but there was no evidence that such genetic differences have contributed to trichothecene structural variation. We also inferred ancestral states of the TRI cluster and trichothecene biosynthetic pathway, and proposed scenarios for changes in trichothecene structures during divergence of TRI cluster homologs. Together, our findings provide insight into evolutionary processes responsible for structural diversification of toxins produced by pathogenic fungi.
Structural studies of CNG repeats

PubMed Central

Kiliszek, Agnieszka; Rypniewski, Wojciech

2014-01-01

CNG repeats (where N denotes one of the four natural nucleotides) are abundant in the human genome. Their tendency to undergo expansion can lead to hereditary diseases known as TREDs (trinucleotide repeat expansion disorders). The toxic factor can be protein, if the abnormal gene is expressed, or the gene transcript, or both. The gene transcripts have attracted much attention in the biomedical community, but their molecular structures have only recently been investigated. Model RNA molecules comprising CNG repeats fold into long hairpins whose stems generally conform to an A-type helix, in which the non-canonical N-N pairs are flanked by C-G and G-C pairs. Each homobasic pair is accommodated in the helical context in a unique manner, with consequences for the local helical parameters, solvent structure, electrostatic potential and potential to interact with ligands. The detailed three-dimensional profiles of RNA CNG repeats can be used in screening of compound libraries for potential therapeutics and in structure-based drug design. Here is a brief survey of the CNG structures published to date. PMID:24939898
Structural and transcription analysis of two homologous genes for the P700 chlorophyll a-apoproteins in Chlamydomonas reinhardii: evidence for in vivo trans-splicing

PubMed Central

Kück, Ulrich; Choquet, Yves; Schneider, Michel; Dron, Michel; Bennoun, Pierre

1987-01-01

The two homologous genes for the P700 chlorophyll a-apoproteins (ps1A1 and ps1A2) are encoded by the plastom in the green alga Chlamydomonas reinhardii. The structure and organization of the two genes were determined by comparison with the homologous genes from maize using data from heterologous hybridizations as well as from DNA and RNA sequencing. While the ps1A2 (736 codons) gene shows a continuous gene organization, the ps1A1 (754 codons) gene possesses some unusual features. The discontinuous gene is split into three separate exons which are scattered around the circular chloroplast genome. Exon 1 (86 bp) is separated by ∼50 kb from exon 2 (198 bp), which is located ∼ 90 kb apart from exon 3 (1984 bp). All exons are flanked by intronic sequences of group II. Transcription analysis reveals that the ps1A2 gene hybridizes with a 2.8-kb transcript, while all exon regions of the ps1A1 gene are homologous to a mature mRNA of 2.7 kb. From our data we conclude that the three distantly separated exonic sequences of the ps1A1 gene constitute a functional gene which probably operates by a trans-splicing mechanism. ImagesFig. 3.Fig. 5.Fig. 6. PMID:16453785
Comparative genetic structure of two mangrove species in Caribbean and Pacific estuaries of Panama

PubMed Central

2012-01-01

Background Mangroves are ecologically important and highly threatened forest communities. Observational and genetic evidence has confirmed the long distance dispersal capacity of water-dispersed mangrove seeds, but less is known about the relative importance of pollen vs. seed gene flow in connecting populations. We analyzed 980 Avicennia germinans for 11 microsatellite loci and 940 Rhizophora mangle for six microsatellite loci and subsampled two non-coding cpDNA regions in order to understand population structure, and gene flow within and among four major estuaries on the Caribbean and Pacific coasts of Panama. Results Both species showed similar rates of outcrossing (t= 0.7 in A. germinans and 0.8 in R. mangle) and strong patterns of spatial genetic structure within estuaries, although A. germinans had greater genetic structure in nuclear and cpDNA markers (7 demes > 4 demes and Sp= 0.02 > 0.002), and much greater cpDNA diversity (Hd= 0.8 > 0.2) than R. mangle. The Central American Isthmus serves as an exceptionally strong barrier to gene flow, with high levels nuclear (FST= 0.3-0.5) and plastid (FST= 0.5-0.8) genetic differentiation observed within each species between coasts and no shared cpDNA haplotypes between species on each coast. Finally, evidence of low ratios of pollen to seed dispersal (r = −0.6 in A. germinans and 7.7 in R. mangle), coupled with the strong observed structure in nuclear and plastid DNA among most estuaries, suggests low levels of gene flow in these mangrove species. Conclusions We conclude that gene dispersal in mangroves is usually limited within estuaries and that coastal geomorphology and rare long distance dispersal events could also influence levels of structure. PMID:23078287
A two-generation analysis of pollen pool genetic structure in flowering dogwood Cornus florida (Cornaceae), in the Missouri Ozarks

Treesearch

Victoria L. Sork; Peter E. Smouse; Victoria J. Apsit; Rodney J. Dyer; Robert D. Westfall

2005-01-01

Anthropogenic landscape change can disrupt gene flow. As part of the Missouri Ozark Forest Ecosystem Project, this study examined whether silvicultural practices influence pollen-mediated gene movement in the insect-pollinated species, Cornus florida L., by comparing pollen pool structure (ΦST) among clear-cutting,...
Developing JSequitur to Study the Hierarchical Structure of Biological Sequences in a Grammatical Inference Framework of String Compression Algorithms.

PubMed

Galbadrakh, Bulgan; Lee, Kyung-Eun; Park, Hyun-Seok

2012-12-01

Grammatical inference methods are expected to find grammatical structures hidden in biological sequences. One hopes that studies of grammar serve as an appropriate tool for theory formation. Thus, we have developed JSequitur for automatically generating the grammatical structure of biological sequences in an inference framework of string compression algorithms. Our original motivation was to find any grammatical traits of several cancer genes that can be detected by string compression algorithms. Through this research, we could not find any meaningful unique traits of the cancer genes yet, but we could observe some interesting traits in regards to the relationship among gene length, similarity of sequences, the patterns of the generated grammar, and compression rate.
Protein modeling and molecular dynamics simulation of SlWRKY4 protein cloned from drought tolerant tomato (Solanum habrochaites) line EC520061.

PubMed

Karkute, Suhas G; Easwaran, Murugesh; Gujjar, Ranjit Singh; Piramanayagam, Shanmughavel; Singh, Major

2015-10-01

WRKY genes are members of one of the largest families of plant transcription factors and play an important role in response to biotic and abiotic stresses, and overall growth and development. Understanding the interaction of WRKY proteins with other proteins/ligands in plant cells is of utmost importance to develop plants having tolerance to biotic and abiotic stresses. The SlWRKY4 gene was cloned from a drought tolerant wild species of tomato (Solanum habrochaites) and the secondary structure and 3D modeling of this protein were predicted using Schrödinger Suite-Prime. Predicted structures were also subjected to plot against Ramachandran's conformation, and the modeled structure was minimized using Macromodel. Finally, the minimized structure was simulated in the water environment to check the protein stability. The behavior of the modeled structure was well-simulated and analyzed through RMSD and RMSF of the protein. The present work provides the modeled 3D structure of SlWRKY4 that will help in understanding the mechanism of gene regulation by further in silico interaction studies.
Structural and functional organization of ribosomal genes within the mammalian cell nucleolus.

PubMed

Derenzini, Massimo; Pasquinelli, Gianandrea; O'Donohue, Marie-Françoise; Ploton, Dominique; Thiry, Marc

2006-02-01

Data on the in situ structural-functional organization of ribosomal genes in the mammalian cell nucleolus are reviewed here. Major findings on chromatin structure in situ come from investigations carried out using the Feulgen-like osmium ammine reaction as a highly specific electron-opaque DNA tracer. Intranucleolar chromatin shows three different levels of organization: compact clumps, fibers ranging from 11 to 30 nm, and loose agglomerates of extended DNA filaments. Both clumps and fibers of chromatin exhibit a nucleosomal organization that is lacking in the loose agglomerates of extended DNA filaments. In fact, these filaments constantly show a thickness of 2-3 nm, the same as a DNA double-helix molecule. The loose agglomerates of DNA filaments are located in the fibrillar centers, the interphase counterpart of metaphase NORs, therefore being constituted by ribosomal DNA. The extended, non-nucleosomal configuration of this rDNA has been shown to be independent of transcriptional activity and characterizes ribosome genes that are either transcribed or transcriptionally silent. Data reviewed are consistent with a model of control for ribosome gene activity that is not mediated by changes in chromatin structure. The presence of rDNA in mammalian cells always structurally ready for transcription might facilitate a more rapid adjustment of the ribosome production in response to the metabolic needs of the cell.
Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

PubMed

Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

2016-03-01

Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.

Sex-dependent association of common variants of microcephaly genes with brain structure.

PubMed

Rimol, Lars M; Agartz, Ingrid; Djurovic, Srdjan; Brown, Andrew A; Roddey, J Cooper; Kähler, Anna K; Mattingsdal, Morten; Athanasiu, Lavinia; Joyner, Alexander H; Schork, Nicholas J; Halgren, Eric; Sundet, Kjetil; Melle, Ingrid; Dale, Anders M; Andreassen, Ole A

2010-01-05

Loss-of-function mutations in the genes associated with primary microcephaly (MCPH) reduce human brain size by about two-thirds, without producing gross abnormalities in brain organization or physiology and leaving other organs largely unaffected [Woods CG, et al. (2005) Am J Hum Genet 76:717-728]. There is also evidence suggesting that MCPH genes have evolved rapidly in primates and humans and have been subjected to selection in recent human evolution [Vallender EJ, et al. (2008) Trends Neurosci 31:637-644]. Here, we show that common variants of MCPH genes account for some of the common variation in brain structure in humans, independently of disease status. We investigated the correlations of SNPs from four MCPH genes with brain morphometry phenotypes obtained with MRI. We found significant, sex-specific associations between common, nonexonic, SNPs of the genes CDK5RAP2, MCPH1, and ASPM, with brain volume or cortical surface area in an ethnically homogenous Norwegian discovery sample (n = 287), including patients with mental illness. The most strongly associated SNP findings were replicated in an independent North American sample (n = 656), which included patients with dementia. These results are consistent with the view that common variation in brain structure is associated with genetic variants located in nonexonic, presumably regulatory, regions.
Genetic connectivity for two bear species at wildlife crossing structures in Banff National Park.

PubMed

Sawaya, Michael A; Kalinowski, Steven T; Clevenger, Anthony P

2014-04-07

Roads can fragment and isolate wildlife populations, which will eventually decrease genetic diversity within populations. Wildlife crossing structures may counteract these impacts, but most crossings are relatively new, and there is little evidence that they facilitate gene flow. We conducted a three-year research project in Banff National Park, Alberta, to evaluate the effectiveness of wildlife crossings to provide genetic connectivity. Our main objective was to determine how the Trans-Canada Highway and crossing structures along it affect gene flow in grizzly (Ursus arctos) and black bears (Ursus americanus). We compared genetic data generated from wildlife crossings with data collected from greater bear populations. We detected a genetic discontinuity at the highway in grizzly bears but not in black bears. We assigned grizzly bears that used crossings to populations north and south of the highway, providing evidence of bidirectional gene flow and genetic admixture. Parentage tests showed that 47% of black bears and 27% of grizzly bears that used crossings successfully bred, including multiple males and females of both species. Differentiating between dispersal and gene flow is difficult, but we documented gene flow by showing migration, reproduction and genetic admixture. We conclude that wildlife crossings allow sufficient gene flow to prevent genetic isolation.
Spatial genetic structure in wild cardoon, the ancestor of cultivated globe artichoke: Limited gene flow, fragmentation and population history.

PubMed

Rau, D; Rodriguez, M; Rapposelli, E; Murgia, M L; Papa, R; Brown, A H D; Attene, G

2016-12-01

Nuclear and chloroplast markers and phenotypic characters were integrated to analyse the population genetic structure of wild cardoon, Cynara cardunculus var. sylvestris, the ancestor of cultivated globe artichoke, Cynara cardunculus var. scolymus on the island of Sardinia, Italy. The spatial scale ranged from a few metres to ∼200km. Wild cardoon appears to be genetically fragmented, with significant genetic divergence at various scales, indicating that gene flow is insufficient to counterbalance the effects of genetic drift or founder effects. Divergence between populations was higher for chloroplast (40%) than for nuclear markers (15%), suggesting that gene flow via seed was lower than via pollen. Two main genetic groups were detected; these correlated with differences in flowering time, capitula size, glossiness, and anthocyanin pigmentation. A complex population structure of wild cardoon emerged over small spatial scales, likely resulting from the interplay between gene dispersal, colonisation history and selective forces. Indeed, Sardinia appears to be a 'hybrid zone' of different gene pools. The island has unique diverse germplasm that has originated from hybridisation among different gene pools. The sampling of seeds from a few plants but from many sites is suggested as the best strategy to harvest the genetic diversity of wild cardoon. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Genome Structures and Transcriptomes Signify Niche Adaptation for the Multiple-Ion-Tolerant Extremophyte Schrenkiella parvula1[C][W][OPEN

PubMed Central

Oh, Dong-Ha; Hong, Hyewon; Lee, Sang Yeol; Yun, Dae-Jin; Bohnert, Hans J.; Dassanayake, Maheshi

2014-01-01

Schrenkiella parvula (formerly Thellungiella parvula), a close relative of Arabidopsis (Arabidopsis thaliana) and Brassica crop species, thrives on the shores of Lake Tuz, Turkey, where soils accumulate high concentrations of multiple-ion salts. Despite the stark differences in adaptations to extreme salt stresses, the genomes of S. parvula and Arabidopsis show extensive synteny. S. parvula completes its life cycle in the presence of Na+, K+, Mg2+, Li+, and borate at soil concentrations lethal to Arabidopsis. Genome structural variations, including tandem duplications and translocations of genes, interrupt the colinearity observed throughout the S. parvula and Arabidopsis genomes. Structural variations distinguish homologous gene pairs characterized by divergent promoter sequences and basal-level expression strengths. Comparative RNA sequencing reveals the enrichment of ion-transport functions among genes with higher expression in S. parvula, while pathogen defense-related genes show higher expression in Arabidopsis. Key stress-related ion transporter genes in S. parvula showed increased copy number, higher transcript dosage, and evidence for subfunctionalization. This extremophyte offers a framework to identify the requisite adjustments of genomic architecture and expression control for a set of genes found in most plants in a way to support distinct niche adaptation and lifestyles. PMID:24563282
Genetic connectivity for two bear species at wildlife crossing structures in Banff National Park

PubMed Central

Sawaya, Michael A.; Kalinowski, Steven T.; Clevenger, Anthony P.

2014-01-01

Roads can fragment and isolate wildlife populations, which will eventually decrease genetic diversity within populations. Wildlife crossing structures may counteract these impacts, but most crossings are relatively new, and there is little evidence that they facilitate gene flow. We conducted a three-year research project in Banff National Park, Alberta, to evaluate the effectiveness of wildlife crossings to provide genetic connectivity. Our main objective was to determine how the Trans-Canada Highway and crossing structures along it affect gene flow in grizzly (Ursus arctos) and black bears (Ursus americanus). We compared genetic data generated from wildlife crossings with data collected from greater bear populations. We detected a genetic discontinuity at the highway in grizzly bears but not in black bears. We assigned grizzly bears that used crossings to populations north and south of the highway, providing evidence of bidirectional gene flow and genetic admixture. Parentage tests showed that 47% of black bears and 27% of grizzly bears that used crossings successfully bred, including multiple males and females of both species. Differentiating between dispersal and gene flow is difficult, but we documented gene flow by showing migration, reproduction and genetic admixture. We conclude that wildlife crossings allow sufficient gene flow to prevent genetic isolation. PMID:24552834
Dendrimers as Carriers for siRNA Delivery and Gene Silencing: A Review

PubMed Central

Huang, Weizhe; He, Ziying

2013-01-01

RNA interference (RNAi) was first literaturally reported in 1998 and has become rapidly a promising tool for therapeutic applications in gene therapy. In a typical RNAi process, small interfering RNAs (siRNA) are used to specifically downregulate the expression of the targeted gene, known as the term “gene silencing.” One key point for successful gene silencing is to employ a safe and efficient siRNA delivery system. In this context, dendrimers are emerging as potential nonviral vectors to deliver siRNA for RNAi purpose. Dendrimers have attracted intense interest since their emanating research in the 1980s and are extensively studied as efficient DNA delivery vectors in gene transfer applications, due to their unique features based on the well-defined and multivalent structures. Knowing that DNA and RNA possess a similar structure in terms of nucleic acid framework and the electronegative nature, one can also use the excellent DNA delivery properties of dendrimers to develop effective siRNA delivery systems. In this review, the development of dendrimer-based siRNA delivery vectors is summarized, focusing on the vector features (siRNA delivery efficiency, cytotoxicity, etc.) of different types of dendrimers and the related investigations on structure-activity relationship to promote safe and efficient siRNA delivery system. PMID:24288498
Protein and gene structure of a blue laccase from Pleurotus ostreatus1.

PubMed Central

Giardina, P; Palmieri, G; Scaloni, A; Fontanella, B; Faraco, V; Cennamo, G; Sannia, G

1999-01-01

A new laccase isoenzyme (POXA1b, where POX is phenol oxidase), produced by Pleurotus ostreatus in cultures supplemented with copper sulphate, has been purified and fully characterized. The main characteristics of this protein (molecular mass in native and denaturing conditions, pI and catalytic properties) are almost identical to the previously studied laccase POXA1w. However, POXA1b contains four copper atoms per molecule instead of one copper, two zinc and one iron atom per molecule of POXA1w. Furthermore, POXA1b shows an unusually high stability at alkaline pH. The gene and cDNA coding for POXA1b have been cloned and sequenced. The gene coding sequence contains 1599 bp, interrupted by 15 introns. Comparison of the structure of the poxa1b gene with the two previously studied P. ostreatus laccase genes (pox1 and poxc) suggests that these genes belong to two different subfamilies. The amino acid sequence of POXA1b deduced from the cDNA sequence has been almost completely verified by means of matrix-assisted laser desorption ionization MS. It has been demonstrated that three out of six putative glycosylation sites are post-translationally modified and the structure of the bound glycosidic moieties has been determined, whereas two other putative glycosylation sites are unmodified. PMID:10417329
A transcriptome-based assessment of the astrocytic dystrophin-associated complex in the developing human brain.

PubMed

Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J

2018-02-01

Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.
Contrasting roles for MyoD in organizing myogenic promoter structures during embryonic skeletal muscle development.

PubMed

Cho, Ok Hyun; Mallappa, Chandrashekara; Hernández-Hernández, J Manuel; Rivera-Pérez, Jaime A; Imbalzano, Anthony N

2015-01-01

Among the complexities of skeletal muscle differentiation is a temporal distinction in the onset of expression of different lineage-specific genes. The lineage-determining factor MyoD is bound to myogenic genes at the onset of differentiation whether gene activation is immediate or delayed. How temporal regulation of differentiation-specific genes is established remains unclear. Using embryonic tissue, we addressed the molecular differences in the organization of the myogenin and muscle creatine kinase (MCK) gene promoters by examining regulatory factor binding as a function of both time and spatial organization during somitogenesis. At the myogenin promoter, binding of the homeodomain factor Pbx1 coincided with H3 hyperacetylation and was followed by binding of co-activators that modulate chromatin structure. MyoD and myogenin binding occurred subsequently, demonstrating that Pbx1 facilitates chromatin remodeling and modification before myogenic regulatory factor binding. At the same time, the MCK promoter was bound by HDAC2 and MyoD, and activating histone marks were largely absent. The association of HDAC2 and MyoD was confirmed by co-immunoprecipitation, proximity ligation assay (PLA), and sequential ChIP. MyoD differentially promotes activated and repressed chromatin structures at myogenic genes early after the onset of skeletal muscle differentiation in the developing mouse embryo. © 2014 Wiley Periodicals, Inc.
The novel product of a five-exon stargazin-related gene abolishes CaV2.2 calcium channel expression

PubMed Central

Moss, Fraser J.; Viard, Patricia; Davies, Anthony; Bertaso, Federica; Page, Karen M.; Graham, Alex; Cantí, Carles; Plumpton, Mary; Plumpton, Christopher; Clare, Jeffrey J.; Dolphin, Annette C.

2002-01-01

We have cloned and characterized a new member of the voltage-dependent Ca2+ channel γ subunit family, with a novel gene structure and striking properties. Unlike the genes of other potential γ subunits identified by their homology to the stargazin gene, CACNG7 is a five-, and not four-exon gene whose mRNA encodes a protein we have designated γ7. Expression of human γ7 has been localized specifically to brain. N-type current through CaV2.2 channels was almost abolished when co-expressed transiently with γ7 in either Xenopus oocytes or COS-7 cells. Furthermore, immunocytochemistry and western blots show that γ7 has this effect by causing a large reduction in expression of CaV2.2 rather than by interfering with trafficking or biophysical properties of the channel. No effect of transiently expressed γ7 was observed on pre-existing endogenous N-type calcium channels in sympathetic neurones. Low homology to the stargazin-like γ subunits, different gene structure and the unique functional properties of γ7 imply that it represents a distinct subdivision of the family of proteins identified by their structural and sequence homology to stargazin. PMID:11927536
Muscle Research and Gene Ontology: New standards for improved data integration.

PubMed

Feltrin, Erika; Campanaro, Stefano; Diehl, Alexander D; Ehler, Elisabeth; Faulkner, Georgine; Fordham, Jennifer; Gardin, Chiara; Harris, Midori; Hill, David; Knoell, Ralph; Laveder, Paolo; Mittempergher, Lorenza; Nori, Alessandra; Reggiani, Carlo; Sorrentino, Vincenzo; Volpe, Pompeo; Zara, Ivano; Valle, Giorgio; Deegan, Jennifer

2009-01-29

The Gene Ontology Project provides structured controlled vocabularies for molecular biology that can be used for the functional annotation of genes and gene products. In a collaboration between the Gene Ontology (GO) Consortium and the muscle biology community, we have made large-scale additions to the GO biological process and cellular component ontologies. The main focus of this ontology development work concerns skeletal muscle, with specific consideration given to the processes of muscle contraction, plasticity, development, and regeneration, and to the sarcomere and membrane-delimited compartments. Our aims were to update the existing structure to reflect current knowledge, and to resolve, in an accommodating manner, the ambiguity in the language used by the community. The updated muscle terminologies have been incorporated into the GO. There are now 159 new terms covering critical research areas, and 57 existing terms have been improved and reorganized to follow their usage in muscle literature. The revised GO structure should improve the interpretation of data from high-throughput (e.g. microarray and proteomic) experiments in the area of muscle science and muscle disease. We actively encourage community feedback on, and gene product annotation with these new terms. Please visit the Muscle Community Annotation Wiki http://wiki.geneontology.org/index.php/Muscle_Biology.
Characterization of the c-type lysozyme gene family in Anopheles gambiae.

PubMed

Li, Bin; Calvo, Eric; Marinotti, Osvaldo; James, Anthony A; Paskewitz, Susan M

2005-11-07

Seven new c-type lysozyme genes were found using the Anopheles gambiae genome sequence, increasing to eight the total number of genes in this family identified in this species. The eight lysozymes in An. gambiae have considerable variation in gene structure and expression patterns. Lys c-6 has the most unusual primary amino acid structure as the predicted protein consists of five lysozyme-like domains. Transcript abundance of each c-type lysozyme was determined by semiquantitative RT-PCR. Lys c-1, c-6 and c-7 are expressed constitutively in all developmental stages from egg to adult. Lys c-2 and c-4 also are found in all stages, but with relatively much higher levels in adults. Conversely, Lys c-3 and c-8 transcripts are highest in larvae. Lys c-1, c-6 and c-7 transcripts are found in nearly all the adult tissue samples examined while Lys c-2 and Lys c-4 are more restricted in their expression. Lys c-1 and c-2 transcripts are clearly immune responsive and are increased significantly 6-12 h post challenge with bacteria. The functional adaptive changes that may have evolved during the expansion of this gene family are briefly discussed in terms of the expression patterns, gene and protein structures.
Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

PubMed

Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

2013-01-01

EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.
Evolutionary Relationship and Structural Characterization of the EPF/EPFL Gene Family

PubMed Central

Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

2013-01-01

EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes. PMID:23755192
De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

PubMed Central

Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

2013-01-01

How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
Cassette structures associated with antibiotic resistance genes in Salmonella enterica isolated from processing plants, food animals, and retail meats

USDA-ARS?s Scientific Manuscript database

Slowing the spread of antibiotic resistance (AR) is one of the most urgent tasks currently facing the field of microbiology. Mobile genetic elements, like plasmids and integrons, allow AR genes to transfer horizontally, thus increasing the spread of AR genes. Determining which AR genes are found on ...
Molecular Characterization of Two Lactate Dehydrogenase Genes with a Novel Structural Organization on the Genome of Lactobacillus sp. Strain MONT4

PubMed Central

Weekes, Jennifer; Yüksel, Gülhan Ü.

2004-01-01

Two lactate dehydrogenase (ldh) genes from Lactobacillus sp. strain MONT4 were cloned by complementation in Escherichia coli DC1368 (ldh pfl) and were sequenced. The sequence analysis revealed a novel genomic organization of the ldh genes. Subcloning of the individual ldh genes and their Northern blot analyses indicated that the genes are monocistronic. PMID:15466577
Structure of the circumsporozoite protein gene in 18 strains of Plasmodium falciparum.

PubMed

Weber, J L; Hockmeyer, W T

1985-06-01

Using the cloned circumsporozoite (CS) protein gene of a Brazilian strain of Plasmodium falciparum as probe, we have analyzed the structure of the CS protein gene from 17 other Asian, African, Central and South American parasite strains by nucleic acid hybridization. Each strain appears to have one CS protein gene which hybridizes readily to the Brazilian strain probe. The 5' and 3' thirds of the genes are invariant in size in all 18 strains whereas the central third containing the 12 base pair tandem repeats varies in size over a range of about 100 base pairs. Several differences were found in the locations of Sau3A sites in the genes. The Sau3A sites are significant because each of the minority Asn-Val-Asp-Pro repeats in the cloned gene has a Sau3A site. DNA melting of hybrids revealed a high degree of homology between the sequences of the cloned gene and genes from an Asian strain and an African strain. A 14 base oligodeoxynucleotide with a sequence from the central repeat region hybridized to all strains tested. We conclude that the CS protein gene is highly conserved among strains of P. falciparum and that malaria vaccine development with the CS protein is unlikely to be complicated by strain variation.
Microarray identification of novel genes downstream of Six1, a critical factor in cranial placode, somite and kidney development

PubMed Central

Yan, Bo; Neilson, Karen M.; Ranganathan, Ramya; Maynard, Thomas; Streit, Andrea; Moody, Sally A.

2014-01-01

Background Six1 plays an important role in the development of several vertebrate organs, including cranial sensory placodes, somites and kidney. Although Six1 mutations cause one form of Branchio-Otic Syndrome (BOS), the responsible gene in many patients has not been identified; genes that act downstream of Six1 are potential BOS candidates. Results We sought to identify novel genes expressed during placode, somite and kidney development by comparing gene expression between control and Six1-expressing ectodermal explants. The expression patterns of 19 of the significantly up-regulated and 11 of the significantly down-regulated genes were assayed from cleavage to larval stages. 28/30 genes are expressed in the otocyst, a structure that is functionally disrupted in BOS, and 26/30 genes are expressed in the nephric mesoderm, a structure that is functionally disrupted in the related Branchio-Otic-Renal (BOR) syndrome. We also identified the chick homologues of 5 genes and show that they have conserved expression patterns. Conclusions Of the 30 genes selected for expression analyses, all are expressed at many of the developmental times and appropriate tissues to be regulated by Six1. Many have the potential to play a role in the disruption of hearing and kidney function seen in BOS/BOR patients. PMID:25403746
The Composition and Spatial Patterns of Bacterial Virulence Factors and Antibiotic Resistance Genes in 19 Wastewater Treatment Plants

PubMed Central

Zhang, Bing; Xia, Yu; Wen, Xianghua; Wang, Xiaohui; Yang, Yunfeng; Zhou, Jizhong; Zhang, Yu

2016-01-01

Bacterial pathogenicity and antibiotic resistance are of concern for environmental safety and public health. Accumulating evidence suggests that wastewater treatment plants (WWTPs) are as an important sink and source of pathogens and antibiotic resistance genes (ARGs). Virulence genes (encoding virulence factors) are good indicators for bacterial pathogenic potentials. To achieve a comprehensive understanding of bacterial pathogenic potentials and antibiotic resistance in WWTPs, bacterial virulence genes and ARGs in 19 WWTPs covering a majority of latitudinal zones of China were surveyed by using GeoChip 4.2. A total of 1610 genes covering 13 virulence factors and 1903 genes belonging to 11 ARG families were detected respectively. The bacterial virulence genes exhibited significant spatial distribution patterns of a latitudinal biodiversity gradient and a distance-decay relationship across China. Moreover, virulence genes tended to coexist with ARGs as shown by their strongly positive associations. In addition, key environmental factors shaping the overall virulence gene structure were identified. This study profiles the occurrence, composition and distribution of virulence genes and ARGs in current WWTPs in China, and uncovers spatial patterns and important environmental variables shaping their structure, which may provide the basis for further studies of bacterial virulence factors and antibiotic resistance in WWTPs. PMID:27907117

Adeno-associated virus inverted terminal repeats stimulate gene editing.

PubMed

Hirsch, M L

2015-02-01

Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.
Clustering Algorithms: Their Application to Gene Expression Data

PubMed Central

Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

2016-01-01

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867
Genome structure drives patterns of gene family evolution in ciliates, a case study using Chilodonella uncinata (Protista, Ciliophora, Phyllopharyngea).

PubMed

Gao, Feng; Song, Weibo; Katz, Laura A

2014-08-01

In most lineages, diversity among gene family members results from gene duplication followed by sequence divergence. Because of the genome rearrangements during the development of somatic nuclei, gene family evolution in ciliates involves more complex processes. Previous work on the ciliate Chilodonella uncinata revealed that macronuclear β-tubulin gene family members are generated by alternative processing, in which germline regions are alternatively used in multiple macronuclear chromosomes. To further study genome evolution in this ciliate, we analyzed its transcriptome and found that (1) alternative processing is extensive among gene families; and (2) such gene families are likely to be C. uncinata specific. We characterized additional macronuclear and micronuclear copies of one candidate alternatively processed gene family-a protein kinase domain containing protein (PKc)-from two C. uncinata strains. Analysis of the PKc sequences reveals that (1) multiple PKc gene family members in the macronucleus share some identical regions flanked by divergent regions; and (2) the shared identical regions are processed from a single micronuclear chromosome. We discuss analogous processes in lineages across the eukaryotic tree of life to provide further insights on the impact of genome structure on gene family evolution in eukaryotes. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.
A network-based method for the identification of putative genes related to infertility.

PubMed

Wang, ShaoPeng; Huang, GuoHua; Hu, Qinghua; Zou, Quan

2016-11-01

Infertility has become one of the major health problems worldwide, with its incidence having risen markedly in recent decades. There is an urgent need to investigate the pathological mechanisms behind infertility and to design effective treatments. However, this is made difficult by the fact that various biological factors have been identified to be related to infertility, including genetic factors. A network-based method was established to identify new genes potentially related to infertility. A network constructed using human protein-protein interactions based on previously validated infertility-related genes enabled the identification of some novel candidate genes. These genes were then filtered by a permutation test and their functional and structural associations with infertility-related genes. Our method identified 23 novel genes, which have strong functional and structural associations with previously validated infertility-related genes. Substantial evidence indicates that the identified genes are strongly related to dysfunction of the four main biological processes of fertility: reproductive development and physiology, gametogenesis, meiosis and recombination, and hormone regulation. The newly discovered genes may provide new directions for investigating infertility. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016 Elsevier B.V. All rights reserved.
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes

PubMed Central

López-Garriga, Juan; Cadilla, Carmen L.

2016-01-01

The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
Insights into soybean transcriptome reconfiguration under hypoxic stress: Functional, regulatory, structural, and compositional characterization

PubMed Central

Rodrigues, Fabiana A.; Neumaier, Norman; Marcolino-Gomes, Juliana; Molinari, Hugo B. C.; Santiago, Thaís R.; Formighieri, Eduardo F.; Basso, Marcos F.; Farias, José R. B.; Emygdio, Beatriz M.; de Oliveira, Ana C. B.; Campos, Ângela D.; Borém, Aluízio; Harmon, Frank G.; Mertz-Henning, Liliane M.; Nepomuceno, Alexandre L.

2017-01-01

Soybean (Glycine max) is one of the major crops worldwide and flooding stress affects the production and expansion of cultivated areas. Oxygen is essential for mitochondrial aerobic respiration to supply the energy demand of plant cells. Because oxygen diffusion in water is 10,000 times lower than in air, partial (hypoxic) or total (anoxic) oxygen deficiency is important component of flooding. Even when oxygen is externally available, oxygen deficiency frequently occurs in bulky, dense or metabolically active tissues such as phloem, meristems, seeds, and fruits. In this study, we analyzed conserved and divergent root transcriptional responses between flood-tolerant Embrapa 45 and flood-sensitive BR 4 soybean cultivars under hypoxic stress conditions with RNA-seq. To understand how soybean genes evolve and respond to hypoxia, stable and differentially expressed genes were characterized structurally and compositionally comparing its mechanistic relationship. Between cultivars, Embrapa 45 showed less up- and more down-regulated genes, and stronger induction of phosphoglucomutase (Glyma05g34790), unknown protein related to N-terminal protein myristoylation (Glyma06g03430), protein suppressor of phyA-105 (Glyma06g37080), and fibrillin (Glyma10g32620). RNA-seq and qRT-PCR analysis of non-symbiotic hemoglobin (Glyma11g12980) indicated divergence in gene structure between cultivars. Transcriptional changes for genes in amino acids and derivative metabolic process suggest involvement of amino acids metabolism in tRNA modifications, translation accuracy/efficiency, and endoplasmic reticulum stress in both cultivars under hypoxia. Gene groups differed in promoter TATA box, ABREs (ABA-responsive elements), and CRT/DREs (C-repeat/dehydration-responsive elements) frequency. Gene groups also differed in structure, composition, and codon usage, indicating biological significances. Additional data suggests that cis-acting ABRE elements can mediate gene expression independent of ABA in soybean roots under hypoxia. PMID:29145496
Structural Basis of Cooperative Ligand Binding by the Glycine Riboswitch

DOE Office of Scientific and Technical Information (OSTI.GOV)

E Butler; J Wang; Y Xiong

2011-12-31

The glycine riboswitch regulates gene expression through the cooperative recognition of its amino acid ligand by a tandem pair of aptamers. A 3.6 {angstrom} crystal structure of the tandem riboswitch from the glycine permease operon of Fusobacterium nucleatum reveals the glycine binding sites and an extensive network of interactions, largely mediated by asymmetric A-minor contacts, that serve to communicate ligand binding status between the aptamers. These interactions provide a structural basis for how the glycine riboswitch cooperatively regulates gene expression.
The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda) bears a novel gene order and unusual control region features

PubMed Central

Kilpert, Fabian; Podsiadlowski, Lars

2006-01-01

Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules) are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc.) comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region) of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp), and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already present the last common ancestor of these isopods. Beyond that, the positions of three tRNA genes differ in the two isopod species. Strand bias in nucleotide frequency is reversed in both isopod species compared to other Malacostraca. This is probably due to a reversal of the replication origin, which is further supported by the fact that the hairpin structure typically found in the control region shows a reversed orientation in the isopod species, compared to other crustaceans. PMID:16987408
Deciphering the genomic structure, function and evolution of carotenogenesis related phytoene synthases in grasses

PubMed Central

2012-01-01

Background Carotenoids are isoprenoid pigments, essential for photosynthesis and photoprotection in plants. The enzyme phytoene synthase (PSY) plays an essential role in mediating condensation of two geranylgeranyl diphosphate molecules, the first committed step in carotenogenesis. PSY are nuclear enzymes encoded by a small gene family consisting of three paralogous genes (PSY1-3) that have been widely characterized in rice, maize and sorghum. Results In wheat, for which yellow pigment content is extremely important for flour colour, only PSY1 has been extensively studied because of its association with QTLs reported for yellow pigment whereas PSY2 has been partially characterized. Here, we report the isolation of bread wheat PSY3 genes from a Renan BAC library using Brachypodium as a model genome for the Triticeae to develop Conserved Orthologous Set markers prior to gene cloning and sequencing. Wheat PSY3 homoeologous genes were sequenced and annotated, unravelling their novel structure associated with intron-loss events and consequent exonic fusions. A wheat PSY3 promoter region was also investigated for the presence of cis-acting elements involved in the response to abscisic acid (ABA), since carotenoids also play an important role as precursors of signalling molecules devoted to plant development and biotic/abiotic stress responses. Expression of wheat PSYs in leaves and roots was investigated during ABA treatment to confirm the up-regulation of PSY3 during abiotic stress. Conclusions We investigated the structural and functional determinisms of PSY genes in wheat. More generally, among eudicots and monocots, the PSY gene family was found to be associated with differences in gene copy numbers, allowing us to propose an evolutionary model for the entire PSY gene family in Grasses. PMID:22672222
Evaluation of Reference Genes for RT qPCR Analyses of Structure-Specific and Hormone Regulated Gene Expression in Physcomitrella patens Gametophytes

PubMed Central

Le Bail, Aude; Scholz, Sebastian; Kost, Benedikt

2013-01-01

The use of the moss Physcomitrella patens as a model system to study plant development and physiology is rapidly expanding. The strategic position of P. patens within the green lineage between algae and vascular plants, the high efficiency with which transgenes are incorporated by homologous recombination, advantages associated with the haploid gametophyte representing the dominant phase of the P. patens life cycle, the simple structure of protonemata, leafy shoots and rhizoids that constitute the haploid gametophyte, as well as a readily accessible high-quality genome sequence make this moss a very attractive experimental system. The investigation of the genetic and hormonal control of P. patens development heavily depends on the analysis of gene expression patterns by real time quantitative PCR (RT qPCR). This technique requires well characterized sets of reference genes, which display minimal expression level variations under all analyzed conditions, for data normalization. Sets of suitable reference genes have been described for most widely used model systems including e.g. Arabidopsis thaliana, but not for P. patens. Here, we present a RT qPCR based comparison of transcript levels of 12 selected candidate reference genes in a range of gametophytic P. patens structures at different developmental stages, and in P. patens protonemata treated with hormones or hormone transport inhibitors. Analysis of these RT qPCR data using GeNorm and NormFinder software resulted in the identification of sets of P. patens reference genes suitable for gene expression analysis under all tested conditions, and suggested that the two best reference genes are sufficient for effective data normalization under each of these conditions. PMID:23951063
Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes

PubMed Central

Matus, José Tomás; Aquea, Felipe; Arce-Johnson, Patricio

2008-01-01

Background The MYB superfamily constitutes the most abundant group of transcription factors described in plants. Members control processes such as epidermal cell differentiation, stomatal aperture, flavonoid synthesis, cold and drought tolerance and pathogen resistance. No genome-wide characterization of this family has been conducted in a woody species such as grapevine. In addition, previous analysis of the recently released grape genome sequence suggested expansion events of several gene families involved in wine quality. Results We describe and classify 108 members of the grape R2R3 MYB gene subfamily in terms of their genomic gene structures and similarity to their putative Arabidopsis thaliana orthologues. Seven gene models were derived and analyzed in terms of gene expression and their DNA binding domain structures. Despite low overall sequence homology in the C-terminus of all proteins, even in those with similar functions across Arabidopsis and Vitis, highly conserved motif sequences and exon lengths were found. The grape epidermal cell fate clade is expanded when compared with the Arabidopsis and rice MYB subfamilies. Two anthocyanin MYBA related clusters were identified in chromosomes 2 and 14, one of which includes the previously described grape colour locus. Tannin related loci were also detected with eight candidate homologues in chromosomes 4, 9 and 11. Conclusion This genome wide transcription factor analysis in Vitis suggests that clade-specific grape R2R3 MYB genes are expanded while other MYB genes could be well conserved compared to Arabidopsis. MYB gene abundance, homology and orientation within particular loci also suggests that expanded MYB clades conferring quality attributes of grapes and wines, such as colour and astringency, could possess redundant, overlapping and cooperative functions. PMID:18647406
Xylanase II from an alkaliphilic thermophilic Bacillus with a distinctly different structure from other xylanases: evolutionary relationship to alkaliphilic xylanases.

PubMed

Kulkarni, N; Lakshmikumaran, M; Rao, M

1999-10-05

A 1.0 kilobase gene fragment from the genomic DNA of an alkaliphilic thermophilic Bacillus was found to code for a functional xylanase (XynII). The complete nucleotide sequence including the structural gene and the 5' and 3' flanking sequences of the xylanase gene have been determined. An open reading frame starting from ATG initiator codon comprising 402 nucleotides gave a preprotein of 133 amino acids of calculated molecular mass 14.090 kDa. The occurrence of three potential N-glycosylation sites in XynII gene is a unique feature for a gene of bacterial origin. The stop codon was followed by hairpin loop structures indicating the presence of transcription termination signals. The secondary structure analysis of XynII predicted that the polypeptide was primarily formed of beta-sheets. XynII appeared to be a member of family G/11 of xylanases based on its molecular weight and basic pI (8.0). However, sequence homology revealed similar identity with families 10 and 11 of xylanases. The conserved triad (Val-Val-Xaa, where Xaa is Asn or Asp) was identified only in the xylanases from alkaliphilic organisms. Our results implicate for the first time the concept of convergent evolution for XynII and provide a basis for research in evolutionary relationship among the xylanases from alkaliphilic and neutrophilic organisms. Copyright 1999 Academic Press.
Toxicity of algicidal extracts from Mangrovimonas yunxiaonensis strain LY01 on a HAB causing Alexandrium tamarense.

PubMed

Li, Yi; Zhu, Hong; Zhang, Huajun; Chen, Zhangran; Tian, Yun; Xu, Hong; Zheng, Tianling; Zheng, Wei

2014-08-15

Toxicity of algicidal extracts from Mangrovimonas yunxiaonensis strain LY01 on Alexandrium tamarense were measured through studying the algicidal procedure, nuclear damage and transcription of related genes. Medium components were optimized to improve algicidal activity, and characteristics of algicidal extracts were determined. Transmission electron microscope analysis revealed that the cell structure was broken. Cell membrane integrity destruction and nuclear structure degradation were monitored using confocal laser scanning microscope, and the rbcS, hsp and proliferating cell nuclear antigen (PCNA) gene expressions were studied. Results showed that 1.0% tryptone, 0.4% glucose and 0.8% MgCl2 were the optimal nutrient sources. The algicidal extracts were heat and pH stable, non-protein and less than 1kD. Cell membrane and nuclear structure integrity were lost, and the transcription of the rbcS and PCNA genes were significantly inhibited and there was up-regulation of hsp gene expression during the exposure procedure. The algicidal extracts destroyed the cell membrane and nuclear structure integrity, inhibited related gene expression and, eventually, lead to the inhibition of algal growth. All the results may elaborate firstly the cell death process and nuclear damage in A. tamarense which was induced by algicidal extracts, and the algicidal extracts could be potentially used as bacterial control of HABs in future. Copyright © 2014 Elsevier B.V. All rights reserved.
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications

PubMed Central

Chen, Zhi-Teng; Du, Yu-Zhou

2017-01-01

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer (AGN), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae. PMID:28475163
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2017-05-05

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.
The Functional Genetics of Handedness and Language Lateralization: Insights from Gene Ontology, Pathway and Disease Association Analyses.

PubMed

Schmitz, Judith; Lor, Stephanie; Klose, Rena; Güntürkün, Onur; Ocklenburg, Sebastian

2017-01-01

Handedness and language lateralization are partially determined by genetic influences. It has been estimated that at least 40 (and potentially more) possibly interacting genes may influence the ontogenesis of hemispheric asymmetries. Recently, it has been suggested that analyzing the genetics of hemispheric asymmetries on the level of gene ontology sets, rather than at the level of individual genes, might be more informative for understanding the underlying functional cascades. Here, we performed gene ontology, pathway and disease association analyses on genes that have previously been associated with handedness and language lateralization. Significant gene ontology sets for handedness were anatomical structure development, pattern specification (especially asymmetry formation) and biological regulation. Pathway analysis highlighted the importance of the TGF-beta signaling pathway for handedness ontogenesis. Significant gene ontology sets for language lateralization were responses to different stimuli, nervous system development, transport, signaling, and biological regulation. Despite the fact that some authors assume that handedness and language lateralization share a common ontogenetic basis, gene ontology sets barely overlap between phenotypes. Compared to genes involved in handedness, which mostly contribute to structural development, genes involved in language lateralization rather contribute to activity-dependent cognitive processes. Disease association analysis revealed associations of genes involved in handedness with diseases affecting the whole body, while genes involved in language lateralization were specifically engaged in mental and neurological diseases. These findings further support the idea that handedness and language lateralization are ontogenetically independent, complex phenotypes.
The Functional Genetics of Handedness and Language Lateralization: Insights from Gene Ontology, Pathway and Disease Association Analyses

PubMed Central

Schmitz, Judith; Lor, Stephanie; Klose, Rena; Güntürkün, Onur; Ocklenburg, Sebastian

2017-01-01

Handedness and language lateralization are partially determined by genetic influences. It has been estimated that at least 40 (and potentially more) possibly interacting genes may influence the ontogenesis of hemispheric asymmetries. Recently, it has been suggested that analyzing the genetics of hemispheric asymmetries on the level of gene ontology sets, rather than at the level of individual genes, might be more informative for understanding the underlying functional cascades. Here, we performed gene ontology, pathway and disease association analyses on genes that have previously been associated with handedness and language lateralization. Significant gene ontology sets for handedness were anatomical structure development, pattern specification (especially asymmetry formation) and biological regulation. Pathway analysis highlighted the importance of the TGF-beta signaling pathway for handedness ontogenesis. Significant gene ontology sets for language lateralization were responses to different stimuli, nervous system development, transport, signaling, and biological regulation. Despite the fact that some authors assume that handedness and language lateralization share a common ontogenetic basis, gene ontology sets barely overlap between phenotypes. Compared to genes involved in handedness, which mostly contribute to structural development, genes involved in language lateralization rather contribute to activity-dependent cognitive processes. Disease association analysis revealed associations of genes involved in handedness with diseases affecting the whole body, while genes involved in language lateralization were specifically engaged in mental and neurological diseases. These findings further support the idea that handedness and language lateralization are ontogenetically independent, complex phenotypes. PMID:28729848
Identification of a feather β-keratin gene exclusively expressed in pennaceous barbule cells of contour feathers in chicken.

PubMed

Kowata, Kinue; Nakaoka, Minori; Nishio, Kaori; Fukao, Ayaka; Satoh, Akira; Ogoshi, Maho; Takahashi, Sumio; Tsudzuki, Masaoki; Takeuchi, Sakae

2014-05-25

Feathers are elaborate skin appendages shared by birds and theropod dinosaurs that have hierarchical branching of the rachis, barbs, and barbules. Feather filaments consist of β-keratins encoded by multiple genes, most of which are located in tandem arrays on chromosomes 2, 25, and 27 in chicken. The expansion of the genes is thought to have contributed to feather evolution; however, it is unclear how the individual genes are involved in feather formation. The aim of the present study was to identify feather keratin genes involved in the formation of barbules. Using a combination of microarray analysis, reverse-transcription polymerase chain reaction, and in situ hybridization, we found an uncharacterized keratin gene on chromosome 7 that was expressed specifically in barbule cells in regenerating chicken feathers. We have named the gene barbule specific keratin 1 (BlSK1). The BlSK1 gene structure was similar to the gene structure of previously characterized feather keratin genes, and consisted of a non-coding leader exon, an intron, and an exon with an open reading frame (ORF). The ORF was predicted to encode a 98 aa long protein, which shared 59% identity with feather keratin B. Orthologs of BlSK1 were found in the genomes of other avian species, including turkey, duck, zebra finch, and flycatcher, in regions that shared synteny with chromosome 7 of chicken. Interestingly, BlSK1 was expressed in feather follicles that generated pennaceous barbules but not in follicles that generated plumulaceous barbules. These results suggested that the composition of feather keratins probably varies depending on the structure of the feather filaments and, that individual feather keratin genes may be involved in building different portions and/or types of feathers in chicken. Copyright © 2014 Elsevier B.V. All rights reserved.
Structural and Biochemical Characterization of a Novel Aminopeptidase from Human Intestine

DOE PAGES

Tykvart, Jan; Bařinka, Cyril; Svoboda, Michal; ...

2015-03-09

N-acetylated α-linked acidic dipeptidase-like protein (NAALADase L), encoded by the NAALADL1 gene, is a close homolog of glutamate carboxypeptidase II, a metallopeptidase that has been intensively studied as a target for imaging and therapy of solid malignancies and neuropathologies. However, neither the physiological functions nor structural features of NAALADase L are known at present. In this paper, we report a thorough characterization of the protein product of the human NAALADL1 gene, including heterologous overexpression and purification, structural and biochemical characterization, and analysis of its expression profile. By solving the NAALADase L x-ray structure, we provide the first experimental evidence thatmore » it is a zinc-dependent metallopeptidase with a catalytic mechanism similar to that of glutamate carboxypeptidase II yet distinct substrate specificity. A proteome-based assay revealed that the NAALADL1 gene product possesses previously unrecognized aminopeptidase activity but no carboxy- or endopeptidase activity. These findings were corroborated by site-directed mutagenesis and identification of bestatin as a potent inhibitor of the enzyme. Analysis of NAALADL1 gene expression at both the mRNA and protein levels revealed the small intestine as the major site of protein expression and points toward extensive alternative splicing of the NAALADL1 gene transcript. Taken together, our data imply that the NAALADL1 gene product's primary physiological function is associated with the final stages of protein/peptide digestion and absorption in the human digestive system. Finally, based on these results, we suggest a new name for this enzyme: human ileal aminopeptidase (HILAP).« less
Aldosterone alters the chromatin structure of the murine endothelin-1 gene.

PubMed

Welch, Amanda K; Jeanette Lynch, I; Gumz, Michelle L; Cain, Brian D; Wingo, Charles S

2016-08-15

Aldosterone increases sodium reabsorption in the renal collecting duct and systemic blood pressure. Paradoxically, aldosterone also induces transcription of the endothelin-1 (Edn1) gene to increase protein (ET-1) levels, which inhibits sodium reabsorption. Here we investigated changes in the chromatin structure of the Edn1 gene of collecting duct cell lines in response to aldosterone treatment. The Edn1 gene has a CpG island that encompasses the transcription start site and four sites in the 5' regulatory region previously linked to transcriptional regulation. The chromatin structure of the Edn1 gene was investigated using a quantitative PCR-based DNaseI hypersensitivity assay in murine hepatocyte (AML12), renal cortical collecting duct (mpkCCDC14), outer medullary collecting duct1 (OMCD1), and inner medullary collecting duct-3 (IMCD-3) cell lines. The CpG island was uniformly accessible. One calcium-responsive NFAT element remained at low chromatin accessibility in all cell lines under all conditions tested. However, the second calcium responsive NFAT element located at -1563bp upstream became markedly more accessible in IMCD-3 cells exposed to aldosterone. Importantly, one established aldosterone hormone response element HRE at -671bp relative to the transcription start site was highly accessible, and another HRE (-551bp) became more accessible in aldosterone-treated IMCD-3 and OMCD1 cells. The evidence supports a model in which aldosterone activation of the mineralocorticoid receptor (MR) results in the MR-hormone complex binding at HRE at -671bp to open chromatin structure around other regulatory elements in the Edn1 gene. Published by Elsevier Inc.

Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli.

PubMed

Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup

2015-01-01

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Demographic History, Population Structure, and Local Adaptation in Alpine Populations of Cardamine impatiens and Cardamine resedifolia

PubMed Central

Ometto, Lino; Li, Mingai; Bresadola, Luisa; Barbaro, Enrico; Neteler, Markus; Varotto, Claudio

2015-01-01

Species evolution depends on numerous and distinct forces, including demography and natural selection. For example, local adaptation and population structure affect the evolutionary history of species living along environmental clines. This is particularly relevant in plants, which are often characterized by limited dispersal ability and the need to respond to abiotic and biotic stress factors specific to the local environment. Here we study the demographic history and the possible existence of local adaptation in two related species of Brassicaceae, Cardamine impatiens and Cardamine resedifolia, which occupy separate habitats along the elevation gradient. Previous genome-wide analyses revealed the occurrence of distinct selective pressures in the two species, with genes involved in cold response evolving particularly fast in C. resedifolia. In this study we surveyed patterns of molecular evolution and genetic variability in a set of 19 genes, including neutral and candidate genes involved in cold response, across 10 populations each of C. resedifolia and C. impatiens from the Italian Alps (Trentino). We inferred the population structure and demographic history of the two species, and tested the occurrence of signatures of local adaptation in these genes. The results indicate that, despite a slightly higher population differentiation in C. resedifolia than in C. impatiens, both species are only weakly structured and that populations sampled at high altitude experience less gene flow than low-altitude ones. None of the genes showed signatures of positive selection, suggesting that they do not seem to play relevant roles in the current evolutionary processes of adaptation to alpine environments of these species. PMID:25933225
Structural analysis of chromosomal rearrangements associated with the developmental mutations Ph, W19H, and Rw on mouse chromosome 5.

PubMed Central

Nagle, D L; Martin-DeLeon, P; Hough, R B; Bućan, M

1994-01-01

We are studying the chromosomal structure of three developmental mutations, dominant spotting (W), patch (Ph), and rump white (Rw) on mouse chromosome 5. These mutations are clustered in a region containing three genes encoding tyrosine kinase receptors (Kit, Pdgfra, and Flk1). Using probes for these genes and for a closely linked locus, D5Mn125, we established a high-resolution physical map covering approximately 2.8 Mb. The entire chromosomal segment mapped in this study is deleted in the W19H mutation. The map indicates the position of the Ph deletion, which encompasses not more than 400 kb around and including the Pdgfra gene. The map also places the distal breakpoint of the Rw inversion to a limited chromosomal segment between Kit and Pdgfra. In light of the structure of the Ph-W-Rw region, we interpret the previously published complementation analyses as indicating that the pigmentation defect in Rw/+ heterozygotes could be due to the disruption of Kit and/or Pdgfra regulatory sequences, whereas the gene(s) responsible for the recessive lethality of Rw/Rw embryos is not closely linked to the Ph and W loci and maps proximally to the W19H deletion. The structural analysis of chromosomal rearrangements associated with W19H, Ph, and Rw combined with the high-resolution physical mapping points the way toward the definition of these mutations in molecular terms and isolation of homologous genes on human chromosome 4. Images PMID:8041773
Structural characterization and chromosomal location of the mouse macrophage migration inhibitory factor gene and pseudogenes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bozza, M.; Gerard, C.; Kolakowski, L.F. Jr.

1995-06-10

Macrophage migration inhibitory factor, MIF, is a cytokine released by T-lymphocytes, macrophages, and the pituitary gland that serves to integrate peripheral and central inflammatory responses. Ubiquitous expression and developmental regulation suggest that MIF may have additional roles outside of the immune system. Here we report the structure and chromosomal location of the mouse Mif gene and the partial characterization of five Mif pseudogenes. The mouse Mif gene spans less than 0.7 kb of chromosomal DNA and is composed of three exons. A comparison between the mouse and the human genes shows a similar gene structure and common regulatory elements inmore » both promoter regions. The mouse Mif gene maps to the middle region of chromosome 10, between Bcr and S100b, which have been mapped to human chromosomes 22q11 and 21q22.3, respectively. The entire sequence of two pseudogenes demonstrates the absence of introns, the presence of the 5{prime} untranslated region of the cDNA, a 3{prime} poly(A) tail, and the lack of sequence similarity with untranscribed regions of the gene. The five pseudogenes are highly homologous to the cDNA, but contain a variable number of mutations that would produce mutated or truncated MIF-like proteins. Phylogenetic analyses of MIF genes and pseudogenes indicate several independent genetic events that can account for multiple genomic integrations. Three of the Mif pseudogenes were also mapped by interspecific backcross to chromosomes 1, 9, and 17. These results suggest that Mif pseudogenes originated by retrotransposition. 46 refs., 5 figs., 1 tab.« less
Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach.

PubMed

Singh, Kh Dhanachandra; Karthikeyan, Muthusamy

2014-12-01

The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
Association mapping of starch chain length distribution and amylose content in pea (Pisum sativum L.) using carbohydrate metabolism candidate genes.

PubMed

Carpenter, Margaret A; Shaw, Martin; Cooper, Rebecca D; Frew, Tonya J; Butler, Ruth C; Murray, Sarah R; Moya, Leire; Coyne, Clarice J; Timmerman-Vaughan, Gail M

2017-08-01

Although starch consists of large macromolecules composed of glucose units linked by α-1,4-glycosidic linkages with α-1,6-glycosidic branchpoints, variation in starch structural and functional properties is found both within and between species. Interest in starch genetics is based on the importance of starch in food and industrial processes, with the potential of genetics to provide novel starches. The starch metabolic pathway is complex but has been characterized in diverse plant species, including pea. To understand how allelic variation in the pea starch metabolic pathway affects starch structure and percent amylose, partial sequences of 25 candidate genes were characterized for polymorphisms using a panel of 92 diverse pea lines. Variation in the percent amylose composition of extracted seed starch and (amylopectin) chain length distribution, one measure of starch structure, were characterized for these lines. Association mapping was undertaken to identify polymorphisms associated with the variation in starch chain length distribution and percent amylose, using a mixed linear model that incorporated population structure and kinship. Associations were found for polymorphisms in seven candidate genes plus Mendel's r locus (which conditions the round versus wrinkled seed phenotype). The genes with associated polymorphisms are involved in the substrate supply, chain elongation and branching stages of the pea carbohydrate and starch metabolic pathways. The association of polymorphisms in carbohydrate and starch metabolic genes with variation in amylopectin chain length distribution and percent amylose may help to guide manipulation of pea seed starch structural and functional properties through plant breeding.
An in vitro recombination-based reverse genetic system for rapid mutagenesis of structural genes of the Japanese encephalitis virus.

PubMed

Du, Ruikun; Wang, Manli; Hu, Zhihong; Wang, Hualin; Deng, Fei

2015-10-01

Japanese encephalitis virus (JEV) is one of the most common pathogens of severe viral encephalitis, which is a severe threat to human health. Despite instability of the JEV genome in bacteria, many strategies have been developed to establish molecular clone systems of JEV, providing convenient tools for studying the virus life cycle and virus-host interactions. In this study, we adapted an In-Fusion enzyme-based in vitro recombination method to construct a reverse genetic system of JEV, thereby providing a rapid approach to introduce mutations into the structural genes. A truncated genome without the structural genes was constructed as the backbone, and the complementary segment containing the structural genes was recombined in vitro, which was then transfected directly into virus-permissive cells. The progeny of the infectious virus was successfully detected in the supernatant of the transfected cells, and showed an identical phenotype to its parental virus. To provide a proof-of-principle, the 12 conserved cysteine residues in the envelope (E) protein of JEV were respectively mutated using this approach, and all mutations resulted in a complete failure to generate infectious virus. However, a leucine-tophenylanine mutation at amino acid 107 of the E protein did not interfere with the production of the infectious virus. These results suggested that all 12 cysteines in the E protein are essential for the JEV life cycle. In summary, a novel reverse genetic system of JEV was established for rapidly introducing mutations into structural genes, which will serve as a useful tool for functional studies.
A Subset of Autism-Associated Genes Regulate the Structural Stability of Neurons

PubMed Central

Lin, Yu-Chih; Frei, Jeannine A.; Kilander, Michaela B. C.; Shen, Wenjuan; Blatt, Gene J.

2016-01-01

Autism spectrum disorder (ASD) comprises a range of neurological conditions that affect individuals’ ability to communicate and interact with others. People with ASD often exhibit marked qualitative difficulties in social interaction, communication, and behavior. Alterations in neurite arborization and dendritic spine morphology, including size, shape, and number, are hallmarks of almost all neurological conditions, including ASD. As experimental evidence emerges in recent years, it becomes clear that although there is broad heterogeneity of identified autism risk genes, many of them converge into similar cellular pathways, including those regulating neurite outgrowth, synapse formation and spine stability, and synaptic plasticity. These mechanisms together regulate the structural stability of neurons and are vulnerable targets in ASD. In this review, we discuss the current understanding of those autism risk genes that affect the structural connectivity of neurons. We sub-categorize them into (1) cytoskeletal regulators, e.g., motors and small RhoGTPase regulators; (2) adhesion molecules, e.g., cadherins, NCAM, and neurexin superfamily; (3) cell surface receptors, e.g., glutamatergic receptors and receptor tyrosine kinases; (4) signaling molecules, e.g., protein kinases and phosphatases; and (5) synaptic proteins, e.g., vesicle and scaffolding proteins. Although the roles of some of these genes in maintaining neuronal structural stability are well studied, how mutations contribute to the autism phenotype is still largely unknown. Investigating whether and how the neuronal structure and function are affected when these genes are mutated will provide insights toward developing effective interventions aimed at improving the lives of people with autism and their families. PMID:27909399
Structural and Functional Characterization of a Caenorhabditis elegans Genetic Interaction Network within Pathways

PubMed Central

Boucher, Benjamin; Lee, Anna Y.; Hallett, Michael; Jenna, Sarah

2016-01-01

A genetic interaction (GI) is defined when the mutation of one gene modifies the phenotypic expression associated with the mutation of a second gene. Genome-wide efforts to map GIs in yeast revealed structural and functional properties of a GI network. This provided insights into the mechanisms underlying the robustness of yeast to genetic and environmental insults, and also into the link existing between genotype and phenotype. While a significant conservation of GIs and GI network structure has been reported between distant yeast species, such a conservation is not clear between unicellular and multicellular organisms. Structural and functional characterization of a GI network in these latter organisms is consequently of high interest. In this study, we present an in-depth characterization of ~1.5K GIs in the nematode Caenorhabditis elegans. We identify and characterize six distinct classes of GIs by examining a wide-range of structural and functional properties of genes and network, including co-expression, phenotypical manifestations, relationship with protein-protein interaction dense subnetworks (PDS) and pathways, molecular and biological functions, gene essentiality and pleiotropy. Our study shows that GI classes link genes within pathways and display distinctive properties, specifically towards PDS. It suggests a model in which pathways are composed of PDS-centric and PDS-independent GIs coordinating molecular machines through two specific classes of GIs involving pleiotropic and non-pleiotropic connectors. Our study provides the first in-depth characterization of a GI network within pathways of a multicellular organism. It also suggests a model to understand better how GIs control system robustness and evolution. PMID:26871911
K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries

PubMed Central

Jangid, Kamlesh; Kao, Ming-Hung; Lahamge, Aishwarya; Williams, Mark A.; Rathbun, Stephen L.; Whitman, William B.

2016-01-01

K-shuff is a new algorithm for comparing the similarity of gene sequence libraries, providing measures of the structural and compositional diversity as well as the significance of the differences between these measures. Inspired by Ripley’s K-function for spatial point pattern analysis, the Intra K-function or IKF measures the structural diversity, including both the richness and overall similarity of the sequences, within a library. The Cross K-function or CKF measures the compositional diversity between gene libraries, reflecting both the number of OTUs shared as well as the overall similarity in OTUs. A Monte Carlo testing procedure then enables statistical evaluation of both the structural and compositional diversity between gene libraries. For 16S rRNA gene libraries from complex bacterial communities such as those found in seawater, salt marsh sediments, and soils, K-shuff yields reproducible estimates of structural and compositional diversity with libraries greater than 50 sequences. Similarly, for pyrosequencing libraries generated from a glacial retreat chronosequence and Illumina® libraries generated from US homes, K-shuff required >300 and 100 sequences per sample, respectively. Power analyses demonstrated that K-shuff is sensitive to small differences in Sanger or Illumina® libraries. This extra sensitivity of K-shuff enabled examination of compositional differences at much deeper taxonomic levels, such as within abundant OTUs. This is especially useful when comparing communities that are compositionally very similar but functionally different. K-shuff will therefore prove beneficial for conventional microbiome analysis as well as specific hypothesis testing. PMID:27911946
Improving Interpretation of Cardiac Phenotypes and Enhancing Discovery With Expanded Knowledge in the Gene Ontology

PubMed Central

Roncaglia, Paola; Howe, Douglas G.; Laulederkind, Stanley J.F.; Khodiyar, Varsha K.; Berardini, Tanya Z.; Tweedie, Susan; Foulger, Rebecca E.; Osumi-Sutherland, David; Campbell, Nancy H.; Huntley, Rachael P.; Talmud, Philippa J.; Blake, Judith A.; Breckenridge, Ross; Riley, Paul R.; Lambiase, Pier D.; Elliott, Perry M.; Clapp, Lucie; Tinker, Andrew; Hill, David P.

2018-01-01

Background: A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. Methods and Results: In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. Conclusions: We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects. PMID:29440116
Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

PubMed

Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

2016-01-01

WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.
Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

PubMed

Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

2014-10-01

MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.
Improving Interpretation of Cardiac Phenotypes and Enhancing Discovery With Expanded Knowledge in the Gene Ontology.

PubMed

Lovering, Ruth C; Roncaglia, Paola; Howe, Douglas G; Laulederkind, Stanley J F; Khodiyar, Varsha K; Berardini, Tanya Z; Tweedie, Susan; Foulger, Rebecca E; Osumi-Sutherland, David; Campbell, Nancy H; Huntley, Rachael P; Talmud, Philippa J; Blake, Judith A; Breckenridge, Ross; Riley, Paul R; Lambiase, Pier D; Elliott, Perry M; Clapp, Lucie; Tinker, Andrew; Hill, David P

2018-02-01

A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects. © 2018 The Authors.
Mobile genes in the human microbiome are structured from global to individual scales

PubMed Central

Brito, IL; Jupiter, SD; Jenkins, AP; Naisilisili, W; Tamminen, M; Smillie, CS; Wortman, JR; Birren, BW; Xavier, RJ; Blainey, PC; Singh, AK; Gevers, D; Alm, EJ

2016-01-01

Recent work has underscored the importance of the microbiome in human health, largely attributing differences in phenotype to differences in the species present across individuals1,2,3,4,5. But mobile genes can confer profoundly different phenotypes on different strains of the same species. Little is known about the function and distribution of mobile genes in the human microbiome, and in particular whether the gene pool is globally homogenous or constrained by human population structure. Here, we investigate this question by comparing the mobile genes found in the microbiomes of 81 metropolitan North Americans with that of 172 agrarian Fiji islanders using a combination of single-cell genomics and metagenomics. We find large differences in mobile gene content between the Fijian and North American microbiomes, with functional variation that mirrors known dietary differences such as the excess of plant-based starch degradation genes. Remarkably, differences are also observed between the mobile gene pools of proximal Fijian villages, even though microbiome composition across villages is similar. Finally, we observe high rates of recombination leading to individual-specific mobile elements, suggesting that the abundance of some genes may reflect environmental selection rather than dispersal limitation. Together, these data support the hypothesis that human activities and behaviors provide selective pressures that shape mobile gene pools, and that acquisition of mobile genes is important to colonizing specific human populations. PMID:27409808
Molecular identification of aiiA homologous gene from endophytic Enterobacter species and in silico analysis of putative tertiary structure of AHL-lactonase.

PubMed

Rajesh, P S; Rai, V Ravishankar

2014-01-03

The aiiA homologous gene known to encode AHL- lactonase enzyme which hydrolyze the N-acylhomoserine lactone (AHL) quorum sensing signaling molecules produced by Gram negative bacteria. In this study, the degradation of AHL molecules was determined by cell-free lysate of endophytic Enterobacter species. The percentage of quorum quenching was confirmed and quantified by HPLC method (p<0.0001). Amplification and sequence BLAST analysis showed the presence of aiiA homologous gene in endophytic Enterobacter asburiae VT65, Enterobacter aerogenes VT66 and Enterobacter ludwigii VT70 strains. Sequence alignment analysis revealed the presence of two zinc binding sites, "HXHXDH" motif as well as tyrosine residue at the position 194. Based on known template available at Swiss-Model, putative tertiary structure of AHL-lactonase was constructed. The result showed that novel endophytic strains of Enterobacter genera encode the novel aiiA homologous gene and its structural importance for future study. Copyright © 2013 Elsevier Inc. All rights reserved.
Recent progress in elucidating the structure, function and evolution of disease resistance genes in plants.

PubMed

Liu, Jinling; Liu, Xionglun; Dai, Liangying; Wang, Guoliang

2007-09-01

Plants employ multifaceted mechanisms to fight with numerous pathogens in nature. Resistance (R) genes are the most effective weapons against pathogen invasion since they can specifically recognize the corresponding pathogen effectors or associated protein(s) to activate plant immune responses at the site of infection. Up to date, over 70 R genes have been isolated from various plant species. Most R proteins contain conserved motifs such as nucleotide-binding site (NBS), leucine-rich repeat (LRR), Toll-interleukin-1 receptor domain (TIR, homologous to cytoplasmic domains of the Drosophila Toll protein and the mammalian interleukin-1 receptor), coiled-coil (CC) or leucine zipper (LZ) structure and protein kinase domain (PK). Recent results indicate that these domains play significant roles in R protein interactions with effector proteins from pathogens and in activating signal transduction pathways involved in innate immunity. This review highlights an overview of the recent progress in elucidating the structure, function and evolution of the isolated R genes in different plant-pathogen interaction systems.
Identifying the genes of unconventional high temperature superconductors.

PubMed

Hu, Jiangping

We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d -orbitals of cations that participate in strong in-plane couplings to the p -orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.
Structural Heterogeneity and Functional Domains of Murine Immunoglobulin G Fc Receptors

NASA Astrophysics Data System (ADS)

Ravetch, Jeffrey V.; Luster, Andrew D.; Weinshank, Richard; Kochan, Jarema; Pavlovec, Amalia; Portnoy, Daniel A.; Hulmes, Jeffrey; Pan, Yu-Ching E.; Unkeless, Jay C.

1986-11-01

Binding of antibodies to effector cells by way of receptors to their constant regions (Fc receptors) is central to the pathway that leads to clearance of antigens by the immune system. The structure and function of this important class of receptors on immune cells is addressed through the molecular characterization of Fc receptors (FcR) specific for the murine immunoglobulin G isotype. Structural diversity is encoded by two genes that by alternative splicing result in expression of molecules with highly conserved extracellular domains and different transmembrane and intracytoplasmic domains. The proteins encoded by these genes are members of the immunoglobulin supergene family, most homologous to the major histocompatibility complex molecule Eβ. Functional reconstitution of ligand binding by transfection of individual FcR genes demonstrates that the requirements for ligand binding are encoded in a single gene. These studies demonstrate the molecular basis for the functional heterogeneity of FcR's, accounting for the possible transduction of different signals in response to a single ligand.
The centrality of RNA for engineering gene expression

PubMed Central

Chappell, James; Takahashi, Melissa K; Meyer, Sarai; Loughrey, David; Watters, Kyle E; Lucks, Julius

2013-01-01

Synthetic biology holds promise as both a framework for rationally engineering biological systems and a way to revolutionize how we fundamentally understand them. Essential to realizing this promise is the development of strategies and tools to reliably and predictably control and characterize sophisticated patterns of gene expression. Here we review the role that RNA can play towards this goal and make a case for why this versatile, designable, and increasingly characterizable molecule is one of the most powerful substrates for engineering gene expression at our disposal. We discuss current natural and synthetic RNA regulators of gene expression acting at key points of control – transcription, mRNA degradation, and translation. We also consider RNA structural probing and computational RNA structure predication tools as a way to study RNA structure and ultimately function. Finally, we discuss how next-generation sequencing methods are being applied to the study of RNA and to the characterization of RNA's many properties throughout the cell. PMID:24124015

Genome Structure of the Legume, Lotus japonicus

PubMed Central

Sato, Shusei; Nakamura, Yasukazu; Kaneko, Takakazu; Asamizu, Erika; Kato, Tomohiko; Nakao, Mitsuteru; Sasamoto, Shigemi; Watanabe, Akiko; Ono, Akiko; Kawashima, Kumiko; Fujishiro, Tsunakazu; Katoh, Midori; Kohara, Mitsuyo; Kishida, Yoshie; Minami, Chiharu; Nakayama, Shinobu; Nakazaki, Naomi; Shimizu, Yoshimi; Shinpo, Sayaka; Takahashi, Chika; Wada, Tsuyuko; Yamada, Manabu; Ohmido, Nobuko; Hayashi, Makoto; Fukui, Kiichi; Baba, Tomoya; Nakamichi, Tomoko; Mori, Hirotada; Tabata, Satoshi

2008-01-01

The legume Lotus japonicus has been widely used as a model system to investigate the genetic background of legume-specific phenomena such as symbiotic nitrogen fixation. Here, we report structural features of the L. japonicus genome. The 315.1-Mb sequences determined in this and previous studies correspond to 67% of the genome (472 Mb), and are likely to cover 91.3% of the gene space. Linkage mapping anchored 130-Mb sequences onto the six linkage groups. A total of 10 951 complete and 19 848 partial structures of protein-encoding genes were assigned to the genome. Comparative analysis of these genes revealed the expansion of several functional domains and gene families that are characteristic of L. japonicus. Synteny analysis detected traces of whole-genome duplication and the presence of synteny blocks with other plant genomes to various degrees. This study provides the first opportunity to look into the complex and unique genetic system of legumes. PMID:18511435
The chloroplast tRNALys(UUU) gene from mustard (Sinapis alba) contains a class II intron potentially coding for a maturase-related polypeptide.

PubMed

Neuhaus, H; Link, G

1987-01-01

The trnK gene endocing the tRNALys(UUU) has been located on mustard (Sinapis alba) chloroplast DNA, 263 bp upstream of the psbA gene on the same strand. The nucleotide sequence of the trnK gene and its flanking regions as well as the putative transcription start and termination sites are shown. The 5' end of the transcript lies 121 bp upstream of the 5' tRNA coding region and is preceded by procaryotic-type "-10" and "-35" sequence elements, while the 3' end maps 2.77 kb downstream to a DNA region with possible stemloop secondary structure. The anticodon loop of the tRNALys is interrupted by a 2,574 bp intron containing a long open reading frame, which codes for 524 amino acids. Based on conserved stem and loop structures, this intron has characteristic features of a class II intron. A region near the carboxyl terminus of the derived polypeptide appears structurally related to maturases.
Two-component signal transduction systems of Xanthomonas spp.: a lesson from genomics.

PubMed

Qian, Wei; Han, Zhong-Ji; He, Chaozu

2008-02-01

The two-component signal transduction systems (TCSTSs), consisting of a histidine kinase sensor (HK) and a response regulator (RR), are the dominant molecular mechanisms by which prokaryotes sense and respond to environmental stimuli. Genomes of Xanthomonas generally contain a large repertoire of TCSTS genes (approximately 92 to 121 for each genome), which encode diverse structural groups of HKs and RRs. Among them, although a core set of 70 TCSTS genes (about two-thirds in total) which accumulates point mutations with a slow rate are shared by these genomes, the other genes, especially hybrid HKs, experienced extensive genetic recombination, including genomic rearrangement, gene duplication, addition or deletion, and fusion or fission. The recombinations potentially promote the efficiency and complexity of TCSTSs in regulating gene expression. In addition, our analysis suggests that a co-evolutionary model, rather than a selfish operon model, is the major mechanism for the maintenance and microevolution of TCSTS genes in the genomes of Xanthomonas. Genomic annotation, secondary protein structure prediction, and comparative genomic analyses of TCSTS genes reviewed here provide insights into our understanding of signal networks in these important phytopathogenic bacteria.
Normalization of Reverse Transcription Quantitative PCR Data During Ageing in Distinct Cerebral Structures.

PubMed

Bruckert, G; Vivien, D; Docagne, F; Roussel, B D

2016-04-01

Reverse transcription quantitative-polymerase chain reaction (RT-qPCR) has become a routine method in many laboratories. Normalization of data from experimental conditions is critical for data processing and is usually achieved by the use of a single reference gene. Nevertheless, as pointed by the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines, several reference genes should be used for reliable normalization. Ageing is a physiological process that results in a decline of many expressed genes. Reliable normalization of RT-qPCR data becomes crucial when studying ageing. Here, we propose a RT-qPCR study from four mouse brain regions (cortex, hippocampus, striatum and cerebellum) at different ages (from 8 weeks to 22 months) in which we studied the expression of nine commonly used reference genes. With the use of two different algorithms, we found that all brain structures need at least two genes for a good normalization step. We propose specific pairs of gene for efficient data normalization in the four brain regions studied. These results underline the importance of reliable reference genes for specific brain regions in ageing.
Genomic Structure of an Economically Important Cyanobacterium, Arthrospira (Spirulina) platensis NIES-39

PubMed Central

Fujisawa, Takatomo; Narikawa, Rei; Okamoto, Shinobu; Ehira, Shigeki; Yoshimura, Hidehisa; Suzuki, Iwane; Masuda, Tatsuru; Mochimaru, Mari; Takaichi, Shinichi; Awai, Koichiro; Sekine, Mitsuo; Horikawa, Hiroshi; Yashiro, Isao; Omata, Seiha; Takarada, Hiromi; Katano, Yoko; Kosugi, Hiroki; Tanikawa, Satoshi; Ohmori, Kazuko; Sato, Naoki; Ikeuchi, Masahiko; Fujita, Nobuyuki; Ohmori, Masayuki

2010-01-01

A filamentous non-N2-fixing cyanobacterium, Arthrospira (Spirulina) platensis, is an important organism for industrial applications and as a food supply. Almost the complete genome of A. platensis NIES-39 was determined in this study. The genome structure of A. platensis is estimated to be a single, circular chromosome of 6.8 Mb, based on optical mapping. Annotation of this 6.7 Mb sequence yielded 6630 protein-coding genes as well as two sets of rRNA genes and 40 tRNA genes. Of the protein-coding genes, 78% are similar to those of other organisms; the remaining 22% are currently unknown. A total 612 kb of the genome comprise group II introns, insertion sequences and some repetitive elements. Group I introns are located in a protein-coding region. Abundant restriction-modification systems were determined. Unique features in the gene composition were noted, particularly in a large number of genes for adenylate cyclase and haemolysin-like Ca2+-binding proteins and in chemotaxis proteins. Filament-specific genes were highlighted by comparative genomic analysis. PMID:20203057
A web application for automatic prediction of gene translation elongation efficiency.

PubMed

Sokolov, Vladimir S; Zuraev, Bulat S; Lashin, Sergei A; Matushkin, Yury G

2015-03-01

Expression efficiency is one of the major characteristics describing genes in various modern investigations. Expression efficiency of genes is regulated at various stages: transcription, translation, posttranslational protein modification and others. In this study, a special EloE (Elongation Efficiency) web application is described. The EloE sorts the organism's genes in a descend order on their theoretical rate of the elongation stage of translation based on the analysis of their nucleotide sequences. Obtained theoretical data have a significant correlation with available experimental data of gene expression in various organisms. In addition, the program identifies preferential codons in organism's genes and defines distribution of potential secondary structures energy in 5´ and 3´ regions of mRNA. The EloE can be useful in preliminary estimation of translation elongation efficiency for genes for which experimental data are not available yet. Some results can be used, for instance, in other programs modeling artificial genetic structures in genetically engineered experiments. The EloE web application is available at http://www-bionet.sscc.ru:7780/EloE.
Genetic addiction: selfish gene's strategy for symbiosis in the genome.

PubMed

Mochizuki, Atsushi; Yahara, Koji; Kobayashi, Ichizo; Iwasa, Yoh

2006-02-01

The evolution and maintenance of the phenomenon of postsegregational host killing or genetic addiction are paradoxical. In this phenomenon, a gene complex, once established in a genome, programs death of a host cell that has eliminated it. The intact form of the gene complex would survive in other members of the host population. It is controversial as to why these genetic elements are maintained, due to the lethal effects of host killing, or perhaps some other properties are beneficial to the host. We analyzed their population dynamics by analytical methods and computer simulations. Genetic addiction turned out to be advantageous to the gene complex in the presence of a competitor genetic element. The advantage is, however, limited in a population without spatial structure, such as that in a well-mixed liquid culture. In contrast, in a structured habitat, such as the surface of a solid medium, the addiction gene complex can increase in frequency, irrespective of its initial density. Our demonstration that genomes can evolve through acquisition of addiction genes has implications for the general question of how a genome can evolve as a community of potentially selfish genes.
Uncovering the Salt Response of Soybean by Unraveling Its Wild and Cultivated Functional Genomes Using Tag Sequencing

PubMed Central

Ali, Zulfiqar; Zhang, Da Yong; Xu, Zhao Long; Xu, Ling; Yi, Jin Xin; He, Xiao Lan; Huang, Yi Hong; Liu, Xiao Qing; Khan, Asif Ali; Trethowan, Richard M.; Ma, Hong Xiang

2012-01-01

Soil salinity has very adverse effects on growth and yield of crop plants. Several salt tolerant wild accessions and cultivars are reported in soybean. Functional genomes of salt tolerant Glycine soja and a salt sensitive genotype of Glycine max were investigated to understand the mechanism of salt tolerance in soybean. For this purpose, four libraries were constructed for Tag sequencing on Illumina platform. We identify around 490 salt responsive genes which included a number of transcription factors, signaling proteins, translation factors and structural genes like transporters, multidrug resistance proteins, antiporters, chaperons, aquaporins etc. The gene expression levels and ratio of up/down-regulated genes was greater in tolerant plants. Translation related genes remained stable or showed slightly higher expression in tolerant plants under salinity stress. Further analyses of sequenced data and the annotations for gene ontology and pathways indicated that soybean adapts to salt stress through ABA biosynthesis and regulation of translation and signal transduction of structural genes. Manipulation of these pathways may mitigate the effect of salt stress thus enhancing salt tolerance. PMID:23209559
Genomic Comparison of the P-ATPase Gene Family in Four Cotton Species and Their Expression Patterns in Gossypium hirsutum.

PubMed

Chen, Wen; Si, Guo-Yang; Zhao, Gang; Abdullah, Muhammad; Guo, Ning; Li, Da-Hui; Sun, Xu; Cai, Yong-Ping; Lin, Yi; Gao, Jun-Shan

2018-05-05

Plant P-type H⁺-ATPase (P-ATPase) is a membrane protein existing in the plasma membrane that plays an important role in the transmembrane transport of plant cells. To understand the variety and quantity of P-ATPase proteins in different cotton species, we combined four databases from two diploid cotton species ( Gossypium raimondii and G. arboreum ) and two tetraploid cotton species ( G. hirsutum and G. barbadense ) to screen the P-ATPase gene family and resolved the evolutionary relationships between the former cotton species. We identified 53, 51, 99 and 98 P-ATPase genes from G. arboretum, G. raimondii , G. barbadense and G. hirsutum , respectively. The structural and phylogenetic analyses revealed that the gene structure was consistent between P-ATPase genes, with a close evolutionary relationship. The expression analysis of P-ATPase genes showed that many P-ATPase genes were highly expressed in various tissues and at different fiber developmental stages in G. hirsutum , suggesting that they have potential functions during growth and fiber development in cotton.
Metagenomes reveal microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor.

PubMed

Ma, Jinxing; Wang, Zhiwei; Li, Huan; Park, Hee-Deung; Wu, Zhichao

2016-06-01

Metagenomic sequencing was used to investigate the microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor (MBR). The results showed that the microbial community in the MBR was highly diverse. Notably, function analysis of the dominant genera indicated that common genes from different phylotypes were identified for important functional potentials with the observation of variation of abundances of genes in a certain taxon (e.g., Dechloromonas). Despite maintaining similar metabolic functional potentials with a parallel full-scale conventional activated sludge (CAS) system due to treating the identical wastewater, the MBR had more abundant nitrification-related bacteria and coding genes of ammonia monooxygenase, which could well explain its excellent ammonia removal in the low-temperature period. Furthermore, according to quantification of the genes involved in exopolysaccharide and extracellular polymeric substance (EPS) protein metabolism, the MBR did not show a much different potential in producing EPS compared to the CAS system, and bacteria from the membrane biofilm had lower abundances of genes associated with EPS biosynthesis and transport compared to the activated sludge in the MBR.
Differential gene expression patterns in the autogamous plant Hordeum euclaston (Poaceae).

PubMed

Georg-Kraemer, J E; Ferreira, C A S; Cavalli, S S

2011-02-22

Sib-seedlings of 95 strains of the strictly autogamous grass Hordeum euclaston were analyzed by horizontal polyacrylamide gel electrophoresis for four isoenzyme systems at a specific ontogenetic stage. We found differences in the activity of some genes among individuals of this species. Hence, an ontogenetic analysis was carried out to investigate 12 strains at five ontogenetic stages, to determine the patterns of expression of these genes during development. The differences in the presence versus absence of certain isoenzyme bands may be due to differential regulatory activation in response to environmental differences, as all plants showed the same structural genes, although these genes were active in different tissues and/or times of development. These results indicate the importance of differential gene activation in the metabolic phenotype variability of this strictly autogamous, highly homozygous species. The same structural alleles for isoenzymes showed the active form of the enzymes (phenotypic expression) to be present in different tissues and/or stages of development. Differential isoenzyme gene activation was shown to be directly responsible for the enzymatic variability (metabolic phenotype) presented by the plants, which seem to possess almost no heterozygosis.
Structure of the human myelin/oligodendrocyte glycoprotein gene and multiple alternative spliced isoforms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.

1995-09-20

Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection.

PubMed

Guthke, Reinhard; Möller, Ulrich; Hoffmann, Martin; Thies, Frank; Töpfer, Susanne

2005-04-15

The immune response to bacterial infection represents a complex network of dynamic gene and protein interactions. We present an optimized reverse engineering strategy aimed at a reconstruction of this kind of interaction networks. The proposed approach is based on both microarray data and available biological knowledge. The main kinetics of the immune response were identified by fuzzy clustering of gene expression profiles (time series). The number of clusters was optimized using various evaluation criteria. For each cluster a representative gene with a high fuzzy-membership was chosen in accordance with available physiological knowledge. Then hypothetical network structures were identified by seeking systems of ordinary differential equations, whose simulated kinetics could fit the gene expression profiles of the cluster-representative genes. For the construction of hypothetical network structures singular value decomposition (SVD) based methods and a newly introduced heuristic Network Generation Method here were compared. It turned out that the proposed novel method could find sparser networks and gave better fits to the experimental data. Reinhard.Guthke@hki-jena.de.
Using Signature Genes as Tools To Assess Environmental Viral Ecology and Diversity

PubMed Central

Adriaenssens, Evelien M.

2014-01-01

Viruses (including bacteriophages) are the most abundant biological entities on the planet. As such, they are thought to have a major impact on all aspects of microbial community structure and function. Despite this critical role in ecosystem processes, the study of virus/phage diversity has lagged far behind parallel studies of the bacterial and eukaryotic kingdoms, largely due to the absence of any universal phylogenetic marker. Here we review the development and use of signature genes to investigate viral diversity, as a viable strategy for data sets of specific virus groups. Genes that have been used include those encoding structural proteins, such as portal protein, major capsid protein, and tail sheath protein, auxiliary metabolism genes, such as psbA, psbB, and phoH, and several polymerase genes. These marker genes have been used in combination with PCR-based fingerprinting and/or sequencing strategies to investigate spatial, temporal, and seasonal variations and diversity in a wide range of habitats. PMID:24837394
Evolution of substrate specificity in a retained enzyme driven by gene loss

PubMed Central

Juárez-Vázquez, Ana Lilia; Edirisinghe, Janaka N; Verduzco-Castro, Ernesto A; Michalska, Karolina; Wu, Chenggang; Noda-García, Lianet; Babnigg, Gyorgy; Endres, Michael; Medina-Ruíz, Sofía; Santoyo-Flores, Julián; Carrillo-Tripp, Mauricio; Ton-That, Hung; Joachimiak, Andrzej; Henry, Christopher S; Barona-Gómez, Francisco

2017-01-01

The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. We apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to a monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. Our results show how gene loss can drive the evolution of substrate specificity from retained enzymes. DOI: http://dx.doi.org/10.7554/eLife.22679.001 PMID:28362260
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss

DOE PAGES

Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.; ...

2017-03-31

The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Two pheromone precursor genes are transcriptionally expressed in the homothallic ascomycete Sordaria macrospora.

PubMed

Pöggeler, S

2000-06-01

In order to analyze the involvement of pheromones in cell recognition and mating in a homothallic fungus, two putative pheromone precursor genes, named ppg1 and ppg2, were isolated from a genomic library of Sordaria macrospora. The ppg1 gene is predicted to encode a precursor pheromone that is processed by a Kex2-like protease to yield a pheromone that is structurally similar to the alpha-factor of the yeast Saccharomyces cerevisiae. The ppg2 gene encodes a 24-amino-acid polypeptide that contains a putative farnesylated and carboxy methylated C-terminal cysteine residue. The sequences of the predicted pheromones display strong structural similarity to those encoded by putative pheromones of heterothallic filamentous ascomycetes. Both genes are expressed during the life cycle of S. macrospora. This is the first description of pheromone precursor genes encoded by a homothallic fungus. Southern-hybridization experiments indicated that ppg1 and ppg2 homologues are also present in other homothallic ascomycetes.
Evolution of Substrate Specificity in A Retained Enzyme Driven by Gene Loss

DOE Office of Scientific and Technical Information (OSTI.GOV)

Juarez-Vazquez, Ana L.; Edirisinghe, Janaka N.; Verduzco-Castro, Ernesto A.

The connection between gene loss and the functional adaptation of retained proteins is still poorly understood. Here, we apply phylogenomics and metabolic modeling to detect bacterial species that are evolving by gene loss, with the finding that Actinomycetaceae genomes from human cavities are undergoing sizable reductions, including loss of L-histidine and L-tryptophan biosynthesis. We also observe that the dual-substrate phosphoribosyl isomerase A or priA gene, at which these pathways converge, appears to coevolve with the occurrence of trp and his genes. Characterization of a dozen PriA homologs shows that these enzymes adapt from bifunctionality in the largest genomes, to amore » monofunctional, yet not necessarily specialized, inefficient form in genomes undergoing reduction. These functional changes are accomplished via mutations, which result from relaxation of purifying selection, in residues structurally mapped after sequence and X-ray structural analyses. These results show how gene loss can drive the evolution of substrate specificity from retained enzymes.« less
Complete mitochondrial genome of the giant African snail, Achatina fulica (Mollusca: Achatinidae): a novel location of putative control regions (CR) in the mitogenome within Pulmonate species.

PubMed

He, Zhang-Ping; Dai, Xia-Bin; Zhang, Shuai; Zhi, Ting-Ting; Lun, Zhao-Rong; Wu, Zhong-Dao; Yang, Ting-Bao

2016-01-01

The whole sequence (15,057 bp) of the mitochondrial DNA (mtDNA) of the terrestrial snail Achatina fulica (order Stylommatophora) was determined. The mitogenome, as the typical metazoan mtDNA, contains 13 protein-coding genes (PCG), 2 ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA). The tRNA genes include two trnS without standard secondary structure. Interestingly, among the known mitogenomes of Pulmonata species, we firstly characterized an unassigned lengthy sequence (551 bp) between the cox1 and the trnV which may be the CR for the sake of its AT bases usage bias (65.70%) and potential hairpin structure.
Polycomb group protein bodybuilding: working out the routines.

PubMed

Sievers, Cem; Paro, Renato

2013-09-30

Polycomb group (PcG) proteins regulate gene expression by modifying chemical and structural properties of chromatin. Isono et al. (2013) now report in Developmental Cell a polymerization-dependent mechanism used by PcG proteins to form higher-order chromatin structures, referred to as Polycomb bodies, and demonstrate its necessity for gene silencing. Copyright © 2013 Elsevier Inc. All rights reserved.

Hybridization assay of insect antifreezing protein gene by novel multilayered porous silicon nucleic acid biosensor.

PubMed

Lv, Xiaoyi; Chen, Liangliang; Zhang, Hongyan; Mo, Jiaqing; Zhong, Furu; Lv, Changwu; Ma, Ji; Jia, Zhenhong

2013-01-15

A fabrication of a novel simple porous silicon polybasic photonic crystal with symmetrical structure has been reported as a nucleic acid biosensor for detecting antifreeze protein gene in insects (Microdera puntipennis dzhungarica), which would be helpful in the development of some new transgenic plants with tolerance of freezing stress. Compared to various porous silicon-based photonic configurations, porous silicon polytype layered structure is quite easy to prepare and shows more stability; moreover, polybasic photonic crystals with symmetrical structure exhibit interesting optical properties with a sharp resonance in the reflectance spectrum, giving a higher Q factor which causes higher sensitivity for sensing performance. In this experiment, DNA oligonucleotides were immobilized into the porous silicon pores using a standard crosslink chemistry method. The porous silicon polybasic symmetrical structure sensor possesses high specificity in performing controlled experiments with non-complementary DNA. The detection limit was found to be 21.3nM for DNA oligonucleotides. The fabricated multilayered porous silicon-based DNA biosensor has potential commercial applications in clinical chemistry for determination of an antifreeze protein gene or other genes. Copyright © 2012 Elsevier B.V. All rights reserved.
Population genetic structure and gene flow of Adélie penguins (Pygoscelis adeliae) breeding throughout the western Antarctic Peninsula

USGS Publications Warehouse

Gorman, Kristen B.; Talbot, Sandra L.; Sonsthagen, Sarah A.; Sage, George K.; Gravley, Megan C.; Fraser, William R.; Williams, Tony D.

2017-01-01

Adélie penguins (Pygoscelis adeliae) are responding to ocean–climate variability throughout the marine ecosystem of the western Antarctic Peninsula (WAP) where some breeding colonies have declined by 80%. Nuclear and mitochondrial DNA (mtDNA) markers were used to understand historical population genetic structure and gene flow given relatively recent and continuing reductions in sea ice habitats and changes in numbers of breeding adults at colonies throughout the WAP. Genetic diversity, spatial genetic structure, genetic signatures of fluctuations in population demography and gene flow were assessed in four regional Adélie penguin colonies. The analyses indicated little genetic structure overall based on bi-parentally inherited microsatellite markers (FST =-0.006–0.004). No significant variance was observed in overall haplotype frequency (mtDNA ΦST =0.017; P=0.112). Some comparisons with Charcot Island were significant, suggestive of female-biased philopatry. Estimates of gene flow based on a two-population coalescent model were asymmetrical from the species’ regional core to its northern range. Breeding Adélie penguins of the WAP are a panmictic population and hold adequate genetic diversity and dispersal capacity to be resilient to environmental change.
[Radiation biology of structurally different Drosophila genes. Report 2. The vestigial gene: molecular characteristics of chromosome mutations].

PubMed

Afanas'eva, K P; Aleksandrova, M V; Aleksandrov, I D; Korablinova, S V

2012-01-01

The results of the PCR-assay of mutation lesions at each of 16 fragments overlapping the entire vestigial (vg) gene of Drosophila melanogaster in 52 gamma-ray-, neutron- and neutron + gamma-ray-induced vg mutants having the inversion or translocation breakpoint within the vg microregion are presented. 4 from 52 mutants studied were found to have large deletions of about 200 kb covering the entire vg gene and adjacent to sca and l(2)C gene-markers as well. 23 mutants from 48 (47.9%) were found to have a wild-type gene structure showing that the exchange breakpoints are located outside of the vg gene. 25 others display the intragenic lesions of different complexity detected by PCR as the absence of(i) either one fragment or (ii) two or more (6-7) adjacent fragments and (iii) simultaneously several (i) or (i) and (ii) types separated by normal gene regions. It is important that 6 from 25 mutants have the breakpoint inside the vg gene and display the (i) or (ii) type of lesions at the gene regions containing the putative break whereas 5 others from 25 with the above lesions have the exchange breakpoint outside the vg gene. Therefore, the breakpoints underlying either inversions or translocations induced by low- and high-LET radiation are likely to be located within and outside the gene under study. Thereby, the formation of exchanges is accompanied by DNA deletions of various sizes at the exchange breakpoints. The molecular model of formation of such exchange-deletion rearrangements is elaborated and presented. Also, conception of the predominately clustered action of both low- and high-LET radiation on the germ cell genome is suggested as the summing-up of the presented results. The ability of ionizing radiation to induce the clusters of genetic alterations in the form of hidden DNA damages as well as gene/chromosome mutations is determined by the track structure and hierarchical organization of the genome. To detect the quality and frequency patterns of all components of the cluster, joint molecular, genetic and cytological techniques need to be used.
Hormone Replacement Therapy, Iron, and Breast Cancer

DTIC Science & Technology

2004-11-01

accumulates due to the mutation of the HFE gene (hemochromatosis EeJ, iron elevated in the mouse body mimics the post-menopausal condition. In the present...model. Since iron slowly accumulates due to the mutation of the HFE gene (hemochromatosis Fe), iron elevated in the mouse body mimics the post...menopausal condition. Development of iron overloaded transgenic mice: The murine HFE gene is structurally similar to the human gene . Four different HFE gene
The Infection of Cucumber (Cucumis sativus L.) Roots by Meloidogyne incognita Alters the Expression of Actin-Depolymerizing Factor (ADF) Genes, Particularly in Association with Giant Cell Formation

PubMed Central

Liu, Bin; Liu, Xingwang; Liu, Ying; Xue, Shudan; Cai, Yanling; Yang, Sen; Dong, Mingming; Zhang, Yaqi; Liu, Huiling; Zhao, Binyu; Qi, Changhong; Zhu, Ning; Ren, Huazhong

2016-01-01

Cucumber (Cucumis sativus L.) is threatened by substantial yield losses due to the south root-knot nematode (Meloidogyne incognita). However, understanding of the molecular mechanisms underlying the process of nematode infection is still limited. In this study, we found that M. incognita infection affected the structure of cells in cucumber roots and treatment of the cytoskeleton inhibitor (cytochalasin D) reduced root-knot nematode (RKN) parasitism. It is known that Actin-Depolymerizing Factor (ADF) affects cell structure, as well as the organization of the cytoskeleton. To address the hypothesis that nematode-induced abnormal cell structures and cytoskeletal rearrangements might be mediated by the ADF genes, we identified and characterized eight cucumber ADF (CsADF) genes. Phylogenetic analysis showed that the cucumber ADF gene family is grouped into four ancient subclasses. Expression analysis revealed that CsADF1, CsADF2-1, CsADF2-2, CsADF2-3 (Subclass I), and CsADF6 (Subclass III) have higher transcript levels than CsADF7-1, CsADF7-2 (Subclass II genes), and CsADF5 (Subclass IV) in roots. Members of subclass I genes (CsADF1, CsADF2-1, CsADF2-2, and CsADF2-3), with the exception of CsADF2-1, exhibited a induction of expression in roots 14 days after their inoculation (DAI) with nematodes. However, the expression of subclass II genes (CsADF7-1 and CsADF7-2) showed no significant change after inoculation. The transcript levels of CsADF6 (Subclass III) showed a specific induction at 21 DAI, while CsADF5 (Subclass IV) was weakly expressed in roots, but was strongly up-regulated as early as 7 DAI. In addition, treatment of roots with cytochalasin D caused an approximately 2-fold down-regulation of the CsADF genes in the treated plants. These results suggest that CsADF gene mediated actin dynamics are associated with structural changes in roots as a consequence of M. incognita infection. PMID:27695469
Dramatic Increases of Soil Microbial Functional Gene Diversity at the Treeline Ecotone of Changbai Mountain.

PubMed

Shen, Congcong; Shi, Yu; Ni, Yingying; Deng, Ye; Van Nostrand, Joy D; He, Zhili; Zhou, Jizhong; Chu, Haiyan

2016-01-01

The elevational and latitudinal diversity patterns of microbial taxa have attracted great attention in the past decade. Recently, the distribution of functional attributes has been in the spotlight. Here, we report a study profiling soil microbial communities along an elevation gradient (500-2200 m) on Changbai Mountain. Using a comprehensive functional gene microarray (GeoChip 5.0), we found that microbial functional gene richness exhibited a dramatic increase at the treeline ecotone, but the bacterial taxonomic and phylogenetic diversity based on 16S rRNA gene sequencing did not exhibit such a similar trend. However, the β-diversity (compositional dissimilarity among sites) pattern for both bacterial taxa and functional genes was similar, showing significant elevational distance-decay patterns which presented increased dissimilarity with elevation. The bacterial taxonomic diversity/structure was strongly influenced by soil pH, while the functional gene diversity/structure was significantly correlated with soil dissolved organic carbon (DOC). This finding highlights that soil DOC may be a good predictor in determining the elevational distribution of microbial functional genes. The finding of significant shifts in functional gene diversity at the treeline ecotone could also provide valuable information for predicting the responses of microbial functions to climate change.
Cloning and characterization of WRKY gene homologs in Chieh-qua (Benincasa hispida Cogn. var. Chieh-qua How) and their expression in response to fusaric acid treatment.

PubMed

Mao, Yizhou; Jiang, Biao; Peng, Qingwu; Liu, Wenrui; Lin, Yue; Xie, Dasen; He, Xiaoming; Li, Shaoshan

2017-05-01

The WRKY transcription factors play an important role in plant resistance for biotic and abiotic stresses. In the present study, we cloned 10 WRKY gene homologs (CqWRKY) in Chieh-qua (Benincasa hispida Cogn. var. Chieh-qua) using the rapid-amplification of cDNA ends (RACE) or homology-based cloning methods. We characterized the structure of these CqWRKY genes. Phylogenetic analysis of these sequences with cucumber homologs suggested possible structural conservation of these genes among cucurbit crops. We examined the expression levels of these genes in response to fusaric acid (FA) treatment between resistant and susceptible Chieh-qua lines with quantitative real-time PCR. All genes could be upregulated upon FA treatment, but four CqWRKY genes exhibited differential expression between resistant and susceptible lines before and after FA application. CqWRKY31 seemed to be a positive regulator while CqWRKY1, CqWRKY23 and CqWRKY53 were negative regulators of fusaric resistance. This is the first report of characterization of WRKY family genes in Chieh-qua. The results may also be useful in breeding Chieh-qua for Fusarium wilt resistance.
Structural and functional annotation of the porcine immunome

PubMed Central

2013-01-01

Background The domestic pig is known as an excellent model for human immunology and the two species share many pathogens. Susceptibility to infectious disease is one of the major constraints on swine performance, yet the structure and function of genes comprising the pig immunome are not well-characterized. The completion of the pig genome provides the opportunity to annotate the pig immunome, and compare and contrast pig and human immune systems. Results The Immune Response Annotation Group (IRAG) used computational curation and manual annotation of the swine genome assembly 10.2 (Sscrofa10.2) to refine the currently available automated annotation of 1,369 immunity-related genes through sequence-based comparison to genes in other species. Within these genes, we annotated 3,472 transcripts. Annotation provided evidence for gene expansions in several immune response families, and identified artiodactyl-specific expansions in the cathelicidin and type 1 Interferon families. We found gene duplications for 18 genes, including 13 immune response genes and five non-immune response genes discovered in the annotation process. Manual annotation provided evidence for many new alternative splice variants and 8 gene duplications. Over 1,100 transcripts without porcine sequence evidence were detected using cross-species annotation. We used a functional approach to discover and accurately annotate porcine immune response genes. A co-expression clustering analysis of transcriptomic data from selected experimental infections or immune stimulations of blood, macrophages or lymph nodes identified a large cluster of genes that exhibited a correlated positive response upon infection across multiple pathogens or immune stimuli. Interestingly, this gene cluster (cluster 4) is enriched for known general human immune response genes, yet contains many un-annotated porcine genes. A phylogenetic analysis of the encoded proteins of cluster 4 genes showed that 15% exhibited an accelerated evolution as compared to 4.1% across the entire genome. Conclusions This extensive annotation dramatically extends the genome-based knowledge of the molecular genetics and structure of a major portion of the porcine immunome. Our complementary functional approach using co-expression during immune response has provided new putative immune response annotation for over 500 porcine genes. Our phylogenetic analysis of this core immunome cluster confirms rapid evolutionary change in this set of genes, and that, as in other species, such genes are important components of the pig’s adaptation to pathogen challenge over evolutionary time. These comprehensive and integrated analyses increase the value of the porcine genome sequence and provide important tools for global analyses and data-mining of the porcine immune response. PMID:23676093
[Quantitative changes of main components of erythrocyte membranes which define architectonics of cells under pttg gene knockout].

PubMed

Kaniuka, O P; Filiak, Ie Z; Kulachkovs'kyĭ, O R; Osyp, Iu L; Sybirna, N O

2014-01-01

A pttg gene knockout affects the functional state of erythron in mice which could be associated with structural changes in the structure of erythrocyte membranes. The pttg gene knockout causes a significant modification of fatty acids composition of erythrocyte membrane lipids by reducing the content of palmitic acid and increasing of polyunsaturated fatty acids amount by 18%. Analyzing the erythrocyte surface architectonics of mice under pttg gene knockout, it was found that on the background of reduction of the functionally complete biconcave discs population one could observe an increase of the number of transformed cells at different degeneration stages. Researches have shown that in mice with a pttg gene knockout compared with a control group of animals cytoskeletal protein--beta-spectrin was reduced by 17.03%. However, there is a reduction of membrane protein band 3 by 33.04%, simultaneously the content of anion transport protein band 4.5 increases by 35.2% and protein band 4.2 by 32.1%. The lectin blot analysis has helped to reveal changes in the structure of the carbohydrate determinants of erythrocyte membrane glycoproteins under conditions of directed pttg gene inactivation, accompanied by changes in the type of communication, which joins the terminal residue in carbohydrate determinant of glycoproteins. Thus, a significant redistribution of protein and fatty acids contents in erythrocyte membranes that manifested in the increase of the deformed shape of red blood cells is observed underpttg gene knockout.
β-Globin locus control region HS2 and HS3 interact structurally and functionally

PubMed Central

Jackson, David A.; McDowell, Jennifer C.; Dean, Ann

2003-01-01

The overall structure of the DNase I hypersensitive sites (HSs) that comprise the β-globin locus control region (LCR) is highly conserved among mammals, implying that the HSs have conserved functions. However, it is not well understood how the LCR HSs, either individually or collectively, activate transcription. We analyzed the interactions of HS2, HS3 and HS4 with the human ε- and β-globin genes in chromatinized episomes in fetal/embryonic K562 cells. Only HS2 activates transcription of the ε-globin gene, while all three HSs activate the β-globin gene. HS3 stimulates the β-globin gene constitutively, but HS2 and HS4 transactivation requires expression of the transcription factor EKLF, which is not present in K562 cells but is required for β-globin expression in vivo. To begin addressing how the individual HSs may interact with one another in a complex, we linked the β-globin gene to both the HS2 and HS3. HS2 and HS3 together resulted in synergistic stimulation of β-globin transcription. Unexpectedly, mutated, inactive forms of HS2 impeded the activation of the β-globin gene by HS3. Thus, there appear to be distinct interactions among the HSs and between the HSs and the globin genes. These preferential, non-exclusive interactions may underlie an important structural and functional cooperativity among the regulatory sequences of the β-globin locus in vivo. PMID:12582237
The top skin-associated genes: a comparative analysis of human and mouse skin transcriptomes.

PubMed

Gerber, Peter Arne; Buhren, Bettina Alexandra; Schrumpf, Holger; Homey, Bernhard; Zlotnik, Albert; Hevezi, Peter

2014-06-01

The mouse represents a key model system for the study of the physiology and biochemistry of skin. Comparison of skin between mouse and human is critical for interpretation and application of data from mouse experiments to human disease. Here, we review the current knowledge on structure and immunology of mouse and human skin. Moreover, we present a systematic comparison of human and mouse skin transcriptomes. To this end, we have recently used a genome-wide database of human gene expression to identify genes highly expressed in skin, with no, or limited expression elsewhere - human skin-associated genes (hSAGs). Analysis of our set of hSAGs allowed us to generate a comprehensive molecular characterization of healthy human skin. Here, we used a similar database to generate a list of mouse skin-associated genes (mSAGs). A comparative analysis between the top human (n=666) and mouse (n=873) skin-associated genes (SAGs) revealed a total of only 30.2% identity between the two lists. The majority of shared genes encode proteins that participate in structural and barrier functions. Analysis of the top functional annotation terms revealed an overlap for morphogenesis, cell adhesion, structure, and signal transduction. The results of this analysis, discussed in the context of published data, illustrate the diversity between the molecular make up of skin of both species and grants a probable explanation, why results generated in murine in vivo models often fail to translate into the human.
Structure of a gene encoding a murine thymus leukemia antigen, and organization of Tla genes in the BALB/c mouse

PubMed Central

1985-01-01

We have determined the DNA sequence of a gene encoding a thymus leukemia (TL) antigen in the BALB/c mouse, and have more definitively mapped the cloned BALB/c Tla-region class I gene clusters. Analysis of the sequence shows that the Tla gene is less closely related to the H-2 genes than H-2 genes are to one another or to a Qa-2,3-region genes. The Tla gene, 17.3A, contains an apparent gene conversion. Comparison of the BALB/c Tla genes with those from C57BL shows that BALB/c has more Tla-region class I genes, and that one of the genes absent in C57BL is gene 17.3A. PMID:3894562
Isolation and characterization of major histocompatibility complex class IIB genes from the nurse shark.

PubMed

Bartl, S; Weissman, I L

1994-01-04

The major histocompatibility complex (MHC) contains a set of linked genes which encode cell surface proteins involved in the binding of small peptide antigens for their subsequent recognition by T lymphocytes. MHC proteins share structural features and the presence and location of polymorphic residues which play a role in the binding of antigens. In order to compare the structure of these molecules and gain insights into their evolution, we have isolated two MHC class IIB genes from the nurse shark, Ginglymostoma cirratum. Two clones, most probably alleles, encode proteins which differ by 13 amino acids located in the putative antigen-binding cleft. The protein structure and the location of polymorphic residues are similar to their mammalian counterparts. Although these genes appear to encode a typical MHC protein, no T-cell-mediated responses have been demonstrated in cartilaginous fish. The nurse shark represents the most phylogenetically primitive organism in which both class IIA [Kasahara, M., Vazquez, M., Sato, K., McKinney, E.C. & Flajnik, M.F. (1992) Proc. Natl. Acad. Sci USA 89, 6688-6692] and class IIB genes, presumably encoding the alpha/beta heterodimer, have been isolated.
Analysis of Craniocardiac Malformations in Xenopus using Optical Coherence Tomography

PubMed Central

Deniz, Engin; Jonas, Stephan; Hooper, Michael; N. Griffin, John; Choma, Michael A.; Khokha, Mustafa K.

2017-01-01

Birth defects affect 3% of children in the United States. Among the birth defects, congenital heart disease and craniofacial malformations are major causes of mortality and morbidity. Unfortunately, the genetic mechanisms underlying craniocardiac malformations remain largely uncharacterized. To address this, human genomic studies are identifying sequence variations in patients, resulting in numerous candidate genes. However, the molecular mechanisms of pathogenesis for most candidate genes are unknown. Therefore, there is a need for functional analyses in rapid and efficient animal models of human disease. Here, we coupled the frog Xenopus tropicalis with Optical Coherence Tomography (OCT) to create a fast and efficient system for testing craniocardiac candidate genes. OCT can image cross-sections of microscopic structures in vivo at resolutions approaching histology. Here, we identify optimal OCT imaging planes to visualize and quantitate Xenopus heart and facial structures establishing normative data. Next we evaluate known human congenital heart diseases: cardiomyopathy and heterotaxy. Finally, we examine craniofacial defects by a known human teratogen, cyclopamine. We recapitulate human phenotypes readily and quantify the functional and structural defects. Using this approach, we can quickly test human craniocardiac candidate genes for phenocopy as a critical first step towards understanding disease mechanisms of the candidate genes. PMID:28195132
Impact of nitrogen concentration on validamycin A production and related gene transcription in fermentation of Streptomyces hygroscopicus 5008.

PubMed

Wei, Zhen-Hua; Bai, Linquan; Deng, Zixin; Zhong, Jian-Jiang

2012-09-01

Validamycin A (VAL-A) is an important and widely used agricultural antibiotic. In this study, statistical screening designs were applied to identify significant medium variables for VAL-A production and to find their optimal levels. The optimized medium caused 70% enhancement of VAL-A production. The difference between optimized medium and original medium suggested that low nitrogen source level might attribute to the enhancement of VAL-A production. The addition of different nitrogen sources to the optimized medium inhibited VAL-A production, which confirmed the importance of nitrogen concentration for VAL-A production. Furthermore, differences in structural gene transcription and enzyme activity between the two media were assayed. The results showed that lower nitrogen level in the optimized medium could regulate VAL-A production in gene transcriptional level. Our previous study indicated that the transcription of VAL-A structural genes could be enhanced at elevated temperature. In this work, the increased fermentation temperature from 37 to 42 °C with the optimized medium enhanced VAL-A production by 39%, which testified to the importance of structural gene transcription in VAL-A production. The information is useful for further VAL-A production enhancement.
Interfacing cellular networks of S. cerevisiae and E. coli: Connecting dynamic and genetic information

PubMed Central

2013-01-01

Background In recent years, various types of cellular networks have penetrated biology and are nowadays used omnipresently for studying eukaryote and prokaryote organisms. Still, the relation and the biological overlap among phenomenological and inferential gene networks, e.g., between the protein interaction network and the gene regulatory network inferred from large-scale transcriptomic data, is largely unexplored. Results We provide in this study an in-depth analysis of the structural, functional and chromosomal relationship between a protein-protein network, a transcriptional regulatory network and an inferred gene regulatory network, for S. cerevisiae and E. coli. Further, we study global and local aspects of these networks and their biological information overlap by comparing, e.g., the functional co-occurrence of Gene Ontology terms by exploiting the available interaction structure among the genes. Conclusions Although the individual networks represent different levels of cellular interactions with global structural and functional dissimilarities, we observe crucial functions of their network interfaces for the assembly of protein complexes, proteolysis, transcription, translation, metabolic and regulatory interactions. Overall, our results shed light on the integrability of these networks and their interfacing biological processes. PMID:23663484
The genomic structure of the human Charcot-Leyden crystal protein gene is analogous to those of the galectin genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.

The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
One-Pot Parallel Synthesis of Lipid Library via Thiolactone Ring Opening and Screening for Gene Delivery.

PubMed

Molla, Mijanur R; Böser, Alexander; Rana, Akshita; Schwarz, Karina; Levkin, Pavel A

2018-04-18

Efficient delivery of nucleic acids into cells is of great interest in the field of cell biology and gene therapy. Despite a lot of research, transfection efficiency and structural diversity of gene-delivery vectors are still limited. A better understanding of the structure-function relationship of gene delivery vectors is also essential for the design of novel and intelligent delivery vectors, efficient in "difficult-to-transfect" cells and in vivo clinical applications. Most of the existing strategies for the synthesis of gene-delivery vectors require multiple steps and lengthy procedures. Here, we demonstrate a facile, three-component one-pot synthesis of a combinatorial library of 288 structurally diverse lipid-like molecules termed "lipidoids" via a thiolactone ring opening reaction. This strategy introduces the possibility to synthesize lipidoids with hydrophobic tails containing both unsaturated bonds and reducible disulfide groups. The whole synthesis and purification are convenient, extremely fast, and can be accomplished within a few hours. Screening of the produced lipidoids using HEK293T cells without addition of helper lipids resulted in identification of highly stable liposomes demonstrating ∼95% transfection efficiency with low toxicity.
The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

PubMed Central

De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

1992-01-01

The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
Protein classification using probabilistic chain graphs and the Gene Ontology structure.

PubMed

Carroll, Steven; Pavlovic, Vladimir

2006-08-01

Probabilistic graphical models have been developed in the past for the task of protein classification. In many cases, classifications obtained from the Gene Ontology have been used to validate these models. In this work we directly incorporate the structure of the Gene Ontology into the graphical representation for protein classification. We present a method in which each protein is represented by a replicate of the Gene Ontology structure, effectively modeling each protein in its own 'annotation space'. Proteins are also connected to one another according to different measures of functional similarity, after which belief propagation is run to make predictions at all ontology terms. The proposed method was evaluated on a set of 4879 proteins from the Saccharomyces Genome Database whose interactions were also recorded in the GRID project. Results indicate that direct utilization of the Gene Ontology improves predictive ability, outperforming traditional models that do not take advantage of dependencies among functional terms. Average increase in accuracy (precision) of positive and negative term predictions of 27.8% (2.0%) over three different similarity measures and three subontologies was observed. C/C++/Perl implementation is available from authors upon request.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.