Sample records for conserved gene clusters

  1. Finding approximate gene clusters with Gecko 3.

    PubMed

    Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

    2016-11-16

    Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. A cross-species bi-clustering approach to identifying conserved co-regulated genes.

    PubMed

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-06-15

    A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared to the two-step method and several recent joint clustering methods. We then applied this approach to two real world datasets of gene expression during the pre-implantation embryonic development of the human and mouse. Co-regulated genes consistent between the human and mouse were identified, offering insights into conserved functions, as well as similarities and differences in genome activation timing between the human and mouse embryos. The R package containing the implementation of the proposed method in C ++ is available at: https://github.com/JavonSun/mvbc.git and also at the R platform https://www.r-project.org/ jinbo@engr.uconn.edu. © The Author 2016. Published by Oxford University Press.

  3. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less

  4. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

    2004-08-06

    The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayedmore » embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less

  5. A conserved gene cluster as a putative functional unit in insect innate immunity.

    PubMed

    Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Andó, István

    2010-11-05

    The Nimrod gene superfamily is an important component of the innate immune response. The majority of its member genes are located in close proximity within the Drosophila melanogaster genome and they lie in a larger conserved cluster ("Nimrod cluster"), made up of non-related groups (families, superfamilies) of genes. This cluster has been a part of the Arthropod genomes for about 300-350 million years. The available data suggest that the Nimrod cluster is a functional module of the insect innate immune response. Copyright © 2010 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  6. Ancient genomic architecture for mammalian olfactory receptor clusters

    PubMed Central

    Aloni, Ronny; Olender, Tsviya; Lancet, Doron

    2006-01-01

    Background Mammalian olfactory receptor (OR) genes reside in numerous genomic clusters of up to several dozen genes. Whole-genome sequence alignment nets of five mammals allow their comprehensive comparison, aimed at reconstructing the ancestral olfactory subgenome. Results We developed a new and general tool for genome-wide definition of genomic gene clusters conserved in multiple species. Syntenic orthologs, defined as gene pairs showing conservation of both genomic location and coding sequence, were subjected to a graph theory algorithm for discovering CLICs (clusters in conservation). When applied to ORs in five mammals, including the marsupial opossum, more than 90% of the OR genes were found within a framework of 48 multi-species CLICs, invoking a general conservation of gene order and composition. A detailed analysis of individual CLICs revealed multiple differences among species, interpretable through species-specific genomic rearrangements and reflecting complex mammalian evolutionary dynamics. One significant instance involves CLIC #1, which lacks a human member, implying the human-specific deletion of an OR cluster, whose mouse counterpart has been tentatively associated with isovaleric acid odorant detection. Conclusion The identified multi-species CLICs demonstrate that most of the mammalian OR clusters have a common ancestry, preceding the split between marsupials and placental mammals. However, only two of these CLICs were capable of incorporating chicken OR genes, parsimoniously implying that all other CLICs emerged subsequent to the avian-mammalian divergence. PMID:17010214

  7. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

    2004-08-06

    Background The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. Results We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene,more » and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Conclusions Measuring conservation of sequence features closely linked to function - such as binding-site clustering - makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less

  8. Integrative analyses of conserved WNT clusters and their co-operative behaviour in human breast cancer

    PubMed Central

    Qurrat-ul-Ain; Seemab, Umair; Nawaz, Sulaman; Rashid, Sajid

    2011-01-01

    In human, WNT gene clusters are highly conserved at specie level and associated with carcinogenesis. Among them, WNT-10A and WNT-6 genes clustered in chromosome 2q35 are homologous to WNT-10B and WNT-1 located in chromosome 12q13, respectively. In an attempt to study co-regulation, the coordinated expression of these genes was monitored in human breast cancer tissues. As compared to normal tissue, both WNT-10A and WNT-10B genes exhibited lower expression while WNT-6 and WNT-1 showed increased expression in breast cancer tissues. The co-expression pattern was elaborated by detailed phylogenetic and syntenic analyses. Moreover, the intergenic and intragenic regions for these gene clusters were analyzed for studying the transcriptional regulation. In this context, adequate conserved binding sites for SOX and TCF family of transcriptional factors were observed. We propose that SOX9 and TCF4 may compete for binding at the promoters of WNT family genes thus regulating the disease phenotype. PMID:22355234

  9. Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

    PubMed

    Lu, Hong; Patil, Prabhu; Van Sluys, Marie-Anne; White, Frank F; Ryan, Robert P; Dow, J Maxwell; Rabinowicz, Pablo; Salzberg, Steven L; Leach, Jan E; Sonti, Ramesh; Brendel, Volker; Bogdanove, Adam J

    2008-01-01

    Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors) cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small number of genes or in non-coding sequences, and/or differences outside the clusters, potentially among regulatory targets or secretory substrates.

  10. Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications

    PubMed Central

    Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel

    2007-01-01

    Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters. PMID:17822543

  11. The drug target genes show higher evolutionary conservation than non-target genes.

    PubMed

    Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

    2016-01-26

    Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.

  12. Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters

    PubMed Central

    WANG, JIACHEN; DASGUPTA, INDRANI; FOX, GEORGE E.

    2009-01-01

    The genomic associations of the archaeal ribosomal proteins, (r-proteins), were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such. PMID:19478915

  13. Evolution of coding and non-coding genes in HOX clusters of a marsupial.

    PubMed

    Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

    2012-06-18

    The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.

  14. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    PubMed Central

    2012-01-01

    Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672

  15. Molecular evolution of the HoxA cluster in the three major gnathostome lineages

    PubMed Central

    Chiu, Chi-hua; Amemiya, Chris; Dewar, Ken; Kim, Chang-Bae; Ruddle, Frank H.; Wagner, Günter P.

    2002-01-01

    The duplication of Hox clusters and their maintenance in a lineage has a prominent but little understood role in chordate evolution. Here we examined how Hox cluster duplication may influence changes in cluster architecture and patterns of noncoding sequence evolution. We sequenced the entire duplicated HoxAa and HoxAb clusters of zebrafish (Danio rerio) and extended the 5′ (posterior) part of the HoxM (HoxA-like) cluster of horn shark (Heterodontus francisci) containing the hoxa11 and hoxa13 orthologs as well as intergenic and flanking noncoding sequences. The duplicated HoxA clusters in zebrafish each house considerably fewer genes and are dramatically shorter than the single HoxA clusters of human and horn shark. We compared the intergenic sequences of the HoxA clusters of human, horn shark, zebrafish (Aa, Ab), and striped bass and found extensive conservation of noncoding sequence motifs, i.e., phylogenetic footprints, between the human and horn shark, representing two of the three gnathostome lineages. These are putative cis-regulatory elements that may play a role in the regulation of the ancestral HoxA cluster. In contrast, homologous regions of the duplicated HoxAa and HoxAb clusters of zebrafish and the HoxA cluster of striped bass revealed a striking loss of conservation of these putative cis-regulatory sequences in the 3′ (anterior) segment of the cluster, where zebrafish only retains single representatives of group 1, 3, 4, and 5 (HoxAa) and group 2 (HoxAb) genes and in the 5′ part of the clusters, where zebrafish retains two copies of the group 13, 11, and 9 genes, i.e., AbdB-like genes. In analyzing patterns of cis-sequence evolution in the 5′ part of the clusters, we explicitly looked for evidence of complementary loss of conserved noncoding sequences, as predicted by the duplication-degeneration-complementation model in which genetic redundancy after gene duplication is resolved because of the fixation of complementary degenerative mutations. Our data did not yield evidence supporting this prediction. We conclude that changes in the pattern of cis-sequence conservation after Hox cluster duplication are more consistent with being the outcome of adaptive modification rather than passive mechanisms that erode redundancy created by the duplication event. These results support the view that genome duplications may provide a mechanism whereby master control genes undergo radical modifications conducive to major alterations in body plan. Such genomic revolutions may contribute significantly to the evolutionary process. PMID:11943847

  16. Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

    PubMed Central

    Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

    2005-01-01

    Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430

  17. Functional Organization of hsp70 Cluster in Camel (Camelus dromedarius) and Other Mammals

    PubMed Central

    Garbuz, David G.; Astakhova, Lubov N.; Zatsepina, Olga G.; Arkhipova, Irina R.; Nudler, Eugene; Evgen'ev, Michael B.

    2011-01-01

    Heat shock protein 70 (Hsp70) is a molecular chaperone providing tolerance to heat and other challenges at the cellular and organismal levels. We sequenced a genomic cluster containing three hsp70 family genes linked with major histocompatibility complex (MHC) class III region from an extremely heat tolerant animal, camel (Camelus dromedarius). Two hsp70 family genes comprising the cluster contain heat shock elements (HSEs), while the third gene lacks HSEs and should not be induced by heat shock. Comparison of the camel hsp70 cluster with the corresponding regions from several mammalian species revealed similar organization of genes forming the cluster. Specifically, the two heat inducible hsp70 genes are arranged in tandem, while the third constitutively expressed hsp70 family member is present in inverted orientation. Comparison of regulatory regions of hsp70 genes from camel and other mammals demonstrates that transcription factor matches with highest significance are located in the highly conserved 250-bp upstream region and correspond to HSEs followed by NF-Y and Sp1 binding sites. The high degree of sequence conservation leaves little room for putative camel-specific regulatory elements. Surprisingly, RT-PCR and 5′/3′-RACE analysis demonstrated that all three hsp70 genes are expressed in camel's muscle and blood cells not only after heat shock, but under normal physiological conditions as well, and may account for tolerance of camel cells to extreme environmental conditions. A high degree of evolutionary conservation observed for the hsp70 cluster always linked with MHC locus in mammals suggests an important role of such organization for coordinated functioning of these vital genes. PMID:22096537

  18. Identifying conserved gene clusters in the presence of homology families.

    PubMed

    He, Xin; Goldwasser, Michael H

    2005-01-01

    The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.

  19. Breakup of a homeobox cluster after genome duplication in teleosts

    PubMed Central

    Mulley, John F.; Chiu, Chi-hua; Holland, Peter W. H.

    2006-01-01

    Several families of homeobox genes are arranged in genomic clusters in metazoan genomes, including the Hox, ParaHox, NK, Rhox, and Iroquois gene clusters. The selective pressures responsible for maintenance of these gene clusters are poorly understood. The ParaHox gene cluster is evolutionarily conserved between amphioxus and human but is fragmented in teleost fishes. We show that two basal ray-finned fish, Polypterus and Amia, each possess an intact ParaHox cluster; this implies that the selective pressure maintaining clustering was lost after whole-genome duplication in teleosts. Cluster breakup is because of gene loss, not transposition or inversion, and the total number of ParaHox genes is the same in teleosts, human, mouse, and frog. We propose that this homeobox gene cluster is held together in chordates by the existence of interdigitated control regions that could be separated after locus duplication in the teleost fish. PMID:16801555

  20. Drug repositioning for orphan genetic diseases through Conserved Anticoexpressed Gene Clusters (CAGCs)

    PubMed Central

    2013-01-01

    Background The development of new therapies for orphan genetic diseases represents an extremely important medical and social challenge. Drug repositioning, i.e. finding new indications for approved drugs, could be one of the most cost- and time-effective strategies to cope with this problem, at least in a subset of cases. Therefore, many computational approaches based on the analysis of high throughput gene expression data have so far been proposed to reposition available drugs. However, most of these methods require gene expression profiles directly relevant to the pathologic conditions under study, such as those obtained from patient cells and/or from suitable experimental models. In this work we have developed a new approach for drug repositioning, based on identifying known drug targets showing conserved anti-correlated expression profiles with human disease genes, which is completely independent from the availability of ‘ad hoc’ gene expression data-sets. Results By analyzing available data, we provide evidence that the genes displaying conserved anti-correlation with drug targets are antagonistically modulated in their expression by treatment with the relevant drugs. We then identified clusters of genes associated to similar phenotypes and showing conserved anticorrelation with drug targets. On this basis, we generated a list of potential candidate drug-disease associations. Importantly, we show that some of the proposed associations are already supported by independent experimental evidence. Conclusions Our results support the hypothesis that the identification of gene clusters showing conserved anticorrelation with drug targets can be an effective method for drug repositioning and provide a wide list of new potential drug-disease associations for experimental validation. PMID:24088245

  1. Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

    NASA Technical Reports Server (NTRS)

    Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

    1997-01-01

    Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.

  2. Sequence Similarity of Clostridium difficile Strains by Analysis of Conserved Genes and Genome Content Is Reflected by Their Ribotype Affiliation

    PubMed Central

    Kurka, Hedwig; Ehrenreich, Armin; Ludwig, Wolfgang; Monot, Marc; Rupnik, Maja; Barbut, Frederic; Indra, Alexander; Dupuy, Bruno; Liebl, Wolfgang

    2014-01-01

    PCR-ribotyping is a broadly used method for the classification of isolates of Clostridium difficile, an emerging intestinal pathogen, causing infections with increased disease severity and incidence in several European and North American countries. We have now carried out clustering analysis with selected genes of numerous C. difficile strains as well as gene content comparisons of their genomes in order to broaden our view of the relatedness of strains assigned to different ribotypes. We analyzed the genomic content of 48 C. difficile strains representing 21 different ribotypes. The calculation of distance matrix-based dendrograms using the neighbor joining method for 14 conserved genes (standard phylogenetic marker genes) from the genomes of the C. difficile strains demonstrated that the genes from strains with the same ribotype generally clustered together. Further, certain ribotypes always clustered together and formed ribotype groups, i.e. ribotypes 078, 033 and 126, as well as ribotypes 002 and 017, indicating their relatedness. Comparisons of the gene contents of the genomes of ribotypes that clustered according to the conserved gene analysis revealed that the number of common genes of the ribotypes belonging to each of these three ribotype groups were very similar for the 078/033/126 group (at most 69 specific genes between the different strains with the same ribotype) but less similar for the 002/017 group (86 genes difference). It appears that the ribotype is indicative not only of a specific pattern of the amplified 16S–23S rRNA intergenic spacer but also reflects specific differences in the nucleotide sequences of the conserved genes studied here. It can be anticipated that the sequence deviations of more genes of C. difficile strains are correlated with their PCR-ribotype. In conclusion, the results of this study corroborate and extend the concept of clonal C. difficile lineages, which correlate with ribotypes affiliation. PMID:24482682

  3. Identification and Functional Analysis of the Nocardithiocin Gene Cluster in Nocardia pseudobrasiliensis

    PubMed Central

    Sakai, Kanae; Komaki, Hisayuki; Gonoi, Tohru

    2015-01-01

    Nocardithiocin is a thiopeptide compound isolated from the opportunistic pathogen Nocardia pseudobrasiliensis. It shows a strong activity against acid-fast bacteria and is also active against rifampicin-resistant Mycobacterium tuberculosis. Here, we report the identification of the nocardithiocin gene cluster in N. pseudobrasiliensis IFM 0761 based on conserved thiopeptide biosynthesis gene sequence and the whole genome sequence. The predicted gene cluster was confirmed by gene disruption and complementation. As expected, strains containing the disrupted gene did not produce nocardithiocin while gene complementation restored nocardithiocin production in these strains. The predicted cluster was further analyzed using RNA-seq which showed that the nocardithiocin gene cluster contains 12 genes within a 15.2-kb region. This finding will promote the improvement of nocardithiocin productivity and its derivatives production. PMID:26588225

  4. Complete Genome Sequence and Comparative Analysis of the Fish Pathogen Lactococcus garvieae

    PubMed Central

    Oshima, Kenshiro; Yoshizaki, Mariko; Kawanishi, Michiko; Nakaya, Kohei; Suzuki, Takehito; Miyauchi, Eiji; Ishii, Yasuo; Tanabe, Soichi; Murakami, Masaru; Hattori, Masahira

    2011-01-01

    Lactococcus garvieae causes fatal haemorrhagic septicaemia in fish such as yellowtail. The comparative analysis of genomes of a virulent strain Lg2 and a non-virulent strain ATCC 49156 of L. garvieae revealed that the two strains shared a high degree of sequence identity, but Lg2 had a 16.5-kb capsule gene cluster that is absent in ATCC 49156. The capsule gene cluster was composed of 15 genes, of which eight genes are highly conserved with those in exopolysaccharide biosynthesis gene cluster often found in Lactococcus lactis strains. Sequence analysis of the capsule gene cluster in the less virulent strain L. garvieae Lg2-S, Lg2-derived strain, showed that two conserved genes were disrupted by a single base pair deletion, respectively. These results strongly suggest that the capsule is crucial for virulence of Lg2. The capsule gene cluster of Lg2 may be a genomic island from several features such as the presence of insertion sequences flanked on both ends, different GC content from the chromosomal average, integration into the locus syntenic to other lactococcal genome sequences, and distribution in human gut microbiomes. The analysis also predicted other potential virulence factors such as haemolysin. The present study provides new insights into understanding of the virulence mechanisms of L. garvieae in fish. PMID:21829716

  5. Uncovering the functional constraints underlying the genomic organization of the odorant-binding protein genes.

    PubMed

    Librado, Pablo; Rozas, Julio

    2013-01-01

    Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.

  6. Conservation of gene linkage in dispersed vertebrate NK homeobox clusters.

    PubMed

    Wotton, Karl R; Weierud, Frida K; Juárez-Morales, José L; Alvares, Lúcia E; Dietrich, Susanne; Lewis, Katharine E

    2009-10-01

    Nk homeobox genes are important regulators of many different developmental processes including muscle, heart, central nervous system and sensory organ development. They are thought to have arisen as part of the ANTP megacluster, which also gave rise to Hox and ParaHox genes, and at least some NK genes remain tightly linked in all animals examined so far. The protostome-deuterostome ancestor probably contained a cluster of nine Nk genes: (Msx)-(Nk4/tinman)-(Nk3/bagpipe)-(Lbx/ladybird)-(Tlx/c15)-(Nk7)-(Nk6/hgtx)-(Nk1/slouch)-(Nk5/Hmx). Of these genes, only NKX2.6-NKX3.1, LBX1-TLX1 and LBX2-TLX2 remain tightly linked in humans. However, it is currently unclear whether this is unique to the human genome as we do not know which of these Nk genes are clustered in other vertebrates. This makes it difficult to assess whether the remaining linkages are due to selective pressures or because chance rearrangements have "missed" certain genes. In this paper, we identify all of the paralogs of these ancestrally clustered NK genes in several distinct vertebrates. We demonstrate that tight linkages of Lbx1-Tlx1, Lbx2-Tlx2 and Nkx3.1-Nkx2.6 have been widely maintained in both the ray-finned and lobe-finned fish lineages. Moreover, the recently duplicated Hmx2-Hmx3 genes are also tightly linked. Finally, we show that Lbx1-Tlx1 and Hmx2-Hmx3 are flanked by highly conserved noncoding elements, suggesting that shared regulatory regions may have resulted in evolutionary pressure to maintain these linkages. Consistent with this, these pairs of genes have overlapping expression domains. In contrast, Lbx2-Tlx2 and Nkx3.1-Nkx2.6, which do not seem to be coexpressed, are also not associated with conserved noncoding sequences, suggesting that an alternative mechanism may be responsible for the continued clustering of these genes.

  7. DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes.

    PubMed

    Wang, Dapeng

    2016-01-15

    The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

    PubMed

    Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

    2005-06-01

    The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.

  9. Long-range comparison of human and mouse Sprr loci to identify conserved noncoding sequences involved in coordinate regulation

    PubMed Central

    Martin, Natalia; Patel, Satyakam; Segre, Julia A.

    2004-01-01

    Mammalian epidermis provides a permeability barrier between an organism and its environment. Under homeostatic conditions, epidermal cells produce structural proteins, which are cross-linked in an orderly fashion to form a cornified envelope (CE). However, under genetic or environmental stress, specific genes are induced to rapidly build a temporary barrier. Small proline-rich (SPRR) proteins are the primary constituents of the CE. Under stress the entire family of 14 Sprr genes is upregulated. The Sprr genes are clustered within the larger epidermal differentiation complex on mouse chromosome 3, human chromosome 1q21. The clustering of the Sprr genes and their upregulation under stress suggest that these genes may be coordinately regulated. To identify enhancer elements that regulate this stress response activation of the Sprr locus, we utilized bioinformatic tools and classical biochemical dissection. Long-range comparative sequence analysis identified conserved noncoding sequences (CNSs). Clusters of epidermal-specific DNaseI-hypersensitive sites (HSs) mapped to specific CNSs. Increased prevalence of these HSs in barrier-deficient epidermis provides in vivo evidence of the regulation of the Sprr locus by these conserved sequences. Individual components of these HSs were cloned, and one was shown to have strong enhancer activity specific to conditions when the Sprr genes are coordinately upregulated. PMID:15574822

  10. DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants

    PubMed Central

    Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László

    2005-01-01

    DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291

  11. DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants.

    PubMed

    Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László

    2005-01-01

    DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.

  12. Globin gene structure in a reptile supports the transpositional model for amniote α- and β-globin gene evolution.

    PubMed

    Patel, Vidushi S; Ezaz, Tariq; Deakin, Janine E; Graves, Jennifer A Marshall

    2010-12-01

    The haemoglobin protein, required for oxygen transportation in the body, is encoded by α- and β-globin genes that are arranged in clusters. The transpositional model for the evolution of distinct α-globin and β-globin clusters in amniotes is much simpler than the previously proposed whole genome duplication model. According to this model, all jawed vertebrates share one ancient region containing α- and β-globin genes and several flanking genes in the order MPG-C16orf35-(α-β)-GBY-LUC7L that has been conserved for more than 410 million years, whereas amniotes evolved a distinct β-globin cluster by insertion of a transposed β-globin gene from this ancient region into a cluster of olfactory receptors flanked by CCKBR and RRM1. It could not be determined whether this organisation is conserved in all amniotes because of the paucity of information from non-avian reptiles. To fill in this gap, we examined globin gene organisation in a squamate reptile, the Australian bearded dragon lizard, Pogona vitticeps (Agamidae). We report here that the α-globin cluster (HBK, HBA) is flanked by C16orf35 and GBY and is located on a pair of microchromosomes, whereas the β-globin cluster is flanked by RRM1 on the 3' end and is located on the long arm of chromosome 3. However, the CCKBR gene that flanks the β-globin cluster on the 5' end in other amniotes is located on the short arm of chromosome 5 in P. vitticeps, indicating that a chromosomal break between the β-globin cluster and CCKBR occurred at least in the agamid lineage. Our data from a reptile species provide further evidence to support the transpositional model for the evolution of β-globin gene cluster in amniotes.

  13. Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cameron, R A; Rowen, L; Nesbitt, R

    2005-10-11

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3 gene is Hox5. (The gene order is :more » 5-Hox1, 2, 3, 11/13c, 11/13b, 11/13a, 9/10, 8, 7, 6, 5 - 3). The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less

  14. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is :more » 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less

  15. Gene essentiality, conservation index and co-evolution of genes in cyanobacteria.

    PubMed

    Tiruveedula, Gopi Siva Sai; Wangikar, Pramod P

    2017-01-01

    Cyanobacteria, a group of photosynthetic prokaryotes, dominate the earth with ~ 1015 g wet biomass. Despite diversity in habitats and an ancient origin, cyanobacterial phylum has retained a significant core genome. Cyanobacteria are being explored for direct conversion of solar energy and carbon dioxide into biofuels. For this, efficient cyanobacterial strains will need to be designed via metabolic engineering. This will require identification of target knockouts to channelize the flow of carbon toward the product of interest while minimizing deletions of essential genes. We propose "Gene Conservation Index" (GCI) as a quick measure to predict gene essentiality in cyanobacteria. GCI is based on phylogenetic profile of a gene constructed with a reduced dataset of cyanobacterial genomes. GCI is the percentage of organism clusters in which the query gene is present in the reduced dataset. Of the 750 genes deemed to be essential in the experimental study on S. elongatus PCC 7942, we found 494 to be conserved across the phylum which largely comprise of the essential metabolic pathways. On the contrary, the conserved but non-essential genes broadly comprise of genes required under stress conditions. Exceptions to this rule include genes such as the glycogen synthesis and degradation enzymes, deoxyribose-phosphate aldolase (DERA), glucose-6-phosphate 1-dehydrogenase (zwf) and fructose-1,6-bisphosphatase class1, which are conserved but non-essential. While the essential genes are to be avoided during gene knockout studies as potentially lethal deletions, the non-essential but conserved set of genes could be interesting targets for metabolic engineering. Further, we identify clusters of co-evolving genes (CCG), which provide insights that may be useful in annotation. Principal component analysis (PCA) plots of the CCGs are demonstrated as data visualization tools that are complementary to the conventional heatmaps. Our dataset consists of phylogenetic profiles for 23,643 non-redundant cyanobacterial genes. We believe that the data and the analysis presented here will be a great resource to the scientific community interested in cyanobacteria.

  16. Horizontal transfer of a large and highly toxic secondary metabolic gene cluster between fungi.

    PubMed

    Slot, Jason C; Rokas, Antonis

    2011-01-25

    Genes involved in intermediary and secondary metabolism in fungi are frequently physically linked or clustered. For example, in Aspergillus nidulans the entire pathway for the production of sterigmatocystin (ST), a highly toxic secondary metabolite and a precursor to the aflatoxins (AF), is located in a ∼54 kb, 23 gene cluster. We discovered that a complete ST gene cluster in Podospora anserina was horizontally transferred from Aspergillus. Phylogenetic analysis shows that most Podospora cluster genes are adjacent to or nested within Aspergillus cluster genes, although the two genera belong to different taxonomic classes. Furthermore, the Podospora cluster is highly conserved in content, sequence, and microsynteny with the Aspergillus ST/AF clusters and its intergenic regions contain 14 putative binding sites for AflR, the transcription factor required for activation of the ST/AF biosynthetic genes. Examination of ∼52,000 Podospora expressed sequence tags identified transcripts for 14 genes in the cluster, with several expressed at multiple life cycle stages. The presence of putative AflR-binding sites and the expression evidence for several cluster genes, coupled with the recent independent discovery of ST production in Podospora [1], suggest that this HGT event probably resulted in a functional cluster. Given the abundance of metabolic gene clusters in fungi, our finding that one of the largest known metabolic gene clusters moved intact between species suggests that such transfers might have significantly contributed to fungal metabolic diversity. PAPERFLICK: Copyright © 2011 Elsevier Ltd. All rights reserved.

  17. Chromosomal mapping of H3 histone and 5S rRNA genes in eight species of Astyanax (Pisces, Characiformes) with different diploid numbers: syntenic conservation of repetitive genes.

    PubMed

    Piscor, Diovani; Parise-Maltempi, Patricia Pasquali

    2016-03-01

    The genus Astyanax is widely distributed from the southern United States to northern Patagonia, Argentina. While cytogenetic studies have been performed for this genus, little is known about the histone gene families. The aim of this study was to examine the chromosomal relationships among the different species of Astyanax. The chromosomal locations of the 5S rRNA and H3 histone genes were determined in A. abramis, A. asuncionensis, A. altiparanae, A. bockmanni, A. eigenmanniorum, A. mexicanus (all 2n = 50), A. fasciatus (2n = 46), and A. schubarti (2n = 36). All eight species exhibited H3 histone clusters on two chromosome pairs. In six species (A. abramis, A. asuncionensis, A. altiparanae, A. bockmanni, A. eigenmanniorum, and A. fasciatus), syntenic clusters of H3 histone and 5S rDNA were observed on metacentric (m) or submetacentric (sm) chromosomes. In seven species, clusters of 5S rDNA sequences were located on one or two chromosome pairs. In A. mexicanus, 5S rDNA clusters were located on four chromosome pairs. This study demonstrates that H3 histone clusters are conserved on two chromosome pairs in the genus Astyanax, and specific chromosomal features may contribute to the genomic organization of the H3 histone and 5S rRNA genes.

  18. Conserved noncoding sequences (CNSs) in higher plants.

    PubMed

    Freeling, Michael; Subramaniam, Shabarinath

    2009-04-01

    Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.

  19. The Lineage-Specific Evolution of Aquaporin Gene Clusters Facilitated Tetrapod Terrestrial Adaptation

    PubMed Central

    Finn, Roderick Nigel; Chauvigné, François; Hlidberg, Jón Baldur; Cutler, Christopher P.; Cerdà, Joan

    2014-01-01

    A major physiological barrier for aquatic organisms adapting to terrestrial life is dessication in the aerial environment. This barrier was nevertheless overcome by the Devonian ancestors of extant Tetrapoda, but the origin of specific molecular mechanisms that solved this water problem remains largely unknown. Here we show that an ancient aquaporin gene cluster evolved specifically in the sarcopterygian lineage, and subsequently diverged into paralogous forms of AQP2, -5, or -6 to mediate water conservation in extant Tetrapoda. To determine the origin of these apomorphic genomic traits, we combined aquaporin sequencing from jawless and jawed vertebrates with broad taxon assembly of >2,000 transcripts amongst 131 deuterostome genomes and developed a model based upon Bayesian inference that traces their convergent roots to stem subfamilies in basal Metazoa and Prokaryota. This approach uncovered an unexpected diversity of aquaporins in every lineage investigated, and revealed that the vertebrate superfamily consists of 17 classes of aquaporins (Aqp0 - Aqp16). The oldest orthologs associated with water conservation in modern Tetrapoda are traced to a cluster of three aqp2-like genes in Actinistia that likely arose >500 Ma through duplication of an aqp0-like gene present in a jawless ancestor. In sea lamprey, we show that aqp0 first arose in a protocluster comprised of a novel aqp14 paralog and a fused aqp01 gene. To corroborate these findings, we conducted phylogenetic analyses of five syntenic nuclear receptor subfamilies, which, together with observations of extensive genome rearrangements, support the coincident loss of ancestral aqp2-like orthologs in Actinopterygii. We thus conclude that the divergence of sarcopterygian-specific aquaporin gene clusters was permissive for the evolution of water conservation mechanisms that facilitated tetrapod terrestrial adaptation. PMID:25426855

  20. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    USDA-ARS?s Scientific Manuscript database

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  1. Homologues of a single resistance-gene cluster in potato confer resistance to distinct pathogens: a virus and a nematode.

    PubMed

    van der Vossen, E A; van der Voort, J N; Kanyuka, K; Bendahmane, A; Sandbrink, H; Baulcombe, D C; Bakker, J; Stiekema, W J; Klein-Lankhorst, R M

    2000-09-01

    The isolation of the nematode-resistance gene Gpa2 in potato is described, and it is demonstrated that highly homologous resistance genes of a single resistance-gene cluster can confer resistance to distinct pathogen species. Molecular analysis of the Gpa2 locus resulted in the identification of an R-gene cluster of four highly homologous genes in a region of approximately 115 kb. At least two of these genes are active: one corresponds to the previously isolated Rx1 gene that confers resistance to potato virus X, while the other corresponds to the Gpa2 gene that confers resistance to the potato cyst nematode Globodera pallida. The proteins encoded by the Gpa2 and the Rx1 genes share an overall homology of over 88% (amino-acid identity) and belong to the leucine-zipper, nucleotide-binding site, leucine-rich repeat (LZ-NBS-LRR)-containing class of plant resistance genes. From the sequence conservation between Gpa2 and Rx1 it is clear that there is a direct evolutionary relationship between the two proteins. Sequence diversity is concentrated in the LRR region and in the C-terminus. The putative effector domains are more conserved suggesting that, at least in this case, nematode and virus resistance cascades could share common components. These findings underline the potential of protein breeding for engineering new resistance specificities against plant pathogens in vitro.

  2. Discovery and characterization of miRNA genes in atlantic salmon (Salmo salar) by use of a deep sequencing approach

    PubMed Central

    2013-01-01

    Background MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the posttranscriptional level. They play important roles in multiple biological processes by regulating genes that control developmental timing, growth, stem cell division and apoptosis by binding to the mRNA of target genes. Despite the position Atlantic salmon (Salmo salar) has as an economically important domesticated animal, there has been little research on miRNAs in this species. Knowledge about miRNAs and their target genes may be used to control health and to improve performance of economically important traits. However, before their biological function can be unravelled they must be identified and annotated. The aims of this study were to identify and characterize miRNA genes in Atlantic salmon by deep sequencing analysis of small RNA libraries from nine different tissues. Results A total of 180 distinct mature miRNAs belonging to 106 families of evolutionary conserved miRNAs, and 13 distinct novel mature miRNAs were discovered and characterized. The mature miRNAs corresponded to 521 putative precursor sequences located at unique genome locations. About 40% of these precursors were part of gene clusters, and the majority of the Salmo salar gene clusters discovered were conserved across species. Comparison of expression levels in samples from different tissues applying DESeq indicated that there were tissue specific expression differences in three conserved and one novel miRNA. Ssa-miR 736 was detected in heart tissue only, while two other clustered miRNAs (ssa-miR 212 and132) seems to be at a higher expression level in brain tissue. These observations correlate well with their expected functions as regulators of signal pathways in cardiac and neuronal cells, respectively. Ssa-miR 8163 is one of the novel miRNAs discovered and its function remains unknown. However, differential expression analysis using DESeq suggests that this miRNA is enriched in liver tissue and the precursor was mapped to intron 7 of the transferrin gene. Conclusions The identification and annotation of evolutionary conserved and novel Salmo salar miRNAs as well as the characterization of miRNA gene clusters provide biological knowledge that will greatly facilitate further functional studies on miRNAs in this species. PMID:23865519

  3. Analysis of developmental gene conservation in the Actinomycetales using DNA/DNA microarray comparisons.

    PubMed

    Kirby, Ralph; Herron, Paul; Hoskisson, Paul

    2011-02-01

    Based on available genome sequences, Actinomycetales show significant gene synteny across a wide range of species and genera. In addition, many genera show varying degrees of complex morphological development. Using the presence of gene synteny as a basis, it is clear that an analysis of gene conservation across the Streptomyces and various other Actinomycetales will provide information on both the importance of genes and gene clusters and the evolution of morphogenesis in these bacteria. Genome sequencing, although becoming cheaper, is still relatively expensive for comparing large numbers of strains. Thus, a heterologous DNA/DNA microarray hybridization dataset based on a Streptomyces coelicolor microarray allows a cheaper and greater depth of analysis of gene conservation. This study, using both bioinformatical and microarray approaches, was able to classify genes previously identified as involved in morphogenesis in Streptomyces into various subgroups in terms of conservation across species and genera. This will allow the targeting of genes for further study based on their importance at the species level and at higher evolutionary levels.

  4. Welcome to pandoraviruses at the ‘Fourth TRUC’ club

    PubMed Central

    Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

    2015-01-01

    Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9–2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the ‘Fourth TRUC’ club, encompassing distinct life forms compared with cellular organisms. PMID:26042093

  5. Welcome to pandoraviruses at the 'Fourth TRUC' club.

    PubMed

    Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

    2015-01-01

    Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9-2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the 'Fourth TRUC' club, encompassing distinct life forms compared with cellular organisms.

  6. RNase 1 genes from the Family Sciuridae define a novel rodent ribonuclease cluster

    PubMed Central

    Siegel, Steven J.; Percopo, Caroline M.; Dyer, Kimberly D.; Zhao, Wei; Roth, V. Louise; Mercer, John M.; Rosenberg, Helene F.

    2009-01-01

    The RNase A ribonucleases are complex group of functionally diverse secretory proteins with conserved enzymatic activity. We have identified novel RNase 1 genes from four species of squirrel (order Rodentia, family Sciuridae). Squirrel RNase 1 genes encode typical RNase A ribonucleases, each with eight cysteines, a conserved CKXXNTF signature motif, and a canonical His12-Lys41-His119 catalytic triad. Two alleles encode Callosciurus prevostii RNase 1, which include a Ser18↔Pro, analogous to the sequence polymorphisms found among the RNase 1 duplications in the genome of Rattus exulans. Interestingly, although the squirrel RNase 1 genes are closely related to one another (77 to 95% amino acid sequence identity), the cluster as a whole is distinct and divergent from the clusters including RNase 1 genes from other rodent species. We examined the specific sites at which Sciuridae RNase 1s diverge from Muridae / Cricetidae RNase 1s, and determined that the divergent sites are located on the external surface, with complete sparing of the catalytic crevice. The full significance of these findings awaits a more complete understanding of biological role of mammalian RNase 1s. PMID:19771477

  7. A homeotic gene cluster patterns the anteroposterior body axis of C. elegans.

    PubMed

    Wang, B B; Müller-Immergluck, M M; Austin, J; Robinson, N T; Chisholm, A; Kenyon, C

    1993-07-16

    In insects and vertebrates, clusters of Antennapedia class homeobox (HOM-C) genes specify anteroposterior body pattern. The nematode C. elegans also contains a small cluster of HOM-C genes, one of which has been shown to specify positional identity. Here we show that two additional C. elegans HOM-C genes also specify positional identity and that together these three HOM-C genes function along the anteroposterior axis in the same order as their homologs in other organisms. Thus, HOM-C-based pattern formation has been conserved in nematodes despite the many differences in morphology and embryology that distinguish them from other phyla. Each C. elegans HOM-C gene is responsible for a distinct body region; however, where their domains overlap, two HOM-C genes can act together to specify the fates of individual cells.

  8. Comparative genomic sequence analysis of strawberry and other rosids reveals significant microsynteny

    PubMed Central

    2010-01-01

    Background Fragaria belongs to the Rosaceae, an economically important family that includes a number of important fruit producing genera such as Malus and Prunus. Using genomic sequences from 50 Fragaria fosmids, we have examined the microsynteny between Fragaria and other plant models. Results In more than half of the strawberry fosmids, we found syntenic regions that are conserved in Populus, Vitis, Medicago and/or Arabidopsis with Populus containing the greatest number of syntenic regions with Fragaria. The longest syntenic region was between LG VIII of the poplar genome and the strawberry fosmid 72E18, where seven out of twelve predicted genes were collinear. We also observed an unexpectedly high level of conserved synteny between Fragaria (rosid I) and Vitis (basal rosid). One of the strawberry fosmids, 34E24, contained a cluster of R gene analogs (RGAs) with NBS and LRR domains. We detected clusters of RGAs with high sequence similarity to those in 34E24 in all the genomes compared. In the phylogenetic tree we have generated, all the NBS-LRR genes grouped together with Arabidopsis CNL-A type NBS-LRR genes. The Fragaria RGA grouped together with those of Vitis and Populus in the phylogenetic tree. Conclusions Our analysis shows considerable microsynteny between Fragaria and other plant genomes such as Populus, Medicago, Vitis, and Arabidopsis to a lesser degree. We also detected a cluster of NBS-LRR type genes that are conserved in all the genomes compared. PMID:20565715

  9. DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

    PubMed

    El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

    2007-01-01

    We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.

  10. Conserved Responses in a War of Small Molecules between a Plant-Pathogenic Bacterium and Fungi.

    PubMed

    Spraker, Joseph E; Wiemann, Philipp; Baccile, Joshua A; Venkatesh, Nandhitha; Schumacher, Julia; Schroeder, Frank C; Sanchez, Laura M; Keller, Nancy P

    2018-05-22

    Small-molecule signaling is one major mode of communication within the polymicrobial consortium of soil and rhizosphere. While microbial secondary metabolite (SM) production and responses of individual species have been studied extensively, little is known about potentially conserved roles of SM signals in multilayered symbiotic or antagonistic relationships. Here, we characterize the SM-mediated interaction between the plant-pathogenic bacterium Ralstonia solanacearum and the two plant-pathogenic fungi Fusarium fujikuroi and Botrytis cinerea We show that cellular differentiation and SM biosynthesis in F. fujikuroi are induced by the bacterially produced lipopeptide ralsolamycin (synonym ralstonin A). In particular, fungal bikaverin production is induced and preferentially accumulates in fungal survival spores (chlamydospores) only when exposed to supernatants of ralsolamycin-producing strains of R. solanacearum Although inactivation of bikaverin biosynthesis moderately increases chlamydospore invasion by R. solanacearum , we show that other metabolites such as beauvericin are also induced by ralsolamycin and contribute to suppression of R. solanacearum growth in vitro Based on our findings that bikaverin antagonizes R. solanacearum and that ralsolamycin induces bikaverin biosynthesis in F. fujikuroi , we asked whether other bikaverin-producing fungi show similar responses to ralsolamycin. Examining a strain of B. cinerea that horizontally acquired the bikaverin gene cluster from Fusarium , we found that ralsolamycin induced bikaverin biosynthesis in this fungus. Our results suggest that conservation of microbial SM responses across distantly related fungi may arise from horizontal transfer of protective gene clusters that are activated by conserved regulatory cues, e.g., a bacterial lipopeptide, providing consistent fitness advantages in dynamic polymicrobial networks. IMPORTANCE Bacteria and fungi are ubiquitous neighbors in many environments, including the rhizosphere. Many of these organisms are notorious as economically devastating plant pathogens, but little is known about how they communicate chemically with each other. Here, we uncover a conserved antagonistic communication between the widespread bacterial wilt pathogen Ralstonia solanacearum and plant-pathogenic fungi from disparate genera, Fusarium and Botrytis Exposure of Fusarium fujikuroi to the bacterial lipopeptide ralsolamycin resulted in production of the antibacterial metabolite bikaverin specifically in fungal tissues invaded by Ralstonia Remarkably, ralsolamycin induction of bikaverin was conserved in a Botrytis cinerea isolate carrying a horizontally transferred bikaverin gene cluster. These results indicate that horizontally transferred gene clusters may carry regulatory prompts that contribute to conserved fitness functions in polymicrobial environments. Copyright © 2018 Spraker et al.

  11. WordCluster: detecting clusters of DNA words and genomic elements

    PubMed Central

    2011-01-01

    Background Many k-mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (k-mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used WordCluster to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions WordCluster seems to predict biological meaningful clusters of DNA words (k-mers) and genomic entities. The implementation of the method into a web server is available at http://bioinfo2.ugr.es/wordCluster/wordCluster.php including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes. PMID:21261981

  12. Microarray-based comparative genomic profiling of reference strains and selected Canadian field isolates of Actinobacillus pleuropneumoniae

    PubMed Central

    Gouré, Julien; Findlay, Wendy A; Deslandes, Vincent; Bouevitch, Anne; Foote, Simon J; MacInnes, Janet I; Coulton, James W; Nash, John HE; Jacques, Mario

    2009-01-01

    Background Actinobacillus pleuropneumoniae, the causative agent of porcine pleuropneumonia, is a highly contagious respiratory pathogen that causes severe losses to the swine industry worldwide. Current commercially-available vaccines are of limited value because they do not induce cross-serovar immunity and do not prevent development of the carrier state. Microarray-based comparative genomic hybridizations (M-CGH) were used to estimate whole genomic diversity of representative Actinobacillus pleuropneumoniae strains. Our goal was to identify conserved genes, especially those predicted to encode outer membrane proteins and lipoproteins because of their potential for the development of more effective vaccines. Results Using hierarchical clustering, our M-CGH results showed that the majority of the genes in the genome of the serovar 5 A. pleuropneumoniae L20 strain were conserved in the reference strains of all 15 serovars and in representative field isolates. Fifty-eight conserved genes predicted to encode for outer membrane proteins or lipoproteins were identified. As well, there were several clusters of diverged or absent genes including those associated with capsule biosynthesis, toxin production as well as genes typically associated with mobile elements. Conclusion Although A. pleuropneumoniae strains are essentially clonal, M-CGH analysis of the reference strains of the fifteen serovars and representative field isolates revealed several classes of genes that were divergent or absent. Not surprisingly, these included genes associated with capsule biosynthesis as the capsule is associated with sero-specificity. Several of the conserved genes were identified as candidates for vaccine development, and we conclude that M-CGH is a valuable tool for reverse vaccinology. PMID:19239696

  13. Comparative interrogation of the developing xylem transcriptomes of two wood-forming species: Populus trichocarpa and Eucalyptus grandis.

    PubMed

    Hefer, Charles A; Mizrachi, Eshchar; Myburg, Alexander A; Douglas, Carl J; Mansfield, Shawn D

    2015-06-01

    Wood formation is a complex developmental process governed by genetic and environmental stimuli. Populus and Eucalyptus are fast-growing, high-yielding tree genera that represent ecologically and economically important species suitable for generating significant lignocellulosic biomass. Comparative analysis of the developing xylem and leaf transcriptomes of Populus trichocarpa and Eucalyptus grandis together with phylogenetic analyses identified clusters of homologous genes preferentially expressed during xylem formation in both species. A conserved set of 336 single gene pairs showed highly similar xylem preferential expression patterns, as well as evidence of high functional constraint. Individual members of multi-gene orthologous clusters known to be involved in secondary cell wall biosynthesis also showed conserved xylem expression profiles. However, species-specific expression as well as opposite (xylem versus leaf) expression patterns observed for a subset of genes suggest subtle differences in the transcriptional regulation important for xylem development in each species. Using sequence similarity and gene expression status, we identified functional homologs likely to be involved in xylem developmental and biosynthetic processes in Populus and Eucalyptus. Our study suggests that, while genes involved in secondary cell wall biosynthesis show high levels of gene expression conservation, differential regulation of some xylem development genes may give rise to unique xylem properties. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  14. Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

    PubMed Central

    Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

    1993-01-01

    Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043

  15. The Complete Mitochondrial Genome of Gossypium hirsutum and Evolutionary Analysis of Higher Plant Mitochondrial Genomes

    PubMed Central

    Su, Aiguo; Geng, Jianing; Grover, Corrinne E.; Hu, Songnian; Hua, Jinping

    2013-01-01

    Background Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. Methodology/Principal Findings We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. Conclusion The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species. PMID:23940520

  16. The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

    PubMed

    Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

    2013-01-01

    Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.

  17. A novel bioinformatics pipeline to discover genes related to arbuscular mycorrhizal symbiosis based on their evolutionary conservation pattern among higher plants.

    PubMed

    Favre, Patrick; Bapaume, Laure; Bossolini, Eligio; Delorenzi, Mauro; Falquet, Laurent; Reinhardt, Didier

    2014-12-03

    Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species. Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility. We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest.

  18. A remarkably stable TipE gene cluster: evolution of insect Para sodium channel auxiliary subunits

    PubMed Central

    2011-01-01

    Background First identified in fruit flies with temperature-sensitive paralysis phenotypes, the Drosophila melanogaster TipE locus encodes four voltage-gated sodium (NaV) channel auxiliary subunits. This cluster of TipE-like genes on chromosome 3L, and a fifth family member on chromosome 3R, are important for the optional expression and functionality of the Para NaV channel but appear quite distinct from auxiliary subunits in vertebrates. Here, we exploited available arthropod genomic resources to trace the origin of TipE-like genes by mapping their evolutionary histories and examining their genomic architectures. Results We identified a remarkably conserved synteny block of TipE-like orthologues with well-maintained local gene arrangements from 21 insect species. Homologues in the water flea, Daphnia pulex, suggest an ancestral pancrustacean repertoire of four TipE-like genes; a subsequent gene duplication may have generated functional redundancy allowing gene losses in the silk moth and mosquitoes. Intronic nesting of the insect TipE gene cluster probably occurred following the divergence from crustaceans, but in the flour beetle and silk moth genomes the clusters apparently escaped from nesting. Across Pancrustacea, TipE gene family members have experienced intronic nesting, escape from nesting, retrotransposition, translocation, and gene loss events while generally maintaining their local gene neighbourhoods. D. melanogaster TipE-like genes exhibit coordinated spatial and temporal regulation of expression distinct from their host gene but well-correlated with their regulatory target, the Para NaV channel, suggesting that functional constraints may preserve the TipE gene cluster. We identified homology between TipE-like NaV channel regulators and vertebrate Slo-beta auxiliary subunits of big-conductance calcium-activated potassium (BKCa) channels, which suggests that ion channel regulatory partners have evolved distinct lineage-specific characteristics. Conclusions TipE-like genes form a remarkably conserved genomic cluster across all examined insect genomes. This study reveals likely structural and functional constraints on the genomic evolution of insect TipE gene family members maintained in synteny over hundreds of millions of years of evolution. The likely common origin of these NaV channel regulators with BKCa auxiliary subunits highlights the evolutionary plasticity of ion channel regulatory mechanisms. PMID:22098672

  19. Genetic characterization of the hemagglutinin genes of wild-type measles virus circulating in china, 1993-2009.

    PubMed

    Xu, Songtao; Zhang, Yan; Zhu, Zhen; Liu, Chunyu; Mao, Naiying; Ji, Yixin; Wang, Huiling; Jiang, Xiaohong; Li, Chongshan; Tang, Wei; Feng, Daxing; Wang, Changyin; Zheng, Lei; Lei, Yue; Ling, Hua; Zhao, Chunfang; Ma, Yan; He, Jilan; Wang, Yan; Li, Ping; Guan, Ronghui; Zhou, Shujie; Zhou, Jianhui; Wang, Shuang; Zhang, Hong; Zheng, Huanying; Liu, Leng; Ma, Hemuti; Guan, Jing; Lu, Peishan; Feng, Yan; Zhang, Yanjun; Zhou, Shunde; Xiong, Ying; Ba, Zhuoma; Chen, Hui; Yang, Xiuhui; Bo, Fang; Ma, Yujie; Liang, Yong; Lei, Yake; Gu, Suyi; Liu, Wei; Chen, Meng; Featherstone, David; Jee, Youngmee; Bellini, William J; Rota, Paul A; Xu, Wenbo

    2013-01-01

    China experienced several large measles outbreaks in the past two decades, and a series of enhanced control measures were implemented to achieve the goal of measles elimination. Molecular epidemiologic surveillance of wild-type measles viruses (MeV) provides valuable information about the viral transmission patterns. Since 1993, virologic surveillnace has confirmed that a single endemic genotype H1 viruses have been predominantly circulating in China. A component of molecular surveillance is to monitor the genetic characteristics of the hemagglutinin (H) gene of MeV, the major target for virus neutralizing antibodies. Analysis of the sequences of the complete H gene from 56 representative wild-type MeV strains circulating in China during 1993-2009 showed that the H gene sequences were clustered into 2 groups, cluster 1 and cluster 2. Cluster1 strains were the most frequently detected cluster and had a widespread distribution in China after 2000. The predicted amino acid sequences of the H protein were relatively conserved at most of the functionally significant amino acid positions. However, most of the genotype H1 cluster1 viruses had an amino acid substitution (Ser240Asn), which removed a predicted N-linked glycosylation site. In addition, the substitution of Pro397Leu in the hemagglutinin noose epitope (HNE) was identified in 23 of 56 strains. The evolutionary rate of the H gene of the genotype H1 viruses was estimated to be approximately 0.76×10(-3) substitutions per site per year, and the ratio of dN to dS (dN/dS) was <1 indicating the absence of selective pressure. Although H genes of the genotype H1 strains were conserved and not subjected to selective pressure, several amino acid substitutions were observed in functionally important positions. Therefore the antigenic and genetic properties of H genes of wild-type MeVs should be monitored as part of routine molecular surveillance for measles in China.

  20. Drivers of genetic diversity in secondary metabolic gene clusters within a fungal species

    PubMed Central

    Lind, Abigail L.; Wisecaver, Jennifer H.; Lameiras, Catarina; Wiemann, Philipp; Palmer, Jonathan M.; Keller, Nancy P.; Rodrigues, Fernando; Goldman, Gustavo H.

    2017-01-01

    Filamentous fungi produce a diverse array of secondary metabolites (SMs) critical for defense, virulence, and communication. The metabolic pathways that produce SMs are found in contiguous gene clusters in fungal genomes, an atypical arrangement for metabolic pathways in other eukaryotes. Comparative studies of filamentous fungal species have shown that SM gene clusters are often either highly divergent or uniquely present in one or a handful of species, hampering efforts to determine the genetic basis and evolutionary drivers of SM gene cluster divergence. Here, we examined SM variation in 66 cosmopolitan strains of a single species, the opportunistic human pathogen Aspergillus fumigatus. Investigation of genome-wide within-species variation revealed 5 general types of variation in SM gene clusters: nonfunctional gene polymorphisms; gene gain and loss polymorphisms; whole cluster gain and loss polymorphisms; allelic polymorphisms, in which different alleles corresponded to distinct, nonhomologous clusters; and location polymorphisms, in which a cluster was found to differ in its genomic location across strains. These polymorphisms affect the function of representative A. fumigatus SM gene clusters, such as those involved in the production of gliotoxin, fumigaclavine, and helvolic acid as well as the function of clusters with undefined products. In addition to enabling the identification of polymorphisms, the detection of which requires extensive genome-wide synteny conservation (e.g., mobile gene clusters and nonhomologous cluster alleles), our approach also implicated multiple underlying genetic drivers, including point mutations, recombination, and genomic deletion and insertion events as well as horizontal gene transfer from distant fungi. Finally, most of the variants that we uncover within A. fumigatus have been previously hypothesized to contribute to SM gene cluster diversity across entire fungal classes and phyla. We suggest that the drivers of genetic diversity operating within a fungal species shown here are sufficient to explain SM cluster macroevolutionary patterns. PMID:29149178

  1. Missing link in the evolution of Hox clusters.

    PubMed

    Ogishima, Soichi; Tanaka, Hiroshi

    2007-01-31

    Hox cluster has key roles in regulating the patterning of the antero-posterior axis in a metazoan embryo. It consists of the anterior, central and posterior genes; the central genes have been identified only in bilaterians, but not in cnidarians, and are responsible for archiving morphological complexity in bilaterian development. However, their evolutionary history has not been revealed, that is, there has been a "missing link". Here we show the evolutionary history of Hox clusters of 18 bilaterians and 2 cnidarians by using a new method, "motif-based reconstruction", examining the gain/loss processes of evolutionarily conserved sequences, "motifs", outside the homeodomain. We successfully identified the missing link in the evolution of Hox clusters between the cnidarian-bilaterian ancestor and the bilaterians as the ancestor of the central genes, which we call the proto-central gene. Exploring the correspondent gene with the proto-central gene, we found that one of the acoela Hox genes has the same motif repertory as that of the proto-central gene. This interesting finding suggests that the acoela Hox cluster corresponds with the missing link in the evolution of the Hox cluster between the cnidarian-bilaterian ancestor and the bilaterians. Our findings suggested that motif gains/diversifications led to the explosive diversity of the bilaterian body plan.

  2. Heterochromatin influences the secondary metabolite profile in the plant pathogen Fusarium graminearum

    PubMed Central

    Reyes-Dominguez, Yazmid; Boedi, Stefan; Sulyok, Michael; Wiesenberger, Gerlinde; Stoppacher, Norbert; Krska, Rudolf; Strauss, Joseph

    2012-01-01

    Chromatin modifications and heterochromatic marks have been shown to be involved in the regulation of secondary metabolism gene clusters in the fungal model system Aspergillus nidulans. We examine here the role of HEP1, the heterochromatin protein homolog of Fusarium graminearum, for the production of secondary metabolites. Deletion of Hep1 in a PH-1 background strongly influences expression of genes required for the production of aurofusarin and the main tricothecene metabolite DON. In the Hep1 deletion strains AUR genes are highly up-regulated and aurofusarin production is greatly enhanced suggesting a repressive role for heterochromatin on gene expression of this cluster. Unexpectedly, gene expression and metabolites are lower for the trichothecene cluster suggesting a positive function of Hep1 for DON biosynthesis. However, analysis of histone modifications in chromatin of AUR and DON gene promoters reveals that in both gene clusters the H3K9me3 heterochromatic mark is strongly reduced in the Hep1 deletion strain. This, and the finding that a DON-cluster flanking gene is up-regulated, suggests that the DON biosynthetic cluster is repressed by HEP1 directly and indirectly. Results from this study point to a conserved mode of secondary metabolite (SM) biosynthesis regulation in fungi by chromatin modifications and the formation of facultative heterochromatin. PMID:22100541

  3. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.

  4. Linkage of the Nit1C gene cluster to bacterial cyanide assimilation as a nitrogen source.

    PubMed

    Jones, Lauren B; Ghosh, Pallab; Lee, Jung-Hyun; Chou, Chia-Ni; Kunz, Daniel A

    2018-05-21

    A genetic linkage between a conserved gene cluster (Nit1C) and the ability of bacteria to utilize cyanide as the sole nitrogen source was demonstrated for nine different bacterial species. These included three strains whose cyanide nutritional ability has formerly been documented (Pseudomonas fluorescens Pf11764, Pseudomonas putida BCN3 and Klebsiella pneumoniae BCN33), and six not previously known to have this ability [Burkholderia (Paraburkholderia) xenovorans LB400, Paraburkholderia phymatum STM815, Paraburkholderia phytofirmans PsJN, Cupriavidus (Ralstonia) eutropha H16, Gluconoacetobacter diazotrophicus PA1 5 and Methylobacterium extorquens AM1]. For all bacteria, growth on or exposure to cyanide led to the induction of the canonical nitrilase (NitC) linked to the gene cluster, and in the case of Pf11764 in particular, transcript levels of cluster genes (nitBCDEFGH) were raised, and a nitC knock-out mutant failed to grow. Further studies demonstrated that the highly conserved nitB gene product was also significantly elevated. Collectively, these findings provide strong evidence for a genetic linkage between Nit1C and bacterial growth on cyanide, supporting use of the term cyanotrophy in describing what may represent a new nutritional paradigm in microbiology. A broader search of Nit1C genes in presently available genomes revealed its presence in 270 different bacteria, all contained within the domain Bacteria, including Gram-positive Firmicutes and Actinobacteria, and Gram-negative Proteobacteria and Cyanobacteria. Absence of the cluster in the Archaea is congruent with events that may have led to the inception of Nit1C occurring coincidentally with the first appearance of cyanogenic species on Earth, dating back 400-500 million years.

  5. Comparative genomic analysis of isoproturon-mineralizing sphingomonads reveals the isoproturon catabolic mechanism.

    PubMed

    Yan, Xin; Gu, Tao; Yi, Zhongquan; Huang, Junwei; Liu, Xiaowei; Zhang, Ji; Xu, Xihui; Xin, Zhihong; Hong, Qing; He, Jian; Spain, Jim C; Li, Shunpeng; Jiang, Jiandong

    2016-12-01

    The worldwide use of the phenylurea herbicide, isoproturon (IPU), has resulted in considerable concern about its environmental fate. Although many microbial metabolites of IPU are known and IPU-mineralizing bacteria have been isolated, the molecular mechanism of IPU catabolism has not been elucidated yet. In this study, complete genes that encode the conserved IPU catabolic pathway were revealed, based on comparative analysis of the genomes of three IPU-mineralizing sphingomonads and subsequent experimental validation. The complete genes included a novel hydrolase gene ddhA, which is responsible for the cleavage of the urea side chain of the IPU demethylated products; a distinct aniline dioxygenase gene cluster adoQTA1A2BR, which has a broad substrate range; and an inducible catechol meta-cleavage pathway gene cluster adoXEGKLIJC. Furthermore, the initial mono-N-demethylation genes pdmAB were further confirmed to be involved in the successive N-demethylation of the IPU mono-N-demethylated product. These IPU-catabolic genes were organized into four transcription units and distributed on three plasmids. They were flanked by multiple mobile genetic elements and highly conserved among IPU-mineralizing sphingomonads. The elucidation of the molecular mechanism of IPU catabolism will enhance our understanding of the microbial mineralization of IPU and provide insights into the evolutionary scenario of the conserved IPU-catabolic pathway. © 2016 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.

  6. VRprofile: gene-cluster-detection-based profiling of virulence and antibiotic resistance traits encoded within genome sequences of pathogenic bacteria.

    PubMed

    Li, Jun; Tai, Cui; Deng, Zixin; Zhong, Weihong; He, Yongqun; Ou, Hong-Yu

    2017-01-10

    VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Comparative Genomic Analysis of N2-Fixing and Non-N2-Fixing Paenibacillus spp.: Organization, Evolution and Expression of the Nitrogen Fixation Genes

    PubMed Central

    Xie, Jian-Bo; Du, Zhenglin; Bai, Lanqing; Tian, Changfu; Zhang, Yunzhi; Xie, Jiu-Yan; Wang, Tianshu; Liu, Xiaomeng; Chen, Xi; Cheng, Qi; Chen, Sanfeng; Li, Jilun

    2014-01-01

    We provide here a comparative genome analysis of 31 strains within the genus Paenibacillus including 11 new genomic sequences of N2-fixing strains. The heterogeneity of the 31 genomes (15 N2-fixing and 16 non-N2-fixing Paenibacillus strains) was reflected in the large size of the shell genome, which makes up approximately 65.2% of the genes in pan genome. Large numbers of transposable elements might be related to the heterogeneity. We discovered that a minimal and compact nif cluster comprising nine genes nifB, nifH, nifD, nifK, nifE, nifN, nifX, hesA and nifV encoding Mo-nitrogenase is conserved in the 15 N2-fixing strains. The nif cluster is under control of a σ70-depedent promoter and possesses a GlnR/TnrA-binding site in the promoter. Suf system encoding [Fe–S] cluster is highly conserved in N2-fixing and non-N2-fixing strains. Furthermore, we demonstrate that the nif cluster enabled Escherichia coli JM109 to fix nitrogen. Phylogeny of the concatenated NifHDK sequences indicates that Paenibacillus and Frankia are sister groups. Phylogeny of the concatenated 275 single-copy core genes suggests that the ancestral Paenibacillus did not fix nitrogen. The N2-fixing Paenibacillus strains were generated by acquiring the nif cluster via horizontal gene transfer (HGT) from a source related to Frankia. During the history of evolution, the nif cluster was lost, producing some non-N2-fixing strains, and vnf encoding V-nitrogenase or anf encoding Fe-nitrogenase was acquired, causing further diversification of some strains. In addition, some N2-fixing strains have additional nif and nif-like genes which may result from gene duplications. The evolution of nitrogen fixation in Paenibacillus involves a mix of gain, loss, HGT and duplication of nif/anf/vnf genes. This study not only reveals the organization and distribution of nitrogen fixation genes in Paenibacillus, but also provides insight into the complex evolutionary history of nitrogen fixation. PMID:24651173

  8. The first report of a Pelecaniformes defensin cluster: Characterization of β-defensin genes in the crested ibis based on BAC libraries

    PubMed Central

    Lan, Hong; Chen, Hui; Chen, Li-Cheng; Wang, Bei-Bing; Sun, Li; Ma, Mei-Ying; Fang, Sheng-Guo; Wan, Qiu-Hong

    2014-01-01

    Defensins play a key role in the innate immunity of various organisms. Detailed genomic studies of the defensin cluster have only been reported in a limited number of birds. Herein, we present the first characterization of defensins in a Pelecaniformes species, the crested ibis (Nipponia nippon), which is one of the most endangered birds in the world. We constructed bacterial artificial chromosome libraries, including a 4D-PCR library and a reverse-4D library, which provide at least 40 equivalents of this rare bird's genome. A cluster including 14 β-defensin loci within 129 kb was assigned to chromosome 3 by FISH, and one gene duplication of AvBD1 was found. The ibis defensin genes are characterized by multiform gene organization ranging from two to four exons through extensive exon fusion. Splicing signal variations and alternative splice variants were also found. Comparative analysis of four bird species identified one common and multiple species-specific duplications, which might be associated with high GC content. Evolutionary analysis revealed birth-and-death mode and purifying selection for avian defensin evolution, resulting in different defensin gene numbers among bird species and functional conservation within orthologous genes, respectively. Additionally, we propose various directions for further research on genetic conservation in the crested ibis. PMID:25372018

  9. Gene cluster conservation provides insight into cercosporin biosynthesis and extends production to the genus Colletotrichum.

    PubMed

    de Jonge, Ronnie; Ebert, Malaika K; Huitt-Roehl, Callie R; Pal, Paramita; Suttle, Jeffrey C; Spanner, Rebecca E; Neubauer, Jonathan D; Jurick, Wayne M; Stott, Karina A; Secor, Gary A; Thomma, Bart P H J; Van de Peer, Yves; Townsend, Craig A; Bolton, Melvin D

    2018-06-12

    Species in the genus Cercospora cause economically devastating diseases in sugar beet, maize, rice, soy bean, and other major food crops. Here, we sequenced the genome of the sugar beet pathogen Cercospora beticola and found it encodes 63 putative secondary metabolite gene clusters, including the cercosporin toxin biosynthesis ( CTB ) cluster. We show that the CTB gene cluster has experienced multiple duplications and horizontal transfers across a spectrum of plant pathogenic fungi, including the wide-host range Colletotrichum genus as well as the rice pathogen Magnaporthe oryzae Although cercosporin biosynthesis has been thought to rely on an eight-gene CTB cluster, our phylogenomic analysis revealed gene collinearity adjacent to the established cluster in all CTB cluster-harboring species. We demonstrate that the CTB cluster is larger than previously recognized and includes cercosporin facilitator protein, previously shown to be involved with cercosporin autoresistance, and four additional genes required for cercosporin biosynthesis, including the final pathway enzymes that install the unusual cercosporin methylenedioxy bridge. Lastly, we demonstrate production of cercosporin by Colletotrichum fioriniae , the first known cercosporin producer within this agriculturally important genus. Thus, our results provide insight into the intricate evolution and biology of a toxin critical to agriculture and broaden the production of cercosporin to another fungal genus containing many plant pathogens of important crops worldwide. Copyright © 2018 the Author(s). Published by PNAS.

  10. Identification of Loci and Functional Characterization of Trichothecene Biosynthesis Genes in Filamentous Fungi of the Genus Trichoderma▿†

    PubMed Central

    Cardoza, R. E.; Malmierca, M. G.; Hermosa, M. R.; Alexander, N. J.; McCormick, S. P.; Proctor, R. H.; Tijerino, A. M.; Rumbero, A.; Monte, E.; Gutiérrez, S.

    2011-01-01

    Trichothecenes are mycotoxins produced by Trichoderma, Fusarium, and at least four other genera in the fungal order Hypocreales. Fusarium has a trichothecene biosynthetic gene (TRI) cluster that encodes transport and regulatory proteins as well as most enzymes required for the formation of the mycotoxins. However, little is known about trichothecene biosynthesis in the other genera. Here, we identify and characterize TRI gene orthologues (tri) in Trichoderma arundinaceum and Trichoderma brevicompactum. Our results indicate that both Trichoderma species have a tri cluster that consists of orthologues of seven genes present in the Fusarium TRI cluster. Organization of genes in the cluster is the same in the two Trichoderma species but differs from the organization in Fusarium. Sequence and functional analysis revealed that the gene (tri5) responsible for the first committed step in trichothecene biosynthesis is located outside the cluster in both Trichoderma species rather than inside the cluster as it is in Fusarium. Heterologous expression analysis revealed that two T. arundinaceum cluster genes (tri4 and tri11) differ in function from their Fusarium orthologues. The Tatri4-encoded enzyme catalyzes only three of the four oxygenation reactions catalyzed by the orthologous enzyme in Fusarium. The Tatri11-encoded enzyme catalyzes a completely different reaction (trichothecene C-4 hydroxylation) than the Fusarium orthologue (trichothecene C-15 hydroxylation). The results of this study indicate that although some characteristics of the tri/TRI cluster have been conserved during evolution of Trichoderma and Fusarium, the cluster has undergone marked changes, including gene loss and/or gain, gene rearrangement, and divergence of gene function. PMID:21642405

  11. Comparative genomics reveals phylogenetic distribution patterns of secondary metabolites in Amycolatopsis species.

    PubMed

    Adamek, Martina; Alanjary, Mohammad; Sales-Ortells, Helena; Goodfellow, Michael; Bull, Alan T; Winkler, Anika; Wibberg, Daniel; Kalinowski, Jörn; Ziemert, Nadine

    2018-06-01

    Genome mining tools have enabled us to predict biosynthetic gene clusters that might encode compounds with valuable functions for industrial and medical applications. With the continuously increasing number of genomes sequenced, we are confronted with an overwhelming number of predicted clusters. In order to guide the effective prioritization of biosynthetic gene clusters towards finding the most promising compounds, knowledge about diversity, phylogenetic relationships and distribution patterns of biosynthetic gene clusters is necessary. Here, we provide a comprehensive analysis of the model actinobacterial genus Amycolatopsis and its potential for the production of secondary metabolites. A phylogenetic characterization, together with a pan-genome analysis showed that within this highly diverse genus, four major lineages could be distinguished which differed in their potential to produce secondary metabolites. Furthermore, we were able to distinguish gene cluster families whose distribution correlated with phylogeny, indicating that vertical gene transfer plays a major role in the evolution of secondary metabolite gene clusters. Still, the vast majority of the diverse biosynthetic gene clusters were derived from clusters unique to the genus, and also unique in comparison to a database of known compounds. Our study on the locations of biosynthetic gene clusters in the genomes of Amycolatopsis' strains showed that clusters acquired by horizontal gene transfer tend to be incorporated into non-conserved regions of the genome thereby allowing us to distinguish core and hypervariable regions in Amycolatopsis genomes. Using a comparative genomics approach, it was possible to determine the potential of the genus Amycolatopsis to produce a huge diversity of secondary metabolites. Furthermore, the analysis demonstrates that horizontal and vertical gene transfer play an important role in the acquisition and maintenance of valuable secondary metabolites. Our results cast light on the interconnections between secondary metabolite gene clusters and provide a way to prioritize biosynthetic pathways in the search and discovery of novel compounds.

  12. Genome-wide DNA methylation analysis reveals estrogen-mediated epigenetic repression of metallothionein-1 gene cluster in breast cancer.

    PubMed

    Jadhav, Rohit R; Ye, Zhenqing; Huang, Rui-Lan; Liu, Joseph; Hsu, Pei-Yin; Huang, Yi-Wen; Rangel, Leticia B; Lai, Hung-Cheng; Roa, Juan Carlos; Kirma, Nameer B; Huang, Tim Hui-Ming; Jin, Victor X

    2015-01-01

    Recent genome-wide analysis has shown that DNA methylation spans long stretches of chromosome regions consisting of clusters of contiguous CpG islands or gene families. Hypermethylation of various gene clusters has been reported in many types of cancer. In this study, we conducted methyl-binding domain capture (MBDCap) sequencing (MBD-seq) analysis on a breast cancer cohort consisting of 77 patients and 10 normal controls, as well as a panel of 38 breast cancer cell lines. Bioinformatics analysis determined seven gene clusters with a significant difference in overall survival (OS) and further revealed a distinct feature that the conservation of a large gene cluster (approximately 70 kb) metallothionein-1 (MT1) among 45 species is much lower than the average of all RefSeq genes. Furthermore, we found that DNA methylation is an important epigenetic regulator contributing to gene repression of MT1 gene cluster in both ERα positive (ERα+) and ERα negative (ERα-) breast tumors. In silico analysis revealed much lower gene expression of this cluster in The Cancer Genome Atlas (TCGA) cohort for ERα + tumors. To further investigate the role of estrogen, we conducted 17β-estradiol (E2) and demethylating agent 5-aza-2'-deoxycytidine (DAC) treatment in various breast cancer cell types. Cell proliferation and invasion assays suggested MT1F and MT1M may play an anti-oncogenic role in breast cancer. Our data suggests that DNA methylation in large contiguous gene clusters can be potential prognostic markers of breast cancer. Further investigation of these clusters revealed that estrogen mediates epigenetic repression of MT1 cluster in ERα + breast cancer cell lines. In all, our studies identify thousands of breast tumor hypermethylated regions for the first time, in particular, discovering seven large contiguous hypermethylated gene clusters.

  13. Identification of microRNA Genes in Three Opisthorchiids

    PubMed Central

    Ovchinnikov, Vladimir Y.; Afonnikov, Dmitry A.; Vasiliev, Gennady V.; Kashina, Elena V.; Sripa, Banchob; Mordvinov, Viacheslav A.; Katokhin, Alexey V.

    2015-01-01

    Background Opisthorchis felineus, O. viverrini, and Clonorchis sinensis (family Opisthorchiidae) are parasitic flatworms that pose a serious threat to humans in some countries and cause opisthorchiasis/clonorchiasis. Chronic disease may lead to a risk of carcinogenesis in the biliary ducts. MicroRNAs (miRNAs) are small noncoding RNAs that control gene expression at post-transcriptional level and are implicated in the regulation of various cellular processes during the parasite- host interplay. However, to date, the miRNAs of opisthorchiid flukes, in particular those essential for maintaining their complex biology and parasitic mode of existence, have not been satisfactorily described. Methodology/Principal Findings Using a SOLiD deep sequencing-bioinformatic approach, we identified 43 novel and 18 conserved miRNAs for O. felineus (miracidia, metacercariae and adult worms), 20 novel and 16 conserved miRNAs for O. viverrini (adult worms), and 33 novel and 18 conserved miRNAs for C. sinensis (adult worms). The analysis of the data revealed differences in the expression level of conserved miRNAs among the three species and among three the developmental stages of O. felineus. Analysis of miRNA genes revealed two gene clusters, one cluster-like region and one intronic miRNA in the genome. The presence and structure of the two gene clusters were validated using a PCR-based approach in the three flukes. Conclusions This study represents a comprehensive description of miRNAs in three members of the family Opistorchiidae, significantly expands our knowledge of miRNAs in multicellular parasites and provides a basis for understanding the structural and functional evolution of miRNAs in these metazoan parasites. Results of this study also provides novel resources for deeper understanding the complex parasite biology, for further research on the pathogenesis and molecular events of disease induced by the liver flukes. The present data may also facilitate the development of novel approaches for the prevention and treatment of opisthorchiasis/clonorchiasis. PMID:25898350

  14. Two divergent Symbiodinium genomes reveal conservation of a gene cluster for sunscreen biosynthesis and recently lost genes.

    PubMed

    Shoguchi, Eiichi; Beedessee, Girish; Tada, Ipputa; Hisata, Kanako; Kawashima, Takeshi; Takeuchi, Takeshi; Arakaki, Nana; Fujie, Manabu; Koyanagi, Ryo; Roy, Michael C; Kawachi, Masanobu; Hidaka, Michio; Satoh, Noriyuki; Shinzato, Chuya

    2018-06-14

    The marine dinoflagellate, Symbiodinium, is a well-known photosynthetic partner for coral and other diverse, non-photosynthetic hosts in subtropical and tropical shallows, where it comprises an essential component of marine ecosystems. Using molecular phylogenetics, the genus Symbiodinium has been classified into nine major clades, A-I, and one of the reported differences among phenotypes is their capacity to synthesize mycosporine-like amino acids (MAAs), which absorb UV radiation. However, the genetic basis for this difference in synthetic capacity is unknown. To understand genetics underlying Symbiodinium diversity, we report two draft genomes, one from clade A, presumed to have been the earliest branching clade, and the other from clade C, in the terminal branch. The nuclear genome of Symbiodinium clade A (SymA) has more gene families than that of clade C, with larger numbers of organelle-related genes, including mitochondrial transcription terminal factor (mTERF) and Rubisco. While clade C (SymC) has fewer gene families, it displays specific expansions of repeat domain-containing genes, such as leucine-rich repeats (LRRs) and retrovirus-related dUTPases. Interestingly, the SymA genome encodes a gene cluster for MAA biosynthesis, potentially transferred from an endosymbiotic red alga (probably of bacterial origin), while SymC has completely lost these genes. Our analysis demonstrates that SymC appears to have evolved by losing gene families, such as the MAA biosynthesis gene cluster. In contrast to the conservation of genes related to photosynthetic ability, the terminal clade has suffered more gene family losses than other clades, suggesting a possible adaptation to symbiosis. Overall, this study implies that Symbiodinium ecology drives acquisition and loss of gene families.

  15. A novel pathway for the biosynthesis of heme in Archaea: genome-based bioinformatic predictions and experimental evidence.

    PubMed

    Storbeck, Sonja; Rolfes, Sarah; Raux-Deery, Evelyne; Warren, Martin J; Jahn, Dieter; Layer, Gunhild

    2010-12-13

    Heme is an essential prosthetic group for many proteins involved in fundamental biological processes in all three domains of life. In Eukaryota and Bacteria heme is formed via a conserved and well-studied biosynthetic pathway. Surprisingly, in Archaea heme biosynthesis proceeds via an alternative route which is poorly understood. In order to formulate a working hypothesis for this novel pathway, we searched 59 completely sequenced archaeal genomes for the presence of gene clusters consisting of established heme biosynthetic genes and colocalized conserved candidate genes. Within the majority of archaeal genomes it was possible to identify such heme biosynthesis gene clusters. From this analysis we have been able to identify several novel heme biosynthesis genes that are restricted to archaea. Intriguingly, several of the encoded proteins display similarity to enzymes involved in heme d(1) biosynthesis. To initiate an experimental verification of our proposals two Methanosarcina barkeri proteins predicted to catalyze the initial steps of archaeal heme biosynthesis were recombinantly produced, purified, and their predicted enzymatic functions verified.

  16. A Novel Pathway for the Biosynthesis of Heme in Archaea: Genome-Based Bioinformatic Predictions and Experimental Evidence

    PubMed Central

    Storbeck, Sonja; Rolfes, Sarah; Raux-Deery, Evelyne; Warren, Martin J.; Jahn, Dieter; Layer, Gunhild

    2010-01-01

    Heme is an essential prosthetic group for many proteins involved in fundamental biological processes in all three domains of life. In Eukaryota and Bacteria heme is formed via a conserved and well-studied biosynthetic pathway. Surprisingly, in Archaea heme biosynthesis proceeds via an alternative route which is poorly understood. In order to formulate a working hypothesis for this novel pathway, we searched 59 completely sequenced archaeal genomes for the presence of gene clusters consisting of established heme biosynthetic genes and colocalized conserved candidate genes. Within the majority of archaeal genomes it was possible to identify such heme biosynthesis gene clusters. From this analysis we have been able to identify several novel heme biosynthesis genes that are restricted to archaea. Intriguingly, several of the encoded proteins display similarity to enzymes involved in heme d 1 biosynthesis. To initiate an experimental verification of our proposals two Methanosarcina barkeri proteins predicted to catalyze the initial steps of archaeal heme biosynthesis were recombinantly produced, purified, and their predicted enzymatic functions verified. PMID:21197080

  17. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

  18. Similarity-based gene detection: using COGs to find evolutionarily-conserved ORFs.

    PubMed

    Powell, Bradford C; Hutchison, Clyde A

    2006-01-19

    Experimental verification of gene products has not kept pace with the rapid growth of microbial sequence information. However, existing annotations of gene locations contain sufficient information to screen for probable errors. Furthermore, comparisons among genomes become more informative as more genomes are examined. We studied all open reading frames (ORFs) of at least 30 codons from the genomes of 27 sequenced bacterial strains. We grouped the potential peptide sequences encoded from the ORFs by forming Clusters of Orthologous Groups (COGs). We used this grouping in order to find homologous relationships that would not be distinguishable from noise when using simple BLAST searches. Although COG analysis was initially developed to group annotated genes, we applied it to the task of grouping anonymous DNA sequences that may encode proteins. "Mixed COGs" of ORFs (clusters in which some sequences correspond to annotated genes and some do not) are attractive targets when seeking errors of gene prediction. Examination of mixed COGs reveals some situations in which genes appear to have been missed in current annotations and a smaller number of regions that appear to have been annotated as gene loci erroneously. This technique can also be used to detect potential pseudogenes or sequencing errors. Our method uses an adjustable parameter for degree of conservation among the studied genomes (stringency). We detail results for one level of stringency at which we found 83 potential genes which had not previously been identified, 60 potential pseudogenes, and 7 sequences with existing gene annotations that are probably incorrect. Systematic study of sequence conservation offers a way to improve existing annotations by identifying potentially homologous regions where the annotation of the presence or absence of a gene is inconsistent among genomes.

  19. Similarity-based gene detection: using COGs to find evolutionarily-conserved ORFs

    PubMed Central

    Powell, Bradford C; Hutchison, Clyde A

    2006-01-01

    Background Experimental verification of gene products has not kept pace with the rapid growth of microbial sequence information. However, existing annotations of gene locations contain sufficient information to screen for probable errors. Furthermore, comparisons among genomes become more informative as more genomes are examined. We studied all open reading frames (ORFs) of at least 30 codons from the genomes of 27 sequenced bacterial strains. We grouped the potential peptide sequences encoded from the ORFs by forming Clusters of Orthologous Groups (COGs). We used this grouping in order to find homologous relationships that would not be distinguishable from noise when using simple BLAST searches. Although COG analysis was initially developed to group annotated genes, we applied it to the task of grouping anonymous DNA sequences that may encode proteins. Results "Mixed COGs" of ORFs (clusters in which some sequences correspond to annotated genes and some do not) are attractive targets when seeking errors of gene predicion. Examination of mixed COGs reveals some situations in which genes appear to have been missed in current annotations and a smaller number of regions that appear to have been annotated as gene loci erroneously. This technique can also be used to detect potential pseudogenes or sequencing errors. Our method uses an adjustable parameter for degree of conservation among the studied genomes (stringency). We detail results for one level of stringency at which we found 83 potential genes which had not previously been identified, 60 potential pseudogenes, and 7 sequences with existing gene annotations that are probably incorrect. Conclusion Systematic study of sequence conservation offers a way to improve existing annotations by identifying potentially homologous regions where the annotation of the presence or absence of a gene is inconsistent among genomes. PMID:16423288

  20. Genetic Characterization of the Hemagglutinin Genes of Wild-Type Measles Virus Circulating in China, 1993–2009

    PubMed Central

    Zhu, Zhen; Liu, Chunyu; Mao, Naiying; Ji, Yixin; Wang, Huiling; Jiang, Xiaohong; Li, Chongshan; Tang, Wei; Feng, Daxing; Wang, Changyin; Zheng, Lei; Lei, Yue; Ling, Hua; Zhao, Chunfang; Ma, Yan; He, Jilan; Wang, Yan; Li, Ping; Guan, Ronghui; Zhou, Shujie; Zhou, Jianhui; Wang, Shuang; Zhang, Hong; Zheng, Huanying; Liu, Leng; Ma, Hemuti; Guan, Jing; Lu, Peishan; Feng, Yan; Zhang, Yanjun; Zhou, Shunde; Xiong, Ying; Ba, Zhuoma; Chen, Hui; Yang, Xiuhui; Bo, Fang; Ma, Yujie; Liang, Yong; Lei, Yake; Gu, Suyi; Liu, Wei; Chen, Meng; Featherstone, David; Jee, Youngmee; Bellini, William J.; Rota, Paul A.; Xu, Wenbo

    2013-01-01

    Background China experienced several large measles outbreaks in the past two decades, and a series of enhanced control measures were implemented to achieve the goal of measles elimination. Molecular epidemiologic surveillance of wild-type measles viruses (MeV) provides valuable information about the viral transmission patterns. Since 1993, virologic surveillnace has confirmed that a single endemic genotype H1 viruses have been predominantly circulating in China. A component of molecular surveillance is to monitor the genetic characteristics of the hemagglutinin (H) gene of MeV, the major target for virus neutralizing antibodies. Principal Findings Analysis of the sequences of the complete H gene from 56 representative wild-type MeV strains circulating in China during 1993–2009 showed that the H gene sequences were clustered into 2 groups, cluster 1 and cluster 2. Cluster1 strains were the most frequently detected cluster and had a widespread distribution in China after 2000. The predicted amino acid sequences of the H protein were relatively conserved at most of the functionally significant amino acid positions. However, most of the genotype H1 cluster1 viruses had an amino acid substitution (Ser240Asn), which removed a predicted N-linked glycosylation site. In addition, the substitution of Pro397Leu in the hemagglutinin noose epitope (HNE) was identified in 23 of 56 strains. The evolutionary rate of the H gene of the genotype H1 viruses was estimated to be approximately 0.76×10−3 substitutions per site per year, and the ratio of dN to dS (dN/dS) was <1 indicating the absence of selective pressure. Conclusions Although H genes of the genotype H1 strains were conserved and not subjected to selective pressure, several amino acid substitutions were observed in functionally important positions. Therefore the antigenic and genetic properties of H genes of wild-type MeVs should be monitored as part of routine molecular surveillance for measles in China. PMID:24073194

  1. Atlas of nonribosomal peptide and polyketide biosynthetic pathways reveals common occurrence of nonmodular enzymes.

    PubMed

    Wang, Hao; Fewer, David P; Holm, Liisa; Rouhiainen, Leo; Sivonen, Kaarina

    2014-06-24

    Nonribosomal peptides and polyketides are a diverse group of natural products with complex chemical structures and enormous pharmaceutical potential. They are synthesized on modular nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) enzyme complexes by a conserved thiotemplate mechanism. Here, we report the widespread occurrence of NRPS and PKS genetic machinery across the three domains of life with the discovery of 3,339 gene clusters from 991 organisms, by examining a total of 2,699 genomes. These gene clusters display extraordinarily diverse organizations, and a total of 1,147 hybrid NRPS/PKS clusters were found. Surprisingly, 10% of bacterial gene clusters lacked modular organization, and instead catalytic domains were mostly encoded as separate proteins. The finding of common occurrence of nonmodular NRPS differs substantially from the current classification. Sequence analysis indicates that the evolution of NRPS machineries was driven by a combination of common descent and horizontal gene transfer. We identified related siderophore NRPS gene clusters that encoded modular and nonmodular NRPS enzymes organized in a gradient. A higher frequency of the NRPS and PKS gene clusters was detected from bacteria compared with archaea or eukarya. They commonly occurred in the phyla of Proteobacteria, Actinobacteria, Firmicutes, and Cyanobacteria in bacteria and the phylum of Ascomycota in fungi. The majority of these NRPS and PKS gene clusters have unknown end products highlighting the power of genome mining in identifying novel genetic machinery for the biosynthesis of secondary metabolites.

  2. Identification of the Viridicatumtoxin and Griseofulvin Gene Clusters from Penicillium aethiopicum

    PubMed Central

    Chooi, Yit-Heng; Cacho, Ralph; Tang, Yi

    2010-01-01

    SUMMARY Penicillium aethiopicum produces two structurally interesting and biologically active polyketides: the tetracycline-like viridicatumtoxin 1 and the classic antifungal agent griseofulvin 2. Here, we report the concurrent discovery of the two corresponding biosynthetic gene clusters (vrt and gsf) by 454 shotgun sequencing. Gene deletions confirmed two nonreducing PKSs (NRPKS), vrtA and gsfA, are required for the biosynthesis of 1 and 2, respectively. Both PKSs share similar domain architectures and lack a C-terminal thioesterase domain. We identified gsfI as the chlorinase involved in the biosynthesis of 2, as deletion of gsfI resulted in the accumulation of decholorogriseofulvin 3. Comparative analysis with the P. chrysogenum genome revealed that both clusters are embedded within conserved syntenic regions of P. aethiopicum chromosomes. Discovery of the vrt and gsf clusters provided the basis for genetic and biochemical studies of the pathways. PMID:20534346

  3. Identification and characterization of large DNA deletions affecting oil quality traits in soybean seeds through transcriptome sequencing analysis.

    PubMed

    Goettel, Wolfgang; Ramirez, Martha; Upchurch, Robert G; An, Yong-Qiang Charles

    2016-08-01

    Identification and characterization of a 254-kb genomic deletion on a duplicated chromosome segment that resulted in a low level of palmitic acid in soybean seeds using transcriptome sequencing. A large number of soybean genotypes varying in seed oil composition and content have been identified. Understanding the molecular mechanisms underlying these variations is important for breeders to effectively utilize them as a genetic resource. Through design and application of a bioinformatics approach, we identified nine co-regulated gene clusters by comparing seed transcriptomes of nine soybean genotypes varying in oil composition and content. We demonstrated that four gene clusters in the genotypes M23, Jack and N0304-303-3 coincided with large-scale genome rearrangements. The co-regulated gene clusters in M23 and Jack mapped to a previously described 164-kb deletion and a copy number amplification of the Rhg1 locus, respectively. The coordinately down-regulated gene clusters in N0304-303-3 were caused by a 254-kb deletion containing 19 genes including a fatty acyl-ACP thioesterase B gene (FATB1a). This deletion was associated with reduced palmitic acid content in seeds and was the molecular cause of a previously reported nonfunctional FATB1a allele, fap nc . The M23 and N0304-304-3 deletions were located in duplicated genome segments retained from the Glycine-specific whole genome duplication that occurred 13 million years ago. The homoeologous genes in these duplicated regions shared a strong similarity in both their encoded protein sequences and transcript accumulation levels, suggesting that they may have conserved and important functions in seeds. The functional conservation of homoeologous genes may result in genetic redundancy and gene dosage effects for their associated seed traits, explaining why the large deletion did not cause lethal effects or completely eliminate palmitic acid in N0304-303-3.

  4. Evolution of Daily Gene Co-expression Patterns from Algae to Plants

    PubMed Central

    de los Reyes, Pedro; Romero-Campero, Francisco J.; Ruiz, M. Teresa; Romero, José M.; Valverde, Federico

    2017-01-01

    Daily rhythms play a key role in transcriptome regulation in plants and microalgae orchestrating responses that, among other processes, anticipate light transitions that are essential for their metabolism and development. The recent accumulation of genome-wide transcriptomic data generated under alternating light:dark periods from plants and microalgae has made possible integrative and comparative analysis that could contribute to shed light on the evolution of daily rhythms in the green lineage. In this work, RNA-seq and microarray data generated over 24 h periods in different light regimes from the eudicot Arabidopsis thaliana and the microalgae Chlamydomonas reinhardtii and Ostreococcus tauri have been integrated and analyzed using gene co-expression networks. This analysis revealed a reduction in the size of the daily rhythmic transcriptome from around 90% in Ostreococcus, being heavily influenced by light transitions, to around 40% in Arabidopsis, where a certain independence from light transitions can be observed. A novel Multiple Bidirectional Best Hit (MBBH) algorithm was applied to associate single genes with a family of potential orthologues from evolutionary distant species. Gene duplication, amplification and divergence of rhythmic expression profiles seems to have played a central role in the evolution of gene families in the green lineage such as Pseudo Response Regulators (PRRs), CONSTANS-Likes (COLs), and DNA-binding with One Finger (DOFs). Gene clustering and functional enrichment have been used to identify groups of genes with similar rhythmic gene expression patterns. The comparison of gene clusters between species based on potential orthologous relationships has unveiled a low to moderate level of conservation of daily rhythmic expression patterns. However, a strikingly high conservation was found for the gene clusters exhibiting their highest and/or lowest expression value during the light transitions. PMID:28751903

  5. Organization of nif gene cluster in Frankia sp. EuIK1 strain, a symbiont of Elaeagnus umbellata.

    PubMed

    Oh, Chang Jae; Kim, Ho Bang; Kim, Jitae; Kim, Won Jin; Lee, Hyoungseok; An, Chung Sun

    2012-01-01

    The nucleotide sequence of a 20.5-kb genomic region harboring nif genes was determined and analyzed. The fragment was obtained from Frankia sp. EuIK1 strain, an indigenous symbiont of Elaeagnus umbellata. A total of 20 ORFs including 12 nif genes were identified and subjected to comparative analysis with the genome sequences of 3 Frankia strains representing diverse host plant specificities. The nucleotide and deduced amino acid sequences showed highest levels of identity with orthologous genes from an Elaeagnus-infecting strain. The gene organization patterns around the nif gene clusters were well conserved among all 4 Frankia strains. However, characteristic features appeared in the location of the nifV gene for each Frankia strain, depending on the type of host plant. Sequence analysis was performed to determine the transcription units and suggested that there could be an independent operon starting from the nifW gene in the EuIK strain. Considering the organization patterns and their total extensions on the genome, we propose that the nif gene clusters remained stable despite genetic variations occurring in the Frankia genomes.

  6. A comprehensive analysis of Helicobacter pylori plasticity zones reveals that they are integrating conjugative elements with intermediate integration specificity.

    PubMed

    Fischer, Wolfgang; Breithaupt, Ute; Kern, Beate; Smith, Stella I; Spicher, Carolin; Haas, Rainer

    2014-04-27

    The human gastric pathogen Helicobacter pylori is a paradigm for chronic bacterial infections. Its persistence in the stomach mucosa is facilitated by several mechanisms of immune evasion and immune modulation, but also by an unusual genetic variability which might account for the capability to adapt to changing environmental conditions during long-term colonization. This variability is reflected by the fact that almost each infected individual is colonized by a genetically unique strain. Strain-specific genes are dispersed throughout the genome, but clusters of genes organized as genomic islands may also collectively be present or absent. We have comparatively analysed such clusters, which are commonly termed plasticity zones, in a high number of H. pylori strains of varying geographical origin. We show that these regions contain fixed gene sets, rather than being true regions of genome plasticity, but two different types and several subtypes with partly diverging gene content can be distinguished. Their genetic diversity is incongruent with variations in the rest of the genome, suggesting that they are subject to horizontal gene transfer within H. pylori populations. We identified 40 distinct integration sites in 45 genome sequences, with a conserved heptanucleotide motif that seems to be the minimal requirement for integration. The significant number of possible integration sites, together with the requirement for a short conserved integration motif and the high level of gene conservation, indicates that these elements are best described as integrating conjugative elements (ICEs) with an intermediate integration site specificity.

  7. CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.

    PubMed

    Nikulova, Anna A; Favorov, Alexander V; Sutormin, Roman A; Makeev, Vsevolod J; Mironov, Andrey A

    2012-07-01

    Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.

  8. Analysis of lamprey clustered Fox genes: insight into Fox gene evolution and expression in vertebrates.

    PubMed

    Wotton, Karl R; Shimeld, Sebastian M

    2011-12-01

    In the human genome, members of the FoxC, FoxF, FoxL1, and FoxQ1 gene families are found in two paralagous clusters. One cluster contains the genes FOXQ1, FOXF2, FOXC1 and the second consists of FOXF1, FOXC2, and FOXL1. In jawed vertebrates these genes are known to be expressed in different pharyngeal tissues and all, except FoxQ1, are involved in patterning the early embryonic mesoderm. We have previously traced the evolution of this cluster in the bony vertebrates, and the gene content is identical in the dogfish, a member of the most basally branching lineage of the jawed vertebrates. Here we extend these analyses to jawless vertebrates. Using genomic searches and molecular approaches we have identified homologues of these genes from lampreys. We identify two FoxC genes, two FoxF genes, two FoxQ1 genes and single FoxL1 gene. We examine the embryonic expression of one predominantly mesodermally expressed gene family, FoxC, and the endodermally expressed member of the cluster, FoxQ1. We identified FoxQ1 transcripts in the pharyngeal endoderm, while the two FoxC genes are differentially expressed in the pharyngeal mesenchyme and ectoderm. Furthermore we identify conserved expression of lamprey FoxC genes in the paraxial and intermediate mesoderms. We interpret our results through a chordate-wide comparison of expression patterns and discuss gene content in the context of theories on the evolution of the vertebrate genome. 2011 Elsevier B.V. All rights reserved.

  9. Two Virus-Induced MicroRNAs Known Only from Teleost Fishes Are Orthologues of MicroRNAs Involved in Cell Cycle Control in Humans

    PubMed Central

    Schyth, Brian Dall; Bela-ong, Dennis Berbulla; Jalali, Seyed Amir Hossein; Kristensen, Lasse Bøgelund Juel; Einer-Jensen, Katja; Pedersen, Finn Skou; Lorenzen, Niels

    2015-01-01

    MicroRNAs (miRNAs) are ~22 base pair-long non-coding RNAs which regulate gene expression in the cytoplasm of eukaryotic cells by binding to specific target regions in mRNAs to mediate transcriptional blocking or mRNA cleavage. Through their fundamental roles in cellular pathways, gene regulation mediated by miRNAs has been shown to be involved in almost all biological phenomena, including development, metabolism, cell cycle, tumor formation, and host-pathogen interactions. To address the latter in a primitive vertebrate host, we here used an array platform to analyze the miRNA response in rainbow trout (Oncorhynchus mykiss) following inoculation with the virulent fish rhabdovirus Viral hemorrhagic septicaemia virus. Two clustered miRNAs, miR-462 and miR-731 (herein referred to as miR-462 cluster), described only in teleost fishes, were found to be strongly upregulated, indicating their involvement in fish-virus interactions. We searched for homologues of the two teleost miRNAs in other vertebrate species and investigated whether findings related to ours have been reported for these homologues. Gene synteny analysis along with gene sequence conservation suggested that the teleost fish miR-462 and miR-731 had evolved from the ancestral miR-191 and miR-425 (herein called miR-191 cluster), respectively. Whereas the miR-462 cluster locus is found between two protein-coding genes (intergenic) in teleost fish genomes, the miR-191 cluster locus is found within an intron of a protein-coding gene (intragenic) in the human genome. Interferon (IFN)-inducible and immune-related promoter elements found upstream of the teleost miR-462 cluster locus suggested roles in immune responses to viral pathogens in fish, while in humans, the miR-191 cluster functionally associated with cell cycle regulation. Stimulation of fish cell cultures with the IFN inducer poly I:C accordingly upregulated the expression of miR-462 and miR-731, while no stimulatory effect on miR-191 and miR-425 expression was observed in human cell lines. Despite high sequence conservation, evolution has thus resulted in different regulation and presumably also different functional roles of these orthologous miRNA clusters in different vertebrate lineages. PMID:26207374

  10. The Methanosarcina barkeri genome: comparative analysis withMethanosarcina acetivorans and Methanosarcina mazei reveals extensiverearrangement within methanosarcinal genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.

    2006-05-19

    We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri,more » 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.« less

  11. Gene transfer agent (GTA) genes reveal diverse and dynamic Roseobacter and Rhodobacter populations in the Chesapeake Bay.

    PubMed

    Zhao, Yanlin; Wang, Kui; Budinoff, Charles; Buchan, Alison; Lang, Andrew; Jiao, Nianzhi; Chen, Feng

    2009-03-01

    Within the bacterial class Alphaproteobacteria, the order Rhodobacterales contains the Roseobacter and Rhodobacter clades. Roseobacters are abundant and play important biogeochemical roles in marine environments. Roseobacter and Rhodobacter genomes contain a conserved gene transfer agent (GTA) gene cluster, and GTA-mediated gene transfer has been observed in these groups of bacteria. In this study, we investigated the genetic diversity of these two groups in Chesapeake Bay surface waters using a specific PCR primer set targeting the conserved Rhodobacterales GTA major capsid protein gene (g5). The g5 gene was successfully amplified from 26 Rhodobacterales isolates and the bay microbial communities using this primer set. Four g5 clone libraries were constructed from microbial assemblages representing different regions and seasons of the bay and yielded diverse sequences. In total, 12 distinct g5 clusters could be identified among 158 Chesapeake Bay clones, 11 fall within the Roseobacter clade, and one falls in the Rhodobacter clade. The vast majority of the clusters (10 out of 12) lack cultivated representatives. The composition of g5 sequences varied dramatically along the bay during the wintertime, and a distinct Roseobacter population composition between winter and summer was observed. The congruence between g5 and 16S rRNA gene phylogenies indicates that g5 may serve as a useful genetic marker to investigate diversity and abundance of Roseobacter and Rhodobacter in natural environments. The presence of the g5 gene in the natural populations of Roseobacter and Rhodobacter implies that genetic exchange through GTA transduction could be an important mechanism for maintaining the metabolic flexibility of these groups of bacteria.

  12. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

    PubMed

    Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

    2011-02-01

    The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they display a different cellular localization compared to that of the gsdf gene indicating that the later gene is not co-regulated. Interestingly, our study identifies new clustered genes that are specifically expressed in previtellogenic oocytes (nup54, aff1, klhl8, sdad1). Copyright © 2010 Elsevier B.V. All rights reserved.

  13. RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage

    PubMed Central

    Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili

    2013-01-01

    The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778

  14. Extensive concerted evolution of rice paralogs and the road to regaining independence.

    PubMed

    Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2007-11-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.

  15. Theria-Specific Homeodomain and cis-Regulatory Element Evolution of the Dlx3–4 Bigene Cluster in 12 Different Mammalian Species

    PubMed Central

    SUMIYAMA, KENTA; MIYAKE, TSUTOMU; GRIMWOOD, JANE; STUART, ANDREW; DICKSON, MARK; SCHMUTZ, JEREMY; RUDDLE, FRANK H.; MYERS, RICHARD M.; AMEMIYA, CHRIS T.

    2013-01-01

    The mammalian Dlx3 and Dlx4 genes are configured as a bigene cluster, and their respective expression patterns are controlled temporally and spatially by cis-elements that largely reside within the intergenic region of the cluster. Previous work revealed that there are conspicuously conserved elements within the intergenic region of the Dlx3–4 bigene clusters of mouse and human. In this paper we have extended these analyses to include 12 additional mammalian taxa (including a marsupial and a monotreme) in order to better define the nature and molecular evolutionary trends of the coding and non-coding functional elements among morphologically divergent mammals. Dlx3–4 regions were fully sequenced from 12 divergent taxa of interest. We identified three theria-specific amino acid replacements in homeodomain of Dlx4 gene that functions in placenta. Sequence analyses of constrained nucleotide sites in the intergenic non-coding region showed that many of the intergenic conserved elements are highly conserved and have evolved slowly within the mammals. In contrast, a branchial arch/craniofacial enhancer I37-2 exhibited accelerated evolution at the branch between the monotreme and therian common ancestor despite being highly conserved among therian species. Functional analysis of I37-2 in transgenic mice has shown that the equivalent region of the platypus fails to drive transcriptional activity in branchial arches. These observations, taken together with our molecular evolutionary data, suggest that theria-specific episodic changes in the I37-2 element may have contributed to craniofacial innovation at the base of the mammalian lineage. PMID:22951979

  16. Comparative genomics of chemosensory protein genes (CSPs) in twenty-two mosquito species (Diptera: Culicidae): Identification, characterization, and evolution

    PubMed Central

    Fu, Wen-Bo; Li, Bo; He, Zheng-Bo

    2018-01-01

    Chemosensory proteins (CSP) are soluble carrier proteins that may function in odorant reception in insects. CSPs have not been thoroughly studied at whole-genome level, despite the availability of insect genomes. Here, we identified/reidentified 283 CSP genes in the genomes of 22 mosquitoes. All 283 CSP genes possess a highly conserved OS-D domain. We comprehensively analyzed these CSP genes and determined their conserved domains, structure, genomic distribution, phylogeny, and evolutionary patterns. We found an average of seven CSP genes in each of 19 Anopheles genomes, 27 CSP genes in Cx. quinquefasciatus, 43 in Ae. aegypti, and 83 in Ae. albopictus. The Anopheles CSP genes had a simple genomic organization with a relatively consistent gene distribution, while most of the Culicinae CSP genes were distributed in clusters on the scaffolds. Our phylogenetic analysis clustered the CSPs into two major groups: CSP1-8 and CSE1-3. The CSP1-8 groups were all monophyletic with good bootstrap support. The CSE1-3 groups were an expansion of the CSP family of genes specific to the three Culicinae species. The Ka/Ks ratios indicated that the CSP genes had been subject to purifying selection with relatively slow evolution. Our results provide a comprehensive framework for the study of the CSP gene family in these 22 mosquito species, laying a foundation for future work on CSP function in the detection of chemical cues in the surrounding environment. PMID:29304168

  17. Comparative genomics of chemosensory protein genes (CSPs) in twenty-two mosquito species (Diptera: Culicidae): Identification, characterization, and evolution.

    PubMed

    Mei, Ting; Fu, Wen-Bo; Li, Bo; He, Zheng-Bo; Chen, Bin

    2018-01-01

    Chemosensory proteins (CSP) are soluble carrier proteins that may function in odorant reception in insects. CSPs have not been thoroughly studied at whole-genome level, despite the availability of insect genomes. Here, we identified/reidentified 283 CSP genes in the genomes of 22 mosquitoes. All 283 CSP genes possess a highly conserved OS-D domain. We comprehensively analyzed these CSP genes and determined their conserved domains, structure, genomic distribution, phylogeny, and evolutionary patterns. We found an average of seven CSP genes in each of 19 Anopheles genomes, 27 CSP genes in Cx. quinquefasciatus, 43 in Ae. aegypti, and 83 in Ae. albopictus. The Anopheles CSP genes had a simple genomic organization with a relatively consistent gene distribution, while most of the Culicinae CSP genes were distributed in clusters on the scaffolds. Our phylogenetic analysis clustered the CSPs into two major groups: CSP1-8 and CSE1-3. The CSP1-8 groups were all monophyletic with good bootstrap support. The CSE1-3 groups were an expansion of the CSP family of genes specific to the three Culicinae species. The Ka/Ks ratios indicated that the CSP genes had been subject to purifying selection with relatively slow evolution. Our results provide a comprehensive framework for the study of the CSP gene family in these 22 mosquito species, laying a foundation for future work on CSP function in the detection of chemical cues in the surrounding environment.

  18. Comparative Sequence and X-Inactivation Analyses of a Domain of Escape in Human Xp11.2 and the Conserved Segment in Mouse

    PubMed Central

    Tsuchiya, Karen D.; Greally, John M.; Yi, Yajun; Noel, Kevin P.; Truong, Jean-Pierre; Disteche, Christine M.

    2004-01-01

    We have performed X-inactivation and sequence analyses on 350 kb of sequence from human Xp11.2, a region shown previously to contain a cluster of genes that escape X inactivation, and we compared this region with the region of conserved synteny in mouse. We identified several new transcripts from this region in human and in mouse, which defined the full extent of the domain escaping X inactivation in both species. In human, escape from X inactivation involves an uninterrupted 235-kb domain of multiple genes. Despite highly conserved gene content and order between the two species, Smcx is the only mouse gene from the conserved segment that escapes inactivation. As repetitive sequences are believed to facilitate spreading of X inactivation along the chromosome, we compared the repetitive sequence composition of this region between the two species. We found that long terminal repeats (LTRs) were decreased in the human domain of escape, but not in the majority of the conserved mouse region adjacent to Smcx in which genes were subject to X inactivation, suggesting that these repeats might be excluded from escape domains to prevent spreading of silencing. Our findings indicate that genomic context, as well as gene-specific regulatory elements, interact to determine expression of a gene from the inactive X-chromosome. PMID:15197169

  19. Identification of an intact ParaHox cluster with temporal colinearity but altered spatial colinearity in the hemichordate Ptychodera flava

    PubMed Central

    2013-01-01

    Background ParaHox and Hox genes are thought to have evolved from a common ancestral ProtoHox cluster or from tandem duplication prior to the divergence of cnidarians and bilaterians. Similar to Hox clusters, chordate ParaHox genes including Gsx, Xlox, and Cdx, are clustered and their expression exhibits temporal and spatial colinearity. In non-chordate animals, however, studies on the genomic organization of ParaHox genes are limited to only a few animal taxa. Hemichordates, such as the Enteropneust acorn worms, have been used to gain insights into the origins of chordate characters. In this study, we investigated the genomic organization and expression of ParaHox genes in the indirect developing hemichordate acorn worm Ptychodera flava. Results We found that P. flava contains an intact ParaHox cluster with a similar arrangement to that of chordates. The temporal expression order of the P. flava ParaHox genes is the same as that of the chordate ParaHox genes. During embryogenesis, the spatial expression pattern of PfCdx in the posterior endoderm represents a conserved feature similar to the expression of its orthologs in other animals. On the other hand, PfXlox and PfGsx show a novel expression pattern in the blastopore. Nevertheless, during metamorphosis, PfXlox and PfCdx are expressed in the endoderm in a spatially staggered pattern similar to the situation in chordates. Conclusions Our study shows that P. flava ParaHox genes, despite forming an intact cluster, exhibit temporal colinearity but lose spatial colinearity during embryogenesis. During metamorphosis, partial spatial colinearity is retained in the transforming larva. These results strongly suggest that intact ParaHox gene clustering was retained in the deuterostome ancestor and is correlated with temporal colinearity. PMID:23802544

  20. Identifying resistance gene analogs associated with resistances to different pathogens in common bean.

    PubMed

    López, Camilo E; Acosta, Iván F; Jara, Carlos; Pedraza, Fabio; Gaitán-Solís, Eliana; Gallego, Gerardo; Beebe, Steve; Tohme, Joe

    2003-01-01

    ABSTRACT A polymerase chain reaction approach using degenerate primers that targeted the conserved domains of cloned plant disease resistance genes (R genes) was used to isolate a set of 15 resistance gene analogs (RGAs) from common bean (Phaseolus vulgaris). Eight different classes of RGAs were obtained from nucleotide binding site (NBS)-based primers and seven from not previously described Toll/Interleukin-1 receptor-like (TIR)-based primers. Putative amino acid sequences of RGAs were significantly similar to R genes and contained additional conserved motifs. The NBS-type RGAs were classified in two subgroups according to the expected final residue in the kinase-2 motif. Eleven RGAs were mapped at 19 loci on eight linkage groups of the common bean genetic map constructed at Centro Internacional de Agricultura Tropical. Genetic linkage was shown for eight RGAs with partial resistance to anthracnose, angular leaf spot (ALS) and Bean golden yellow mosaic virus (BGYMV). RGA1 and RGA2 were associated with resistance loci to anthracnose and BGYMV and were part of two clusters of R genes previously described. A new major cluster was detected by RGA7 and explained up to 63.9% of resistance to ALS and has a putative contribution to anthracnose resistance. These results show the usefulness of RGAs as candidate genes to detect and eventually isolate numerous R genes in common bean.

  1. Conservation and Sex-Specific Splicing of the transformer Gene in the Calliphorids Cochliomyia hominivorax, Cochliomyia macellaria and Lucilia sericata

    PubMed Central

    Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.

    2013-01-01

    Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170

  2. Functional genome analysis of Bifidobacterium breve UCC2003 reveals type IVb tight adherence (Tad) pili as an essential and conserved host-colonization factor

    PubMed Central

    O'Connell Motherway, Mary; Zomer, Aldert; Leahy, Sinead C.; Reunanen, Justus; Bottacini, Francesca; Claesson, Marcus J.; O'Brien, Frances; Flynn, Kiera; Casey, Patrick G.; Moreno Munoz, Jose Antonio; Kearney, Breda; Houston, Aileen M.; O'Mahony, Caitlin; Higgins, Des G.; Shanahan, Fergus; Palva, Airi; de Vos, Willem M.; Fitzgerald, Gerald F.; Ventura, Marco; O'Toole, Paul W.; van Sinderen, Douwe

    2011-01-01

    Development of the human gut microbiota commences at birth, with bifidobacteria being among the first colonizers of the sterile newborn gastrointestinal tract. To date, the genetic basis of Bifidobacterium colonization and persistence remains poorly understood. Transcriptome analysis of the Bifidobacterium breve UCC2003 2.42-Mb genome in a murine colonization model revealed differential expression of a type IVb tight adherence (Tad) pilus-encoding gene cluster designated “tad2003.” Mutational analysis demonstrated that the tad2003 gene cluster is essential for efficient in vivo murine gut colonization, and immunogold transmission electron microscopy confirmed the presence of Tad pili at the poles of B. breve UCC2003 cells. Conservation of the Tad pilus-encoding locus among other B. breve strains and among sequenced Bifidobacterium genomes supports the notion of a ubiquitous pili-mediated host colonization and persistence mechanism for bifidobacteria. PMID:21690406

  3. Functional genome analysis of Bifidobacterium breve UCC2003 reveals type IVb tight adherence (Tad) pili as an essential and conserved host-colonization factor.

    PubMed

    O'Connell Motherway, Mary; Zomer, Aldert; Leahy, Sinead C; Reunanen, Justus; Bottacini, Francesca; Claesson, Marcus J; O'Brien, Frances; Flynn, Kiera; Casey, Patrick G; Munoz, Jose Antonio Moreno; Kearney, Breda; Houston, Aileen M; O'Mahony, Caitlin; Higgins, Des G; Shanahan, Fergus; Palva, Airi; de Vos, Willem M; Fitzgerald, Gerald F; Ventura, Marco; O'Toole, Paul W; van Sinderen, Douwe

    2011-07-05

    Development of the human gut microbiota commences at birth, with bifidobacteria being among the first colonizers of the sterile newborn gastrointestinal tract. To date, the genetic basis of Bifidobacterium colonization and persistence remains poorly understood. Transcriptome analysis of the Bifidobacterium breve UCC2003 2.42-Mb genome in a murine colonization model revealed differential expression of a type IVb tight adherence (Tad) pilus-encoding gene cluster designated "tad(2003)." Mutational analysis demonstrated that the tad(2003) gene cluster is essential for efficient in vivo murine gut colonization, and immunogold transmission electron microscopy confirmed the presence of Tad pili at the poles of B. breve UCC2003 cells. Conservation of the Tad pilus-encoding locus among other B. breve strains and among sequenced Bifidobacterium genomes supports the notion of a ubiquitous pili-mediated host colonization and persistence mechanism for bifidobacteria.

  4. [Amphioxus ortholog of ECSIT, an evolutionarily conserved adaptor in the Toll and BMP signaling pathways].

    PubMed

    Lin, Y H; Zhang, W; Li, J W; Zhang, H W; Chen, D Y

    2017-01-01

    In vertebrates, evolutionarily conserved signaling intermediate in the Toll pathway (ECSIT) interacts with the TNF-receptor associated factor 6 (TRAF6) to regulate the processing of MEKK1, activate NF-κB, and also control BMP target genes. However, the role of ECSIT in invertebrates remains largely unexplored. We performed comparative investigations of the expression, gene structure, and phylogeny of ECSIT, Toll-like receptor (TLR), and Smad4 in the cephalochordate Branchiostoma belcheri. Phylogenetic analysis indicated that, in amphioxus, ECSIT, TLR, and Smad4 form independent clusters at the base of Chordate   clusters. Interestingly, overall gene structures were comparable to those in vertebrate orthologs. Transcripts of AmphiECSIT were detectable at the mid-neural stage, and continued to be expressed in the epithelium of the pharyngeal region at later stages. In adult animals, strong expression was observed in the nerve cord, endostyle, epithelial cells of the gut and wheel organ, genital membrane of the testis, and coelom and lymphoid cavities, what is highly similar to AmphiTLR and AmphiSmad4 expression patterns during development and in adult organisms. Our data suggests that ECSIT is evolutionarily conserved. Its amphioxus ortholog functions during embryonic development and as part of the innate immune system and may be involved in TLR/BMP signaling.

  5. PlantTribes: a gene and gene family resource for comparative genomics in plants

    PubMed Central

    Wall, P. Kerr; Leebens-Mack, Jim; Müller, Kai F.; Field, Dawn; Altman, Naomi S.; dePamphilis, Claude W.

    2008-01-01

    The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575–1584)] to classify all of these species’ protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting ∼4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study. PMID:18073194

  6. Genome-Wide Analysis of NBS-LRR Genes in Sorghum Genome Revealed Several Events Contributing to NBS-LRR Gene Evolution in Grass Species

    PubMed Central

    Yang, Xiping; Wang, Jianping

    2016-01-01

    The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976

  7. Genome-Wide Transcriptional Profiling of Clostridium perfringens SM101 during Sporulation Extends the Core of Putative Sporulation Genes and Genes Determining Spore Properties and Germination Characteristics.

    PubMed

    Xiao, Yinghua; van Hijum, Sacha A F T; Abee, Tjakko; Wells-Bennik, Marjon H J

    2015-01-01

    The formation of bacterial spores is a highly regulated process and the ultimate properties of the spores are determined during sporulation and subsequent maturation. A wide variety of genes that are expressed during sporulation determine spore properties such as resistance to heat and other adverse environmental conditions, dormancy and germination responses. In this study we characterized the sporulation phases of C. perfringens enterotoxic strain SM101 based on morphological characteristics, biomass accumulation (OD600), the total viable counts of cells plus spores, the viable count of heat resistant spores alone, the pH of the supernatant, enterotoxin production and dipicolinic acid accumulation. Subsequently, whole-genome expression profiling during key phases of the sporulation process was performed using DNA microarrays, and genes were clustered based on their time-course expression profiles during sporulation. The majority of previously characterized C. perfringens germination genes showed upregulated expression profiles in time during sporulation and belonged to two main clusters of genes. These clusters with up-regulated genes contained a large number of C. perfringens genes which are homologs of Bacillus genes with roles in sporulation and germination; this study therefore suggests that those homologs are functional in C. perfringens. A comprehensive homology search revealed that approximately half of the upregulated genes in the two clusters are conserved within a broad range of sporeforming Firmicutes. Another 30% of upregulated genes in the two clusters were found only in Clostridium species, while the remaining 20% appeared to be specific for C. perfringens. These newly identified genes may add to the repertoire of genes with roles in sporulation and determining spore properties including germination behavior. Their exact roles remain to be elucidated in future studies.

  8. Genome-Wide Transcriptional Profiling of Clostridium perfringens SM101 during Sporulation Extends the Core of Putative Sporulation Genes and Genes Determining Spore Properties and Germination Characteristics

    PubMed Central

    Xiao, Yinghua; van Hijum, Sacha A. F. T.; Abee, Tjakko; Wells-Bennik, Marjon H. J.

    2015-01-01

    The formation of bacterial spores is a highly regulated process and the ultimate properties of the spores are determined during sporulation and subsequent maturation. A wide variety of genes that are expressed during sporulation determine spore properties such as resistance to heat and other adverse environmental conditions, dormancy and germination responses. In this study we characterized the sporulation phases of C. perfringens enterotoxic strain SM101 based on morphological characteristics, biomass accumulation (OD600), the total viable counts of cells plus spores, the viable count of heat resistant spores alone, the pH of the supernatant, enterotoxin production and dipicolinic acid accumulation. Subsequently, whole-genome expression profiling during key phases of the sporulation process was performed using DNA microarrays, and genes were clustered based on their time-course expression profiles during sporulation. The majority of previously characterized C. perfringens germination genes showed upregulated expression profiles in time during sporulation and belonged to two main clusters of genes. These clusters with up-regulated genes contained a large number of C. perfringens genes which are homologs of Bacillus genes with roles in sporulation and germination; this study therefore suggests that those homologs are functional in C. perfringens. A comprehensive homology search revealed that approximately half of the upregulated genes in the two clusters are conserved within a broad range of sporeforming Firmicutes. Another 30% of upregulated genes in the two clusters were found only in Clostridium species, while the remaining 20% appeared to be specific for C. perfringens. These newly identified genes may add to the repertoire of genes with roles in sporulation and determining spore properties including germination behavior. Their exact roles remain to be elucidated in future studies. PMID:25978838

  9. The Putative C2H2 Transcription Factor MtfA Is a Novel Regulator of Secondary Metabolism and Morphogenesis in Aspergillus nidulans

    PubMed Central

    Ramamoorthy, Vellaisamy; Dhingra, Sourabh; Kincaid, Alexander; Shantappa, Sourabha; Feng, Xuehuan; Calvo, Ana M.

    2013-01-01

    Secondary metabolism in the model fungus Aspergillus nidulans is controlled by the conserved global regulator VeA, which also governs morphological differentiation. Among the secondary metabolites regulated by VeA is the mycotoxin sterigmatocystin (ST). The presence of VeA is necessary for the biosynthesis of this carcinogenic compound. We identified a revertant mutant able to synthesize ST intermediates in the absence of VeA. The point mutation occurred at the coding region of a gene encoding a novel putative C2H2 zinc finger domain transcription factor that we denominated mtfA. The A. nidulans mtfA gene product localizes at nuclei independently of the illumination regime. Deletion of the mtfA gene restores mycotoxin biosynthesis in the absence of veA, but drastically reduced mycotoxin production when mtfA gene expression was altered, by deletion or overexpression, in A. nidulans strains with a veA wild-type allele. Our study revealed that mtfA regulates ST production by affecting the expression of the specific ST gene cluster activator aflR. Importantly, mtfA is also a regulator of other secondary metabolism gene clusters, such as genes responsible for the synthesis of terrequinone and penicillin. As in the case of ST, deletion or overexpression of mtfA was also detrimental for the expression of terrequinone genes. Deletion of mtfA also decreased the expression of the genes in the penicillin gene cluster, reducing penicillin production. However, in this case, over-expression of mtfA enhanced the transcription of penicillin genes, increasing penicillin production more than 5 fold with respect to the control. Importantly, in addition to its effect on secondary metabolism, mtfA also affects asexual and sexual development in A. nidulans. Deletion of mtfA results in a reduction of conidiation and sexual stage. We found mtfA putative orthologs conserved in other fungal species. PMID:24066102

  10. Rapid diversification of FoxP2 in teleosts through gene duplication in the teleost-specific whole genome duplication event.

    PubMed

    Song, Xiaowei; Wang, Yajun; Tang, Yezhong

    2013-01-01

    As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication.

  11. Rapid Diversification of FoxP2 in Teleosts through Gene Duplication in the Teleost-Specific Whole Genome Duplication Event

    PubMed Central

    Song, Xiaowei; Wang, Yajun; Tang, Yezhong

    2013-01-01

    As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication. PMID:24349554

  12. Interstitial telomeric sequences in human chromosomes cluster with common fragile sites, mutagen sensitive sites, viral integration sites, cancer breakpoints, proto-oncogenes and breakpoints involved in primate evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Adekunle, S.S.A.; Wyandt, H.; Mark, H.F.L.

    1994-09-01

    Recently we mapped the telomeric repeat sequences to 111 interstitial sites in the human genome and to sites of gaps and breaks induced by aphidicolin and sister chromatid exchange sites detected by BrdU. Many of these sites correspond to conserved fragile sites in man, gorilla and chimpazee, to sites of conserved sister chromatid exchange in the mammalian X chromosome, to mutagenic sensitive sites, mapped locations of proto-oncogenes, breakpoints implicated in primate evolution and to breakpoints indicated as the sole anomaly in neoplasia. This observation prompted us to investigate if the interstitial telomeric sites cluster with these sites. An extensive literaturemore » search was carried out to find all the available published sites mentioned above. For comparison, we also carried out a statistical analysis of the clustering of the sites of the telomeric repeats with the gene locations where only nucleotide mutations have been observed as the only chromosomal abnormality. Our results indicate that the telomeric repeats cluster most with fragile sites, mutagenic sensitive sites and breakpoints implicated in primate evolution and least with cancer breakpoints, mapped locations of proto-oncogenes and other genes with nucleotide mutations.« less

  13. Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence

    PubMed Central

    Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2007-01-01

    Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882

  14. In Silico Analysis of Gene Expression Network Components Underlying Pigmentation Phenotypes in the Python Identified Evolutionarily Conserved Clusters of Transcription Factor Binding Sites

    PubMed Central

    2016-01-01

    Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus. Our results provide insight into pigment phenotypes in pythons. PMID:27698666

  15. In Silico Analysis of Gene Expression Network Components Underlying Pigmentation Phenotypes in the Python Identified Evolutionarily Conserved Clusters of Transcription Factor Binding Sites.

    PubMed

    Irizarry, Kristopher J L; Bryden, Randall L

    2016-01-01

    Color variation provides the opportunity to investigate the genetic basis of evolution and selection. Reptiles are less studied than mammals. Comparative genomics approaches allow for knowledge gained in one species to be leveraged for use in another species. We describe a comparative vertebrate analysis of conserved regulatory modules in pythons aimed at assessing bioinformatics evidence that transcription factors important in mammalian pigmentation phenotypes may also be important in python pigmentation phenotypes. We identified 23 python orthologs of mammalian genes associated with variation in coat color phenotypes for which we assessed the extent of pairwise protein sequence identity between pythons and mouse, dog, horse, cow, chicken, anole lizard, and garter snake. We next identified a set of melanocyte/pigment associated transcription factors (CREB, FOXD3, LEF-1, MITF, POU3F2, and USF-1) that exhibit relatively conserved sequence similarity within their DNA binding regions across species based on orthologous alignments across multiple species. Finally, we identified 27 evolutionarily conserved clusters of transcription factor binding sites within ~200-nucleotide intervals of the 1500-nucleotide upstream regions of AIM1, DCT, MC1R, MITF, MLANA, OA1, PMEL, RAB27A, and TYR from Python bivittatus . Our results provide insight into pigment phenotypes in pythons.

  16. IL26 gene inactivation in Equidae.

    PubMed

    Shakhsi-Niaei, M; Drögemüller, M; Jagannathan, V; Gerber, V; Leeb, T

    2013-12-01

    Interleukin-26 (IL26) is a member of the IL10 cytokine family. The IL26 gene is located between two other well-known cytokines genes of this family encoding interferon-gamma (IFNG) and IL22 in an evolutionary conserved gene cluster. In contrast to humans and most other mammals, mice lack a functional Il26 gene. We analyzed the genome sequences of other vertebrates for the presence or absence of functional IL26 orthologs and found that the IL26 gene has also become inactivated in several equid species. We detected a one-base pair frameshift deletion in exon 2 of the IL26 gene in the domestic horse (Equus caballus), Przewalski horse (Equus przewalskii) and donkey (Equus asinus). The remnant IL26 gene in the horse is still transcribed and gives rise to at least five alternative transcripts. None of these transcripts share a conserved open reading frame with the human IL26 gene. A comparative analysis across diverse vertebrates revealed that the IL26 gene has also independently been inactivated in a few other mammals, including the African elephant and the European hedgehog. The IL26 gene thus appears to be highly variable, and the conserved open reading frame has been lost several times during mammalian evolution. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.

  17. CRISPR Diversity and Microevolution in Clostridium difficile

    PubMed Central

    Andersen, Joakim M.; Shoup, Madelyn; Robinson, Cathy; Britton, Robert; Olsen, Katharina E.P.; Barrangou, Rodolphe

    2016-01-01

    Abstract Virulent strains of Clostridium difficile have become a global health problem associated with morbidity and mortality. Traditional typing methods do not provide ideal resolution to track outbreak strains, ascertain genetic diversity between isolates, or monitor the phylogeny of this species on a global basis. Here, we investigate the occurrence and diversity of clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (cas) in C. difficile to assess the potential of CRISPR-based phylogeny and high-resolution genotyping. A single Type-IB CRISPR-Cas system was identified in 217 analyzed genomes with cas gene clusters present at conserved chromosomal locations, suggesting vertical evolution of the system, assessing a total of 1,865 CRISPR arrays. The CRISPR arrays, markedly enriched (8.5 arrays/genome) compared with other species, occur both at conserved and variable locations across strains, and thus provide a basis for typing based on locus occurrence and spacer polymorphism. Clustering of strains by array composition correlated with sequence type (ST) analysis. Spacer content and polymorphism within conserved CRISPR arrays revealed phylogenetic relationship across clades and within ST. Spacer polymorphisms of conserved arrays were instrumental for differentiating closely related strains, e.g., ST1/RT027/B1 strains and pathogenicity locus encoding ST3/RT001 strains. CRISPR spacers showed sequence similarity to phage sequences, which is consistent with the native role of CRISPR-Cas as adaptive immune systems in bacteria. Overall, CRISPR-Cas sequences constitute a valuable basis for genotyping of C. difficile isolates, provide insights into the micro-evolutionary events that occur between closely related strains, and reflect the evolutionary trajectory of these genomes. PMID:27576538

  18. Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

    PubMed

    Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

    2016-04-01

    Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.

  19. Molecular Keys to the Janthinobacterium and Duganella spp. Interaction with the Plant Pathogen Fusarium graminearum

    PubMed Central

    Haack, Frederike S.; Poehlein, Anja; Kröger, Cathrin; Voigt, Christian A.; Piepenbring, Meike; Bode, Helge B.; Daniel, Rolf; Schäfer, Wilhelm; Streit, Wolfgang R.

    2016-01-01

    Janthinobacterium and Duganella are well-known for their antifungal effects. Surprisingly, almost nothing is known on molecular aspects involved in the close bacterium-fungus interaction. To better understand this interaction, we established the genomes of 11 Janthinobacterium and Duganella isolates in combination with phylogenetic and functional analyses of all publicly available genomes. Thereby, we identified a core and pan genome of 1058 and 23,628 genes. All strains encoded secondary metabolite gene clusters and chitinases, both possibly involved in fungal growth suppression. All but one strain carried a single gene cluster involved in the biosynthesis of alpha-hydroxyketone-like autoinducer molecules, designated JAI-1. Genome-wide RNA-seq studies employing the background of two isolates and the corresponding JAI-1 deficient strains identified a set of 45 QS-regulated genes in both isolates. Most regulated genes are characterized by a conserved sequence motif within the promoter region. Among the most strongly regulated genes were secondary metabolite and type VI secretion system gene clusters. Most intriguing, co-incubation studies of J. sp. HH102 or its corresponding JAI-1 synthase deletion mutant with the plant pathogen Fusarium graminearum provided first evidence of a QS-dependent interaction with this pathogen. PMID:27833590

  20. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

    PubMed Central

    2012-01-01

    Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster. PMID:22577841

  1. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster.

    PubMed

    Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René

    2012-05-11

    The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  2. Comparative genomic analysis of four representative plant growth-promoting rhizobacteria in Pseudomonas.

    PubMed

    Shen, Xuemei; Hu, Hongbo; Peng, Huasong; Wang, Wei; Zhang, Xuehong

    2013-04-22

    Some Pseudomonas strains function as predominant plant growth-promoting rhizobacteria (PGPR). Within this group, Pseudomonas chlororaphis and Pseudomonas fluorescens are non-pathogenic biocontrol agents, and some Pseudomonas aeruginosa and Pseudomonas stutzeri strains are PGPR. P. chlororaphis GP72 is a plant growth-promoting rhizobacterium with a fully sequenced genome. We conducted a genomic analysis comparing GP72 with three other pseudomonad PGPR: P. fluorescens Pf-5, P. aeruginosa M18, and the nitrogen-fixing strain P. stutzeri A1501. Our aim was to identify the similarities and differences among these strains using a comparative genomic approach to clarify the mechanisms of plant growth-promoting activity. The genome sizes of GP72, Pf-5, M18, and A1501 ranged from 4.6 to 7.1 M, and the number of protein-coding genes varied among the four species. Clusters of Orthologous Groups (COGs) analysis assigned functions to predicted proteins. The COGs distributions were similar among the four species. However, the percentage of genes encoding transposases and their inactivated derivatives (COG L) was 1.33% of the total genes with COGs classifications in A1501, 0.21% in GP72, 0.02% in Pf-5, and 0.11% in M18. A phylogenetic analysis indicated that GP72 and Pf-5 were the most closely related strains, consistent with the genome alignment results. Comparisons of predicted coding sequences (CDSs) between GP72 and Pf-5 revealed 3544 conserved genes. There were fewer conserved genes when GP72 CDSs were compared with those of A1501 and M18. Comparisons among the four Pseudomonas species revealed 603 conserved genes in GP72, illustrating common plant growth-promoting traits shared among these PGPR. Conserved genes were related to catabolism, transport of plant-derived compounds, stress resistance, and rhizosphere colonization. Some strain-specific CDSs were related to different kinds of biocontrol activities or plant growth promotion. The GP72 genome contained the cus operon (related to heavy metal resistance) and a gene cluster involved in type IV pilus biosynthesis, which confers adhesion ability. Comparative genomic analysis of four representative PGPR revealed some conserved regions, indicating common characteristics (metabolism of plant-derived compounds, heavy metal resistance, and rhizosphere colonization) among these pseudomonad PGPR. Genomic regions specific to each strain provide clues to its lifestyle, ecological adaptation, and physiological role in the rhizosphere.

  3. Transcriptomic analysis of neuregulin-1 regulated genes following ischemic stroke by computational identification of promoter binding sites: A role for the ETS-1 transcription factor.

    PubMed

    Surles-Zeigler, Monique C; Li, Yonggang; Distel, Timothy J; Omotayo, Hakeem; Ge, Shaokui; Ford, Byron D

    2018-01-01

    Ischemic stroke is a major cause of mortality in the United States. We previously showed that neuregulin-1 (NRG1) was neuroprotective in rat models of ischemic stroke. We used gene expression profiling to understand the early cellular and molecular mechanisms of NRG1's effects after the induction of ischemia. Ischemic stroke was induced by middle cerebral artery occlusion (MCAO). Rats were allocated to 3 groups: (1) control, (2) MCAO and (3) MCAO + NRG1. Cortical brain tissues were collected three hours following MCAO and NRG1 treatment and subjected to microarray analysis. Data and statistical analyses were performed using R/Bioconductor platform alongside Genesis, Ingenuity Pathway Analysis and Enrichr software packages. There were 2693 genes differentially regulated following ischemia and NRG1 treatment. These genes were organized by expression patterns into clusters using a K-means clustering algorithm. We further analyzed genes in clusters where ischemia altered gene expression, which was reversed by NRG1 (clusters 4 and 10). NRG1, IRS1, OPA3, and POU6F1 were central linking (node) genes in cluster 4. Conserved Transcription Factor Binding Site Finder (CONFAC) identified ETS-1 as a potential transcriptional regulator of NRG1 suppressed genes following ischemia. A transcription factor activity array showed that ETS-1 activity was increased 2-fold, 3 hours following ischemia and this activity was attenuated by NRG1. These findings reveal key early transcriptional mechanisms associated with neuroprotection by NRG1 in the ischemic penumbra.

  4. Nucleotide sequence of a cluster of early and late genes in a conserved segment of the vaccinia virus genome.

    PubMed Central

    Plucienniczak, A; Schroeder, E; Zettlmeissl, G; Streeck, R E

    1985-01-01

    The nucleotide sequence of a 7.6 kb vaccinia DNA segment from a genomic region conserved among different orthopox virus has been determined. This segment contains a tight cluster of 12 partly overlapping open reading frames most of which can be correlated with previously identified early and late proteins and mRNAs. Regulatory signals used by vaccinia virus have been studied. Presumptive promoter regions are rich in A, T and carry the consensus sequences TATA and AATAA spaced at 20-24 base pairs. Tandem repeats of a CTATTC consensus sequence are proposed to be involved in the termination of early transcription. PMID:2987815

  5. Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

    PubMed

    Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

    1998-08-01

    The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.

  6. A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean1[W

    PubMed Central

    David, Perrine; Chen, Nicolas W.G.; Pedrosa-Harand, Andrea; Thareau, Vincent; Sévignac, Mireille; Cannon, Steven B.; Debouck, Daniel; Langin, Thierry; Geffroy, Valérie

    2009-01-01

    The B4 resistance (R) gene cluster is one of the largest clusters known in common bean (Phaseolus vulgaris [Pv]). It is located in a peculiar genomic environment in the subtelomeric region of the short arm of chromosome 4, adjacent to two heterochromatic blocks (knobs). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-Coil-Nucleotide-Binding-Site-Leucine-Rich-Repeat (CNL). Conserved microsynteny was observed between the Pv B4 locus and corresponding regions of Medicago truncatula and Lotus japonicus in chromosomes Mt6 and Lj2, respectively. The notable exception was the CNL sequences, which were completely absent in these regions. The origin of the Pv B4-CNL sequences was investigated through phylogenetic analysis, which reveals that, in the Pv genome, paralogous CNL genes are shared among nonhomologous chromosomes (4 and 11). Together, our results suggest that Pv B4-CNL was derived from CNL sequences from another cluster, the Co-2 cluster, through an ectopic recombination event. Integration of the soybean (Glycine max) genome data enables us to date more precisely this event and also to infer that a single CNL moved from the Co-2 to the B4 cluster. Moreover, we identified a new 528-bp satellite repeat, referred to as khipu, specific to the Phaseolus genus, present both between B4-CNL sequences and in the two knobs identified at the B4 R gene cluster. The khipu repeat is present on most chromosomal termini, indicating the existence of frequent ectopic recombination events in Pv subtelomeric regions. Our results highlight the importance of ectopic recombination in R gene evolution. PMID:19776165

  7. Molecular characterization and expression of microbial inulinase genes.

    PubMed

    Liu, Guang-Lei; Chi, Zhe; Chi, Zhen-Ming

    2013-05-01

    Many genes encoding exo- and endo-inulinases from bacteria, yeasts and filamentous fungi have been cloned and characterized. All the inulinases have several conserved motifs, such as WMND(E)PNGL, RDP, EC(V)P, SVEVF, Q and FS(T), which play an important role in inulinase catalysis and substrate binding. However, the exo-inulinases produced by yeasts has no conserved motif SVEVF and the yeasts do not produce any endo-inulinase. Exo- and endo-inulinases found in different microorganisms cluster separately at distant positions from each other. Most of the cloned inulinase genes have been expressed in Yarrowia lipolytica, Saccharomyces cerevisiae, Pichia pastoris, Klyuveromyces lactis and Escherichia coli, respectively. The recombinant inulinases produced and the engineered hosts using the cloned inulinase genes have many potential applications. Expression of most of the inulinase genes is repressed by glucose and fructose and induced by inulin and sucrose. However, the detailed mechanisms of the repression and induction are still unknown.

  8. dndDB: a database focused on phosphorothioation of the DNA backbone.

    PubMed

    Ou, Hong-Yu; He, Xinyi; Shao, Yucheng; Tai, Cui; Rajakumar, Kumar; Deng, Zixin

    2009-01-01

    The Dnd DNA degradation phenotype was first observed during electrophoresis of genomic DNA from Streptomyces lividans more than 20 years ago. It was subsequently shown to be governed by the five-gene dnd cluster. Similar gene clusters have now been found to be widespread among many other distantly related bacteria. Recently the dnd cluster was shown to mediate the incorporation of sulphur into the DNA backbone via a sequence-selective, stereo-specific phosphorothioate modification in Escherichia coli B7A. Intriguingly, to date all identified dnd clusters lie within mobile genetic elements, the vast majority in laterally transferred genomic islands. We organized available data from experimental and bioinformatics analyses about the DNA phosphorothioation phenomenon and associated documentation as a dndDB database. It contains the following detailed information: (i) Dnd phenotype; (ii) dnd gene clusters; (iii) genomic islands harbouring dnd genes; (iv) Dnd proteins and conserved domains. As of 25 December 2008, dndDB contained data corresponding to 24 bacterial species exhibiting the Dnd phenotype reported in the scientific literature. In addition, via in silico analysis, dndDB identified 26 syntenic dnd clusters from 25 species of Eubacteria and Archaea, 25 dnd-bearing genomic islands and one dnd plasmid containing 114 dnd genes. A further 397 other genes coding for proteins with varying levels of similarity to Dnd proteins were also included in dndDB. A broad range of similarity search, sequence alignment and phylogenetic tools are readily accessible to allow for to individualized directions of research focused on dnd genes. dndDB can facilitate efficient investigation of a wide range of aspects relating to dnd DNA modification and other island-encoded functions in host organisms. dndDB version 1.0 is freely available at http://mml.sjtu.edu.cn/dndDB/.

  9. Evolutionary conservation of sequence and secondary structures inCRISPR repeats

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kunin, Victor; Sorek, Rotem; Hugenholtz, Philip

    Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in {approx}40% of bacterial and all archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CAS), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been proposed that the CRISPR/CAS system samples, maintains a record of, and inactivates invasive DNA that the cell has encountered, and therefore constitutes a prokaryotic analog of an immune system. Here we analyze CRISPR repeatsmore » identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. All individual repeats in any given cluster were inferred to form characteristic RNA secondary structure, ranging from non-existent to pronounced. Stable secondary structures included G:U base pairs and exhibited multiple compensatory base changes in the stem region, indicating evolutionary conservation and functional importance. We also show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification including specific relationships between CRISPR and CAS subtypes.« less

  10. Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation

    PubMed Central

    Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; Taylor, Ronald C.; Weisenhorn, Pamela; Olson, Robert D.; Stevens, Rick L.; Rocha, Miguel; Rocha, Isabel; Best, Aaron A.; DeJongh, Matthew; Tintle, Nathan L.; Parrello, Bruce; Overbeek, Ross; Henry, Christopher S.

    2016-01-01

    Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain. PMID:27933038

  11. The Anaerobe-Specific Orange Protein Complex of Desulfovibrio vulgaris Hildenborough Is Encoded by Two Divergent Operons Coregulated by σ54 and a Cognate Transcriptional Regulator▿†

    PubMed Central

    Fiévet, Anouchka; My, Laetitia; Cascales, Eric; Ansaldi, Mireille; Pauleta, Sofia R.; Moura, Isabel; Dermoun, Zorah; Bernard, Christophe S.; Dolla, Alain; Aubert, Corinne

    2011-01-01

    Analysis of sequenced bacterial genomes revealed that the genomes encode more than 30% hypothetical and conserved hypothetical proteins of unknown function. Among proteins of unknown function that are conserved in anaerobes, some might be determinants of the anaerobic way of life. This study focuses on two divergent clusters specifically found in anaerobic microorganisms and mainly composed of genes encoding conserved hypothetical proteins. We show that the two gene clusters DVU2103-DVU2104-DVU2105 (orp2) and DVU2107-DVU2108-DVU2109 (orp1) form two divergent operons transcribed by the σ54-RNA polymerase. We further demonstrate that the σ54-dependent transcriptional regulator DVU2106, located between orp1 and orp2, collaborates with σ54-RNA polymerase to orchestrate the simultaneous expression of the divergent orp operons. DVU2106, whose structural gene is transcribed by the σ70-RNA polymerase, negatively retrocontrols its own expression. By using an endogenous pulldown strategy, we identify a physiological complex composed of DVU2103, DVU2104, DVU2105, DVU2108, and DVU2109. Interestingly, inactivation of DVU2106, which is required for orp operon transcription, induces morphological defects that are likely linked to the absence of the ORP complex. A putative role of the ORP proteins in positioning the septum during cell division is discussed. PMID:21531797

  12. Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

    PubMed Central

    Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

    2014-01-01

    Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822

  13. Widespread occurrence of secondary lipid biosynthesis potential in microbial lineages.

    PubMed

    Shulse, Christine N; Allen, Eric E

    2011-01-01

    Bacterial production of long-chain omega-3 polyunsaturated fatty acids (PUFAs), such as eicosapentaenoic acid (EPA, 20:5n-3) and docosahexaenoic acid (DHA, 22:6n-3), is constrained to a narrow subset of marine γ-proteobacteria. The genes responsible for de novo bacterial PUFA biosynthesis, designated pfaEABCD, encode large, multi-domain protein complexes akin to type I iterative fatty acid and polyketide synthases, herein referred to as "Pfa synthases". In addition to the archetypal Pfa synthase gene products from marine bacteria, we have identified homologous type I FAS/PKS gene clusters in diverse microbial lineages spanning 45 genera representing 10 phyla, presumed to be involved in long-chain fatty acid biosynthesis. In total, 20 distinct types of gene clusters were identified. Collectively, we propose the designation of "secondary lipids" to describe these biosynthetic pathways and products, a proposition consistent with the "secondary metabolite" vernacular. Phylogenomic analysis reveals a high degree of functional conservation within distinct biosynthetic pathways. Incongruence between secondary lipid synthase functional clades and taxonomic group membership combined with the lack of orthologous gene clusters in closely related strains suggests horizontal gene transfer has contributed to the dissemination of specialized lipid biosynthetic activities across disparate microbial lineages.

  14. Chassis organism from Corynebacterium glutamicum--a top-down approach to identify and delete irrelevant gene clusters.

    PubMed

    Unthan, Simon; Baumgart, Meike; Radek, Andreas; Herbst, Marius; Siebert, Daniel; Brühl, Natalie; Bartsch, Anna; Bott, Michael; Wiechert, Wolfgang; Marin, Kay; Hans, Stephan; Krämer, Reinhard; Seibold, Gerd; Frunzke, Julia; Kalinowski, Jörn; Rückert, Christian; Wendisch, Volker F; Noack, Stephan

    2015-02-01

    For synthetic biology applications, a robust structural basis is required, which can be constructed either from scratch or in a top-down approach starting from any existing organism. In this study, we initiated the top-down construction of a chassis organism from Corynebacterium glutamicum ATCC 13032, aiming for the relevant gene set to maintain its fast growth on defined medium. We evaluated each native gene for its essentiality considering expression levels, phylogenetic conservation, and knockout data. Based on this classification, we determined 41 gene clusters ranging from 3.7 to 49.7 kbp as target sites for deletion. 36 deletions were successful and 10 genome-reduced strains showed impaired growth rates, indicating that genes were hit, which are relevant to maintain biological fitness at wild-type level. In contrast, 26 deleted clusters were found to include exclusively irrelevant genes for growth on defined medium. A combinatory deletion of all irrelevant gene clusters would, in a prophage-free strain, decrease the size of the native genome by about 722 kbp (22%) to 2561 kbp. Finally, five combinatory deletions of irrelevant gene clusters were investigated. The study introduces the novel concept of relevant genes and demonstrates general strategies to construct a chassis suitable for biotechnological application. © 2014 The Authors. Biotechnology Journal published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim. This is an open access article under the terms of the Creative Commons Attribution-Non-Commercial-NoDerivs Licence, which permits use and distribution in any medium, provided the original work is properly cited, the use is non- commercial and no modifications or adaptations are made.

  15. Identification of the Main Regulator Responsible for Synthesis of the Typical Yellow Pigment Produced by Trichoderma reesei

    PubMed Central

    Derntl, Christian; Rassinger, Alice; Srebotnik, Ewald; Mach, Robert L.

    2016-01-01

    ABSTRACT The industrially used ascomycete Trichoderma reesei secretes a typical yellow pigment during cultivation, while other Trichoderma species do not. A comparative genomic analysis suggested that a putative secondary metabolism cluster, containing two polyketide-synthase encoding genes, is responsible for the yellow pigment synthesis. This cluster is conserved in a set of rather distantly related fungi, including Acremonium chrysogenum and Penicillium chrysogenum. In an attempt to silence the cluster in T. reesei, two genes of the cluster encoding transcription factors were individually deleted. For a complete genetic proof-of-function, the genes were reinserted into the genomes of the respective deletion strains. The deletion of the first transcription factor (termed yellow pigment regulator 1 [Ypr1]) resulted in the full abolishment of the yellow pigment formation and the expression of most genes of this cluster. A comparative high-pressure liquid chromatography (HPLC) analysis of supernatants of the ypr1 deletion and its parent strain suggested the presence of several yellow compounds in T. reesei that are all derived from the same cluster. A subsequent gas chromatography/mass spectrometry analysis strongly indicated the presence of sorbicillin in the major HPLC peak. The presence of the second transcription factor, termed yellow pigment regulator 2 (Ypr2), reduces the yellow pigment formation and the expression of most cluster genes, including the gene encoding the activator Ypr1. IMPORTANCE Trichoderma reesei is used for industry-scale production of carbohydrate-active enzymes. During growth, it secretes a typical yellow pigment. This is not favorable for industrial enzyme production because it makes the downstream process more complicated and thus increases operating costs. In this study, we demonstrate which regulators influence the synthesis of the yellow pigment. Based on these data, we also provide indication as to which genes are under the control of these regulators and are finally responsible for the biosynthesis of the yellow pigment. These genes are organized in a cluster that is also found in other industrially relevant fungi, such as the two antibiotic producers Penicillium chrysogenum and Acremonium chrysogenum. The targeted manipulation of a secondary metabolism cluster is an important option for any biotechnologically applied microorganism. PMID:27520818

  16. Complete genome sequence of Nitrosospira multiformis, an ammonia-oxidizing bacterium from the soil environment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Norton, Jeanette M.; Klotz, Martin G; Stein, Lisa Y

    2008-01-01

    The complete genome of the ammonia-oxidizing bacterium, Nitrosospira multiformis (ATCC 25196T), consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2827 putative proteins. Of these, 2026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and N. eutropha were the best match for 42% of the predicted genes in N. multiformis. The genome contains three nearly identical copies of amo and hao gene clusters as large repeats. Distinguishing features compared to N. europaea include: the presencemore » of gene clusters encoding urease and hydrogenase, a RuBisCO-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced AOB genomes. Gene clusters encoding proteins associated with outer membrane and cell envelope functions including transporters, porins, exopolysaccharide synthesis, capsule formation and protein sorting/export were abundant. Numerous sensory transduction and response regulator gene systems directed towards sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate and cyanophycin storage and utilization were identified providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.« less

  17. Identification and characterization of a NBS–LRR class resistance gene analog in Pistacia atlantica subsp. Kurdica

    PubMed Central

    Bahramnejad, Bahman

    2014-01-01

    P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981

  18. Evolutionary biology: microsporidia sex--a missing link to fungi.

    PubMed

    Dyer, Paul S

    2008-11-11

    The evolutionary origins of the microsporidia, a group of intracellular eukaryotic pathogens, have been unclear. Genome analysis of a sex locus and other gene clusters has now revealed conserved synteny with zygomycete fungi, indicating that microsporidia are true fungi descended from a zygomycete ancestor.

  19. Carbon-dependent control of electron transfer and central carbon pathway genes for methane biosynthesis in the Archaean, Methanosarcina acetivorans strain C2A

    PubMed Central

    2010-01-01

    Background The archaeon, Methanosarcina acetivorans strain C2A forms methane, a potent greenhouse gas, from a variety of one-carbon substrates and acetate. Whereas the biochemical pathways leading to methane formation are well understood, little is known about the expression of the many of the genes that encode proteins needed for carbon flow, electron transfer and/or energy conservation. Quantitative transcript analysis was performed on twenty gene clusters encompassing over one hundred genes in M. acetivorans that encode enzymes/proteins with known or potential roles in substrate conversion to methane. Results The expression of many seemingly "redundant" genes/gene clusters establish substrate dependent control of approximately seventy genes for methane production by the pathways for methanol and acetate utilization. These include genes for soluble-type and membrane-type heterodisulfide reductases (hdr), hydrogenases including genes for a vht-type F420 non-reducing hydrogenase, molybdenum-type (fmd) as well as tungsten-type (fwd) formylmethanofuran dehydrogenases, genes for rnf and mrp-type electron transfer complexes, for acetate uptake, plus multiple genes for aha- and atp-type ATP synthesis complexes. Analysis of promoters for seven gene clusters reveal UTR leaders of 51-137 nucleotides in length, raising the possibility of both transcriptional and translational levels of control. Conclusions The above findings establish the differential and coordinated expression of two major gene families in M. acetivorans in response to carbon/energy supply. Furthermore, the quantitative mRNA measurements demonstrate the dynamic range for modulating transcript abundance. Since many of these gene clusters in M. acetivorans are also present in other Methanosarcina species including M. mazei, and in M. barkeri, these findings provide a basis for predicting related control in these environmentally significant methanogens. PMID:20178638

  20. Insight into Energy Conservation via Alternative Carbon Monoxide Metabolism in Carboxydothermus pertinax Revealed by Comparative Genome Analysis.

    PubMed

    Fukuyama, Yuto; Omae, Kimiho; Yoneda, Yasuko; Yoshida, Takashi; Sako, Yoshihiko

    2018-05-04

    Carboxydothermus species are some of the most studied thermophilic carboxydotrophs. Their varied carboxydotrophic growth properties suggest distinct strategies for energy conservation via CO metabolism. In this study, we used comparative genome analysis of the genus Carboxydothermus to show variations in the CO dehydrogenase/energy-converting hydrogenase gene cluster, which is responsible for CO metabolism with H 2 production (hydrogenogenic CO metabolism). Indeed, ability or inability to produce H 2 with CO oxidation is explained by the presence or absence of this gene cluster in C. hydrogenoformans , C. islandicus , and C. ferrireducens Interestingly, despite its hydrogenogenic CO metabolism, C. pertinax lacks the Ni-CO dehydrogenase catalytic subunit (CooS-I) and its transcriptional regulator encoding genes in this gene cluster probably due to inversion. Transcriptional analysis in C. pertinax showed that the Ni-CO dehydrogenase gene ( cooS-II ) and distantly encoded energy-converting hydrogenase related genes were remarkably upregulated under 100% CO. In addition, when thiosulfate was available as a terminal electron acceptor under 100% CO, C. pertinax maximum cell density and maximum specific growth rate were 3.1-fold and 1.5-fold higher, respectively, than when thiosulfate was absent. The amount of H 2 produced was only 63% of the consumed CO, less than expected according to hydrogenogenic CO oxidation: CO + H 2 O → CO 2 + H 2 Accordingly, C. pertinax would couple CO oxidation by Ni-CO dehydrogenase-II with simultaneous reduction of not only H 2 O but thiosulfate when grown under 100% CO. IMPORTANCE Anaerobic hydrogenogenic carboxydotrophs are thought to fill a vital niche with scavenging potentially toxic CO and producing H 2 as available energy source for thermophilic microbes. This hydrogenogenic carboxydotrophy relies on a Ni-CO dehydrogenase/energy-converting hydrogenase gene cluster. This feature is thought to be as common to these organisms. However, hydrogenogenic carboxydotroph, Carboxydothermus pertinax lacks the gene for the Ni-CO dehydrogenase catalytic subunit encoded in the gene cluster. Here, we performed a comparative genome analysis of the genus Carboxydothermus , transcriptional analysis, and cultivation study under 100% CO to prove their hydrogenogenic CO metabolism. Results revealed that C. pertinax could couple Ni-CO dehydrogenase-II alternatively to the distal energy-converting hydrogenase. Furthermore, C. pertinax represents an example of the functioning of Ni-CO dehydrogenase which does not always correspond with its genomic context owing to the versatility of CO metabolism and the low redox potential of CO. Copyright © 2018 American Society for Microbiology.

  1. Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

    PubMed

    Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

    2017-06-15

    Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Ergot cluster-encoded catalase is required for synthesis of chanoclavine-I in Aspergillus fumigatus.

    PubMed

    Goetz, Kerry E; Coyle, Christine M; Cheng, Johnathan Z; O'Connor, Sarah E; Panaccione, Daniel G

    2011-06-01

    Genes required for ergot alkaloid biosynthesis are clustered in the genomes of several fungi. Several conserved ergot cluster genes have been hypothesized, and in some cases demonstrated, to encode early steps of the pathway shared among fungi that ultimately make different ergot alkaloid end products. The deduced amino acid sequence of one of these conserved genes (easC) indicates a catalase as the product, but a role for a catalase in the ergot alkaloid pathway has not been established. We disrupted easC of Aspergillus fumigatus by homologous recombination with a truncated copy of that gene. The resulting mutant (ΔeasC) failed to produce the ergot alkaloids typically observed in A. fumigatus, including chanoclavine-I, festuclavine, and fumigaclavines B, A, and C. The ΔeasC mutant instead accumulated N-methyl-4-dimethylallyltryptophan (N-Me-DMAT), an intermediate recently shown to accumulate in Claviceps purpurea strains mutated at ccsA (called easE in A. fumigatus) (Lorenz et al. Appl Environ Microbiol 76:1822-1830, 2010). A ΔeasE disruption mutant of A. fumigatus also failed to accumulate chanoclavine-I and downstream ergot alkaloids and, instead, accumulated N-Me-DMAT. Feeding chanoclavine-I to the ΔeasC mutant restored ergot alkaloid production. Complementation of either ΔeasC or ΔeasE mutants with the respective wild-type allele also restored ergot alkaloid production. The easC gene was expressed in Escherichia coli, and the protein product displayed in vitro catalase activity with H(2)O(2) but did not act, in isolation, on N-Me-DMAT as substrate. The data indicate that the products of both easC (catalase) and easE (FAD-dependent oxidoreductase) are required for conversion of N-Me-DMAT to chanoclavine-I.

  3. CRISPR Diversity and Microevolution in Clostridium difficile.

    PubMed

    Andersen, Joakim M; Shoup, Madelyn; Robinson, Cathy; Britton, Robert; Olsen, Katharina E P; Barrangou, Rodolphe

    2016-09-19

    Virulent strains of Clostridium difficile have become a global health problem associated with morbidity and mortality. Traditional typing methods do not provide ideal resolution to track outbreak strains, ascertain genetic diversity between isolates, or monitor the phylogeny of this species on a global basis. Here, we investigate the occurrence and diversity of clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (cas) in C. difficile to assess the potential of CRISPR-based phylogeny and high-resolution genotyping. A single Type-IB CRISPR-Cas system was identified in 217 analyzed genomes with cas gene clusters present at conserved chromosomal locations, suggesting vertical evolution of the system, assessing a total of 1,865 CRISPR arrays. The CRISPR arrays, markedly enriched (8.5 arrays/genome) compared with other species, occur both at conserved and variable locations across strains, and thus provide a basis for typing based on locus occurrence and spacer polymorphism. Clustering of strains by array composition correlated with sequence type (ST) analysis. Spacer content and polymorphism within conserved CRISPR arrays revealed phylogenetic relationship across clades and within ST. Spacer polymorphisms of conserved arrays were instrumental for differentiating closely related strains, e.g., ST1/RT027/B1 strains and pathogenicity locus encoding ST3/RT001 strains. CRISPR spacers showed sequence similarity to phage sequences, which is consistent with the native role of CRISPR-Cas as adaptive immune systems in bacteria. Overall, CRISPR-Cas sequences constitute a valuable basis for genotyping of C. difficile isolates, provide insights into the micro-evolutionary events that occur between closely related strains, and reflect the evolutionary trajectory of these genomes. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. The complete mitochondrial genome of the central chimpanzee, Pan troglodytes troglodytes.

    PubMed

    Liu, Bang; Hu, Xiao-di; Gao, Li-Zhi

    2016-07-01

    This study first report the complete mitochondrial genome sequence of the central chimpanzee, Pan troglodytes troglodytes. The genome was a total of 16 556 bp in length and had a base composition of A (31.05%), G (12.95%), C (30.84%), and T (25.16%), indicating that the percentage of A + T (56.21%) is higher than G + C (43.79%). Similar to other primates, it possessed a typically conserved structure, including 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region (D-loop). Most of these genes were found to locate on the H-strand except for the ND6 gene and 8 tRNA genes. The phylogenetic analysis showed that the P. t. troglodytes mitochondrial genome formed a cluster with the other three Pan troglodytes genomes and that the genus Pan is closely related to the genus Homo. This mitochondrial genome sequence would supply useful genetic resources to help the conservation management of primate germplasm and uncover hominoid evolution.

  5. Type III Pilus of Corynebacteria: Pilus Length Is Determined by the Level of Its Major Pilin Subunit

    PubMed Central

    Swierczynski, Arlene; Ton-That, Hung

    2006-01-01

    Multiple pilus gene clusters have been identified in several gram-positive bacterial genomes sequenced to date, including the Actinomycetales, clostridia, streptococci, and corynebacteria. The genome of Corynebacterium diphtheriae contains three pilus gene clusters, two of which have been previously characterized. Here, we report the characterization of the third pilus encoded by the spaHIG cluster. By using electron microscopy and biochemical analysis, we demonstrate that SpaH forms the pilus shaft, while SpaI decorates the structure and SpaG is largely located at the pilus tip. The assembly of the SpaHIG pilus requires a specific sortase located within the spaHIG pilus gene cluster. Deletion of genes specific for the synthesis and polymerization of the other two pilus types does not affect the SpaHIG pilus. Moreover, SpaH but not SpaI or SpaG is essential for the formation of the filament. When expressed under the control of an inducible promoter, the amount of the SpaH pilin regulates pilus length; no pili are assembled from an SpaH precursor that has an alanine in place of the conserved lysine of the SpaH pilin motif. Thus, the spaHIG pilus gene cluster encodes a pilus structure that is independently assembled and antigenically distinct from other pili of C. diphtheriae. We incorporate these findings in a model of sortase-mediated pilus assembly that may be applicable to many gram-positive pathogens. PMID:16923899

  6. Type III pilus of corynebacteria: Pilus length is determined by the level of its major pilin subunit.

    PubMed

    Swierczynski, Arlene; Ton-That, Hung

    2006-09-01

    Multiple pilus gene clusters have been identified in several gram-positive bacterial genomes sequenced to date, including the Actinomycetales, clostridia, streptococci, and corynebacteria. The genome of Corynebacterium diphtheriae contains three pilus gene clusters, two of which have been previously characterized. Here, we report the characterization of the third pilus encoded by the spaHIG cluster. By using electron microscopy and biochemical analysis, we demonstrate that SpaH forms the pilus shaft, while SpaI decorates the structure and SpaG is largely located at the pilus tip. The assembly of the SpaHIG pilus requires a specific sortase located within the spaHIG pilus gene cluster. Deletion of genes specific for the synthesis and polymerization of the other two pilus types does not affect the SpaHIG pilus. Moreover, SpaH but not SpaI or SpaG is essential for the formation of the filament. When expressed under the control of an inducible promoter, the amount of the SpaH pilin regulates pilus length; no pili are assembled from an SpaH precursor that has an alanine in place of the conserved lysine of the SpaH pilin motif. Thus, the spaHIG pilus gene cluster encodes a pilus structure that is independently assembled and antigenically distinct from other pili of C. diphtheriae. We incorporate these findings in a model of sortase-mediated pilus assembly that may be applicable to many gram-positive pathogens.

  7. The WRKY Transcription Factor Genes in Lotus japonicus.

    PubMed

    Song, Hui; Wang, Pengfei; Nan, Zhibiao; Wang, Xingjun

    2014-01-01

    WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I-III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved.

  8. Molecular analysis of SCARECROW genes expressed in white lupin cluster roots

    PubMed Central

    Sbabou, Laila; Bucciarelli, Bruna; Miller, Susan; Liu, Junqi; Berhada, Fatiha; Filali-Maltouf, Abdelkarim; Allan, Deborah; Vance, Carroll

    2010-01-01

    The Scarecrow (SCR) transcription factor plays a crucial role in root cell radial patterning and is required for maintenance of the quiescent centre and differentiation of the endodermis. In response to phosphorus (P) deficiency, white lupin (Lupinus albus L.) root surface area increases some 50-fold to 70-fold due to the development of cluster (proteoid) roots. Previously it was reported that SCR-like expressed sequence tags (ESTs) were expressed during early cluster root development. Here the cloning of two white lupin SCR genes, LaSCR1 and LaSCR2, is reported. The predicted amino acid sequences of both LaSCR gene products are highly similar to AtSCR and contain C-terminal conserved GRAS family domains. LaSCR1 and LaSCR2 transcript accumulation localized to the endodermis of both normal and cluster roots as shown by in situ hybridization and gene promoter::reporter staining. Transcript analysis as evaluated by quantitative real-time-PCR (qRT-PCR) and RNA gel hybridization indicated that the two LaSCR genes are expressed predominantly in roots. Expression of LaSCR genes was not directly responsive to the P status of the plant but was a function of cluster root development. Suppression of LaSCR1 in transformed roots of lupin and Medicago via RNAi (RNA interference) delivered through Agrobacterium rhizogenes resulted in decreased root numbers, reflecting the potential role of LaSCR1 in maintaining root growth in these species. The results suggest that the functional orthologues of AtSCR have been characterized. PMID:20167612

  9. Inferred vs Realized Patterns of Gene Flow: An Analysis of Population Structure in the Andros Island Rock Iguana

    PubMed Central

    Colosimo, Giuliano; Knapp, Charles R.; Wallace, Lisa E.; Welch, Mark E.

    2014-01-01

    Ecological data, the primary source of information on patterns and rates of migration, can be integrated with genetic data to more accurately describe the realized connectivity between geographically isolated demes. In this paper we implement this approach and discuss its implications for managing populations of the endangered Andros Island Rock Iguana, Cyclura cychlura cychlura. This iguana is endemic to Andros, a highly fragmented landmass of large islands and smaller cays. Field observations suggest that geographically isolated demes were panmictic due to high, inferred rates of gene flow. We expand on these observations using 16 polymorphic microsatellites to investigate the genetic structure and rates of gene flow from 188 Andros Iguanas collected across 23 island sites. Bayesian clustering of specimens assigned individuals to three distinct genotypic clusters. An analysis of molecular variance (AMOVA) indicates that allele frequency differences are responsible for a significant portion of the genetic variance across the three defined clusters (Fst =  0.117, p0.01). These clusters are associated with larger islands and satellite cays isolated by broad water channels with strong currents. These findings imply that broad water channels present greater obstacles to gene flow than was inferred from field observation alone. Additionally, rates of gene flow were indirectly estimated using BAYESASS 3.0. The proportion of individuals originating from within each identified cluster varied from 94.5 to 98.7%, providing further support for local isolation. Our assessment reveals a major disparity between inferred and realized gene flow. We discuss our results in a conservation perspective for species inhabiting highly fragmented landscapes. PMID:25229344

  10. Conserved syntenic clusters of protein coding genes are missing in birds.

    PubMed

    Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

    2014-01-01

    Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.

  11. Conserved nonsense-prone CpG sites in apoptosis-regulatory genes: conditional stop signs on the road to cell death.

    PubMed

    Zhao, Yongzhong; Epstein, Richard J

    2013-01-01

    Methylation-prone CpG dinucleotides are strongly conserved in the germline, yet are also predisposed to somatic mutation. Here we quantify the relationship between germline codon mutability and somatic carcinogenesis by comparing usage of the nonsense-prone CGA (→TGA) codons in gene groups that differ in apoptotic function; to this end, suppressor genes were subclassified as either apoptotic (gatekeepers) or repair (caretakers). Mutations affecting CGA codons in sporadic tumors proved to be highly asymmetric. Moreover, nonsense mutations were 3-fold more likely to affect gatekeepers than caretakers. In addition, intragenic CGA clustering nonrandomly affected functionally critical regions of gatekeepers. We conclude that human gatekeeper suppressor genes are enriched for nonsense-prone codons, and submit that this germline vulnerability to tumors could reflect in utero selection for a methylation-dependent capability to short-circuit environmental insults that otherwise trigger apoptosis and fetal loss.

  12. Pre-Bilaterian Origins of the Hox Cluster and the Hox Code: Evidence from the Sea Anemone, Nematostella vectensis

    PubMed Central

    Ryan, Joseph F.; Mazza, Maureen E.; Pang, Kevin; Matus, David Q.; Baxevanis, Andreas D.; Martindale, Mark Q.; Finnerty, John R.

    2007-01-01

    Background Hox genes were critical to many morphological innovations of bilaterian animals. However, early Hox evolution remains obscure. Phylogenetic, developmental, and genomic analyses on the cnidarian sea anemone Nematostella vectensis challenge recent claims that the Hox code is a bilaterian invention and that no “true” Hox genes exist in the phylum Cnidaria. Methodology/Principal Findings Phylogenetic analyses of 18 Hox-related genes from Nematostella identify putative Hox1, Hox2, and Hox9+ genes. Statistical comparisons among competing hypotheses bolster these findings, including an explicit consideration of the gene losses implied by alternate topologies. In situ hybridization studies of 20 Hox-related genes reveal that multiple Hox genes are expressed in distinct regions along the primary body axis, supporting the existence of a pre-bilaterian Hox code. Additionally, several Hox genes are expressed in nested domains along the secondary body axis, suggesting a role in “dorsoventral” patterning. Conclusions/Significance A cluster of anterior and posterior Hox genes, as well as ParaHox cluster of genes evolved prior to the cnidarian-bilaterian split. There is evidence to suggest that these clusters were formed from a series of tandem gene duplication events and played a role in patterning both the primary and secondary body axes in a bilaterally symmetrical common ancestor. Cnidarians and bilaterians shared a common ancestor some 570 to 700 million years ago, and as such, are derived from a common body plan. Our work reveals several conserved genetic components that are found in both of these diverse lineages. This finding is consistent with the hypothesis that a set of developmental rules established in the common ancestor of cnidarians and bilaterians is still at work today. PMID:17252055

  13. Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters.

    PubMed

    Dallery, Jean-Félix; Lapalu, Nicolas; Zampounis, Antonios; Pigné, Sandrine; Luyten, Isabelle; Amselem, Joëlle; Wittenberg, Alexander H J; Zhou, Shiguo; de Queiroz, Marisa V; Robin, Guillaume P; Auger, Annie; Hainaut, Matthieu; Henrissat, Bernard; Kim, Ki-Tae; Lee, Yong-Hwan; Lespinet, Olivier; Schwartz, David C; Thon, Michael R; O'Connell, Richard J

    2017-08-29

    The ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly-conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.

  14. Distribution and evolution of cotton fiber development genes in the fibreless Gossypium raimondii genome

    USDA-ARS?s Scientific Manuscript database

    Cotton fibers represent the largest single cell in the plant kingdom, and they have been used as a model to study cell function, differentiation, maturation, and cell death. The cotton fiber transcriptome can be clustered into two genomic regions: conserved and recombination hotspots. Genetic link...

  15. Genomic landscape of fiber genes in fibered and non-fibered cottons

    USDA-ARS?s Scientific Manuscript database

    Cotton fiber is the largest single cell in the plant kingdom. It is the best model to study cell function, differentiation, maturation, and cell death. Cotton fiber transcriptome can be clustered into two types of regions: conservative areas and recombination hotspots. This study was to investig...

  16. Deciphering the Cryptic Genome: Genome-wide Analyses of the Rice Pathogen Fusarium fujikuroi Reveal Complex Regulation of Secondary Metabolism and Novel Metabolites

    PubMed Central

    Studt, Lena; Niehaus, Eva-Maria; Espino, Jose J.; Huß, Kathleen; Michielse, Caroline B.; Albermann, Sabine; Wagner, Dominik; Bergner, Sonja V.; Connolly, Lanelle R.; Fischer, Andreas; Reuter, Gunter; Kleigrewe, Karin; Bald, Till; Wingfield, Brenda D.; Ophir, Ron; Freeman, Stanley; Hippler, Michael; Smith, Kristina M.; Brown, Daren W.; Proctor, Robert H.; Münsterkötter, Martin; Freitag, Michael; Humpf, Hans-Ulrich; Güldener, Ulrich; Tudzynski, Bettina

    2013-01-01

    The fungus Fusarium fujikuroi causes “bakanae” disease of rice due to its ability to produce gibberellins (GAs), but it is also known for producing harmful mycotoxins. However, the genetic capacity for the whole arsenal of natural compounds and their role in the fungus' interaction with rice remained unknown. Here, we present a high-quality genome sequence of F. fujikuroi that was assembled into 12 scaffolds corresponding to the 12 chromosomes described for the fungus. We used the genome sequence along with ChIP-seq, transcriptome, proteome, and HPLC-FTMS-based metabolome analyses to identify the potential secondary metabolite biosynthetic gene clusters and to examine their regulation in response to nitrogen availability and plant signals. The results indicate that expression of most but not all gene clusters correlate with proteome and ChIP-seq data. Comparison of the F. fujikuroi genome to those of six other fusaria revealed that only a small number of gene clusters are conserved among these species, thus providing new insights into the divergence of secondary metabolism in the genus Fusarium. Noteworthy, GA biosynthetic genes are present in some related species, but GA biosynthesis is limited to F. fujikuroi, suggesting that this provides a selective advantage during infection of the preferred host plant rice. Among the genome sequences analyzed, one cluster that includes a polyketide synthase gene (PKS19) and another that includes a non-ribosomal peptide synthetase gene (NRPS31) are unique to F. fujikuroi. The metabolites derived from these clusters were identified by HPLC-FTMS-based analyses of engineered F. fujikuroi strains overexpressing cluster genes. In planta expression studies suggest a specific role for the PKS19-derived product during rice infection. Thus, our results indicate that combined comparative genomics and genome-wide experimental analyses identified novel genes and secondary metabolites that contribute to the evolutionary success of F. fujikuroi as a rice pathogen. PMID:23825955

  17. Genome-wide identification and characterization of microRNA genes and their targets in flax (Linum usitatissimum): Characterization of flax miRNA genes.

    PubMed

    Barvkar, Vitthal T; Pardeshi, Varsha C; Kale, Sandip M; Qiu, Shuqing; Rollins, Meaghen; Datla, Raju; Gupta, Vidya S; Kadoo, Narendra Y

    2013-04-01

    MicroRNAs (miRNAs) are small (20-24 nucleotide long) endogenous regulatory RNAs that play important roles in plant growth and development. They regulate gene expression at the post-transcriptional level by translational repression or target degradation and gene silencing. In this study, we identified 116 conserved miRNAs belonging to 23 families from the flax (Linum usitatissimum L.) genome using a computational approach. The precursor miRNAs varied in length; while most of the mature miRNAs were 21 nucleotide long, intergenic and showed conserved signatures of RNA polymerase II transcripts in their upstream regions. Promoter region analysis of the flax miRNA genes indicated prevalence of MYB transcription factor binding sites. Four miRNA gene clusters containing members of three phylogenetic groups were identified. Further, 142 target genes were predicted for these miRNAs and most of these represent transcriptional regulators. The miRNA encoding genes were expressed in diverse tissues as determined by digital expression analysis as well as real-time PCR. The expression of fourteen miRNAs and nine target genes was independently validated using the quantitative reverse transcription PCR (qRT-PCR). This study suggests that a large number of conserved plant miRNAs are also found in flax and these may play important roles in growth and development of flax.

  18. Hierarchical Partitioning of Metazoan Protein Conservation Profiles Provides New Functional Insights

    PubMed Central

    Witztum, Jonathan; Persi, Erez; Horn, David; Pasmanik-Chor, Metsada; Chor, Benny

    2014-01-01

    The availability of many complete, annotated proteomes enables the systematic study of the relationships between protein conservation and functionality. We explore this question based solely on the presence or absence of protein homologues (a.k.a. conservation profiles). We study 18 metazoans, from two distinct points of view: the human's and the fly's. Using the GOrilla gene ontology (GO) analysis tool, we explore functional enrichment of the “universal proteins”, those with homologues in all 17 other species, and of the “non-universal proteins”. A large number of GO terms are strongly enriched in both human and fly universal proteins. Most of these functions are known to be essential. A smaller number of GO terms, exhibiting markedly different properties, are enriched in both human and fly non-universal proteins. We further explore the non-universal proteins, whose conservation profiles are consistent with the “tree of life” (TOL consistent), as well as the TOL inconsistent proteins. Finally, we applied Quantum Clustering to the conservation profiles of the TOL consistent proteins. Each cluster is strongly associated with one or a small number of specific monophyletic clades in the tree of life. The proteins in many of these clusters exhibit strong functional enrichment associated with the “life style” of the related clades. Most previous approaches for studying function and conservation are “bottom up”, studying protein families one by one, and separately assessing the conservation of each. By way of contrast, our approach is “top down”. We globally partition the set of all proteins hierarchically, as described above, and then identify protein families enriched within different subdivisions. While supporting previous findings, our approach also provides a tool for discovering novel relations between protein conservation profiles, functionality, and evolutionary history as represented by the tree of life. PMID:24594619

  19. Transcription of two adjacent carbohydrate utilization gene clusters in Bifidobacterium breve UCC2003 is controlled by LacI- and repressor open reading frame kinase (ROK)-type regulators.

    PubMed

    O'Connell, Kerry Joan; Motherway, Mary O'Connell; Liedtke, Andrea; Fitzgerald, Gerald F; Paul Ross, R; Stanton, Catherine; Zomer, Aldert; van Sinderen, Douwe

    2014-06-01

    Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control.

  20. Genetic homogeneity of Clostridium botulinum type A1 strains with unique toxin gene clusters.

    PubMed

    Raphael, Brian H; Luquez, Carolina; McCroskey, Loretta M; Joseph, Lavin A; Jacobson, Mark J; Johnson, Eric A; Maslanka, Susan E; Andreadis, Joanne D

    2008-07-01

    A group of five clonally related Clostridium botulinum type A strains isolated from different sources over a period of nearly 40 years harbored several conserved genetic properties. These strains contained a variant bont/A1 with five nucleotide polymorphisms compared to the gene in C. botulinum strain ATCC 3502. The strains also had a common toxin gene cluster composition (ha-/orfX+) similar to that associated with bont/A in type A strains containing an unexpressed bont/B [termed A(B) strains]. However, bont/B was not identified in the strains examined. Comparative genomic hybridization demonstrated identical genomic content among the strains relative to C. botulinum strain ATCC 3502. In addition, microarray data demonstrated the absence of several genes flanking the toxin gene cluster among the ha-/orfX+ A1 strains, suggesting the presence of genomic rearrangements with respect to this region compared to the C. botulinum ATCC 3502 strain. All five strains were shown to have identical flaA variable region nucleotide sequences. The pulsed-field gel electrophoresis patterns of the strains were indistinguishable when digested with SmaI, and a shift in the size of at least one band was observed in a single strain when digested with XhoI. These results demonstrate surprising genomic homogeneity among a cluster of unique C. botulinum type A strains of diverse origin.

  1. Phylogeography, phylogeny and hybridization in trichechid sirenians: Implications for manatee conservation

    USGS Publications Warehouse

    Vianna, J.A.; Bonde, R.K.; Caballero, S.; Giraldo, J.P.; Lima, R.P.; Clark, A.; Marmontel, M.; Morales-Vela, B.; De Souza, M. J.; Parr, L.; Rodriguez-Lopez, M.A.; Mignucci-Giannoni, A. A.; Powell, J.A.; Santos, F.R.

    2006-01-01

    The three living species of manatees, West Indian (Trichechus manatus), Amazonian (Trichechus inunguis) and West African (Trichechus senegalensis), are distributed across the shallow tropical and subtropical waters of America and the western coast of Africa. We have sequenced the mitochondrial DNA control region in 330 Trichechus to compare their phylogeographic patterns. In T. manatus we observed a marked population structure with the identification of three haplotype clusters showing a distinct spatial distribution. A geographic barrier represented by the continuity of the Lesser Antilles to Trinidad Island, near the mouth of the Orinoco River in Venezuela, appears to have restricted the gene flow historically in T. manatus. However, for T. inunguis we observed a single expanding population cluster, with a high diversity of very closely related haplotypes. A marked geographic population structure is likely present in T. senegalensis with at least two distinct clusters. Phylogenetic analyses with the mtDNA cytochrome b gene suggest a clade of the marine Trichechus species, with T. inunguis as the most basal trichechid. This is in agreement with previous morphological analyses. Mitochondrial DNA, autosomal microsatellites and cytogenetic analyses revealed the presence of hybrids between the T. manatus and T. inunguis species at the mouth of the Amazon River in Brazil, extending to the Guyanas and probably as far as the mouth of the Orinoco River. Future conservation strategies should consider the distinct population structure of manatee species, as well as the historical barriers to gene flow and the likely occurrence of interspecific hybridization. ?? 2006 Blackwell Publishing Ltd.

  2. The Evolution of SINEs and LINEs in the genus Chironomus (Diptera).

    PubMed

    Papusheva, Ekaterina; Gruhl, Mary C; Berezikov, Eugene; Groudieva, Tatiana; Scherbik, Svetlana V; Martin, Jon; Blinov, Alexander; Bergtrom, Gerald

    2004-03-01

    Genomic DNA amplification from 51 species of the family Chironomidae shows that most contain relatives of NLRCth1 LINE and CTRT1 SINE retrotransposons first found in Chironomus thummi. More than 300 cloned PCR products were sequenced. The amplified region of the reverse transcriptase gene in the LINEs is intact and highly conserved, suggesting active elements. The SINEs are less conserved, consistent with minimal/no selection after transposition. A mitochondrial gene phylogeny resolves the Chironomus genus into six lineages (Guryev et al. 2001). LINE and SINE phylogenies resolve five of these lineages, indicating their monophyletic origin and vertical inheritance. However, both the LINE and the SINE tree topologies differ from the species phylogeny, resolving the elements into "clusters I-IV" and "cluster V" families. The data suggest a descent of all LINE and SINE subfamilies from two major families. Based on the species phylogeny, a few LINEs and a larger number of SINEs are cladisitically misplaced. Most misbranch with LINEs or SINEs from species with the same families of elements. From sequence comparisons, cladistically misplaced LINEs and several misplaced SINEs arose by convergent base substitutions. More diverged SINEs result from early transposition and some are derived from multiple source SINEs in the same species. SINEs from two species (C. dorsalis, C. pallidivittatus), expected to belong to the clusters I-IV family, branch instead with cluster V family SINEs; apparently both families predate separation of cluster V from clusters I-IV species. Correlation of the distribution of active SINEs and LINEs, as well as similar 3' sequence motifs in CTRT1 and NLRCth1, suggests coevolving retrotransposon pairs in which CTRT1 transposition depends on enzymes active during NLRCth1 LINE mobility.

  3. Identification of conserved pathways of DNA-damage response and radiation protection by genome-wide RNAi.

    PubMed

    van Haaften, Gijs; Romeijn, Ron; Pothof, Joris; Koole, Wouter; Mullenders, Leon H F; Pastink, Albert; Plasterk, Ronald H A; Tijsterman, Marcel

    2006-07-11

    Ionizing radiation is extremely harmful for human cells, and DNA double-strand breaks (DSBs) are considered to be the main cytotoxic lesions induced. Improper processing of DSBs contributes to tumorigenesis, and mutations in DSB response genes underlie several inherited disorders characterized by cancer predisposition. Here, we performed a comprehensive screen for genes that protect animal cells against ionizing radiation. A total of 45 C. elegans genes were identified in a genome-wide RNA interference screen for increased sensitivity to ionizing radiation in germ cells. These genes include orthologs of well-known human cancer predisposition genes as well as novel genes, including human disease genes not previously linked to defective DNA-damage responses. Knockdown of eleven genes also impaired radiation-induced cell-cycle arrest, and seven genes were essential for apoptosis upon exposure to irradiation. The gene set was further clustered on the basis of increased sensitivity to DNA-damaging cancer drugs cisplatin and camptothecin. Almost all genes are conserved across animal phylogeny, and their relevance for humans was directly demonstrated by showing that their knockdown in human cells results in radiation sensitivity, indicating that this set of genes is important for future cancer profiling and drug development.

  4. Are Pericentric Inversions Reorganizing Wedge Shell Genomes?

    PubMed Central

    García-Souto, Daniel; Pérez-García, Concepción

    2017-01-01

    Wedge shells belonging to the Donacidae family are the dominant bivalves in exposed beaches in almost all areas of the world. Typically, two or more sympatric species of wedge shells differentially occupy intertidal, sublittoral, and offshore coastal waters in any given locality. A molecular cytogenetic analysis of two sympatric and closely related wedge shell species, Donax trunculus and Donax vittatus, was performed. Results showed that the karyotypes of these two species were both strikingly different and closely alike; whilst metacentric and submetacentric chromosome pairs were the main components of the karyotype of D. trunculus, 10–11 of the 19 chromosome pairs were telocentric in D. vittatus, most likely as a result of different pericentric inversions. GC-rich heterochromatic bands were present in both species. Furthermore, they showed coincidental 45S ribosomal RNA (rRNA), 5S rRNA and H3 histone gene clusters at conserved chromosomal locations, although D. trunculus had an additional 45S rDNA cluster. Intraspecific pericentric inversions were also detected in both D. trunculus and D. vittatus. The close genetic similarity of these two species together with the high degree of conservation of the 45S rRNA, 5S rRNA and H3 histone gene clusters, and GC-rich heterochromatic bands indicate that pericentric inversions contribute to the karyotype divergence in wedge shells. PMID:29215567

  5. The cytosolic Fe-S cluster assembly component MET18 is required for the full enzymatic activity of ROS1 in active DNA demethylation.

    PubMed

    Wang, Xiaokang; Li, Qi; Yuan, Wei; Cao, Zhendong; Qi, Bei; Kumar, Suresh; Li, Yan; Qian, Weiqiang

    2016-05-19

    DNA methylation patterns in plants are dynamically regulated by DNA methylation and active DNA demethylation in response to both environmental changes and development of plant. Beginning with the removal of methylated cytosine by ROS1/DME family of 5-methylcytosine DNA glycosylases, active DNA demethylation in plants occurs through base excision repair. So far, many components involved in active DNA demethylation remain undiscovered. Through a forward genetic screening of Arabidopsis mutants showing DNA hypermethylation at the EPF2 promoter region, we identified the conserved iron-sulfur cluster assembly protein MET18. MET18 dysfunction caused DNA hypermethylation at more than 1000 loci as well as the silencing of reporter genes and some endogenous genes. MET18 can directly interact with ROS1 in vitro and in vivo. ROS1 activity was reduced in the met18 mutant plants and point mutation in the conserved Fe-S cluster binding motif of ROS1 disrupted its biological function. Interestingly, a large number of DNA hypomethylated loci, especially in the CHH context, were identified from the met18 mutants and most of the hypo-DMRs were from TE regions. Our results suggest that MET18 can regulate both active DNA demethylation and DNA methylation pathways in Arabidopsis.

  6. The cytosolic Fe-S cluster assembly component MET18 is required for the full enzymatic activity of ROS1 in active DNA demethylation

    PubMed Central

    Wang, Xiaokang; Li, Qi; Yuan, Wei; Cao, Zhendong; Qi, Bei; Kumar, Suresh; Li, Yan; Qian, Weiqiang

    2016-01-01

    DNA methylation patterns in plants are dynamically regulated by DNA methylation and active DNA demethylation in response to both environmental changes and development of plant. Beginning with the removal of methylated cytosine by ROS1/DME family of 5-methylcytosine DNA glycosylases, active DNA demethylation in plants occurs through base excision repair. So far, many components involved in active DNA demethylation remain undiscovered. Through a forward genetic screening of Arabidopsis mutants showing DNA hypermethylation at the EPF2 promoter region, we identified the conserved iron-sulfur cluster assembly protein MET18. MET18 dysfunction caused DNA hypermethylation at more than 1000 loci as well as the silencing of reporter genes and some endogenous genes. MET18 can directly interact with ROS1 in vitro and in vivo. ROS1 activity was reduced in the met18 mutant plants and point mutation in the conserved Fe-S cluster binding motif of ROS1 disrupted its biological function. Interestingly, a large number of DNA hypomethylated loci, especially in the CHH context, were identified from the met18 mutants and most of the hypo-DMRs were from TE regions. Our results suggest that MET18 can regulate both active DNA demethylation and DNA methylation pathways in Arabidopsis. PMID:27193999

  7. The ABC transporter Tba of Amycolatopsis balhimycina is required for efficient export of the glycopeptide antibiotic balhimycin.

    PubMed

    Menges, R; Muth, G; Wohlleben, W; Stegmann, E

    2007-11-01

    All known gene clusters for glycopeptide antibiotic biosynthesis contain a conserved gene supposed to encode an ABC-transporter. In the balhimycin-producer Amycolatopsis balhimycina this gene (tba) is localised between the prephenate dehydrogenase gene pdh and the peptide synthetase gene bpsA. Inactivation of tba in A. balhimycina by gene replacement did not interfere with growth and did not affect balhimycin resistance. However, in the supernatant of the tba mutant RM43 less balhimycin was accumulated compared to the wild type; and the intra-cellular balhimycin concentration was ten times higher in the tba mutant RM43 than in the wild type. These data suggest that the ABC transporter encoded in the balhimycin biosynthesis gene cluster is not involved in resistance but is required for the efficient export of the antibiotic. To elucidate the activity of Tba it was heterologously expressed in Escherichia coli with an N-terminal His-tag and purified by nickel chromatography. A photometric assay revealed that His(6)-Tba solubilised in dodecylmaltoside possesses ATPase activity, characteristic for ABC-transporters.

  8. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    PubMed

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  9. Tunable regulation of CREB DNA binding activity couples genotoxic stress response and metabolism

    PubMed Central

    Kim, Sang Hwa; Trinh, Anthony T.; Larsen, Michele Campaigne; Mastrocola, Adam S.; Jefcoate, Colin R.; Bushel, Pierre R.; Tibbetts, Randal S.

    2016-01-01

    cAMP response element binding protein (CREB) is a key regulator of glucose metabolism and synaptic plasticity that is canonically regulated through recruitment of transcriptional coactivators. Here we show that phosphorylation of CREB on a conserved cluster of Ser residues (the ATM/CK cluster) by the DNA damage-activated protein kinase ataxia-telangiectasia-mutated (ATM) and casein kinase1 (CK1) and casein kinase2 (CK2) positively and negatively regulates CREB-mediated transcription in a signal dependent manner. In response to genotoxic stress, phosphorylation of the ATM/CK cluster inhibited CREB-mediated gene expression, DNA binding activity and chromatin occupancy proportional to the number of modified Ser residues. Paradoxically, substoichiometric, ATM-independent, phosphorylation of the ATM/CK cluster potentiated bursts in CREB-mediated transcription by promoting recruitment of the CREB coactivator, cAMP-regulated transcriptional coactivators (CRTC2). Livers from mice expressing a non-phosphorylatable CREB allele failed to attenuate gluconeogenic genes in response to DNA damage or fully activate the same genes in response to glucagon. We propose that phosphorylation-dependent regulation of DNA binding activity evolved as a tunable mechanism to control CREB transcriptional output and promote metabolic homeostasis in response to rapidly changing environmental conditions. PMID:27431323

  10. Identification of a Conserved Non-Protein-Coding Genomic Element that Plays an Essential Role in Alphabaculovirus Pathogenesis

    PubMed Central

    Kikhno, Irina

    2014-01-01

    Highly homologous sequences 154–157 bp in length grouped under the name of “conserved non-protein-coding element” (CNE) were revealed in all of the sequenced genomes of baculoviruses belonging to the genus Alphabaculovirus. A CNE alignment led to the detection of a set of highly conserved nucleotide clusters that occupy strictly conserved positions in the CNE sequence. The significant length of the CNE and conservation of both its length and cluster architecture were identified as a combination of characteristics that make this CNE different from known viral non-coding functional sequences. The essential role of the CNE in the Alphabaculovirus life cycle was demonstrated through the use of a CNE-knockout Autographa californica multiple nucleopolyhedrovirus (AcMNPV) bacmid. It was shown that the essential function of the CNE was not mediated by the presumed expression activities of the protein- and non-protein-coding genes that overlap the AcMNPV CNE. On the basis of the presented data, the AcMNPV CNE was categorized as a complex-structured, polyfunctional genomic element involved in an essential DNA transaction that is associated with an undefined function of the baculovirus genome. PMID:24740153

  11. Discovery of a widely distributed toxin biosynthetic gene cluster

    PubMed Central

    Lee, Shaun W.; Mitchell, Douglas A.; Markley, Andrew L.; Hensler, Mary E.; Gonzalez, David; Wohlrab, Aaron; Dorrestein, Pieter C.; Nizet, Victor; Dixon, Jack E.

    2008-01-01

    Bacteriocins represent a large family of ribosomally produced peptide antibiotics. Here we describe the discovery of a widely conserved biosynthetic gene cluster for the synthesis of thiazole and oxazole heterocycles on ribosomally produced peptides. These clusters encode a toxin precursor and all necessary proteins for toxin maturation and export. Using the toxin precursor peptide and heterocycle-forming synthetase proteins from the human pathogen Streptococcus pyogenes, we demonstrate the in vitro reconstitution of streptolysin S activity. We provide evidence that the synthetase enzymes, as predicted from our bioinformatics analysis, introduce heterocycles onto precursor peptides, thereby providing molecular insight into the chemical structure of streptolysin S. Furthermore, our studies reveal that the synthetase exhibits relaxed substrate specificity and modifies toxin precursors from both related and distant species. Given our findings, it is likely that the discovery of similar peptidic toxins will rapidly expand to existing and emerging genomes. PMID:18375757

  12. The Mouse Solitary Odorant Receptor Gene Promoters as Models for the Study of Odorant Receptor Gene Choice.

    PubMed

    Degl'Innocenti, Andrea; Parrilla, Marta; Harr, Bettina; Teschke, Meike

    2016-01-01

    In vertebrates, several anatomical regions located within the nasal cavity mediate olfaction. Among these, the main olfactory epithelium detects most conventional odorants. Olfactory sensory neurons, provided with cilia exposed to the air, detect volatile chemicals via an extremely large family of seven-transmembrane chemoreceptors named odorant receptors. Their genes are expressed in a monogenic and monoallelic fashion: a single allele of a single odorant receptor gene is transcribed in a given mature neuron, through a still uncharacterized molecular mechanism known as odorant receptor gene choice. Odorant receptor genes are typically arranged in genomic clusters, but a few are isolated (we call them solitary) from the others within a region broader than 1 Mb upstream and downstream with respect to their transcript's coordinates. The study of clustered genes is problematic, because of redundancy and ambiguities in their regulatory elements: we propose to use the solitary genes as simplified models to understand odorant receptor gene choice. Here we define number and identity of the solitary genes in the mouse genome (C57BL/6J), and assess the conservation of the solitary status in some mammalian orthologs. Furthermore, we locate their putative promoters, predict their homeodomain binding sites (commonly present in the promoters of odorant receptor genes) and compare candidate promoter sequences with those of wild-caught mice. We also provide expression data from histological sections. In the mouse genome there are eight intact solitary genes: Olfr19 (M12), Olfr49, Olfr266, Olfr267, Olfr370, Olfr371, Olfr466, Olfr1402; five are conserved as solitary in rat. These genes are all expressed in the main olfactory epithelium of three-day-old mice. The C57BL/6J candidate promoter of Olfr370 has considerably varied compared to its wild-type counterpart. Within the putative promoter for Olfr266 a homeodomain binding site is predicted. As a whole, our findings favor Olfr266 as a model gene to investigate odorant receptor gene choice.

  13. COGNAT: a web server for comparative analysis of genomic neighborhoods.

    PubMed

    Klimchuk, Olesya I; Konovalov, Kirill A; Perekhvatov, Vadim V; Skulachev, Konstantin V; Dibrova, Daria V; Mulkidjanian, Armen Y

    2017-11-22

    In prokaryotic genomes, functionally coupled genes can be organized in conserved gene clusters enabling their coordinated regulation. Such clusters could contain one or several operons, which are groups of co-transcribed genes. Those genes that evolved from a common ancestral gene by speciation (i.e. orthologs) are expected to have similar genomic neighborhoods in different organisms, whereas those copies of the gene that are responsible for dissimilar functions (i.e. paralogs) could be found in dissimilar genomic contexts. Comparative analysis of genomic neighborhoods facilitates the prediction of co-regulated genes and helps to discern different functions in large protein families. We intended, building on the attribution of gene sequences to the clusters of orthologous groups of proteins (COGs), to provide a method for visualization and comparative analysis of genomic neighborhoods of evolutionary related genes, as well as a respective web server. Here we introduce the COmparative Gene Neighborhoods Analysis Tool (COGNAT), a web server for comparative analysis of genomic neighborhoods. The tool is based on the COG database, as well as the Pfam protein families database. As an example, we show the utility of COGNAT in identifying a new type of membrane protein complex that is formed by paralog(s) of one of the membrane subunits of the NADH:quinone oxidoreductase of type 1 (COG1009) and a cytoplasmic protein of unknown function (COG3002). This article was reviewed by Drs. Igor Zhulin, Uri Gophna and Igor Rogozin.

  14. [Construction of screening system for mutation of negative regulatory genes in Streptomyces].

    PubMed

    Zhu, Yu; Feng, Chi; Tan, Huarong; Tian, Yuqing

    2013-10-04

    We aimed to create a novel report system for screening the mutation of the negative regulatory genes, especially for those repressing the expression of cryptic antibiotics clusters. We used marker-free gene disruption strategy, which combines with the "REDIRECT (Rapid Efficient Directed Recombination Time Saving)" technology and in vivo site-specific recombination by Streptomyces phage phiBT1 integrase, to construct a scbR2/inoA double mutant strain of S. coelicolor M145. This strain was used as the host of the report system. For the construction of the reporter plasmid, the ScbR2 repressed promoter of cpkO from CPK (cryptic polyketide) cluster was used to drive the expression of a promoterless conserved gene inoA of S. coelicolor. Then the reporter plasmid was introduced into the host strain described above to test the availability of inoA as a reporter gene in this system. The scbR2/inoA double mutant strain gave rise to a bald pheno type on MM medium in the absence of inositol, and produced yellow pigmented secondary metabolite by the disruption of scbR2 to release the repression of cpkO, a pathway specific activator gene situated in CPK cluster. After introducing the reporter plasmid into this test stain, the resulting strain recovered the phenotype as wild-type strain, indicating that the promoter of cpkO can drive the expression of inoA in scbR2 mutant and consequently restore the biosynthesis of inositol. Our results indicated that inoA can be used as a novel reporter gene for Streptomyces, especially for detecting the activation of the "silent" promoter. This report system might be available for screening the mutation of the negative regulatory genes for the cryptic secondary metabolic gene clusters.

  15. Cloning and Characterization of the Tetrocarcin A Gene Cluster from Micromonospora chalcea NRRL 11289 Reveals a Highly Conserved Strategy for Tetronate Biosynthesis in Spirotetronate Antibiotics▿ †

    PubMed Central

    Fang, Jie; Zhang, Yiping; Huang, Lijuan; Jia, Xinying; Zhang, Qi; Zhang, Xu; Tang, Gongli; Liu, Wen

    2008-01-01

    Tetrocarcin A (TCA), produced by Micromonospora chalcea NRRL 11289, is a spirotetronate antibiotic with potent antitumor activity and versatile modes of action. In this study, the biosynthetic gene cluster of TCA was cloned and localized to a 108-kb contiguous DNA region. In silico sequence analysis revealed 36 putative genes that constitute this cluster (including 11 for unusual sugar biosynthesis, 13 for aglycone formation, and 4 for glycosylations) and allowed us to propose the biosynthetic pathway of TCA. The formation of d-tetronitrose, l-amicetose, and l-digitoxose may begin with d-glucose-1-phosphate, share early enzymatic steps, and branch into different pathways by competitive actions of specific enzymes. Tetronolide biosynthesis involves the incorporation of a 3-C unit with a polyketide intermediate to form the characteristic spirotetronate moiety and trans-decalin system. Further substitution of tetronolide with five deoxysugars (one being a deoxynitrosugar) was likely due to the activities of four glycosyltransferases. In vitro characterization of the first enzymatic step by utilization of 1,3-biphosphoglycerate as the substrate and in vivo cross-complementation of the bifunctional fused gene tcaD3 (with the functions of chlD3 and chlD4) to ΔchlD3 and ΔchlD4 in chlorothricin biosynthesis supported the highly conserved tetronate biosynthetic strategy in the spirotetronate family. Deletion of a large DNA fragment encoding polyketide synthases resulted in a non-TCA-producing strain, providing a clear background for the identification of novel analogs. These findings provide insights into spirotetronate biosynthesis and demonstrate that combinatorial-biosynthesis methods can be applied to the TCA biosynthetic machinery to generate structural diversity. PMID:18586939

  16. Conservation of a vitellogenin gene cluster in oviparous vertebrates and identification of its traces in the platypus genome.

    PubMed

    Babin, Patrick J

    2008-04-30

    Vitellogenin (Vtg) derivatives are the main egg-yolk proteins in most oviparous animal species, and are, therefore, key players in reproduction and embryo development. Conserved synteny and phylogeny were used to identify a Vtg gene cluster (VGC) that had been evolutionarily conserved in most oviparous vertebrates, encompassing the three linked Vtgs on chicken (Gallus gallus) chromosome 8. Tandem arranged homologs to chicken VtgII and VtgIII were retrieved in similar locations in Xenopus (Xenopus tropicalis) and homologous transcribed inverted genes were found in medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), pufferfish (Takifugu rubripes), and Tetrahodon (Tetraodon nigroviridis), while zebrafish (Danio rerio) Vtg3 may represent a residual trace of VGC in this genome. Vtgs were not conserved in the paralogous chromosomal segment attributed to a whole-genome duplication event in the ancestor of teleosts, while tandem duplicated forms have survived the recent African clawed frog (Xenopus laevis) tetraploidization. Orthologs to chicken VtgI were found in similar locations in teleost fish, as well as in the platypus (Ornithorhynchus anatinus). Additional Vtg fragments found suggested that VGC had been conserved in this egg-laying mammal. A low ratio of nonsynonymous-to-synonymous substitution values and the paucity of pseudogene features suggest functional platypus Vtg products. Genomic identification of Vtgs, Apob, and Mtp in this genome, together with maximum likelihood and Bayesian inference phylogenetic analyses, support the existence of these three large lipid transfer protein superfamily members at the base of the mammalian lineage. In conclusion, the establishment of a VGC in the vertebrate lineage predates the divergence of ray-finned fish and tetrapods and the shift in reproductive and developmental strategy observed between prototherians and therians may be associated with its loss, as shown by its absence from the genomic resources currently available from therians.

  17. Genetic organization of plasmid pXF51 from the plant pathogen Xylella fastidiosa.

    PubMed

    Marques, M V; da Silva, A M; Gomes, S L

    2001-05-01

    The sequence of plasmid pXF51 from the plant pathogen Xylella fastidiosa, the causal agent of citrus variegated chlorosis, has been analyzed. This plasmid codes for 65 open reading frames (ORFs), organized into four main regions, containing genes related to replication, mobilization, and conjugative transfer. Twenty-five ORFs have no counterparts in the public sequence databases, and 7 are similar to conserved hypothetical proteins from other bacteria. A pXF51 incompatibility group has not been determined, as we could not find a typical replication origin. One cluster of conjugation-related genes (trb) seems to be incomplete in pXF51, and a copy of this sequence is found in the chromosome, suggesting it was generated by a duplication event. A second cluster (tra) contains all genes necessary for conjugation transfer to occur, showing a conserved organization with other conjugative plasmids. An identifiable origin of transfer similar to oriT from IncP plasmids is found adjacent to genes encoding two mobilization proteins. None of the ORFs with putative assigned function could be predicted as having a role in pathogenesis, except for a virulence-associated protein D homolog. These results indicate that even though pXF51 appears not to have a direct role in Xylella pathogenesis, it is a conjugative plasmid that could be important for lateral gene transfer in this bacterium. This property may be of great importance for future development of transformation techniques in X. fastidiosa.

  18. Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants.

    PubMed

    Rawal, H C; Singh, N K; Sharma, T R

    2013-01-01

    Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future.

  19. Conservation, Divergence, and Genome-Wide Distribution of PAL and POX A Gene Families in Plants

    PubMed Central

    Rawal, H. C.; Singh, N. K.; Sharma, T. R.

    2013-01-01

    Genome-wide identification and phylogenetic and syntenic comparison were performed for the genes responsible for phenylalanine ammonia lyase (PAL) and peroxidase A (POX A) enzymes in nine plant species representing very diverse groups like legumes (Glycine max and Medicago truncatula), fruits (Vitis vinifera), cereals (Sorghum bicolor, Zea mays, and Oryza sativa), trees (Populus trichocarpa), and model dicot (Arabidopsis thaliana) and monocot (Brachypodium distachyon) species. A total of 87 and 1045 genes in PAL and POX A gene families, respectively, have been identified in these species. The phylogenetic and syntenic comparison along with motif distributions shows a high degree of conservation of PAL genes, suggesting that these genes may predate monocot/eudicot divergence. The POX A family genes, present in clusters at the subtelomeric regions of chromosomes, might be evolving and expanding with higher rate than the PAL gene family. Our analysis showed that during the expansion of POX A gene family, many groups and subgroups have evolved, resulting in a high level of functional divergence among monocots and dicots. These results will act as a first step toward the understanding of monocot/eudicot evolution and functional characterization of these gene families in the future. PMID:23671845

  20. Comparison of potential diatom 'barcode' genes (the 18S rRNA gene and ITS, COI, rbcL) and their effectiveness in discriminating and determining species taxonomy in the Bacillariophyta.

    PubMed

    Guo, Liliang; Sui, Zhenghong; Zhang, Shu; Ren, Yuanyuan; Liu, Yuan

    2015-04-01

    Diatoms form an enormous group of photoautotrophic micro-eukaryotes and play a crucial role in marine ecology. In this study, we evaluated typical genes to determine whether they were effective at different levels of diatom clustering analysis to assess the potential of these regions for barcoding taxa. Our test genes included nuclear rRNA genes (the nuclear small-subunit rRNA gene and the 5.8S rRNA gene+ITS-2), a mitochondrial gene (cytochrome c-oxidase subunit 1, COI), a chloroplast gene [ribulose-1,5-biphosphate carboxylase/oxygenase large subunit (rbcL)] and the universal plastid amplicon (UPA). Calculated genetic divergence was highest for the internal transcribed spacer (ITS; 5.8S+ITS-2) (p-distance of 1.569, 85.84% parsimony-informative sites) and COI (6.084, 82.14%), followed by the 18S rRNA gene (0.139, 57.69%), rbcL (0.120, 42.01%) and UPA (0.050, 14.97%), which indicated that ITS and COI were highly divergent compared with the other tested genes, and that their nucleotide compositions were variable within the whole group of diatoms. Bayesian inference (BI) analysis showed that the phylogenetic trees generated from each gene clustered diatoms at different phylogenetic levels. The 18S rRNA gene was better than the other genes in clustering higher diatom taxa, and both the 18S rRNA gene and rbcL performed well in clustering some lower taxa. The COI region was able to barcode species of some genera within the Bacillariophyceae. ITS was a potential marker for DNA based-taxonomy and DNA barcoding of Thalassiosirales, while species of Cyclotella, Skeletonema and Stephanodiscus gathered in separate clades, and were paraphyletic with those of Thalassiosira. Finally, UPA was too conserved to serve as a diatom barcode. © 2015 IUMS.

  1. Are Hox genes ancestrally involved in axial patterning? Evidence from the hydrozoan Clytia hemisphaerica (Cnidaria).

    PubMed

    Chiori, Roxane; Jager, Muriel; Denker, Elsa; Wincker, Patrick; Da Silva, Corinne; Le Guyader, Hervé; Manuel, Michaël; Quéinnec, Eric

    2009-01-01

    The early evolution and diversification of Hox-related genes in eumetazoans has been the subject of conflicting hypotheses concerning the evolutionary conservation of their role in axial patterning and the pre-bilaterian origin of the Hox and ParaHox clusters. The diversification of Hox/ParaHox genes clearly predates the origin of bilaterians. However, the existence of a "Hox code" predating the cnidarian-bilaterian ancestor and supporting the deep homology of axes is more controversial. This assumption was mainly based on the interpretation of Hox expression data from the sea anemone, but growing evidence from other cnidarian taxa puts into question this hypothesis. Hox, ParaHox and Hox-related genes have been investigated here by phylogenetic analysis and in situ hybridisation in Clytia hemisphaerica, an hydrozoan species with medusa and polyp stages alternating in the life cycle. Our phylogenetic analyses do not support an origin of ParaHox and Hox genes by duplication of an ancestral ProtoHox cluster, and reveal a diversification of the cnidarian HOX9-14 genes into three groups called A, B, C. Among the 7 examined genes, only those belonging to the HOX9-14 and the CDX groups exhibit a restricted expression along the oral-aboral axis during development and in the planula larva, while the others are expressed in very specialised areas at the medusa stage. Cross species comparison reveals a strong variability of gene expression along the oral-aboral axis and during the life cycle among cnidarian lineages. The most parsimonious interpretation is that the Hox code, collinearity and conservative role along the antero-posterior axis are bilaterian innovations.

  2. Are Hox Genes Ancestrally Involved in Axial Patterning? Evidence from the Hydrozoan Clytia hemisphaerica (Cnidaria)

    PubMed Central

    Chiori, Roxane; Jager, Muriel; Denker, Elsa; Wincker, Patrick; Da Silva, Corinne; Le Guyader, Hervé; Manuel, Michaël; Quéinnec, Eric

    2009-01-01

    Background The early evolution and diversification of Hox-related genes in eumetazoans has been the subject of conflicting hypotheses concerning the evolutionary conservation of their role in axial patterning and the pre-bilaterian origin of the Hox and ParaHox clusters. The diversification of Hox/ParaHox genes clearly predates the origin of bilaterians. However, the existence of a “Hox code” predating the cnidarian-bilaterian ancestor and supporting the deep homology of axes is more controversial. This assumption was mainly based on the interpretation of Hox expression data from the sea anemone, but growing evidence from other cnidarian taxa puts into question this hypothesis. Methodology/Principal Findings Hox, ParaHox and Hox-related genes have been investigated here by phylogenetic analysis and in situ hybridisation in Clytia hemisphaerica, an hydrozoan species with medusa and polyp stages alternating in the life cycle. Our phylogenetic analyses do not support an origin of ParaHox and Hox genes by duplication of an ancestral ProtoHox cluster, and reveal a diversification of the cnidarian HOX9-14 genes into three groups called A, B, C. Among the 7 examined genes, only those belonging to the HOX9-14 and the CDX groups exhibit a restricted expression along the oral-aboral axis during development and in the planula larva, while the others are expressed in very specialised areas at the medusa stage. Conclusions/Significance Cross species comparison reveals a strong variability of gene expression along the oral-aboral axis and during the life cycle among cnidarian lineages. The most parsimonious interpretation is that the Hox code, collinearity and conservative role along the antero-posterior axis are bilaterian innovations. PMID:19156208

  3. Genome-Wide Analysis of Secondary Metabolite Gene Clusters in Ophiostoma ulmi and Ophiostoma novo-ulmi Reveals a Fujikurin-Like Gene Cluster with a Putative Role in Infection.

    PubMed

    Sbaraini, Nicolau; Andreis, Fábio C; Thompson, Claudia E; Guedes, Rafael L M; Junges, Ângela; Campos, Thais; Staats, Charley C; Vainstein, Marilene H; Ribeiro de Vasconcelos, Ana T; Schrank, Augusto

    2017-01-01

    The emergence of new microbial pathogens can result in destructive outbreaks, since their hosts have limited resistance and pathogens may be excessively aggressive. Described as the major ecological incident of the twentieth century, Dutch elm disease, caused by ascomycete fungi from the Ophiostoma genus, has caused a significant decline in elm tree populations ( Ulmus sp.) in North America and Europe. Genome sequencing of the two main causative agents of Dutch elm disease ( Ophiostoma ulmi and Ophiostoma novo-ulmi ), along with closely related species with different lifestyles, allows for unique comparisons to be made to identify how pathogens and virulence determinants have emerged. Among several established virulence determinants, secondary metabolites (SMs) have been suggested to play significant roles during phytopathogen infection. Interestingly, the secondary metabolism of Dutch elm pathogens remains almost unexplored, and little is known about how SM biosynthetic genes are organized in these species. To better understand the metabolic potential of O. ulmi and O. novo-ulmi , we performed a deep survey and description of SM biosynthetic gene clusters (BGCs) in these species and assessed their conservation among eight species from the Ophiostomataceae family. Among 19 identified BGCs, a fujikurin-like gene cluster (OpPKS8) was unique to Dutch elm pathogens. Phylogenetic analysis revealed that orthologs for this gene cluster are widespread among phytopathogens and plant-associated fungi, suggesting that OpPKS8 may have been horizontally acquired by the Ophiostoma genus. Moreover, the detailed identification of several BGCs paves the way for future in-depth research and supports the potential impact of secondary metabolism on Ophiostoma genus' lifestyle.

  4. Heterogeneous conservation of Dlx paralog co-expression in jawed vertebrates.

    PubMed

    Debiais-Thibaud, Mélanie; Metcalfe, Cushla J; Pollack, Jacob; Germon, Isabelle; Ekker, Marc; Depew, Michael; Laurenti, Patrick; Borday-Birraux, Véronique; Casane, Didier

    2013-01-01

    The Dlx gene family encodes transcription factors involved in the development of a wide variety of morphological innovations that first evolved at the origins of vertebrates or of the jawed vertebrates. This gene family expanded with the two rounds of genome duplications that occurred before jawed vertebrates diversified. It includes at least three bigene pairs sharing conserved regulatory sequences in tetrapods and teleost fish, but has been only partially characterized in chondrichthyans, the third major group of jawed vertebrates. Here we take advantage of developmental and molecular tools applied to the shark Scyliorhinus canicula to fill in the gap and provide an overview of the evolution of the Dlx family in the jawed vertebrates. These results are analyzed in the theoretical framework of the DDC (Duplication-Degeneration-Complementation) model. The genomic organisation of the catshark Dlx genes is similar to that previously described for tetrapods. Conserved non-coding elements identified in bony fish were also identified in catshark Dlx clusters and showed regulatory activity in transgenic zebrafish. Gene expression patterns in the catshark showed that there are some expression sites with high conservation of the expressed paralog(s) and other expression sites with events of paralog sub-functionalization during jawed vertebrate diversification, resulting in a wide variety of evolutionary scenarios within this gene family. Dlx gene expression patterns in the catshark show that there has been little neo-functionalization in Dlx genes over gnathostome evolution. In most cases, one tandem duplication and two rounds of vertebrate genome duplication have led to at least six Dlx coding sequences with redundant expression patterns followed by some instances of paralog sub-functionalization. Regulatory constraints such as shared enhancers, and functional constraints including gene pleiotropy, may have contributed to the evolutionary inertia leading to high redundancy between gene expression patterns.

  5. Phylum-wide comparative genomics unravel the diversity of secondary metabolism in Cyanobacteria

    DOE PAGES

    Calteau, Alexandra; Fewer, David P.; Latifi, Amel; ...

    2014-11-18

    Cyanobacteria are an ancient lineage of photosynthetic bacteria from which hundreds of natural products have been described, including many notorious toxins but also potent natural products of interest to the pharmaceutical and biotechnological industries. Many of these compounds are the products of non-ribosomal peptide synthetase (NRPS) or polyketide synthase (PKS) pathways. However, current understanding of the diversification of these pathways is largely based on the chemical structure of the bioactive compounds, while the evolutionary forces driving their remarkable chemical diversity are poorly understood. We carried out a phylum-wide investigation of genetic diversification of the cyanobacterial NRPS and PKS pathways formore » the production of bioactive compounds. 452 NRPS and PKS gene clusters were identified from 89 cyanobacterial genomes, revealing a clear burst in late-branching lineages. Our genomic analysis further grouped the clusters into 286 highly diversified cluster families (CF) of pathways. Some CFs appeared vertically inherited, while others presented a more complex evolutionary history. Only a few horizontal gene transfers were evidenced amongst strongly conserved CFs in the phylum, while several others have undergone drastic gene shuffling events, which could result in the observed diversification of the pathways. In addition to toxin production, several NRPS and PKS gene clusters are devoted to important cellular processes of these bacteria such as nitrogen fixation and iron uptake. The majority of the biosynthetic clusters identified here have unknown end products, highlighting the power of genome mining for the discovery of new natural products.« less

  6. Phylum-wide comparative genomics unravel the diversity of secondary metabolism in Cyanobacteria

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Calteau, Alexandra; Fewer, David P.; Latifi, Amel

    Cyanobacteria are an ancient lineage of photosynthetic bacteria from which hundreds of natural products have been described, including many notorious toxins but also potent natural products of interest to the pharmaceutical and biotechnological industries. Many of these compounds are the products of non-ribosomal peptide synthetase (NRPS) or polyketide synthase (PKS) pathways. However, current understanding of the diversification of these pathways is largely based on the chemical structure of the bioactive compounds, while the evolutionary forces driving their remarkable chemical diversity are poorly understood. We carried out a phylum-wide investigation of genetic diversification of the cyanobacterial NRPS and PKS pathways formore » the production of bioactive compounds. 452 NRPS and PKS gene clusters were identified from 89 cyanobacterial genomes, revealing a clear burst in late-branching lineages. Our genomic analysis further grouped the clusters into 286 highly diversified cluster families (CF) of pathways. Some CFs appeared vertically inherited, while others presented a more complex evolutionary history. Only a few horizontal gene transfers were evidenced amongst strongly conserved CFs in the phylum, while several others have undergone drastic gene shuffling events, which could result in the observed diversification of the pathways. In addition to toxin production, several NRPS and PKS gene clusters are devoted to important cellular processes of these bacteria such as nitrogen fixation and iron uptake. The majority of the biosynthetic clusters identified here have unknown end products, highlighting the power of genome mining for the discovery of new natural products.« less

  7. Leukocyte common antigen-related phosphatase (LRP) gene structure: Conservation of the genomic organization of transmembrane protein tyrosine phosphatases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wong, E.C.C.; Mullersman, J.E.; Thomas, M.L.

    1993-07-01

    The leukocyte common antigen-related protein tyrosine phosphatase (LRP) is a widely expressed transmembrane glycoprotein thought to be involved in cell growth and differentiation. Similar to most other transmembrane protein tyrosine phosphatases, LRP contains two tandem cytoplasmic phosphatase domains. To understand further the regulation and evolution of LRP, the authors have isolated and characterized mouse [lambda] genomic clones. Thirteen genomic clones could be divided into two non-overlapping clusters. The first cluster contained the transcription initiation site and the exon encoding most of the 5[prime] untranslated region. The second cluster contained the remaining exons encoding the protein and the 3[prime] untranslated region.more » The gene consists of 22 exons spanning over 75 kb. The distance between exon 1 and exon 2 is at least 25 kb. Characterization of the 5[prime] ends of LRP mRNA by S1 nuclease protection identifies putative initiation start sites within a G/C-rich region. The upstream region does not contain a TATA box. Comparison of the LRP gene structure to the mammalian protein tyrosine phosphatase gene, CD45, shows striking similarities in size and genomic organization. 29 refs., 5 figs., 1 tab.« less

  8. Rapid Detection of Positive Selection in Genes and Genomes Through Variation Clusters

    PubMed Central

    Wagner, Andreas

    2007-01-01

    Positive selection in genes and genomes can point to the evolutionary basis for differences among species and among races within a species. The detection of positive selection can also help identify functionally important protein regions and thus guide protein engineering. Many existing tests for positive selection are excessively conservative, vulnerable to artifacts caused by demographic population history, or computationally very intensive. I here propose a simple and rapid test that is complementary to existing tests and that overcomes some of these problems. It relies on the null hypothesis that neutrally evolving DNA regions should show a Poisson distribution of nucleotide substitutions. The test detects significant deviations from this expectation in the form of variation clusters, highly localized groups of amino acid changes in a coding region. In applying this test to several thousand human–chimpanzee gene orthologs, I show that such variation clusters are not generally caused by relaxed selection. They occur in well-defined domains of a protein's tertiary structure and show a large excess of amino acid replacement over silent substitutions. I also identify multiple new human–chimpanzee orthologs subject to positive selection, among them genes that are involved in reproductive functions, immune defense, and the nervous system. PMID:17603100

  9. The WRKY Transcription Factor Genes in Lotus japonicus

    PubMed Central

    Wang, Pengfei; Wang, Xingjun

    2014-01-01

    WRKY transcription factor genes play critical roles in plant growth and development, as well as stress responses. WRKY genes have been examined in various higher plants, but they have not been characterized in Lotus japonicus. The recent release of the L. japonicus whole genome sequence provides an opportunity for a genome wide analysis of WRKY genes in this species. In this study, we identified 61 WRKY genes in the L. japonicus genome. Based on the WRKY protein structure, L. japonicus WRKY (LjWRKY) genes can be classified into three groups (I–III). Investigations of gene copy number and gene clusters indicate that only one gene duplication event occurred on chromosome 4 and no clustered genes were detected on chromosomes 3 or 6. Researchers previously believed that group II and III WRKY domains were derived from the C-terminal WRKY domain of group I. Our results suggest that some WRKY genes in group II originated from the N-terminal domain of group I WRKY genes. Additional evidence to support this hypothesis was obtained by Medicago truncatula WRKY (MtWRKY) protein motif analysis. We found that LjWRKY and MtWRKY group III genes are under purifying selection, suggesting that WRKY genes will become increasingly structured and functionally conserved. PMID:24745006

  10. Variation in conserved non-coding sequences on chromosome 5q andsusceptibility to asthma and atopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Donfack, Joseph; Schneider, Daniel H.; Tan, Zheng

    2005-09-10

    Background: Evolutionarily conserved sequences likely havebiological function. Methods: To determine whether variation in conservedsequences in non-coding DNA contributes to risk for human disease, westudied six conserved non-coding elements in the Th2 cytokine cluster onhuman chromosome 5q31 in a large Hutterite pedigree and in samples ofoutbred European American and African American asthma cases and controls.Results: Among six conserved non-coding elements (>100 bp,>70percent identity; human-mouse comparison), we identified one singlenucleotide polymorphism (SNP) in each of two conserved elements and sixSNPs in the flanking regions of three conserved elements. We genotypedour samples for four of these SNPs and an additional three SNPs eachmore » inthe IL13 and IL4 genes. While there was only modest evidence forassociation with single SNPs in the Hutterite and European Americansamples (P<0.05), there were highly significant associations inEuropean Americans between asthma and haplotypes comprised of SNPs in theIL4 gene (P<0.001), including a SNP in a conserved non-codingelement. Furthermore, variation in the IL13 gene was strongly associatedwith total IgE (P = 0.00022) and allergic sensitization to mold allergens(P = 0.00076) in the Hutterites, and more modestly associated withsensitization to molds in the European Americans and African Americans (P<0.01). Conclusion: These results indicate that there is overalllittle variation in the conserved non-coding elements on 5q31, butvariation in IL4 and IL13, including possibly one SNP in a conservedelement, influence asthma and atopic phenotypes in diversepopulations.« less

  11. Transcription of Two Adjacent Carbohydrate Utilization Gene Clusters in Bifidobacterium breve UCC2003 Is Controlled by LacI- and Repressor Open Reading Frame Kinase (ROK)-Type Regulators

    PubMed Central

    O'Connell, Kerry Joan; O'Connell Motherway, Mary; Liedtke, Andrea; Fitzgerald, Gerald F.; Ross, R. Paul; Stanton, Catherine; Zomer, Aldert

    2014-01-01

    Members of the genus Bifidobacterium are commonly found in the gastrointestinal tracts of mammals, including humans, where their growth is presumed to be dependent on various diet- and/or host-derived carbohydrates. To understand transcriptional control of bifidobacterial carbohydrate metabolism, we investigated two genetic carbohydrate utilization clusters dedicated to the metabolism of raffinose-type sugars and melezitose. Transcriptomic and gene inactivation approaches revealed that the raffinose utilization system is positively regulated by an activator protein, designated RafR. The gene cluster associated with melezitose metabolism was shown to be subject to direct negative control by a LacI-type transcriptional regulator, designated MelR1, in addition to apparent indirect negative control by means of a second LacI-type regulator, MelR2. In silico analysis, DNA-protein interaction, and primer extension studies revealed the MelR1 and MelR2 operator sequences, each of which is positioned just upstream of or overlapping the correspondingly regulated promoter sequences. Similar analyses identified the RafR binding operator sequence located upstream of the rafB promoter. This study indicates that transcriptional control of gene clusters involved in carbohydrate metabolism in bifidobacteria is subject to conserved regulatory systems, representing either positive or negative control. PMID:24705323

  12. A Gibbs sampler for motif detection in phylogenetically close sequences

    NASA Astrophysics Data System (ADS)

    Siddharthan, Rahul; van Nimwegen, Erik; Siggia, Eric

    2004-03-01

    Genes are regulated by transcription factors that bind to DNA upstream of genes and recognize short conserved ``motifs'' in a random intergenic ``background''. Motif-finders such as the Gibbs sampler compare the probability of these short sequences being represented by ``weight matrices'' to the probability of their arising from the background ``null model'', and explore this space (analogous to a free-energy landscape). But closely related species may show conservation not because of functional sites but simply because they have not had sufficient time to diverge, so conventional methods will fail. We introduce a new Gibbs sampler algorithm that accounts for common ancestry when searching for motifs, while requiring minimal ``prior'' assumptions on the number and types of motifs, assessing the significance of detected motifs by ``tracking'' clusters that stay together. We apply this scheme to motif detection in sporulation-cycle genes in the yeast S. cerevisiae, using recent sequences of other closely-related Saccharomyces species.

  13. Genome Sequencing of Ralstonia solanacearum CQPS-1, a Phylotype I Strain Collected from a Highland Area with Continuous Cropping of Tobacco

    PubMed Central

    Liu, Ying; Tang, Yuanman; Qin, Xiyun; Yang, Liang; Jiang, Gaofei; Li, Shili; Ding, Wei

    2017-01-01

    Ralstonia solanacearum, an agent of bacterial wilt, is a highly variable species with a broad host range and wide geographic distribution. As a species complex, it has extensive genetic diversity and its living environment is polymorphic like the lowland and the highland area, so more genomes are needed for studying population evolution and environment adaptation. In this paper, we reported the genome sequencing of R. solanacearum strain CQPS-1 isolated from wilted tobacco in Pengshui, Chongqing, China, a highland area with severely acidified soil and continuous cropping of tobacco more than 20 years. The comparative genomic analysis among different R. solanacearum strains was also performed. The completed genome size of CQPS-1 was 5.89 Mb and contained the chromosome (3.83 Mb) and the megaplasmid (2.06 Mb). A total of 5229 coding sequences were predicted (the chromosome and megaplasmid encoded 3573 and 1656 genes, respectively). A comparative analysis with eight strains from four phylotypes showed that there was some variation among the species, e.g., a large set of specific genes in CQPS-1. Type III secretion system gene cluster (hrp gene cluster) was conserved in CQPS-1 compared with the reference strain GMI1000. In addition, most genes coding core type III effectors were also conserved with GMI1000, but significant gene variation was found in the gene ripAA: the identity compared with strain GMI1000 was 75% and the hrpII box promoter in the upstream had significantly mutated. This study provided a potential resource for further understanding of the relationship between variation of pathogenicity factors and adaptation to the host environment. PMID:28620361

  14. Human growth is associated with distinct patterns of gene expression in evolutionarily conserved networks

    PubMed Central

    2013-01-01

    Background A co-ordinated tissue-independent gene expression profile associated with growth is present in rodent models and this is hypothesised to extend to all mammals. Growth in humans has similarities to other mammals but the return to active long bone growth in the pubertal growth spurt is a distinctly human growth event. The aim of this study was to describe gene expression and biological pathways associated with stages of growth in children and to assess tissue-independent expression patterns in relation to human growth. Results We conducted gene expression analysis on a library of datasets from normal children with age annotation, collated from the NCBI Gene Expression Omnibus (GEO) and EBI Arrayexpress databases. A primary data set was generated using cells of lymphoid origin from normal children; the expression of 688 genes (ANOVA false discovery rate modified p-value, q < 0.1) was associated with age, and subsets of these genes formed clusters that correlated with the phases of growth – infancy, childhood, puberty and final height. Network analysis on these clusters identified evolutionarily conserved growth pathways (NOTCH, VEGF, TGFB, WNT and glucocorticoid receptor – Hyper-geometric test, q < 0.05). The greatest degree of network ‘connectivity’ and hence functional significance was present in infancy (Wilcoxon test, p < 0.05), which then decreased through to adulthood. These observations were confirmed in a separate validation data set from lymphoid tissue. Similar biological pathways were observed to be associated with development-related gene expression in other tissues (conjunctival epithelia, temporal lobe brain tissue and bone marrow) suggesting the existence of a tissue-independent genetic program for human growth and maturation. Conclusions Similar evolutionarily conserved pathways have been associated with gene expression and child growth in multiple tissues. These expression profiles associate with the developmental phases of growth including the return to active long bone growth in puberty, a distinctly human event. These observations also have direct medical relevance to pathological changes that induce disease in children. Taking into account development-dependent gene expression profiles for normal children will be key to the appropriate selection of genes and pathways as potential biomarkers of disease or as drug targets. PMID:23941278

  15. A comprehensive analysis of replicative lifespan in 4,698 single-gene deletion strains uncovers conserved mechanisms of aging

    PubMed Central

    McCormick, Mark A.; Delaney, Joe R.; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J.; Schleit, Jennifer; Sutphin, George L.; Wasko, Brian M.; Bennett, Christopher F.; Wang, Adrienne M.; Olsen, Brady; Beyer, Richard P.; Bammler, Theodor K.; Prunkard, Donna; Johnson, Simon C.; Pennypacker, Juniper K.; An, Elroy; Anies, Arieanna; Castanza, Anthony S.; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A.; Holmberg, Molly A.; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O.; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S.; Lo, Winston C.; Lockshon, Dan; Miller, Hillary A.; Moller, Richard M.; Muller, Brian; Oakes, Jonathan; Pak, Diana N.; Peng, Zhao Jun; Pham, Kim M.; Pollard, Tom G.; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A.; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K.; Smith, Erica D.; Snead, Katie; Solanky, Amrita; Spector, Benjamin L.; Steffen, Kristan K.; Tchao, Bie Nga; Ting, Marc K.; Wende, Helen Vander; Wang, Dennis; Welton, K. Linnea; Westman, Eric A.; Brem, Rachel B.; Liu, Xin-guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K.

    2015-01-01

    SUMMARY Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is non-additive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. PMID:26456335

  16. A functionally conserved Polycomb response element from mouse HoxD complex responds to heterochromatin factors

    NASA Astrophysics Data System (ADS)

    Vasanthi, Dasari; Nagabhushan, A.; Matharu, Navneet Kaur; Mishra, Rakesh K.

    2013-10-01

    Anterior-posterior body axis in all bilaterians is determined by the Hox gene clusters that are activated in a spatio-temporal order. This expression pattern of Hox genes is established and maintained by regulatory mechanisms that involve higher order chromatin structure and Polycomb group (PcG) and trithorax group (trxG) proteins. We identified earlier a Polycomb response element (PRE) in the mouse HoxD complex that is functionally conserved in flies. We analyzed the molecular and genetic interactions of mouse PRE using Drosophila melanogaster and vertebrate cell culture as the model systems. We demonstrate that the repressive activity of this PRE depends on PcG/trxG genes as well as the heterochromatin components. Our findings indicate that a wide range of factors interact with the HoxD PRE that can contribute to establishing the expression pattern of homeotic genes in the complex early during development and maintain that pattern at subsequent stages.

  17. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, Tingting; Chang, Chin -Yuan; Lohman, Jeremy R.

    Comparative analysis of the enediyne biosynthetic gene clusters revealed sets of conserved genes serving as outstanding candidates for the enediyne core. Here we report the crystal structures of SgcJ and its homologue NCS-Orf16, together with gene inactivation and site-directed mutagenesis studies, to gain insight into enediyne core biosynthesis. Gene inactivation in vivo establishes that SgcJ is required for C-1027 production in Streptomyces globisporus. SgcJ and NCS-Orf16 share a common structure with the nuclear transport factor 2-like superfamily of proteins, featuring a putative substrate binding or catalytic active site. Site-directed mutagenesis of the conserved residues lining this site allowed us tomore » propose that SgcJ and its homologues may play a catalytic role in transforming the linear polyene intermediate, along with other enediyne polyketide synthase-associated enzymes, into an enzyme-sequestered enediyne core intermediate. In conclusion, these findings will help formulate hypotheses and design experiments to ascertain the function of SgcJ and its homologues in nine-membered enediyne core biosynthesis.« less

  18. SXT/R391 integrative and conjugative elements in Proteus species reveal abundant genetic diversity and multidrug resistance

    PubMed Central

    Li, Xinyue; Du, Yu; Du, Pengcheng; Dai, Hang; Fang, Yujie; Li, Zhenpeng; Lv, Na; Zhu, Baoli; Kan, Biao; Wang, Duochun

    2016-01-01

    SXT/R391 integrative and conjugative elements (ICEs) are self-transmissible mobile genetic elements that are found in most members of Enterobacteriaceae. Here, we determined fifteen SXT/R391 ICEs carried by Proteus isolates from food (4.2%) and diarrhoea patients (17.3%). BLASTn searches against GenBank showed that the fifteen SXT/R391 ICEs were closely related to that from different Enterobacteriaceae species, including Proteus mirabilis. Using core gene phylogenetic analysis, the fifteen SXT/R391 ICEs were grouped into six distinct clusters, including a dominant cluster and three clusters that have not been previously reported in Proteus isolates. The SXT/R391 ICEs shared a common structure with a set of conserved genes, five hotspots and two variable regions, which contained more foreign genes, including drug-resistance genes. Notably, a class A β-lactamase gene was identified in nine SXT/R391 ICEs. Collectively, the ICE-carrying isolates carried resistance genes for 20 tested drugs. Six isolates were resistant to chloramphenicol, kanamycin, streptomycin, trimethoprim-sulfamethoxazole, sulfisoxazole and tetracycline, which are drug resistances commonly encoded by ICEs. Our results demonstrate abundant genetic diversity and multidrug resistance of the SXT/R391 ICEs carried by Proteus isolates, which may have significance for public health. It is therefore necessary to continuously monitor the antimicrobial resistance and related mobile elements among Proteus isolates. PMID:27892525

  19. Conservation of NLR-triggered immunity across plant lineages.

    PubMed

    Maekawa, Takaki; Kracher, Barbara; Vernaldi, Saskia; Ver Loren van Themaat, Emiel; Schulze-Lefert, Paul

    2012-12-04

    The nucleotide-binding domain and leucine-rich repeat (NLR) family of plant receptors detects pathogen-derived molecules, designated effectors, inside host cells and mediates innate immune responses to pathogenic invaders. Genetic evidence revealed species-specific coevolution of many NLRs with effectors from host-adapted pathogens, suggesting that the specificity of these NLRs is restricted to the host or closely related plant species. However, we report that an NLR immune receptor (MLA1) from monocotyledonous barley is fully functional in partially immunocompromised dicotyledonous Arabidopsis thaliana against the barley powdery mildew fungus, Blumeria graminis f. sp. hordei. This implies ~200 million years of evolutionary conservation of the underlying immune mechanism. A time-course RNA-seq analysis in transgenic Arabidopsis lines detected sustained expression of a large MLA1-dependent gene cluster. This cluster is greatly enriched in genes known to respond to the fungal cell wall-derived microbe-associated molecular pattern chitin. The MLA1-dependent sustained transcript accumulation could define a conserved function of the nuclear pool of MLA1 detected in barley and Arabidopsis. We also found that MLA1-triggered immunity was fully retained in mutant plants that are simultaneously depleted of ethylene, jasmonic acid, and salicylic acid signaling. This points to the existence of an evolutionarily conserved and phytohormone-independent MLA1-mediated resistance mechanism. This also suggests a conserved mechanism for internalization of B. graminis f. sp. hordei effectors into host cells of flowering plants. Furthermore, the deduced connectivity of the NLR to multiple branches of immune signaling pathways likely confers increased robustness against pathogen effector-mediated interception of host immune signaling and could have contributed to the evolutionary preservation of the immune mechanism.

  20. Whole-genome sequencing of Aspergillus tubingensis G131 and overview of its secondary metabolism potential.

    PubMed

    Choque, Elodie; Klopp, Christophe; Valiere, Sophie; Raynal, José; Mathieu, Florence

    2018-03-15

    Black Aspergilli represent one of the most important fungal resources of primary and secondary metabolites for biotechnological industry. Having several black Aspergilli sequenced genomes should allow targeting the production of certain metabolites with bioactive properties. In this study, we report the draft genome of a black Aspergilli, A. tubingensis G131, isolated from a French Mediterranean vineyard. This 35 Mb genome includes 10,994 predicted genes. A genomic-based discovery identifies 80 secondary metabolites biosynthetic gene clusters. Genomic sequences of these clusters were blasted on 3 chosen black Aspergilli genomes: A. tubingensis CBS 134.48, A. niger CBS 513.88 and A. kawachii IFO 4308. This comparison highlights different levels of clusters conservation between the four strains. It also allows identifying seven unique clusters in A. tubingensis G131. Moreover, the putative secondary metabolites clusters for asperazine and naphtho-gamma-pyrones production were proposed based on this genomic analysis. Key biosynthetic genes required for the production of 2 mycotoxins, ochratoxin A and fumonisin, are absent from this draft genome. Even if intergenic sequences of these mycotoxins biosynthetic pathways are present, this could not lead to the production of those mycotoxins by A. tubingensis G131. Functional and bioinformatics analyses of A. tubingensis G131 genome highlight its potential for metabolites production in particular for TAN-1612, asperazine and naphtho-gamma-pyrones presenting antioxidant, anticancer or antibiotic properties.

  1. The "fossilized" mitochondrial genome of Liriodendron tulipifera: ancestral gene content and order, ancestral editing sites, and extraordinarily low mutation rate.

    PubMed

    Richardson, Aaron O; Rice, Danny W; Young, Gregory J; Alverson, Andrew J; Palmer, Jeffrey D

    2013-04-15

    The mitochondrial genomes of flowering plants vary greatly in size, gene content, gene order, mutation rate and level of RNA editing. However, the narrow phylogenetic breadth of available genomic data has limited our ability to reconstruct these traits in the ancestral flowering plant and, therefore, to infer subsequent patterns of evolution across angiosperms. We sequenced the mitochondrial genome of Liriodendron tulipifera, the first from outside the monocots or eudicots. This 553,721 bp mitochondrial genome has evolved remarkably slowly in virtually all respects, with an extraordinarily low genome-wide silent substitution rate, retention of genes frequently lost in other angiosperm lineages, and conservation of ancestral gene clusters. The mitochondrial protein genes in Liriodendron are the most heavily edited of any angiosperm characterized to date. Most of these sites are also edited in various other lineages, which allowed us to polarize losses of editing sites in other parts of the angiosperm phylogeny. Finally, we added comprehensive gene sequence data for two other magnoliids, Magnolia stellata and the more distantly related Calycanthus floridus, to measure rates of sequence evolution in Liriodendron with greater accuracy. The Magnolia genome has evolved at an even lower rate, revealing a roughly 5,000-fold range of synonymous-site divergence among angiosperms whose mitochondrial gene space has been comprehensively sequenced. Using Liriodendron as a guide, we estimate that the ancestral flowering plant mitochondrial genome contained 41 protein genes, 14 tRNA genes of mitochondrial origin, as many as 7 tRNA genes of chloroplast origin, >700 sites of RNA editing, and some 14 colinear gene clusters. Many of these gene clusters, genes and RNA editing sites have been variously lost in different lineages over the course of the ensuing ∽200 million years of angiosperm evolution.

  2. Genes encoding major light-harvesting polypeptides are clustered on the genome of the cyanobacterium Fremyella diplosiphon.

    PubMed Central

    Conley, P B; Lemaux, P G; Lomax, T L; Grossman, A R

    1986-01-01

    The polypeptide composition of the phycobilisome, the major light-harvesting complex of prokaryotic cyanobacteria and certain eukaryotic algae, can be modulated by different light qualities in cyanobacteria exhibiting chromatic adaptation. We have identified genomic fragments encoding a cluster of phycobilisome polypeptides (phycobiliproteins) from the chromatically adapting cyanobacterium Fremyella diplosiphon using previously characterized DNA fragments of phycobiliprotein genes from the eukaryotic alga Cyanophora paradoxa and from F. diplosiphon. Characterization of two lambda-EMBL3 clones containing overlapping genomic fragments indicates that three sets of phycobiliprotein genes--the alpha- and beta-allophycocyanin genes plus two sets of alpha- and beta-phycocyanin genes--are clustered within 13 kilobases on the cyanobacterial genome and transcribed off the same strand. The gene order (alpha-allophycocyanin followed by beta-allophycocyanin and beta-phycocyanin followed by alpha-phycocyanin) appears to be a conserved arrangement found previously in a eukaryotic alga and another cyanobacterium. We have reported that one set of phycocyanin genes is transcribed as two abundant red light-induced mRNAs (1600 and 3800 bases). We now present data showing that the allophycocyanin genes and a second set of phycocyanin genes are transcribed into major mRNAs of 1400 and 1600 bases, respectively. These transcripts are present in RNA isolated from cultures grown in red and green light, although lower levels of the 1600-base phycocyanin transcript are present in cells grown in green light. Furthermore, a larger transcript of 1750 bases hybridizes to the allophycocyanin genes and may be a precursor to the 1400-base species. Images PMID:3086870

  3. Effects of multiple founder populations on spatial genetic structure of reintroduced American martens.

    PubMed

    Williams, Bronwyn W; Scribner, Kim T

    2010-01-01

    Reintroductions and translocations are increasingly used to repatriate or increase probabilities of persistence for animal and plant species. Genetic and demographic characteristics of founding individuals and suitability of habitat at release sites are commonly believed to affect the success of these conservation programs. Genetic divergence among multiple source populations of American martens (Martes americana) and well documented introduction histories permitted analyses of post-introduction dispersion from release sites and development of genetic clusters in the Upper Peninsula (UP) of Michigan <50 years following release. Location and size of spatial genetic clusters and measures of individual-based autocorrelation were inferred using 11 microsatellite loci. We identified three genetic clusters in geographic proximity to original release locations. Estimated distances of effective gene flow based on spatial autocorrelation varied greatly among genetic clusters (30-90 km). Spatial contiguity of genetic clusters has been largely maintained with evidence for admixture primarily in localized regions, suggesting recent contact or locally retarded rates of gene flow. Data provide guidance for future studies of the effects of permeabilities of different land-cover and land-use features to dispersal and of other biotic and environmental factors that may contribute to the colonization process and development of spatial genetic associations.

  4. Diversity amongst trigeminal neurons revealed by high throughput single cell sequencing

    PubMed Central

    Nguyen, Minh Q.; Wu, Youmei; Bonilla, Lauren S.; von Buchholtz, Lars J.

    2017-01-01

    The trigeminal ganglion contains somatosensory neurons that detect a range of thermal, mechanical and chemical cues and innervate unique sensory compartments in the head and neck including the eyes, nose, mouth, meninges and vibrissae. We used single-cell sequencing and in situ hybridization to examine the cellular diversity of the trigeminal ganglion in mice, defining thirteen clusters of neurons. We show that clusters are well conserved in dorsal root ganglia suggesting they represent distinct functional classes of somatosensory neurons and not specialization associated with their sensory targets. Notably, functionally important genes (e.g. the mechanosensory channel Piezo2 and the capsaicin gated ion channel Trpv1) segregate into multiple clusters and often are expressed in subsets of cells within a cluster. Therefore, the 13 genetically-defined classes are likely to be physiologically heterogeneous rather than highly parallel (i.e., redundant) lines of sensory input. Our analysis harnesses the power of single-cell sequencing to provide a unique platform for in silico expression profiling that complements other approaches linking gene-expression with function and exposes unexpected diversity in the somatosensory system. PMID:28957441

  5. Identification and Analysis of a Novel Gene Cluster Involves in Fe2+ Oxidation in Acidithiobacillus ferrooxidans ATCC 23270, a Typical Biomining Acidophile.

    PubMed

    Ai, Chenbing; Liang, Yuting; Miao, Bo; Chen, Miao; Zeng, Weimin; Qiu, Guanzhou

    2018-07-01

    Iron-oxidizing Acidithiobacillus spp. are applied worldwide in biomining industry to extract metals from sulfide minerals. They derive energy for survival through Fe 2+ oxidation and generate Fe 3+ for the dissolution of sulfide minerals. However, molecular mechanisms of their iron oxidation still remain elusive. A novel two-cytochrome-encoding gene cluster (named tce gene cluster) encoding a high-molecular-weight cytochrome c (AFE_1428) and a c 4 -type cytochrome c 552 (AFE_1429) in A. ferrooxidans ATCC 23270 was first identified in this study. Bioinformatic analysis together with transcriptional study showed that AFE_1428 and AFE_1429 were the corresponding paralog of Cyc2 (AFE_3153) and Cyc1 (AFE_3152) which were encoded by the extensively studied rus operon and had been proven involving in ferrous iron oxidation. Both AFE_1428 and AFE_1429 contained signal peptide and the classic heme-binding motif(s) as their corresponding paralog. The modeled structure of AFE_1429 showed high resemblance to Cyc1. AFE_1428 and AFE_1429 were preferentially transcribed as their corresponding paralogs in the presence of ferrous iron as sole energy source as compared with sulfur. The tce gene cluster is highly conserved in the genomes of four phylogenetic-related A. ferrooxidans strains that were originally isolated from different sites separated with huge geographical distance, which further implies the importance of this gene cluster. Collectively, AFE_1428 and AFE_1429 involve in Fe 2+ oxidation like their corresponding paralog by integrating with the metalloproteins encoded by rus operon. This study provides novel insights into the Fe 2+ oxidation mechanism in Fe 2+ -oxidizing A. ferrooxidans ssp.

  6. [Genome-wide identification and analysis of WRKY transcription factors in Medicago truncatula].

    PubMed

    Song, Hui; Nan, Zhibiao

    2014-02-01

    WRKY gene family plays important roles in plant by involving in transcriptional regulations during various physiologically processes such as development, metabolism and responses to biotic and abiotic stresses. WRKY genes have been identified in various plants. However, only few WRKY genes in Medicago truncatula have been identified with systematic analysis and comparison. In this study, we identified 93 WRKY genes through analyses of M. truncatula genome. These genes include 19 type-I genes, 49 type II genes and 13 type-III genes, and 12 non-regular type genes. All of these genes were characterized through analyses of gene duplication, chromosomal locations, structural diversity, conserved protein motifs and phylogenetic relations. The results showed that 11 times of gene duplication event occurred in WRKY gene family involving 24 genes. WRKY genes, containing 6 gene clusters, are unevenly distributed into chromosome 1 to 6, and there is the purifying selection pressure in WRKY group III genes.

  7. Discovery and characterization of a prevalent human gut bacterial enzyme sufficient for the inactivation of a family of plant toxins

    PubMed Central

    Koppel, Nitzan; Bisanz, Jordan E; Pandelia, Maria-Eirini

    2018-01-01

    Although the human gut microbiome plays a prominent role in xenobiotic transformation, most of the genes and enzymes responsible for this metabolism are unknown. Recently, we linked the two-gene ‘cardiac glycoside reductase’ (cgr) operon encoded by the gut Actinobacterium Eggerthella lenta to inactivation of the cardiac medication and plant natural product digoxin. Here, we compared the genomes of 25 E. lenta strains and close relatives, revealing an expanded 8-gene cgr-associated gene cluster present in all digoxin metabolizers and absent in non-metabolizers. Using heterologous expression and in vitro biochemical characterization, we discovered that a single flavin- and [4Fe-4S] cluster-dependent reductase, Cgr2, is sufficient for digoxin inactivation. Unexpectedly, Cgr2 displayed strict specificity for digoxin and other cardenolides. Quantification of cgr2 in gut microbiomes revealed that this gene is widespread and conserved in the human population. Together, these results demonstrate that human-associated gut bacteria maintain specialized enzymes that protect against ingested plant toxins. PMID:29761785

  8. Identification of proteins in Streptococcus pneumoniae by reverse vaccinology and genetic diversity of these proteins in clinical isolates.

    PubMed

    Argondizzo, Ana Paula Corrêa; da Mota, Fabio Faria; Pestana, Cristiane Pinheiro; Reis, Joice Neves; de Miranda, Antonio Basílio; Galler, Ricardo; Medeiros, Marco Alberto

    2015-02-01

    Streptococcus pneumoniae is a major cause of morbidity and mortality worldwide. Virulence-associated proteins common and conserved among all capsular types now represent the best strategy to combat pneumococcal infections. Our aim was to identify conserved targets in pneumococci that showed positive prediction for lipoprotein and extracellular subcellular location using bioinformatics programs and verify the distribution and the degree of conservation of these targets in pneumococci. These targets can be considered potential vaccine candidate to be evaluated in the future. A set of 13 targets were analyzed and confirmed the presence in all pneumococci tested. These 13 genes were highly conserved showing around >96 % of amino acid and nucleotide identity, but they were also present and show high identity in the closely related species Streptococcus mitis, Streptococcus oralis, and Streptococcus pseudopneumoniae. S. oralis clusters away from S. pneumoniae, while S. pseudopneumoniae and S. mitis cluster closer. The divergence between the selected targets was too small to be observed consistently in phylogenetic groups between the analyzed genomes of S. pneumoniae. The proteins analyzed fulfill two of the initial criteria of a vaccine candidate: targets are present in a variety of different pneumococci strains including different serotypes and are conserved among the samples evaluated.

  9. The genome sequence of Bifidobacterium longum subsp. infantis reveals adaptations for milk utilization within the infant microbiome

    PubMed Central

    Sela, D. A.; Chapman, J.; Adeuya, A.; Kim, J. H.; Chen, F.; Whitehead, T. R.; Lapidus, A.; Rokhsar, D. S.; Lebrilla, C. B.; German, J. B.; Price, N. P.; Richardson, P. M.; Mills, D. A.

    2008-01-01

    Following birth, the breast-fed infant gastrointestinal tract is rapidly colonized by a microbial consortium often dominated by bifidobacteria. Accordingly, the complete genome sequence of Bifidobacterium longum subsp. infantis ATCC15697 reflects a competitive nutrient-utilization strategy targeting milk-borne molecules which lack a nutritive value to the neonate. Several chromosomal loci reflect potential adaptation to the infant host including a 43 kbp cluster encoding catabolic genes, extracellular solute binding proteins and permeases predicted to be active on milk oligosaccharides. An examination of in vivo metabolism has detected the hallmarks of milk oligosaccharide utilization via the central fermentative pathway using metabolomic and proteomic approaches. Finally, conservation of gene clusters in multiple isolates corroborates the genomic mechanism underlying milk utilization for this infant-associated phylotype. PMID:19033196

  10. Physical mapping of repetitive DNA suggests 2n reduction in Amazon turtles Podocnemis (Testudines: Podocnemididae)

    PubMed Central

    Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues

    2018-01-01

    Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26–68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26–28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae. PMID:29813087

  11. Physical mapping of repetitive DNA suggests 2n reduction in Amazon turtles Podocnemis (Testudines: Podocnemididae).

    PubMed

    Cavalcante, Manoella Gemaque; Bastos, Carlos Eduardo Matos Carvalho; Nagamachi, Cleusa Yoshiko; Pieczarka, Julio Cesar; Vicari, Marcelo Ricardo; Noronha, Renata Coelho Rodrigues

    2018-01-01

    Cytogenetic studies show that there is great karyotypic diversity in order Testudines (2n = 26-68), and that this may be mainly attributed to the presence/absence of microchromosomes. Members of the Podocnemididae family have the smallest diploid numbers of this order (2n = 26-28), which may be a derived condition of the group. Diverse studies suggest that repetitive-DNA-rich sites generally act as hotspots for double-strand breaks and chromosomal reorganization. In this context, we used fluorescent in situ hybridization (FISH) to map telomeric sequences (TTAGGG)n, 45S rDNA, and the genes encoding histones H1 and H3 in two species of genus Podocnemis. We also observed conservation of the 45S rDNA and H1 histone sequences (probable case of conserved synteny), but multiple conserved and non-conserved clusters of H3 genes, which colocalized with the interstitial telomeric sequences in the Podocnemis genome. Our results suggest that fusions have occurred between macro and microchromosomes or between microchromosomes, leading to the observed reduction in diploid number in the family Podocnemididae.

  12. A curated catalog of canine and equine keratin genes

    PubMed Central

    Pujar, Shashikant; McGarvey, Kelly M.; Welle, Monika; Galichet, Arnaud; Müller, Eliane J.; Pruitt, Kim D.; Leeb, Tosso

    2017-01-01

    Keratins represent a large protein family with essential structural and functional roles in epithelial cells of skin, hair follicles, and other organs. During evolution the genes encoding keratins have undergone multiple rounds of duplication and humans have two clusters with a total of 55 functional keratin genes in their genomes. Due to the high similarity between different keratin paralogs and species-specific differences in gene content, the currently available keratin gene annotation in species with draft genome assemblies such as dog and horse is still imperfect. We compared the National Center for Biotechnology Information (NCBI) (dog annotation release 103, horse annotation release 101) and Ensembl (release 87) gene predictions for the canine and equine keratin gene clusters to RNA-seq data that were generated from adult skin of five dogs and two horses and from adult hair follicle tissue of one dog. Taking into consideration the knowledge on the conserved exon/intron structure of keratin genes, we annotated 61 putatively functional keratin genes in both the dog and horse, respectively. Subsequently, curators in the RefSeq group at NCBI reviewed their annotation of keratin genes in the dog and horse genomes (Annotation Release 104 and Annotation Release 102, respectively) and updated annotation and gene nomenclature of several keratin genes. The updates are now available in the NCBI Gene database (https://www.ncbi.nlm.nih.gov/gene). PMID:28846680

  13. Underlying mathematics in diversification of human olfactory receptors in different loci.

    PubMed

    Hassan, Sk Sarif; Choudhury, Pabitra Pal; Goswami, Arunava

    2013-12-01

    As per conservative estimate, approximately 51-105 Olfactory Receptors (ORs) loci are present in human genome occurring in clusters. These clusters are apparently unevenly spread as mosaics over 21 pairs of human chromosomes. Olfactory Receptor (OR) gene families which are thought to have expanded for the need to provide recognition capability for a huge number of pure and complex odorants, form the largest known multigene family in the human genome. Recent studies have shown that 388 full length and 414 OR pseudo-genes are present in these OR genomic clusters. In this paper, the authors report a classification method for all human ORs based on their sequential quantitative information like presence of poly strings of nucleotides bases, long range correlation and so on. An L-System generated sequence has been taken as an input into a star-model of specific subfamily members and resultant sequence has been mapped to a specific OR based on the classification scheme using fractal parameters like Hurst exponent and fractal dimensions.

  14. Gene network analysis identifies rumen epithelial cell proliferation, differentiation and metabolic pathways perturbed by diet and correlated with methane production

    PubMed Central

    Xiang, Ruidong; McNally, Jody; Rowe, Suzanne; Jonker, Arjan; Pinares-Patino, Cesar S.; Oddy, V. Hutton; Vercoe, Phil E.; McEwan, John C.; Dalrymple, Brian P.

    2016-01-01

    Ruminants obtain nutrients from microbial fermentation of plant material, primarily in their rumen, a multilayered forestomach. How the different layers of the rumen wall respond to diet and influence microbial fermentation, and how these process are regulated, is not well understood. Gene expression correlation networks were constructed from full thickness rumen wall transcriptomes of 24 sheep fed two different amounts and qualities of a forage and measured for methane production. The network contained two major negatively correlated gene sub-networks predominantly representing the epithelial and muscle layers of the rumen wall. Within the epithelium sub-network gene clusters representing lipid/oxo-acid metabolism, general metabolism and proliferating and differentiating cells were identified. The expression of cell cycle and metabolic genes was positively correlated with dry matter intake, ruminal short chain fatty acid concentrations and methane production. A weak correlation between lipid/oxo-acid metabolism genes and methane yield was observed. Feed consumption level explained the majority of gene expression variation, particularly for the cell cycle genes. Many known stratified epithelium transcription factors had significantly enriched targets in the epithelial gene clusters. The expression patterns of the transcription factors and their targets in proliferating and differentiating skin is mirrored in the rumen, suggesting conservation of regulatory systems. PMID:27966600

  15. Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns.

    PubMed

    Gruel, Jérémy; LeBorgne, Michel; LeMeur, Nolwenn; Théret, Nathalie

    2011-09-12

    Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks.

  16. Simple Shared Motifs (SSM) in conserved region of promoters: a new approach to identify co-regulation patterns

    PubMed Central

    2011-01-01

    Background Regulation of gene expression plays a pivotal role in cellular functions. However, understanding the dynamics of transcription remains a challenging task. A host of computational approaches have been developed to identify regulatory motifs, mainly based on the recognition of DNA sequences for transcription factor binding sites. Recent integration of additional data from genomic analyses or phylogenetic footprinting has significantly improved these methods. Results Here, we propose a different approach based on the compilation of Simple Shared Motifs (SSM), groups of sequences defined by their length and similarity and present in conserved sequences of gene promoters. We developed an original algorithm to search and count SSM in pairs of genes. An exceptional number of SSM is considered as a common regulatory pattern. The SSM approach is applied to a sample set of genes and validated using functional gene-set enrichment analyses. We demonstrate that the SSM approach selects genes that are over-represented in specific biological categories (Ontology and Pathways) and are enriched in co-expressed genes. Finally we show that genes co-expressed in the same tissue or involved in the same biological pathway have increased SSM values. Conclusions Using unbiased clustering of genes, Simple Shared Motifs analysis constitutes an original contribution to provide a clearer definition of expression networks. PMID:21910886

  17. Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function.

    PubMed

    Goad, David M; Zhu, Chuanmei; Kellogg, Elizabeth A

    2017-10-01

    CLV3/ESR (CLE) proteins are important signaling peptides in plants. The short CLE peptide (12-13 amino acids) is cleaved from a larger pre-propeptide and functions as an extracellular ligand. The CLE family is large and has resisted attempts at classification because the CLE domain is too short for reliable phylogenetic analysis and the pre-propeptide is too variable. We used a model-based search for CLE domains from 57 plant genomes and used the entire pre-propeptide for comprehensive clustering analysis. In total, 1628 CLE genes were identified in land plants, with none recognizable from green algae. These CLEs form 12 groups within which CLE domains are largely conserved and pre-propeptides can be aligned. Most clusters contain sequences from monocots, eudicots and Amborella trichopoda, with sequences from Picea abies, Selaginella moellendorffii and Physcomitrella patens scattered in some clusters. We easily identified previously known clusters involved in vascular differentiation and nodulation. In addition, we found a number of discrete groups whose function remains poorly characterized. Available data indicate that CLE proteins within a cluster are likely to share function, whereas those from different clusters play at least partially different roles. Our analysis provides a foundation for future evolutionary and functional studies. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  18. Biochemical and Genetic Characterization of the vanC-2 Vancomycin Resistance Gene Cluster of Enterococcus casseliflavus ATCC 25788

    PubMed Central

    Dutta, Ireena; Reynolds, Peter E.

    2002-01-01

    The vanC-2 cluster of Enterococcus casseliflavus ATCC 25788 consisted of five genes (vanC-2, vanXYC-2, vanTC-2, vanRC-2, and vanSC-2) and shared the same organization as the vanC cluster of E. gallinarum BM4174. The proteins encoded by these genes displayed a high degree of amino acid identity to the proteins encoded within the vanC gene cluster. The putative d,d-dipeptidase-d,d-carboxypeptidase, VanXYC-2, exhibited 81% amino acid identity to VanXYC, and VanTC-2 displayed 65% amino acid identity to the serine racemase, VanT. VanRC-2 and VanSC-2 displayed high degrees of identity to VanRC and VanSC, respectively, and contained the conserved residues identified as important to their function as a response regulator and histidine kinase, respectively. Resistance to vancomycin was expressed inducibly in E. casseliflavus ATCC 25788 and required an extended period of induction. Analysis of peptidoglycan precursors revealed that UDP-N-acetylmuramyl-l-Ala-δ-d-Glu-l-Lys-d-Ala-d-Ser could not be detected until several hours after the addition of vancomycin, and its appearance coincided with the resumption of growth. The introduction of additional copies of the vanTC-2 gene, encoding a putative serine racemase, and the presence of supplementary d-serine in the growth medium both significantly reduced the period before growth resumed after addition of vancomycin. This suggested that the availability of d-serine plays an important role in the induction process. PMID:12234834

  19. Conserved enzymes mediate the early reactions of carotenoid biosynthesis in nonphotosynthetic and photosynthetic prokaryotes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Armstrong, G.A.; Hearst, J.E.; Alberti, M.

    1990-12-01

    Carotenoids comprise one of the most widespread classes of pigments found in nature. The first reactions of C{sub 40} carotenoid biosynthesis proceed through common intermediates in all organisms, suggesting the evolutionary conservation of early enzymes from this pathway. The authors report here the nucleotide sequence of three genes from the carotenoid biosynthesis gene cluster of Erwinia herbicola, a nonphotosynthetic epiphytic bacterium, which encode homologs of the CrtB, CrtE, and CrtI proteins of Rhodobacter capsulatus, a purple nonsulfur photosynthetic bacterium. CrtB (prephytoene pyrophosphate synthase), CrtE (phytoene synthase), and CrtI (phytoene dehydrogenase) are required for the first three reactions specific to themore » carotenoid branch of general isoprenoid metabolism. All three dehydrogenases possess a hydrophobic N-terminal domain containing a putative ADP-binding {beta}{alpha}{beta} fold characteristic of enzymes known to bind FAD or NAD(P) cofactors. These data indicate the structural conservation of early carotenoid biosynthesis enzymes in evolutionary diverse organisms.« less

  20. Draft Genome Sequencing and Comparative Analysis of Aspergillus sojae NBRC4239

    PubMed Central

    Sato, Atsushi; Oshima, Kenshiro; Noguchi, Hideki; Ogawa, Masahiro; Takahashi, Tadashi; Oguma, Tetsuya; Koyama, Yasuji; Itoh, Takehiko; Hattori, Masahira; Hanya, Yoshiki

    2011-01-01

    We conducted genome sequencing of the filamentous fungus Aspergillus sojae NBRC4239 isolated from the koji used to prepare Japanese soy sauce. We used the 454 pyrosequencing technology and investigated the genome with respect to enzymes and secondary metabolites in comparison with other Aspergilli sequenced. Assembly of 454 reads generated a non-redundant sequence of 39.5-Mb possessing 13 033 putative genes and 65 scaffolds composed of 557 contigs. Of the 2847 open reading frames with Pfam domain scores of >150 found in A. sojae NBRC4239, 81.7% had a high degree of similarity with the genes of A. oryzae. Comparative analysis identified serine carboxypeptidase and aspartic protease genes unique to A. sojae NBRC4239. While A. oryzae possessed three copies of α-amyalse gene, A. sojae NBRC4239 possessed only a single copy. Comparison of 56 gene clusters for secondary metabolites between A. sojae NBRC4239 and A. oryzae revealed that 24 clusters were conserved, whereas 32 clusters differed between them that included a deletion of 18 508 bp containing mfs1, mao1, dmaT, and pks-nrps for the cyclopiazonic acid (CPA) biosynthesis, explaining the no productivity of CPA in A. sojae. The A. sojae NBRC4239 genome data will be useful to characterize functional features of the koji moulds used in Japanese industries. PMID:21659486

  1. 3. VIEW NORTHEAST, SOUTH FRONT OF SOIL CONSERVATION SERVICE CLUSTER ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    3. VIEW NORTHEAST, SOUTH FRONT OF SOIL CONSERVATION SERVICE CLUSTER (BUILDING 25) - U.S. Plant Introduction Station, Soil Conservation Service Cluster, 11601 Old Pond Road, Glenn Dale, Prince George's County, MD

  2. Clock genes and their genomic distributions in three species of salmonid fishes: Associations with genes regulating sexual maturation and cell cycling

    PubMed Central

    2010-01-01

    Background Clock family genes encode transcription factors that regulate clock-controlled genes and thus regulate many physiological mechanisms/processes in a circadian fashion. Clock1 duplicates and copies of Clock3 and NPAS2-like genes were partially characterized (genomic sequencing) and mapped using family-based indels/SNPs in rainbow trout (RT)(Oncorhynchus mykiss), Arctic charr (AC)(Salvelinus alpinus), and Atlantic salmon (AS)(Salmo salar) mapping panels. Results Clock1 duplicates mapped to linkage groups RT-8/-24, AC-16/-13 and AS-2/-18. Clock3/NPAS2-like genes mapped to RT-9/-20, AC-20/-43, and AS-5. Most of these linkage group regions containing the Clock gene duplicates were derived from the most recent 4R whole genome duplication event specific to the salmonids. These linkage groups contain quantitative trait loci (QTL) for life history and growth traits (i.e., reproduction and cell cycling). Comparative synteny analyses with other model teleost species reveal a high degree of conservation for genes in these chromosomal regions suggesting that functionally related or co-regulated genes are clustered in syntenic blocks. For example, anti-müllerian hormone (amh), regulating sexual maturation, and ornithine decarboxylase antizymes (oaz1 and oaz2), regulating cell cycling, are contained within these syntenic blocks. Conclusions Synteny analyses indicate that regions homologous to major life-history QTL regions in salmonids contain many candidate genes that are likely to influence reproduction and cell cycling. The order of these genes is highly conserved across the vertebrate species examined, and as such, these genes may make up a functional cluster of genes that are likely co-regulated. CLOCK, as a transcription factor, is found within this block and therefore has the potential to cis-regulate the processes influenced by these genes. Additionally, clock-controlled genes (CCGs) are located in other life-history QTL regions within salmonids suggesting that at least in part, trans-regulation of these QTL regions may also occur via Clock expression. PMID:20670436

  3. The Mouse Solitary Odorant Receptor Gene Promoters as Models for the Study of Odorant Receptor Gene Choice

    PubMed Central

    Degl'Innocenti, Andrea

    2016-01-01

    Background In vertebrates, several anatomical regions located within the nasal cavity mediate olfaction. Among these, the main olfactory epithelium detects most conventional odorants. Olfactory sensory neurons, provided with cilia exposed to the air, detect volatile chemicals via an extremely large family of seven-transmembrane chemoreceptors named odorant receptors. Their genes are expressed in a monogenic and monoallelic fashion: a single allele of a single odorant receptor gene is transcribed in a given mature neuron, through a still uncharacterized molecular mechanism known as odorant receptor gene choice. Aim Odorant receptor genes are typically arranged in genomic clusters, but a few are isolated (we call them solitary) from the others within a region broader than 1 Mb upstream and downstream with respect to their transcript's coordinates. The study of clustered genes is problematic, because of redundancy and ambiguities in their regulatory elements: we propose to use the solitary genes as simplified models to understand odorant receptor gene choice. Procedures Here we define number and identity of the solitary genes in the mouse genome (C57BL/6J), and assess the conservation of the solitary status in some mammalian orthologs. Furthermore, we locate their putative promoters, predict their homeodomain binding sites (commonly present in the promoters of odorant receptor genes) and compare candidate promoter sequences with those of wild-caught mice. We also provide expression data from histological sections. Results In the mouse genome there are eight intact solitary genes: Olfr19 (M12), Olfr49, Olfr266, Olfr267, Olfr370, Olfr371, Olfr466, Olfr1402; five are conserved as solitary in rat. These genes are all expressed in the main olfactory epithelium of three-day-old mice. The C57BL/6J candidate promoter of Olfr370 has considerably varied compared to its wild-type counterpart. Within the putative promoter for Olfr266 a homeodomain binding site is predicted. As a whole, our findings favor Olfr266 as a model gene to investigate odorant receptor gene choice. PMID:26794459

  4. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).

    PubMed

    Natale, D A; Shankavaram, U T; Galperin, M Y; Wolf, Y I; Aravind, L; Koonin, E V

    2000-01-01

    Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and predicted protein functions provide for a significant improvement in genome annotation. A differential genome display approach helps in a systematic investigation of common and distinct features of gene repertoires and in some cases reveals unexpected connections that may be indicative of functional similarities between phylogenetically distant organisms and of lateral gene exchange.

  5. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs)

    PubMed Central

    Natale, Darren A; Shankavaram, Uma T; Galperin, Michael Y; Wolf, Yuri I; Aravind, L; Koonin, Eugene V

    2000-01-01

    Background: Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. Results: A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Conclusions: Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and predicted protein functions provide for a significant improvement in genome annotation. A differential genome display approach helps in a systematic investigation of common and distinct features of gene repertoires and in some cases reveals unexpected connections that may be indicative of functional similarities between phylogenetically distant organisms and of lateral gene exchange. PMID:11178258

  6. Genetic structure in the southernmost populations of black-and-gold howler monkeys (Alouatta caraya) and its conservation implications

    PubMed Central

    Miño, Carolina Isabel; Fernández, Gabriela; Caputo, Mariela; Corach, Daniel

    2017-01-01

    Black-and-gold howler monkeys Alouatta caraya, are arboreal primates, inhabitants of Neotropical forests, highly susceptible to the yellow fever virus, considered early 'sentinels' of outbreaks, and thus, of major epidemiological importance. Currently, anthropogenic habitat loss and modifications threatens their survival. Habitat modification can prevent, reduce or change dispersal behavior, which, in turn, may influence patterns of gene flow. We explored past and contemporary levels of genetic diversity, elucidated genetic structure and identified its possible drivers, in ten populations (n = 138) located in the southernmost distribution range of the species in South America, in Argentina and Paraguay. Overall, genetic variability was moderate (ten microsatellites: 3.16 ± 0.18 alleles per locus, allelic richness of 2.93 ± 0.81, 0.443±0.025 unbiased expected heterozygosity; 22 haplotypes of 491-bp mitochondrial Control Region, haplotypic diversity of 0.930 ± 0.11, and nucleotide diversity of0.01± 0.007). Significant evidence of inbreeding was found in a population that was, later, decimated by yellow fever. Population-based gene flow measures (FST = 0.13; θST = 018), hierarchical analysis of molecular variance and Bayesian clustering methods revealed significant genetic structure, grouping individuals into four clusters. Shared haplotypes and lack of mitochondrial differentiation (non-significant θST) among some populations seem to support the hypothesis of historical dispersal via riparian forests. Current resistance analyses revealed a significant role of landscape features in modeling contemporary gene flow: continuous forest and riparian forests could promote genetic exchange, whereas disturbed forests or crop/grassland fields may restrict it. Estimates of effective population size allow anticipating that the studied populations will lose 75% of heterozygosity in less than 50 generations. Our findings suggest that anthropogenic modifications on native forests, increasingly ongoing in Northeastern Argentina, Southern Paraguay and Southeastern Brazil, might prevent the dispersal of howlers, leading to population isolation. To ensure long-term viability and maintain genetic connectivity of A. caraya remnant populations, we recommend preserving and restoring habitat continuity. To conserve the species genetic pool, as well, the four genetic clusters identified here should be considered separate Management Units and given high conservation priority. In light of our findings and considering complementary non-genetic information, we suggest upgrading the international conservation status of A. caraya to “Vulnerable”. PMID:28968440

  7. Genome-wide identification and expression analysis of YTH domain-containing RNA-binding protein family in cucumber (Cucumis sativus).

    PubMed

    Zhou, Yong; Hu, Lifang; Jiang, Lunwei; Liu, Shiqiang

    2018-06-01

    YTH domain-containing RNA-binding proteins are involved in post-transcriptional regulation and play important roles in the growth and development as well as abiotic stress responses of plants. However, YTH genes have not been previously studied in cucumber (Cucumis sativus). In this study, a total of five YTH genes (CsYTH1-CsYTH5) were identified in cucumber, which could be mapped on three out of the seven cucumber chromosomes. All CsYTH proteins had highly conserved C-terminal YTH domains, and two of them (CsYTH1 and CsYTH4) harbored extra CCCH and P/Q/N-rich domains. The phylogenesis, conserved motifs and exon-intron structure of YTH genes from cucumber, Arabidopsis and rice were also analyzed. The phylogenetically closely clustered YTHs shared similar gene structures and conserved motifs. An analysis of the cis-acting regulatory elements in the upstream region of these genes resulted in the identification of many cis-elements related to stress, hormone and development. Expression analysis based on the transcriptome data showed that some CsYTHs had development- or tissue-specific expression. In addition, their expression levels were altered under various stresses such as salt, drought, cold, and abscisic acid (ABA) treatments. These findings lay the foundation for the functional analysis of CsYTHs in the future.

  8. 1. VIEW EAST, WEST FRONT OF SOIL CONSERVATION SERVICE CLUSTER ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    1. VIEW EAST, WEST FRONT OF SOIL CONSERVATION SERVICE CLUSTER (BUILDINGS 24, 25, 26) - U.S. Plant Introduction Station, Soil Conservation Service Cluster, 11601 Old Pond Road, Glenn Dale, Prince George's County, MD

  9. Heterologous production of an energy-conserving carbon monoxide dehydrogenase complex in the hyperthermophile Pyrococcus furiosus

    DOE PAGES

    Schut, Gerrit J.; Lipscomb, Gina L.; Nguyen, Diep M. N.; ...

    2016-01-29

    In this study, carbon monoxide (CO) is an important intermediate in anaerobic carbon fixation pathways in acetogenesis and methanogenesis. In addition, some anaerobes can utilize CO as an energy source. In the hyperthermophilic archaeon Thermococcus onnurineus, which grows optimally at 80°C, CO oxidation and energy conservation is accomplished by a respiratory complex encoded by a 16-gene cluster containing a CO dehydrogenase, a membrane-bound [NiFe]-hydrogenase and a Na +/H + antiporter module. This complex oxidizes CO, evolves CO 2 and H 2, and generates a Na+ motive force that is used to conserve energy by a Na+-dependent ATP synthase. Herein wemore » used a bacterial artificial chromosome to insert the 13.2 kb gene cluster encoding the CO-oxidizing respiratory complex of T. onnurineus into the genome of the heterotrophic archaeon, Pyrococcus furiosus, which grows optimally at 100° C. P. furiosus is normally unable to utilize CO, however, the recombinant strain readily oxidized CO and generated H 2 at 80° C. Moreover, CO also served as an energy source and allowed the P. furiosus strain to grow with a limiting concentration of sugar or with peptides as the carbon source. Moreover, CO oxidation by P. furiosus was also coupled to the re-utilization, presumably for biosynthesis, of acetate generated by fermentation. The functional transfer of CO utilization between Thermococcus and Pyrococcus species demonstrated herein is representative of the horizontal gene transfer of an environmentally relevant metabolic capability. The transfer of CO utilizing, hydrogen-producing genetic modules also has applications for biohydrogen production and a CO-based industrial platform for various thermophilic organisms.« less

  10. PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

    PubMed Central

    Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

    2008-01-01

    Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at . PMID:18366802

  11. Sequencing and functional annotation of the whole genome of the filamentous fungus Aspergillus westerdijkiae.

    PubMed

    Han, Xiaolong; Chakrabortti, Alolika; Zhu, Jindong; Liang, Zhao-Xun; Li, Jinming

    2016-08-15

    Aspergillus westerdijkiae produces ochratoxin A (OTA) in Aspergillus section Circumdati. It is responsible for the contamination of agricultural crops, fruits, and food commodities, as its secondary metabolite OTA poses a potential threat to animals and humans. As a member of the filamentous fungi family, its capacity for enzymatic catalysis and secondary metabolite production is valuable in industrial production and medicine. To understand the genetic factors underlying its pathogenicity, enzymatic degradation, and secondary metabolism, we analysed the whole genome of A. westerdijkiae and compared it with eight other sequenced Aspergillus species. We sequenced the complete genome of A. westerdijkiae and assembled approximately 36 Mb of its genomic DNA, in which we identified 10,861 putative protein-coding genes. We constructed a phylogenetic tree of A. westerdijkiae and eight other sequenced Aspergillus species and found that the sister group of A. westerdijkiae was the A. oryzae - A. flavus clade. By searching the associated databases, we identified 716 cytochrome P450 enzymes, 633 carbohydrate-active enzymes, and 377 proteases. By combining comparative analysis with Kyoto Encyclopaedia of Genes and Genomes (KEGG), Conserved Domains Database (CDD), and Pfam annotations, we predicted 228 potential carbohydrate-active enzymes related to plant polysaccharide degradation (PPD). We found a large number of secondary biosynthetic gene clusters, which suggested that A. westerdijkiae had a remarkable capacity to produce secondary metabolites. Furthermore, we obtained two more reliable and integrated gene sequences containing the reported portions of OTA biosynthesis and identified their respective secondary metabolite clusters. We also systematically annotated these two hybrid t1pks-nrps gene clusters involved in OTA biosynthesis. These two clusters were separate in the genome, and one of them encoded a couple of GH3 and AA3 enzyme genes involved in sucrose and glucose metabolism. The genomic information obtained in this study is valuable for understanding the life cycle and pathogenicity of A. westerdijkiae. We identified numerous enzyme genes that are potentially involved in host invasion and pathogenicity, and we provided a preliminary prediction for each putative secondary metabolite (SM) gene cluster. In particular, for the OTA-related SM gene clusters, we delivered their components with domain and pathway annotations. This study sets the stage for experimental verification of the biosynthetic and regulatory mechanisms of OTA and for the discovery of new secondary metabolites.

  12. Molecular Regulation of Antibiotic Biosynthesis in Streptomyces

    PubMed Central

    Liu, Gang; Chandra, Govind; Niu, Guoqing

    2013-01-01

    SUMMARY Streptomycetes are the most abundant source of antibiotics. Typically, each species produces several antibiotics, with the profile being species specific. Streptomyces coelicolor, the model species, produces at least five different antibiotics. We review the regulation of antibiotic biosynthesis in S. coelicolor and other, nonmodel streptomycetes in the light of recent studies. The biosynthesis of each antibiotic is specified by a large gene cluster, usually including regulatory genes (cluster-situated regulators [CSRs]). These are the main point of connection with a plethora of generally conserved regulatory systems that monitor the organism's physiology, developmental state, population density, and environment to determine the onset and level of production of each antibiotic. Some CSRs may also be sensitive to the levels of different kinds of ligands, including products of the pathway itself, products of other antibiotic pathways in the same organism, and specialized regulatory small molecules such as gamma-butyrolactones. These interactions can result in self-reinforcing feed-forward circuitry and complex cross talk between pathways. The physiological signals and regulatory mechanisms may be of practical importance for the activation of the many cryptic secondary metabolic gene cluster pathways revealed by recent sequencing of numerous Streptomyces genomes. PMID:23471619

  13. The Expression and Function of the Achaete-Scute Genes in Tribolium castaneum Reveals Conservation and Variation in Neural Pattern Formation and Cell Fate Specification

    NASA Technical Reports Server (NTRS)

    Wheeler, Scott R.; Carrico, Michelle L.; Wilson, Beth A.; Brown, Susan J.; Skeath, James B.

    2003-01-01

    SUMMARY The study of achaete-scute (ac/sc) genes has recently become a paradigm to understand the evolution and development of the arthropod nervous system. We describe the identification and characterization of the ache genes in the coleopteran insect species Tribolium castaneum. We have identified two Tribolium ache genes - achaete-scute homolog (Tc-ASH) a proneural gene and asense (Tc-ase) a neural precursor gene that reside in a gene complex. Focusing on the embryonic central nervous system we fmd that Tc-ASH is expressed in all neural precursors and the proneural clusters from which they segregate. Through RNAi and misexpression studies we show that Tc-ASH is necessary for neural precursor formation in Triboliurn and sufficient for neural precursor formation in Drosophila. Comparison of the function of the Drosophila and Triboliurn proneural ac/sc genes suggests that in the Drosophila lineage these genes have maintained their ancestral function in neural precursor formation and have acquired a new role in the fate specification of individual neural precursors. Furthermore, we find that Tc-use is expressed in all neural precursors suggesting an important and conserved role for asense genes in insect nervous system development. Our analysis of the Triboliurn ache genes indicates significant plasticity in gene number, expression and function, and implicates these modifications in the evolution of arthropod neural development.

  14. The expression and function of the achaete-scute genes in Tribolium castaneum reveals conservation and variation in neural pattern formation and cell fate specification

    NASA Technical Reports Server (NTRS)

    Wheeler, Scott R.; Carrico, Michelle L.; Wilson, Beth A.; Brown, Susan J.; Skeath, James B.

    2003-01-01

    The study of achaete-scute (ac/sc) genes has recently become a paradigm to understand the evolution and development of the arthropod nervous system. We describe the identification and characterization of the ac/sc genes in the coleopteran insect species Tribolium castaneum. We have identified two Tribolium ac/sc genes - achaete-scute homolog (Tc-ASH) a proneural gene and asense (Tc-ase) a neural precursor gene that reside in a gene complex. Focusing on the embryonic central nervous system we find that Tc-ASH is expressed in all neural precursors and the proneural clusters from which they segregate. Through RNAi and misexpression studies we show that Tc-ASH is necessary for neural precursor formation in Tribolium and sufficient for neural precursor formation in Drosophila. Comparison of the function of the Drosophila and Tribolium proneural ac/sc genes suggests that in the Drosophila lineage these genes have maintained their ancestral function in neural precursor formation and have acquired a new role in the fate specification of individual neural precursors. Furthermore, we find that Tc-ase is expressed in all neural precursors suggesting an important and conserved role for asense genes in insect nervous system development. Our analysis of the Tribolium ac/sc genes indicates significant plasticity in gene number, expression and function, and implicates these modifications in the evolution of arthropod neural development.

  15. Census of solo LuxR genes in prokaryotic genomes

    PubMed Central

    Hudaiberdiev, Sanjarbek; Choudhary, Kumari S.; Vera Alvarez, Roberto; Gelencsér, Zsolt; Ligeti, Balázs; Lamba, Doriano; Pongor, Sándor

    2015-01-01

    luxR genes encode transcriptional regulators that control acyl homoserine lactone-based quorum sensing (AHL QS) in Gram negative bacteria. On the bacterial chromosome, luxR genes are usually found next or near to a luxI gene encoding the AHL signal synthase. Recently, a number of luxR genes were described that have no luxI genes in their vicinity on the chromosome. These so-called solo luxR genes may either respond to internal AHL signals produced by a non-adjacent luxI in the chromosome, or can respond to exogenous signals. Here we present a survey of solo luxR genes found in complete and draft bacterial genomes in the NCBI databases using HMMs. We found that 2698 of the 3550 luxR genes found are solos, which is an unexpectedly high number even if some of the hits may be false positives. We also found that solo LuxR sequences form distinct clusters that are different from the clusters of LuxR sequences that are part of the known luxR-luxI topological arrangements. We also found a number of cases that we termed twin luxR topologies, in which two adjacent luxR genes were in tandem or divergent orientation. Many of the luxR solo clusters were devoid of the sequence motifs characteristic of AHL binding LuxR proteins so there is room to speculate that the solos may be involved in sensing hitherto unknown signals. It was noted that only some of the LuxR clades are rich in conserved cysteine residues. Molecular modeling suggests that some of the cysteines may be involved in disulfide formation, which makes us speculate that some LuxR proteins, including some of the solos may be involved in redox regulation. PMID:25815274

  16. Census of solo LuxR genes in prokaryotic genomes.

    PubMed

    Hudaiberdiev, Sanjarbek; Choudhary, Kumari S; Vera Alvarez, Roberto; Gelencsér, Zsolt; Ligeti, Balázs; Lamba, Doriano; Pongor, Sándor

    2015-01-01

    luxR genes encode transcriptional regulators that control acyl homoserine lactone-based quorum sensing (AHL QS) in Gram negative bacteria. On the bacterial chromosome, luxR genes are usually found next or near to a luxI gene encoding the AHL signal synthase. Recently, a number of luxR genes were described that have no luxI genes in their vicinity on the chromosome. These so-called solo luxR genes may either respond to internal AHL signals produced by a non-adjacent luxI in the chromosome, or can respond to exogenous signals. Here we present a survey of solo luxR genes found in complete and draft bacterial genomes in the NCBI databases using HMMs. We found that 2698 of the 3550 luxR genes found are solos, which is an unexpectedly high number even if some of the hits may be false positives. We also found that solo LuxR sequences form distinct clusters that are different from the clusters of LuxR sequences that are part of the known luxR-luxI topological arrangements. We also found a number of cases that we termed twin luxR topologies, in which two adjacent luxR genes were in tandem or divergent orientation. Many of the luxR solo clusters were devoid of the sequence motifs characteristic of AHL binding LuxR proteins so there is room to speculate that the solos may be involved in sensing hitherto unknown signals. It was noted that only some of the LuxR clades are rich in conserved cysteine residues. Molecular modeling suggests that some of the cysteines may be involved in disulfide formation, which makes us speculate that some LuxR proteins, including some of the solos may be involved in redox regulation.

  17. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.

    PubMed

    Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J

    2008-06-18

    Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson correlation coefficient and the SD-weighted correlation coefficient, and is particularly useful for clustering replicated microarray data. This computational approach should be generally useful for proteomic data or other high-throughput analysis methodology.

  18. Lineage-specific evolution of cnidarian Wnt ligands.

    PubMed

    Hensel, Katrin; Lotan, Tamar; Sanders, Steve M; Cartwright, Paulyn; Frank, Uri

    2014-09-01

    We have studied the evolution of Wnt genes in cnidarians and the expression pattern of all Wnt ligands in the hydrozoan Hydractinia echinata. Current views favor a scenario in which 12 Wnt sub-families were jointly inherited by cnidarians and bilaterians from their last common ancestor. Our phylogenetic analyses clustered all medusozoan genes in distinct, well-supported clades, but many orthologous relationships between medusozoan Wnts and anthozoan and bilaterian Wnt genes were poorly supported. Only seven anthozoan genes, Wnt2, Wnt4, Wnt5, Wnt6, Wnt 10, Wnt11, and Wnt16 were recovered with strong support with bilaterian genes and of those, only the Wnt2, Wnt5, Wnt11, and Wnt16 clades also included medusozoan genes. Although medusozoan Wnt8 genes clustered with anthozoan and bilaterian genes, this was not well supported. In situ hybridization studies revealed poor conservation of expression patterns of putative Wnt orthologs within Cnidaria. In polyps, only Wnt1, Wnt3, and Wnt7 were expressed at the same position in the studied cnidarian models Hydra, Hydractinia, and Nematostella. Different expression patterns are consistent with divergent functions. Our data do not fully support previous assertions regarding Wnt gene homology, and suggest a more complex history of Wnt family genes than previously suggested. This includes high rates of sequence divergence and lineage-specific duplications of Wnt genes within medusozoans, followed by functional divergence over evolutionary time scales. © 2014 Wiley Periodicals, Inc.

  19. ApiEST-DB: analyzing clustered EST data of the apicomplexan parasites.

    PubMed

    Li, Li; Crabtree, Jonathan; Fischer, Steve; Pinney, Deborah; Stoeckert, Christian J; Sibley, L David; Roos, David S

    2004-01-01

    ApiEST-DB (http://www.cbil.upenn.edu/paradbs-servlet/) provides integrated access to publicly available EST data from protozoan parasites in the phylum Apicomplexa. The database currently incorporates a total of nearly 100,000 ESTs from several parasite species of clinical and/or veterinary interest, including Eimeria tenella, Neospora caninum, Plasmodium falciparum, Sarcocystis neurona and Toxoplasma gondii. To facilitate analysis of these data, EST sequences were clustered and assembled to form consensus sequences for each organism, and these assemblies were then subjected to automated annotation via similarity searches against protein and domain databases. The underlying relational database infrastructure, Genomics Unified Schema (GUS), enables complex biologically based queries, facilitating validation of gene models, identification of alternative splicing, detection of single nucleotide polymorphisms, identification of stage-specific genes and recognition of phylogenetically conserved and phylogenetically restricted sequences.

  20. Promoter Engineering Reveals the Importance of Heptameric Direct Repeats for DNA Binding by Streptomyces Antibiotic Regulatory Protein-Large ATP-Binding Regulator of the LuxR Family (SARP-LAL) Regulators in Streptomyces natalensis.

    PubMed

    Barreales, Eva G; Vicente, Cláudia M; de Pedro, Antonio; Santos-Aberturas, Javier; Aparicio, Jesús F

    2018-05-15

    The biosynthesis of small-size polyene macrolides is ultimately controlled by a couple of transcriptional regulators that act in a hierarchical way. A Streptomyces antibiotic regulatory protein-large ATP-binding regulator of the LuxR family (SARP-LAL) regulator binds the promoter of a PAS-LuxR regulator-encoding gene and activates its transcription, and in turn, the gene product of the latter activates transcription from various promoters of the polyene gene cluster directly. The primary operator of PimR, the archetype of SARP-LAL regulators, contains three heptameric direct repeats separated by four-nucleotide spacers, but the regulator can also bind a secondary operator with only two direct repeats separated by a 3-nucleotide spacer, both located in the promoter region of its unique target gene, pimM A similar arrangement of operators has been identified for PimR counterparts encoded by gene clusters for different antifungal secondary metabolites, including not only polyene macrolides but peptidyl nucleosides, phoslactomycins, or cycloheximide. Here, we used promoter engineering and quantitative transcriptional analyses to determine the contributions of the different heptameric repeats to transcriptional activation and final polyene production. Optimized promoters have thus been developed. Deletion studies and electrophoretic mobility assays were used for the definition of DNA-binding boxes formed by 22-nucleotide sequences comprising two conserved heptameric direct repeats separated by four-nucleotide less conserved spacers. The cooperative binding of PimR SARP appears to be the mechanism involved in the binding of regulator monomers to operators, and at least two protein monomers are required for efficient binding. IMPORTANCE Here, we have shown that a modulation of the production of the antifungal pimaricin in Streptomyces natalensis can be accomplished via promoter engineering of the PAS-LuxR transcriptional activator pimM The expression of this gene is controlled by the Streptomyces antibiotic regulatory protein-large ATP-binding regulator of the LuxR family (SARP-LAL) regulator PimR, which binds a series of heptameric direct repeats in its promoter region. The structure and importance of such repeats in protein binding, transcriptional activation, and polyene production have been investigated. These findings should provide important clues to understand the regulatory machinery that modulates antibiotic biosynthesis in Streptomyces and open new possibilities for the manipulation of metabolite production. The presence of PimR orthologues encoded by gene clusters for different secondary metabolites and the conservation of their operators suggest that the improvements observed in the activation of pimaricin biosynthesis by Streptomyces natalensis could be extrapolated to the production of different compounds by other species. Copyright © 2018 Barreales et al.

  1. Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes

    PubMed Central

    2015-01-01

    Background It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. Results Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. Conclusions The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates. PMID:26449467

  2. Conservation of small RNA pathways in platypus

    PubMed Central

    Murchison, Elizabeth P.; Kheradpour, Pouya; Sachidanandam, Ravi; Smith, Carly; Hodges, Emily; Xuan, Zhenyu; Kellis, Manolis; Grützner, Frank; Stark, Alexander; Hannon, Gregory J.

    2008-01-01

    Small RNA pathways play evolutionarily conserved roles in gene regulation and defense from parasitic nucleic acids. The character and expression patterns of small RNAs show conservation throughout animal lineages, but specific animal clades also show variations on these recurring themes, including species-specific small RNAs. The monotremes, with only platypus and four species of echidna as extant members, represent the basal branch of the mammalian lineage. Here, we examine the small RNA pathways of monotremes by deep sequencing of six platypus and echidna tissues. We find that highly conserved microRNA species display their signature tissue-specific expression patterns. In addition, we find a large rapidly evolving cluster of microRNAs on platypus chromosome X1, which is unique to monotremes. Platypus and echidna testes contain a robust Piwi-interacting (piRNA) system, which appears to be participating in ongoing transposon defense. PMID:18463306

  3. Conservation of small RNA pathways in platypus.

    PubMed

    Murchison, Elizabeth P; Kheradpour, Pouya; Sachidanandam, Ravi; Smith, Carly; Hodges, Emily; Xuan, Zhenyu; Kellis, Manolis; Grützner, Frank; Stark, Alexander; Hannon, Gregory J

    2008-06-01

    Small RNA pathways play evolutionarily conserved roles in gene regulation and defense from parasitic nucleic acids. The character and expression patterns of small RNAs show conservation throughout animal lineages, but specific animal clades also show variations on these recurring themes, including species-specific small RNAs. The monotremes, with only platypus and four species of echidna as extant members, represent the basal branch of the mammalian lineage. Here, we examine the small RNA pathways of monotremes by deep sequencing of six platypus and echidna tissues. We find that highly conserved microRNA species display their signature tissue-specific expression patterns. In addition, we find a large rapidly evolving cluster of microRNAs on platypus chromosome X1, which is unique to monotremes. Platypus and echidna testes contain a robust Piwi-interacting (piRNA) system, which appears to be participating in ongoing transposon defense.

  4. Crystal structure of SgcJ, an NTF2-like superfamily protein involved in biosynthesis of the nine-membered enediyne antitumor antibiotic C-1027

    DOE PAGES

    Huang, Tingting; Chang, Chin -Yuan; Lohman, Jeremy R.; ...

    2016-10-01

    Comparative analysis of the enediyne biosynthetic gene clusters revealed sets of conserved genes serving as outstanding candidates for the enediyne core. Here we report the crystal structures of SgcJ and its homologue NCS-Orf16, together with gene inactivation and site-directed mutagenesis studies, to gain insight into enediyne core biosynthesis. Gene inactivation in vivo establishes that SgcJ is required for C-1027 production in Streptomyces globisporus. SgcJ and NCS-Orf16 share a common structure with the nuclear transport factor 2-like superfamily of proteins, featuring a putative substrate binding or catalytic active site. Site-directed mutagenesis of the conserved residues lining this site allowed us tomore » propose that SgcJ and its homologues may play a catalytic role in transforming the linear polyene intermediate, along with other enediyne polyketide synthase-associated enzymes, into an enzyme-sequestered enediyne core intermediate. In conclusion, these findings will help formulate hypotheses and design experiments to ascertain the function of SgcJ and its homologues in nine-membered enediyne core biosynthesis.« less

  5. Database resources of the National Center for Biotechnology Information

    PubMed Central

    Wheeler, David L.; Barrett, Tanya; Benson, Dennis A.; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Kenton, David L.; Khovayko, Oleg; Lipman, David J.; Madden, Thomas L.; Maglott, Donna R.; Ostell, James; Pruitt, Kim D.; Schuler, Gregory D.; Schriml, Lynn M.; Sequeira, Edwin; Sherry, Stephen T.; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Suzek, Tugba O.; Tatusov, Roman; Tatusova, Tatiana A.; Wagner, Lukas; Yaschenko, Eugene

    2006-01-01

    In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Retroviral Genotyping Tools, HIV-1, Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at: . PMID:16381840

  6. Database resources of the National Center for Biotechnology Information.

    PubMed

    Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; Dicuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Krasnov, Sergey; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Karsch-Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian

    2012-01-01

    In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

  7. A Comprehensive Analysis of Replicative Lifespan in 4,698 Single-Gene Deletion Strains Uncovers Conserved Mechanisms of Aging.

    PubMed

    McCormick, Mark A; Delaney, Joe R; Tsuchiya, Mitsuhiro; Tsuchiyama, Scott; Shemorry, Anna; Sim, Sylvia; Chou, Annie Chia-Zong; Ahmed, Umema; Carr, Daniel; Murakami, Christopher J; Schleit, Jennifer; Sutphin, George L; Wasko, Brian M; Bennett, Christopher F; Wang, Adrienne M; Olsen, Brady; Beyer, Richard P; Bammler, Theodor K; Prunkard, Donna; Johnson, Simon C; Pennypacker, Juniper K; An, Elroy; Anies, Arieanna; Castanza, Anthony S; Choi, Eunice; Dang, Nick; Enerio, Shiena; Fletcher, Marissa; Fox, Lindsay; Goswami, Sarani; Higgins, Sean A; Holmberg, Molly A; Hu, Di; Hui, Jessica; Jelic, Monika; Jeong, Ki-Soo; Johnston, Elijah; Kerr, Emily O; Kim, Jin; Kim, Diana; Kirkland, Katie; Klum, Shannon; Kotireddy, Soumya; Liao, Eric; Lim, Michael; Lin, Michael S; Lo, Winston C; Lockshon, Dan; Miller, Hillary A; Moller, Richard M; Muller, Brian; Oakes, Jonathan; Pak, Diana N; Peng, Zhao Jun; Pham, Kim M; Pollard, Tom G; Pradeep, Prarthana; Pruett, Dillon; Rai, Dilreet; Robison, Brett; Rodriguez, Ariana A; Ros, Bopharoth; Sage, Michael; Singh, Manpreet K; Smith, Erica D; Snead, Katie; Solanky, Amrita; Spector, Benjamin L; Steffen, Kristan K; Tchao, Bie Nga; Ting, Marc K; Vander Wende, Helen; Wang, Dennis; Welton, K Linnea; Westman, Eric A; Brem, Rachel B; Liu, Xin-Guang; Suh, Yousin; Zhou, Zhongjun; Kaeberlein, Matt; Kennedy, Brian K

    2015-11-03

    Many genes that affect replicative lifespan (RLS) in the budding yeast Saccharomyces cerevisiae also affect aging in other organisms such as C. elegans and M. musculus. We performed a systematic analysis of yeast RLS in a set of 4,698 viable single-gene deletion strains. Multiple functional gene clusters were identified, and full genome-to-genome comparison demonstrated a significant conservation in longevity pathways between yeast and C. elegans. Among the mechanisms of aging identified, deletion of tRNA exporter LOS1 robustly extended lifespan. Dietary restriction (DR) and inhibition of mechanistic Target of Rapamycin (mTOR) exclude Los1 from the nucleus in a Rad53-dependent manner. Moreover, lifespan extension from deletion of LOS1 is nonadditive with DR or mTOR inhibition, and results in Gcn4 transcription factor activation. Thus, the DNA damage response and mTOR converge on Los1-mediated nuclear tRNA export to regulate Gcn4 activity and aging. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Database resources of the National Center for Biotechnology Information

    PubMed Central

    Acland, Abigail; Agarwala, Richa; Barrett, Tanya; Beck, Jeff; Benson, Dennis A.; Bollin, Colleen; Bolton, Evan; Bryant, Stephen H.; Canese, Kathi; Church, Deanna M.; Clark, Karen; DiCuccio, Michael; Dondoshansky, Ilya; Federhen, Scott; Feolo, Michael; Geer, Lewis Y.; Gorelenkov, Viatcheslav; Hoeppner, Marilu; Johnson, Mark; Kelly, Christopher; Khotomlianski, Viatcheslav; Kimchi, Avi; Kimelman, Michael; Kitts, Paul; Krasnov, Sergey; Kuznetsov, Anatoliy; Landsman, David; Lipman, David J.; Lu, Zhiyong; Madden, Thomas L.; Madej, Tom; Maglott, Donna R.; Marchler-Bauer, Aron; Karsch-Mizrachi, Ilene; Murphy, Terence; Ostell, James; O'Sullivan, Christopher; Panchenko, Anna; Phan, Lon; Pruitt, Don Preussm Kim D.; Rubinstein, Wendy; Sayers, Eric W.; Schneider, Valerie; Schuler, Gregory D.; Sequeira, Edwin; Sherry, Stephen T.; Shumway, Martin; Sirotkin, Karl; Siyan, Karanjit; Slotta, Douglas; Soboleva, Alexandra; Soussov, Vladimir; Starchenko, Grigory; Tatusova, Tatiana A.; Trawick, Bart W.; Vakatov, Denis; Wang, Yanli; Ward, Minghong; John Wilbur, W.; Yaschenko, Eugene; Zbicz, Kerry

    2014-01-01

    In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, PubReader, Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Primer-BLAST, COBALT, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, the Genetic Testing Registry, Genome and related tools, the Map Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, ClinVar, MedGen, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Probe, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All these resources can be accessed through the NCBI home page. PMID:24259429

  9. Characteristics and phylogenetic analysis of the complete mitochondrial genome of Cheilodactylus quadricornis (Perciformes, Cheilodactylidae).

    PubMed

    Wang, Aishuai; Sun, Yuena; Wu, Changwen

    2016-11-01

    The complete mitochondrial genome of the Cheilodactylus quadricornis was firstly determined in the present study. The mitochondrial genome of C. quadricornis is 16 521 nucleotides, comprising 13 protein-coding genes and 2 ribosomal RNA genes, 22 tRNA genes and 2 main non-coding regions (the control region and the origin of the light-strand replication). The overall base composition was T, 26.3%; C, 29.6%; A, 27.8% and G, 16.3%. The gene arrangement, base composition, and tRNA structures of the complete mitochondrial genome of C. quadricornis is similar to other teleosts. Only two central conserved sequence blocks (CSB-2 and CSB-3) were identified in the control region. In addition, the conserved motif 5'-GCCGG-3' was identified in the origin of light-strand replication of C. quadricornis. The complete mitochondrial genome of C. quadricornis was used to construct phylogenetic tree, which shows that C. quadricornis and C. variegatus clustered in a clade and formed a sister relationship. This mitogenome sequence data would play an important role in population genetics and phylogenetic analysis of the Cheilodactylidae.

  10. Genomic analysis of carboxyl/cholinesterase genes in the silkworm Bombyx mori

    PubMed Central

    2010-01-01

    Background Carboxyl/cholinesterases (CCEs) have pivotal roles in dietary detoxification, pheromone or hormone degradation and neurodevelopment. The recent completion of genome projects in various insect species has led to the identification of multiple CCEs with unknown functions. Here, we analyzed the phylogeny, expression and genomic distribution of 69 putative CCEs in the silkworm, Bombyx mori (Lepidoptera: Bombycidae). Results A phylogenetic tree of CCEs in B. mori and other lepidopteran species was constructed. The expression pattern of each B. mori CCE was also investigated by a search of an expressed sequence tag (EST) database, and the relationship between phylogeny and expression was analyzed. A large number of B. mori CCEs were identified from a midgut EST library. CCEs expressed in the midgut formed a cluster in the phylogenetic tree that included not only B. mori genes but also those of other lepidopteran species. The silkworm, and possibly also other lepidopteran species, has a large number of CCEs, and this might be a consequence of the large cluster of midgut CCEs. Investigation of intron-exon organization in B. mori CCEs revealed that their positions and splicing site phases were strongly conserved. Several B. mori CCEs, including juvenile hormone esterase, not only showed clustering in the phylogenetic tree but were also closely located on silkworm chromosomes. We investigated the phylogeny and microsynteny of neuroligins in detail, among many CCEs. Interestingly, we found the evolution of this gene appeared not to be conserved between B. mori and other insect orders. Conclusions We analyzed 69 putative CCEs from B. mori. Comparison of these CCEs with other lepidopteran CCEs indicated that they had conserved expression and function in this insect order. The analyses showed that CCEs were unevenly distributed across the genome of B. mori and suggested that neuroligins may have a distinct evolutionary history from other insect order. It is possible that such an uneven genomic distribution and a unique neuroligin evolution are shared with other lepidopteran insects. Our genomic analysis has provided novel information on the CCEs of the silkworm, which will be of value to understanding the biology, physiology and evolution of insect CCEs. PMID:20546589

  11. Genetic diversity analysis of cultivated Korarima [Aframomum corrorima (Braun) P.C.M. Jansen] populations from southwestern Ethiopia using inter simple sequence repeats (ISSR) marker.

    PubMed

    Chombe, Dagmawit; Bekele, Endashaw

    2018-12-01

    Korarima ( Aframomum corrorima ) is a perennial and aromatic herb native and widely distributed in southwestern Ethiopia. It is known for its fine flavor as a spice in various Ethiopian traditional dishes. Few molecular studies have been performed on this species so far. In the present paper, the ISSR technique was employed to study the genetic diversity in populations of cultivated A. corrorima . Seven ISSR primers produced a total of 86 clearly scorable DNA bands. High levels of genetic diversity were detected in cultivated A. corrorima (percentage of polymorphic bands = 97.67%, gene diversity = 0.35, Shannon's information index = 0.52). Analysis of molecular variance (AMOVA) showed that 27.47% of the variation is attributed to the variation among populations and 72.53% to the variation within populations. The F st (0.28) value showed a significant ( p  < 0.0001) genetic differentiation among populations. This was supported by the high coefficient of gene differentiation (G st  = 0.32) and low estimated gene flow (Nm = 1.08). A neighbor-joining dendrogram showed that the thirteen cultivated populations were separated into three clusters, which was in good accordance with the results provided by the two dimensional and three dimensional coordinate analyses. However, the clusters did not reveal clear pattern of populations clustering according to their geographic origin. This could be due to human mediated transfer of genetic material among different localities. The genetic diversity in populations of A. corrorima from the southwestern part of Ethiopia was relatively high. This finding should be taken into account when conservation actions, management policies for the species and site identification for in situ and ex situ conservation strategies are developed. Mizan Teferi II population displayed the highest genetic diversity; this population should be considered as the key site in designing conservation strategies for this crop. In addition, Jimma I and Jimma II populations with lowest genetic diversity, should also be considered due to the putative risk of extinction that they face because of the low genetic diversity.

  12. A global analysis of adaptive evolution of operons in cyanobacteria.

    PubMed

    Memon, Danish; Singh, Abhay K; Pakrasi, Himadri B; Wangikar, Pramod P

    2013-02-01

    Operons are an important feature of prokaryotic genomes. Evolution of operons is hypothesized to be adaptive and has contributed significantly towards coordinated optimization of functions. Two conflicting theories, based on (i) in situ formation to achieve co-regulation and (ii) horizontal gene transfer of functionally linked gene clusters, are generally considered to explain why and how operons have evolved. Furthermore, effects of operon evolution on genomic traits such as intergenic spacing, operon size and co-regulation are relatively less explored. Based on the conservation level in a set of diverse prokaryotes, we categorize the operonic gene pair associations and in turn the operons as ancient and recently formed. This allowed us to perform a detailed analysis of operonic structure in cyanobacteria, a morphologically and physiologically diverse group of photoautotrophs. Clustering based on operon conservation showed significant similarity with the 16S rRNA-based phylogeny, which groups the cyanobacterial strains into three clades. Clade C, dominated by strains that are believed to have undergone genome reduction, shows a larger fraction of operonic genes that are tightly packed in larger sized operons. Ancient operons are in general larger, more tightly packed, better optimized for co-regulation and part of key cellular processes. A sub-clade within Clade B, which includes Synechocystis sp. PCC 6803, shows a reverse trend in intergenic spacing. Our results suggest that while in situ formation and vertical descent may be a dominant mechanism of operon evolution in cyanobacteria, optimization of intergenic spacing and co-regulation are part of an ongoing process in the life-cycle of operons.

  13. The iron-binding CyaY and IscX proteins assist the ISC-catalyzed Fe-S biogenesis in Escherichia coli.

    PubMed

    Roche, Béatrice; Huguenot, Allison; Barras, Frédéric; Py, Béatrice

    2015-02-01

    In eukaryotes, frataxin deficiency (FXN) causes severe phenotypes including loss of iron-sulfur (Fe-S) cluster protein activity, accumulation of mitochondrial iron and leads to the neurodegenerative disease Friedreich's ataxia. In contrast, in prokaryotes, deficiency in the FXN homolog, CyaY, was reported not to cause any significant phenotype, questioning both its importance and its actual contribution to Fe-S cluster biogenesis. Because FXN is conserved between eukaryotes and prokaryotes, this surprising discrepancy prompted us to reinvestigate the role of CyaY in Escherichia coli. We report that CyaY (i) potentiates E. coli fitness, (ii) belongs to the ISC pathway catalyzing the maturation of Fe-S cluster-containing proteins and (iii) requires iron-rich conditions for its contribution to be significant. A genetic interaction was discovered between cyaY and iscX, the last gene of the isc operon. Deletion of both genes showed an additive effect on Fe-S cluster protein maturation, which led, among others, to increased resistance to aminoglycosides and increased sensitivity to lambda phage infection. Together, these in vivo results establish the importance of CyaY as a member of the ISC-mediated Fe-S cluster biogenesis pathway in E. coli, like it does in eukaryotes, and validate IscX as a new bona fide Fe-S cluster biogenesis factor. © 2014 John Wiley & Sons Ltd.

  14. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes

    PubMed Central

    Matus, José Tomás; Aquea, Felipe; Arce-Johnson, Patricio

    2008-01-01

    Background The MYB superfamily constitutes the most abundant group of transcription factors described in plants. Members control processes such as epidermal cell differentiation, stomatal aperture, flavonoid synthesis, cold and drought tolerance and pathogen resistance. No genome-wide characterization of this family has been conducted in a woody species such as grapevine. In addition, previous analysis of the recently released grape genome sequence suggested expansion events of several gene families involved in wine quality. Results We describe and classify 108 members of the grape R2R3 MYB gene subfamily in terms of their genomic gene structures and similarity to their putative Arabidopsis thaliana orthologues. Seven gene models were derived and analyzed in terms of gene expression and their DNA binding domain structures. Despite low overall sequence homology in the C-terminus of all proteins, even in those with similar functions across Arabidopsis and Vitis, highly conserved motif sequences and exon lengths were found. The grape epidermal cell fate clade is expanded when compared with the Arabidopsis and rice MYB subfamilies. Two anthocyanin MYBA related clusters were identified in chromosomes 2 and 14, one of which includes the previously described grape colour locus. Tannin related loci were also detected with eight candidate homologues in chromosomes 4, 9 and 11. Conclusion This genome wide transcription factor analysis in Vitis suggests that clade-specific grape R2R3 MYB genes are expanded while other MYB genes could be well conserved compared to Arabidopsis. MYB gene abundance, homology and orientation within particular loci also suggests that expanded MYB clades conferring quality attributes of grapes and wines, such as colour and astringency, could possess redundant, overlapping and cooperative functions. PMID:18647406

  15. 4. VIEW NORTHWEST, NORTH FRONT OF SOIL CONSERVATION SERVICE CLUSTER ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    4. VIEW NORTHWEST, NORTH FRONT OF SOIL CONSERVATION SERVICE CLUSTER (BUILDINGS 24, 25, 26); NORTH FRONT OF QUARANTINE HEADHOUSE (BUILDING 27) - U.S. Plant Introduction Station, Soil Conservation Service Cluster, 11601 Old Pond Road, Glenn Dale, Prince George's County, MD

  16. Gene Coexpression Network Alignment and Conservation of Gene Modules between Two Grass Species: Maize and Rice[C][W][OA

    PubMed Central

    Ficklin, Stephen P.; Feltus, F. Alex

    2011-01-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species. PMID:21606319

  17. Gene coexpression network alignment and conservation of gene modules between two grass species: maize and rice.

    PubMed

    Ficklin, Stephen P; Feltus, F Alex

    2011-07-01

    One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species.

  18. Identification of evolutionarily conserved DNA damage response genes that alter sensitivity to cisplatin

    PubMed Central

    Gaponova, Anna V.; Deneka, Alexander Y.; Beck, Tim N.; Liu, Hanqing; Andrianov, Gregory; Nikonova, Anna S.; Nicolas, Emmanuelle; Einarson, Margret B.; Golemis, Erica A.; Serebriiskii, Ilya G.

    2017-01-01

    Ovarian, head and neck, and other cancers are commonly treated with cisplatin and other DNA damaging cytotoxic agents. Altered DNA damage response (DDR) contributes to resistance of these tumors to chemotherapies, some targeted therapies, and radiation. DDR involves multiple protein complexes and signaling pathways, some of which are evolutionarily ancient and involve protein orthologs conserved from yeast to humans. To identify new regulators of cisplatin-resistance in human tumors, we integrated high throughput and curated datasets describing yeast genes that regulate sensitivity to cisplatin and/or ionizing radiation. Next, we clustered highly validated genes based on chemogenomic profiling, and then mapped orthologs of these genes in expanded genomic networks for multiple metazoans, including humans. This approach identified an enriched candidate set of genes involved in the regulation of resistance to radiation and/or cisplatin in humans. Direct functional assessment of selected candidate genes using RNA interference confirmed their activity in influencing cisplatin resistance, degree of γH2AX focus formation and ATR phosphorylation, in ovarian and head and neck cancer cell lines, suggesting impaired DDR signaling as the driving mechanism. This work enlarges the set of genes that may contribute to chemotherapy resistance and provides a new contextual resource for interpreting next generation sequencing (NGS) genomic profiling of tumors. PMID:27863405

  19. A Caenorhabditis elegans protein with a PRDM9-like SET domain localizes to chromatin-associated foci and promotes spermatocyte gene expression, sperm production and fertility.

    PubMed

    Engert, Christoph G; Droste, Rita; van Oudenaarden, Alexander; Horvitz, H Robert

    2018-04-01

    To better understand the tissue-specific regulation of chromatin state in cell-fate determination and animal development, we defined the tissue-specific expression of all 36 C. elegans presumptive lysine methyltransferase (KMT) genes using single-molecule fluorescence in situ hybridization (smFISH). Most KMTs were expressed in only one or two tissues. The germline was the tissue with the broadest KMT expression. We found that the germline-expressed C. elegans protein SET-17, which has a SET domain similar to that of the PRDM9 and PRDM7 SET-domain proteins, promotes fertility by regulating gene expression in primary spermatocytes. SET-17 drives the transcription of spermatocyte-specific genes from four genomic clusters to promote spermatid development. SET-17 is concentrated in stable chromatin-associated nuclear foci at actively transcribed msp (major sperm protein) gene clusters, which we term msp locus bodies. Our results reveal the function of a PRDM9/7-family SET-domain protein in spermatocyte transcription. We propose that the spatial intranuclear organization of chromatin factors might be a conserved mechanism in tissue-specific control of transcription.

  20. Database resources of the National Center for Biotechnology Information

    PubMed Central

    2015-01-01

    The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (Bookshelf, PubMed Central (PMC) and PubReader); medical genetics (ClinVar, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen); genes and genomics (BioProject, BioSample, dbSNP, dbVar, Epigenomics, Gene, Gene Expression Omnibus (GEO), Genome, HomoloGene, the Map Viewer, Nucleotide, PopSet, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser, Trace Archive and UniGene); and proteins and chemicals (Biosystems, COBALT, the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB), Protein Clusters, Protein and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for many of these databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov. PMID:25398906

  1. Database resources of the National Center for Biotechnology Information

    PubMed Central

    2016-01-01

    The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (PubMed Central (PMC), Bookshelf and PubReader), health (ClinVar, dbGaP, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen), genomes (BioProject, Assembly, Genome, BioSample, dbSNP, dbVar, Epigenomics, the Map Viewer, Nucleotide, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser and the Trace Archive), genes (Gene, Gene Expression Omnibus (GEO), HomoloGene, PopSet and UniGene), proteins (Protein, the Conserved Domain Database (CDD), COBALT, Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB) and Protein Clusters) and chemicals (Biosystems and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for most of these databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:26615191

  2. Genetic structure and demographic history should inform conservation: Chinese cobras currently treated as homogenous show population divergence.

    PubMed

    Lin, Long-Hui; Qu, Yan-Fu; Li, Hong; Zhou, Kai-Ya; Ji, Xiang

    2012-01-01

    An understanding of population structure and genetic diversity is crucial for wildlife conservation and for determining the integrity of wildlife populations. The vulnerable Chinese cobra (Naja atra) has a distribution from the mouth of the Yangtze River down to northern Vietnam and Laos, within which several large mountain ranges and water bodies may influence population structure. We combined 12 microsatellite loci and 1117 bp of the mitochondrial cytochrome b gene to explore genetic structure and demographic history in this species, using 269 individuals from various localities in Mainland China and Vietnam. High levels of genetic variation were identified for both mtDNA and microsatellites. mtDNA data revealed two main (Vietnam + southern China + southwestern China; eastern + southeastern China) and one minor (comprising only two individuals from the westernmost site) clades. Microsatellite data divided the eastern + southeastern China clade further into two genetic clusters, which include individuals from the eastern and southeastern regions, respectively. The Luoxiao and Nanling Mountains may be important barriers affecting the diversification of lineages. In the haplotype network of cytchrome b, many haplotypes were represented within a "star" cluster and this and other tests suggest recent expansion. However, microsatellite analyses did not yield strong evidence for a recent bottleneck for any population or genetic cluster. The three main clusters identified here should be considered as independent management units for conservation purposes. The release of Chinese cobras into the wild should cease unless their origin can be determined, and this will avoid problems arising from unnatural homogenization.

  3. The group B streptococcal sialic acid O-acetyltransferase is encoded by neuD, a conserved component of bacterial sialic acid biosynthetic gene clusters.

    PubMed

    Lewis, Amanda L; Hensler, Mary E; Varki, Ajit; Nizet, Victor

    2006-04-21

    Nearly two dozen microbial pathogens have surface polysaccharides or lipo-oligosaccharides that contain sialic acid (Sia), and several Sia-dependent virulence mechanisms are known to enhance bacterial survival or result in host tissue injury. Some pathogens are also known to O-acetylate their Sias, although the role of this modification in pathogenesis remains unclear. We report that neuD, a gene located within the Group B Streptococcus (GBS) Sia biosynthetic gene cluster, encodes a Sia O-acetyltransferase that is itself required for capsular polysaccharide (CPS) sialylation. Homology modeling and site-directed mutagenesis identified Lys-123 as a critical residue for Sia O-acetyltransferase activity. Moreover, a single nucleotide polymorphism in neuD can determine whether GBS displays a "high" or "low" Sia O-acetylation phenotype. Complementation analysis revealed that Escherichia coli K1 NeuD also functions as a Sia O-acetyltransferase in GBS. In fact, NeuD homologs are commonly found within Sia biosynthetic gene clusters. A bioinformatic approach identified 18 bacterial species with a Sia biosynthetic gene cluster that included neuD. Included in this list are the sialylated human pathogens Legionella pneumophila, Vibrio parahemeolyticus, Pseudomonas aeruginosa, and Campylobacter jejuni, as well as an additional 12 bacterial species never before analyzed for Sia expression. Phylogenetic analysis shows that NeuD homologs of sialylated pathogens share a common evolutionary lineage distinct from the poly-Sia O-acetyltransferase of E. coli K1. These studies define a molecular genetic approach for the selective elimination of GBS Sia O-acetylation without concurrent loss of sialylation, a key to further studies addressing the role(s) of this modification in bacterial virulence.

  4. Biochemical and genetic characterization of the vanC-2 vancomycin resistance gene cluster of Enterococcus casseliflavus ATCC 25788.

    PubMed

    Dutta, Ireena; Reynolds, Peter E

    2002-10-01

    The vanC-2 cluster of Enterococcus casseliflavus ATCC 25788 consisted of five genes (vanC-2, vanXY(C-2), vanT(C-2), vanR(C-2), and vanS(C-2)) and shared the same organization as the vanC cluster of E. gallinarum BM4174. The proteins encoded by these genes displayed a high degree of amino acid identity to the proteins encoded within the vanC gene cluster. The putative D,D-dipeptidase-D,D-carboxypeptidase, VanXY(C-2), exhibited 81% amino acid identity to VanXY(C), and VanT(C-2) displayed 65% amino acid identity to the serine racemase, VanT. VanR(C-2) and VanS(C-2) displayed high degrees of identity to VanR(C) and VanS(C), respectively, and contained the conserved residues identified as important to their function as a response regulator and histidine kinase, respectively. Resistance to vancomycin was expressed inducibly in E. casseliflavus ATCC 25788 and required an extended period of induction. Analysis of peptidoglycan precursors revealed that UDP-N-acetylmuramyl-L-Ala-delta-D-Glu-L-Lys-D-Ala-D-Ser could not be detected until several hours after the addition of vancomycin, and its appearance coincided with the resumption of growth. The introduction of additional copies of the vanT(C-2) gene, encoding a putative serine racemase, and the presence of supplementary D-serine in the growth medium both significantly reduced the period before growth resumed after addition of vancomycin. This suggested that the availability of D-serine plays an important role in the induction process.

  5. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.

    PubMed

    Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C

    2015-01-01

    MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.

  6. Short and long-term genome stability analysis of prokaryotic genomes.

    PubMed

    Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France

    2013-05-08

    Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were able to explore genome organization stability at different time-scales and to find significant differences for pathogen and non-pathogen species. The output of our framework also allows to identify the conserved gene clusters and/or partial occurrences thereof, making possible to explore how gene clusters assembled during evolution.

  7. Cluster of Genes That Encode Positive and Negative Elements Influencing Filament Length in a Heterocyst-Forming Cyanobacterium

    PubMed Central

    Merino-Puerto, Victoria; Herrero, Antonia

    2013-01-01

    The filamentous, heterocyst-forming cyanobacteria perform oxygenic photosynthesis in vegetative cells and nitrogen fixation in heterocysts, and their filaments can be hundreds of cells long. In the model heterocyst-forming cyanobacterium Anabaena sp. strain PCC 7120, the genes in the fraC-fraD-fraE operon are required for filament integrity mainly under conditions of nitrogen deprivation. The fraC operon transcript partially overlaps gene all2395, which lies in the opposite DNA strand and ends 1 bp beyond fraE. Gene all2395 produces transcripts of 1.35 kb (major transcript) and 2.2 kb (minor transcript) that overlap fraE and whose expression is dependent on the N-control transcription factor NtcA. Insertion of a gene cassette containing transcriptional terminators between fraE and all2395 prevented production of the antisense RNAs and resulted in an increased length of the cyanobacterial filaments. Deletion of all2395 resulted in a larger increase of filament length and in impaired growth, mainly under N2-fixing conditions and specifically on solid medium. We denote all2395 the fraF gene, which encodes a protein restricting filament length. A FraF-green fluorescent protein (GFP) fusion protein accumulated significantly in heterocysts. Similar to some heterocyst differentiation-related proteins such as HglK, HetL, and PatL, FraF is a pentapeptide repeat protein. We conclude that the fraC-fraD-fraE←fraF gene cluster (where the arrow indicates a change in orientation), in which cis antisense RNAs are produced, regulates morphology by encoding proteins that influence positively (FraC, FraD, FraE) or negatively (FraF) the length of the filament mainly under conditions of nitrogen deprivation. This gene cluster is often conserved in heterocyst-forming cyanobacteria. PMID:23813733

  8. De novo characterization of microRNAs in oriental fruit moth Grapholita molesta and selection of reference genes for normalization of microRNA expression

    PubMed Central

    Zhang, Jing; Zhang, Qingwen; Liu, Xiaoxia; Li, Zhen

    2017-01-01

    MicroRNAs (miRNAs) are a group of endogenous non-coding small RNAs that have critical regulatory functions in almost all known biological processes at the post-transcriptional level in a variety of organisms. The oriental fruit moth Grapholita molesta is one of the most serious pests in orchards worldwide and threatens the production of Rosacea fruits. In this study, a de novo small RNA library constructed from mixed stages of G. molesta was sequenced through Illumina sequencing platform and a total of 536 mature miRNAs consisting of 291 conserved and 245 novel miRNAs were identified. Most of the conserved and novel miRNAs were detected with moderate abundance. The miRNAs in the same cluster normally showed correlated expressional profiles. A comparative analysis of the 79 conserved miRNA families within 31 arthropod species indicated that these miRNA families were more conserved among insects and within orders of closer phylogenetic relationships. The KEGG pathway analysis and network prediction of target genes indicated that the complex composed of miRNAs, clock genes and developmental regulation genes may play vital roles to regulate the developmental circadian rhythm of G. molesta. Furthermore, based on the sRNA library of G. molesta, suitable reference genes were selected and validated for study of miRNA transcriptional profile in G. molesta under two biotic and six abiotic experimental conditions. This study systematically documented the miRNA profile in G. molesta, which could lay a foundation for further understanding of the regulatory roles of miRNAs in the development and metabolism in this pest and might also suggest clues to the development of genetic-based techniques for agricultural pest control. PMID:28158242

  9. Pithovirus sibericum, a new bona fide member of the "Fourth TRUC" club.

    PubMed

    Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Pontarotti, Pierre; Raoult, Didier

    2015-01-01

    Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, include giant viruses of Acanthamoeba that were discovered over the last 12 years and are bona fide microbes. Phylogenies based on a few genes conserved amongst these megaviruses and shared by microbes classified as Eukarya, Bacteria, and Archaea, allowed for delineation of a fourth monophylogenetic group or "TRUC" (Things Resisting Uncompleted Classification) composed of the Megavirales representatives. A new Megavirales member named Pithovirus sibericum was isolated from a >30,000-year-old dated Siberian permafrost sample. This virion is as large as recently described pandoraviruses but has a genome that is approximately three to four times shorter. Our objective was to update the classification of P. sibericum as a new member of the "Fourth TRUC" club. Phylogenetic trees were constructed based on four conserved ancient genes and a phyletic analysis was concurrently conducted based on the presence/absence patterns of a set of informational genes from members of Megavirales, Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on the four conserved genes revealed that P. sibericum is part of the fourth TRUC composed of Megavirales members, and is closely related to the families Marseilleviridae and Ascoviridae/Iridoviridae. Additionally, hierarchical clustering delineated four branches, and showed that P. sibericum is part of this fourth TRUC. Overall, phylogenetic and phyletic analyses using informational genes clearly indicate that P. sibericum is a new bona fide member of the "Fourth TRUC" club composed of representatives of Megavirales, alongside Bacteria, Archaea, and Eukarya.

  10. Comparative Analysis of Wolbachia Genomes Reveals Streamlining and Divergence of Minimalist Two-Component Systems

    PubMed Central

    Christensen, Steen; Serbus, Laura Renee

    2015-01-01

    Two-component regulatory systems are commonly used by bacteria to coordinate intracellular responses with environmental cues. These systems are composed of functional protein pairs consisting of a sensor histidine kinase and cognate response regulator. In contrast to the well-studied Caulobacter crescentus system, which carries dozens of these pairs, the streamlined bacterial endosymbiont Wolbachia pipientis encodes only two pairs: CckA/CtrA and PleC/PleD. Here, we used bioinformatic tools to compare characterized two-component system relays from C. crescentus, the related Anaplasmataceae species Anaplasma phagocytophilum and Ehrlichia chaffeensis, and 12 sequenced Wolbachia strains. We found the core protein pairs and a subset of interacting partners to be highly conserved within Wolbachia and these other Anaplasmataceae. Genes involved in two-component signaling were positioned differently within the various Wolbachia genomes, whereas the local context of each gene was conserved. Unlike Anaplasma and Ehrlichia, Wolbachia two-component genes were more consistently found clustered with metabolic genes. The domain architecture and key functional residues standard for two-component system proteins were well-conserved in Wolbachia, although residues that specify cognate pairing diverged substantially from other Anaplasmataceae. These findings indicate that Wolbachia two-component signaling pairs share considerable functional overlap with other α-proteobacterial systems, whereas their divergence suggests the potential for regulatory differences and cross-talk. PMID:25809075

  11. The DUF59 Family Gene AE7 Acts in the Cytosolic Iron-Sulfur Cluster Assembly Pathway to Maintain Nuclear Genome Integrity in Arabidopsis[C][W][OA

    PubMed Central

    Luo, Dexian; Bernard, Delphine G.; Balk, Janneke; Hai, Huang; Cui, Xiaofeng

    2012-01-01

    Eukaryotic organisms have evolved a set of strategies to safeguard genome integrity, but the underlying mechanisms remain poorly understood. Here, we report that ASYMMETRIC LEAVES1/2 ENHANCER7 (AE7), an Arabidopsis thaliana gene encoding a protein in the evolutionarily conserved Domain of Unknown Function 59 family, participates in the cytosolic iron-sulfur (Fe-S) cluster assembly (CIA) pathway to maintain genome integrity. The severe ae7-2 allele is embryo lethal, whereas plants with the weak ae7 (ae7-1) allele are viable but exhibit highly accumulated DNA damage that activates the DNA damage response to arrest the cell cycle. AE7 is part of a protein complex with CIA1, NAR1, and MET18, which are highly conserved in eukaryotes and are involved in the biogenesis of cytosolic and nuclear Fe-S proteins. ae7-1 plants have lower activities of the cytosolic [4Fe-4S] enzyme aconitase and the nuclear [4Fe-4S] enzyme DNA glycosylase ROS1. Additionally, mutations in the gene encoding the mitochondrial ATP binding cassette transporter ATM3/ABCB25, which is required for the activity of cytosolic Fe-S enzymes in Arabidopsis, also result in defective genome integrity similar to that of ae7-1. These results indicate that AE7 is a central member of the CIA pathway, linking plant mitochondria to nuclear genome integrity through assembly of Fe-S proteins. PMID:23104832

  12. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage.

    PubMed

    Brok-Volchanskaya, Vera S; Kadyrov, Farid A; Sivogrivov, Dmitry E; Kolosov, Peter M; Sokolov, Andrey S; Shlyapnikov, Michael G; Kryukov, Valentine M; Granovsky, Igor E

    2008-04-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3' 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TpsiC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages.

  13. A conserved long noncoding RNA affects sleep behavior in Drosophila.

    PubMed

    Soshnev, Alexey A; Ishimoto, Hiroshi; McAllister, Bryant F; Li, Xingguo; Wehling, Misty D; Kitamoto, Toshihiro; Geyer, Pamela K

    2011-10-01

    Metazoan genomes encode an abundant collection of mRNA-like, long noncoding (lnc)RNAs. Although lncRNAs greatly expand the transcriptional repertoire, we have a limited understanding of how these RNAs contribute to developmental regulation. Here, we investigate the function of the Drosophila lncRNA called yellow-achaete intergenic RNA (yar). Comparative sequence analyses show that the yar gene is conserved in Drosophila species representing 40-60 million years of evolution, with one of the conserved sequence motifs encompassing the yar promoter. Further, the timing of yar expression in Drosophila virilis parallels that in D. melanogaster, suggesting that transcriptional regulation of yar is conserved. The function of yar was defined by generating null alleles. Flies lacking yar RNAs are viable and show no overt morphological defects, consistent with maintained transcriptional regulation of the adjacent yellow (y) and achaete (ac) genes. The location of yar within a neural gene cluster led to the investigation of effects of yar in behavioral assays. These studies demonstrated that loss of yar alters sleep regulation in the context of a normal circadian rhythm. Nighttime sleep was reduced and fragmented, with yar mutants displaying diminished sleep rebound following sleep deprivation. Importantly, these defects were rescued by a yar transgene. These data provide the first example of a lncRNA gene involved in Drosophila sleep regulation. We find that yar is a cytoplasmic lncRNA, suggesting that yar may regulate sleep by affecting stabilization or translational regulation of mRNAs. Such functions of lncRNAs may extend to vertebrates, as lncRNAs are abundant in neural tissues.

  14. The Aquaporin Channel Repertoire of the Tardigrade Milnesium tardigradum

    PubMed Central

    Grohme, Markus A.; Mali, Brahim; Wełnicz, Weronika; Michel, Stephanie; Schill, Ralph O.; Frohme, Marcus

    2013-01-01

    Limno-terrestrial tardigrades are small invertebrates that are subjected to periodic drought of their micro-environment. They have evolved to cope with these unfavorable conditions by anhydrobiosis, an ametabolic state of low cellular water. During drying and rehydration, tardigrades go through drastic changes in cellular water content. By our transcriptome sequencing effort of the limno-terrestrial tardigrade Milnesium tardigradum and by a combination of cloning and targeted sequence assembly, we identified transcripts encoding eleven putative aquaporins. Analysis of these sequences proposed 2 classical aquaporins, 8 aquaglyceroporins and a single potentially intracellular unorthodox aquaporin. Using quantitative real-time PCR we analyzed aquaporin transcript expression in the anhydrobiotic context. We have identified additional unorthodox aquaporins in various insect genomes and have identified a novel common conserved structural feature in these proteins. Analysis of the genomic organization of insect aquaporin genes revealed several conserved gene clusters. PMID:23761966

  15. Global Profiling of Rice and Poplar Transcriptomes Highlights Key Conserved Circadian-Controlled Pathways and cis-Regulatory Modules

    PubMed Central

    Filichkin, Sergei A.; Breton, Ghislain; Priest, Henry D.; Dharmawardhana, Palitha; Jaiswal, Pankaj; Fox, Samuel E.; Michael, Todd P.; Chory, Joanne; Kay, Steve A.; Mockler, Todd C.

    2011-01-01

    Background Circadian clocks provide an adaptive advantage through anticipation of daily and seasonal environmental changes. In plants, the central clock oscillator is regulated by several interlocking feedback loops. It was shown that a substantial proportion of the Arabidopsis genome cycles with phases of peak expression covering the entire day. Synchronized transcriptome cycling is driven through an extensive network of diurnal and clock-regulated transcription factors and their target cis-regulatory elements. Study of the cycling transcriptome in other plant species could thus help elucidate the similarities and differences and identify hubs of regulation common to monocot and dicot plants. Methodology/Principal Findings Using a combination of oligonucleotide microarrays and data mining pipelines, we examined daily rhythms in gene expression in one monocotyledonous and one dicotyledonous plant, rice and poplar, respectively. Cycling transcriptomes were interrogated under different diurnal (driven) and circadian (free running) light and temperature conditions. Collectively, photocycles and thermocycles regulated about 60% of the expressed nuclear genes in rice and poplar. Depending on the condition tested, up to one third of oscillating Arabidopsis-poplar-rice orthologs were phased within three hours of each other suggesting a high degree of conservation in terms of rhythmic gene expression. We identified clusters of rhythmically co-expressed genes and searched their promoter sequences to identify phase-specific cis-elements, including elements that were conserved in the promoters of Arabidopsis, poplar, and rice. Conclusions/Significance Our results show that the cycling patterns of many circadian clock genes are highly conserved across poplar, rice, and Arabidopsis. The expression of many orthologous genes in key metabolic and regulatory pathways is diurnal and/or circadian regulated and phased to similar times of day. Our results confirm previous findings in Arabidopsis of three major classes of cis-regulatory modules within the plant circadian network: the morning (ME, GBOX), evening (EE, GATA), and midnight (PBX/TBX/SBX) modules. Identification of identical overrepresented motifs in the promoters of cycling genes from different species suggests that the core diurnal/circadian cis-regulatory network is deeply conserved between mono- and dicotyledonous species. PMID:21694767

  16. Genome-wide identification of chitinase and chitin deacetylase gene families in the oriental fruit fly, Bactrocera dorsalis (Hendel).

    PubMed

    Liu, Shi-Huo; Li, Hong-Fei; Yang, Yang; Yang, Rui-Lin; Yang, Wen-Jia; Jiang, Hong-Bo; Dou, Wei; Smagghe, Guy; Wang, Jin-Jun

    2018-05-01

    Chitinases (Chts) and chitin deacetylases (CDAs) are important enzymes required for chitin metabolism in insects. In this study, 12 Cht-related genes (including seven Cht genes and five imaginal disc growth factor genes) and 6 CDA genes (encoding seven proteins) were identified in Bactrocera dorsalis using genome-wide searching and transcript profiling. Based on the conserved sequences and phylogenetic relationships, 12 Cht-related proteins were clustered into eight groups (group I-V and VII-IX). Further domain architecture analysis showed that all contained at least one chitinase catalytic domain, however, only four (BdCht5, BdCht7, BdCht8 and BdCht10) possessed chitin-binding domains. The subsequent phylogenetic analysis revealed that seven CDAs were clustered into five groups (group I-V), and all had one chitin deacetylase catalytic domain. However, only six exhibited chitin-binding domains. Finally, the development- and tissue-specific expression profiling showed that transcript levels of the 12 Cht-related genes and 6 CDA genes varied considerably among eggs, larvae, pupae and adults, as well as among different tissues of larvae and adults. Our findings illustrate the structural differences and expression patterns of Cht and CDA genes in B. dorsalis, and provide important information for the development of new pest control strategies based on these vital enzymes. Copyright © 2018. Published by Elsevier Inc.

  17. The RNA polymerase III-dependent family of genes in hemiascomycetes: comparative RNomics, decoding strategies, transcription and evolutionary implications

    PubMed Central

    Marck, Christian; Kachouri-Lafond, Rym; Lafontaine, Ingrid; Westhof, Eric; Dujon, Bernard; Grosjean, Henri

    2006-01-01

    We present the first comprehensive analysis of RNA polymerase III (Pol III) transcribed genes in ten yeast genomes. This set includes all tRNA genes (tDNA) and genes coding for SNR6 (U6), SNR52, SCR1 and RPR1 RNA in the nine hemiascomycetes Saccharomyces cerevisiae, Saccharomyces castellii, Candida glabrata, Kluyveromyces waltii, Kluyveromyces lactis, Eremothecium gossypii, Debaryomyces hansenii, Candida albicans, Yarrowia lipolytica and the archiascomycete Schizosaccharomyces pombe. We systematically analysed sequence specificities of tRNA genes, polymorphism, variability of introns, gene redundancy and gene clustering. Analysis of decoding strategies showed that yeasts close to S.cerevisiae use bacterial decoding rules to read the Leu CUN and Arg CGN codons, in contrast to all other known Eukaryotes. In D.hansenii and C.albicans, we identified a novel tDNA-Leu (AAG), reading the Leu CUU/CUC/CUA codons with an unusual G at position 32. A systematic ‘p-distance tree’ using the 60 variable positions of the tRNA molecule revealed that most tDNAs cluster into amino acid-specific sub-trees, suggesting that, within hemiascomycetes, orthologous tDNAs are more closely related than paralogs. We finally determined the bipartite A- and B-box sequences recognized by TFIIIC. These minimal sequences are nearly conserved throughout hemiascomycetes and were satisfactorily retrieved at appropriate locations in other Pol III genes. PMID:16600899

  18. Methionine sulphoxide reductases protect iron-sulphur clusters from oxidative inactivation in yeast

    PubMed Central

    Sideri, Theodora C.; Willetts, Sylvia A.; Avery, Simon V.

    2008-01-01

    Methionine residues and iron-sulphur (FeS) clusters are primary targets of reactive oxygen species in the proteins of microorganisms. Here we show that methionine redox-modifications help to preserve essential FeS cluster activities in yeast. Mutants defective for the highly conserved methionine sulphoxide reductases (MSRs; which re-reduce oxidized methionines) are sensitive to many pro-oxidants, but here exhibited an unexpected copper resistance. This phenotype was mimicked by methionine sulphoxide supplementation. Microarray analyses highlighted several Cu and Fe homeostasis genes that were upregulated in the mxrΔ double mutant, which lacks both of the yeast MSRs. Of the upregulated genes, the Cu-binding Fe-transporter Fet3p proved to be required for the Cu-resistance phenotype. FET3 is known to be regulated by the Aft1 transcription factor, which responds to low mitochondrial FeS-cluster status. Here, constitutive Aft1p expression in the wild type reproduced the Cu-resistance phenotype, and FeS cluster functions were found to be defective in the mxrΔ mutant. Genetic perturbation of FeS activity also mimicked FET3-dependent Cu resistance. 55Fe-labeling studies showed that FeS clusters are turned over more rapidly in the mxrΔ mutant than the wild type, consistent with elevated oxidative targeting of the clusters in MSR-deficient cells. The potential underlying molecular mechanisms of this targeting are discussed. Moreover, the results indicate an important new role for cellular MSR enzymes, in helping to protect the essential function of FeS clusters in aerobic settings. PMID:19202110

  19. Phylogenetic Network Analysis Revealed the Occurrence of Horizontal Gene Transfer of 16S rRNA in the Genus Enterobacter

    PubMed Central

    Sato, Mitsuharu; Miyazaki, Kentaro

    2017-01-01

    Horizontal gene transfer (HGT) is a ubiquitous genetic event in bacterial evolution, but it seldom occurs for genes involved in highly complex supramolecules (or biosystems), which consist of many gene products. The ribosome is one such supramolecule, but several bacteria harbor dissimilar and/or chimeric 16S rRNAs in their genomes, suggesting the occurrence of HGT of this gene. However, we know little about whether the genes actually experience HGT and, if so, the frequency of such a transfer. This is primarily because the methods currently employed for phylogenetic analysis (e.g., neighbor-joining, maximum likelihood, and maximum parsimony) of 16S rRNA genes assume point mutation-driven tree-shape evolution as an evolutionary model, which is intrinsically inappropriate to decipher the evolutionary history for genes driven by recombination. To address this issue, we applied a phylogenetic network analysis, which has been used previously for detection of genetic recombination in homologous alleles, to the 16S rRNA gene. We focused on the genus Enterobacter, whose phylogenetic relationships inferred by multi-locus sequence alignment analysis and 16S rRNA sequences are incompatible. All 10 complete genomic sequences were retrieved from the NCBI database, in which 71 16S rRNA genes were included. Neighbor-joining analysis demonstrated that the genes residing in the same genomes clustered, indicating the occurrence of intragenomic recombination. However, as suggested by the low bootstrap values, evolutionary relationships between the clusters were uncertain. We then applied phylogenetic network analysis to representative sequences from each cluster. We found three ancestral 16S rRNA groups; the others were likely created through recursive recombination between the ancestors and chimeric descendants. Despite the large sequence changes caused by the recombination events, the RNA secondary structures were conserved. Successive intergenomic and intragenomic recombination thus shaped the evolution of 16S rRNA genes in the genus Enterobacter. PMID:29180992

  20. Molecular characterization of the equine testis-specific protein 1 (TPX1) and acidic epididymal glycoprotein 2 (AEG2) genes encoding members of the cysteine-rich secretory protein (CRISP) family.

    PubMed

    Giese, Alexander; Jude, Rony; Kuiper, Heidi; Raudsepp, Terje; Piumi, Francois; Schambony, Alexandra; Guérin, Gérard; Chowdhary, Bhanu P; Distl, Ottmar; Töpfer-Petersen, Edda; Leeb, Tosso

    2002-10-16

    The cysteine-rich secretory protein (CRISP) family consists of three members called acidic epididymal glycoprotein 1 (AEG1), AEG2, and testis-specific protein 1 (TPX1), which share 16 conserved cysteine residues at their C-termini. The CRISP proteins are primarily expressed in different sections of the male genital tract and are thought to mediate cell-cell interactions of male germ cells with other cells during sperm maturation or during fertilization. Therefore, their genes are of interest as candidate genes for inherited male fertility dysfunctions and as putative quantitative trait loci for male fertility traits. In this report, the cloning and DNA sequence of 137 kb of horse genomic DNA from equine chromosome 20q22 containing the closely linked equine TPX1 and AEG2 genes are described. The equine TPX1 gene consists of ten exons spanning 18 kb while the AEG2 gene consists of eight exons that are spread over 24 kb. The expression of these two genes was investigated in several tissues by reverse transcription polymerase chain reaction analysis and Western blotting. Comparative genome analysis between horse, human, and mouse indicates that all three CRISP genes are clustered on one chromosomal location, which shows conserved synteny between these species.

  1. Unveiling the biotransformation mechanism of indole in a Cupriavidus sp. strain.

    PubMed

    Qu, Yuanyuan; Ma, Qiao; Liu, Ziyan; Wang, Weiwei; Tang, Hongzhi; Zhou, Jiti; Xu, Ping

    2017-12-01

    Indole, an important signaling molecule as well as a typical N-heterocyclic aromatic pollutant, is widespread in nature. However, the biotransformation mechanisms of indole are still poorly studied. Here, we sought to unlock the genetic determinants of indole biotransformation in strain Cupriavidus sp. SHE based on genomics, proteomics and functional studies. A total of 177 proteins were notably altered (118 up- and 59 downregulated) in cells grown in indole mineral salt medium when compared with that in sodium citrate medium. RT-qPCR and gene knockout assays demonstrated that an indole oxygenase gene cluster was responsible for the indole upstream metabolism. A functional indole oxygenase, termed IndA, was identified in the cluster, and its catalytic efficiency was higher than those of previously reported indole oxidation enzymes. Furthermore, the indole downstream metabolism was found to proceed via the atypical CoA-thioester pathway rather than conventional gentisate and salicylate pathways. This unusual pathway was catalyzed by a conserved 2-aminobenzoyl-CoA gene cluster, among which the 2-aminobenzoyl-CoA ligase initiated anthranilate transformation. This study unveils the genetic determinants of indole biotransformation and will provide new insights into our understanding of indole biodegradation in natural environments and its functional studies. © 2017 John Wiley & Sons Ltd.

  2. Genomic organization and evolution of the Atlantic salmon hemoglobin repertoire

    PubMed Central

    2010-01-01

    Background The genomes of salmonids are considered pseudo-tetraploid undergoing reversion to a stable diploid state. Given the genome duplication and extensive biological data available for salmonids, they are excellent model organisms for studying comparative genomics, evolutionary processes, fates of duplicated genes and the genetic and physiological processes associated with complex behavioral phenotypes. The evolution of the tetrapod hemoglobin genes is well studied; however, little is known about the genomic organization and evolution of teleost hemoglobin genes, particularly those of salmonids. The Atlantic salmon serves as a representative salmonid species for genomics studies. Given the well documented role of hemoglobin in adaptation to varied environmental conditions as well as its use as a model protein for evolutionary analyses, an understanding of the genomic structure and organization of the Atlantic salmon α and β hemoglobin genes is of great interest. Results We identified four bacterial artificial chromosomes (BACs) comprising two hemoglobin gene clusters spanning the entire α and β hemoglobin gene repertoire of the Atlantic salmon genome. Their chromosomal locations were established using fluorescence in situ hybridization (FISH) analysis and linkage mapping, demonstrating that the two clusters are located on separate chromosomes. The BACs were sequenced and assembled into scaffolds, which were annotated for putatively functional and pseudogenized hemoglobin-like genes. This revealed that the tail-to-tail organization and alternating pattern of the α and β hemoglobin genes are well conserved in both clusters, as well as that the Atlantic salmon genome houses substantially more hemoglobin genes, including non-Bohr β globin genes, than the genomes of other teleosts that have been sequenced. Conclusions We suggest that the most parsimonious evolutionary path leading to the present organization of the Atlantic salmon hemoglobin genes involves the loss of a single hemoglobin gene cluster after the whole genome duplication (WGD) at the base of the teleost radiation but prior to the salmonid-specific WGD, which then produced the duplicated copies seen today. We also propose that the relatively high number of hemoglobin genes as well as the presence of non-Bohr β hemoglobin genes may be due to the dynamic life history of salmon and the diverse environmental conditions that the species encounters. Data deposition: BACs S0155C07 and S0079J05 (fps135): GenBank GQ898924; BACs S0055H05 and S0014B03 (fps1046): GenBank GQ898925 PMID:20923558

  3. Archaeal Shikimate Kinase, a New Member of the GHMP-Kinase Family

    PubMed Central

    Daugherty, Matthew; Vonstein, Veronika; Overbeek, Ross; Osterman, Andrei

    2001-01-01

    Shikimate kinase (EC 2.7.1.71) is a committed enzyme in the seven-step biosynthesis of chorismate, a major precursor of aromatic amino acids and many other aromatic compounds. Genes for all enzymes of the chorismate pathway except shikimate kinase are found in archaeal genomes by sequence homology to their bacterial counterparts. In this study, a conserved archaeal gene (gi|1500322 in Methanococcus jannaschii) was identified as the best candidate for the missing shikimate kinase gene by the analysis of chromosomal clustering of chorismate biosynthetic genes. The encoded hypothetical protein, with no sequence similarity to bacterial and eukaryotic shikimate kinases, is distantly related to homoserine kinases (EC 2.7.1.39) of the GHMP-kinase superfamily. The latter functionality in M. jannaschii is assigned to another gene (gi|1591748), in agreement with sequence similarity and chromosomal clustering analysis. Both archaeal proteins, overexpressed in Escherichia coli and purified to homogeneity, displayed activity of the predicted type, with steady-state kinetic parameters similar to those of the corresponding bacterial kinases: Km,shikimate = 414 ± 33 μM, Km,ATP = 48 ± 4 μM, and kcat = 57 ± 2 s−1 for the predicted shikimate kinase and Km,homoserine = 188 ± 37 μM, Km,ATP = 101 ± 7 μM, and kcat = 28 ± 1 s−1 for the homoserine kinase. No overlapping activity could be detected between shikimate kinase and homoserine kinase, both revealing a >1,000-fold preference for their own specific substrates. The case of archaeal shikimate kinase illustrates the efficacy of techniques based on reconstruction of metabolism from genomic data and analysis of gene clustering on chromosomes in finding missing genes. PMID:11114929

  4. vanC Cluster of Vancomycin-Resistant Enterococcus gallinarum BM4174

    PubMed Central

    Arias, Cesar A.; Courvalin, Patrice; Reynolds, Peter E.

    2000-01-01

    Glycopeptide-resistant enterococci of the VanC type synthesize UDP-muramyl-pentapeptide[d-Ser] for cell wall assembly and prevent synthesis of peptidoglycan precursors ending in d-Ala. The vanC cluster of Enterococcus gallinarum BM4174 consists of five genes: vanC-1, vanXYC, vanT, vanRC, and vanSC. Three genes are sufficient for resistance: vanC-1 encodes a ligase that synthesizes the dipeptide d-Ala-d-Ser for addition to UDP-MurNAc-tripeptide, vanXYC encodes a d,d-dipeptidase–carboxypeptidase that hydrolyzes d-Ala-d-Ala and removes d-Ala from UDP-MurNAc-pentapeptide[d-Ala], and vanT encodes a membrane-bound serine racemase that provides d-Ser for the synthetic pathway. The three genes are clustered: the start codons of vanXYC and vanT overlap the termination codons of vanC-1 and vanXYC, respectively. Two genes which encode proteins with homology to the VanS-VanR two-component regulatory system were present downstream from the resistance genes. The predicted amino acid sequence of VanRC exhibited 50% identity to VanR and 33% identity to VanRB. VanSC had 40% identity to VanS over a region of 308 amino acids and 24% identity to VanSB over a region of 285 amino acids. All residues with important functions in response regulators and histidine kinases were conserved in VanRC and VanSC, respectively. Induction experiments based on the determination of d,d-carboxypeptidase activity in cytoplasmic extracts confirmed that the genes were expressed constitutively. Using a promoter-probing vector, regions upstream from the resistance and regulatory genes were identified that have promoter activity. PMID:10817725

  5. A liver enhancer in the fibrinogen gene cluster.

    PubMed

    Fort, Alexandre; Fish, Richard J; Attanasio, Catia; Dosch, Roland; Visel, Axel; Neerman-Arbez, Marguerite

    2011-01-06

    The plasma concentration of fibrinogen varies in the healthy human population between 1.5 and 3.5 g/L. Understanding the basis of this variability has clinical importance because elevated fibrinogen levels are associated with increased cardiovascular disease risk. To identify novel regulatory elements involved in the control of fibrinogen expression, we used sequence conservation and in silico-predicted regulatory potential to select 14 conserved noncoding sequences (CNCs) within the conserved block of synteny containing the fibrinogen locus. The regulatory potential of each CNC was tested in vitro using a luciferase reporter gene assay in fibrinogen-expressing hepatoma cell lines (HuH7 and HepG2). 4 potential enhancers were tested for their ability to direct enhanced green fluorescent protein expression in zebrafish embryos. CNC12, a sequence equidistant from the human fibrinogen alpha and beta chain genes, activates strong liver enhanced green fluorescent protein expression in injected embryos and their transgenic progeny. A transgenic assay in embryonic day 14.5 mouse embryos confirmed the ability of CNC12 to activate transcription in the liver. While additional experiments are necessary to prove the role of CNC12 in the regulation of fibrinogen, our study reveals a novel regulatory element in the fibrinogen locus that is active in the liver and may contribute to variable fibrinogen expression in humans.

  6. Conservation of a microRNA cluster in parasitic nematodes and profiling of miRNAs in excretory-secretory products and microvesicles of Haemonchus contortus

    PubMed Central

    Gu, Henry Y.; Marks, Neil D.; Winter, Alan D.; Weir, William; Tzelos, Thomas; McNeilly, Tom N.; Britton, Collette

    2017-01-01

    microRNAs are small non-coding RNAs that are important regulators of gene expression in a range of animals, including nematodes. We have analysed a cluster of four miRNAs from the pathogenic nematode species Haemonchus contortus that are closely linked in the genome. We find that the cluster is conserved only in clade V parasitic nematodes and in some ascarids, but not in other clade III species nor in clade V free-living nematodes. Members of the cluster are present in parasite excretory-secretory products and can be detected in the abomasum and draining lymph nodes of infected sheep, indicating their release in vitro and in vivo. As observed for other parasitic nematodes, H. contortus adult worms release extracellular vesicles (EV). Small RNA libraries were prepared from vesicle-enriched and vesicle-depleted supernatants from both adult worms and L4 stage larvae. Comparison of the miRNA species in the different fractions indicated that specific miRNAs are packaged within vesicles, while others are more abundant in vesicle-depleted supernatant. Hierarchical clustering analysis indicated that the gut is the likely source of vesicle-associated miRNAs in the L4 stage, but not in the adult worm. These findings add to the growing body of work demonstrating that miRNAs released from parasitic helminths may play an important role in host-parasite interactions. PMID:29145392

  7. Genetic diversity and accession structure in European Cynara cardunculus collections

    PubMed Central

    Fernández, Juan A.; Sonnante, Gabriella; Egea-Gilabert, Catalina

    2017-01-01

    Understanding the distribution of genetic variations and accession structures is an important factor for managing genetic resources, but also for using proper germplasm in association map analyses and breeding programs. The globe artichoke is the fourth most important horticultural crop in Europe. Here, we report the results of a molecular analysis of a collection including globe artichoke and leafy cardoon germplasm present in the Italian, French and Spanish gene banks. The aims of this study were to: (i) assess the diversity present in European collections, (ii) determine the population structure, (iii) measure the genetic distance between accessions; (iv) cluster the accessions; (v) properly distinguish accessions present in the different national collections carrying the same name; and (vi) understand the diversity distribution in relation to the gene bank and the geographic origin of the germplasm. A total of 556 individuals grouped into 174 accessions of distinct typologies were analyzed by different types of molecular markers, i.e. dominant (ISSR and AFLP) and co-dominant (SSR). The data of the two crops (globe artichoke and leafy cardoon) were analyzed jointly and separately to compute, among other aims, the gene diversity, heterozygosity (He, Ho), fixation indexes, AMOVA, genetic distance and structure. The findings underline the huge diversity present in the analyzed material, and the existence of alleles that are able to discriminate among accessions. The accessions were clustered not only on the basis of their typology, but also on the basis of the gene bank they come from. Probably, the environmental conditions of the different field gene banks affected germplasm conservation. These outcomes will be useful in plant breeding to select accessions and to fingerprint varieties. Moreover, the results highlight the particular attention that should be paid to the method used to conserve the Cynara cardunculus germplasm and suggest to the preference of using accessions from different gene banks to run an association map. PMID:28570688

  8. A Sinorhizobium meliloti RpoH-Regulated Gene Is Involved in Iron-Sulfur Protein Metabolism and Effective Plant Symbiosis under Intrinsic Iron Limitation.

    PubMed

    Sasaki, Shohei; Minamisawa, Kiwamu; Mitsui, Hisayuki

    2016-09-01

    In Sinorhizobium meliloti, RpoH-type sigma factors have a global impact on gene expression during heat shock and play an essential role in symbiosis with leguminous plants. Using mutational analysis of a set of genes showing highly RpoH-dependent expression during heat shock, we identified a gene indispensable for effective symbiosis. This gene, designated sufT, was located downstream of the sufBCDS homologs that specify the iron-sulfur (Fe/S) cluster assembly pathway. The identified transcription start site was preceded by an RpoH-dependent promoter consensus sequence. SufT was related to a conserved protein family of unknown molecular function, of which some members are involved in Fe/S cluster metabolism in diverse organisms. A sufT mutation decreased bacterial growth in both rich and minimal media, tolerance to stresses such as iron starvation, and activities of some Fe/S cluster-dependent enzymes. These results support the involvement of SufT in SUF (sulfur mobilization) system-mediated Fe/S protein metabolism. Furthermore, we isolated spontaneous pseudorevertants of the sufT mutant with partially recovered growth; each of them had a mutation in rirA This gene encodes a global iron regulator whose loss increases the intracellular iron content. Deletion of rirA in the original sufT mutant improved growth and restored Fe/S enzyme activities and effective symbiosis. These results suggest that enhanced iron availability compensates for the lack of SufT in the maintenance of Fe/S proteins. Although RpoH-type sigma factors of the RNA polymerase are present in diverse proteobacteria, their role as global regulators of protein homeostasis has been studied mainly in the enteric gammaproteobacterium Escherichia coli In the soil alphaproteobacterium Sinorhizobium meliloti, the rpoH mutations have a strong impact on symbiosis with leguminous plants. We found that sufT is a unique member of the S. meliloti RpoH regulon; sufT contributes to Fe/S protein metabolism and effective symbiosis under intrinsic iron limitation exerted by RirA, a global iron regulator. Our study provides insights into the RpoH regulon function in diverse proteobacteria adapted to particular ecological niches and into the mechanism of conserved Fe/S protein biogenesis. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  9. ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation

    PubMed Central

    Kristensen, David M.; Wolf, Yuri I.; Koonin, Eugene V.

    2017-01-01

    The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny) and a small number of synonymous substitutions in the protein-coding genes. As such, each ATGC is suited for analysis of microevolutionary variations within a cohesive group of organisms (e.g. species), whereas the entire collection of ATGCs is useful for macroevolutionary studies. The ATGC database includes many forms of pre-computed data, in particular ATGC-COGs (Clusters of Orthologous Genes), multiple sequence alignments, a set of ‘index’ orthologs representing the most well-conserved members of each ATGC-COG, the phylogenetic tree of the organisms within each ATGC, etc. Although the ATGC database contains several million proteins from thousands of genomes organized into hundreds of clusters (roughly a 4-fold increase since the last version of the ATGC database), it is now built with completely automated methods and will be regularly updated following new releases of the NCBI RefSeq database. The ATGC database is hosted jointly at the University of Iowa at dmk-brain.ecn.uiowa.edu/ATGC/ and the NCBI at ftp.ncbi.nlm.nih.gov/pub/kristensen/ATGC/atgc_home.html. PMID:28053163

  10. The first myriapod genome sequence reveals conservative arthropod gene content and genome organisation in the centipede Strigamia maritima.

    PubMed

    Chipman, Ariel D; Ferrier, David E K; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S T; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C; Alonso, Claudio R; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C J; Blankenburg, Kerstin P; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K; Du Pasquier, Louis; Duncan, Elizabeth J; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D; Extavour, Cassandra G; Francisco, Liezl; Gabaldón, Toni; Gillis, William J; Goodwin-Horn, Elizabeth A; Green, Jack E; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J P; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H L; Hunn, Julia P; Hunnekuhl, Vera S; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N; Jiggins, Francis M; Jones, Tamsin E; Kaiser, Tobias S; Kalra, Divya; Kenny, Nathan J; Korchina, Viktoriya; Kovar, Christie L; Kraus, F Bernhard; Lapraz, François; Lee, Sandra L; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C; Robertson, Helen E; Robertson, Hugh M; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E; Schurko, Andrew M; Siggens, Kenneth W; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M; Willis, Judith H; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M; Worley, Kim C; Gibbs, Richard A; Akam, Michael; Richards, Stephen

    2014-11-01

    Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific life history.

  11. The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima

    PubMed Central

    Chipman, Ariel D.; Ferrier, David E. K.; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S. T.; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C.; Alonso, Claudio R.; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C. J.; Blankenburg, Kerstin P.; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K.; Du Pasquier, Louis; Duncan, Elizabeth J.; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D.; Extavour, Cassandra G.; Francisco, Liezl; Gabaldón, Toni; Gillis, William J.; Goodwin-Horn, Elizabeth A.; Green, Jack E.; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J. P.; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H. L.; Hunn, Julia P.; Hunnekuhl, Vera S.; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N.; Jiggins, Francis M.; Jones, Tamsin E.; Kaiser, Tobias S.; Kalra, Divya; Kenny, Nathan J.; Korchina, Viktoriya; Kovar, Christie L.; Kraus, F. Bernhard; Lapraz, François; Lee, Sandra L.; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N.; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J.; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H.; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C.; Robertson, Helen E.; Robertson, Hugh M.; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E.; Schurko, Andrew M.; Siggens, Kenneth W.; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J.; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M.; Willis, Judith H.; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M.; Worley, Kim C.; Gibbs, Richard A.; Akam, Michael; Richards, Stephen

    2014-01-01

    Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific life history. PMID:25423365

  12. Gene context analysis in the Integrated Microbial Genomes (IMG) data management system.

    PubMed

    Mavromatis, Konstantinos; Chu, Ken; Ivanova, Natalia; Hooper, Sean D; Markowitz, Victor M; Kyrpides, Nikos C

    2009-11-24

    Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.

  13. Characterization and Evolution of Cell Division and Cell Wall Synthesis Genes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae, and Planctomycetes and Phylogenetic Comparison with rRNA Genes▿ †

    PubMed Central

    Pilhofer, Martin; Rappl, Kristina; Eckl, Christina; Bauer, Andreas Peter; Ludwig, Wolfgang; Schleifer, Karl-Heinz; Petroni, Giulio

    2008-01-01

    In the past, studies on the relationships of the bacterial phyla Planctomycetes, Chlamydiae, Lentisphaerae, and Verrucomicrobia using different phylogenetic markers have been controversial. Investigations based on 16S rRNA sequence analyses suggested a relationship of the four phyla, showing the branching order Planctomycetes, Chlamydiae, Verrucomicrobia/Lentisphaerae. Phylogenetic analyses of 23S rRNA genes in this study also support a monophyletic grouping and their branching order—this grouping is significant for understanding cell division, since the major bacterial cell division protein FtsZ is absent from members of two of the phyla Chlamydiae and Planctomycetes. In Verrucomicrobia, knowledge about cell division is mainly restricted to the recent report of ftsZ in the closely related genera Prosthecobacter and Verrucomicrobium. In this study, genes of the conserved division and cell wall (dcw) cluster (ddl, ftsQ, ftsA, and ftsZ) were characterized in all verrucomicrobial subdivisions (1 to 4) with cultivable representatives (1 to 4). Sequence analyses and transcriptional analyses in Verrucomicrobia and genome data analyses in Lentisphaerae suggested that cell division is based on FtsZ in all verrucomicrobial subdivisions and possibly also in the sister phylum Lentisphaerae. Comprehensive sequence analyses of available genome data for representatives of Verrucomicrobia, Lentisphaerae, Chlamydiae, and Planctomycetes strongly indicate that their last common ancestor possessed a conserved, ancestral type of dcw gene cluster and an FtsZ-based cell division mechanism. This implies that Planctomycetes and Chlamydiae may have shifted independently to a non-FtsZ-based cell division mechanism after their separate branchings from their last common ancestor with Verrucomicrobia. PMID:18310338

  14. Functional and bioinformatics analysis of an exopolysaccharide-related gene (epsN) from Lactobacillus kefiranofaciens ZW3.

    PubMed

    Wang, Jingrui; Tang, Wei; Zheng, Yongna; Xing, Zhuqing; Wang, Yanping

    2016-09-01

    A novel lactic acid bacteria strain Lactobacillus kefiranofaciens ZW3 exhibited the characteristics of high production of exopolysaccharide (EPS). The epsN gene, located in the eps gene cluster of this strain, is associated with EPS biosynthesis. Bioinformatics analysis of this gene was performed. The conserved domain analysis showed that the EpsN protein contained MATE-Wzx-like domains. Then the epsN gene was amplified to construct the recombinant expression vector pMG36e-epsN. The results showed that the EPS yields of the recombinants were significantly improved. By determining the yields of EPS and intracellular polysaccharide, it was considered that epsN gene could play its Wzx flippase role in the EPS biosynthesis. This is the first time to prove the effect of EpsN on L. kefiranofaciens EPS biosynthesis and further prove its functional property.

  15. Database resources of the National Center for Biotechnology Information

    PubMed Central

    Sayers, Eric W.; Barrett, Tanya; Benson, Dennis A.; Bolton, Evan; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M.; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Krasnov, Sergey; Landsman, David; Lipman, David J.; Lu, Zhiyong; Madden, Thomas L.; Madej, Tom; Maglott, Donna R.; Marchler-Bauer, Aron; Miller, Vadim; Karsch-Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D.; Schuler, Gregory D.; Sequeira, Edwin; Sherry, Stephen T.; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A.; Wagner, Lukas; Wang, Yanli; Wilbur, W. John; Yaschenko, Eugene; Ye, Jian

    2012-01-01

    In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:22140104

  16. Database resources of the National Center for Biotechnology Information

    PubMed Central

    2013-01-01

    In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, the Genetic Testing Registry, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Probe, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page. PMID:23193264

  17. Database resources of the National Center for Biotechnology Information.

    PubMed

    Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Ostell, James; Miller, Vadim; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Steven T; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene

    2007-01-01

    In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link(BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace and Assembly Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Viral Genotyping Tools, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

  18. Database resources of the National Center for Biotechnology Information.

    PubMed

    Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene; Ye, Jian

    2009-01-01

    In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the web applications is custom implementation of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

  19. Database resources of the National Center for Biotechnology Information

    PubMed Central

    Wheeler, David L.; Barrett, Tanya; Benson, Dennis A.; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J.; Madden, Thomas L.; Maglott, Donna R.; Miller, Vadim; Ostell, James; Pruitt, Kim D.; Schuler, Gregory D.; Shumway, Martin; Sequeira, Edwin; Sherry, Steven T.; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L.; Tatusova, Tatiana A.; Wagner, Lukas; Yaschenko, Eugene

    2008-01-01

    In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data available through NCBI's web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace, Assembly, and Short Read Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Database of Genotype and Phenotype, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting the web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:18045790

  20. The iron uptake repressor Fep1 in the fission yeast binds Fe-S cluster through conserved cysteines.

    PubMed

    Kim, Hyo-Jin; Lee, Kang-Lok; Kim, Kyoung-Dong; Roe, Jung-Hye

    2016-09-09

    Iron homeostasis is tightly regulated since iron is an essential but toxic element in the cell. The GATA-type transcription factor Fep1 and its orthologs contribute to iron homeostasis in many fungi by repressing genes for iron uptake when intracellular iron is high. Even though the function and interaction partners of Fep1 have been elucidated extensively In Schizosaccharomyces pombe, the mechanism behind iron-sensing by Fep1 remains elusive. It has been reported that Fep1 interacts with Fe-S-containing monothiol glutaredoxin Grx4 and Grx4-Fra2 complex. In this study, we demonstrate that Fep1 also binds iron, in the form of Fe-S cluster. Spectroscopic and biochemical analyses of as isolated and reconstituted Fep1 suggest that the dimeric Fep1 binds Fe-S clusters. The mutation study revealed that the cluster-binding depended on the conserved cysteines located between the two zinc fingers in the DNA binding domain. EPR analyses revealed [Fe-S]-specific peaks indicative of mixed presence of [2Fe-2S], [3Fe-4S], or [4Fe-4S]. The finding that Fep1 is an Fe-S protein fits nicely with the model that the Fe-S-trafficking Grx4 senses intracellular iron environment and modulates the activity of Fep1. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Isolation of Notl sites from chromosome 22q11

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ten Hoeve, J.; Groffen, J.; Heisterkamp, N.

    1993-12-01

    Chromosome 22q11 contains a large number of interesting loci, including genes associated with cancer and developmental defects. The region is also the site of the lambda immunoglobulin variable and constants regions and the BCR, [gamma]-glutamyl transpeptidase, and GGT-like activity multigene families. Because of the complexities associated with mapping highly related gene families, the authors have examined the utility of mapping large areas of DNA using a defined approach. A total of 21 complete NotI sites from band q11 were cloned and ordered into six noncontiguous clusters of sites using a combination of somatic cell hybrid panels, NotI jumping and linkingmore » libraries, and fluorescence in situ hybridization. The largest cluster spanned an estimated 2 Mb of NotI fragments, the smallest 115 kb. Approximately 3.5 Mb of band q11 could be examined for rearrangements in NotI restriction enzyme fragments. A number of conserved sequences, two genes, and a minimum of two families of related sequences were identified adjacent to NotI sites. 51 refs., 5 figs., 4 tabs.« less

  2. Chloroplast DNA sequence of the green alga Oedogonium cardiacum (Chlorophyceae): Unique genome architecture, derived characters shared with the Chaetophorales and novel genes acquired through horizontal transfer

    PubMed Central

    Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique

    2008-01-01

    Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012

  3. Cloned Erwinia chrysanthemi out genes enable Escherichia coli to selectively secrete a diverse family of heterologous proteins to its milieu.

    PubMed

    He, S Y; Lindeberg, M; Chatterjee, A K; Collmer, A

    1991-02-01

    The out genes of the enterobacterial plant pathogen Erwinia chrysanthemi are responsible for the efficient extracellular secretion of multiple plant cell wall-degrading enzymes, including four isozymes of pectate lyase, exo-poly-alpha-D-galacturonosidase, pectin methylesterase, and cellulase. Out- mutants of Er. chrysanthemi are unable to export any of these proteins beyond the periplasm and are severely reduced in virulence. We have cloned out genes from Er. chrysanthemi in the stable, low-copy-number cosmid pCPP19 by complementing several transposon-induced mutations. The cloned out genes were clustered in a 12-kilobase chromosomal DNA region, complemented all existing out mutations in Er. chrysanthemi EC16, and enabled Escherichia coli strains to efficiently secrete the extracellular pectic enzymes produced from cloned Er. chrysanthemi genes, while retaining the periplasmic marker protein beta-lactamase. DNA sequencing of a 2.4-kilobase EcoRI fragment within the out cluster revealed four genes arranged colinearly and sharing substantial similarity with the Klebsiella pneumoniae genes pulH, pulI, pulJ, and pulK, which are necessary for pullulanase secretion. However, K. pneumoniae cells harboring the cloned Er. chrysanthemi pelE gene were unable to secrete the Erwinia pectate lyase. Furthermore, the Er. chrysanthemi Out system was unable to secrete an extracellular pectate lyase encoded by a gene from a closely related plant pathogen. Erwinia carotovora ssp. carotovora. The results suggest that these enterobacteria secrete polysaccharidases by a conserved mechanism whose protein-recognition capacities have diverged.

  4. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.

  5. Burkholderia mallei tssM encodes a putative deubiquitinase that is secreted and expressed inside infected RAW 264.7 murine macrophages.

    PubMed

    Shanks, John; Burtnick, Mary N; Brett, Paul J; Waag, David M; Spurgers, Kevin B; Ribot, Wilson J; Schell, Mark A; Panchal, Rekha G; Gherardini, Frank C; Wilkinson, Keith D; Deshazer, David

    2009-04-01

    Burkholderia mallei, a category B biothreat agent, is a facultative intracellular pathogen that causes the zoonotic disease glanders. The B. mallei VirAG two-component regulatory system activates the transcription of approximately 60 genes, including a large virulence gene cluster encoding a type VI secretion system (T6SS). The B. mallei tssM gene encodes a putative ubiquitin-specific protease that is physically linked to, and transcriptionally coregulated with, the T6SS gene cluster. Mass spectrometry and immunoblot analysis demonstrated that TssM was secreted in a virAG-dependent manner in vitro. Surprisingly, the T6SS was found to be dispensable for the secretion of TssM. The C-terminal half of TssM, which contains Cys and His box motifs conserved in eukaryotic deubiquitinases, was purified and biochemically characterized. Recombinant TssM hydrolyzed multiple ubiquitinated substrates and the cysteine at position 102 was critical for enzymatic activity. The tssM gene was expressed within 1 h after uptake of B. mallei into RAW 264.7 murine macrophages, suggesting that the TssM deubiquitinase is produced in this intracellular niche. Although the physiological substrate(s) is currently unknown, the TssM deubiquitinase may provide B. mallei a selective advantage in the intracellular environment during infection.

  6. Identification and comparative analysis of the epidermal differentiation complex in snakes

    PubMed Central

    Brigit Holthaus, Karin; Mlitz, Veronika; Strasser, Bettina; Tschachler, Erwin; Alibardi, Lorenzo; Eckhart, Leopold

    2017-01-01

    The epidermis of snakes efficiently protects against dehydration and mechanical stress. However, only few proteins of the epidermal barrier to the environment have so far been identified in snakes. Here, we determined the organization of the Epidermal Differentiation Complex (EDC), a cluster of genes encoding protein constituents of cornified epidermal structures, in snakes and compared it to the EDCs of other squamates and non-squamate reptiles. The EDC of snakes displays shared synteny with that of the green anole lizard, including the presence of a cluster of corneous beta-protein (CBP)/beta-keratin genes. We found that a unique CBP comprising 4 putative beta-sheets and multiple cysteine-rich EDC proteins are conserved in all snakes and other squamates investigated. Comparative genomics of squamates suggests that the evolution of snakes was associated with a gene duplication generating two isoforms of the S100 fused-type protein, scaffoldin, the origin of distinct snake-specific EDC genes, and the loss of other genes that were present in the EDC of the last common ancestor of snakes and lizards. Taken together, our results provide new insights into the evolution of the skin in squamates and a basis for the characterization of the molecular composition of the epidermis in snakes. PMID:28345630

  7. Genetic and physical mapping of homologues of the virus resistance gene Rx1 and the cyst nematode resistance gene Gpa2 in potato.

    PubMed

    Bakker, E; Butterbach, P; Rouppe van der Voort, J; van der Vossen, E; van Vliet, J; Bakker, J; Goverse, A

    2003-05-01

    Nine resistance gene homologues (RGHs) were identified in two diploid potato clones (SH and RH), with a specific primer pair based on conserved motifs in the LRR domain of the potato cyst nematode resistance gene Gpa2 and the potato virus X resistance gene Rx1. A modified AFLP method was used to facilitate the genetic mapping of the RGHs in the four haplotypes under investigation. All nine RGHs appeared to be located in the Gpa2/ Rx1 cluster on chromosome XII. Construction of a physical map using bacterial artificial chromosome (BAC) clones for both the Solanum tuberosum ssp. tuberosum and the S. tuberosum ssp. andigena haplotype of SH showed that the RGHs are located within a stretch of less than 200 kb. Sequence analysis of the RGHs revealed that they are highly similar (93 to 95%) to Gpa2 and Rx1. The sequence identities among all RGHs range from 85 to 100%. Two pairs of RGHs are identical, or nearly so (100 and 99.9%), with each member located in a different genotype. Southern-blot analysis on genomic DNA revealed no evidence for additional homologues outside the Gpa2/ Rx1 cluster on chromosome XII.

  8. Database resources of the National Center for Biotechnology

    PubMed Central

    Wheeler, David L.; Church, Deanna M.; Federhen, Scott; Lash, Alex E.; Madden, Thomas L.; Pontius, Joan U.; Schuler, Gregory D.; Schriml, Lynn M.; Sequeira, Edwin; Tatusova, Tatiana A.; Wagner, Lukas

    2003-01-01

    In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, PubMed, PubMed Central (PMC), LocusLink, the NCBITaxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR (e-PCR), Open Reading Frame (ORF) Finder, References Sequence (RefSeq), UniGene, HomoloGene, ProtEST, Database of Single Nucleotide Polymorphisms (dbSNP), Human/Mouse Homology Map, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes and related tools, the Map Viewer, Model Maker (MM), Evidence Viewer (EV), Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov. PMID:12519941

  9. A High-Resolution Gene Map of the Chloroplast Genome of the Red Alga Porphyra purpurea.

    PubMed Central

    Reith, M; Munholland, J

    1993-01-01

    Extensive DNA sequencing of the chloroplast genome of the red alga Porphyra purpurea has resulted in the detection of more than 125 genes. Fifty-eight (approximately 46%) of these genes are not found on the chloroplast genomes of land plants. These include genes encoding 17 photosynthetic proteins, three tRNAs, and nine ribosomal proteins. In addition, nine genes encoding proteins related to biosynthetic functions, six genes encoding proteins involved in gene expression, and at least five genes encoding miscellaneous proteins are among those not known to be located on land plant chloroplast genomes. The increased coding capacity of the P. purpurea chloroplast genome, along with other characteristics such as the absence of introns and the conservation of ancestral operons, demonstrate the primitive nature of the P. purpurea chloroplast genome. In addition, evidence for a monophyletic origin of chloroplasts is suggested by the identification of two groups of genes that are clustered in chloroplast genomes but not in cyanobacteria. PMID:12271072

  10. NsrR from Streptomyces coelicolor Is a Nitric Oxide-sensing [4Fe-4S] Cluster Protein with a Specialized Regulatory Function*

    PubMed Central

    Crack, Jason C.; Munnoch, John; Dodd, Erin L.; Knowles, Felicity; Al Bassam, Mahmoud M.; Kamali, Saeed; Holland, Ashley A.; Cramer, Stephen P.; Hamilton, Chris J.; Johnson, Michael K.; Thomson, Andrew J.; Hutchings, Matthew I.; Le Brun, Nick E.

    2015-01-01

    The Rrf2 family transcription factor NsrR controls expression of genes in a wide range of bacteria in response to nitric oxide (NO). The precise form of the NO-sensing module of NsrR is the subject of controversy because NsrR proteins containing either [2Fe-2S] or [4Fe-4S] clusters have been observed previously. Optical, Mössbauer, resonance Raman spectroscopies and native mass spectrometry demonstrate that Streptomyces coelicolor NsrR (ScNsrR), previously reported to contain a [2Fe-2S] cluster, can be isolated containing a [4Fe-4S] cluster. ChIP-seq experiments indicated that the ScNsrR regulon is small, consisting of only hmpA1, hmpA2, and nsrR itself. The hmpA genes encode NO-detoxifying flavohemoglobins, indicating that ScNsrR has a specialized regulatory function focused on NO detoxification and is not a global regulator like some NsrR orthologues. EMSAs and DNase I footprinting showed that the [4Fe-4S] form of ScNsrR binds specifically and tightly to an 11-bp inverted repeat sequence in the promoter regions of the identified target genes and that DNA binding is abolished following reaction with NO. Resonance Raman data were consistent with cluster coordination by three Cys residues and one oxygen-containing residue, and analysis of ScNsrR variants suggested that highly conserved Glu-85 may be the fourth ligand. Finally, we demonstrate that some low molecular weight thiols, but importantly not physiologically relevant thiols, such as cysteine and an analogue of mycothiol, bind weakly to the [4Fe-4S] cluster, and exposure of this bound form to O2 results in cluster conversion to the [2Fe-2S] form, which does not bind to DNA. These data help to account for the observation of [2Fe-2S] forms of NsrR. PMID:25771538

  11. Evolution of two Rh blood group-related genes of the amphioxus species Branchiostoma floridae.

    PubMed

    Kitano, Takashi; Satou, Masahiro; Saitou, Naruya

    2010-04-01

    We determined cDNAs of two genes that belong to the Rhesus (Rh) blood group gene family in an amphioxus species (Branchiostoma floridae) and designated them Rh-related-1 (RhR-1) and Rh-related-2 (RhR-2). RhR-1 and RhR-2 consisted of 10 and 11 exons, respectively. 3' UTR sequences of RhR-1 were shorter (220-272 bp) than those of RhR-2 (1,505-1,650 bp). CDS lengths were 1,344 and 1,476 bp for RhR-1 and RhR-2, respectively, and the average nucleotide difference between their CDS regions was 0.33. The corresponding regions of Rh genes from exons 2 to 7 were relatively conserved among the chordate species examined in this study. Length difference numbers were in multiples of three, which implies that codon frames were conserved among them, and the same exon/intron boundary phases were observed in those regions. This region was used for the phylogenetic analyses. RhR-1 and RhR-2 formed a cluster on the phylogenetic tree of the Rh gene family. Gene duplication time of RhR-1 and RhR-2 was estimated to be ca. 500 million years ago. It is likely that the four Rh family genes in vertebrates emerged by gene duplications in the common ancestor of vertebrates, and functional differentiation has occurred after the first gene duplication.

  12. Comparative Genomic and Transcriptomic Analysis of Wangiella dermatitidis, A Major Cause of Phaeohyphomycosis and a Model Black Yeast Human Pathogen

    PubMed Central

    Chen, Zehua; Martinez, Diego A.; Gujja, Sharvari; Sykes, Sean M.; Zeng, Qiandong; Szaniszlo, Paul J.; Wang, Zheng; Cuomo, Christina A.

    2014-01-01

    Black or dark brown (phaeoid) fungi cause cutaneous, subcutaneous, and systemic infections in humans. Black fungi thrive in stressful conditions such as intense light, high radiation, and very low pH. Wangiella (Exophiala) dermatitidis is arguably the most studied phaeoid fungal pathogen of humans. Here, we report our comparative analysis of the genome of W. dermatitidis and the transcriptional response to low pH stress. This revealed that W. dermatitidis has lost the ability to synthesize alpha-glucan, a cell wall compound many pathogenic fungi use to evade the host immune system. In contrast, W. dermatitidis contains a similar profile of chitin synthase genes as related fungi and strongly induces genes involved in cell wall synthesis in response to pH stress. The large portfolio of transporters may provide W. dermatitidis with an enhanced ability to remove harmful products as well as to survive on diverse nutrient sources. The genome encodes three independent pathways for producing melanin, an ability linked to pathogenesis; these are active during pH stress, potentially to produce a barrier to accumulated oxidative damage that might occur under stress conditions. In addition, a full set of fungal light-sensing genes is present, including as part of a carotenoid biosynthesis gene cluster. Finally, we identify a two-gene cluster involved in nucleotide sugar metabolism conserved with a subset of fungi and characterize a horizontal transfer event of this cluster between fungi and algal viruses. This work reveals how W. dermatitidis has adapted to stress and survives in diverse environments, including during human infections. PMID:24496724

  13. Single-cell transcriptome analysis of fish immune cells provides insight into the evolution of vertebrate immune cell types.

    PubMed

    Carmona, Santiago J; Teichmann, Sarah A; Ferreira, Lauren; Macaulay, Iain C; Stubbington, Michael J T; Cvejic, Ana; Gfeller, David

    2017-03-01

    The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans -membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell-specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans -membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. © 2017 Carmona et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Single-cell transcriptome analysis of fish immune cells provides insight into the evolution of vertebrate immune cell types

    PubMed Central

    Ferreira, Lauren; Macaulay, Iain C.; Stubbington, Michael J.T.

    2017-01-01

    The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans-membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell–specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans-membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. PMID:28087841

  15. Identification of evolutionarily conserved Momordica charantia microRNAs using computational approach and its utility in phylogeny analysis.

    PubMed

    Thirugnanasambantham, Krishnaraj; Saravanan, Subramanian; Karikalan, Kulandaivelu; Bharanidharan, Rajaraman; Lalitha, Perumal; Ilango, S; HairulIslam, Villianur Ibrahim

    2015-10-01

    Momordica charantia (bitter gourd, bitter melon) is a monoecious Cucurbitaceae with anti-oxidant, anti-microbial, anti-viral and anti-diabetic potential. Molecular studies on this economically valuable plant are very essential to understand its phylogeny and evolution. MicroRNAs (miRNAs) are conserved, small, non-coding RNA with ability to regulate gene expression by bind the 3' UTR region of target mRNA and are evolved at different rates in different plant species. In this study we have utilized homology based computational approach and identified 27 mature miRNAs for the first time from this bio-medically important plant. The phylogenetic tree developed from binary data derived from the data on presence/absence of the identified miRNAs were noticed to be uncertain and biased. Most of the identified miRNAs were highly conserved among the plant species and sequence based phylogeny analysis of miRNAs resolved the above difficulties in phylogeny approach using miRNA. Predicted gene targets of the identified miRNAs revealed their importance in regulation of plant developmental process. Reported miRNAs held sequence conservation in mature miRNAs and the detailed phylogeny analysis of pre-miRNA sequences revealed genus specific segregation of clusters. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Implications of the circumpolar genetic structure of polar bears for their conservation in a rapidly warming Arctic.

    PubMed

    Peacock, Elizabeth; Sonsthagen, Sarah A; Obbard, Martyn E; Boltunov, Andrei; Regehr, Eric V; Ovsyanikov, Nikita; Aars, Jon; Atkinson, Stephen N; Sage, George K; Hope, Andrew G; Zeyl, Eve; Bachmann, Lutz; Ehrich, Dorothee; Scribner, Kim T; Amstrup, Steven C; Belikov, Stanislav; Born, Erik W; Derocher, Andrew E; Stirling, Ian; Taylor, Mitchell K; Wiig, Øystein; Paetkau, David; Talbot, Sandra L

    2015-01-01

    We provide an expansive analysis of polar bear (Ursus maritimus) circumpolar genetic variation during the last two decades of decline in their sea-ice habitat. We sought to evaluate whether their genetic diversity and structure have changed over this period of habitat decline, how their current genetic patterns compare with past patterns, and how genetic demography changed with ancient fluctuations in climate. Characterizing their circumpolar genetic structure using microsatellite data, we defined four clusters that largely correspond to current ecological and oceanographic factors: Eastern Polar Basin, Western Polar Basin, Canadian Archipelago and Southern Canada. We document evidence for recent (ca. last 1-3 generations) directional gene flow from Southern Canada and the Eastern Polar Basin towards the Canadian Archipelago, an area hypothesized to be a future refugium for polar bears as climate-induced habitat decline continues. Our data provide empirical evidence in support of this hypothesis. The direction of current gene flow differs from earlier patterns of gene flow in the Holocene. From analyses of mitochondrial DNA, the Canadian Archipelago cluster and the Barents Sea subpopulation within the Eastern Polar Basin cluster did not show signals of population expansion, suggesting these areas may have served also as past interglacial refugia. Mismatch analyses of mitochondrial DNA data from polar and the paraphyletic brown bear (U. arctos) uncovered offset signals in timing of population expansion between the two species, that are attributed to differential demographic responses to past climate cycling. Mitogenomic structure of polar bears was shallow and developed recently, in contrast to the multiple clades of brown bears. We found no genetic signatures of recent hybridization between the species in our large, circumpolar sample, suggesting that recently observed hybrids represent localized events. Documenting changes in subpopulation connectivity will allow polar nations to proactively adjust conservation actions to continuing decline in sea-ice habitat.

  17. Implications of the Circumpolar Genetic Structure of Polar Bears for Their Conservation in a Rapidly Warming Arctic

    PubMed Central

    Peacock, Elizabeth; Sonsthagen, Sarah A.; Obbard, Martyn E.; Boltunov, Andrei; Regehr, Eric V.; Ovsyanikov, Nikita; Aars, Jon; Atkinson, Stephen N.; Sage, George K.; Hope, Andrew G.; Zeyl, Eve; Bachmann, Lutz; Ehrich, Dorothee; Scribner, Kim T.; Amstrup, Steven C.; Belikov, Stanislav; Born, Erik W.; Derocher, Andrew E.; Stirling, Ian; Taylor, Mitchell K.; Wiig, Øystein; Paetkau, David; Talbot, Sandra L.

    2015-01-01

    We provide an expansive analysis of polar bear (Ursus maritimus) circumpolar genetic variation during the last two decades of decline in their sea-ice habitat. We sought to evaluate whether their genetic diversity and structure have changed over this period of habitat decline, how their current genetic patterns compare with past patterns, and how genetic demography changed with ancient fluctuations in climate. Characterizing their circumpolar genetic structure using microsatellite data, we defined four clusters that largely correspond to current ecological and oceanographic factors: Eastern Polar Basin, Western Polar Basin, Canadian Archipelago and Southern Canada. We document evidence for recent (ca. last 1–3 generations) directional gene flow from Southern Canada and the Eastern Polar Basin towards the Canadian Archipelago, an area hypothesized to be a future refugium for polar bears as climate-induced habitat decline continues. Our data provide empirical evidence in support of this hypothesis. The direction of current gene flow differs from earlier patterns of gene flow in the Holocene. From analyses of mitochondrial DNA, the Canadian Archipelago cluster and the Barents Sea subpopulation within the Eastern Polar Basin cluster did not show signals of population expansion, suggesting these areas may have served also as past interglacial refugia. Mismatch analyses of mitochondrial DNA data from polar and the paraphyletic brown bear (U. arctos) uncovered offset signals in timing of population expansion between the two species, that are attributed to differential demographic responses to past climate cycling. Mitogenomic structure of polar bears was shallow and developed recently, in contrast to the multiple clades of brown bears. We found no genetic signatures of recent hybridization between the species in our large, circumpolar sample, suggesting that recently observed hybrids represent localized events. Documenting changes in subpopulation connectivity will allow polar nations to proactively adjust conservation actions to continuing decline in sea-ice habitat. PMID:25562525

  18. Implications of the circumpolar genetic structure of polar bears for their conservation in a rapidly warming Arctic

    USGS Publications Warehouse

    Peacock, Elizabeth; Sonsthagen, Sarah A.; Obbard, Martyn E.; Boltunov, Andrei N.; Regehr, Eric V.; Ovsyanikov, Nikita; Aars, Jon; Atkinson, Stephen N.; Sage, George K.; Hope, Andrew G.; Zeyl, Eve; Bachmann, Lutz; Ehrich, Dorothee; Scribner, Kim T.; Amstrup, Steven C.; Belikov, Stanislav; Born, Erik W.; Derocher, Andrew E.; Stirling, Ian; Taylor, Mitchell K.; Wiig, Øystein; Paetkau, David; Talbot, Sandra L.

    2015-01-01

    We provide an expansive analysis of polar bear (Ursus maritimus) circumpolar genetic variation during the last two decades of decline in their sea-ice habitat. We sought to evaluate whether their genetic diversity and structure have changed over this period of habitat decline, how their current genetic patterns compare with past patterns, and how genetic demography changed with ancient fluctuations in climate. Characterizing their circumpolar genetic structure using microsatellite data, we defined four clusters that largely correspond to current ecological and oceanographic factors: Eastern Polar Basin, Western Polar Basin, Canadian Archipelago and Southern Canada. We document evidence for recent (ca. last 1–3 generations) directional gene flow from Southern Canada and the Eastern Polar Basin towards the Canadian Archipelago, an area hypothesized to be a future refugium for polar bears as climate-induced habitat decline continues. Our data provide empirical evidence in support of this hypothesis. The direction of current gene flow differs from earlier patterns of gene flow in the Holocene. From analyses of mitochondrial DNA, the Canadian Archipelago cluster and the Barents Sea subpopulation within the Eastern Polar Basin cluster did not show signals of population expansion, suggesting these areas may have served also as past interglacial refugia. Mismatch analyses of mitochondrial DNA data from polar and the paraphyletic brown bear (U. arctos) uncovered offset signals in timing of population expansion between the two species, that are attributed to differential demographic responses to past climate cycling. Mitogenomic structure of polar bears was shallow and developed recently, in contrast to the multiple clades of brown bears. We found no genetic signatures of recent hybridization between the species in our large, circumpolar sample, suggesting that recently observed hybrids represent localized events. Documenting changes in subpopulation connectivity will allow polar nations to proactively adjust conservation actions to continuing decline in sea-ice habitat.

  19. Adaptive evolution of newly emerged micro-RNA genes in Drosophila.

    PubMed

    Lu, Jian; Fu, Yonggui; Kumar, Supriya; Shen, Yang; Zeng, Kai; Xu, Anlong; Carthew, Richard; Wu, Chung-I

    2008-05-01

    How often micro-RNA (miRNA) genes emerged and how fast they evolved soon after their emergence are some of the central questions in the evolution of miRNAs. Because most known miRNA genes are ancient and highly conserved, these questions can be best answered by identifying newly emerged miRNA genes. Among the 78 miRNA genes in Drosophila reported before 2007, only 5 are confirmed to be newly emerged in the genus (although many more can be found in the newly reported data set; e.g., Ruby et al. 2007; Stark et al. 2007; Lu et al. 2008). These new miRNA genes have undergone numerous changes, even in the normally invariant mature sequences. Four of them (the miR-310/311/312/313 cluster, denoted miR-310s) were duplicated from other conserved miRNA genes. The fifth one (miR-303) appears to be a very young gene, originating de novo from a non-miRNA sequence recently. We sequenced these 5 miRNA genes and their neighboring regions from a worldwide collection of Drosophila melanogaster lines. The levels of divergence and polymorphism in these miRNA genes, vis-à-vis those of the neighboring DNA sequences, suggest that these 5 genes are evolving adaptively. Furthermore, the polymorphism pattern of miR-310s in D. melanogaster is indicative of hitchhiking under positive selection. Thus, a large number of adaptive changes over a long period of time may be essential for the evolution of newly emerged miRNA genes.

  20. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage

    PubMed Central

    Brok-Volchanskaya, Vera S.; Kadyrov, Farid A.; Sivogrivov, Dmitry E.; Kolosov, Peter M.; Sokolov, Andrey S.; Shlyapnikov, Michael G.; Kryukov, Valentine M.; Granovsky, Igor E.

    2008-01-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3′ 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TψC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages. PMID:18281701

  1. Host Cell Contact-Induced Transcription of the Type IV Fimbria Gene Cluster of Actinobacillus pleuropneumoniae

    PubMed Central

    Boekema, Bouke K. H. L.; Van Putten, Jos P. M.; Stockhofe-Zurwieden, Norbert; Smith, Hilde E.

    2004-01-01

    Type IV pili (Tfp) of gram-negative species share many characteristics, including a common architecture and conserved biogenesis pathway. Much less is known about the regulation of Tfp expression in response to changing environmental conditions. We investigated the diversity of Tfp regulatory systems by searching for the molecular basis of the reported variable expression of the Tfp gene cluster of the pathogen Actinobacillus pleuropneumoniae. Despite the presence of an intact Tfp gene cluster consisting of four genes, apfABCD, no Tfp were formed under standard growth conditions. Sequence analysis of the predicted major subunit protein ApfA showed an atypical alanine residue at position −1 from the prepilin peptidase cleavage site in 42 strains. This alanine deviates from the consensus glycine at this position in Tfp from other species. Yet, cloning of the apfABCD genes under a constitutive promoter in A. pleuropneumoniae resulted in pilin and Tfp assembly. Tfp promoter-luxAB reporter gene fusions demonstrated that the Tfp promoter was intact but tightly regulated. Promoter activity varied with bacterial growth phase and was detected only when bacteria were grown in chemically defined medium. Infection experiments with cultured epithelial cells demonstrated that Tfp promoter activity was upregulated upon adherence of the pathogen to primary cultures of lung epithelial cells. Nonadherent bacteria in the culture supernatant exhibited virtually no promoter activity. A similar upregulation of Tfp promoter activity was observed in vivo during experimental infection of pigs. The host cell contact-induced and in vivo-upregulated Tfp promoter activity in A. pleuropneumoniae adds a new dimension to the diversity of Tfp regulation. PMID:14742510

  2. Capturing neutral and adaptive genetic diversity for conservation in a highly structured tree species.

    PubMed

    Rodríguez-Quilón, Isabel; Santos-Del-Blanco, Luis; Serra-Varela, María Jesús; Koskela, Jarkko; González-Martínez, Santiago C; Alía, Ricardo

    2016-10-01

    Preserving intraspecific genetic diversity is essential for long-term forest sustainability in a climate change scenario. Despite that, genetic information is largely neglected in conservation planning, and how conservation units should be defined is still heatedly debated. Here, we use maritime pine (Pinus pinaster Ait.), an outcrossing long-lived tree with a highly fragmented distribution in the Mediterranean biodiversity hotspot, to prove the importance of accounting for genetic variation, of both neutral molecular markers and quantitative traits, to define useful conservation units. Six gene pools associated to distinct evolutionary histories were identified within the species using 12 microsatellites and 266 single nucleotide polymorphisms (SNPs). In addition, height and survival standing variation, their genetic control, and plasticity were assessed in a multisite clonal common garden experiment (16 544 trees). We found high levels of quantitative genetic differentiation within previously defined neutral gene pools. Subsequent cluster analysis and post hoc trait distribution comparisons allowed us to define 10 genetically homogeneous population groups with high evolutionary potential. They constitute the minimum number of units to be represented in a maritime pine dynamic conservation program. Our results uphold that the identification of conservation units below the species level should account for key neutral and adaptive components of genetic diversity, especially in species with strong population structure and complex evolutionary histories. The environmental zonation approach currently used by the pan-European genetic conservation strategy for forest trees would be largely improved by gradually integrating molecular and quantitative trait information, as data become available. © 2016 by the Ecological Society of America.

  3. Molecular Typing and Virulence Gene Profiles of Enterotoxin Gene Cluster (egc)-Positive Staphylococcus aureus Isolates Obtained from Various Food and Clinical Specimens.

    PubMed

    Song, Minghui; Shi, Chunlei; Xu, Xuebing; Shi, Xianming

    2016-11-01

    The enterotoxin gene cluster (egc) has been proposed to contribute to the Staphylococcus aureus colonization, which highlights the need to evaluate genetic diversity and virulence gene profiles of the egc-positive population. Here, a total of 43 egc-positive isolates (16.2%) were identified from 266 S. aureus isolates that were obtained from various food and clinical specimens in Shanghai. Seven different egc profiles were found based on the polymerase chain reaction (PCR) result for egc genes. Then, these 43 egc-positive isolates were further typed by multilocus sequence typing, pulsed-field gel electrophoresis (PFGE), multiple-locus variable-number tandem-repeat analysis (MLVA), and accessory gene regulatory (agr) typing. It showed that the 43 egc-positive isolates displayed 17 sequence types, 28 PFGE patterns, 29 MLVA types, and 4 agr types, respectively. Among them, the dominant clonal lineage was CC5-agr II (48.84%). Thirty toxin and 20 adhesion-associated genes were detected by PCR in egc-positive isolates. Notably, invasive toxin genes showed a high prevalence, such as 76.7% for Panton-Valentine leukocidin encoding genes, 27.9% for sec, and 23.3% for tsst-1. Most of the examined adhesion-associated genes were found to be conserved (76.7-100%), whereas the fnbB gene was only found in 8 (18.6%) isolates. In addition, 33 toxin gene profiles and 13 adhesion gene profiles were identified, respectively. Our results imply that isolates belonging to the same clonal lineage harbored similar adhesion gene profiles but diverse toxin gene profiles. Overall, the high prevalence of invasive virulence genes increases the potential risk of egc-positive isolates in S. aureus infection.

  4. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset ofmore » genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.« less

  5. Hypoxia-activated genes from early placenta are elevated in Preeclampsia, but not in Intra-Uterine Growth Retardation

    PubMed Central

    Vaiman, Daniel; Mondon, Françoise; Garcès-Duran, Alexandra; Mignot, Thérèse-Marie; Robert, Brigitte; Rebourcet, Régis; Jammes, Hélène; Chelbi, Sonia T; Quetin, Frédérique; Marceau, Geoffrey; Sapin, Vincent; Piumi, François; Danan, Jean-Louis; Rigourd, Virginie; Carbonne, Bruno; Ferré, Françoise

    2005-01-01

    Background As a first step to explore the possible relationships existing between the effects of low oxygen pressure in the first trimester placenta and placental pathologies developing from mid-gestation, two subtracted libraries totaling 2304 cDNA clones were constructed. For achieving this, two reciprocal suppressive/subtractive hybridization procedures (SSH) were applied to early (11 weeks) human placental villi after incubation either in normoxic or in hypoxic conditions. The clones from both libraries (1440 hypoxia-specific and 864 normoxia-specific) were spotted on nylon macroarrays. Complex cDNAs probes prepared from placental villi (either from early pregnancy, after hypoxic or normoxic culture conditions, or near term for controls or pathological placentas) were hybridized to the membranes. Results Three hundred and fifty nine clones presenting a hybridization signal above the background were sequenced and shown to correspond to 276 different genes. Nine of these genes are mitochondrial, while 267 are nuclear. Specific expression profiles characteristic of preeclampsia (PE) could be identified, as well as profiles specific of intra-uterine growth retardation (IUGR). Focusing on the chromosomal distribution of the fraction of genes that responded in at least one hybridization experiment, we could observe a highly significant chromosomal clustering of 54 genes into 8 chromosomal regions, four of which containing imprinted genes. Comparative mapping data indicate that these imprinted clusters are maintained in synteny in mice, and apparently in cattle and pigs, suggesting that the maintenance of such syntenies is requested for achieving a normal placental physiology in eutherian mammals. Conclusion We could demonstrate that genes induced in PE were also genes highly expressed under hypoxic conditions (P = 5.10-5), which was not the case for isolated IUGR. Highly expressed placental genes may be in syntenies conserved interspecifically, suggesting that the maintenance of such clusters is requested for achieving a normal placental physiology in eutherian mammals. PMID:16129025

  6. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    PubMed

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe

    2018-05-01

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.

  7. The cytochrome P450 2AA gene cluster in zebrafish (Danio rerio): Expression of CYP2AA1 and CYP2AA2 and response to phenobarbital-type inducers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kubota, Akira; Bainy, Afonso C.D.; Departamento de Bioquímica, CCB, Universidade Federal de Santa Catarina, Florianopolis, SC 88040-900

    2013-10-01

    The cytochrome P450 (CYP) 2 gene family is the largest and most diverse CYP gene family in vertebrates. In zebrafish, we have identified 10 genes in a new subfamily, CYP2AA, which does not show orthology to any human or other mammalian CYP genes. Here we report evolutionary and structural relationships of the 10 CYP2AA genes and expression of the first two genes, CYP2AA1 and CYP2AA2. Parsimony reconstruction of the tandem duplication pattern for the CYP2AA cluster suggests that CYP2AA1, CYP2AA2 and CYP2AA3 likely arose in the earlier duplication events and thus are most diverged in function from the other CYP2AAs.more » On the other hand, CYP2AA8 and CYP2AA9 are genes that arose in the latest duplication event, implying functional similarity between these two CYPs. A molecular model of CYP2AA1 showing the sequence conservation across the CYP2AA cluster reveals that the regions with the highest variability within the cluster map onto CYP2AA1 near the substrate access channels, suggesting differing substrate specificities. Zebrafish CYP2AA1 transcript was expressed predominantly in the intestine, while CYP2AA2 was most highly expressed in the kidney, suggesting differing roles in physiology. In the liver CYP2AA2 expression but not that of CYP2AA1, was increased by 1,4-bis [2-(3,5-dichloropyridyloxy)] benzene (TCPOBOP) and, to a lesser extent, by phenobarbital (PB). In contrast, pregnenolone 16α-carbonitrile (PCN) increased CYP2AA1 expression, but not CYP2AA2 in the liver. The results identify a CYP2 subfamily in zebrafish that includes genes apparently induced by PB-type chemicals and PXR agonists, the first concrete in vivo evidence for a PB-type response in fish. - Highlights: • A tandemly duplicated cluster of ten CYP2AA genes was described in zebrafish. • Parsimony and duplication analyses suggest pathways to CYP2AA diversity. • Homology models reveal amino acid positions possibly related to functional diversity. • The CYP2AA locus does not share synteny with any CYP2 subfamily in mammals. • Induction of CYP2AA1 and CYP2AA2 indicates a phenobarbital-type response in fish.« less

  8. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.

    PubMed

    Makarova, Kira S; Sorokin, Alexander V; Novichkov, Pavel S; Wolf, Yuri I; Koonin, Eugene V

    2007-11-27

    An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.

  9. Genetic Biodiversity of Italian Olives (Olea europaea) Germplasm Analyzed by SSR Markers

    PubMed Central

    Vendramin, Giuseppe Giovanni; Chiappetta, Adriana

    2014-01-01

    The olive is an important fruit species cultivated for oil and table olives in Italy and the Mediterranean basin. The conservation of cultivated plants in ex situ collections is essential for the optimal management and use of their genetic resources. The largest ex situ olive germplasm collection consists of approximately 500 Italian olive varieties and corresponding to 85% of the total Italian olive germplasm is maintained at the Consiglio per la Ricerca e sperimentazione per l'Agricoltura, Centro di Ricerca per l'Olivicoltura e l'Industria Olearia (CRA-OLI), in Italy. In this work, eleven preselected nuclear microsatellite markers were used to assess genetic diversity, population structure, and gene flows with the aim of assembling a core collection. The dendrogram obtained utilizing the unweighted pair group method highlights the presence of homonymy and synonymy in olive tree datasets analyzed in this study. 439 different unique genotype profiles were obtained with this combination of 11 loci nSSR, representing 89.8% of the varieties analyzed. The remaining 10.2% comprises different variety pairs in which both accessions are genetically indistinguishable. Clustering analysis performed using BAPS software detected seven groups in Italian olive germplasm and gene flows were determined among identified clusters. We proposed an Italian core collection of 23 olive varieties capturing all detected alleles at microsatellites. The information collected in this study regarding the CRA-OLI ex situ collection can be used for breeding programs, for germplasm conservation, and for optimizing a strategy for the management of olive gene pools. PMID:24723801

  10. Analysis of infant isolates of Bifidobacterium breve by comparative genome hybridization indicates the existence of new subspecies with marked infant specificity.

    PubMed

    Boesten, Rolf; Schuren, Frank; Wind, Richèle D; Knol, Jan; de Vos, Willem M

    2011-09-01

    A total of 20 Bifidobacterium strains were isolated from fecal samples of 4 breast- and bottle-fed infants and all were characterized as Bifidobacterium breve based on 16S rRNA gene sequence and metabolic analysis. These isolates were further characterized and compared to the type strains of B. breve and 7 other Bifidobacterium spp. by comparative genome hybridization. For this purpose, we constructed and used a DNA-based microarray containing over 2000 randomly cloned DNA fragments from B. breve type strain LMG13208. This molecular analysis revealed a high degree of genomic variation between the isolated strains and allowed the vast majority to be grouped into 4 clusters. One cluster contained a single isolate that was virtually indistinguishable from the B. breve type strain. The 3 other clusters included 19 B. breve strains that differed considerably from all type strains. Remarkably, each of the 4 clusters included strains that were isolated from a single infant, indicating that a niche adaptation may contribute to variation within the B. breve species. Based on genomic hybridization data, the new B. breve isolates were estimated to contain approximately 60-90% of the genes of the B. breve type strain, attesting to the existence of various subspecies within the species B. breve. Further bioinformatic analysis identified several hundred diagnostic clones specific to the genomic clustering of the B. breve isolates. Molecular analysis of representatives of these revealed that annotated genes from the conserved B. breve core encoded mainly housekeeping functions, while the strain-specific genes were predicted to code for functions related to life style, such as carbohydrate metabolism and transport. This is compatible with genetic adaptation of the strains to their niche, a combination of infants and diet. Copyright © 2011 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  11. Clustering of Pan- and Core-genome of Lactobacillus provides Novel Evolutionary Insights for Differentiation.

    PubMed

    Inglin, Raffael C; Meile, Leo; Stevens, Marc J A

    2018-04-24

    Bacterial taxonomy aims to classify bacteria based on true evolutionary events and relies on a polyphasic approach that includes phenotypic, genotypic and chemotaxonomic analyses. Until now, complete genomes are largely ignored in taxonomy. The genus Lactobacillus consists of 173 species and many genomes are available to study taxonomy and evolutionary events. We analyzed and clustered 98 completely sequenced genomes of the genus Lactobacillus and 234 draft genomes of 5 different Lactobacillus species, i.e. L. reuteri, L. delbrueckii, L. plantarum, L. rhamnosus and L. helveticus. The core-genome of the genus Lactobacillus contains 266 genes and the pan-genome 20'800 genes. Clustering of the Lactobacillus pan- and core-genome resulted in two highly similar trees. This shows that evolutionary history is traceable in the core-genome and that clustering of the core-genome is sufficient to explore relationships. Clustering of core- and pan-genomes at species' level resulted in similar trees as well. Detailed analyses of the core-genomes showed that the functional class "genetic information processing" is conserved in the core-genome but that "signaling and cellular processes" is not. The latter class encodes functions that are involved in environmental interactions. Evolution of lactobacilli seems therefore directed by the environment. The type species L. delbrueckii was analyzed in detail and its pan-genome based tree contained two major clades whose members contained different genes yet identical functions. In addition, evidence for horizontal gene transfer between strains of L. delbrueckii, L. plantarum, and L. rhamnosus, and between species of the genus Lactobacillus is presented. Our data provide evidence for evolution of some lactobacilli according to a parapatric-like model for species differentiation. Core-genome trees are useful to detect evolutionary relationships in lactobacilli and might be useful in taxonomic analyses. Lactobacillus' evolution is directed by the environment and HGT.

  12. In vivo functional analysis of L-rhamnose metabolic pathway in Aspergillus niger: a tool to identify the potential inducer of RhaR.

    PubMed

    Khosravi, Claire; Kun, Roland Sándor; Visser, Jaap; Aguilar-Pontes, María Victoria; de Vries, Ronald P; Battaglia, Evy

    2017-11-06

    The genes of the non-phosphorylative L-rhamnose catabolic pathway have been identified for several yeast species. In Schefferomyces stipitis, all L-rhamnose pathway genes are organized in a cluster, which is conserved in Aspergillus niger, except for the lra-4 ortholog (lraD). The A. niger cluster also contains the gene encoding the L-rhamnose responsive transcription factor (RhaR) that has been shown to control the expression of genes involved in L-rhamnose release and catabolism. In this paper, we confirmed the function of the first three putative L-rhamnose utilisation genes from A. niger through gene deletion. We explored the identity of the inducer of the pathway regulator (RhaR) through expression analysis of the deletion mutants grown in transfer experiments to L-rhamnose and L-rhamnonate. Reduced expression of L-rhamnose-induced genes on L-rhamnose in lraA and lraB deletion strains, but not on L-rhamnonate (the product of LraB), demonstrate that the inducer of the pathway is of L-rhamnonate or a compound downstream of it. Reduced expression of these genes in the lraC deletion strain on L-rhamnonate show that it is in fact a downstream product of L-rhamnonate. This work showed that the inducer of RhaR is beyond L-rhamnonate dehydratase (LraC) and is likely to be the 2-keto-3-L-deoxyrhamnonate.

  13. Expressed Sequence Tag Analysis of the Human Pathogen Paracoccidioides brasiliensis Yeast Phase: Identification of Putative Homologues of Candida albicans Virulence and Pathogenicity Genes

    PubMed Central

    Goldman, Gustavo H.; dos Reis Marques, Everaldo; Custódio Duarte Ribeiro, Diógenes; Ângelo de Souza Bernardes, Luciano; Quiapin, Andréa Carla; Vitorelli, Patrícia Marostica; Savoldi, Marcela; Semighini, Camile P.; de Oliveira, Regina C.; Nunes, Luiz R.; Travassos, Luiz R.; Puccia, Rosana; Batista, Wagner L.; Ferreira, Leslie Ecker; Moreira, Júlio C.; Bogossian, Ana Paula; Tekaia, Fredj; Nobrega, Marina Pasetto; Nobrega, Francisco G.; Goldman, Maria Helena S.

    2003-01-01

    Paracoccidioides brasiliensis, a thermodimorphic fungus, is the causative agent of the prevalent systemic mycosis in Latin America, paracoccidioidomycosis. We present here a survey of expressed genes in the yeast pathogenic phase of P. brasiliensis. We obtained 13,490 expressed sequence tags from both 5′ and 3′ ends. Clustering analysis yielded the partial sequences of 4,692 expressed genes that were functionally classified by similarity to known genes. We have identified several Candida albicans virulence and pathogenicity homologues in P. brasiliensis. Furthermore, we have analyzed the expression of some of these genes during the dimorphic yeast-mycelium-yeast transition by real-time quantitative reverse transcription-PCR. Clustering analysis of the mycelium-yeast transition revealed three groups: (i) RBT, hydrophobin, and isocitrate lyase; (ii) malate dehydrogenase, contigs Pb1067 and Pb1145, GPI, and alternative oxidase; and (iii) ubiquitin, delta-9-desaturase, HSP70, HSP82, and HSP104. The first two groups displayed high mRNA expression in the mycelial phase, whereas the third group showed higher mRNA expression in the yeast phase. Our results suggest the possible conservation of pathogenicity and virulence mechanisms among fungi, expand considerably gene identification in P. brasiliensis, and provide a broader basis for further progress in understanding its biological peculiarities. PMID:12582121

  14. Draft Genome Sequence of Eggplant (Solanum melongena L.): the Representative Solanum Species Indigenous to the Old World

    PubMed Central

    Hirakawa, Hideki; Shirasawa, Kenta; Miyatake, Koji; Nunome, Tsukasa; Negoro, Satomi; Ohyama, Akio; Yamaguchi, Hirotaka; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi; Fukuoka, Hiroyuki

    2014-01-01

    Unlike other important Solanaceae crops such as tomato, potato, chili pepper, and tobacco, all of which originated in South America and are cultivated worldwide, eggplant (Solanum melongena L.) is indigenous to the Old World and in this respect it is phylogenetically unique. To broaden our knowledge of the genomic nature of solanaceous plants further, we dissected the eggplant genome and built a draft genome dataset with 33,873 scaffolds termed SME_r2.5.1 that covers 833.1 Mb, ca. 74% of the eggplant genome. Approximately 90% of the gene space was estimated to be covered by SME_r2.5.1 and 85,446 genes were predicted in the genome. Clustering analysis of the predicted genes of eggplant along with the genes of three other solanaceous plants as well as Arabidopsis thaliana revealed that, of the 35,000 clusters generated, 4,018 were exclusively composed of eggplant genes that would perhaps confer eggplant-specific traits. Between eggplant and tomato, 16,573 pairs of genes were deduced to be orthologous, and 9,489 eggplant scaffolds could be mapped onto the tomato genome. Furthermore, 56 conserved synteny blocks were identified between the two species. The detailed comparative analysis of the eggplant and tomato genomes will facilitate our understanding of the genomic architecture of solanaceous plants, which will contribute to cultivation and further utilization of these crops. PMID:25233906

  15. Microdissection and molecular manipulation of single chromosomes in woody fruit trees with small chromosomes using pomelo (Citrus grandis) as a model. II. Cloning of resistance gene analogs from single chromosomes.

    PubMed

    Huang, D; Wu, W; Lu, L

    2004-05-01

    Amplification of resistance gene analogs (RGAs) is both a useful method for acquiring DNA markers closely linked to disease resistance (R) genes and a potential approach for the rapid cloning of R genes in plants. However, the screening of target sequences from among the numerous amplified RGAs can be very laborious. The amplification of RGAs from specific chromosomes could greatly reduce the number of RGAs to be screened and, consequently, speed up the identification of target RGAs. We have developed two methods for amplifying RGAs from single chromosomes. Method 1 uses products of Sau3A linker adaptor-mediated PCR (LAM-PCR) from a single chromosome as the templates for RGA amplification, while Method 2 directly uses a single chromosomal DNA molecule as the template. Using a pair of degenerate primers designed on the basis of the conserved nucleotide-binding-site motifs in many R genes, RGAs were successfully amplified from single chromosomes of pomelo using both these methods. Sequencing and cluster analysis of RGA clones obtained from single chromosomes revealed the number, type and organization of R-gene clusters on the chromosomes. We suggest that Method 1 is suitable for analyzing chromosomes that are unidentifiable under a microscope, while Method 2 is more appropriate when chromosomes can be clearly identified.

  16. Population connectivity and larval dispersal of the exploited mangrove crab Ucides cordatus along the Brazilian coast.

    PubMed

    Britto, Fábio B; Schmidt, Anders J; Carvalho, Adriana M F; Vasconcelos, Carolina C M P; Farias, Antonia M; Bentzen, Paul; Diniz, Fábio M

    2018-01-01

    The mangrove crab Ucides cordatus is considered a key species for the ecological balance of mangrove forests and a major source of employment and income for traditional crab collectors in Brazil. Several studies evidenced weak genetic variation among populations due to an efficient larval transport. However, gene flow patterns of the species is poorly understood, with no information about migration rates. The influence of the two main Brazilian currents in larval dispersion is also not clear. In order to provide baseline information for conservation, planning and management of this important fishery resource, the present study aimed to estimate and evaluate spatial distribution of genetic diversity, migration rates and gene flow directivity among populations of U. cordatus in Brazil. Nine microsatellites were used to resolve population structure of 319 crabs collected from six sites located along the Brazilian coast. The degree of geographical differentiation included estimates of genetic diversity, population structure and gene flow models, with spatial analysis of shared alleles (SAShA), isolation by distance tests, AMOVA, discriminant analysis of principal components (DAPC) and Bayesian clustering. We estimated the amount of ongoing gene flow between clusters using the coalescent-based method implemented in Migrate-N. Loci were highly polymorphic (average of 12.4 alleles per locus) evidencing high genetic variability. There was significant differentiation among localities, despite of the low value of F ST (= 0.019; P < 0.001). F ST and Jost's D indexes were also estimated in pairwise comparisons and showed significant differences between most of the surveyed site pairs ( P < 0.05). Structure evidenced a single genetic group among samples, however SAShA pointed to a non-panmictic condition ( P = 0.011). AMOVA detected four statistical significant clusters with low level of differentiation ( F CT = 0.037; P = 0.023). The gene flow model that best described the population connectivity was the island model, with ∼24 crabs being exchanged among localities per generation. The high migration rates found among localities seem to be the main force acting to sustain the distribution of the genetic diversity of U. cordatus . Despite the high gene flow and the weak population structure among samples, the significant genetic differences found suggest that gene flow alone does not bypass the effects of genetic drift, natural selection and/or human exploitation. These findings are vital for the establishment of a database to be used in the development of conservation programs.

  17. Population connectivity and larval dispersal of the exploited mangrove crab Ucides cordatus along the Brazilian coast

    PubMed Central

    Schmidt, Anders J.; Carvalho, Adriana M.F.; Vasconcelos, Carolina C.M.P.; Farias, Antonia M.; Bentzen, Paul

    2018-01-01

    Background The mangrove crab Ucides cordatus is considered a key species for the ecological balance of mangrove forests and a major source of employment and income for traditional crab collectors in Brazil. Several studies evidenced weak genetic variation among populations due to an efficient larval transport. However, gene flow patterns of the species is poorly understood, with no information about migration rates. The influence of the two main Brazilian currents in larval dispersion is also not clear. In order to provide baseline information for conservation, planning and management of this important fishery resource, the present study aimed to estimate and evaluate spatial distribution of genetic diversity, migration rates and gene flow directivity among populations of U. cordatus in Brazil. Methods Nine microsatellites were used to resolve population structure of 319 crabs collected from six sites located along the Brazilian coast. The degree of geographical differentiation included estimates of genetic diversity, population structure and gene flow models, with spatial analysis of shared alleles (SAShA), isolation by distance tests, AMOVA, discriminant analysis of principal components (DAPC) and Bayesian clustering. We estimated the amount of ongoing gene flow between clusters using the coalescent-based method implemented in Migrate-N. Results Loci were highly polymorphic (average of 12.4 alleles per locus) evidencing high genetic variability. There was significant differentiation among localities, despite of the low value of FST (= 0.019; P < 0.001). FST and Jost’s D indexes were also estimated in pairwise comparisons and showed significant differences between most of the surveyed site pairs (P < 0.05). Structure evidenced a single genetic group among samples, however SAShA pointed to a non-panmictic condition (P = 0.011). AMOVA detected four statistical significant clusters with low level of differentiation (FCT = 0.037; P = 0.023). The gene flow model that best described the population connectivity was the island model, with ∼24 crabs being exchanged among localities per generation. Discussion The high migration rates found among localities seem to be the main force acting to sustain the distribution of the genetic diversity of U. cordatus. Despite the high gene flow and the weak population structure among samples, the significant genetic differences found suggest that gene flow alone does not bypass the effects of genetic drift, natural selection and/or human exploitation. These findings are vital for the establishment of a database to be used in the development of conservation programs. PMID:29736340

  18. Genome Comparison of Human and Non-Human Malaria Parasites Reveals Species Subset-Specific Genes Potentially Linked to Human Disease

    PubMed Central

    Frech, Christian; Chen, Nansheng

    2011-01-01

    Genes underlying important phenotypic differences between Plasmodium species, the causative agents of malaria, are frequently found in only a subset of species and cluster at dynamically evolving subtelomeric regions of chromosomes. We hypothesized that chromosome-internal regions of Plasmodium genomes harbour additional species subset-specific genes that underlie differences in human pathogenicity, human-to-human transmissibility, and human virulence. We combined sequence similarity searches with synteny block analyses to identify species subset-specific genes in chromosome-internal regions of six published Plasmodium genomes, including Plasmodium falciparum, Plasmodium vivax, Plasmodium knowlesi, Plasmodium yoelii, Plasmodium berghei, and Plasmodium chabaudi. To improve comparative analysis, we first revised incorrectly annotated gene models using homology-based gene finders and examined putative subset-specific genes within syntenic contexts. Confirmed subset-specific genes were then analyzed for their role in biological pathways and examined for molecular functions using publicly available databases. We identified 16 genes that are well conserved in the three primate parasites but not found in rodent parasites, including three key enzymes of the thiamine (vitamin B1) biosynthesis pathway. Thirteen genes were found to be present in both human parasites but absent in the monkey parasite P. knowlesi, including genes specifically upregulated in sporozoites or gametocytes that could be linked to parasite transmission success between humans. Furthermore, we propose 15 chromosome-internal P. falciparum-specific genes as new candidate genes underlying increased human virulence and detected a currently uncharacterized cluster of P. vivax-specific genes on chromosome 6 likely involved in erythrocyte invasion. In conclusion, Plasmodium species harbour many chromosome-internal differences in the form of protein-coding genes, some of which are potentially linked to human disease and thus promising leads for future laboratory research. PMID:22215999

  19. A class of circadian long non-coding RNAs mark enhancers modulating long-range circadian gene regulation

    PubMed Central

    Fan, Zenghua; Zhao, Meng; Joshi, Parth D.; Li, Ping; Zhang, Yan; Guo, Weimin; Xu, Yichi; Wang, Haifang; Zhao, Zhihu

    2017-01-01

    Abstract Circadian rhythm exerts its influence on animal physiology and behavior by regulating gene expression at various levels. Here we systematically explored circadian long non-coding RNAs (lncRNAs) in mouse liver and examined their circadian regulation. We found that a significant proportion of circadian lncRNAs are expressed at enhancer regions, mostly bound by two key circadian transcription factors, BMAL1 and REV-ERBα. These circadian lncRNAs showed similar circadian phases with their nearby genes. The extent of their nuclear localization is higher than protein coding genes but less than enhancer RNAs. The association between enhancer and circadian lncRNAs is also observed in tissues other than liver. Comparative analysis between mouse and rat circadian liver transcriptomes showed that circadian transcription at lncRNA loci tends to be conserved despite of low sequence conservation of lncRNAs. One such circadian lncRNA termed lnc-Crot led us to identify a super-enhancer region interacting with a cluster of genes involved in circadian regulation of metabolism through long-range interactions. Further experiments showed that lnc-Crot locus has enhancer function independent of lnc-Crot's transcription. Our results suggest that the enhancer-associated circadian lncRNAs mark the genomic loci modulating long-range circadian gene regulation and shed new lights on the evolutionary origin of lncRNAs. PMID:28335007

  20. The promise and peril of CRISPR gene drives: Genetic variation and inbreeding may impede the propagation of gene drives based on the CRISPR genome editing technology.

    PubMed

    Zentner, Gabriel E; Wade, Michael J

    2017-10-01

    Gene drives are selfish genetic elements that use a variety of mechanisms to ensure they are transmitted to subsequent generations at greater than expected frequencies. Synthetic gene drives based on the clustered regularly interspersed palindromic repeats (CRISPR) genome editing system have been proposed as a way to alter the genetic characteristics of natural populations of organisms relevant to the goals of public health, conservation, and agriculture. Here, we review the principles and potential applications of CRISPR drives, as well as means proposed to prevent their uncontrolled spread. We also focus on recent work suggesting that factors such as natural genetic variation and inbreeding may represent substantial impediments to the propagation of CRISPR drives. © 2017 WILEY Periodicals, Inc.

  1. Genomic characterization and expression analysis of four apolipoprotein A-IV paralogs in Senegalese sole (Solea senegalensis Kaup).

    PubMed

    Roman-Padilla, J; Rodríguez-Rua, A; Claros, M G; Hachero-Cruzado, I; Manchado, M

    2016-01-01

    The apolipoprotein A-IV (ApoA-IV) plays a key role in lipid transport and feed intake regulation. In this work, four cDNA sequences encoding ApoA-IV paralogs were identified. Sequence analysis revealed conserved structural features including the common 33-codon block and nine repeated motifs. Gene structure analysis identified four exons and three introns except for apoA-IVAa1 (with only 3 exons). Synteny analysis showed that the four paralogs were structured into two clusters (cluster A containing apoA-IVAa1 and apoA-IVAa2 and cluster B with apoA-IVBa3 and apoA-IVBa4) linked to an apolipoprotein E. Phylogenetic analysis clearly separated the paralogs according to their cluster organization as well as revealed four subclades highly conserved in Acanthopterygii. Whole-mount analyses (WISH) in early larvae (0 and 1day post-hatch (dph)) showed that the four paralogs were mainly expressed in yolk syncytial layer surrounding the oil globules. Later, at 3 and 5dph, the four paralogs were mainly expressed in liver and intestine although with differences in their relative abundance and temporal expression patterns. Diet supply triggered the intensity of WISH signals in the intestine of the four paralogs. Quantification of mRNA abundance by qPCR using whole larvae only detected the induction by diet at 5dph. Moreover, transcript levels increased progressively with age except for apoA-IVAa2, which appeared as a low-expressed isoform. Expression analysis in juvenile tissues confirmed that the four paralogs were mainly expressed in liver and intestine and secondary in other tissues. The role of these ApoA-IV genes in lipid transport and the possible role of apoA-IVAa2 as a regulatory form are discussed. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. An archaeal genomic signature

    NASA Technical Reports Server (NTRS)

    Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

    2000-01-01

    Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).

  3. Phosphorylation of the Nicotiana benthamiana WRKY8 Transcription Factor by MAPK Functions in the Defense Response[C][W][OA

    PubMed Central

    Ishihama, Nobuaki; Yamada, Reiko; Yoshioka, Miki; Katou, Shinpei; Yoshioka, Hirofumi

    2011-01-01

    Mitogen-activated protein kinase (MAPK) cascades have pivotal roles in plant innate immunity. However, downstream signaling of plant defense-related MAPKs is not well understood. Here, we provide evidence that the Nicotiana benthamiana WRKY8 transcription factor is a physiological substrate of SIPK, NTF4, and WIPK. Clustered Pro-directed Ser residues (SP cluster), which are conserved in group I WRKY proteins, in the N-terminal region of WRKY8 were phosphorylated by these MAPKs in vitro. Antiphosphopeptide antibodies indicated that Ser residues in the SP cluster of WRKY8 are phosphorylated by SIPK, NTF4, and WIPK in vivo. The interaction of WRKY8 with MAPKs depended on its D domain, which is a MAPK-interacting motif, and this interaction was required for effective phosphorylation of WRKY8 in plants. Phosphorylation of WRKY8 increased its DNA binding activity to the cognate W-box sequence. The phospho-mimicking mutant of WRKY8 showed higher transactivation activity, and its ectopic expression induced defense-related genes, such as 3-hydroxy-3-methylglutaryl CoA reductase 2 and NADP-malic enzyme. By contrast, silencing of WRKY8 decreased the expression of defense-related genes and increased disease susceptibility to the pathogens Phytophthora infestans and Colletotrichum orbiculare. Thus, MAPK-mediated phosphorylation of WRKY8 has an important role in the defense response through activation of downstream genes. PMID:21386030

  4. Cloning, expression and biochemical characterization of one Epsilon-class (GST-3) and ten Delta-class (GST-1) glutathione S-transferases from Drosophila melanogaster, and identification of additional nine members of the Epsilon class.

    PubMed Central

    Sawicki, Rafał; Singh, Sharda P; Mondal, Ashis K; Benes, Helen; Zimniak, Piotr

    2003-01-01

    From the fruitfly, Drosophila melanogaster, ten members of the cluster of Delta-class glutathione S-transferases (GSTs; formerly denoted as Class I GSTs) and one member of the Epsilon-class cluster (formerly GST-3) have been cloned, expressed in Escherichia coli, and their catalytic properties have been determined. In addition, nine more members of the Epsilon cluster have been identified through bioinformatic analysis but not further characterized. Of the 11 expressed enzymes, seven accepted the lipid peroxidation product 4-hydroxynonenal as substrate, and nine were active in glutathione conjugation of 1-chloro-2,4-dinitrobenzene. Since the enzymically active proteins included the gene products of DmGSTD3 and DmGSTD7 which were previously deemed to be pseudogenes, we investigated them further and determined that both genes are transcribed in Drosophila. Thus our present results indicate that DmGSTD3 and DmGSTD7 are probably functional genes. The existence and multiplicity of insect GSTs capable of conjugating 4-hydroxynonenal, in some cases with catalytic efficiencies approaching those of mammalian GSTs highly specialized for this function, indicates that metabolism of products of lipid peroxidation is a highly conserved biochemical pathway with probable detoxification as well as regulatory functions. PMID:12443531

  5. Genetic Diversity and Population Structure of Mesoamerican Jaguars (Panthera onca): Implications for Conservation and Management

    PubMed Central

    Wultsch, Claudia; Caragiulo, Anthony; Dias-Freedman, Isabela; Quigley, Howard; Rabinowitz, Salisa; Amato, George

    2016-01-01

    Mesoamerican jaguars (Panthera onca) have been extirpated from over 77% of their historic range, inhabiting fragmented landscapes at potentially reduced population sizes. Maintaining and restoring genetic diversity and connectivity across human-altered landscapes has become a major conservation priority; nonetheless large-scale genetic monitoring of natural populations is rare. This is the first regional conservation genetic study of jaguars to primarily use fecal samples collected in the wild across five Mesoamerican countries: Belize, Costa Rica, Guatemala, Honduras, and Mexico. We genotyped 445 jaguar fecal samples and examined patterns of genetic diversity and connectivity among 115 individual jaguars using data from 12 microsatellite loci. Overall, moderate levels of genetic variation were detected (NA = 4.50 ± 1.05, AR = 3.43 ± 0.22, HE = 0.59 ± 0.04), with Mexico having the lowest genetic diversity, followed by Honduras, Guatemala, Belize, and Costa Rica. Population-based gene flow measures (FST = 0.09 to 0.15, Dest = 0.09 to 0.21), principal component analysis, and Bayesian clustering applied in a hierarchical framework revealed significant genetic structure in Mesoamerican jaguars, roughly grouping individuals into four genetic clusters with varying levels of admixture. Gene flow was highest among Selva Maya jaguars (northern Guatemala and central Belize), whereas genetic differentiation among all other sampling sites was moderate. Genetic subdivision was most pronounced between Selva Maya and Honduran jaguars, suggesting limited jaguar movement between these close geographic regions and ultimately refuting the hypothesis of contemporary panmixia. To maintain a critical linkage for jaguars dispersing through the Mesoamerican landscape and ensure long-term viability of this near threatened species, we recommend continued management and maintenance of jaguar corridors. The baseline genetic data provided by this study underscores the importance of understanding levels of genetic diversity and connectivity to making informed management and conservation decisions with the goal to maintain functional connectivity across the region. PMID:27783617

  6. Genetic Diversity and Population Structure of Mesoamerican Jaguars (Panthera onca): Implications for Conservation and Management.

    PubMed

    Wultsch, Claudia; Caragiulo, Anthony; Dias-Freedman, Isabela; Quigley, Howard; Rabinowitz, Salisa; Amato, George

    2016-01-01

    Mesoamerican jaguars (Panthera onca) have been extirpated from over 77% of their historic range, inhabiting fragmented landscapes at potentially reduced population sizes. Maintaining and restoring genetic diversity and connectivity across human-altered landscapes has become a major conservation priority; nonetheless large-scale genetic monitoring of natural populations is rare. This is the first regional conservation genetic study of jaguars to primarily use fecal samples collected in the wild across five Mesoamerican countries: Belize, Costa Rica, Guatemala, Honduras, and Mexico. We genotyped 445 jaguar fecal samples and examined patterns of genetic diversity and connectivity among 115 individual jaguars using data from 12 microsatellite loci. Overall, moderate levels of genetic variation were detected (NA = 4.50 ± 1.05, AR = 3.43 ± 0.22, HE = 0.59 ± 0.04), with Mexico having the lowest genetic diversity, followed by Honduras, Guatemala, Belize, and Costa Rica. Population-based gene flow measures (FST = 0.09 to 0.15, Dest = 0.09 to 0.21), principal component analysis, and Bayesian clustering applied in a hierarchical framework revealed significant genetic structure in Mesoamerican jaguars, roughly grouping individuals into four genetic clusters with varying levels of admixture. Gene flow was highest among Selva Maya jaguars (northern Guatemala and central Belize), whereas genetic differentiation among all other sampling sites was moderate. Genetic subdivision was most pronounced between Selva Maya and Honduran jaguars, suggesting limited jaguar movement between these close geographic regions and ultimately refuting the hypothesis of contemporary panmixia. To maintain a critical linkage for jaguars dispersing through the Mesoamerican landscape and ensure long-term viability of this near threatened species, we recommend continued management and maintenance of jaguar corridors. The baseline genetic data provided by this study underscores the importance of understanding levels of genetic diversity and connectivity to making informed management and conservation decisions with the goal to maintain functional connectivity across the region.

  7. Transcription of the extended hyp-operon in Nostoc sp. strain PCC 7120

    PubMed Central

    Agervald, Åsa; Stensjö, Karin; Holmqvist, Marie; Lindblad, Peter

    2008-01-01

    Background The maturation of hydrogenases into active enzymes is a complex process and e.g. a correctly assembled active site requires the involvement of at least seven proteins, encoded by hypABCDEF and a hydrogenase specific protease, encoded either by hupW or hoxW. The N2-fixing cyanobacterium Nostoc sp. strain PCC 7120 may contain both an uptake and a bidirectional hydrogenase. The present study addresses the presence and expression of hyp-genes in Nostoc sp. strain PCC 7120. Results RT-PCRs demonstrated that the six hyp-genes together with one ORF may be transcribed as a single operon. Transcriptional start points (TSPs) were identified 280 bp upstream from hypF and 445 bp upstream of hypC, respectively, demonstrating the existence of several transcripts. In addition, five upstream ORFs located in between hupSL, encoding the small and large subunits of the uptake hydrogenase, and the hyp-operon, and two downstream ORFs from the hyp-genes were shown to be part of the same transcript unit. A third TSP was identified 45 bp upstream of asr0689, the first of five ORFs in this operon. The ORFs are annotated as encoding unknown proteins, with the exception of alr0692 which is identified as a NifU-like protein. Orthologues of the four ORFs asr0689-alr0692, with a highly conserved genomic arrangement positioned between hupSL, and the hyp genes are found in several other N2-fixing cyanobacteria, but are absent in non N2-fixing cyanobacteria with only the bidirectional hydrogenase. Short conserved sequences were found in six intergenic regions of the extended hyp-operon, appearing between 11 and 79 times in the genome. Conclusion This study demonstrated that five ORFs upstream of the hyp-gene cluster are co-transcribed with the hyp-genes, and identified three TSPs in the extended hyp-gene cluster in Nostoc sp. strain PCC 7120. This may indicate a function related to the assembly of a functional uptake hydrogenase, hypothetically in the assembly of the small subunit of the enzyme. PMID:18442387

  8. Genetic diversity and genetic structure of an endemic Mexican Dusky Rattlesnake (Crotalus triseriatus) in a highly modified agricultural landscape: implications for conservation.

    PubMed

    Sunny, Armando; Monroy-Vilchis, Octavio; Zarco-González, Martha M; Mendoza-Martínez, Germán David; Martínez-Gómez, Daniel

    2015-12-01

    It is necessary to determine genetic diversity of fragmented populations in highly modified landscapes to understand how populations respond to land-use change. This information will help guide future conservation and management strategies. We conducted a population genetic study on an endemic Mexican Dusky Rattlesnake (Crotalus triseriatus) in a highly modified landscape near the Toluca metropolitan area, in order to provide crucial information for the conservation of this species. There was medium levels of genetic diversity, with a few alleles and genotypes. We identified three genetically differentiated clusters, likely as a result of different habitat cover type. We also found evidence of an ancestral genetic bottleneck and medium values of effective population size. Inbreeding coefficients were low and there was a moderate gene flow. Our results can be used as a basis for future research and C. triseriatus conservation efforts, particularly considering that the Trans-Mexican Volcanic Belt is heavily impacted by destructive land-use practices.

  9. Phylogenetic analysis of Helicobacter pylori cagA gene of Turkish isolates and the association with gastric pathology.

    PubMed

    Salih, Barik A; Bolek, Bora Kazim; Yildiz, Mehmet Taha; Arikan, Soykan

    2013-11-18

    The cagA gene is one of the important virulence factors of Helicobacter pylori. The diversity of cagA 5' conserved region is thought to reflect the phylogenetic relationships between different H. pylori isolates and their association with peptic ulceration. Significant geographical differences among isolates have been reported. The aim of this study is to compare Turkish H. pylori isolates with isolates from different geographical locations and to correlate the association with peptic ulceration. Total of 52 isolates of which 19 were Turkish and 33 from other geographic locations were studied. Gastric antral biopsies collected from 19 Turkish patients (Gastritis = 12, ulcer = 7) were used to amplify the cagA 5' region by PCR then followed by DNA sequencing. The phylogenetic tree displayed 3 groups: A) a mix of 2 sub-groups "Asian" and "African/Anatolian/Asian/European", B) "Anatolian/European" and C) "American-Indian". Turkish H. pylori isolates clustered in the mixed sub-group A were mostly from gastritis patients while those clustered in group B were from peptic ulcer patients. A phylogenetic tree constructed for our Turkish isolates detected distinctive features among those from gastritis and ulcer patients. We have found that 2/3 of the gastritis isolates were clustered alone while 1/3 was clustered together with the ulcer isolates. Several amino acids were found to be shared between the later groups but not with the first group of gastritis. This study provided an additional insight into the profile of our cagA gene which implies a relationship in geographic locations of the isolates.

  10. Natural selection of the major histocompatibility complex (Mhc) in Hawaiian honeycreepers (Drepanidinae)

    USGS Publications Warehouse

    Jarvi, S.I.; Tarr, C.L.; Mcintosh, C.E.; Atkinson, C.T.; Fleischer, R.C.

    2004-01-01

    The native Hawaiian honeycreepers represent a classic example of adaptive radiation and speciation, but currently face one the highest extinction rates in the world. Although multiple factors have likely influenced the fate of Hawaiian birds, the relatively recent introduction of avian malaria is thought to be a major factor limiting honeycreeper distribution and abundance. We have initiated genetic analyses of class II ?? chain Mhc genes in four species of honeycreepers using methods that eliminate the possibility of sequencing mosaic variants formed by cloning heteroduplexed polymerase chain reaction products. Phylogenetic analyses group the honeycreeper Mhc sequences into two distinct clusters. Variation within one cluster is high, with dN > d S and levels of diversity similar to other studies of Mhc (B system) genes in birds. The second cluster is nearly invariant and includes sequences from honeycreepers (Fringillidae), a sparrow (Emberizidae) and a blackbird (Emberizidae). This highly conserved cluster appears reminiscent of the independently segregating Rfp-Y system of genes defined in chickens. The notion that balancing selection operates at the Mhc in the honeycreepers is supported by transpecies polymorphism and strikingly high dN/dS ratios at codons putatively involved in peptide interaction. Mitochondrial DNA control region sequences were invariant in the i'iwi, but were highly variable in the 'amakihi. By contrast, levels of variability of class II ?? chain Mhc sequence codons that are hypothesized to be directly involved in peptide interactions appear comparable between i'iwi and 'amakihi. In the i'iwi, natural selection may have maintained variation within the Mhc, even in the face of what appears to a genetic bottleneck.

  11. Clustering cancer gene expression data by projective clustering ensemble

    PubMed Central

    Yu, Xianxue; Yu, Guoxian

    2017-01-01

    Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920

  12. A distinct and divergent lineage of genomic island-associated Type IV Secretion Systems in Legionella.

    PubMed

    Wee, Bryan A; Woolfit, Megan; Beatson, Scott A; Petty, Nicola K

    2013-01-01

    Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene transfers that allow Legionella to adapt to and exploit diverse environmental niches.

  13. A Distinct and Divergent Lineage of Genomic Island-Associated Type IV Secretion Systems in Legionella

    PubMed Central

    Wee, Bryan A.; Woolfit, Megan; Beatson, Scott A.; Petty, Nicola K.

    2013-01-01

    Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene transfers that allow Legionella to adapt to and exploit diverse environmental niches. PMID:24358157

  14. A Functionally Conserved Gene Regulatory Network Module Governing Olfactory Neuron Diversity.

    PubMed

    Li, Qingyun; Barish, Scott; Okuwa, Sumie; Maciejewski, Abigail; Brandt, Alicia T; Reinhold, Dominik; Jones, Corbin D; Volkan, Pelin Cayirlioglu

    2016-01-01

    Sensory neuron diversity is required for organisms to decipher complex environmental cues. In Drosophila, the olfactory environment is detected by 50 different olfactory receptor neuron (ORN) classes that are clustered in combinations within distinct sensilla subtypes. Each sensilla subtype houses stereotypically clustered 1-4 ORN identities that arise through asymmetric divisions from a single multipotent sensory organ precursor (SOP). How each class of SOPs acquires a unique differentiation potential that accounts for ORN diversity is unknown. Previously, we reported a critical component of SOP diversification program, Rotund (Rn), increases ORN diversity by generating novel developmental trajectories from existing precursors within each independent sensilla type lineages. Here, we show that Rn, along with BarH1/H2 (Bar), Bric-à-brac (Bab), Apterous (Ap) and Dachshund (Dac), constitutes a transcription factor (TF) network that patterns the developing olfactory tissue. This network was previously shown to pattern the segmentation of the leg, which suggests that this network is functionally conserved. In antennal imaginal discs, precursors with diverse ORN differentiation potentials are selected from concentric rings defined by unique combinations of these TFs along the proximodistal axis of the developing antennal disc. The combinatorial code that demarcates each precursor field is set up by cross-regulatory interactions among different factors within the network. Modifications of this network lead to predictable changes in the diversity of sensilla subtypes and ORN pools. In light of our data, we propose a molecular map that defines each unique SOP fate. Our results highlight the importance of the early prepatterning gene regulatory network as a modulator of SOP and terminally differentiated ORN diversity. Finally, our model illustrates how conserved developmental strategies are used to generate neuronal diversity.

  15. Function and Regulation of Ferredoxins in the Cyanobacterium, Synechocystis PCC6803: Recent Advances

    PubMed Central

    Cassier-Chauvat, Corinne; Chauvat, Franck

    2014-01-01

    Ferredoxins (Fed), occurring in most organisms, are small proteins that use their iron-sulfur cluster to distribute electrons to various metabolic pathways, likely including hydrogen production. Here, we summarize the current knowledge on ferredoxins in cyanobacteria, the prokaryotes regarded as important producers of the oxygenic atmosphere and biomass for the food chain, as well as promising cell factories for biofuel production. Most studies of ferredoxins were performed in the model strain, Synechocystis PCC6803, which possesses nine highly-conserved ferredoxins encoded by monocistronic or operonic genes, some of which are localized in conserved genome regions. Fed1, encoded by a light-inducible gene, is a highly abundant protein essential to photosynthesis. Fed2-Fed9, encoded by genes differently regulated by trophic conditions, are low-abundant proteins that play prominent roles in the tolerance to environmental stresses. Concerning the selectivity/redundancy of ferredoxin, we report that Fed1, Fed7 and Fed9 belong to ferredoxin-glutaredoxin-thioredoxin crosstalk pathways operating in the protection against oxidative and metal stresses. Furthermore, Fed7 specifically interacts with a DnaJ-like protein, an interaction that has been conserved in photosynthetic eukaryotes in the form of a composite protein comprising DnaJ- and Fed7-like domains. Fed9 specifically interacts with the Flv3 flavodiiron protein acting in the photoreduction of O2 to H2O. PMID:25387163

  16. Plant polycistronic precursors containing non-homologous microRNAs target transcripts encoding functionally related proteins

    PubMed Central

    2009-01-01

    Background MicroRNAs (miRNAs) are endogenous single-stranded small RNAs that regulate the expression of specific mRNAs involved in diverse biological processes. In plants, miRNAs are generally encoded as a single species in independent transcriptional units, referred to as MIRNA genes, in contrast to animal miRNAs, which are frequently clustered. Results We performed a comparative genomic analysis in three model plants (rice, poplar and Arabidopsis) and characterized miRNA clusters containing two to eight miRNA species. These clusters usually encode miRNAs of the same family and certain share a common evolutionary origin across monocot and dicot lineages. In addition, we identified miRNA clusters harboring miRNAs with unrelated sequences that are usually not evolutionarily conserved. Strikingly, non-homologous miRNAs from the same cluster were predicted to target transcripts encoding related proteins. At least four Arabidopsis non-homologous clusters were expressed as single transcriptional units. Overexpression of one of these polycistronic precursors, producing Ath-miR859 and Ath-miR774, led to the DCL1-dependent accumulation of both miRNAs and down-regulation of their different mRNA targets encoding F-box proteins. Conclusions In addition to polycistronic precursors carrying related miRNAs, plants also contain precursors allowing coordinated expression of non-homologous miRNAs to co-regulate functionally related target transcripts. This mechanism paves the way for using polycistronic MIRNA precursors as a new molecular tool for plant biologists to simultaneously control the expression of different genes. PMID:19951405

  17. Ergot Alkaloids and their Hallucinogenic Potential in Morning Glories.

    PubMed

    Steiner, Ulrike; Leistner, Eckhard

    2018-03-02

    Naturally occurring and semisynthetic ergot alkaloids play a role in health care or as recreational drugs in Western and indigenous Mexican societies. Evidence is summarized that ergot alkaloids present in Central American Convolvulaceae like Turbina corymbosa, Ipomoea violacea , and Ipomoea asarifolia are colonized by different species of a newly described clavicipitaceous fungal genus named Periglandula . The fungi are associated with peltate glandular trichomes on the adaxial leaf surface of its host plants. The Periglandula fungi are not yet culturable in vitro but were demonstrated to have the capacity to synthesize ergot alkaloids. The alkaloids do not remain in the fungal mycelium but are translocated via the glandular trichomes into their plant host. Both fungi and host benefit from a symbiotic lifestyle. In evolutionary terms the alkaloid biosynthetic gene cluster in the Periglandula/Ipomoea symbiosis is likely to have a conserved (basic) structure while biosynthetic ergot gene clusters within the genera Claviceps and Epichloe were under ecological selection for alkaloid diversification. Georg Thieme Verlag KG Stuttgart · New York.

  18. Pan-Genomic Analysis Provides Insights into the Genomic Variation and Evolution of Salmonella Paratyphi A

    PubMed Central

    Chen, Chunxia; Cui, Xiaoying; Yu, Jun; Xiao, Jingfa; Kan, Biao

    2012-01-01

    Salmonella Paratyphi A (S. Paratyphi A) is a highly adapted, human-specific pathogen that causes paratyphoid fever. Cases of paratyphoid fever have recently been increasing, and the disease is becoming a major public health concern, especially in Eastern and Southern Asia. To investigate the genomic variation and evolution of S. Paratyphi A, a pan-genomic analysis was performed on five newly sequenced S. Paratyphi A strains and two other reference strains. A whole genome comparison revealed that the seven genomes are collinear and that their organization is highly conserved. The high rate of substitutions in part of the core genome indicates that there are frequent homologous recombination events. Based on the changes in the pan-genome size and cluster number (both in the core functional genes and core pseudogenes), it can be inferred that the sharply increasing number of pseudogene clusters may have strong correlation with the inactivation of functional genes, and indicates that the S. Paratyphi A genome is being degraded. PMID:23028950

  19. A proposed model for the flowering signaling pathway of sugarcane under photoperiodic control.

    PubMed

    Coelho, C P; Costa Netto, A P; Colasanti, J; Chalfun-Júnior, A

    2013-04-25

    Molecular analysis of floral induction in Arabidopsis has identified several flowering time genes related to 4 response networks defined by the autonomous, gibberellin, photoperiod, and vernalization pathways. Although grass flowering processes include ancestral functions shared by both mono- and dicots, they have developed their own mechanisms to transmit floral induction signals. Despite its high production capacity and its important role in biofuel production, almost no information is available about the flowering process in sugarcane. We searched the Sugarcane Expressed Sequence Tags database to look for elements of the flowering signaling pathway under photoperiodic control. Sequences showing significant similarity to flowering time genes of other species were clustered, annotated, and analyzed for conserved domains. Multiple alignments comparing the sequences found in the sugarcane database and those from other species were performed and their phylogenetic relationship assessed using the MEGA 4.0 software. Electronic Northerns were run with Cluster and TreeView programs, allowing us to identify putative members of the photoperiod-controlled flowering pathway of sugarcane.

  20. Structural, evolutionary and genetic analysis of the histidine biosynthetic "core" in the genus Burkholderia.

    PubMed

    Papaleo, Maria Cristiana; Russo, Edda; Fondi, Marco; Emiliani, Giovanni; Frandi, Antonio; Brilli, Matteo; Pastorelli, Roberta; Fani, Renato

    2009-12-01

    In this work a detailed analysis of the structure, the expression and the organization of his genes belonging to the core of histidine biosynthesis (hisBHAF) in 40 newly determined and 13 available sequences of Burkholderia strains was carried out. Data obtained revealed a strong conservation of the structure and organization of these genes through the entire genus. The phylogenetic analysis showed the monophyletic origin of this gene cluster and indicated that it did not undergo horizontal gene transfer events. The analysis of the intergenic regions, based on the substitution rate, entropy plot and bendability suggested the existence of a putative transcription promoter upstream of hisB, that was supported by the genetic analysis that showed that this cluster was able to complement Escherichia colihisA, hisB, and hisF mutations. Moreover, a preliminary transcriptional analysis and the analysis of microarray data revealed that the expression of the his core was constitutive. These findings are in agreement with the fact that the entire Burkholderiahis operon is heterogeneous, in that it contains "alien" genes apparently not involved in histidine biosynthesis. Besides, they also support the idea that the proteobacterial his operon was piece-wisely assembled, i.e. through accretion of smaller units containing only some of the genes (eventually together with their own promoters) involved in this biosynthetic route. The correlation existing between the structure, organization and regulation of his "core" genes and the function(s) they perform in cellular metabolism is discussed.

  1. Different Type 1 Fimbrial Genes and Tropisms of Commensal and Potentially Pathogenic Actinomyces spp. with Different Salivary Acidic Proline-Rich Protein and Statherin Ligand Specificities

    PubMed Central

    Li, Tong; Khah, Massoud Kheir; Slavnic, Snjezana; Johansson, Ingegerd; Strömberg, Nicklas

    2001-01-01

    Actinomyces spp. exhibit type 1 fimbria-mediated adhesion to salivary acidic proline-rich proteins (PRPs) and statherin ligands. Actinomyces spp. with different animal and tissue origins belong to three major adhesion types as relates to ligand specificity and type 1 fimbria genes. (i) In preferential acidic-PRP binding, strains of Actinomyces naeslundii genospecies 1 and 2 from human and monkey mouths displayed at least three ligand specificities characterized by preferential acidic-PRP binding. Slot blot DNA hybridization showed seven highly conserved type 1 fimbria genes (orf1- to -6 and fimP) in genospecies 1 and 2 strains, except that orf5 and orf3 were divergent in genospecies 1. (ii) In preferential statherin binding, oral Actinomyces viscosus strains of rat and hamster origin (and strain 19246 from a human case of actinomycosis) bound statherin preferentially. DNA hybridization and characterization of the type 1 fimbria genes from strain 19246 revealed a homologous gene cluster of four open reading frames (orfA to -C and fimP). Bioinformatics suggested sortase (orfB, orf4, and part of orf5), prepilin peptidase (orfC and orf6), fimbria subunit (fimP), and usher- and autotransporter-like (orfA and orf1 to -3) functions. Those gene regions corresponding to orf3 and orf5 were divergent, those corresponding to orf2, orf1, and fimP were moderately conserved, and those corresponding to orf4 and orf6 were highly conserved. Restriction fragment length polymorphism analyses using a fimP probe separated human and monkey and rat and hamster strains into phylogenetically different groups. (iii) In statherin-specific binding, strains of A. naeslundii genospecies 1 from septic and other human infections displayed a low-avidity binding to statherin. Only the orf4 and orf6 gene regions were highly conserved. Finally, rat saliva devoid of statherin bound bacterial strains avidly irrespective of ligand specificity, and specific antisera detected either type 1, type 2, or both types of fimbria on the investigated Actinomyces strains. PMID:11705891

  2. Patterns of genetic differentiation at MHC class I genes and microsatellites identify conservation units in the giant panda.

    PubMed

    Zhu, Ying; Wan, Qiu-Hong; Yu, Bin; Ge, Yun-Fa; Fang, Sheng-Guo

    2013-10-22

    Evaluating patterns of genetic variation is important to identify conservation units (i.e., evolutionarily significant units [ESUs], management units [MUs], and adaptive units [AUs]) in endangered species. While neutral markers could be used to infer population history, their application in the estimation of adaptive variation is limited. The capacity to adapt to various environments is vital for the long-term survival of endangered species. Hence, analysis of adaptive loci, such as the major histocompatibility complex (MHC) genes, is critical for conservation genetics studies. Here, we investigated 4 classical MHC class I genes (Aime-C, Aime-F, Aime-I, and Aime-L) and 8 microsatellites to infer patterns of genetic variation in the giant panda (Ailuropoda melanoleuca) and to further define conservation units. Overall, we identified 24 haplotypes (9 for Aime-C, 1 for Aime-F, 7 for Aime-I, and 7 for Aime-L) from 218 individuals obtained from 6 populations of giant panda. We found that the Xiaoxiangling population had the highest genetic variation at microsatellites among the 6 giant panda populations and higher genetic variation at Aime-MHC class I genes than other larger populations (Qinling, Qionglai, and Minshan populations). Differentiation index (FST)-based phylogenetic and Bayesian clustering analyses for Aime-MHC-I and microsatellite loci both supported that most populations were highly differentiated. The Qinling population was the most genetically differentiated. The giant panda showed a relatively higher level of genetic diversity at MHC class I genes compared with endangered felids. Using all of the loci, we found that the 6 giant panda populations fell into 2 ESUs: Qinling and non-Qinling populations. We defined 3 MUs based on microsatellites: Qinling, Minshan-Qionglai, and Daxiangling-Xiaoxiangling-Liangshan. We also recommended 3 possible AUs based on MHC loci: Qinling, Minshan-Qionglai, and Daxiangling-Xiaoxiangling-Liangshan. Furthermore, we recommend that a captive breeding program be considered for the Qinling panda population.

  3. Integrating microarray analysis and the soybean genome to understand the soybeans iron deficiency response

    PubMed Central

    2009-01-01

    Background Soybeans grown in the upper Midwestern United States often suffer from iron deficiency chlorosis, which results in yield loss at the end of the season. To better understand the effect of iron availability on soybean yield, we identified genes in two near isogenic lines with changes in expression patterns when plants were grown in iron sufficient and iron deficient conditions. Results Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) grown under Fe-sufficient and Fe-limited conditions were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. There were 835 candidate genes in the Clark (PI548553) genotype and 200 candidate genes in the IsoClark (PI547430) genotype putatively involved in soybean's iron stress response. Of these candidate genes, fifty-eight genes in the Clark genotype were identified with a genetic location within known iron efficiency QTL and 21 in the IsoClark genotype. The arrays also identified 170 single feature polymorphisms (SFPs) specific to either Clark or IsoClark. A sliding window analysis of the microarray data and the 7X genome assembly coupled with an iterative model of the data showed the candidate genes are clustered in the genome. An analysis of 5' untranslated regions in the promoter of candidate genes identified 11 conserved motifs in 248 differentially expressed genes, all from the Clark genotype, representing 129 clusters identified earlier, confirming the cluster analysis results. Conclusion These analyses have identified the first genes with expression patterns that are affected by iron stress and are located within QTL specific to iron deficiency stress. The genetic location and promoter motif analysis results support the hypothesis that the differentially expressed genes are co-regulated. The combined results of all analyses lead us to postulate iron inefficiency in soybean is a result of a mutation in a transcription factor(s), which controls the expression of genes required in inducing an iron stress response. PMID:19678937

  4. Cloned Erwinia chrysanthemi out genes enable Escherichia coli to selectively secrete a diverse family of heterologous proteins to its milieu.

    PubMed Central

    He, S Y; Lindeberg, M; Chatterjee, A K; Collmer, A

    1991-01-01

    The out genes of the enterobacterial plant pathogen Erwinia chrysanthemi are responsible for the efficient extracellular secretion of multiple plant cell wall-degrading enzymes, including four isozymes of pectate lyase, exo-poly-alpha-D-galacturonosidase, pectin methylesterase, and cellulase. Out- mutants of Er. chrysanthemi are unable to export any of these proteins beyond the periplasm and are severely reduced in virulence. We have cloned out genes from Er. chrysanthemi in the stable, low-copy-number cosmid pCPP19 by complementing several transposon-induced mutations. The cloned out genes were clustered in a 12-kilobase chromosomal DNA region, complemented all existing out mutations in Er. chrysanthemi EC16, and enabled Escherichia coli strains to efficiently secrete the extracellular pectic enzymes produced from cloned Er. chrysanthemi genes, while retaining the periplasmic marker protein beta-lactamase. DNA sequencing of a 2.4-kilobase EcoRI fragment within the out cluster revealed four genes arranged colinearly and sharing substantial similarity with the Klebsiella pneumoniae genes pulH, pulI, pulJ, and pulK, which are necessary for pullulanase secretion. However, K. pneumoniae cells harboring the cloned Er. chrysanthemi pelE gene were unable to secrete the Erwinia pectate lyase. Furthermore, the Er. chrysanthemi Out system was unable to secrete an extracellular pectate lyase encoded by a gene from a closely related plant pathogen. Erwinia carotovora ssp. carotovora. The results suggest that these enterobacteria secrete polysaccharidases by a conserved mechanism whose protein-recognition capacities have diverged. Images PMID:1992458

  5. Identification of Methyl Halide-Utilizing Genes in the Methyl Bromide-Utilizing Bacterial Strain IMB-1 Suggests a High Degree of Conservation of Methyl Halide-Specific Genes in Gram-Negative Bacteria

    USGS Publications Warehouse

    Woodall, C.A.; Warner, K.L.; Oremland, R.S.; Murrell, J.C.; McDonald, I.R.

    2001-01-01

    Strain IMB-1, an aerobic methylotrophic member of the alpha subgroup of the Proteobacteria, can grow with methyl bromide as a sole carbon and energy source. A single cmu gene cluster was identified in IMB-1 that contained six open reading frames: cmuC, cmuA, orf146, paaE, hutI, and partial metF. CmuA from IMB-1 has high sequence homology to the methyltransferase CmuA from Methylobacterium chloromethanicum and Hyphomicrobium chloromethanicum and contains a C-terminal corrinoid-binding motif and an N-terminal methyl-transferase motif. However, cmuB, identified in M. chloromethanicum and H. chloromethanicum, was not detected in IMB-1.

  6. Cloning and expression of an iron-containing superoxide dismutase in the parasitic protist, Trichomonas vaginalis.

    PubMed

    Viscogliosi, E; Delgado-Viscogliosi, P; Gerbod, D; Dauchez, M; Gratepanche, S; Alix, A J; Dive, D

    1998-04-01

    A superoxide dismutase (SOD) gene of the parasitic protist Trichomonas vaginalis was cloned, sequenced, expressed in Escherichia coli, and its gene product characterized. It is an iron-containing dimeric protein with a monomeric mass of 22,067 Da. Southern blots analyses suggested the presence of seven iron-containing (FeSOD) gene copies. Hydrophobic cluster analysis revealed some peculiarities in the 2D structure of the FeSOD from T. vaginalis and a strong structural conservation between prokaryotic and eukaryotic FeSODs. Phylogenetic reconstruction of the SOD sequences confirmed the dichotomy between FeSODs and manganese-containing SODs. FeSODs of protists appeared to group together with homologous proteobacterial enzymes suggesting a possible origin of eukaryotic FeSODs through an endosymbiotic event.

  7. AID/APOBEC cytosine deaminase induces genome-wide kataegis

    PubMed Central

    2012-01-01

    Clusters of localized hypermutation in human breast cancer genomes, named “kataegis” (from the Greek for thunderstorm), are hypothesized to result from multiple cytosine deaminations catalyzed by AID/APOBEC proteins. However, a direct link between APOBECs and kataegis is still lacking. We have sequenced the genomes of yeast mutants induced in diploids by expression of the gene for PmCDA1, a hypermutagenic deaminase from sea lamprey. Analysis of the distribution of 5,138 induced mutations revealed localized clusters very similar to those found in tumors. Our data provide evidence that unleashed cytosine deaminase activity is an evolutionary conserved, prominent source of genome-wide kataegis events. Reviewers This article was reviewed by: Professor Sandor Pongor, Professor Shamil R. Sunyaev, and Dr Vladimir Kuznetsov. PMID:23249472

  8. Arm-specific dynamics of chromosome evolution in malaria mosquitoes

    PubMed Central

    2011-01-01

    Background The malaria mosquito species of subgenus Cellia have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in Anopheles gambiae and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus Cellia nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints. Results To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among Anopheles gambiae, Anopheles funestus, and Anopheles stephensi. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of An. gambiae and on the homologous arms of An. funestus and An. stephensi. Conclusions Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms. PMID:21473772

  9. Identification of Abundantly Expressed Novel and Conserved Genes from the Infective Larval Stage of Toxocara canis by an Expressed Sequence Tag Strategy

    PubMed Central

    Tetteh, Kevin K. A.; Loukas, Alex; Tripp, Cindy; Maizels, Rick M.

    1999-01-01

    Larvae of Toxocara canis, a nematode parasite of dogs, infect humans, causing visceral and ocular larva migrans. In noncanid hosts, larvae neither grow nor differentiate but endure in a state of arrested development. Reasoning that parasite protein production is orientated to immune evasion, we undertook a random sequencing project from a larval cDNA library to characterize the most highly expressed transcripts. In all, 266 clones were sequenced, most from both 3′ and 5′ ends, and similarity searches against GenBank protein and dbEST nucleotide databases were conducted. Cluster analyses showed that 128 distinct gene products had been found, all but 3 of which represented newly identified genes. Ninety-five genes were represented by a single clone, but seven transcripts were present at high frequencies, each composing >2% of all clones sequenced. These high-abundance transcripts include a mucin and a C-type lectin, which are both major excretory-secretory antigens released by parasites. Four highly expressed novel gene transcripts, termed ant (abundant novel transcript) genes, were found. Together, these four genes comprised 18% of all cDNA clones isolated, but no similar sequences occur in the Caenorhabditis elegans genome. While the coding regions of the four genes are dissimilar, their 3′ untranslated tracts have significant homology in nucleotide sequence. The discovery of these abundant, parasite-specific genes of newly identified lectins and mucins, as well as a range of conserved and novel proteins, provides defined candidates for future analysis of the molecular basis of immune evasion by T. canis. PMID:10456930

  10. Genome-based classification of micromonosporae with a focus on their biotechnological and ecological potential.

    PubMed

    Carro, Lorena; Nouioui, Imen; Sangal, Vartul; Meier-Kolthoff, Jan P; Trujillo, Martha E; Montero-Calasanz, Maria Del Carmen; Sahin, Nevzat; Smith, Darren Lee; Kim, Kristi E; Peluso, Paul; Deshpande, Shweta; Woyke, Tanja; Shapiro, Nicole; Kyrpides, Nikos C; Klenk, Hans-Peter; Göker, Markus; Goodfellow, Michael

    2018-01-11

    There is a need to clarify relationships within the actinobacterial genus Micromonospora, the type genus of the family Micromonosporaceae, given its biotechnological and ecological importance. Here, draft genomes of 40 Micromonospora type strains and two non-type strains are made available through the Genomic Encyclopedia of Bacteria and Archaea project and used to generate a phylogenomic tree which showed they could be assigned to well supported phyletic lines that were not evident in corresponding trees based on single and concatenated sequences of conserved genes. DNA G+C ratios derived from genome sequences showed that corresponding data from species descriptions were imprecise. Emended descriptions include precise base composition data and approximate genome sizes of the type strains. antiSMASH analyses of the draft genomes show that micromonosporae have a previously unrealised potential to synthesize novel specialized metabolites. Close to one thousand biosynthetic gene clusters were detected, including NRPS, PKS, terpenes and siderophores clusters that were discontinuously distributed thereby opening up the prospect of prioritising gifted strains for natural product discovery. The distribution of key stress related genes provide an insight into how micromonosporae adapt to key environmental variables. Genes associated with plant interactions highlight the potential use of micromonosporae in agriculture and biotechnology.

  11. Large-Scale Phylogenetic Classification of Fungal Chitin Synthases and Identification of a Putative Cell-Wall Metabolism Gene Cluster in Aspergillus Genomes

    PubMed Central

    Pacheco-Arjona, Jose Ramon; Ramirez-Prado, Jorge Humberto

    2014-01-01

    The cell wall is a protective and versatile structure distributed in all fungi. The component responsible for its rigidity is chitin, a product of chitin synthase (Chsp) enzymes. There are seven classes of chitin synthase genes (CHS) and the amount and type encoded in fungal genomes varies considerably from one species to another. Previous Chsp sequence analyses focused on their study as individual units, regardless of genomic context. The identification of blocks of conserved genes between genomes can provide important clues about the interactions and localization of chitin synthases. On the present study, we carried out an in silico search of all putative Chsp encoded in 54 full fungal genomes, encompassing 21 orders from five phyla. Phylogenetic studies of these Chsp were able to confidently classify 347 out of the 369 Chsp identified (94%). Patterns in the distribution of Chsp related to taxonomy were identified, the most prominent being related to the type of fungal growth. More importantly, a synteny analysis for genomic blocks centered on class IV Chsp (the most abundant and widely distributed Chsp class) identified a putative cell wall metabolism gene cluster in members of the genus Aspergillus, the first such association reported for any fungal genome. PMID:25148134

  12. Comparative Microbial Modules Resource: Generation and Visualization of Multi-species Biclusters

    PubMed Central

    Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard

    2011-01-01

    The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures – results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. PMID:22144874

  13. Comparative microbial modules resource: generation and visualization of multi-species biclusters.

    PubMed

    Kacmarczyk, Thadeous; Waltman, Peter; Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard

    2011-12-01

    The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures - results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. © 2011 Kacmarczyk et al.

  14. Sequencing of the amylopullulanase (apu) gene of Thermoanaerobacter ethanolicus 39E, and identification of the active site by site-directed mutagenesis.

    PubMed

    Mathupala, S P; Lowe, S E; Podkovyrov, S M; Zeikus, J G

    1993-08-05

    The complete nucleotide sequence of the gene encoding the dual active amylopullulanase of Thermoanaerobacter ethanolicus 39E (formerly Clostridium thermohydrosulfuricum) was determined. The structural gene (apu) contained a single open reading frame 4443 base pairs in length, corresponding to 1481 amino acids, with an estimated molecular weight of 162,780. Analysis of the deduced sequence of apu with sequences of alpha-amylases and alpha-1,6 debranching enzymes enabled the identification of four conserved regions putatively involved in substrate binding and in catalysis. The conserved regions were localized within a 2.9-kilobase pair gene fragment, which encoded a M(r) 100,000 protein that maintained the dual activities and thermostability of the native enzyme. The catalytic residues of amylopullulanase were tentatively identified by using hydrophobic cluster analysis for comparison of amino acid sequences of amylopullulanase and other amylolytic enzymes. Asp597, Glu626, and Asp703 were individually modified to their respective amide form, or the alternate acid form, and in all cases both alpha-amylase and pullulanase activities were lost, suggesting the possible involvement of 3 residues in a catalytic triad, and the presence of a putative single catalytic site within the enzyme. These findings substantiate amylopullulanase as a new type of amylosaccharidase.

  15. Database resources of the National Center for Biotechnology Information.

    PubMed

    Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian

    2011-01-01

    In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

  16. Identification of innate lymphoid cells in single-cell RNA-Seq data.

    PubMed

    Suffiotti, Madeleine; Carmona, Santiago J; Jandus, Camilla; Gfeller, David

    2017-07-01

    Innate lymphoid cells (ILCs) consist of natural killer (NK) cells and non-cytotoxic ILCs that are broadly classified into ILC1, ILC2, and ILC3 subtypes. These cells recently emerged as important early effectors of innate immunity for their roles in tissue homeostasis and inflammation. Over the last few years, ILCs have been extensively studied in mouse and human at the functional and molecular level, including gene expression profiling. However, sorting ILCs with flow cytometry for gene expression analysis is a delicate and time-consuming process. Here we propose and validate a novel framework for studying ILCs at the transcriptomic level using single-cell RNA-Seq data. Our approach combines unsupervised clustering and a new cell type classifier trained on mouse ILC gene expression data. We show that this approach can accurately identify different ILCs, especially ILC2 cells, in human lymphocyte single-cell RNA-Seq data. Our new model relies only on genes conserved across vertebrates, thereby making it in principle applicable in any vertebrate species. Considering the rapid increase in throughput of single-cell RNA-Seq technology, our work provides a computational framework for studying ILC2 cells in single-cell transcriptomic data and may help exploring their conservation in distant vertebrate species.

  17. Genome sequence of the lager brewing yeast, an interspecies hybrid.

    PubMed

    Nakao, Yoshihiro; Kanamori, Takeshi; Itoh, Takehiko; Kodama, Yukiko; Rainieri, Sandra; Nakamura, Norihisa; Shimonaga, Tomoko; Hattori, Masahira; Ashikari, Toshihiko

    2009-04-01

    This work presents the genome sequencing of the lager brewing yeast (Saccharomyces pastorianus) Weihenstephan 34/70, a strain widely used in lager beer brewing. The 25 Mb genome comprises two nuclear sub-genomes originating from Saccharomyces cerevisiae and Saccharomyces bayanus and one circular mitochondrial genome originating from S. bayanus. Thirty-six different types of chromosomes were found including eight chromosomes with translocations between the two sub-genomes, whose breakpoints are within the orthologous open reading frames. Several gene loci responsible for typical lager brewing yeast characteristics such as maltotriose uptake and sulfite production have been increased in number by chromosomal rearrangements. Despite an overall high degree of conservation of the synteny with S. cerevisiae and S. bayanus, the syntenies were not well conserved in the sub-telomeric regions that contain lager brewing yeast characteristic and specific genes. Deletion of larger chromosomal regions, a massive unilateral decrease of the ribosomal DNA cluster and bilateral truncations of over 60 genes reflect a post-hybridization evolution process. Truncations and deletions of less efficient maltose and maltotriose uptake genes may indicate the result of adaptation to brewing. The genome sequence of this interspecies hybrid yeast provides a new tool for better understanding of lager brewing yeast behavior in industrial beer production.

  18. Genome Sequence of the Lager Brewing Yeast, an Interspecies Hybrid

    PubMed Central

    Nakao, Yoshihiro; Kanamori, Takeshi; Itoh, Takehiko; Kodama, Yukiko; Rainieri, Sandra; Nakamura, Norihisa; Shimonaga, Tomoko; Hattori, Masahira; Ashikari, Toshihiko

    2009-01-01

    This work presents the genome sequencing of the lager brewing yeast (Saccharomyces pastorianus) Weihenstephan 34/70, a strain widely used in lager beer brewing. The 25 Mb genome comprises two nuclear sub-genomes originating from Saccharomyces cerevisiae and Saccharomyces bayanus and one circular mitochondrial genome originating from S. bayanus. Thirty-six different types of chromosomes were found including eight chromosomes with translocations between the two sub-genomes, whose breakpoints are within the orthologous open reading frames. Several gene loci responsible for typical lager brewing yeast characteristics such as maltotriose uptake and sulfite production have been increased in number by chromosomal rearrangements. Despite an overall high degree of conservation of the synteny with S. cerevisiae and S. bayanus, the syntenies were not well conserved in the sub-telomeric regions that contain lager brewing yeast characteristic and specific genes. Deletion of larger chromosomal regions, a massive unilateral decrease of the ribosomal DNA cluster and bilateral truncations of over 60 genes reflect a post-hybridization evolution process. Truncations and deletions of less efficient maltose and maltotriose uptake genes may indicate the result of adaptation to brewing. The genome sequence of this interspecies hybrid yeast provides a new tool for better understanding of lager brewing yeast behavior in industrial beer production. PMID:19261625

  19. The mouse genome displays highly dynamic populations of KRAB-zinc finger protein genes and related genetic units

    PubMed Central

    Kauzlaric, Annamaria; Ecco, Gabriela; Cassano, Marco; Duc, Julien; Imbeault, Michael; Trono, Didier

    2017-01-01

    KRAB-containing poly-zinc finger proteins (KZFPs) constitute the largest family of transcription factors encoded by mammalian genomes, and growing evidence indicates that they fulfill functions critical to both embryonic development and maintenance of adult homeostasis. KZFP genes underwent broad and independent waves of expansion in many higher vertebrates lineages, yet comprehensive studies of members harbored by a given species are scarce. Here we present a thorough analysis of KZFP genes and related units in the murine genome. We first identified about twice as many elements than previously annotated as either KZFP genes or pseudogenes, notably by assigning to this family an entity formerly considered as a large group of Satellite repeats. We then could delineate an organization in clusters distributed throughout the genome, with signs of recombination, translocation, duplication and seeding of new sites by retrotransposition of KZFP genes and related genetic units (KZFP/rGUs). Moreover, we harvested evidence indicating that closely related paralogs had evolved through both drifting and shifting of sequences encoding for zinc finger arrays. Finally, we could demonstrate that the KAP1-SETDB1 repressor complex tames the expression of KZFP/rGUs within clusters, yet that the primary targets of this regulation are not the KZFP/rGUs themselves but enhancers contained in neighboring endogenous retroelements and that, underneath, KZFPs conserve highly individualized patterns of expression. PMID:28334004

  20. Towards a physical map of the fertility genes on the heterochromatic Y chromosome of Drosophila hydei: families of repetitive sequences transcribed on the lampbrush loops Nooses and Threads are organized in extended clusters of several hundred kilobases.

    PubMed

    Trapitz, P; Glätzer, K H; Bünemann, H

    1992-11-01

    The understanding of structure and function of the so-called fertility genes of Drosophila is very limited due to their unusual size--several megabases--and their location on the heterochromatic Y chromosome. Since mapping of these genes has mainly been done by classical cytogenetic analyses using a small number of cytologically visible lampbrush loops as the sole markers for particular fertility genes, the resolution of the genetic map of the Y chromosome is restricted to 3-5 Mb. Here we demonstrate that a substantially finer subdivision of the megabase-sized fertility genes in the subtelomeric regions of the Y chromosome of Drosophila hydei can be achieved by a combination of digestion with restriction enzymes having 6 bp recognition sequences, and pulsed field gel electrophoresis. The physical subdivision is based upon large conserved fragments of repetitive DNA in the size range from 50 up to 1600 kb and refers to the long-range organization of several families of repetitive DNA involved in Y chromosomal transcription processes in primary spermatocytes. We conclude from our results that at least five different families of repetitive DNA specifically transcribed on the lampbrush loops nooses and threads are organized as extended clusters of several hundred kb, essentially free of interspersed non-repetitive sequences.

  1. High-quality draft genome sequence of Rhizobium mesoamericanum strain STM6155, a Mimosa pudica microsymbiont from New Caledonia

    DOE PAGES

    Klonowska, Agnieszka; López-López, Aline; Moulin, Lionel; ...

    2017-01-17

    Rhizobium mesoamericanum STM6155 (INSCD=ATYY01000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as an effective nitrogen fixing microsymbiont of the legume Mimosa pudica L.. STM6155 was isolated in 2009 from a nodule of the trap host M. pudica grown in nickel-rich soil collected near Mont Dore, New Caledonia. R. mesoamericanum STM6155 was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) genome sequencing project. Here we describe the symbiotic properties of R. mesoamericanum STM6155, together with its genome sequence information and annotation. Themore » 6,927,906bp high-quality draft genome is arranged into 147 scaffolds of 152 contigs containing 6855 protein-coding genes and 71 RNA-only encoding genes. Strain STM6155 forms an ANI clique (ID 2435) with the sequenced R. mesoamericanum strain STM3625, and the nodulation genes are highly conserved in these strains and the type strain of Rhizobium grahamii CCGE501 T . Within the STM6155 genome, we have identified a chr chromate efflux gene cluster of six genes arranged into two putative operons and we postulate that this cluster is important for the survival of STM6155 in ultramafic soils containing high concentrations of chromate.« less

  2. High-quality draft genome sequence of Rhizobium mesoamericanum strain STM6155, a Mimosa pudica microsymbiont from New Caledonia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Klonowska, Agnieszka; López-López, Aline; Moulin, Lionel

    Rhizobium mesoamericanum STM6155 (INSCD=ATYY01000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as an effective nitrogen fixing microsymbiont of the legume Mimosa pudica L.. STM6155 was isolated in 2009 from a nodule of the trap host M. pudica grown in nickel-rich soil collected near Mont Dore, New Caledonia. R. mesoamericanum STM6155 was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) genome sequencing project. Here we describe the symbiotic properties of R. mesoamericanum STM6155, together with its genome sequence information and annotation. Themore » 6,927,906bp high-quality draft genome is arranged into 147 scaffolds of 152 contigs containing 6855 protein-coding genes and 71 RNA-only encoding genes. Strain STM6155 forms an ANI clique (ID 2435) with the sequenced R. mesoamericanum strain STM3625, and the nodulation genes are highly conserved in these strains and the type strain of Rhizobium grahamii CCGE501 T . Within the STM6155 genome, we have identified a chr chromate efflux gene cluster of six genes arranged into two putative operons and we postulate that this cluster is important for the survival of STM6155 in ultramafic soils containing high concentrations of chromate.« less

  3. The mouse genome displays highly dynamic populations of KRAB-zinc finger protein genes and related genetic units.

    PubMed

    Kauzlaric, Annamaria; Ecco, Gabriela; Cassano, Marco; Duc, Julien; Imbeault, Michael; Trono, Didier

    2017-01-01

    KRAB-containing poly-zinc finger proteins (KZFPs) constitute the largest family of transcription factors encoded by mammalian genomes, and growing evidence indicates that they fulfill functions critical to both embryonic development and maintenance of adult homeostasis. KZFP genes underwent broad and independent waves of expansion in many higher vertebrates lineages, yet comprehensive studies of members harbored by a given species are scarce. Here we present a thorough analysis of KZFP genes and related units in the murine genome. We first identified about twice as many elements than previously annotated as either KZFP genes or pseudogenes, notably by assigning to this family an entity formerly considered as a large group of Satellite repeats. We then could delineate an organization in clusters distributed throughout the genome, with signs of recombination, translocation, duplication and seeding of new sites by retrotransposition of KZFP genes and related genetic units (KZFP/rGUs). Moreover, we harvested evidence indicating that closely related paralogs had evolved through both drifting and shifting of sequences encoding for zinc finger arrays. Finally, we could demonstrate that the KAP1-SETDB1 repressor complex tames the expression of KZFP/rGUs within clusters, yet that the primary targets of this regulation are not the KZFP/rGUs themselves but enhancers contained in neighboring endogenous retroelements and that, underneath, KZFPs conserve highly individualized patterns of expression.

  4. Burkholderia mallei tssM Encodes a Putative Deubiquitinase That Is Secreted and Expressed inside Infected RAW 264.7 Murine Macrophages▿ †

    PubMed Central

    Shanks, John; Burtnick, Mary N.; Brett, Paul J.; Waag, David M.; Spurgers, Kevin B.; Ribot, Wilson J.; Schell, Mark A.; Panchal, Rekha G.; Gherardini, Frank C.; Wilkinson, Keith D.; DeShazer, David

    2009-01-01

    Burkholderia mallei, a category B biothreat agent, is a facultative intracellular pathogen that causes the zoonotic disease glanders. The B. mallei VirAG two-component regulatory system activates the transcription of ∼60 genes, including a large virulence gene cluster encoding a type VI secretion system (T6SS). The B. mallei tssM gene encodes a putative ubiquitin-specific protease that is physically linked to, and transcriptionally coregulated with, the T6SS gene cluster. Mass spectrometry and immunoblot analysis demonstrated that TssM was secreted in a virAG-dependent manner in vitro. Surprisingly, the T6SS was found to be dispensable for the secretion of TssM. The C-terminal half of TssM, which contains Cys and His box motifs conserved in eukaryotic deubiquitinases, was purified and biochemically characterized. Recombinant TssM hydrolyzed multiple ubiquitinated substrates and the cysteine at position 102 was critical for enzymatic activity. The tssM gene was expressed within 1 h after uptake of B. mallei into RAW 264.7 murine macrophages, suggesting that the TssM deubiquitinase is produced in this intracellular niche. Although the physiological substrate(s) is currently unknown, the TssM deubiquitinase may provide B. mallei a selective advantage in the intracellular environment during infection. PMID:19168747

  5. Histidine Kinase-Mediated Production and Autoassembly of Porphyromonas gingivalis Fimbriae▿ †

    PubMed Central

    Nishikawa, Kiyoshi; Duncan, Margaret J.

    2010-01-01

    Porphyromonas gingivalis, a Gram-negative oral anaerobe, is strongly associated with chronic adult periodontitis, and it utilizes FimA fimbriae to persistently colonize and evade host defenses in the periodontal crevice. The FimA-related gene cluster (the fim gene cluster) is positively regulated by the FimS-FimR two-component system. In this study, comparative analyses between fimbriate type strain ATCC 33277 and fimbria-deficient strain W83 revealed differences in their fimS loci, which encode FimS histidine kinase. Using a reciprocal gene exchange system, we established that FimS from W83 is malfunctional. Complementation analysis with chimeric fimS constructs revealed that W83 FimS has a defective kinase domain due to a truncated conserved G3 box motif that provides an ATP-binding pocket. The introduction of the functional fimS from 33277 restored the production, but not polymerization, of endogenous FimA subunits in W83. Further analyses with a fimA-exchanged W83 isogenic strain showed that even the fimbria-deficient W83 retains the ability to polymerize FimA from 33277, indicating the assembly of mature FimA by a primary structure-dependent mechanism. It also was shown that the substantial expression of 33277-type FimA fimbriae in the W83 derivative requires the introduction and expression of the functional 33277 fimS. These findings indicate that FimSR is the unique and universal regulatory system that activates the fim gene cluster in a fimA genotype-independent manner. PMID:20118268

  6. Database resources of the National Center for Biotechnology Information.

    PubMed

    2016-01-04

    The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank(®) nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (PubMed Central (PMC), Bookshelf and PubReader), health (ClinVar, dbGaP, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen), genomes (BioProject, Assembly, Genome, BioSample, dbSNP, dbVar, Epigenomics, the Map Viewer, Nucleotide, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser and the Trace Archive), genes (Gene, Gene Expression Omnibus (GEO), HomoloGene, PopSet and UniGene), proteins (Protein, the Conserved Domain Database (CDD), COBALT, Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB) and Protein Clusters) and chemicals (Biosystems and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for most of these databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  7. Database resources of the National Center for Biotechnology Information.

    PubMed

    2015-01-01

    The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank(®) nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (Bookshelf, PubMed Central (PMC) and PubReader); medical genetics (ClinVar, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen); genes and genomics (BioProject, BioSample, dbSNP, dbVar, Epigenomics, Gene, Gene Expression Omnibus (GEO), Genome, HomoloGene, the Map Viewer, Nucleotide, PopSet, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser, Trace Archive and UniGene); and proteins and chemicals (Biosystems, COBALT, the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB), Protein Clusters, Protein and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for many of these databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  8. Hyperrecombination in Streptococcus pneumoniae Depends on an Atypical mutY Homologue

    PubMed Central

    Samrakandi, Moulay Mustapha; Pasta, Franck

    2000-01-01

    The unusual behavior of the mutation ami36, which generates hyperrecombination in two point crosses, was previously attributed to a localized conversion process changing A/G mispairs into CG pairs. Although the mechanism was found to be dependent on the DNA polymerase I, the specific function responsible for this correction was still unknown. Analysis of the pneumococcal genome sequence has revealed the presence of an open reading frame homologous to the gene mutY of Escherichia coli. The gene mutY encodes an adenine glycosylase active on A/G and A/7,8-dihydro-8-oxoguanine (8-OxoG) mismatches, inducing their repair to CG and C/8-OxoG, respectively. Here we report that disrupting the pneumococcal mutY homologue abolishes the hyperrecombination induced by ami36 and leads to a mutator phenotype specifically enhancing AT-to-CG transversions. The deduced amino acid sequence of the pneumococcal MutY protein reveals the absence of four cysteines, highly conserved in the endonuclease III/MutY glycosylase family, which ligate a [4Fe-4S]2+ cluster. The actual function of this cluster is still intriguing, inasmuch as we show that the pneumococcal gene complements a mutY strain of E. coli. PMID:10852864

  9. Hyperrecombination in Streptococcus pneumoniae depends on an atypical mutY homologue.

    PubMed

    Samrakandi, M M; Pasta, F

    2000-06-01

    The unusual behavior of the mutation ami36, which generates hyperrecombination in two point crosses, was previously attributed to a localized conversion process changing A/G mispairs into CG pairs. Although the mechanism was found to be dependent on the DNA polymerase I, the specific function responsible for this correction was still unknown. Analysis of the pneumococcal genome sequence has revealed the presence of an open reading frame homologous to the gene mutY of Escherichia coli. The gene mutY encodes an adenine glycosylase active on A/G and A/7,8-dihydro-8-oxoguanine (8-OxoG) mismatches, inducing their repair to CG and C/8-OxoG, respectively. Here we report that disrupting the pneumococcal mutY homologue abolishes the hyperrecombination induced by ami36 and leads to a mutator phenotype specifically enhancing AT-to-CG transversions. The deduced amino acid sequence of the pneumococcal MutY protein reveals the absence of four cysteines, highly conserved in the endonuclease III/MutY glycosylase family, which ligate a [4Fe-4S](2+) cluster. The actual function of this cluster is still intriguing, inasmuch as we show that the pneumococcal gene complements a mutY strain of E. coli.

  10. Clustered metallothionein genes are co-regulated in rice and ectopic expression of OsMT1e-P confers multiple abiotic stress tolerance in tobacco via ROS scavenging

    PubMed Central

    2012-01-01

    Background Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. Results We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. Conclusion We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species. PMID:22780875

  11. Clustered metallothionein genes are co-regulated in rice and ectopic expression of OsMT1e-P confers multiple abiotic stress tolerance in tobacco via ROS scavenging.

    PubMed

    Kumar, Gautam; Kushwaha, Hemant Ritturaj; Panjabi-Sabharwal, Vaishali; Kumari, Sumita; Joshi, Rohit; Karan, Ratna; Mittal, Shweta; Pareek, Sneh L Singla; Pareek, Ashwani

    2012-07-10

    Metallothioneins (MT) are low molecular weight, cysteine rich metal binding proteins, found across genera and species, but their function(s) in abiotic stress tolerance are not well documented. We have characterized a rice MT gene, OsMT1e-P, isolated from a subtractive library generated from a stressed salinity tolerant rice genotype, Pokkali. Bioinformatics analysis of the rice genome sequence revealed that this gene belongs to a multigenic family, which consists of 13 genes with 15 protein products. OsMT1e-P is located on chromosome XI, away from the majority of other type I genes that are clustered on chromosome XII. Various members of this MT gene cluster showed a tight co-regulation pattern under several abiotic stresses. Sequence analysis revealed the presence of conserved cysteine residues in OsMT1e-P protein. Salinity stress was found to regulate the transcript abundance of OsMT1e-P in a developmental and organ specific manner. Using transgenic approach, we found a positive correlation between ectopic expression of OsMT1e-P and stress tolerance. Our experiments further suggest ROS scavenging to be the possible mechanism for multiple stress tolerance conferred by OsMT1e-P. We present an overview of MTs, describing their gene structure, genome localization and expression patterns under salinity and development in rice. We have found that ectopic expression of OsMT1e-P enhances tolerance towards multiple abiotic stresses in transgenic tobacco and the resultant plants could survive and set viable seeds under saline conditions. Taken together, the experiments presented here have indicated that ectopic expression of OsMT1e-P protects against oxidative stress primarily through efficient scavenging of reactive oxygen species.

  12. Genome-Wide Identification, Evolutionary Expansion, and Expression Profile of Homeodomain-Leucine Zipper Gene Family in Poplar (Populus trichocarpa)

    PubMed Central

    Hu, Ruibo; Chi, Xiaoyuan; Chai, Guohua; Kong, Yingzhen; He, Guo; Wang, Xiaoyu; Shi, Dachuan; Zhang, Dongyuan; Zhou, Gongke

    2012-01-01

    Background Homeodomain-leucine zipper (HD-ZIP) proteins are plant-specific transcriptional factors known to play crucial roles in plant development. Although sequence phylogeny analysis of Populus HD-ZIPs was carried out in a previous study, no systematic analysis incorporating genome organization, gene structure, and expression compendium has been conducted in model tree species Populus thus far. Principal Findings In this study, a comprehensive analysis of Populus HD-ZIP gene family was performed. Sixty-three full-length HD-ZIP genes were found in Populus genome. These Populus HD-ZIP genes were phylogenetically clustered into four distinct subfamilies (HD-ZIP I–IV) and predominately distributed across 17 linkage groups (LG). Fifty genes from 25 Populus paralogous pairs were located in the duplicated blocks of Populus genome and then preferentially retained during the sequential evolutionary courses. Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus HD-ZIP gene family. Microarray analysis has shown that 21 Populus paralogous pairs have been differentially expressed across different tissues and under various stresses, with five paralogous pairs showing nearly identical expression patterns, 13 paralogous pairs being partially redundant and three paralogous pairs diversifying significantly. Quantitative real-time RT-PCR (qRT-PCR) analysis performed on 16 selected Populus HD-ZIP genes in different tissues and under both drought and salinity stresses confirms their tissue-specific and stress-inducible expression patterns. Conclusions Genomic organizations indicated that segmental duplications contributed significantly to the expansion of Populus HD-ZIP gene family. Exon/intron organization and conserved motif composition of Populus HD-ZIPs are highly conservative in the same subfamily, suggesting the members in the same subfamilies may also have conservative functionalities. Microarray and qRT-PCR analyses showed that 89% (56 out of 63) of Populus HD-ZIPs were duplicate genes that might have been retained by substantial subfunctionalization. Taken together, these observations may lay the foundation for future functional analysis of Populus HD-ZIP genes to unravel their biological roles. PMID:22359569

  13. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    PubMed

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. Conservation of an Intact vif Gene of Human Immunodeficiency Virus Type 1 during Maternal-Fetal Transmission

    PubMed Central

    Yedavalli, Venkat R. K.; Chappey, Colombe; Matala, Erik; Ahmad, Nafees

    1998-01-01

    The human immunodeficiency virus type 1 (HIV-1) vif gene is conserved among most lentiviruses, suggesting that vif is important for natural infection. To determine whether an intact vif gene is positively selected during mother-to-infant transmission, we analyzed vif sequences from five infected mother-infant pairs following perinatal transmission. The coding potential of the vif open reading frame directly derived from uncultured peripheral blood mononuclear cell DNA was maintained in most of the 78,912 bp sequenced. We found that 123 of the 137 clones analyzed showed an 89.8% frequency of intact vif open reading frames. There was a low degree of heterogeneity of vif genes within mothers, within infants, and between epidemiologically linked mother-infant pairs. The distances between vif sequences were greater in epidemiologically unlinked individuals than in epidemiologically linked mother-infant pairs. Furthermore, the epidemiologically linked mother-infant pair vif sequences displayed similar patterns that were not seen in vif sequences from epidemiologically unlinked individuals. The functional domains, including the two cysteines at positions 114 and 133, a serine phosphorylation site at position 144, and the C-terminal basic amino acids essential for vif protein function, were highly conserved in most of the sequences. Phylogenetic analyses of 137 mother-infant pair vif sequences and 187 other available vif sequences from HIV-1 databases revealed distinct clusters for vif sequences from each mother-infant pair and for other vif sequences. Taken together, these findings suggest that vif plays an important role in HIV-1 infection and replication in mothers and their perinatally infected infants. PMID:9445004

  15. Gene Discovery in the Apicomplexa as Revealed by EST Sequencing and Assembly of a Comparative Gene Database

    PubMed Central

    Li, Li; Brunk, Brian P.; Kissinger, Jessica C.; Pape, Deana; Tang, Keliang; Cole, Robert H.; Martin, John; Wylie, Todd; Dante, Mike; Fogarty, Steven J.; Howe, Daniel K.; Liberator, Paul; Diaz, Carmen; Anderson, Jennifer; White, Michael; Jerome, Maria E.; Johnson, Emily A.; Radke, Jay A.; Stoeckert, Christian J.; Waterston, Robert H.; Clifton, Sandra W.; Roos, David S.; Sibley, L. David

    2003-01-01

    Large-scale EST sequencing projects for several important parasites within the phylum Apicomplexa were undertaken for the purpose of gene discovery. Included were several parasites of medical importance (Plasmodium falciparum, Toxoplasma gondii) and others of veterinary importance (Eimeria tenella, Sarcocystis neurona, and Neospora caninum). A total of 55,192 ESTs, deposited into dbEST/GenBank, were included in the analyses. The resulting sequences have been clustered into nonredundant gene assemblies and deposited into a relational database that supports a variety of sequence and text searches. This database has been used to compare the gene assemblies using BLAST similarity comparisons to the public protein databases to identify putative genes. Of these new entries, ∼15%–20% represent putative homologs with a conservative cutoff of p < 10−9, thus identifying many conserved genes that are likely to share common functions with other well-studied organisms. Gene assemblies were also used to identify strain polymorphisms, examine stage-specific expression, and identify gene families. An interesting class of genes that are confined to members of this phylum and not shared by plants, animals, or fungi, was identified. These genes likely mediate the novel biological features of members of the Apicomplexa and hence offer great potential for biological investigation and as possible therapeutic targets. [The sequence data from this study have been submitted to dbEST division of GenBank under accession nos.: Toxoplasma gondii: –, –, –, –, – , –, –, –, –. Plasmodium falciparum: –, –, –, –. Sarcocystis neurona: , , , , , , , , , , , , , –, –, –, –, –. Eimeria tenella: –, –, –, –, –, –, –, –, – , –, –, –, –, –, –, –, –, –, –, –. Neospora caninum: –, –, , – , –, –.] PMID:12618375

  16. Two Cathelicidin Genes Are Present in both Rainbow Trout (Oncorhynchus mykiss) and Atlantic Salmon (Salmo salar)

    PubMed Central

    Chang, Chin-I; Zhang, Yong-An; Zou, Jun; Nie, Pin; Secombes, Christopher J.

    2006-01-01

    Further to the previous finding of the rainbow trout rtCATH_1 gene, this paper describes three more cathelicidin genes found in salmonids: two in Atlantic salmon, named asCATH_1 and asCATH_2, and one in rainbow trout, named rtCATH_2. All the three new salmonid cathelicidin genes share the common characteristics of mammalian cathelicidin genes, such as consisting of four exons and possessing a highly conserved preproregion and four invariant cysteines clustered in the C-terminal region of the cathelin-like domain. The asCATH_1 gene is homologous to the rainbow trout rtCATH_1 gene, in that it possesses three repeat motifs of TGGGGGTGGC in exon IV and two cysteine residues in the predicted mature peptide, while the asCATH_2 gene and rtCATH_2 gene are homologues of each other, with 96% nucleotide identity. Salmonid cathelicidins possess the same elastase-sensitive residue, threonine, as hagfish cathelicidins and the rabbit CAP18 molecule. The cleavage site of the four salmonid cathelicidins is within a conserved amino acid motif of QKIRTRR, which is at the beginning of the sequence encoded by exon IV. Two 36-residue peptides corresponding to the core part of rtCATH_1 and rtCATH_2 were chemically synthesized and shown to exhibit potent antimicrobial activity. rtCATH_2 was expressed constitutively in gill, head kidney, intestine, skin and spleen, while the expression of rtCATH_1 was inducible in gill, head kidney, and spleen after bacterial challenge. Four cathelicidin genes have now been characterized in salmonids and two were identified in hagfish, confirming that cathelicidin genes evolved early and are likely present in all vertebrates. PMID:16377685

  17. Function analysis of 5'-UTR of the cellulosomal xyl-doc cluster in Clostridium papyrosolvens.

    PubMed

    Zou, Xia; Ren, Zhenxing; Wang, Na; Cheng, Yin; Jiang, Yuanyuan; Wang, Yan; Xu, Chenggang

    2018-01-01

    Anaerobic, mesophilic, and cellulolytic Clostridium papyrosolvens produces an efficient cellulolytic extracellular complex named cellulosome that hydrolyzes plant cell wall polysaccharides into simple sugars. Its genome harbors two long cellulosomal clusters: cip - cel operon encoding major cellulosome components (including scaffolding) and xyl - doc gene cluster encoding hemicellulases. Compared with works on cip - cel operon, there are much fewer studies on xyl - doc mainly due to its rare location in cellulolytic clostridia. Sequence analysis of xyl - doc revealed that it harbors a 5' untranslated region (5'-UTR) which potentially plays a role in the regulation of downstream gene expression. Here, we analyzed the function of 5'-UTR of xyl - doc cluster in C. papyrosolvens in vivo via transformation technology developed in this study. In this study, we firstly developed an electrotransformation method for C. papyrosolvens DSM 2782 before the analysis of 5'-UTR of xyl - doc cluster. In the optimized condition, a field with an intensity of 7.5-9.0 kV/cm was applied to a cuvette (0.2 cm gap) containing a mixture of plasmid and late cell suspended in exponential phase to form a 5 ms pulse in a sucrose-containing buffer. Afterwards, the putative promoter and the 5'-UTR of xyl - doc cluster were determined by sequence alignment. It is indicated that xyl - doc possesses a long conservative 5'-UTR with a complex secondary structure encompassing at least two perfect stem-loops which are potential candidates for controlling the transcriptional termination. In the last step, we employed an oxygen-independent flavin-based fluorescent protein (FbFP) as a quantitative reporter to analyze promoter activity and 5'-UTR function in vivo. It revealed that 5'-UTR significantly blocked transcription of downstream genes, but corn stover can relieve its suppression. In the present study, our results demonstrated that 5'-UTR of the cellulosomal xyl - doc cluster blocks the transcriptional activity of promoter. However, some substrates, such as corn stover, can relieve the effect of depression of 5'-UTR. Thus, it is speculated that 5'-UTR of xyl - doc was a putative riboswitch to regulate the expression of downstream cellulosomal genes, which is helpful to understand the complex regulation of cellulosome.

  18. Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection.

    PubMed Central

    Meyers, B C; Shen, K A; Rohani, P; Gaut, B S; Michelmore, R W

    1998-01-01

    Disease resistance genes in plants are often found in complex multigene families. The largest known cluster of disease resistance specificities in lettuce contains the RGC2 family of genes. We compared the sequences of nine full-length genomic copies of RGC2 representing the diversity in the cluster to determine the structure of genes within this family and to examine the evolution of its members. The transcribed regions range from at least 7.0 to 13.1 kb, and the cDNAs contain deduced open reading frames of approximately 5. 5 kb. The predicted RGC2 proteins contain a nucleotide binding site and irregular leucine-rich repeats (LRRs) that are characteristic of resistance genes cloned from other species. Unique features of the RGC2 gene products include a bipartite LRR region with >40 repeats. At least eight members of this family are transcribed. The level of sequence diversity between family members varied in different regions of the gene. The ratio of nonsynonymous (Ka) to synonymous (Ks) nucleotide substitutions was lowest in the region encoding the nucleotide binding site, which is the presumed effector domain of the protein. The LRR-encoding region showed an alternating pattern of conservation and hypervariability. This alternating pattern of variation was also found in all comparisons within families of resistance genes cloned from other species. The Ka /Ks ratios indicate that diversifying selection has resulted in increased variation at these codons. The patterns of variation support the predicted structure of LRR regions with solvent-exposed hypervariable residues that are potentially involved in binding pathogen-derived ligands. PMID:9811792

  19. Comparative bioinformatics, temporal and spatial expression analyses of Ixodes scapularis organic anion transporting polypeptides

    PubMed Central

    Radulović, Željko; Porter, Lindsay M.; Kim, Tae K.; Mulenga, Albert

    2015-01-01

    Organic anion-transporting polypeptides (Oatps) are an integral part of the detoxification mechanism in vertebrates and invertebrates. These cell surface proteins are involved in mediating the sodium-independent uptake and/or distribution of a broad array of organic amphipathic compounds and xenobiotic drugs. This study describes bioinformatics and biological characterization of 9 Oatp sequences in the Ixodes scapularis genome. These sequences have been annotated on the basis of 12 transmembrane domains, consensus motif D-X-RW-(I,V)-GAWW-X-G-(F,L)-L, and 11 conserved cysteine amino acid residues in the large extracellular loop 5 that characterize the Oatp superfamily. Ixodes scapularis Oatps may regulate non-redundant cross-tick species conserved functions in that they did not cluster as a monolithic group on the phylogeny tree and that they have orthologs in other ticks. Phylogeny clustering patterns also suggest that some tick Oatp sequences transport substrates that are similar to those of body louse, mosquito, eye worm, and filarial worm Oatps. Semi-quantitative RT-PCR analysis demonstrated that all 9 I. scapularis Oatp sequences were expressed during tick feeding. Ixodes scapularis Oatp genes potentially regulate functions during early and/or late-stage tick feeding as revealed by normalized mRNA profiles. Normalized transcript abundance indicates that I. scapularis Oatp genes are strongly expressed in unfed ticks during the first 24 h of feeding and/or at the end of the tick feeding process. Except for 2 I. scapularis Oatps, which were expressed in the salivary glands and ovaries, all other genes were expressed in all tested organs, suggesting the significance of I. scapularis Oatps in maintaining tick homeostasis. Different I. scapularis Oatp mRNA expression patterns were detected and discussed with reference to different physiological states of unfed and feeding ticks. PMID:24582512

  20. The prokaryotic antecedents of the ubiquitin-signaling system and the early evolution of ubiquitin-like β-grasp domains

    PubMed Central

    Iyer, Lakshminarayan M; Burroughs, A Maxwell; Aravind, L

    2006-01-01

    Background Ubiquitin (Ub)-mediated signaling is one of the hallmarks of all eukaryotes. Prokaryotic homologs of Ub (ThiS and MoaD) and E1 ligases have been studied in relation to sulfur incorporation reactions in thiamine and molybdenum/tungsten cofactor biosynthesis. However, there is no evidence for entire protein modification systems with Ub-like proteins and deconjugation by deubiquitinating enzymes in prokaryotes. Hence, the evolutionary assembly of the eukaryotic Ub-signaling apparatus remains unclear. Results We systematically analyzed prokaryotic Ub-related β-grasp fold proteins using sensitive sequence profile searches and structural analysis. Consequently, we identified novel Ub-related proteins beyond the characterized ThiS, MoaD, TGS, and YukD domains. To understand their functional associations, we sought and recovered several conserved gene neighborhoods and domain architectures. These included novel associations involving diverse sulfur metabolism proteins, siderophore biosynthesis and the gene encoding the transfer mRNA binding protein SmpB, as well as domain fusions between Ub-like domains and PIN-domain related RNAses. Most strikingly, we found conserved gene neighborhoods in phylogenetically diverse bacteria combining genes for JAB domains (the primary de-ubiquitinating isopeptidases of the proteasomal complex), along with E1-like adenylating enzymes and different Ub-related proteins. Further sequence analysis of other conserved genes in these neighborhoods revealed several Ub-conjugating enzyme/E2-ligase related proteins. Genes for an Ub-like protein and a JAB domain peptidase were also found in the tail assembly gene cluster of certain caudate bacteriophages. Conclusion These observations imply that members of the Ub family had already formed strong functional associations with E1-like proteins, UBC/E2-related proteins, and JAB peptidases in the bacteria. Several of these Ub-like proteins and the associated protein families are likely to function together in signaling systems just as in eukaryotes. PMID:16859499

  1. Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

    PubMed Central

    Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

    2016-01-01

    The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800

  2. Specific resistances against Pseudomonas syringae effectors AvrB and AvrRpm1 have evolved differently in common bean, soybean, and Arabidopsis

    PubMed Central

    Chen, Nicolas W. G.; Sévignac, Mireille; Thareau, Vincent; Magdelenat, Ghislaine; David, Perrine; Ashfield, Tom; Innes, Roger W.; Geffroy, Valérie

    2010-01-01

    Summary In plants, the evolution of specific resistance is poorly understood. Pseudomonas syringae effectors AvrB and AvrRpm1 are recognized by phylogenetically distinct resistance (R) proteins in Arabidopsis (Brassicaceae) and soybean (Glycine max, Fabaceae). In soybean, these resistances are encoded by two tightly linked R genes Rpg1-b and Rpg1-r. To study the evolution of these specific resistances, we investigated AvrB- and AvrRpm1-induced responses in common bean (Phaseolus vulgaris, Fabaceae).Common bean genotypes of various geographical origins were inoculated with P. syringae strains expressing AvrB or AvrRpm1. A common bean recombinant-inbred-line (RIL) population was used to map R genes to AvrRpm1.No common bean genotypes recognized AvrB. By contrast, multiple genotypes responded to AvrRpm1, and two independent R genes conferring AvrRpm1-specific resistance were mapped to the ends of linkage group B11 (Rpsar-1) and B8 (Rpsar-2). Rpsar-1 is located in a region syntenic with the soybean Rpg1 cluster. However, mapping of specific Rpg1 homologous genes suggests that AvrRpm1 recognition evolved independently in common bean and soybean.The conservation of genomic position of AvrRpm1-specific genes between soybean and common bean suggests a model whereby specific clusters of R genes are predisposed to evolve recognition of the same effector molecules. PMID:20561214

  3. Description of an orthologous cluster of ochratoxin A biosynthetic genes in Aspergillus and Penicillium species. A comparative analysis.

    PubMed

    Gil-Serna, Jessica; García-Díaz, Marta; González-Jaén, María Teresa; Vázquez, Covadonga; Patiño, Belén

    2018-03-02

    Ochratoxin A (OTA) is one of the most important mycotoxins due to its toxic properties and worldwide distribution which is produced by several Aspergillus and Penicillium species. The knowledge of OTA biosynthetic genes and understanding of the mechanisms involved in their regulation are essential. In this work, we obtained a clear picture of biosynthetic genes organization in the main OTA-producing Aspergillus and Penicillium species (A. steynii, A. westerdijkiae, A. niger, A. carbonarius and P. nordicum) using complete genome sequences obtained in this work or previously available on databases. The results revealed a region containing five ORFs which predicted five proteins: halogenase, bZIP transcription factor, cytochrome P450 monooxygenase, non-ribosomal peptide synthetase and polyketide synthase in all the five species. Genetic synteny was conserved in both Penicillium and Aspergillus species although genomic location seemed to be different since the clusters presented different flanking regions (except for A. steynii and A. westerdijkiae); these observations support the hypothesis of the orthology of this genomic region and that it might have been acquired by horizontal transfer. New real-time RT-PCR assays for quantification of the expression of these OTA biosynthetic genes were developed. In all species, the five genes were consistently expressed in OTA-producing strains in permissive conditions. These protocols might favour futures studies on the regulation of biosynthetic genes in order to develop new efficient control methods to avoid OTA entering the food chain. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Functional conservation of Gsdma cluster genes specifically duplicated in the mouse genome.

    PubMed

    Tanaka, Shigekazu; Mizushina, Youichi; Kato, Yoriko; Tamura, Masaru; Shiroishi, Toshihiko

    2013-10-03

    Mouse Gasdermin A3 (Gsdma3) is the causative gene for dominant skin mutations exhibiting alopecia. Mouse has two other Gsdma3-related genes, Gsdma and Gsdma2, whereas human and rat have only one related gene. To date, no skin mutation has been reported for human GSDMA and rat Gsdma as well as mouse Gsdma and Gsdma2. Therefore, it is possible that only Gsdma3 has gain-of-function type mutations to cause dominant skin phenotype. To elucidate functional divergence among the Gsdma-related genes in mice, and to infer the function of the human and rat orthologs, we examined in vivo function of mouse Gsdma by generating Gsdma knockout mice and transgenic mice that overexpress wild-type Gsdma or Gsdma harboring a point mutation (Alanine339Threonine). The Gsdma knockout mice shows no visible phenotype, indicating that Gsdma is not essential for differentiation of epidermal cells and maintenance of the hair cycle, and that Gsdma is expressed specifically both in the inner root sheath of hair follicles and in suprabasal cell layers, whereas Gsdma3 is expressed only in suprabasal layers. By contrast, both types of the transgenic mice exhibited epidermal hyperplasia resembling the Gsdma3 mutations, although the phenotype depended on the genetic background. These results indicate that the mouse Gsdma and Gsdma3 genes share common function to regulate epithelial maintenance and/or homeostasis, and suggest that the function of human GSDMA and rat Gsdma, which are orthologs of mouse Gsdma, is conserved as well.

  5. Gene family size conservation is a good indicator of evolutionary rates.

    PubMed

    Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen

    2010-08-01

    The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.

  6. Genome-wide analysis of family-1 UDP glycosyltransferases (UGT) and identification of UGT genes for FHB resistance in wheat (Triticum aestivum L.).

    PubMed

    He, Yi; Ahmad, Dawood; Zhang, Xu; Zhang, Yu; Wu, Lei; Jiang, Peng; Ma, Hongxiang

    2018-04-19

    Fusarium head blight (FHB), a devastating disease in wheat worldwide, results in yield loses and mycotoxin, such as deoxynivalenol (DON), accumulation in infected grains. DON also facilitates the pathogen colonization and spread of FHB symptoms during disease development. UDP-glycosyltransferase enzymes (UGTs) are known to contribute to detoxification and enhance FHB resistance by glycosylating DON into DON-3-glucoside (D3G) in wheat. However, a comprehensive investigation of wheat (Triticum aestivum) UGT genes is still lacking. In this study, we carried out a genome-wide analysis of family-1 UDP glycosyltransferases in wheat based on the PSPG conserved box that resulted in the identification of 179 putative UGT genes. The identified genes were clustered into 16 major phylogenetic groups with a lack of phylogenetic group K. The UGT genes were invariably distributed among all the chromosomes of the 3 genomes. At least 10 intron insertion events were found in the UGT sequences, where intron 4 was observed as the most conserved intron. The expression analysis of the wheat UGT genes using both online microarray data and quantitative real-time PCR verification suggested the distinct role of UGT genes in different tissues and developmental stages. The expression of many UGT genes was up-regulated after Fusarium graminearum inoculation, and six of the genes were further verified by RT-qPCR. We identified 179 UGT genes from wheat using the available sequenced wheat genome. This study provides useful insight into the phylogenetic structure, distribution, and expression patterns of family-1 UDP glycosyltransferases in wheat. The results also offer a foundation for future work aimed at elucidating the molecular mechanisms underlying the resistance to FHB and DON accumulation.

  7. Blueprint for a minimal photoautotrophic cell: conserved and variable genes in Synechococcus elongatus PCC 7942

    PubMed Central

    2011-01-01

    Background Simpler biological systems should be easier to understand and to engineer towards pre-defined goals. One way to achieve biological simplicity is through genome minimization. Here we looked for genomic islands in the fresh water cyanobacteria Synechococcus elongatus PCC 7942 (genome size 2.7 Mb) that could be used as targets for deletion. We also looked for conserved genes that might be essential for cell survival. Results By using a combination of methods we identified 170 xenologs, 136 ORFans and 1401 core genes in the genome of S. elongatus PCC 7942. These represent 6.5%, 5.2% and 53.6% of the annotated genes respectively. We considered that genes in genomic islands could be found if they showed a combination of: a) unusual G+C content; b) unusual phylogenetic similarity; and/or c) a small number of the highly iterated palindrome 1 (HIP1) motif plus an unusual codon usage. The origin of the largest genomic island by horizontal gene transfer (HGT) could be corroborated by lack of coverage among metagenomic sequences from a fresh water microbialite. Evidence is also presented that xenologous genes tend to cluster in operons. Interestingly, most genes coding for proteins with a diguanylate cyclase domain are predicted to be xenologs, suggesting a role for horizontal gene transfer in the evolution of Synechococcus sensory systems. Conclusions Our estimates of genomic islands in PCC 7942 are larger than those predicted by other published methods like SIGI-HMM. Our results set a guide to non-essential genes in S. elongatus PCC 7942 indicating a path towards the engineering of a model photoautotrophic bacterial cell. PMID:21226929

  8. Conservation value of clustered housing developments.

    PubMed

    Lenth, Buffy A; Knight, Richard L; Gilgert, Wendell C

    2006-10-01

    Traditionally, exurban lands in Colorado have been subdivided into a grid of parcels ranging from 2 to 16 ha. From an ecological perspective, this dispersed pattern of development effectively maximizes the individual influence of each home on the land. Clustered housing developments, designed to maximize open space, are assumed to benefit plant and wildlife communities of conservation interest. They have become a popular alternative for rural development despite the lack of empirical evidence demonstrating their conservation benefits. To better inform rural land-use planning, we evaluated clustered housing developments by comparing their spatial pattern with that of dispersed housing developments and by comparing their conservation value with that of both dispersed housing developments and undeveloped areas in Boulder County, Colorado. We used four indicators to assess conservation value: (1) densities of songbirds, (2) nest density and survival of ground-nesting birds, (3) presence of mammals, and (4) percent cover and proportion of native and non-native plant species. Clustered and dispersed housing developments did not differ on the majority of variables we examined. Both types of housing development had significantly higher densities of non-native and human-commensal species and significantly lower densities of native and human-sensitive species than undeveloped areas. More rigorous ecological guidelines and planning on a regional scale may help create clustered developments with higher conservation value.

  9. Fractal Clustering and Knowledge-driven Validation Assessment for Gene Expression Profiling.

    PubMed

    Wang, Lu-Yong; Balasubramanian, Ammaiappan; Chakraborty, Amit; Comaniciu, Dorin

    2005-01-01

    DNA microarray experiments generate a substantial amount of information about the global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. A number of clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or some distance metric to cluster the points in multi-dimension linear Euclidean space. Their results shows poor consistence with the functional annotation of genes in previous validation study. From a novel different perspective, we propose fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry. This method clusters points in such a way that points in the same clusters are more self-affine among themselves than to the points in other clusters. We assess this method using annotation-based validation assessment for gene clusters. It shows that this method is superior in identifying functional related gene groups than other traditional methods.

  10. Transcriptional Analysis and Subcellular Protein Localization Reveal Specific Features of the Essential WalKR System in Staphylococcus aureus

    PubMed Central

    Poupel, Olivier; Moyat, Mati; Groizeleau, Julie; Antunes, Luísa C. S.; Gribaldo, Simonetta; Msadek, Tarek; Dubrac, Sarah

    2016-01-01

    The WalKR two-component system, controlling cell wall metabolism, is highly conserved among Bacilli and essential for cell viability. In Staphylococcus aureus, walR and walK are followed by three genes of unknown function: walH, walI and walJ. Sequence analysis and transcript mapping revealed a unique genetic structure for this locus in S. aureus: the last gene of the locus, walJ, is transcribed independently, whereas transcription of the tetra-cistronic walRKHI operon occurred from two independent promoters located upstream from walR. Protein topology analysis and protein-protein interactions in E. coli as well as subcellular localization in S. aureus allowed us to show that WalH and WalI are membrane-bound proteins, which associate with WalK to form a complex at the cell division septum. While these interactions suggest that WalH and WalI play a role in activity of the WalKR regulatory pathway, deletion of walH and/or walI did not have a major effect on genes whose expression is strongly dependent on WalKR or on associated phenotypes. No effect of WalH or WalI was seen on tightly controlled WalKR regulon genes such as sle1 or saouhsc_00773, which encodes a CHAP-domain amidase. Of the genes encoding the two major S. aureus autolysins, AtlA and Sle1, only transcription of atlA was increased in the ΔwalH or ΔwalI mutants. Likewise, bacterial autolysis was not increased in the absence of WalH and/or WalI and biofilm formation was lowered rather than increased. Our results suggest that contrary to their major role as WalK inhibitors in B. subtilis, the WalH and WalI proteins have evolved a different function in S. aureus, where they are more accessory. A phylogenomic analysis shows a striking conservation of the 5 gene wal cluster along the evolutionary history of Bacilli, supporting the key importance of this signal transduction system, and indicating that the walH and walI genes were lost in the ancestor of Streptococcaceae, leading to their atypical 3 wal gene cluster, walRKJ. PMID:26999783

  11. Genome Sequencing of Sulfolobus sp. A20 from Costa Rica and Comparative Analyses of the Putative Pathways of Carbon, Nitrogen, and Sulfur Metabolism in Various Sulfolobus Strains.

    PubMed

    Dai, Xin; Wang, Haina; Zhang, Zhenfeng; Li, Kuan; Zhang, Xiaoling; Mora-López, Marielos; Jiang, Chengying; Liu, Chang; Wang, Li; Zhu, Yaxin; Hernández-Ascencio, Walter; Dong, Zhiyang; Huang, Li

    2016-01-01

    The genome of Sulfolobus sp. A20 isolated from a hot spring in Costa Rica was sequenced. This circular genome of the strain is 2,688,317 bp in size and 34.8% in G+C content, and contains 2591 open reading frames (ORFs). Strain A20 shares ~95.6% identity at the 16S rRNA gene sequence level and <30% DNA-DNA hybridization (DDH) values with the most closely related known Sulfolobus species (i.e., Sulfolobus islandicus and Sulfolobus solfataricus ), suggesting that it represents a novel Sulfolobus species. Comparison of the genome of strain A20 with those of the type strains of S. solfataricus, Sulfolobus acidocaldarius, S. islandicus , and Sulfolobus tokodaii , which were isolated from geographically separated areas, identified 1801 genes conserved among all Sulfolobus species analyzed (core genes). Comparative genome analyses show that central carbon metabolism in Sulfolobus is highly conserved, and enzymes involved in the Entner-Doudoroff pathway, the tricarboxylic acid cycle and the CO 2 fixation pathways are predominantly encoded by the core genes. All Sulfolobus species encode genes required for the conversion of ammonium into glutamate/glutamine. Some Sulfolobus strains have gained the ability to utilize additional nitrogen source such as nitrate (i.e., S. islandicus strain REY15A, LAL14/1, M14.25, and M16.27) or urea (i.e., S. islandicus HEV10/4, S. tokodaii strain7, and S. metallicus DSM 6482). The strategies for sulfur metabolism are most diverse and least understood. S. tokodaii encodes sulfur oxygenase/reductase (SOR), whereas both S. islandicus and S. solfataricus contain genes for sulfur reductase (SRE). However, neither SOR nor SRE genes exist in the genome of strain A20, raising the possibility that an unknown pathway for the utilization of elemental sulfur may be present in the strain. The ability of Sulfolobus to utilize nitrate or sulfur is encoded by a gene cluster flanked by IS elements or their remnants. These clusters appear to have become fixed at a specific genomic site in some strains and lost in other strains during the course of evolution. The versatility in nitrogen and sulfur metabolism may represent adaptation of Sulfolobus to thriving in different habitats.

  12. Genomic Comparison of Non-Typhoidal Salmonella enterica Serovars Typhimurium, Enteritidis, Heidelberg, Hadar and Kentucky Isolates from Broiler Chickens.

    PubMed

    Dhanani, Akhilesh S; Block, Glenn; Dewar, Ken; Forgetta, Vincenzo; Topp, Edward; Beiko, Robert G; Diarra, Moussa S

    2015-01-01

    Non-typhoidal Salmonella enterica serovars, associated with different foods including poultry products, are important causes of bacterial gastroenteritis worldwide. The colonization of the chicken gut by S. enterica could result in the contamination of the environment and food chain. The aim of this study was to compare the genomes of 25 S. enterica serovars isolated from broiler chicken farms to assess their intra- and inter-genetic variability, with a focus on virulence and antibiotic resistance characteristics. The genomes of 25 S. enterica isolates covering five serovars (ten Typhimurium including three monophasic 4,[5],12:i:, four Enteritidis, three Hadar, four Heidelberg and four Kentucky) were sequenced. Most serovars were clustered in strongly supported phylogenetic clades, except for isolates of serovar Enteritidis that were scattered throughout the tree. Plasmids of varying sizes were detected in several isolates independently of serovars. Genes associated with the IncF plasmid and the IncI1 plasmid were identified in twelve and four isolates, respectively, while genes associated with the IncQ plasmid were found in one isolate. The presence of numerous genes associated with Salmonella pathogenicity islands (SPIs) was also confirmed. Components of the type III and IV secretion systems (T3SS and T4SS) varied in different isolates, which could explain in part, differences of their pathogenicity in humans and/or persistence in broilers. Conserved clusters of genes in the T3SS were detected that could be used in designing effective strategies (diagnostic, vaccination or treatments) to combat Salmonella. Antibiotic resistance genes (CMY, aadA, ampC, florR, sul1, sulI, tetAB, and srtA) and class I integrons were detected in resistant isolates while all isolates carried multidrug efflux pump systems regardless of their antibiotic susceptibility profile. This study showed that the predominant Salmonella serovars in broiler chickens harbor genes encoding adhesins, flagellar proteins, T3SS, iron acquisition systems, and antibiotic and metal resistance genes that may explain their pathogenicity, colonization ability and persistence in chicken. The existence of mobile genetic elements indicates that isolates from a given serovar could acquire and transfer genetic material. Conserved genes in the T3SS and T4SS that we have identified are promising candidates for identification of diagnostic, antimicrobial or vaccine targets for the control of Salmonella in broiler chickens.

  13. Genome Sequencing of Sulfolobus sp. A20 from Costa Rica and Comparative Analyses of the Putative Pathways of Carbon, Nitrogen, and Sulfur Metabolism in Various Sulfolobus Strains

    PubMed Central

    Dai, Xin; Wang, Haina; Zhang, Zhenfeng; Li, Kuan; Zhang, Xiaoling; Mora-López, Marielos; Jiang, Chengying; Liu, Chang; Wang, Li; Zhu, Yaxin; Hernández-Ascencio, Walter; Dong, Zhiyang; Huang, Li

    2016-01-01

    The genome of Sulfolobus sp. A20 isolated from a hot spring in Costa Rica was sequenced. This circular genome of the strain is 2,688,317 bp in size and 34.8% in G+C content, and contains 2591 open reading frames (ORFs). Strain A20 shares ~95.6% identity at the 16S rRNA gene sequence level and <30% DNA-DNA hybridization (DDH) values with the most closely related known Sulfolobus species (i.e., Sulfolobus islandicus and Sulfolobus solfataricus), suggesting that it represents a novel Sulfolobus species. Comparison of the genome of strain A20 with those of the type strains of S. solfataricus, Sulfolobus acidocaldarius, S. islandicus, and Sulfolobus tokodaii, which were isolated from geographically separated areas, identified 1801 genes conserved among all Sulfolobus species analyzed (core genes). Comparative genome analyses show that central carbon metabolism in Sulfolobus is highly conserved, and enzymes involved in the Entner-Doudoroff pathway, the tricarboxylic acid cycle and the CO2 fixation pathways are predominantly encoded by the core genes. All Sulfolobus species encode genes required for the conversion of ammonium into glutamate/glutamine. Some Sulfolobus strains have gained the ability to utilize additional nitrogen source such as nitrate (i.e., S. islandicus strain REY15A, LAL14/1, M14.25, and M16.27) or urea (i.e., S. islandicus HEV10/4, S. tokodaii strain7, and S. metallicus DSM 6482). The strategies for sulfur metabolism are most diverse and least understood. S. tokodaii encodes sulfur oxygenase/reductase (SOR), whereas both S. islandicus and S. solfataricus contain genes for sulfur reductase (SRE). However, neither SOR nor SRE genes exist in the genome of strain A20, raising the possibility that an unknown pathway for the utilization of elemental sulfur may be present in the strain. The ability of Sulfolobus to utilize nitrate or sulfur is encoded by a gene cluster flanked by IS elements or their remnants. These clusters appear to have become fixed at a specific genomic site in some strains and lost in other strains during the course of evolution. The versatility in nitrogen and sulfur metabolism may represent adaptation of Sulfolobus to thriving in different habitats. PMID:27965637

  14. Transcriptional Analysis and Subcellular Protein Localization Reveal Specific Features of the Essential WalKR System in Staphylococcus aureus.

    PubMed

    Poupel, Olivier; Moyat, Mati; Groizeleau, Julie; Antunes, Luísa C S; Gribaldo, Simonetta; Msadek, Tarek; Dubrac, Sarah

    2016-01-01

    The WalKR two-component system, controlling cell wall metabolism, is highly conserved among Bacilli and essential for cell viability. In Staphylococcus aureus, walR and walK are followed by three genes of unknown function: walH, walI and walJ. Sequence analysis and transcript mapping revealed a unique genetic structure for this locus in S. aureus: the last gene of the locus, walJ, is transcribed independently, whereas transcription of the tetra-cistronic walRKHI operon occurred from two independent promoters located upstream from walR. Protein topology analysis and protein-protein interactions in E. coli as well as subcellular localization in S. aureus allowed us to show that WalH and WalI are membrane-bound proteins, which associate with WalK to form a complex at the cell division septum. While these interactions suggest that WalH and WalI play a role in activity of the WalKR regulatory pathway, deletion of walH and/or walI did not have a major effect on genes whose expression is strongly dependent on WalKR or on associated phenotypes. No effect of WalH or WalI was seen on tightly controlled WalKR regulon genes such as sle1 or saouhsc_00773, which encodes a CHAP-domain amidase. Of the genes encoding the two major S. aureus autolysins, AtlA and Sle1, only transcription of atlA was increased in the ΔwalH or ΔwalI mutants. Likewise, bacterial autolysis was not increased in the absence of WalH and/or WalI and biofilm formation was lowered rather than increased. Our results suggest that contrary to their major role as WalK inhibitors in B. subtilis, the WalH and WalI proteins have evolved a different function in S. aureus, where they are more accessory. A phylogenomic analysis shows a striking conservation of the 5 gene wal cluster along the evolutionary history of Bacilli, supporting the key importance of this signal transduction system, and indicating that the walH and walI genes were lost in the ancestor of Streptococcaceae, leading to their atypical 3 wal gene cluster, walRKJ.

  15. ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation.

    PubMed

    Kristensen, David M; Wolf, Yuri I; Koonin, Eugene V

    2017-01-04

    The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny) and a small number of synonymous substitutions in the protein-coding genes. As such, each ATGC is suited for analysis of microevolutionary variations within a cohesive group of organisms (e.g. species), whereas the entire collection of ATGCs is useful for macroevolutionary studies. The ATGC database includes many forms of pre-computed data, in particular ATGC-COGs (Clusters of Orthologous Genes), multiple sequence alignments, a set of 'index' orthologs representing the most well-conserved members of each ATGC-COG, the phylogenetic tree of the organisms within each ATGC, etc. Although the ATGC database contains several million proteins from thousands of genomes organized into hundreds of clusters (roughly a 4-fold increase since the last version of the ATGC database), it is now built with completely automated methods and will be regularly updated following new releases of the NCBI RefSeq database. The ATGC database is hosted jointly at the University of Iowa at dmk-brain.ecn.uiowa.edu/ATGC/ and the NCBI at ftp.ncbi.nlm.nih.gov/pub/kristensen/ATGC/atgc_home.html. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  16. Phylogenetic analysis of Helicobacter pylori cagA gene of Turkish isolates and the association with gastric pathology

    PubMed Central

    2013-01-01

    Background The cagA gene is one of the important virulence factors of Helicobacter pylori. The diversity of cagA 5′ conserved region is thought to reflect the phylogenetic relationships between different H. pylori isolates and their association with peptic ulceration. Significant geographical differences among isolates have been reported. The aim of this study is to compare Turkish H. pylori isolates with isolates from different geographical locations and to correlate the association with peptic ulceration. Methods Total of 52 isolates of which 19 were Turkish and 33 from other geographic locations were studied. Gastric antral biopsies collected from 19 Turkish patients (Gastritis = 12, ulcer = 7) were used to amplify the cagA 5′ region by PCR then followed by DNA sequencing. Results The phylogenetic tree displayed 3 groups: A) a mix of 2 sub-groups “Asian” and “African/Anatolian/Asian/European”, B) “Anatolian/European” and C) “American-Indian”. Turkish H. pylori isolates clustered in the mixed sub-group A were mostly from gastritis patients while those clustered in group B were from peptic ulcer patients. A phylogenetic tree constructed for our Turkish isolates detected distinctive features among those from gastritis and ulcer patients. We have found that 2/3 of the gastritis isolates were clustered alone while 1/3 was clustered together with the ulcer isolates. Several amino acids were found to be shared between the later groups but not with the first group of gastritis. Conclusions This study provided an additional insight into the profile of our cagA gene which implies a relationship in geographic locations of the isolates. PMID:24245965

  17. Divergence of Erv1-Associated Mitochondrial Import and Export Pathways in Trypanosomes and Anaerobic Protists

    PubMed Central

    Basu, Somsuvro; Leonard, Joanne C.; Desai, Nishal; Mavridou, Despoina A. I.; Tang, Kong Ho; Goddard, Alan D.

    2013-01-01

    In yeast (Saccharomyces cerevisiae) and animals, the sulfhydryl oxidase Erv1 functions with Mia40 in the import and oxidative folding of numerous cysteine-rich proteins in the mitochondrial intermembrane space (IMS). Erv1 is also required for Fe-S cluster assembly in the cytosol, which uses at least one mitochondrially derived precursor. Here, we characterize an essential Erv1 orthologue from the protist Trypanosoma brucei (TbERV1), which naturally lacks a Mia40 homolog. We report kinetic parameters for physiologically relevant oxidants cytochrome c and O2, unexpectedly find O2 and cytochrome c are reduced simultaneously, and demonstrate that efficient reduction of O2 by TbERV1 is not dependent upon a simple O2 channel defined by conserved histidine and tyrosine residues. Massive mitochondrial swelling following TbERV1 RNA interference (RNAi) provides evidence that trypanosome Erv1 functions in IMS protein import despite the natural absence of the key player in the yeast and animal import pathways, Mia40. This suggests significant evolutionary divergence from a recently established paradigm in mitochondrial cell biology. Phylogenomic profiling of genes also points to a conserved role for TbERV1 in cytosolic Fe-S cluster assembly. Conversely, loss of genes implicated in precursor delivery for cytosolic Fe-S assembly in Entamoeba, Trichomonas, and Giardia suggests fundamental differences in intracellular trafficking pathways for activated iron or sulfur species in anaerobic versus aerobic eukaryotes. PMID:23264646

  18. Genetic diversity, population structure and marker-trait associations for agronomic and grain traits in wild diploid wheat Triticum urartu.

    PubMed

    Wang, Xin; Luo, Guangbin; Yang, Wenlong; Li, Yiwen; Sun, Jiazhu; Zhan, Kehui; Liu, Dongcheng; Zhang, Aimin

    2017-07-01

    Wild diploid wheat, Triticum urartu (T. urartu) is the progenitor of bread wheat, and understanding its genetic diversity and genome function will provide considerable reference for dissecting genomic information of common wheat. In this study, we investigated the morphological and genetic diversity and population structure of 238 T. urartu accessions collected from different geographic regions. This collection had 19.37 alleles per SSR locus and its polymorphic information content (PIC) value was 0.76, and the PIC and Nei's gene diversity (GD) of high-molecular-weight glutenin subunits (HMW-GSs) were 0.86 and 0.88, respectively. UPGMA clustering analysis indicated that the 238 T. urartu accessions could be classified into two subpopulations, of which Cluster I contained accessions from Eastern Mediterranean coast and those from Mesopotamia and Transcaucasia belonged to Cluster II. The wide range of genetic diversity along with the manageable number of accessions makes it one of the best collections for mining valuable genes based on marker-trait association. Significant associations were observed between simple sequence repeats (SSR) or HMW-GSs and six morphological traits: heading date (HD), plant height (PH), spike length (SPL), spikelet number per spike (SPLN), tiller angle (TA) and grain length (GL). Our data demonstrated that SSRs and HMW-GSs were useful markers for identification of beneficial genes controlling important traits in T. urartu, and subsequently for their conservation and future utilization, which may be useful for genetic improvement of the cultivated hexaploid wheat.

  19. Finding gene clusters for a replicated time course study

    PubMed Central

    2014-01-01

    Background Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. Findings In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. Conclusions The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. PMID:24460656

  20. The genome sequence and effector complement of the flax rust pathogen Melampsora lini.

    PubMed

    Nemri, Adnane; Saunders, Diane G O; Anderson, Claire; Upadhyaya, Narayana M; Win, Joe; Lawrence, Gregory J; Jones, David A; Kamoun, Sophien; Ellis, Jeffrey G; Dodds, Peter N

    2014-01-01

    Rust fungi cause serious yield reductions on crops, including wheat, barley, soybean, coffee, and represent real threats to global food security. Of these fungi, the flax rust pathogen Melampsora lini has been developed most extensively over the past 80 years as a model to understand the molecular mechanisms that underpin pathogenesis. During infection, M. lini secretes virulence effectors to promote disease. The number of these effectors, their function and their degree of conservation across rust fungal species is unknown. To assess this, we sequenced and assembled de novo the genome of M. lini isolate CH5 into 21,130 scaffolds spanning 189 Mbp (scaffold N50 of 31 kbp). Global analysis of the DNA sequence revealed that repetitive elements, primarily retrotransposons, make up at least 45% of the genome. Using ab initio predictions, transcriptome data and homology searches, we identified 16,271 putative protein-coding genes. An analysis pipeline was then implemented to predict the effector complement of M. lini and compare it to that of the poplar rust, wheat stem rust and wheat stripe rust pathogens to identify conserved and species-specific effector candidates. Previous knowledge of four cloned M. lini avirulence effector proteins and two basidiomycete effectors was used to optimize parameters of the effector prediction pipeline. Markov clustering based on sequence similarity was performed to group effector candidates from all four rust pathogens. Clusters containing at least one member from M. lini were further analyzed and prioritized based on features including expression in isolated haustoria and infected leaf tissue and conservation across rust species. Herein, we describe 200 of 940 clusters that ranked highest on our priority list, representing 725 flax rust candidate effectors. Our findings on this important model rust species provide insight into how effectors of rust fungi are conserved across species and how they may act to promote infection on their hosts.

  1. A Locus Encoding Variable Defense Systems against Invading DNA Identified in Streptococcus suis

    PubMed Central

    Okura, Masatoshi; Nozawa, Takashi; Watanabe, Takayasu; Murase, Kazunori; Nakagawa, Ichiro; Takamatsu, Daisuke; Osaki, Makoto; Sekizaki, Tsutomu; Gottschalk, Marcelo; Hamada, Shigeyuki

    2017-01-01

    Streptococcus suis, an important zoonotic pathogen, is known to have an open pan-genome and to develop a competent state. In S. suis, limited genetic lineages are suggested to be associated with zoonosis. However, little is known about the evolution of diversified lineages and their respective phenotypic or ecological characteristics. In this study, we performed comparative genome analyses of S. suis, with a focus on the competence genes, mobile genetic elements, and genetic elements related to various defense systems against exogenous DNAs (defense elements) that are associated with gene gain/loss/exchange mediated by horizontal DNA movements and their restrictions. Our genome analyses revealed a conserved competence-inducing peptide type (pherotype) of the competence system and large-scale genome rearrangements in certain clusters based on the genome phylogeny of 58 S. suis strains. Moreover, the profiles of the defense elements were similar or identical to each other among the strains belonging to the same genomic clusters. Our findings suggest that these genetic characteristics of each cluster might exert specific effects on the phenotypic or ecological differences between the clusters. We also found certain loci that shift several types of defense elements in S. suis. Of note, one of these loci is a previously unrecognized variable region in bacteria, at which strains of distinct clusters code for different and various defense elements. This locus might represent a novel defense mechanism that has evolved through an arms race between bacteria and invading DNAs, mediated by mobile genetic elements and genetic competence. PMID:28379509

  2. Diversity of Two-Domain Laccase-Like Multicopper Oxidase Genes in Streptomyces spp.: Identification of Genes Potentially Involved in Extracellular Activities and Lignocellulose Degradation during Composting of Agricultural Waste

    PubMed Central

    Lu, Lunhui; Zhang, Jiachao; Chen, Anwei; Chen, Ming; Jiang, Min; Yuan, Yujie; Wu, Haipeng; Lai, Mingyong; He, Yibin

    2014-01-01

    Traditional three-domain fungal and bacterial laccases have been extensively studied for their significance in various biotechnological applications. Growing molecular evidence points to a wide occurrence of more recently recognized two-domain laccase-like multicopper oxidase (LMCO) genes in Streptomyces spp. However, the current knowledge about their ecological role and distribution in natural or artificial ecosystems is insufficient. The aim of this study was to investigate the diversity and composition of Streptomyces two-domain LMCO genes in agricultural waste composting, which will contribute to the understanding of the ecological function of Streptomyces two-domain LMCOs with potential extracellular activity and ligninolytic capacity. A new specific PCR primer pair was designed to target the two conserved copper binding regions of Streptomyces two-domain LMCO genes. The obtained sequences mainly clustered with Streptomyces coelicolor, Streptomyces violaceusniger, and Streptomyces griseus. Gene libraries retrieved from six composting samples revealed high diversity and a rapid succession of Streptomyces two-domain LMCO genes during composting. The obtained sequence types cluster in 8 distinct clades, most of which are homologous with Streptomyces two-domain LMCO genes, but the sequences of clades III and VIII do not match with any reference sequence of known streptomycetes. Both lignocellulose degradation rates and phenol oxidase activity at pH 8.0 in the composting process were found to be positively associated with the abundance of Streptomyces two-domain LMCO genes. These observations provide important clues that Streptomyces two-domain LMCOs are potentially involved in bacterial extracellular phenol oxidase activities and lignocellulose breakdown during agricultural waste composting. PMID:24657870

  3. The COG database: an updated version includes eukaryotes

    PubMed Central

    Tatusov, Roman L; Fedorova, Natalie D; Jackson, John D; Jacobs, Aviva R; Kiryutin, Boris; Koonin, Eugene V; Krylov, Dmitri M; Mazumder, Raja; Mekhedov, Sergei L; Nikolskaya, Anastasia N; Rao, B Sridhar; Smirnov, Sergei; Sverdlov, Alexander V; Vasudevan, Sona; Wolf, Yuri I; Yin, Jodie J; Natale, Darren A

    2003-01-01

    Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. Conclusion The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. PMID:12969510

  4. Characterization and Comparative Profiling of MiRNA Transcriptomes in Bighead Carp and Silver Carp

    PubMed Central

    Chi, Wei; Tong, Chaobo; Gan, Xiaoni; He, Shunping

    2011-01-01

    MicroRNAs (miRNAs) are small non-coding RNA molecules that are processed from large ‘hairpin’ precursors and function as post-transcriptional regulators of target genes. Although many individual miRNAs have recently been extensively studied, there has been very little research on miRNA transcriptomes in teleost fishes. By using high throughput sequencing technology, we have identified 167 and 166 conserved miRNAs (belonging to 108 families) in bighead carp (Hypophthalmichthys nobilis) and silver carp (Hypophthalmichthys molitrix), respectively. We compared the expression patterns of conserved miRNAs by means of hierarchical clustering analysis and log2 ratio. Results indicated that there is not a strong correlation between sequence conservation and expression conservation, most of these miRNAs have similar expression patterns. However, high expression differences were also identified for several individual miRNAs. Several miRNA* sequences were also found in our dataset and some of them may have regulatory functions. Two computational strategies were used to identify novel miRNAs from un-annotated data in the two carps. A first strategy based on zebrafish genome, identified 8 and 22 novel miRNAs in bighead carp and silver carp, respectively. We postulate that these miRNAs should also exist in the zebrafish, but the methodologies used have not allowed for their detection. In the second strategy we obtained several carp-specific miRNAs, 31 in bighead carp and 32 in silver carp, which showed low expression. Gain and loss of family members were observed in several miRNA families, which suggests that duplication of animal miRNA genes may occur through evolutionary processes which are similar to the protein-coding genes. PMID:21858165

  5. Regulation of the cnr Cobalt and Nickel Resistance Determinant from Ralstonia sp. Strain CH34†

    PubMed Central

    Grass, Gregor; Große, Cornelia; Nies, Dietrich H.

    2000-01-01

    Ralstonia sp. strain CH34 is resistant to nickel and cobalt cations. Resistance is mediated by the cnr determinant located on plasmid pMOL28. The cnr genes are organized in two clusters, cnrYXH and cnrCBA. As revealed by reverse transcriptase PCR and primer extension, transcription from these operons is initiated from promoters located upstream of the cnrY and cnrC genes. These two promoters exhibit conserved sequences at the −10 (CCGTATA) and −35 (CRAGGGGRAG) regions. The CnrH gene product, which is required for expression of both operons, is a sigma factor belonging to the sigma L family, whose activity seems to be governed by the membrane-bound CnrY and CnrX gene products in response to Ni2+. Half-maximal activation from the cnrCBA operon was determined by using appropriate lacZ gene fusions and was shown to occur at an Ni2+ concentration of about 50 μM. PMID:10671463

  6. Genomic Identification and Analysis of Shared Cis-regulator Elements in a Developmentally Critical homeobox Cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chris Amemiya

    2003-04-01

    The goals of this project were to isolate, characterize, and sequence the Dlx3/Dlx7 bigene cluster from twelve different species of mammals. The Dlx3 and Dlx7 genes are known to encode homeobox transcription factors involved in patterning of structures in the vertebrate jaw as well as vertebrate limbs. Genomic sequences from the respective taxa will subsequently be compared in order to identify conserved non-coding sequences that are potential cis-regulatory elements. Based on the comparisons they will fashion transgenic mouse experiments to functionally test the strength of the potential cis-regulatory elements. A goal of the project is to attempt to identify thosemore » elements that may function in coordinately regulating both Dlx3 and Dlx7 functions.« less

  7. Multiconstrained gene clustering based on generalized projections

    PubMed Central

    2010-01-01

    Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions. PMID:20356386

  8. A human DAZ transgene confers partial rescue of the mouse Dazl null phenotype

    PubMed Central

    Slee, R.; Grimes, B.; Speed, R. M.; Taggart, M.; Maguire, S. M.; Ross, A.; McGill, N. I.; Saunders, P. T. K.; Cooke, H. J.

    1999-01-01

    In a subset of infertile men, a spectrum of spermatogenic defects ranging from a complete absence of germ cells (sertoli cell only) to oligozoospermia is associated with microdeletions of the DAZ (deleted in azoospermia) gene cluster on human distal Yq. DAZ encodes a testis-specific protein with RNA-binding potential recently derived from a single-copy gene DAZL1 (DAZ-like) on chromosome 3. Y chromosomal DAZ homologues are confined to humans and higher primates. It remains unclear which function unique to higher primate spermatogenesis DAZ may serve, and the functional status of the gene recently has been questioned. To assess the extent of functional conservation we have tested the capacity of a human DAZ gene contained in a 225-kb yeast artificial chromosome to complement the sterile phenotype of the Dazl null mouse (Dazl−/−), which is characterized by severe germ-cell depletion and meiotic failure. Although Dazl−/− mice remained infertile when the DAZ transgene was introduced, histological examination revealed a partial and variable rescue of the mutant phenotype, manifest as a pronounced increase in the germ cell population of the seminiferous tubules and survival to the pachytene stage of meiosis. As well as constituting definitive proof of the spermatogenic role of the DAZ gene product, these findings confirm the high degree of functional conservation between the DAZ and DAZL1 genes, suggesting they may constitute a single target for contraceptive intervention and raising the possibility of therapeutic up-regulation of the DAZL1 gene in infertile men. PMID:10393944

  9. Characterization of Erwinia chrysanthemi by pectinolytic isozyme polymorphism and restriction fragment length polymorphism analysis of PCR-amplified fragments of pel genes.

    PubMed Central

    Nassar, A; Darrasse, A; Lemattre, M; Kotoujansky, A; Dervin, C; Vedel, R; Bertheau, Y

    1996-01-01

    Conserved regions about 420 bp long of the pelADE cluster specific to Erwinia chrysanthemi were amplified by PCR and used to differentiate 78 strains of E. chrysanthemi that were obtained from different hosts and geographical areas. No PCR products were obtained from DNA samples extracted from other pectinolytic and nonpectinolytic species and genera. The pel fragments amplified from the E. chrysanthemi strains studied were compared by performing a restriction fragment length polymorphism (RFLP) analysis. On the basis of similarity coefficients derived from the RFLP analysis, the strains were separated into 16 PCR RFLP patterns grouped in six clusters, These clusters appeared to be correlated with other infraspecific levels of E. chrysanthemi classification, such as pathovar and biovar, and occasionally with geographical origin. Moreover, the clusters correlated well with the polymorphism of pectate lyase and pectin methylesterase isoenzymes. While the pectin methylesterase profiles correlated with host monocot-dicot classification, the pectate lyase polymorphism might reflect the cell wall microdomains of the plants belonging to these classes. PMID:8779560

  10. Mutagenesis of NosM Leader Peptide Reveals Important Elements in Nosiheptide Biosynthesis

    PubMed Central

    Jin, Liang; Wu, Xuri; Xue, Yanjiu; Jin, Yue; Wang, Shuzhen

    2016-01-01

    ABSTRACT Nosiheptide, a typical member of the ribosomally synthesized and posttranslationally modified peptides (RiPPs), exhibits potent activity against multidrug-resistant Gram-positive bacterial pathogens. The precursor peptide of nosiheptide (NosM) is comprised of a leader peptide with 37 amino acids and a core peptide containing 13 amino acids. To pinpoint elements in the leader peptide that are essential for nosiheptide biosynthesis, a collection of mutants with unique sequence features, including N- and C-terminal motifs, peptide length, and specific sites in the leader peptide, was generated by mutagenesis in vivo. The effects of various mutants on nosiheptide biosynthesis were evaluated. In addition to the necessity of a conserved motif LEIS box, native length and the N-terminal 12 amino acid residues were indispensable, and single-site substitutions of these 12 amino acid residues resulted in changes ranging from a greater-than-5-fold decrease to a 2-fold increase of nosiheptide production, depending on the sites and substituted residues. Moreover, although the C-terminal motif is not conservative, significant effects of this portion on nosiheptide production were also evident. Taken together, the present results further highlight the importance of the leader peptide in nosiheptide biosynthesis, and provide new insights into the diversity and specificity of leader peptides in the biosynthesis of various RiPPs. IMPORTANCE As a representative thiopeptide, nosiheptide exhibits excellent antibacterial activity. Although the biosynthetic gene cluster and several modification steps have been revealed, the presence and roles of the leader peptide within the precursor peptide of the nosiheptide gene cluster remain elusive. Thus, identification of specific elements in the leader peptide can significantly facilitate the genetic manipulation of the gene cluster for increasing nosiheptide production or generating diverse analogues. Given the complexity of the biosynthetic process, the instability of the leader peptide, and the unavailability of intermediates, cocrystallization of intermediates, leader peptide, and modification enzymes is currently not feasible. Therefore, a mutagenesis approach was used to construct a series of leader peptide mutants to uncover a number of crucial and characteristic elements affecting nosiheptide biosynthesis, which moves a considerable distance toward a thorough understanding of the biosynthetic machinery for thiopeptides. PMID:27913416

  11. Nucleotide sequence analysis reveals linked N-acetyl hydrolase, thioesterase, transport, and regulatory genes encoded by the bialaphos biosynthetic gene cluster of Streptomyces hygroscopicus.

    PubMed Central

    Raibaud, A; Zalacain, M; Holt, T G; Tizard, R; Thompson, C J

    1991-01-01

    Nucleotide sequence analysis of a 5,000-bp region of the bialaphos antibiotic production (bap) gene cluster defined five open reading frames (ORFs) which predicted structural genes in the order bah, ORF1, ORF2, and ORF3 followed by the regulatory gene, brpA (H. Anzai, T. Murakami, S. Imai, A. Satoh, K. Nagaoka, and C.J. Thompson, J. Bacteriol. 169:3482-3488, 1987). The four structural genes were translationally coupled and apparently cotranscribed from an undefined promoter(s) under the positive control of the brpA gene product. S1 mapping experiments indicated that brpA was transcribed by two promoters (brpAp1 and brpAp2) which initiate transcription 150 and 157 bp upstream of brp A within an intergenic region and at least one promoter further upstream within the bap gene cluster (brpAp3). All three transcripts were present at low levels during exponential growth and increased just before the stationary phase. The levels of the brpAp3 band continued to increase at the onset of stationary phase, whereas brpAp1-and brpAp2-protected fragments showed no further change. BrpA contained a possible helix-turn-helix motif at its C terminus which was similar to the C-terminal regulatory motif found in the receiver component of a family of two-component transcriptional activator proteins. This motif was not associated with the N-terminal domain conserved in other members of the family. The structural gene cluster sequenced began with bah, encoding a bialaphos acetylhydrolase which removes the N-acetyl group from bialaphos as one of the final steps in the biosynthetic pathway. The observation that Bah was similar to a rat and to a bacterial (Acinetobacter calcoaceticus) lipase probably reflects the fact that the ester bonds of triglycerides and the amide bond linking acetate to phosphinothricin are similar and hydrolysis is catalyzed by structurally related enzymes. This was followed by two regions encoding ORF1 and ORF2 which were similar to each other (48% nucleotide identity, 31% amino acid identity), as well as to GrsT, a protein encoded by a gene located adjacent to gramicidin S synthetase in Bacillus brevis, and to vertebrate (mallard duck and rat) thioesterases. The amino acid sequence and hydrophobicity profile of ORF3 indicated that it was related to a family of membrane transport proteins. It was strikingly similar to the citrate uptake protein encoded by the transposon Tn3411. Images PMID:2066341

  12. Motif-independent prediction of a secondary metabolism gene cluster using comparative genomics: application to sequenced genomes of Aspergillus and ten other filamentous fungal species.

    PubMed

    Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki

    2014-08-01

    Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  13. Phylogeography of the Koala, (Phascolarctos cinereus), and Harmonising Data to Inform Conservation

    PubMed Central

    Neaves, Linda E.; Frankham, Greta J.; Dennison, Siobhan; FitzGibbon, Sean; Flannagan, Cheyne; Gillett, Amber; Hynes, Emily; Handasyde, Kathrine; Helgen, Kristofer M.; Tsangaras, Kyriakos; Greenwood, Alex D.; Eldridge, Mark D. B.; Johnson, Rebecca N.

    2016-01-01

    The Australian continent exhibits complex biogeographic patterns but studies of the impacts of Pleistocene climatic oscillation on the mesic environments of the Southern Hemisphere are limited. The koala (Phascolarctos cinereus), one of Australia’s most iconic species, was historically widely distributed throughout much of eastern Australia but currently represents a complex conservation challenge. To better understand the challenges to koala genetic health, we assessed the phylogeographic history of the koala. Variation in the maternally inherited mitochondrial DNA (mtDNA) Control Region (CR) was examined in 662 koalas sampled throughout their distribution. In addition, koala CR haplotypes accessioned to Genbank were evaluated and consolidated. A total of 53 unique CR haplotypes have been isolated from koalas to date (including 15 haplotypes novel to this study). The relationships among koala CR haplotypes were indicative of a single Evolutionary Significant Unit and do not support the recognition of subspecies, but were separated into four weakly differentiated lineages which correspond to three geographic clusters: a central lineage, a southern lineage and two northern lineages co-occurring north of Brisbane. The three geographic clusters were separated by known Pleistocene biogeographic barriers: the Brisbane River Valley and Clarence River Valley, although there was evidence of mixing amongst clusters. While there is evidence for historical connectivity, current koala populations exhibit greater structure, suggesting habitat fragmentation may have restricted female-mediated gene flow. Since mtDNA data informs conservation planning, we provide a summary of existing CR haplotypes, standardise nomenclature and make recommendations for future studies to harmonise existing datasets. This holistic approach is critical to ensuring management is effective and small scale local population studies can be integrated into a wider species context. PMID:27588685

  14. Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana

    PubMed Central

    Reimegård, Johan; Kundu, Snehangshu; Pendle, Ali; Irish, Vivian F.; Shaw, Peter

    2017-01-01

    Abstract Co-expression of physically linked genes occurs surprisingly frequently in eukaryotes. Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied the chromosomal organization of genes involved in male reproductive development in Arabidopsis thaliana. We developed an in-silico tool to identify physical clusters of co-regulated genes from gene expression data. We identified 17 clusters (96 genes) involved in stamen development and acting downstream of the transcriptional activator MS1 (MALE STERILITY 1), which contains a PHD domain associated with chromatin re-organization. The clusters exhibited little gene homology or promoter element similarity, and largely overlapped with reported repressive histone marks. Experiments on a subset of the clusters suggested a link between expression activation and chromatin conformation: qRT-PCR and mRNA in situ hybridization showed that the clustered genes were up-regulated within 48 h after MS1 induction; out of 14 chromatin-remodeling mutants studied, expression of clustered genes was consistently down-regulated only in hta9/hta11, previously associated with metabolic cluster activation; DNA fluorescence in situ hybridization confirmed that transcriptional activation of the clustered genes was correlated with open chromatin conformation. Stamen development thus appears to involve transcriptional activation of physically clustered genes through chromatin de-condensation. PMID:28175342

  15. Genomic Regions in Local Endangered Sheep Encode Potentially Favorable Genes.

    PubMed

    Moioli, Bianca; Steri, Roberto; Catillo, Gennaro

    2018-01-02

    The economic evaluation of farm animal genetic resources plays a key role in developing conservation programs. However, to date, the link between diversity as assessed by neutral genetic markers and the functional diversity is not yet understood. Two genome-wide comparisons, using over 44,000 Single Nucleotide Polymorphisms, identified the markers with the highest difference in allele frequency between the Alpago endangered breed and two clusters, composed of four specialized dairy sheep, and four meat breeds respectively. The genes in proximity of these markers were mapped to known pathways of the Gene Ontology to determine which ones were most represented. Our results indicated that the differences of the Alpago breed from the more productive sheep rely upon genes involved in cellular defense and repair mechanisms. A higher number of different markers and genes were detected in the comparison with the specialized dairy sheep. These genes play a role in complex biological processes: metabolic, homeostatic, neurological system, and macromolecular organization; such processes may possibly explain the evolution of gene function as a result of selection to improve milk yield.

  16. Valine/isoleucine variants drive selective pressure in the VP1 sequence of EV-A71 enteroviruses.

    PubMed

    Duy, Nghia Ngu; Huong, Le Thi Thanh; Ravel, Patrice; Huong, Le Thi Song; Dwivedi, Ankit; Sessions, October Michael; Hou, Yan'An; Chua, Robert; Kister, Guilhem; Afelt, Aneta; Moulia, Catherine; Gubler, Duane J; Thiem, Vu Dinh; Thanh, Nguyen Thi Hien; Devaux, Christian; Duong, Tran Nhu; Hien, Nguyen Tran; Cornillot, Emmanuel; Gavotte, Laurent; Frutos, Roger

    2017-05-08

    In 2011-2012, Northern Vietnam experienced its first large scale hand foot and mouth disease (HFMD) epidemic. In 2011, a major HFMD epidemic was also reported in South Vietnam with fatal cases. This 2011-2012 outbreak was the first one to occur in North Vietnam providing grounds to study the etiology, origin and dynamic of the disease. We report here the analysis of the VP1 gene of strains isolated throughout North Vietnam during the 2011-2012 outbreak and before. The VP1 gene of 106 EV-A71 isolates from North Vietnam and 2 from Central Vietnam were sequenced. Sequence alignments were analyzed at the nucleic acid and protein level. Gene polymorphism was also analyzed. A Factorial Correspondence Analysis was performed to correlate amino acid mutations with clinical parameters. The sequences were distributed into four phylogenetic clusters. Three clusters corresponded to the subgenogroup C4 and the last one corresponded to the subgenogroup C5. Each cluster displayed different polymorphism characteristics. Proteins were highly conserved but three sites bearing only Isoleucine (I) or Valine (V) were characterized. The isoleucine/valine variability matched the clusters. Spatiotemporal analysis of the I/V variants showed that all variants which emerged in 2011 and then in 2012 were not the same but were all present in the region prior to the 2011-2012 outbreak. Some correlation was found between certain I/V variants and ethnicity and severity. The 2011-2012 outbreak was not caused by an exogenous strain coming from South Vietnam or elsewhere but by strains already present and circulating at low level in North Vietnam. However, what triggered the outbreak remains unclear. A selective pressure is applied on I/V variants which matches the genetic clusters. I/V variants were shown on other viruses to correlate with pathogenicity. This should be investigated in EV-A71. I/V variants are an easy and efficient way to survey and identify circulating EV-A71 strains.

  17. Constrained clusters of gene expression profiles with pathological features.

    PubMed

    Sese, Jun; Kurokawa, Yukinori; Monden, Morito; Kato, Kikuya; Morishita, Shinichi

    2004-11-22

    Gene expression profiles should be useful in distinguishing variations in disease, since they reflect accurately the status of cells. The primary clustering of gene expression reveals the genotypes that are responsible for the proximity of members within each cluster, while further clustering elucidates the pathological features of the individual members of each cluster. However, since the first clustering process and the second classification step, in which the features are associated with clusters, are performed independently, the initial set of clusters may omit genes that are associated with pathologically meaningful features. Therefore, it is important to devise a way of identifying gene expression clusters that are associated with pathological features. We present the novel technique of 'itemset constrained clustering' (IC-Clustering), which computes the optimal cluster that maximizes the interclass variance of gene expression between groups, which are divided according to the restriction that only divisions that can be expressed using common features are allowed. This constraint automatically labels each cluster with a set of pathological features which characterize that cluster. When applied to liver cancer datasets, IC-Clustering revealed informative gene expression clusters, which could be annotated with various pathological features, such as 'tumor' and 'man', or 'except tumor' and 'normal liver function'. In contrast, the k-means method overlooked these clusters.

  18. Genetic diversity, population structure and subdivision of local Balkan pig breeds in Austria, Croatia, Serbia and Bosnia-Herzegovina and its practical value in conservation programs.

    PubMed

    Druml, Thomas; Salajpal, Kresimir; Dikic, Maria; Urosevic, Miroslav; Grilz-Seger, Gertrud; Baumung, Roswitha

    2012-03-01

    At present the Croatian Turopolje pig population comprises about 157 breeding animals. In Austria, 324 Turopolje pigs originating from six Croatian founder animals are registered. Multiple bottlenecks have occurred in this population, one major one rather recently and several more older and moderate ones. In addition, it has been subdivided into three subpopulations, one in Austria and two in Croatia, with restricted gene flow. These specificities explain the delicate situation of this endangered Croatian lard-type pig breed. In order to identify candidate breeding animals or gene pools for future conservation breeding programs, we studied the genetic diversity and population structure of this breed using microsatellite data from 197 individuals belonging to five different breeds. The genetic diversity of the Turopolje pig is dramatically low with observed heterozygosities values ranging from 0.38 to 0.57. Split into three populations since 1994, two genetic clusters could be identified: one highly conserved Croatian gene pool in Turopoljski Lug and the"Posavina" gene pool mainly present in the Austrian population. The second Croatian subpopulation in Lonjsko Polje in the Posavina region shows a constant gene flow from the Turopoljski Lug animals. One practical conclusion is that it is necessary to develop a "Posavina" boar line to preserve the "Posavina" gene pool and constitute a corresponding population in Croatia. Animals of the highly inbred herd in Turopoljski Lug should not be crossed with animals of other populations since they represent a specific phenotype-genotype combination. However to increase the genetic diversity of this herd, a program to optimize its sex ratio should be carried out, as was done in the Austrian population where the level of heterozygosity has remained moderate despite its heavy bottleneck in 1994. © 2012 Druml et al; licensee BioMed Central Ltd.

  19. Annotation of the Transcriptome from Taenia pisiformis and Its Comparative Analysis with Three Taeniidae Species

    PubMed Central

    Yang, Deying; Fu, Yan; Wu, Xuhang; Xie, Yue; Nie, Huaming; Chen, Lin; Nong, Xiang; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yan, Ning; Zhang, Runhui; Zheng, Wanpeng; Yang, Guangyou

    2012-01-01

    Background Taenia pisiformis is one of the most common intestinal tapeworms and can cause infections in canines. Adult T. pisiformis (canines as definitive hosts) and Cysticercus pisiformis (rabbits as intermediate hosts) cause significant health problems to the host and considerable socio-economic losses as a consequence. No complete genomic data regarding T. pisiformis are currently available in public databases. RNA-seq provides an effective approach to analyze the eukaryotic transcriptome to generate large functional gene datasets that can be used for further studies. Methodology/Principal Findings In this study, 2.67 million sequencing clean reads and 72,957 unigenes were generated using the RNA-seq technique. Based on a sequence similarity search with known proteins, a total of 26,012 unigenes (no redundancy) were identified after quality control procedures via the alignment of four databases. Overall, 15,920 unigenes were mapped to 203 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Through analyzing the glycolysis/gluconeogenesis and axonal guidance pathways, we achieved an in-depth understanding of the biochemistry of T. pisiformis. Here, we selected four unigenes at random and obtained their full-length cDNA clones using RACE PCR. Functional distribution characteristics were gained through comparing four cestode species (72,957 unigenes of T. pisiformis, 30,700 ESTs of T. solium, 1,058 ESTs of Eg+Em [conserved ESTs between Echinococcus granulosus and Echinococcus multilocularis]), with the cluster of orthologous groups (COG) and gene ontology (GO) functional classification systems. Furthermore, the conserved common genes in these four cestode species were obtained and aligned by the KEGG database. Conclusion This study provides an extensive transcriptome dataset obtained from the deep sequencing of T. pisiformis in a non-model whole genome. The identification of conserved genes may provide novel approaches for potential drug targets and vaccinations against cestode infections. Research can now accelerate into the functional genomics, immunity and gene expression profiles of cestode species. PMID:22514598

  20. Diametrical clustering for identifying anti-correlated gene clusters.

    PubMed

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  1. Patterns of amino acid conservation in human and animal immunodeficiency viruses.

    PubMed

    Voitenko, Olga S; Dhroso, Andi; Feldmann, Anna; Korkin, Dmitry; Kalinina, Olga V

    2016-09-01

    Due to their high genomic variability, RNA viruses and retroviruses present a unique opportunity for detailed study of molecular evolution. Lentiviruses, with HIV being a notable example, are one of the best studied viral groups: hundreds of thousands of sequences are available together with experimentally resolved three-dimensional structures for most viral proteins. In this work, we use these data to study specific patterns of evolution of the viral proteins, and their relationship to protein interactions and immunogenicity. We propose a method for identification of two types of surface residues clusters with abnormal conservation: extremely conserved and extremely variable clusters. We identify them on the surface of proteins from HIV and other animal immunodeficiency viruses. Both types of clusters are overrepresented on the interaction interfaces of viral proteins with other proteins, nucleic acids or low molecular-weight ligands, both in the viral particle and between the virus and its host. In the immunodeficiency viruses, the interaction interfaces are not more conserved than the corresponding proteins on an average, and we show that extremely conserved clusters coincide with protein-protein interaction hotspots, predicted as the residues with the largest energetic contribution to the interaction. Extremely variable clusters have been identified here for the first time. In the HIV-1 envelope protein gp120, they overlap with known antigenic sites. These antigenic sites also contain many residues from extremely conserved clusters, hence representing a unique interacting interface enriched both in extremely conserved and in extremely variable clusters of residues. This observation may have important implication for antiretroviral vaccine development. A Python package is available at https://bioinf.mpi-inf.mpg.de/publications/viral-ppi-pred/ voitenko@mpi-inf.mpg.de or kalinina@mpi-inf.mpg.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. Function and Evolution of DNA Methylation in Nasonia vitripennis

    PubMed Central

    Wang, Xu; Wheeler, David; Avery, Amanda; Rago, Alfredo; Choi, Jeong-Hyeon; Colbourne, John K.; Clark, Andrew G.; Werren, John H.

    2013-01-01

    The parasitoid wasp Nasonia vitripennis is an emerging genetic model for functional analysis of DNA methylation. Here, we characterize genome-wide methylation at a base-pair resolution, and compare these results to gene expression across five developmental stages and to methylation patterns reported in other insects. An accurate assessment of DNA methylation across the genome is accomplished using bisulfite sequencing of adult females from a highly inbred line. One-third of genes show extensive methylation over the gene body, yet methylated DNA is not found in non-coding regions and rarely in transposons. Methylated genes occur in small clusters across the genome. Methylation demarcates exon-intron boundaries, with elevated levels over exons, primarily in the 5′ regions of genes. It is also elevated near the sites of translational initiation and termination, with reduced levels in 5′ and 3′ UTRs. Methylated genes have higher median expression levels and lower expression variation across development stages than non-methylated genes. There is no difference in frequency of differential splicing between methylated and non-methylated genes, and as yet no established role for methylation in regulating alternative splicing in Nasonia. Phylogenetic comparisons indicate that many genes maintain methylation status across long evolutionary time scales. Nasonia methylated genes are more likely to be conserved in insects, but even those that are not conserved show broader expression across development than comparable non-methylated genes. Finally, examination of duplicated genes shows that those paralogs that have lost methylation in the Nasonia lineage following gene duplication evolve more rapidly, show decreased median expression levels, and increased specialization in expression across development. Methylation of Nasonia genes signals constitutive transcription across developmental stages, whereas non-methylated genes show more dynamic developmental expression patterns. We speculate that loss of methylation may result in increased developmental specialization in evolution and acquisition of methylation may lead to broader constitutive expression. PMID:24130511

  3. Comparative and genetic analysis of the four sequenced Paenibacillus polymyxa genomes reveals a diverse metabolism and conservation of genes relevant to plant-growth promotion and competitiveness.

    PubMed

    Eastman, Alexander W; Heinrichs, David E; Yuan, Ze-Chun

    2014-10-03

    Members of the genus Paenibacillus are important plant growth-promoting rhizobacteria that can serve as bio-reactors. Paenibacillus polymyxa promotes the growth of a variety of economically important crops. Our lab recently completed the genome sequence of Paenibacillus polymyxa CR1. As of January 2014, four P. polymyxa genomes have been completely sequenced but no comparative genomic analyses have been reported. Here we report the comparative and genetic analyses of four sequenced P. polymyxa genomes, which revealed a significantly conserved core genome. Complex metabolic pathways and regulatory networks were highly conserved and allow P. polymyxa to rapidly respond to dynamic environmental cues. Genes responsible for phytohormone synthesis, phosphate solubilization, iron acquisition, transcriptional regulation, σ-factors, stress responses, transporters and biomass degradation were well conserved, indicating an intimate association with plant hosts and the rhizosphere niche. In addition, genes responsible for antimicrobial resistance and non-ribosomal peptide/polyketide synthesis are present in both the core and accessory genome of each strain. Comparative analyses also reveal variations in the accessory genome, including large plasmids present in strains M1 and SC2. Furthermore, a considerable number of strain-specific genes and genomic islands are irregularly distributed throughout each genome. Although a variety of plant-growth promoting traits are encoded by all strains, only P. polymyxa CR1 encodes the unique nitrogen fixation cluster found in other Paenibacillus sp. Our study revealed that genomic loci relevant to host interaction and ecological fitness are highly conserved within the P. polymyxa genomes analysed, despite variations in the accessory genome. This work suggets that plant-growth promotion by P. polymyxa is mediated largely through phytohormone production, increased nutrient availability and bio-control mechanisms. This study provides an in-depth understanding of the genome architecture of this species, thus facilitating future genetic engineering and applications in agriculture, industry and medicine. Furthermore, this study highlights the current gap in our understanding of complex plant biomass metabolism in Gram-positive bacteria.

  4. Nitrogen transporter and assimilation genes exhibit developmental stage-selective expression in maize (Zea mays L.) associated with distinct cis-acting promoter motifs.

    PubMed

    Liseron-Monfils, Christophe; Bi, Yong-Mei; Downs, Gregory S; Wu, Wenqing; Signorelli, Tara; Lu, Guangwen; Chen, Xi; Bondo, Eddie; Zhu, Tong; Lukens, Lewis N; Colasanti, Joseph; Rothstein, Steven J; Raizada, Manish N

    2013-10-01

    Nitrogen is considered the most limiting nutrient for maize (Zea mays L.), but there is limited understanding of the regulation of nitrogen-related genes during maize development. An Affymetrix 82K maize array was used to analyze the expression of ≤ 46 unique nitrogen uptake and assimilation probes in 50 maize tissues from seedling emergence to 31 d after pollination. Four nitrogen-related expression clusters were identified in roots and shoots corresponding to, or overlapping, juvenile, adult, and reproductive phases of development. Quantitative real time PCR data was consistent with the existence of these distinct expression clusters. Promoters corresponding to each cluster were screened for over-represented cis-acting elements. The 8-bp distal motif of the Arabidopsis 43-bp nitrogen response element (NRE) was over-represented in nitrogen-related maize gene promoters. This conserved motif, referred to here as NRE43-d8, was previously shown to be critical for nitrate-activated transcription of nitrate reductase (NIA1) and nitrite reductase (NIR1) by the NIN-LIKE PROTEIN 6 (NLP6) in Arabidopsis. Here, NRE43-d8 was over-represented in the promoters of maize nitrate and ammonium transporter genes, specifically those that showed peak expression during early-stage vegetative development. This result predicts an expansion of the NRE-NLP6 regulon and suggests that it may have a developmental component in maize. We also report leaf expression of putative orthologs of nitrite transporters (NiTR1), a transporter not previously reported in maize. We conclude by discussing how each of the four transcriptional modules may be responsible for the different nitrogen uptake and assimilation requirements of leaves and roots at different stages of maize development.

  5. Multigenome analysis identifies a worldwide distributed epidemic Legionella pneumophila clone that emerged within a highly diverse species

    PubMed Central

    Cazalet, Christel; Jarraud, Sophie; Ghavi-Helm, Yad; Kunst, Frank; Glaser, Philippe; Etienne; Buchrieser, Carmen

    2008-01-01

    Genomics can provide the basis for understanding the evolution of emerging, lethal human pathogens such as Legionella pneumophila, the causative agent of Legionnaires’ disease. This bacterium replicates within amoebae and persists in the environment as a free-living microbe. Among the many Legionella species described, L. pneumophila is associated with 90% of human disease and within the 15 serogroups (Sg), L. pneumophila Sg1 causes over 84% of Legionnaires’ disease worldwide. Why L. pneumophila Sg1 is so predominant is unknown. Here, we report the first comprehensive screen of the gene content of 217 L. pneumophila and 32 non-L. pneumophila strains isolated from humans and the environment using a Legionella DNA-array. Strikingly, we uncovered a high conservation of virulence- and eukaryotic-like genes, indicating strong environmental selection pressures for their preservation. No specific hybridization profile differentiated clinical and environmental strains or strains of different serogroups. Surprisingly, the gene cluster coding the determinants of the core and the O side-chain synthesis of the lipopolysaccaride (LPS cluster) determining Sg1 was present in diverse genomic backgrounds, strongly implicating the LPS of Sg1 itself as a principal cause of the high prevalence of Sg1 strains in human disease and suggesting that the LPS cluster can be transferred horizontally. Genomic analysis also revealed that L. pneumophila is a genetically diverse species, in part due to horizontal gene transfer of mobile genetic elements among L. pneumophila strains, but also between different Legionella species. However, the genomic background also plays a role in disease causation as demonstrated by the identification of a globally distributed epidemic strain exhibiting the genotype of the sequenced L. pneumophila strain Paris. PMID:18256241

  6. The recurrent SET-NUP214 fusion as a new HOXA activation mechanism in pediatric T-cell acute lymphoblastic leukemia

    PubMed Central

    Van Vlierberghe, Pieter; van Grotel, Martine; Tchinda, Joëlle; Lee, Charles; Beverloo, H. Berna; van der Spek, Peter J.; Stubbs, Andrew; Cools, Jan; Nagata, Kyosuke; Fornerod, Maarten; Buijs-Gladdines, Jessica; Horstmann, Martin; van Wering, Elisabeth R.; Soulier, Jean; Pieters, Rob

    2008-01-01

    T-cell acute lymphoblastic leukemia (T-ALL) is mostly characterized by specific chromosomal abnormalities, some occurring in a mutually exclusive manner that possibly delineate specific T-ALL subgroups. One subgroup, including MLL-rearranged, CALM-AF10 or inv (7)(p15q34) patients, is characterized by elevated expression of HOXA genes. Using a gene expression–based clustering analysis of 67 T-ALL cases with recurrent molecular genetic abnormalities and 25 samples lacking apparent aberrations, we identified 5 new patients with elevated HOXA levels. Using microarray-based comparative genomic hybridization (array-CGH), a cryptic and recurrent deletion, del (9)(q34.11q34.13), was exclusively identified in 3 of these 5 patients. This deletion results in a conserved SET-NUP214 fusion product, which was also identified in the T-ALL cell line LOUCY. SET-NUP214 binds in the promoter regions of specific HOXA genes, where it interacts with CRM1 and DOT1L, which may transcriptionally activate specific members of the HOXA cluster. Targeted inhibition of SET-NUP214 by siRNA abolished expression of HOXA genes, inhibited proliferation, and induced differentiation in LOUCY but not in other T-ALL lines. We conclude that SET-NUP214 may contribute to the pathogenesis of T-ALL by enforcing T-cell differentiation arrest. PMID:18299449

  7. Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B.

    PubMed

    Jeong, Haeyoung; Lim, Jong-Min; Park, Jihye; Sim, Young Mi; Choi, Han-Gu; Lee, Jungho; Jeong, Won-Joong

    2014-04-16

    Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium.

  8. Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B

    PubMed Central

    2014-01-01

    Background Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. Results The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. Conclusions The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast genome phylogeny supports monophyly of the seven investigated members of Chlorellales. The presence of the cox1 intron at 721 in all four investigated Chlorellales taxa indicates that the cox1 intron had been introduced in early Chorellales as a cis-splice form and that the cis-splicing intron was inherited to recent Chlorellales and was recently trans-spliced in Helicosporidium. PMID:24735464

  9. Mood stabilizing drugs regulate transcription of immune, neuronal and metabolic pathway genes in Drosophila.

    PubMed

    Herteleer, L; Zwarts, L; Hens, K; Forero, D; Del-Favero, J; Callaerts, P

    2016-05-01

    Lithium and valproate (VPA) are drugs used in the management of bipolar disorder. Even though they reportedly act on various pathways, the transcriptional targets relevant for disease mechanism and therapeutic effect remain unclear. Furthermore, multiple studies used lymphoblasts of bipolar patients as a cellular proxy, but it remains unclear whether peripheral cells provide a good readout for the effects of these drugs in the brain. We used Drosophila culture cells and adult flies to analyze the transcriptional effects of lithium and VPA and define mechanistic pathways. Transcriptional profiles were determined for Drosophila S2-cells and adult fly heads following lithium or VPA treatment. Gene ontology categories were identified using the DAVID functional annotation tool with a cut-off of p < 0.05. Significantly enriched GO terms were clustered using REVIGO and DAVID functional annotation clustering. Significance of overlap between transcript lists was determined with a Fisher's exact hypergeometric test. Treatment of cultured cells and adult flies with lithium and VPA induces transcriptional responses in genes with similar ontology, with as most prominent immune response, neuronal development, neuronal function, and metabolism. (i) Transcriptional effects of lithium and VPA in Drosophila S2 cells and heads show significant overlap. (ii) The overlap between transcriptional alterations in peripheral versus neuronal cells at the single gene level is negligible, but at the gene ontology and pathway level considerable overlap can be found. (iii) Lithium and VPA act on evolutionarily conserved pathways in Drosophila and mammalian models.

  10. A Novel Sterol Desaturase-Like Protein Promoting Dealkylation of Phytosterols in Tetrahymena thermophila▿

    PubMed Central

    Tomazic, Mariela L.; Najle, Sebastián R.; Nusblat, Alejandro D.; Uttaro, Antonio D.; Nudel, Clara B.

    2011-01-01

    The gene TTHERM_00438800 (DES24) from the ciliate Tetrahymena thermophila encodes a protein with three conserved histidine clusters, typical of the fatty acid hydroxylase superfamily. Despite its high similarity to sterol desaturase-like enzymes, the phylogenetic analysis groups Des24p in a separate cluster more related to bacterial than to eukaryotic proteins, suggesting a possible horizontal gene transfer event. A somatic knockout of DES24 revealed that the gene encodes a protein, Des24p, which is involved in the dealkylation of phytosterols. Knocked-out mutants were unable to eliminate the C-24 ethyl group from C29 sterols, whereas the ability to introduce other modifications, such as desaturations at positions C-5(6), C-7(8), and C-22(23), were not altered. Although C-24 dealkylations have been described in other organisms, such as insects, neither the enzymes nor the corresponding genes have been identified to date. Therefore, this is the first identification of a gene involved in sterol dealkylation. Moreover, the knockout mutant and wild-type strain differed significantly in growth and morphology only when cultivated with C29 sterols; under this culture condition, a change from the typical pear-like shape to a round shape and an alteration in the regulation of tetrahymanol biosynthesis were observed. Sterol analysis upon culture with various substrates and inhibitors indicate that the removal of the C-24 ethyl group in Tetrahymena may proceed by a mechanism different from the one currently known. PMID:21257793

  11. Characterization of class II β chain major histocompatibility complex genes in a family of Hawaiian honeycreepers: 'amakihi (Hemignathus virens).

    PubMed

    Jarvi, Susan I; Bianchi, Kiara R; Farias, Margaret Em; Txakeeyang, Ann; McFarland, Thomas; Belcaid, Mahdi; Asano, Ashley

    2016-07-01

    Hawaiian honeycreepers (Drepanidinae) have evolved in the absence of mosquitoes for over five million years. Through human activity, mosquitoes were introduced to the Hawaiian archipelago less than 200 years ago. Mosquito-vectored diseases such as avian malaria caused by Plasmodium relictum and Avipoxviruses have greatly impacted these vulnerable species. Susceptibility to these diseases is variable among and within species. Due to their function in adaptive immunity, the role of major histocompatibility complex genes (Mhc) in disease susceptibility is under investigation. In this study, we evaluate gene organization and levels of diversity of Mhc class II β chain genes (exon 2) in a captive-reared family of Hawaii 'amakihi (Hemignathus virens). A total of 233 sequences (173 bp) were obtained by PCR+1 amplification and cloning, and 5720 sequences were generated by Roche 454 pyrosequencing. We report a total of 17 alleles originating from a minimum of 14 distinct loci. We detected three linkage groups that appear to represent three distinct haplotypes. Phylogenetic analysis revealed one variable cluster resembling classical Mhc sequences (DAB) and one highly conserved, low variability cluster resembling non-classical Mhc sequences (DBB). High net evolutionary divergence values between DAB and DBB resemble that seen between chicken BLB system and YLB system genes. High amino acid identity among non-classical alleles from 12 species of passerines (DBB) and four species of Galliformes (YLB) was found, suggesting that these non-classical passerine sequences may be related to the Galliforme YLB sequences.

  12. Genome-scale analysis of anaerobic benzoate and phenol metabolism in the hyperthermophilic archaeon Ferroglobus placidus

    PubMed Central

    Holmes, Dawn E; Risso, Carla; Smith, Jessica A; Lovley, Derek R

    2012-01-01

    Insight into the mechanisms for the anaerobic metabolism of aromatic compounds by the hyperthermophilic archaeon Ferroglobus placidus is expected to improve understanding of the degradation of aromatics in hot (>80° C) environments and to identify enzymes that might have biotechnological applications. Analysis of the F. placidus genome revealed genes predicted to encode enzymes homologous to those previously identified as having a role in benzoate and phenol metabolism in mesophilic bacteria. Surprisingly, F. placidus lacks genes for an ATP-independent class II benzoyl-CoA (coenzyme A) reductase (BCR) found in all strictly anaerobic bacteria, but has instead genes coding for a bzd-type ATP-consuming class I BCR, similar to those found in facultative bacteria. The lower portion of the benzoate degradation pathway appears to be more similar to that found in the phototroph Rhodopseudomonas palustris, than the pathway reported for all heterotrophic anaerobic benzoate degraders. Many of the genes predicted to be involved in benzoate metabolism were found in one of two gene clusters. Genes for phenol carboxylation proceeding through a phenylphosphate intermediate were identified in a single gene cluster. Analysis of transcript abundance with a whole-genome microarray and quantitative reverse transcriptase polymerase chain reaction demonstrated that most of the genes predicted to be involved in benzoate or phenol metabolism had higher transcript abundance during growth on those substrates vs growth on acetate. These results suggest that the general strategies for benzoate and phenol metabolism are highly conserved between microorganisms living in moderate and hot environments, and that anaerobic metabolism of aromatic compounds might be analyzed in a wide range of environments with similar molecular targets. PMID:21776029

  13. A Cluster of Cuticle Protein Genes of Drosophila Melanogaster at 65a: Sequence, Structure and Evolution

    PubMed Central

    Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.

    1997-01-01

    A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064

  14. The highly conserved MraZ protein is a transcriptional regulator in Escherichia coli

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eraso, Jesus M.; Markillie, Lye Meng; Mitchell, Hugh D.

    2014-05-05

    The mraZ and mraW genes are highly conserved in bacteria, both in sequence and location at the head of the division and cell wall (dcw) gene cluster. Although MraZ has structural similarity to the AbrB transition state regulator and the MazE antitoxin, and MraW is known to methylate ribosomal RNA, mraZ and mraW null mutants have no detectable growth phenotype in any species tested to date, hampering progress in understanding their physiological role. Here we show that overproduction of Escherichia coli MraZ perturbs cell division and the cell envelope, is more lethal at high levels or in minimal growth medium,more » and that MraW antagonizes these effects. MraZGFP localizes to the nucleoid, suggesting that it binds DNA. Indeed, purified MraZ directly binds a region upstream from its own promoter containing three direct repeats to regulate its own expression and that of downstream cell division and cell wall genes. MraZ-LacZ fusions are repressed by excess MraZ but not when DNA binding by MraZ is inhibited. RNAseq analysis indicates that MraZ is a global transcriptional regulator with numerous targets in addition to dcw genes. One of these targets, mioC, is directly bound by MraZ in a region with three direct repeats.« less

  15. vasa and piwi are required for mitotic integrity in early embryogenesis in the spider Parasteatoda tepidariorum.

    PubMed

    Schwager, Evelyn E; Meng, Yue; Extavour, Cassandra G

    2015-06-15

    Studies in vertebrate and invertebrate model organisms on the molecular basis of primordial germ cell (PGC) specification have revealed that metazoans can specify their germ line either early in development by maternally transmitted cytoplasmic factors (inheritance), or later in development by signaling factors from neighboring tissues (induction). Regardless of the mode of PGC specification, once animal germ cells are specified, they invariably express a number of highly conserved genes. These include vasa and piwi, which can play essential roles in any or all of PGC specification, development, or gametogenesis. Although the arthropods are the most speciose animal phylum, to date there have been no functional studies of conserved germ line genes in species of the most basally branching arthropod clade, the chelicerates (which includes spiders, scorpions, and horseshoe crabs). Here we present the first such study by using molecular and functional tools to examine germ line development and the roles of vasa and piwi orthologues in the common house spider Parasteatoda (formerly Achaearanea) tepidariorum. We use transcript and protein expression patterns of Pt-vasa and Pt-piwi to show that primordial germ cells (PGCs) in the spider arise during late embryogenesis. Neither Pt-vasa nor Pt-piwi gene products are localized asymmetrically to any embryonic region before PGCs emerge as paired segmental clusters in opisthosomal segments 2-6 at late germ band stages. RNA interference studies reveal that both genes are required maternally for egg laying, mitotic progression in early embryos, and embryonic survival. Our results add to the growing body of evidence that vasa and piwi can play important roles in somatic development, and provide evidence for a previously hypothesized conserved role for vasa in cell cycle progression. Copyright © 2014 Elsevier Inc. All rights reserved.

  16. Comparative Genomics of Syntrophic Branched-Chain Fatty Acid Degrading Bacteria

    PubMed Central

    Narihiro, Takashi; Nobu, Masaru K.; Tamaki, Hideyuki; Kamagata, Yoichi; Sekiguchi, Yuji; Liu, Wen-Tso

    2016-01-01

    The syntrophic degradation of branched-chain fatty acids (BCFAs) such as 2-methylbutyrate and isobutyrate is an essential step in the production of methane from proteins/amino acids in anaerobic ecosystems. While a few syntrophic BCFA-degrading bacteria have been isolated, their metabolic pathways in BCFA and short-chain fatty acid (SCFA) degradation as well as energy conservation systems remain unclear. In an attempt to identify these pathways, we herein performed comparative genomics of three syntrophic bacteria: 2-methylbutyrate-degrading “Syntrophomonas wolfei subsp. methylbutyratica” strain JCM 14075T (=4J5T), isobutyrate-degrading Syntrophothermus lipocalidus strain TGB-C1T, and non-BCFA-metabolizing S. wolfei subsp. wolfei strain GöttingenT. We demonstrated that 4J5 and TGB-C1 both encode multiple genes/gene clusters involved in β-oxidation, as observed in the Göttingen genome, which has multiple copies of genes associated with butyrate degradation. The 4J5 genome possesses phylogenetically distinct β-oxidation genes, which may be involved in 2-methylbutyrate degradation. In addition, these Syntrophomonadaceae strains harbor various hydrogen/formate generation systems (i.e., electron-bifurcating hydrogenase, formate dehydrogenase, and membrane-bound hydrogenase) and energy-conserving electron transport systems, including electron transfer flavoprotein (ETF)-linked acyl-CoA dehydrogenase, ETF-linked iron-sulfur binding reductase, ETF dehydrogenase (FixABCX), and flavin oxidoreductase-heterodisulfide reductase (Flox-Hdr). Unexpectedly, the TGB-C1 genome encodes a nitrogenase complex, which may function as an alternative H2 generation mechanism. These results suggest that the BCFA-degrading syntrophic strains 4J5 and TGB-C1 possess specific β-oxidation-related enzymes for BCFA oxidation as well as appropriate energy conservation systems to perform thermodynamically unfavorable syntrophic metabolism. PMID:27431485

  17. Characterization of G2P[4] rotavirus strains associated with increased detection in Australian states using the RotaTeq® vaccine during the 2010-2011 surveillance period.

    PubMed

    Donato, Celeste M; Zhang, Zheng Andrew; Donker, Nicole C; Kirkwood, Carl D

    2014-12-01

    The introduction of rotavirus vaccines Rotarix® and RotaTeq® into the Australian National Immunisation Program in July 2007 has resulted in a dramatic decrease in the burden of rotavirus disease. G2P[4] strains became the dominant genotype Australia-wide during the 2010-2011 surveillance period and for the first time since vaccine introduction, a higher proportion were isolated in jurisdictions using RotaTeq® vaccine compared to locations using Rotarix®. Phylogenetic analysis of the VP7 gene of 32 G2P[4] strains identified six genetic clusters, these distinct clusters were also observed in the VP4 gene for a subset of 12 strains. The whole genome was determined for a representative strain of clusters; A (RVA/Human-wt/AUS/SA066/2010/G2P[4]), B (RVA/Human-wt/AUS/WAPC703/2010/G2P[4]), C (RVA/Human-wt/AUS/MON008/2010/G2P[4]) and E (RVA/Human-wt/AUS/RCH041/2010/G2P[4]). All of the strains possessed the archetypal DS-1 like genome constellation G2-P[4]-I2-R2-C2-M2-A2-N2-T2-E2-H2. Three of the strains, SA066, MON008 and WAPC703 clustered together and were distinct to RCH041 for all 11 genes. The VP7 genes of 31/32 of the strains characterized in this study possessed five conserved amino acid substitutions when compared to the G2 VP7 gene present in the RotaTeq® vaccine. Three of the substitutions were in the VP7 antigenic regions A and C, the substitutions A87T, D96N and S213D have been reported in the majority of G2P[4] strains circulating globally over the previous decade. These changes may have improved the ability of strains to circulate in settings of high vaccine use. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship

    NASA Astrophysics Data System (ADS)

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  19. Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing

    PubMed Central

    Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng

    2017-01-01

    ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933

  20. Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.

    PubMed

    Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng

    2017-10-03

    Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.

  1. In silico cloning, expression of Rieske-like apoprotein gene and protein subcellular localization in the Pacific oyster, Crassostrea gigas.

    PubMed

    He, Xiaocui; Zhang, Yang; Yu, Ziniu

    2010-10-01

    Rieske protein gene in the Pacific oyster Crassostrea gigas was obtained by in silico cloning for the first time, and its expression profiles and subcellular localization were determined, respectively. The full-length cDNA of Cgisp is 985 bp in length and contains a 5'- and 3'-untranslated regions of 35 and 161 bp, respectively, with an open reading frame of 786 bp encoding a protein of 262 amino acids. The predicted molecular weight of 30 kDa of Cgisp protein was verified by prokaryotic expression. Conserved Rieske [2Fe-2S] cluster binding sites and highly matched-pair tertiary structure with 3CWB_E (Gallus gallus) were revealed by homologous analysis and molecular modeling. Eleven putative SNP sites and two conserved hexapeptide sequences, box I (THLGC) and II (PCHGS), were detected by multiple alignments. Real-time PCR analysis showed that Cgisp is expressed in a wide range of tissues, with adductor muscle exhibiting the top expression level, suggesting its biological function of energy transduction. The GFP tagging Cgisp indicated a mitochondrial localization, further confirming its physiological function.

  2. Analysis of a new homozygous deletion in the tumor suppressor region at 3p12.3 reveals two novel intronic noncoding RNA genes.

    PubMed

    Angeloni, Debora; ter Elst, Arja; Wei, Ming Hui; van der Veen, Anneke Y; Braga, Eleonora A; Klimov, Eugene A; Timmer, Tineke; Korobeinikova, Luba; Lerman, Michael I; Buys, Charles H C M

    2006-07-01

    Homozygous deletions or loss of heterozygosity (LOH) at human chromosome band 3p12 are consistent features of lung and other malignancies, suggesting the presence of a tumor suppressor gene(s) (TSG) at this location. Only one gene has been cloned thus far from the overlapping region deleted in lung and breast cancer cell lines U2020, NCI H2198, and HCC38. It is DUTT1 (Deleted in U Twenty Twenty), also known as ROBO1, FLJ21882, and SAX3, according to HUGO. DUTT1, the human ortholog of the fly gene ROBO, has homology with NCAM proteins. Extensive analyses of DUTT1 in lung cancer have not revealed any mutations, suggesting that another gene(s) at this location could be of importance in lung cancer initiation and progression. Here, we report the discovery of a new, small, homozygous deletion in the small cell lung cancer (SCLC) cell line GLC20, nested in the overlapping, critical region. The deletion was delineated using several polymorphic markers and three overlapping P1 phage clones. Fiber-FISH experiments revealed the deletion was approximately 130 kb. Comparative genomic sequence analysis uncovered short sequence elements highly conserved among mammalian genomes and the chicken genome. The discovery of two EST clusters within the deleted region led to the isolation of two noncoding RNA (ncRNA) genes. These were subsequently found differentially expressed in various tumors when compared to their normal tissues. The ncRNA and other highly conserved sequence elements in the deleted region may represent miRNA targets of importance in cancer initiation or progression. Published 2006 Wiley-Liss, Inc.

  3. Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species

    PubMed Central

    Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj

    2015-01-01

    The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056

  4. Global transcriptomics analysis of the Desulfovibrio vulgaris change from syntrophic growth with Methanosarcina barkeri to sulfidogenic metabolism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Plugge, Caroline M.; Scholten, Johannes C.; Culley, David E.

    2010-09-01

    Abstract Desulfovibrio vulgaris is a metabolically flexible microorganism. It can use sulfate as electron acceptor to catabolize a variety of substrates, or in the absence of sulfate can utilize organic acids and alcohols by forming a syntrophic association with hydrogen scavenging partner to relieve inhibition by hydrogen. These alternativemetabolic types increase the chance of survival for D. vulgaris in environments where one of the potential external electron acceptors becomes depleted. In this work, whole-genome D. vulgaris microarrays were used to determine relative transcript levels as D. vulgaris shifted its metabolism from syntroph in a lactate-oxidizing dual-culture with Methanosarcina barkeri tomore » a sulfidogenic metabolism. Syntrophic dual-cultures were grown in two independent chemostats and perturbation was introduced after six volume changes with the addition of sulfate. The results showed that 132 genes were differentially expressed in D. vulgaris 2 hours after addition of sulfate. Functional analyses suggested that genes involved in cell envelope and energy metabolism were the most regulated when comparing syntrophic and sulfidogenic metabolism. Up-regulation was observed for genes encoding ATPase and the membrane-integrated energy conserving hydrogenase (Ech) when cells shifted to a sulfidogenic metabolism. A five-gene cluster encoding several lipo- and membrane-bound proteins was down-regulated when cells were shifted to a sulfidogenic metabolism. Interestingly, this gene cluster has orthologs found only in another syntrophic bacterium Syntrophobacter fumaroxidans and four recently sequenced Desulfovibrio strains. This study also identified several novel c-type cytochrome encoding genes which may be involved in the sulfidogenic metabolism.« less

  5. Identification of lethal cluster of genes in the yeast transcription network

    NASA Astrophysics Data System (ADS)

    Rho, K.; Jeong, H.; Kahng, B.

    2006-05-01

    Identification of essential or lethal genes would be one of the ultimate goals in drug designs. Here we introduce an in silico method to select the cluster with a high population of lethal genes, called lethal cluster, through microarray assay. We construct a gene transcription network based on the microarray expression level. Links are added one by one in the descending order of the Pearson correlation coefficients between two genes. As the link density p increases, two meaningful link densities pm and ps are observed. At pm, which is smaller than the percolation threshold, the number of disconnected clusters is maximum, and the lethal genes are highly concentrated in a certain cluster that needs to be identified. Thus the deletion of all genes in that cluster could efficiently lead to a lethal inviable mutant. This lethal cluster can be identified by an in silico method. As p increases further beyond the percolation threshold, the power law behavior in the degree distribution of a giant cluster appears at ps. We measure the degree of each gene at ps. With the information pertaining to the degrees of each gene at ps, we return to the point pm and calculate the mean degree of genes of each cluster. We find that the lethal cluster has the largest mean degree.

  6. Molecular and phylogenetic characterization of the sieve element occlusion gene family in Fabaceae and non-Fabaceae plants.

    PubMed

    Rüping, Boris; Ernst, Antonia M; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A

    2010-10-08

    The phloem of dicotyledonous plants contains specialized P-proteins (phloem proteins) that accumulate during sieve element differentiation and remain parietally associated with the cisternae of the endoplasmic reticulum in mature sieve elements. Wounding causes P-protein filaments to accumulate at the sieve plates and block the translocation of photosynthate. Specialized, spindle-shaped P-proteins known as forisomes that undergo reversible calcium-dependent conformational changes have evolved exclusively in the Fabaceae. Recently, the molecular characterization of three genes encoding forisome components in the model legume Medicago truncatula (MtSEO1, MtSEO2 and MtSEO3; SEO = sieve element occlusion) was reported, but little is known about the molecular characteristics of P-proteins in non-Fabaceae. We performed a comprehensive genome-wide comparative analysis by screening the M. truncatula, Glycine max, Arabidopsis thaliana, Vitis vinifera and Solanum phureja genomes, and a Malus domestica EST library for homologs of MtSEO1, MtSEO2 and MtSEO3 and identified numerous novel SEO genes in Fabaceae and even non-Fabaceae plants, which do not possess forisomes. Even in Fabaceae some SEO genes appear to not encode forisome components. All SEO genes have a similar exon-intron structure and are expressed predominantly in the phloem. Phylogenetic analysis revealed the presence of several subgroups with Fabaceae-specific subgroups containing all of the known as well as newly identified forisome component proteins. We constructed Hidden Markov Models that identified three conserved protein domains, which characterize SEO proteins when present in combination. In addition, one common and three subgroup specific protein motifs were found in the amino acid sequences of SEO proteins. SEO genes are organized in genomic clusters and the conserved synteny allowed us to identify several M. truncatula vs G. max orthologs as well as paralogs within the G. max genome. The unexpected occurrence of forisome-like genes in non-Fabaceae plants may indicate that these proteins encode species-specific P-proteins, which is backed up by the phloem-specific expression profiles. The conservation of gene structure, the presence of specific motifs and domains and the genomic synteny argue for a common phylogenetic origin of forisomes and other P-proteins.

  7. Functional analysis of the upstream regulatory region of chicken miR-17-92 cluster.

    PubMed

    Cheng, Min; Zhang, Wen-jian; Xing, Tian-yu; Yan, Xiao-hong; Li, Yu-mao; Li, Hui; Wang, Ning

    2016-08-01

    miR-17-92 cluster plays important roles in cell proliferation, differentiation, apoptosis, animal development and tumorigenesis. The transcriptional regulation of miR-17-92 cluster has been extensively studied in mammals, but not in birds. To date, avian miR-17-92 cluster genomic structure has not been fully determined. The promoter location and sequence of miR-17-92 cluster have not been determined, due to the existence of a genomic gap sequence upstream of miR-17-92 cluster in all the birds whose genomes have been sequenced. In this study, genome walking was used to close the genomic gap upstream of chicken miR-17-92 cluster. In addition, bioinformatics analysis, reporter gene assay and truncation mutagenesis were used to investigate functional role of the genomic gap sequence. Genome walking analysis showed that the gap region was 1704 bp long, and its GC content was 80.11%. Bioinformatics analysis showed that in the gap region, there was a 200 bp conserved sequence among the tested 10 species (Gallus gallus, Homo sapiens, Pan troglodytes, Bos taurus, Sus scrofa, Rattus norvegicus, Mus musculus, Possum, Danio rerio, Rana nigromaculata), which is core promoter region of mammalian miR-17-92 host gene (MIR17HG). Promoter luciferase reporter gene vector of the gap region was constructed and reporter assay was performed. The result showed that the promoter activity of pGL3-cMIR17HG (-4228/-2506) was 417 times than that of negative control (empty pGL3 basic vector), suggesting that chicken miR-17-92 cluster promoter exists in the gap region. To further gain insight into the promoter structure, two different truncations for the cloned gap sequence were generated by PCR. One had a truncation of 448 bp at the 5'-end and the other had a truncation of 894 bp at the 3'-end. Further reporter analysis showed that compared with the promoter activity of pGL3-cMIR17HG (-4228/-2506), the reporter activities of the 5'-end truncation and the 3'-end truncation were reduced by 19.82% and 60.14%, respectively. These data demonstrated that the important promoter region of chicken miR-17-92 cluster is located in the -3400/-2506 bp region. Our results lay the foundation for revealing the transcriptional regulatory mechanisms of chicken miR-17-92 cluster.

  8. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C

    PubMed Central

    Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

    2017-01-01

    Sequencing the actinomycin (acm) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN, encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S. antibioticus. PMID:28435299

  9. Genetic interrelations in the actinomycin biosynthetic gene clusters of Streptomyces antibioticus IMRU 3720 and Streptomyces chrysomallus ATCC11523, producers of actinomycin X and actinomycin C.

    PubMed

    Crnovčić, Ivana; Rückert, Christian; Semsary, Siamak; Lang, Manuel; Kalinowski, Jörn; Keller, Ullrich

    2017-01-01

    Sequencing the actinomycin ( acm ) biosynthetic gene cluster of Streptomyces antibioticus IMRU 3720, which produces actinomycin X (Acm X), revealed 20 genes organized into a highly similar framework as in the bi-armed acm C biosynthetic gene cluster of Streptomyces chrysomallus but without an attached additional extra arm of orthologues as in the latter. Curiously, the extra arm of the S. chrysomallus gene cluster turned out to perfectly match the single arm of the S. antibioticus gene cluster in the same order of orthologues including the the presence of two pseudogenes, scacmM and scacmN , encoding a cytochrome P450 and its ferredoxin, respectively. Orthologues of the latter genes were both missing in the principal arm of the S. chrysomallus acm C gene cluster. All orthologues of the extra arm showed a G +C-contents different from that of their counterparts in the principal arm. Moreover, the similarities of translation products from the extra arm were all higher to the corresponding translation products of orthologue genes from the S. antibioticus acm X gene cluster than to those encoded by the principal arm of their own gene cluster. This suggests that the duplicated structure of the S. chrysomallus acm C biosynthetic gene cluster evolved from previous fusion between two one-armed acm gene clusters each from a different genetic background. However, while scacmM and scacmN in the extra arm of the S. chrysomallus acm C gene cluster are mutated and therefore are non-functional, their orthologues saacmM and saacmN in the S. antibioticus acm C gene cluster show no defects seemingly encoding active enzymes with functions specific for Acm X biosynthesis. Both acm biosynthetic gene clusters lack a kynurenine-3-monooxygenase gene necessary for biosynthesis of 3-hydroxy-4-methylanthranilic acid, the building block of the Acm chromophore, which suggests participation of a genome-encoded relevant monooxygenase during Acm biosynthesis in both S. chrysomallus and S. antibioticus .

  10. Analyses of the NAC transcription factor gene family in Gossypium raimondii Ulbr.: chromosomal location, structure, phylogeny, and expression patterns.

    PubMed

    Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu

    2013-07-01

    NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.

  11. Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters.

    PubMed

    Lukashin, A V; Fuchs, R

    2001-05-01

    Cluster analysis of genome-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and samples. In the present paper, we focus on several important issues related to clustering algorithms that have not yet been fully studied. We describe a simple and robust algorithm for the clustering of temporal gene expression profiles that is based on the simulated annealing procedure. In general, this algorithm guarantees to eventually find the globally optimal distribution of genes over clusters. We introduce an iterative scheme that serves to evaluate quantitatively the optimal number of clusters for each specific data set. The scheme is based on standard approaches used in regular statistical tests. The basic idea is to organize the search of the optimal number of clusters simultaneously with the optimization of the distribution of genes over clusters. The efficiency of the proposed algorithm has been evaluated by means of a reverse engineering experiment, that is, a situation in which the correct distribution of genes over clusters is known a priori. The employment of this statistically rigorous test has shown that our algorithm places greater than 90% genes into correct clusters. Finally, the algorithm has been tested on real gene expression data (expression changes during yeast cell cycle) for which the fundamental patterns of gene expression and the assignment of genes to clusters are well understood from numerous previous studies.

  12. New Insights into the Diversity of the Genus Faecalibacterium.

    PubMed

    Benevides, Leandro; Burman, Sriti; Martin, Rebeca; Robert, Véronique; Thomas, Muriel; Miquel, Sylvie; Chain, Florian; Sokol, Harry; Bermudez-Humaran, Luis G; Morrison, Mark; Langella, Philippe; Azevedo, Vasco A; Chatel, Jean-Marc; Soares, Siomar

    2017-01-01

    Faecalibacterium prausnitzii is a commensal bacterium, ubiquitous in the gastrointestinal tracts of animals and humans. This species is a functionally important member of the microbiota and studies suggest it has an impact on the physiology and health of the host. F. prausnitzii is the only identified species in the genus Faecalibacterium , but a recent study clustered strains of this species in two different phylogroups. Here, we propose the existence of distinct species in this genus through the use of comparative genomics. Briefly, we performed analyses of 16S rRNA gene phylogeny, phylogenomics, whole genome Multi-Locus Sequence Typing (wgMLST), Average Nucleotide Identity (ANI), gene synteny, and pangenome to better elucidate the phylogenetic relationships among strains of Faecalibacterium . For this, we used 12 newly sequenced, assembled, and curated genomes of F. prausnitzii , which were isolated from feces of healthy volunteers from France and Australia, and combined these with published data from 5 strains downloaded from public databases. The phylogenetic analysis of the 16S rRNA sequences, together with the wgMLST profiles and a phylogenomic tree based on comparisons of genome similarity, all supported the clustering of Faecalibacterium strains in different genospecies. Additionally, the global analysis of gene synteny among all strains showed a highly fragmented profile, whereas the intra-cluster analyses revealed larger and more conserved collinear blocks. Finally, ANI analysis substantiated the presence of three distinct clusters-A, B, and C-composed of five, four, and four strains, respectively. The pangenome analysis of each cluster corroborated the classification of these clusters into three distinct species, each containing less variability than that found within the global pangenome of all strains. Here, we propose that comparison of pangenome subsets and their associated α values may be used as an alternative approach, together with ANI, in the in silico classification of new species. Altogether, our results provide evidence not only for the reconsideration of the phylogenetic and genomic relatedness among strains currently assigned to F. prausnitzii , but also the need for lineage (strain-based) differentiation of this taxon to better define how specific members might be associated with positive or negative host interactions.

  13. Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.

    PubMed Central

    Sasaki, H; Yokoyama, E; Kuroiwa, A

    1990-01-01

    The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866

  14. Nimrod, a putative phagocytosis receptor with EGF repeats in Drosophila plasmatocytes.

    PubMed

    Kurucz, Eva; Márkus, Róbert; Zsámboki, János; Folkl-Medzihradszky, Katalin; Darula, Zsuzsanna; Vilmos, Péter; Udvardy, Andor; Krausz, Ildikó; Lukacsovich, Tamás; Gateff, Elisabeth; Zettervall, Carl-Johan; Hultmark, Dan; Andó, István

    2007-04-03

    The hemocytes, the blood cells of Drosophila, participate in the humoral and cellular immune defense reactions against microbes and parasites [1-8]. The plasmatocytes, one class of hemocytes, are phagocytically active and play an important role in immunity and development by removing microorganisms as well as apoptotic cells. On the surface of circulating and sessile plasmatocytes, we have now identified a protein, Nimrod C1 (NimC1), which is involved in the phagocytosis of bacteria. Suppression of NimC1 expression in plasmatocytes inhibited the phagocytosis of Staphylococcus aureus. Conversely, overexpression of NimC1 in S2 cells stimulated the phagocytosis of both S. aureus and Escherichia coli. NimC1 is a 90-100 kDa single-pass transmembrane protein with ten characteristic EGF-like repeats (NIM repeats). The nimC1 gene is part of a cluster of ten related nimrod genes at 34E on chromosome 2, and similar clusters of nimrod-like genes are conserved in other insects such as Anopheles and Apis. The Nimrod proteins are related to other putative phagocytosis receptors such as Eater and Draper from D. melanogaster and CED-1 from C. elegans. Together, they form a superfamily that also includes proteins that are encoded in the human genome.

  15. Transcriptome Analysis of Aspergillus flavus Reveals veA-Dependent Regulation of Secondary Metabolite Gene Clusters, Including the Novel Aflavarin Cluster

    PubMed Central

    Cary, J. W.; Han, Z.; Yin, Y.; Lohmar, J. M.; Shantappa, S.; Harris-Coward, P. Y.; Mack, B.; Ehrlich, K. C.; Wei, Q.; Arroyo-Manzanares, N.; Uka, V.; Vanhaecke, L.; Bhatnagar, D.; Yu, J.; Nierman, W. C.; Johns, M. A.; Sorensen, D.; Shen, H.; De Saeger, S.; Diana Di Mavungu, J.

    2015-01-01

    The global regulatory veA gene governs development and secondary metabolism in numerous fungal species, including Aspergillus flavus. This is especially relevant since A. flavus infects crops of agricultural importance worldwide, contaminating them with potent mycotoxins. The most well-known are aflatoxins, which are cytotoxic and carcinogenic polyketide compounds. The production of aflatoxins and the expression of genes implicated in the production of these mycotoxins are veA dependent. The genes responsible for the synthesis of aflatoxins are clustered, a signature common for genes involved in fungal secondary metabolism. Studies of the A. flavus genome revealed many gene clusters possibly connected to the synthesis of secondary metabolites. Many of these metabolites are still unknown, or the association between a known metabolite and a particular gene cluster has not yet been established. In the present transcriptome study, we show that veA is necessary for the expression of a large number of genes. Twenty-eight out of the predicted 56 secondary metabolite gene clusters include at least one gene that is differentially expressed depending on presence or absence of veA. One of the clusters under the influence of veA is cluster 39. The absence of veA results in a downregulation of the five genes found within this cluster. Interestingly, our results indicate that the cluster is expressed mainly in sclerotia. Chemical analysis of sclerotial extracts revealed that cluster 39 is responsible for the production of aflavarin. PMID:26209694

  16. Functional Conservation of Gsdma Cluster Genes Specifically Duplicated in the Mouse Genome

    PubMed Central

    Tanaka, Shigekazu; Mizushina, Youichi; Kato, Yoriko; Tamura, Masaru; Shiroishi, Toshihiko

    2013-01-01

    Mouse Gasdermin A3 (Gsdma3) is the causative gene for dominant skin mutations exhibiting alopecia. Mouse has two other Gsdma3-related genes, Gsdma and Gsdma2, whereas human and rat have only one related gene. To date, no skin mutation has been reported for human GSDMA and rat Gsdma as well as mouse Gsdma and Gsdma2. Therefore, it is possible that only Gsdma3 has gain-of-function type mutations to cause dominant skin phenotype. To elucidate functional divergence among the Gsdma-related genes in mice, and to infer the function of the human and rat orthologs, we examined in vivo function of mouse Gsdma by generating Gsdma knockout mice and transgenic mice that overexpress wild-type Gsdma or Gsdma harboring a point mutation (Alanine339Threonine). The Gsdma knockout mice shows no visible phenotype, indicating that Gsdma is not essential for differentiation of epidermal cells and maintenance of the hair cycle, and that Gsdma is expressed specifically both in the inner root sheath of hair follicles and in suprabasal cell layers, whereas Gsdma3 is expressed only in suprabasal layers. By contrast, both types of the transgenic mice exhibited epidermal hyperplasia resembling the Gsdma3 mutations, although the phenotype depended on the genetic background. These results indicate that the mouse Gsdma and Gsdma3 genes share common function to regulate epithelial maintenance and/or homeostasis, and suggest that the function of human GSDMA and rat Gsdma, which are orthologs of mouse Gsdma, is conserved as well. PMID:23979942

  17. Coordinating cell cycle-regulated histone gene expression through assembly and function of the Histone Locus Body

    PubMed Central

    Duronio, Robert J.; Marzluff, William F.

    2017-01-01

    ABSTRACT Metazoan replication-dependent (RD) histone genes encode the only known cellular mRNAs that are not polyadenylated. These mRNAs end instead in a conserved stem-loop, which is formed by an endonucleolytic cleavage of the pre-mRNA. The genes for all 5 histone proteins are clustered in all metazoans and coordinately regulated with high levels of expression during S phase. Production of histone mRNAs occurs in a nuclear body called the Histone Locus Body (HLB), a subdomain of the nucleus defined by a concentration of factors necessary for histone gene transcription and pre-mRNA processing. These factors include the scaffolding protein NPAT, essential for histone gene transcription, and FLASH and U7 snRNP, both essential for histone pre-mRNA processing. Histone gene expression is activated by Cyclin E/Cdk2-mediated phosphorylation of NPAT at the G1-S transition. The concentration of factors within the HLB couples transcription with pre-mRNA processing, enhancing the efficiency of histone mRNA biosynthesis. PMID:28059623

  18. Activation of a C. elegans Antennapedia homologue in migrating cells controls their direction of migration.

    PubMed

    Salser, S J; Kenyon, C

    1992-01-16

    Anterior-posterior patterning in insects, vertebrates and nematodes involves members of conserved Antennapedia-class homeobox gene clusters (HOM-C) that are thought to give specific body regions their identities. The effects of these genes on region-specific body structures have been described extensively, particularly in Drosophila, but little is known about how HOM-C genes affect the behaviours of cells that migrate into their domains of function. In Caenorhabditis elegans, the Antennapedia-like HOM-C gene mab-5 not only specifies postembryonic fates of cells in a posterior body region, but also influences the migration of mesodermal and neural cells that move through this region. Here we show that as one neuroblast migrates into this posterior region, it switches on mab-5 gene expression; mab-5 then acts as a developmental switch to control the migratory behaviour of the neuroblast descendants. HOM-C genes can therefore not only direct region-specific patterns of cell division and differentiation, but can also act within migrating cells to programme region-specific migratory behaviour.

  19. 5S rRNA Promoter for Guide RNA Expression Enabled Highly Efficient CRISPR/Cas9 Genome Editing in Aspergillus niger.

    PubMed

    Zheng, Xiaomei; Zheng, Ping; Zhang, Kun; Cairns, Timothy C; Meyer, Vera; Sun, Jibin; Ma, Yanhe

    2018-04-30

    The CRISPR/Cas9 system is a revolutionary genome editing tool. However, in eukaryotes, search and optimization of a suitable promoter for guide RNA expression is a significant technical challenge. Here we used the industrially important fungus, Aspergillus niger, to demonstrate that the 5S rRNA gene, which is both highly conserved and efficiently expressed in eukaryotes, can be used as a guide RNA promoter. The gene editing system was established with 100% rates of precision gene modifications among dozens of transformants using short (40-bp) homologous donor DNA. This system was also applicable for generation of designer chromosomes, as evidenced by deletion of a 48 kb gene cluster required for biosynthesis of the mycotoxin fumonisin B1. Moreover, this system also facilitated simultaneous mutagenesis of multiple genes in A. niger. We anticipate that the use of the 5S rRNA gene as guide RNA promoter can broadly be applied for engineering highly efficient eukaryotic CRISPR/Cas9 toolkits. Additionally, the system reported here will enable development of designer chromosomes in model and industrially important fungi.

  20. A tripartite clustering analysis on microRNA, gene and disease model.

    PubMed

    Shen, Chengcheng; Liu, Ying

    2012-02-01

    Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.

  1. Heterologous expression of pikromycin biosynthetic gene cluster using Streptomyces artificial chromosome system.

    PubMed

    Pyeon, Hye-Rim; Nah, Hee-Ju; Kang, Seung-Hoon; Choi, Si-Sun; Kim, Eung-Soo

    2017-05-31

    Heterologous expression of biosynthetic gene clusters of natural microbial products has become an essential strategy for titer improvement and pathway engineering of various potentially-valuable natural products. A Streptomyces artificial chromosomal conjugation vector, pSBAC, was previously successfully applied for precise cloning and tandem integration of a large polyketide tautomycetin (TMC) biosynthetic gene cluster (Nah et al. in Microb Cell Fact 14(1):1, 2015), implying that this strategy could be employed to develop a custom overexpression scheme of natural product pathway clusters present in actinomycetes. To validate the pSBAC system as a generally-applicable heterologous overexpression system for a large-sized polyketide biosynthetic gene cluster in Streptomyces, another model polyketide compound, the pikromycin biosynthetic gene cluster, was preciously cloned and heterologously expressed using the pSBAC system. A unique HindIII restriction site was precisely inserted at one of the border regions of the pikromycin biosynthetic gene cluster within the chromosome of Streptomyces venezuelae, followed by site-specific recombination of pSBAC into the flanking region of the pikromycin gene cluster. Unlike the previous cloning process, one HindIII site integration step was skipped through pSBAC modification. pPik001, a pSBAC containing the pikromycin biosynthetic gene cluster, was directly introduced into two heterologous hosts, Streptomyces lividans and Streptomyces coelicolor, resulting in the production of 10-deoxymethynolide, a major pikromycin derivative. When two entire pikromycin biosynthetic gene clusters were tandemly introduced into the S. lividans chromosome, overproduction of 10-deoxymethynolide and the presence of pikromycin, which was previously not detected, were both confirmed. Moreover, comparative qRT-PCR results confirmed that the transcription of pikromycin biosynthetic genes was significantly upregulated in S. lividans containing tandem clusters of pikromycin biosynthetic gene clusters. The 60 kb pikromycin biosynthetic gene cluster was isolated in a single integration pSBAC vector. Introduction of the pikromycin biosynthetic gene cluster into the pikromycin non-producing strains resulted in higher pikromycin production. The utility of the pSBAC system as a precise cloning tool for large-sized biosynthetic gene clusters was verified through heterologous expression of the pikromycin biosynthetic gene cluster. Moreover, this pSBAC-driven heterologous expression strategy was confirmed to be an ideal approach for production of low and inconsistent natural products such as pikromycin in S. venezuelae, implying that this strategy could be employed for development of a custom overexpression scheme of natural product biosynthetic gene clusters in actinomycetes.

  2. From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants.

    PubMed

    Chu, Hoi Yee; Wegel, Eva; Osbourn, Anne

    2011-04-01

    Gene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds. An obvious assumption might be that these eukaryotic gene clusters have arisen by horizontal gene transfer from microbes, but there is compelling evidence to indicate that this is not the case. This raises intriguing questions about how widespread such clusters are, what the significance of clustering is, why genes for some metabolic pathways are clustered and those for others are not, and how these clusters form. In answering these questions we may hope to learn more about mechanisms of genome plasticity and adaptive evolution in plants. It is noteworthy that for the five plant secondary metabolic gene clusters reported so far, the enzymes for the first committed steps all appear to have been recruited directly or indirectly from primary metabolic pathways involved in hormone synthesis. This may or may not turn out to be a common feature of plant secondary metabolic gene clusters as new clusters emerge. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.

  3. Assembly and features of secondary metabolite biosynthetic gene clusters in Streptomyces ansochromogenes.

    PubMed

    Zhong, Xingyu; Tian, Yuqing; Niu, Guoqing; Tan, Huarong

    2013-07-01

    A draft genome sequence of Streptomyces ansochromogenes 7100 was generated using 454 sequencing technology. In combination with local BLAST searches and gap filling techniques, a comprehensive antiSMASH-based method was adopted to assemble the secondary metabolite biosynthetic gene clusters in the draft genome of S. ansochromogenes. A total of at least 35 putative gene clusters were identified and assembled. Transcriptional analysis showed that 20 of the 35 gene clusters were expressed in either or all of the three different media tested, whereas the other 15 gene clusters were silent in all three different media. This study provides a comprehensive method to identify and assemble secondary metabolite biosynthetic gene clusters in draft genomes of Streptomyces, and will significantly promote functional studies of these secondary metabolite biosynthetic gene clusters.

  4. Supervised group Lasso with applications to microarray data analysis

    PubMed Central

    Ma, Shuangge; Song, Xiao; Huang, Jian

    2007-01-01

    Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. PMID:17316436

  5. Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA

    PubMed Central

    Mugford, Sam T.; Louveau, Thomas; Melton, Rachel; Qi, Xiaoquan; Bakht, Saleha; Hill, Lionel; Tsurushima, Tetsu; Honkanen, Suvi; Rosser, Susan J.; Lomonossoff, George P.; Osbourn, Anne

    2013-01-01

    Operon-like gene clusters are an emerging phenomenon in the field of plant natural products. The genes encoding some of the best-characterized plant secondary metabolite biosynthetic pathways are scattered across plant genomes. However, an increasing number of gene clusters encoding the synthesis of diverse natural products have recently been reported in plant genomes. These clusters have arisen through the neo-functionalization and relocation of existing genes within the genome, and not by horizontal gene transfer from microbes. The reasons for clustering are not yet clear, although this form of gene organization is likely to facilitate co-inheritance and co-regulation. Oats (Avena spp) synthesize antimicrobial triterpenoids (avenacins) that provide protection against disease. The synthesis of these compounds is encoded by a gene cluster. Here we show that a module of three adjacent genes within the wider biosynthetic gene cluster is required for avenacin acylation. Through the characterization of these genes and their encoded proteins we present a model of the subcellular organization of triterpenoid biosynthesis. PMID:23532069

  6. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  7. Cloning and sequencing of the Thermoanaerobacterium saccharolyticum B6A-RI apu gene and purification and characterization of the amylopullulanase from Escherichia coli.

    PubMed

    Ramesh, M V; Podkovyrov, S M; Lowe, S E; Zeikus, J G

    1994-01-01

    The amylopullulanase gene (apu) of the thermophilic anaerobic bacterium Thermoanaerobacterium saccharolyticum B6A-RI was cloned into Escherichia coli. The complete nucleotide sequence of the gene was determined. It encoded a protein consisting of 1,288 amino acids with a signal peptide of 35 amino acids. The enzyme purified from E. coli was a monomer with an M(r) of 142,000 +/- 2,000 and had same the catalytic and thermal characteristics as the native glycoprotein from T. saccharolyticum B6A. Linear alignment and the hydrophobic cluster analysis were used to compare this amylopullulanase with other amylolytic enzymes. Both methods revealed strictly conserved amino acid residues among these enzymes, and it is proposed that Asp-594, Asp-700, and Glu-623 are a putative catalytic triad of the T. saccharolyticum B6A-RI amylopullulanase.

  8. Cloning and sequencing of the Thermoanaerobacterium saccharolyticum B6A-RI apu gene and purification and characterization of the amylopullulanase from Escherichia coli.

    PubMed Central

    Ramesh, M V; Podkovyrov, S M; Lowe, S E; Zeikus, J G

    1994-01-01

    The amylopullulanase gene (apu) of the thermophilic anaerobic bacterium Thermoanaerobacterium saccharolyticum B6A-RI was cloned into Escherichia coli. The complete nucleotide sequence of the gene was determined. It encoded a protein consisting of 1,288 amino acids with a signal peptide of 35 amino acids. The enzyme purified from E. coli was a monomer with an M(r) of 142,000 +/- 2,000 and had same the catalytic and thermal characteristics as the native glycoprotein from T. saccharolyticum B6A. Linear alignment and the hydrophobic cluster analysis were used to compare this amylopullulanase with other amylolytic enzymes. Both methods revealed strictly conserved amino acid residues among these enzymes, and it is proposed that Asp-594, Asp-700, and Glu-623 are a putative catalytic triad of the T. saccharolyticum B6A-RI amylopullulanase. Images PMID:8117096

  9. Avian genomics lends insights into endocrine function in birds.

    PubMed

    Mello, C V; Lovell, P V

    2018-01-15

    The genomics era has brought along the completed sequencing of a large number of bird genomes that cover a broad range of the avian phylogenetic tree (>30 orders), leading to major novel insights into avian biology and evolution. Among recent findings, the discovery that birds lack a large number of protein coding genes that are organized in highly conserved syntenic clusters in other vertebrates is very intriguing, given the physiological importance of many of these genes. A considerable number of them play prominent endocrine roles, suggesting that birds evolved compensatory genetic or physiological mechanisms that allowed them to survive and thrive in spite of these losses. While further studies are needed to establish the exact extent of avian gene losses, these findings point to birds as potentially highly relevant model organisms for exploring the genetic basis and possible therapeutic approaches for a wide range of endocrine functions and disorders. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. A Gene Transfer Agent and a Dynamic Repertoire of Secretion Systems Hold the Keys to the Explosive Radiation of the Emerging Pathogen Bartonella

    PubMed Central

    Guy, Lionel; Nystedt, Björn; Toft, Christina; Zaremba-Niedzwiedzka, Katarzyna; Berglund, Eva C.; Granberg, Fredrik; Näslund, Kristina; Eriksson, Ann-Sofie; Andersson, Siv G. E.

    2013-01-01

    Gene transfer agents (GTAs) randomly transfer short fragments of a bacterial genome. A novel putative GTA was recently discovered in the mouse-infecting bacterium Bartonella grahamii. Although GTAs are widespread in phylogenetically diverse bacteria, their role in evolution is largely unknown. Here, we present a comparative analysis of 16 Bartonella genomes ranging from 1.4 to 2.6 Mb in size, including six novel genomes from Bartonella isolated from a cow, two moose, two dogs, and a kangaroo. A phylogenetic tree inferred from 428 orthologous core genes indicates that the deadly human pathogen B. bacilliformis is related to the ruminant-adapted clade, rather than being the earliest diverging species in the genus as previously thought. A gene flux analysis identified 12 genes for a GTA and a phage-derived origin of replication as the most conserved innovations. These are located in a region of a few hundred kb that also contains 8 insertions of gene clusters for type III, IV, and V secretion systems, and genes for putatively secreted molecules such as cholera-like toxins. The phylogenies indicate a recent transfer of seven genes in the virB gene cluster for a type IV secretion system from a cat-adapted B. henselae to a dog-adapted B. vinsonii strain. We show that the B. henselae GTA is functional and can transfer genes in vitro. We suggest that the maintenance of the GTA is driven by selection to increase the likelihood of horizontal gene transfer and argue that this process is beneficial at the population level, by facilitating adaptive evolution of the host-adaptation systems and thereby expansion of the host range size. The process counters gene loss and forces all cells to contribute to the production of the GTA and the secreted molecules. The results advance our understanding of the role that GTAs play for the evolution of bacterial genomes. PMID:23555299

  11. Evolutionary expansion and divergence in a large family of primate-specific zinc finger transcription factor genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hamilton, A T; Huntley, S; Tran-Gyamfi, M

    Although most genes are conserved as one-to-one orthologs in different mammalian orders, certain gene families have evolved to comprise different numbers and types of protein-coding genes through independent series of gene duplications, divergence and gene loss in each evolutionary lineage. One such family encodes KRAB-zinc finger (KRAB-ZNF) genes, which are likely to function as transcriptional repressors. One KRAB-ZNF subfamily, the ZNF91 clade, has expanded specifically in primates to comprise more than 110 loci in the human genome, yielding large gene clusters in human chromosomes 19 and 7 and smaller clusters or isolated copies at other chromosomal locations. Although phylogenetic analysismore » indicates that many of these genes arose before the split between old world monkeys and new world monkeys, the ZNF91 subfamily has continued to expand and diversify throughout the evolution of apes and humans. The paralogous loci are distinguished by sequence divergence within their zinc finger arrays indicating a selection for proteins with different DNA binding specificities. RT-PCR and in situ hybridization data show that some of these ZNF genes can have tissue-specific expression patterns, however many KRAB-ZNFs that are near-ubiquitous could also be playing very specific roles in halting target pathways in all tissues except for a few, where the target is released by the absence of its repressor. The number of variant KRAB-ZNF proteins is increased not only because of the large number of loci, but also because many loci can produce multiple splice variants, which because of the modular structure of these genes may have separate and perhaps even conflicting regulatory roles. The lineage-specific duplication and rapid divergence of this family of transcription factor genes suggests a role in determining species-specific biological differences and the evolution of novel primate traits.« less

  12. A gene transfer agent and a dynamic repertoire of secretion systems hold the keys to the explosive radiation of the emerging pathogen Bartonella.

    PubMed

    Guy, Lionel; Nystedt, Björn; Toft, Christina; Zaremba-Niedzwiedzka, Katarzyna; Berglund, Eva C; Granberg, Fredrik; Näslund, Kristina; Eriksson, Ann-Sofie; Andersson, Siv G E

    2013-03-01

    Gene transfer agents (GTAs) randomly transfer short fragments of a bacterial genome. A novel putative GTA was recently discovered in the mouse-infecting bacterium Bartonella grahamii. Although GTAs are widespread in phylogenetically diverse bacteria, their role in evolution is largely unknown. Here, we present a comparative analysis of 16 Bartonella genomes ranging from 1.4 to 2.6 Mb in size, including six novel genomes from Bartonella isolated from a cow, two moose, two dogs, and a kangaroo. A phylogenetic tree inferred from 428 orthologous core genes indicates that the deadly human pathogen B. bacilliformis is related to the ruminant-adapted clade, rather than being the earliest diverging species in the genus as previously thought. A gene flux analysis identified 12 genes for a GTA and a phage-derived origin of replication as the most conserved innovations. These are located in a region of a few hundred kb that also contains 8 insertions of gene clusters for type III, IV, and V secretion systems, and genes for putatively secreted molecules such as cholera-like toxins. The phylogenies indicate a recent transfer of seven genes in the virB gene cluster for a type IV secretion system from a cat-adapted B. henselae to a dog-adapted B. vinsonii strain. We show that the B. henselae GTA is functional and can transfer genes in vitro. We suggest that the maintenance of the GTA is driven by selection to increase the likelihood of horizontal gene transfer and argue that this process is beneficial at the population level, by facilitating adaptive evolution of the host-adaptation systems and thereby expansion of the host range size. The process counters gene loss and forces all cells to contribute to the production of the GTA and the secreted molecules. The results advance our understanding of the role that GTAs play for the evolution of bacterial genomes.

  13. Anhydrobiosis vs. aging: comparative genomics of protein repair L-isoaspartyl methyltransferases in the sleeping chironomid. .

    NASA Astrophysics Data System (ADS)

    Gusev, Oleg; Kikawada, Takahiro; Shagimardanova, Elena; Suetsugu, Yoshitaka; Ayupov, Rustam

    Origin of anhydrobiosis in the larvae of the sleeping chironomid Polypedilum vanderplanki represents unique example of set of evolutionary events in a single species, resulted in acquiring new ability allowing survival in extremely changeable environment. Complex comparative analysis of the genome of P. vanderplanki resulted in discovery of a set of features, including existence of the set of unique clusters of genes contributing in desiccation resistance. Surprisingly, in several cases, the genes mainly contributing to the formation of the molecular shield in the larvae are sleeping chironomid-specific and have no homology with genes from other insects, including P. nubifer - a chironomid from the same genus. Protein L-isoaspartyl methyltransferase (PIMT) acts on proteins that have been non-enzymatically damaged due to age, and partially restores aspartic residues, extending life of the polypeptides. PIMT a highly conserved enzyme present in nearly all eukaryotes, and microorganisms mostly in a single copy (or in a few isoforms in certain plants and some bacteria). While conducting a comparative analysis of the genomes of two chironomid midge species different in their ability to stand complete water loss, we have noticed that structure and number of PIMT-coding genes in the desiccation resistant (anhydrobiotic) midge (Polypedilum vanderplanki, Pv) is different from those of the common desiccation-sensitive midge (Polypedilum nubifer, Pn) and the rest of insects. Both species have a clear orthologous PIMT shared by all insects. At the same time, in contrast to Pn which has only one PIMT gene (PnPimt-1), the Pv genome contains 12 additional genes paralogous to Pimt1 (PvPimt-2-12) presumably coding functional PIMT proteins, which are arranged in a single cluster. Remarkably, PvPimt-1 location in the Pv is different from the rest of Pimt-like genes. PvPimt-1 gene is ubiquitously expressed during the life cycle, but expression of the PvPimt2-12 is limited to the eggs and larval stages. Finally, the expression of Pimt1 gene in both chironomids was not changed in response to desiccation, while the clustered PvPimt2-12 showed strong up-regulation in response to water loss and other abiotic stresses. The abundance of PvPimt2-12 mRNAs was maximal in anhydrobiotic larvae, and it resembles the case of plant seeds where accumulation of PIMT provides additional protection for proteins during long dry storage. Predicted proteins of PvPimT2-12 contain conservative L-isoaspartyl methyltransferase functional domain. At the same time the length and structure of N- and C- terminals of the predicted proteins show significant variation, suggesting different substrate preferences or other specific properties of different Pv-PIMT Furthermore, the multi-member family in Pv is the first observation of drastic expansion and evolution of Pimt genes in general, and particularly in a single insect species. This work was supported by Russian Foundation for Basic Research (No. 12-08-33157 mol_a_ved and No. 14-04-01657_A).

  14. Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology

    PubMed Central

    Fischbach, Michael; Voigt, Christopher A.

    2014-01-01

    Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668

  15. GraphTeams: a method for discovering spatial gene clusters in Hi-C sequencing data.

    PubMed

    Schulz, Tizian; Stoye, Jens; Doerr, Daniel

    2018-05-08

    Hi-C sequencing offers novel, cost-effective means to study the spatial conformation of chromosomes. We use data obtained from Hi-C experiments to provide new evidence for the existence of spatial gene clusters. These are sets of genes with associated functionality that exhibit close proximity to each other in the spatial conformation of chromosomes across several related species. We present the first gene cluster model capable of handling spatial data. Our model generalizes a popular computational model for gene cluster prediction, called δ-teams, from sequences to graphs. Following previous lines of research, we subsequently extend our model to allow for several vertices being associated with the same label. The model, called δ-teams with families, is particular suitable for our application as it enables handling of gene duplicates. We develop algorithmic solutions for both models. We implemented the algorithm for discovering δ-teams with families and integrated it into a fully automated workflow for discovering gene clusters in Hi-C data, called GraphTeams. We applied it to human and mouse data to find intra- and interchromosomal gene cluster candidates. The results include intrachromosomal clusters that seem to exhibit a closer proximity in space than on their chromosomal DNA sequence. We further discovered interchromosomal gene clusters that contain genes from different chromosomes within the human genome, but are located on a single chromosome in mouse. By identifying δ-teams with families, we provide a flexible model to discover gene cluster candidates in Hi-C data. Our analysis of Hi-C data from human and mouse reveals several known gene clusters (thus validating our approach), but also few sparsely studied or possibly unknown gene cluster candidates that could be the source of further experimental investigations.

  16. Functional clustering of time series gene expression data by Granger causality

    PubMed Central

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  17. Fine-Scale Analysis Reveals Cryptic Landscape Genetic Structure in Desert Tortoises

    PubMed Central

    Latch, Emily K.; Boarman, William I.; Walde, Andrew; Fleischer, Robert C.

    2011-01-01

    Characterizing the effects of landscape features on genetic variation is essential for understanding how landscapes shape patterns of gene flow and spatial genetic structure of populations. Most landscape genetics studies have focused on patterns of gene flow at a regional scale. However, the genetic structure of populations at a local scale may be influenced by a unique suite of landscape variables that have little bearing on connectivity patterns observed at broader spatial scales. We investigated fine-scale spatial patterns of genetic variation and gene flow in relation to features of the landscape in desert tortoise (Gopherus agassizii), using 859 tortoises genotyped at 16 microsatellite loci with associated data on geographic location, sex, elevation, slope, and soil type, and spatial relationship to putative barriers (power lines, roads). We used spatially explicit and non-explicit Bayesian clustering algorithms to partition the sample into discrete clusters, and characterize the relationships between genetic distance and ecological variables to identify factors with the greatest influence on gene flow at a local scale. Desert tortoises exhibit weak genetic structure at a local scale, and we identified two subpopulations across the study area. Although genetic differentiation between the subpopulations was low, our landscape genetic analysis identified both natural (slope) and anthropogenic (roads) landscape variables that have significantly influenced gene flow within this local population. We show that desert tortoise movements at a local scale are influenced by features of the landscape, and that these features are different than those that influence gene flow at larger scales. Our findings are important for desert tortoise conservation and management, particularly in light of recent translocation efforts in the region. More generally, our results indicate that recent landscape changes can affect gene flow at a local scale and that their effects can be detected almost immediately. PMID:22132143

  18. Fine-scale analysis reveals cryptic landscape genetic structure in desert tortoises.

    PubMed

    Latch, Emily K; Boarman, William I; Walde, Andrew; Fleischer, Robert C

    2011-01-01

    Characterizing the effects of landscape features on genetic variation is essential for understanding how landscapes shape patterns of gene flow and spatial genetic structure of populations. Most landscape genetics studies have focused on patterns of gene flow at a regional scale. However, the genetic structure of populations at a local scale may be influenced by a unique suite of landscape variables that have little bearing on connectivity patterns observed at broader spatial scales. We investigated fine-scale spatial patterns of genetic variation and gene flow in relation to features of the landscape in desert tortoise (Gopherus agassizii), using 859 tortoises genotyped at 16 microsatellite loci with associated data on geographic location, sex, elevation, slope, and soil type, and spatial relationship to putative barriers (power lines, roads). We used spatially explicit and non-explicit Bayesian clustering algorithms to partition the sample into discrete clusters, and characterize the relationships between genetic distance and ecological variables to identify factors with the greatest influence on gene flow at a local scale. Desert tortoises exhibit weak genetic structure at a local scale, and we identified two subpopulations across the study area. Although genetic differentiation between the subpopulations was low, our landscape genetic analysis identified both natural (slope) and anthropogenic (roads) landscape variables that have significantly influenced gene flow within this local population. We show that desert tortoise movements at a local scale are influenced by features of the landscape, and that these features are different than those that influence gene flow at larger scales. Our findings are important for desert tortoise conservation and management, particularly in light of recent translocation efforts in the region. More generally, our results indicate that recent landscape changes can affect gene flow at a local scale and that their effects can be detected almost immediately.

  19. Genetic Screening Strategy for Rapid Access to Polyether Ionophore Producers and Products in Actinomycetes ▿ †

    PubMed Central

    Wang, Hao; Liu, Ning; Xi, Lijun; Rong, Xiaoying; Ruan, Jisheng; Huang, Ying

    2011-01-01

    Polyether ionophores are a unique class of polyketides with broad-spectrum activity and outstanding potency for the control of drug-resistant bacteria and parasites, and they are produced exclusively by actinomycetes. A special epoxidase gene encoding a critical tailoring enzyme involved in the biosynthesis of these compounds has been found in all five of the complete gene clusters of polyether ionophores published so far. To detect potential producer strains of these antibiotics, a pair of degenerate primers was designed according to the conserved regions of the five known polyether epoxidases. A total of 44 putative polyether epoxidase gene-positive strains were obtained by the PCR-based screening of 1,068 actinomycetes isolated from eight different habitats and 236 reference strains encompassing eight major families of Actinomycetales. The isolates spanned a wide taxonomic diversity based on 16S rRNA gene analysis, and actinomycetes isolated from acidic soils seemed to be a promising source of polyether ionophores. Four genera were detected to contain putative polyether epoxidases, including Micromonospora, which has not previously been reported to produce polyether ionophores. The designed primers also detected putative epoxidase genes from diverse known producer strains that produce polyether ionophores unrelated to the five published gene clusters. Moreover, phylogenetic and chemical analyses showed a strong correlation between the sequence of polyether epoxidases and the structure of encoded polyethers. Thirteen positive isolates were proven to be polyether ionophore producers as expected, and two new analogues were found. These results demonstrate the feasibility of using this epoxidase gene screening strategy to aid the rapid identification of known products and the discovery of unknown polyethers in actinomycetes. PMID:21421776

  20. Genetic screening strategy for rapid access to polyether ionophore producers and products in actinomycetes.

    PubMed

    Wang, Hao; Liu, Ning; Xi, Lijun; Rong, Xiaoying; Ruan, Jisheng; Huang, Ying

    2011-05-01

    Polyether ionophores are a unique class of polyketides with broad-spectrum activity and outstanding potency for the control of drug-resistant bacteria and parasites, and they are produced exclusively by actinomycetes. A special epoxidase gene encoding a critical tailoring enzyme involved in the biosynthesis of these compounds has been found in all five of the complete gene clusters of polyether ionophores published so far. To detect potential producer strains of these antibiotics, a pair of degenerate primers was designed according to the conserved regions of the five known polyether epoxidases. A total of 44 putative polyether epoxidase gene-positive strains were obtained by the PCR-based screening of 1,068 actinomycetes isolated from eight different habitats and 236 reference strains encompassing eight major families of Actinomycetales. The isolates spanned a wide taxonomic diversity based on 16S rRNA gene analysis, and actinomycetes isolated from acidic soils seemed to be a promising source of polyether ionophores. Four genera were detected to contain putative polyether epoxidases, including Micromonospora, which has not previously been reported to produce polyether ionophores. The designed primers also detected putative epoxidase genes from diverse known producer strains that produce polyether ionophores unrelated to the five published gene clusters. Moreover, phylogenetic and chemical analyses showed a strong correlation between the sequence of polyether epoxidases and the structure of encoded polyethers. Thirteen positive isolates were proven to be polyether ionophore producers as expected, and two new analogues were found. These results demonstrate the feasibility of using this epoxidase gene screening strategy to aid the rapid identification of known products and the discovery of unknown polyethers in actinomycetes.

Top