Sample records for duplicate gene expression

  1. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    PubMed

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.

  2. Modes of gene duplication contribute differently to genetic novelty and redundancy, but show parallels across divergent angiosperms.

    PubMed

    Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P; Feltus, F Alex; Paterson, Andrew H

    2011-01-01

    Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution.

  3. Modes of Gene Duplication Contribute Differently to Genetic Novelty and Redundancy, but Show Parallels across Divergent Angiosperms

    PubMed Central

    Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P.; Feltus, F. Alex; Paterson, Andrew H.

    2011-01-01

    Background Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. Results In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Conclusion Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution. PMID:22164235

  4. Gene duplication, silencing and expression alteration govern the molecular evolution of PRC2 genes in plants.

    PubMed

    Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira

    2016-10-13

    PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.

  5. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae).

    PubMed

    Baker, Richard H; Narechania, Apurva; Johns, Philip M; Wilkinson, Gerald S

    2012-08-19

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict.

  6. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae)

    PubMed Central

    Baker, Richard H.; Narechania, Apurva; Johns, Philip M.; Wilkinson, Gerald S.

    2012-01-01

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict. PMID:22777023

  7. Evolution of vertebrate central nervous system is accompanied by novel expression changes of duplicate genes.

    PubMed

    Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu

    2011-12-20

    The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.

  8. Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation.

    PubMed

    Zeng, Jia; Hannenhalli, Sridhar

    2013-01-01

    Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study. Here we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human. Our results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.

  9. Complexity of Gene Expression Evolution after Duplication: Protein Dosage Rebalancing

    PubMed Central

    Rogozin, Igor B.

    2014-01-01

    Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC). Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO) annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes. PMID:25197576

  10. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates

    PubMed Central

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-01-01

    Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708

  11. Dating and functional characterization of duplicated genes in the apple (Malus domestica Borkh.) by analyzing EST data.

    PubMed

    Sanzol, Javier

    2010-05-14

    Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to gamma-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82% of duplicated genes, including 80% of young paralogs, showed uncorrelated expression profiles, suggesting extensive subfunctionalization and a role of gene duplication in the acquisition of novel patterns of gene expression. This study reports a genome-wide analysis of the mode of gene duplication in the apple, and provides evidence for its role in genome functional diversification by characterising three major processes: selective retention of paralogs, amplification of gene families, and changes in gene expression.

  12. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    PubMed

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. Duplication and expression of CYC2-like genes in the origin and maintenance of corolla zygomorphy in Lamiales.

    PubMed

    Zhong, Jinshun; Kellogg, Elizabeth A

    2015-01-01

    Duplication, retention, and expression of CYCLOIDEA2 (CYC2)-like genes are thought to affect evolution of corolla symmetry. However, exactly what and how changes in CYC2-like genes correlate with the origin of corolla zygomorphy are poorly understood. We inferred and calibrated a densely sampled phylogeny of CYC2-like genes across the Lamiales and examined their expression in early diverging (EDL) and higher core clades (HCL). CYC2-like genes duplicated extensively in Lamiales, at least six times in core Lamiales (CL) around the Cretaceous-Paleogene (K-Pg) boundary, and seven more in EDL relatively more recently. Nested duplications and losses of CYC2-like paralogs are pervasive but may not correlate with transitions in corolla symmetry. We found evidence for dN/dS (ω) variation following gene duplications. CYC2-like paralogs in HCL show differential expression with higher expression in adaxial petals. Asymmetric expression but not recurrent duplication of CYC2-like genes correlates with the origin of corolla zygomorphy. Changes in both cis-regulatory and coding domains of CYC2-like genes are probably crucial for the evolution of corolla zygomorphy. Multiple selection regimes appear likely to play important roles in gene retention. The parallel duplications of CYC2-like genes are after the initial diversification of bumble bees and Euglossine bees. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  14. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders

    PubMed Central

    Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.

    2016-01-01

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576

  15. Evolution of developmental roles of Pax2/5/8 paralogs after independent duplication in urochordate and vertebrate lineages.

    PubMed

    Bassham, Susan; Cañestro, Cristian; Postlethwait, John H

    2008-08-22

    Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial remodeling, and in sensory tissues evokes like functions of Pax2, Pax5 and Pax8 in vertebrate embryos, and may indicate ancient origins for these functions in the chordate common ancestor. Comparative analysis of expression patterns of chordate Pax2/5/8 duplicates, rooted on the single-copy Pax2/5/8 gene of amphioxus, whose lineage diverged basally among chordates, provides new insights into the evolution and development of the heart, thyroid, pharynx, stomodeum and placodes in chordates; supports the controversial conclusion that the atrial siphon of ascidians and the otic placode in vertebrates are homologous; and backs the notion that Pax2/5/8 functioned in ancestral chordates to engineer epithelial fusions and perforations, including gill slit openings.

  16. Gene duplication and the evolution of phenotypic diversity in insect societies.

    PubMed

    Chau, Linh M; Goodisman, Michael A D

    2017-12-01

    Gene duplication is an important evolutionary process thought to facilitate the evolution of phenotypic diversity. We investigated if gene duplication was associated with the evolution of phenotypic differences in a highly social insect, the honeybee Apis mellifera. We hypothesized that the genetic redundancy provided by gene duplication could promote the evolution of social and sexual phenotypes associated with advanced societies. We found a positive correlation between sociality and rate of gene duplications across the Apoidea, indicating that gene duplication may be associated with sociality. We also discovered that genes showing biased expression between A. mellifera alternative phenotypes tended to be found more frequently than expected among duplicated genes than singletons. Moreover, duplicated genes had higher levels of caste-, sex-, behavior-, and tissue-biased expression compared to singletons, as expected if gene duplication facilitated phenotypic differentiation. We also found that duplicated genes were maintained in the A. mellifera genome through the processes of conservation, neofunctionalization, and specialization, but not subfunctionalization. Overall, we conclude that gene duplication may have facilitated the evolution of social and sexual phenotypes, as well as tissue differentiation. Thus this study further supports the idea that gene duplication allows species to evolve an increased range of phenotypic diversity. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.

  17. Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.

    PubMed

    Gout, Jean-Francois; Lynch, Michael

    2015-08-01

    Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Evolution of developmental roles of Pax2/5/8 paralogs after independent duplication in urochordate and vertebrate lineages

    PubMed Central

    Bassham, Susan; Cañestro, Cristian; Postlethwait, John H

    2008-01-01

    Background Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. Results To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial remodeling, and in sensory tissues evokes like functions of Pax2, Pax5 and Pax8 in vertebrate embryos, and may indicate ancient origins for these functions in the chordate common ancestor. Conclusion Comparative analysis of expression patterns of chordate Pax2/5/8 duplicates, rooted on the single-copy Pax2/5/8 gene of amphioxus, whose lineage diverged basally among chordates, provides new insights into the evolution and development of the heart, thyroid, pharynx, stomodeum and placodes in chordates; supports the controversial conclusion that the atrial siphon of ascidians and the otic placode in vertebrates are homologous; and backs the notion that Pax2/5/8 functioned in ancestral chordates to engineer epithelial fusions and perforations, including gill slit openings. PMID:18721460

  19. The evolution of duplicate gene expression in mammalian organs

    PubMed Central

    Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik

    2017-01-01

    Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766

  20. Duplicated growth hormone genes in a passerine bird, the jungle crow (Corvus macrorhynchos).

    PubMed

    Arai, Natsumi; Iigo, Masayuki

    2010-07-02

    Molecular cloning, molecular phylogeny, gene structure and expression analyses of growth hormone (GH) were performed in a passerine bird, the jungle crow (Corvus macrorhynchos). Unexpectedly, duplicated GH cDNA and genes were identified and designated as GH1A and GH1B. In silico analyses identified the zebra finch orthologs. Both GH genes encode 217 amino acid residues and consist of five exons and four introns, spanning 5.2 kbp in GH1A and 4.2 kbp in GH1B. Predicted GH proteins of the jungle crow and zebra finch contain four conserved cysteine residues, suggesting duplicated GH genes are functional. Molecular phylogenetic analysis revealed that duplication of GH genes occur after divergence of the passerine lineage from the other avian orders as has been suggested from partial genomic DNA sequences of passerine GH genes. RT-PCR analyses confirmed expression of GH1A and GH1B in the pituitary gland. In addition, GH1A gene is expressed in all the tissues examined. However, expression of GH1B is confined to several brain areas and blood cells. These results indicate that the regulatory mechanisms of duplicated GH genes are different and that duplicated GH genes exert both endocrine and autocrine/paracrine functions. Copyright 2010 Elsevier Inc. All rights reserved.

  1. Models for loosely linked gene duplicates suggest lengthy persistence of both copies.

    PubMed

    O'Hely, Martin; Wockner, Leesa

    2007-06-21

    Consider the appearance of a duplicate copy of a gene at a locus linked loosely, if at all, to the locus at which the gene is usually found. If all copies of the gene are subject to non-functionalizing mutations, then two fates are possible: loss of functional copies at the duplicate locus (loss of duplicate expression), or loss of functional copies at the original locus (map change). This paper proposes a simple model to address the probability of map change, the time taken for a map change and/or loss of duplicate expression, and considers where in the spectrum between loss of duplicate expression and map change such a duplicate complex is likely to be found. The findings are: the probability of map change is always half the reciprocal of the population size N, the time for a map change to occur is order NlogN generations, and that there is a marked tendency for duplicates to remain near equi-frequency with the gene at the original locus for a large portion of that time. This is in excellent agreement with simulations.

  2. A limited role for gene duplications in the evolution of platypus venom.

    PubMed

    Wong, Emily S W; Papenfuss, Anthony T; Whittington, Camilla M; Warren, Wesley C; Belov, Katherine

    2012-01-01

    Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the "venome" of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation.

  3. A Limited Role for Gene Duplications in the Evolution of Platypus Venom

    PubMed Central

    Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine

    2012-01-01

    Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864

  4. Levels of duplicate gene expression in armoured catfishes.

    PubMed

    Dunham, R A; Philipp, D P; Whitt, G S

    1980-01-01

    Species of armoured catfishes differ significantly in their cellular DNA content and chromosome number. Starch gel electrophoresis of isozymes was used to determine whether each of 16 enzyme loci was expressed in a single or duplicate state. The percent of enzyme loci exhibiting duplicate locus expression in Corydoras aeneus, Corydoras julii, Corydoras melanistius, and Corydoras myersi was 37.5 percent, 18.75 percent, 12.5 percent, and 6.25 percent, respectively. The percentage of loci expressed in duplicate is higher in the species with higher haploid DNA contents, which are 4.4 pg, 3.0 pg, and 2.3 pg, respectively. These differences in DNA contents are also associated with differences in chromosome number. These data are consistent with the hypothesis that increases in DNA contents and enzyme loci occur both by tetraploidization and by regional gene duplication and that these increases are then followed by a partial loss of DNA and a reduction in the number of the duplicate isozyme loci expressed. Such analyses provide insight into the mechanisms of genome amplification and reduction as well as insights into the fats of duplicate genes.

  5. Evolutionary Diversification of Insect Innexins

    PubMed Central

    Hughes, Austin L.

    2014-01-01

    Abstract Phylogenetic analysis of insect innexins supported the hypothesis that six major clades of insect innexins arose by gene duplication prior to the origin of the endopterygote insects. Within one of the six clades (the Zpg Clade), two independent gene duplication events were inferred to have occurred in the lineage of Drosophila , after the most recent common ancestor of the dipteran families Culicidae and Drosophilidae. The relationships among this clades were poorly resolved, except for a sister relationship between ShakB and Ogre. Gene expression data from FlyAtlas supported the hypothesis that the latter gene duplication events gave rise to functional differentiation, with Zpg showing a high level of expression in ovary, and Inx5 and Inx6 showing a high level of expression in testis. Because unduplicated members of this clade in Bombyx mori and Anopheles gambiae showed high levels of expression in both ovary and tests, the expression patterns of the Drosophila members of this clade provide evidence of subdivision of an ancestral gene function after gene duplication. PMID:25502029

  6. Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

    PubMed Central

    Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

    2014-01-01

    It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342

  7. Divergence and evolution of cotton bHLH proteins from diploid to allotetraploid.

    PubMed

    Liu, Bingliang; Guan, Xueying; Liang, Wenhua; Chen, Jiedan; Fang, Lei; Hu, Yan; Guo, Wangzhen; Rong, Junkang; Xu, Guohua; Zhang, Tianzhen

    2018-02-23

    Polyploidy is considered a major driving force in genome expansion, yielding duplicated genes whose expression may be conserved or divergence as a consequence of polyploidization. We compared the genome sequences of tetraploid cotton (Gossypium hirsutum) and its two diploid progenitors, G. arboreum and G. raimondii, and found that the bHLH genes were conserved over the polyploidization. Oppositely, the expression of the homeolgous gene pairs was diversified. The biased homeologous proportion for bHLH family is significantly higher (64.6%) than the genome wide homeologous expression bias (40%). Compared with cacao (T. cacao), orthologous genes only accounted for a small proportion (41.7%) of whole cotton bHLHs family. The further Ks analysis indicated that bHLH genes underwent at least two distinct episodes of whole genome duplication: a recent duplication (1.0-60.0 million years ago, MYA, 0.005 < Ks < 0.312) and an old duplication (> 60.0 MYA, 0.312 < Ks < 3.0). The old duplication event might have played a key role in the expansion of the bHLH family. Both recent and old duplicated pairs (68.8%) showed a divergent expression profile, indicating specialized functions. The expression diversification of the duplicated genes suggested it might be a universal feature of the long-term evolution of cotton. Overview of cotton bHLH proteins indicated a conserved and divergent evolution from diploids to allotetraploid. Our results provided an excellent example for studying the long-term evolution of polyploidy.

  8. Analyses of the NAC transcription factor gene family in Gossypium raimondii Ulbr.: chromosomal location, structure, phylogeny, and expression patterns.

    PubMed

    Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu

    2013-07-01

    NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.

  9. Population Level Purifying Selection and Gene Expression Shape Subgenome Evolution in Maize.

    PubMed

    Pophaly, Saurabh D; Tellier, Aurélien

    2015-12-01

    The maize ancestor experienced a recent whole-genome duplication (WGD) followed by gene erosion which generated two subgenomes, the dominant subgenome (maize1) experiencing fewer deletions than maize2. We take advantage of available extensive polymorphism and gene expression data in maize to study purifying selection and gene expression divergence between WGD retained paralog pairs. We first report a strong correlation in nucleotide diversity between duplicate pairs, except for upstream regions. We then show that maize1 genes are under stronger purifying selection than maize2. WGD retained genes have higher gene dosage and biased Gene Ontologies consistent with previous studies. The relative gene expression of paralogs across tissues demonstrates that 98% of duplicate pairs have either subfunctionalized in a tissuewise manner or have diverged consistently in their expression thereby preventing functional complementation. Tissuewise subfunctionalization seems to be a hallmark of transcription factors, whereas consistent repression occurs for macromolecular complexes. We show that dominant gene expression is a strong determinant of the strength of purifying selection, explaining the inferred stronger negative selection on maize1 genes. We propose a novel expression-based classification of duplicates which is more robust to explain observed polymorphism patterns than the subgenome location. Finally, upstream regions of repressed genes exhibit an enrichment in transposable elements which indicates a possible mechanism for expression divergence. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots.

    PubMed

    Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.

  11. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots

    PubMed Central

    Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960

  12. Gene family size conservation is a good indicator of evolutionary rates.

    PubMed

    Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen

    2010-08-01

    The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.

  13. Identification of three duplicated Spin genes in medaka (Oryzias latipes).

    PubMed

    Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang

    2005-05-09

    Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.

  14. Pericentromeric Effects Shape the Patterns of Divergence, Retention, and Expression of Duplicated Genes in the Paleopolyploid Soybean[C][W

    PubMed Central

    Du, Jianchang; Tian, Zhixi; Sui, Yi; Zhao, Meixia; Song, Qijian; Cannon, Steven B.; Cregan, Perry; Ma, Jianxin

    2012-01-01

    The evolutionary forces that govern the divergence and retention of duplicated genes in polyploids are poorly understood. In this study, we first investigated the rates of nonsynonymous substitution (Ka) and the rates of synonymous substitution (Ks) for a nearly complete set of genes in the paleopolyploid soybean (Glycine max) by comparing the orthologs between soybean and its progenitor species Glycine soja and then compared the patterns of gene divergence and expression between pericentromeric regions and chromosomal arms in different gene categories. Our results reveal strong associations between duplication status and Ka and gene expression levels and overall low Ks and low levels of gene expression in pericentromeric regions. It is theorized that deleterious mutations can easily accumulate in recombination-suppressed regions, because of Hill-Robertson effects. Intriguingly, the genes in pericentromeric regions—the cold spots for meiotic recombination in soybean—showed significantly lower Ka and higher levels of expression than their homoeologs in chromosomal arms. This asymmetric evolution of two members of individual whole genome duplication (WGD)-derived gene pairs, echoing the biased accumulation of singletons in pericentromeric regions, suggests that distinct genomic features between the two distinct chromatin types are important determinants shaping the patterns of divergence and retention of WGD-derived genes. PMID:22227891

  15. Genome-Wide Identification and Expression Analysis of NBS-Encoding Genes in Malus x domestica and Expansion of NBS Genes Family in Rosaceae

    PubMed Central

    Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K.

    2014-01-01

    Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1∶1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple. PMID:25232838

  16. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function

    PubMed Central

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estelle; Bertrand, Daniel; Leonard, Sherry

    2011-01-01

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca++, that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [125I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. PMID:21718690

  17. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function.

    PubMed

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estele; Bertrand, Daniel; Leonard, Sherry

    2011-10-15

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca(2+), that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [(125)I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. Copyright © 2011 Elsevier Inc. All rights reserved.

  18. Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.

    PubMed

    Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H

    2009-07-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.

  19. Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes

    PubMed Central

    Deitcher, David L.; Bass, Andrew H.

    2009-01-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796

  20. Yeast Interspecies Comparative Proteomics Reveals Divergence in Expression Profiles and Provides Insights into Proteome Resource Allocation and Evolutionary Roles of Gene Duplication*

    PubMed Central

    Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke

    2016-01-01

    Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. PMID:26560065

  1. Yeast Interspecies Comparative Proteomics Reveals Divergence in Expression Profiles and Provides Insights into Proteome Resource Allocation and Evolutionary Roles of Gene Duplication.

    PubMed

    Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke

    2016-01-01

    Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  2. Genome-wide identification and expression analysis of sulfate transporter (SULTR) genes in potato (Solanum tuberosum L.).

    PubMed

    Vatansever, Recep; Koc, Ibrahim; Ozyigit, Ibrahim Ilker; Sen, Ugur; Uras, Mehmet Emin; Anjum, Naser A; Pereira, Eduarda; Filiz, Ertugrul

    2016-12-01

    Solanum tuberosum genome analysis revealed 12 StSULTR genes encoding 18 transcripts. Among genes annotated at group level ( StSULTR I-IV), group III members formed the largest SULTRs-cluster and were potentially involved in biotic/abiotic stress responses via various regulatory factors, and stress and signaling proteins. Employing bioinformatics tools, this study performed genome-wide identification and expression analysis of SULTR (StSULTR) genes in potato (Solanum tuberosum L.). Very strict homology search and subsequent domain verification with Hidden Markov Model revealed 12 StSULTR genes encoding 18 transcripts. StSULTR genes were mapped on seven S. tuberosum chromosomes. Annotation of StSULTR genes was also done as StSULTR I-IV at group level based mainly on the phylogenetic distribution with Arabidopsis SULTRs. Several tandem and segmental duplications were identified between StSULTR genes. Among these duplications, Ka/Ks ratios indicated neutral nature of mutations that might not be causing any selection. Two segmental and one-tandem duplications were calculated to occur around 147.69, 180.80 and 191.00 million years ago (MYA), approximately corresponding to the time of monocot/dicot divergence. Two other segmental duplications were found to occur around 61.23 and 67.83 MYA, which is very close to the origination of monocotyledons. Most cis-regulatory elements in StSULTRs were found associated with major hormones (such as abscisic acid and methyl jasmonate), and defense and stress responsiveness. The cis-element distribution in duplicated gene pairs indicated the contribution of duplication events in conferring the neofunctionalization/s in StSULTR genes. Notably, RNAseq data analyses unveiled expression profiles of StSULTR genes under different stress conditions. In particular, expression profiles of StSULTR III members suggested their involvement in plant stress responses. Additionally, gene co-expression networks of these group members included various regulatory factors, stress and signaling proteins, and housekeeping and some other proteins with unknown functions.

  3. Diverse Cis-Regulatory Mechanisms Contribute to Expression Evolution of Tandem Gene Duplicates

    PubMed Central

    Baudouin-Gonzalez, Luís; Santos, Marília A; Tempesta, Camille; Sucena, Élio; Roch, Fernando; Tanaka, Kohtaro

    2017-01-01

    Abstract Pairs of duplicated genes generally display a combination of conserved expression patterns inherited from their unduplicated ancestor and newly acquired domains. However, how the cis-regulatory architecture of duplicated loci evolves to produce these expression patterns is poorly understood. We have directly examined the gene-regulatory evolution of two tandem duplicates, the Drosophila Ly6 genes CG9336 and CG9338, which arose at the base of the drosophilids between 40 and 60 Ma. Comparing the expression patterns of the two paralogs in four Drosophila species with that of the unduplicated ortholog in the tephritid Ceratitis capitata, we show that they diverged from each other as well as from the unduplicated ortholog. Moreover, the expression divergence appears to have occurred close to the duplication event and also more recently in a lineage-specific manner. The comparison of the tissue-specific cis-regulatory modules (CRMs) controlling the paralog expression in the four Drosophila species indicates that diverse cis-regulatory mechanisms, including the novel tissue-specific enhancers, differential inactivation, and enhancer sharing, contributed to the expression evolution. Our analysis also reveals a surprisingly variable cis-regulatory architecture, in which the CRMs driving conserved expression domains change in number, location, and specificity. Altogether, this study provides a detailed historical account that uncovers a highly dynamic picture of how the paralog expression patterns and their underlying cis-regulatory landscape evolve. We argue that our findings will encourage studying cis-regulatory evolution at the whole-locus level to understand how interactions between enhancers and other regulatory levels shape the evolution of gene expression. PMID:28961967

  4. Evolutionary history of glucose-6-phosphatase encoding genes in vertebrate lineages: towards a better understanding of the functions of multiple duplicates.

    PubMed

    Marandel, Lucie; Panserat, Stéphane; Plagnes-Juan, Elisabeth; Arbenoits, Eva; Soengas, José Luis; Bobe, Julien

    2017-05-02

    Glucose-6-phosphate (G6pc) is a key enzyme involved in the regulation of the glucose homeostasis. The present study aims at revisiting and clarifying the evolutionary history of g6pc genes in vertebrates. g6pc duplications happened by successive rounds of whole genome duplication that occurred during vertebrate evolution. g6pc duplicated before or around Osteichthyes/Chondrichthyes radiation, giving rise to g6pca and g6pcb as a consequence of the second vertebrate whole genome duplication. g6pca was lost after this duplication in Sarcopterygii whereas both g6pca and g6pcb then duplicated as a consequence of the teleost-specific whole genome duplication. One g6pca duplicate was lost after this duplication in teleosts. Similarly one g6pcb2 duplicate was lost at least in the ancestor of percomorpha. The analysis of the evolution of spatial expression patterns of g6pc genes in vertebrates showed that all g6pc were mainly expressed in intestine and liver whereas teleost-specific g6pcb2 genes were mainly and surprisingly expressed in brain and heart. g6pcb2b, one gene previously hypothesised to be involved in the glucose intolerant phenotype in trout, was unexpectedly up-regulated (as it was in liver) by carbohydrates in trout telencephalon without showing significant changes in other brain regions. This up-regulation is in striking contrast with expected glucosensing mechanisms suggesting that its positive response to glucose relates to specific unknown processes in this brain area. Our results suggested that the fixation and the divergence of g6pc duplicated genes during vertebrates' evolution may lead to adaptive novelty and probably to the emergence of novel phenotypes related to glucose homeostasis.

  5. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    PubMed

    Guo, Yong; Qiu, Li-Juan

    2013-01-01

    The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  6. Genome-wide analysis of soybean HD-Zip gene family and expression profiling under salinity and drought treatments.

    PubMed

    Chen, Xue; Chen, Zhu; Zhao, Hualin; Zhao, Yang; Cheng, Beijiu; Xiang, Yan

    2014-01-01

    Homeodomain-leucine zipper (HD-Zip) proteins, a group of homeobox transcription factors, participate in various aspects of normal plant growth and developmental processes as well as environmental responses. To date, no overall analysis or expression profiling of the HD-Zip gene family in soybean (Glycine max) has been reported. An investigation of the soybean genome revealed 88 putative HD-Zip genes. These genes were classified into four subfamilies, I to IV, based on phylogenetic analysis. In each subfamily, the constituent parts of gene structure and motif were relatively conserved. A total of 87 out of 88 genes were distributed unequally on 20 chromosomes with 36 segmental duplication events, indicating that segmental duplication is important for the expansion of the HD-Zip family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the HD-Zip family basically underwent purifying selection with restrictive functional divergence after the duplication events. Analysis of expression profiles showed that 80 genes differentially expressed across 14 tissues, and 59 HD-Zip genes are differentially expressed under salinity and drought stress, with 20 paralogous pairs showing nearly identical expression patterns and three paralogous pairs diversifying significantly under drought stress. Quantitative real-time RT-PCR (qRT-PCR) analysis of six paralogous pairs of 12 selected soybean HD-Zip genes under both drought and salinity stress confirmed their stress-inducible expression patterns. This study presents a thorough overview of the soybean HD-Zip gene family and provides a new perspective on the evolution of this gene family. The results indicate that HD-Zip family genes may be involved in many plant responses to stress conditions. Additionally, this study provides a solid foundation for uncovering the biological roles of HD-Zip genes in soybean growth and development.

  7. Gene Duplication and Evolutionary Innovations in Hemoglobin-Oxygen Transport

    PubMed Central

    2016-01-01

    During vertebrate evolution, duplicated hemoglobin (Hb) genes diverged with respect to functional properties as well as the developmental timing of expression. For example, the subfamilies of genes that encode the different subunit chains of Hb are ontogenetically regulated such that functionally distinct Hb isoforms are expressed during different developmental stages. In some vertebrate taxa, functional differentiation between co-expressed Hb isoforms may also contribute to physiologically important divisions of labor. PMID:27053736

  8. Evolution and Expression Patterns of TCP Genes in Asparagales

    PubMed Central

    Madrigal, Yesenia; Alzate, Juan F.; Pabón-Mora, Natalia

    2017-01-01

    CYCLOIDEA-like genes are involved in the symmetry gene network, limiting cell proliferation in the dorsal regions of bilateral flowers in core eudicots. CYC-like and closely related TCP genes (acronym for TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATION CELL FACTOR) have been poorly studied in Asparagales, the largest order of monocots that includes both bilateral flowers in Orchidaceae (ca. 25.000 spp) and radially symmetrical flowers in Hypoxidaceae (ca. 200 spp). With the aim of assessing TCP gene evolution in the Asparagales, we isolated TCP-like genes from publicly available databases and our own transcriptomes of Cattleya trianae (Orchidaceae) and Hypoxis decumbens (Hypoxidaceae). Our matrix contains 452 sequences representing the three major clades of TCP genes. Besides the previously identified CYC specific core eudicot duplications, our ML phylogenetic analyses recovered an early CIN-like duplication predating all angiosperms, two CIN-like Asparagales-specific duplications and a duplication prior to the diversification of Orchidoideae and Epidendroideae. In addition, we provide evidence of at least three duplications of PCF-like genes in Asparagales. While CIN-like and PCF-like genes have multiplied in Asparagales, likely enhancing the genetic network for cell proliferation, CYC-like genes remain as single, shorter copies with low expression. Homogeneous expression of CYC-like genes in the labellum as well as the lateral petals suggests little contribution to the bilateral perianth in C. trianae. CIN-like and PCF-like gene expression suggests conserved roles in cell proliferation in leaves, sepals and petals, carpels, ovules and fruits in Asparagales by comparison with previously reported functions in core eudicots and monocots. This is the first large scale analysis of TCP-like genes in Asparagales that will serve as a platform for in-depth functional studies in emerging model monocots. PMID:28144250

  9. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus

    PubMed Central

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342

  10. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    PubMed

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.

  11. Genome-Wide Investigation and Expression Profiling of HD-Zip Transcription Factors in Foxtail Millet (Setaria italica L.).

    PubMed

    Chai, Wenbo; Si, Weina; Ji, Wei; Qin, Qianqian; Zhao, Manli; Jiang, Haiyang

    2018-01-01

    HD-Zip proteins represent the major transcription factors in higher plants, playing essential roles in plant development and stress responses. Foxtail millet is a crop to investigate the systems biology of millet and biofuel grasses and the HD-Zip gene family has not been studied in foxtail millet. For further investigation of the expression profile of the HD-Zip gene family in foxtail millet, a comprehensive genome-wide expression analysis was conducted in this study. We found 47 protein-encoding genes in foxtail millet using BLAST search tools; the putative proteins were classified into four subfamilies, namely, subfamilies I, II, III, and IV. Gene structure and motif analysis indicate that the genes in one subfamily were conserved. Promotor analysis showed that HD-Zip gene was involved in abiotic stress. Duplication analysis revealed that 8 (~17%) hdz genes were tandemly duplicated and 28 (58%) were segmentally duplicated; purifying duplication plays important roles in gene expansion. Microsynteny analysis revealed the maximum relationship in foxtail millet-sorghum and foxtail millet-rice. Expression profiling upon the abiotic stresses of drought and high salinity and the biotic stress of ABA revealed that some genes regulated responses to drought and salinity stresses via an ABA-dependent process, especially sihdz29 and sihdz45. Our study provides new insight into evolutionary and functional analyses of HD-Zip genes involved in environmental stress responses in foxtail millet.

  12. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution

    PubMed Central

    Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.

    2015-01-01

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392

  13. Genome evolution and speciation genetics of clawed frogs (Xenopus and Silurana).

    PubMed

    Evans, Ben J

    2008-05-01

    Speciation of clawed frogs occurred through bifurcation and reticulation of evolutionary lineages, and resulted in extant species with different ploidy levels. Duplicate gene evolution and expression in these animals provides a unique perspective into the earliest genomic transformations after vertebrate whole genome duplication (WGD) and suggests that functional constraints are relaxed compared to before duplication but still consistently strong for millions of years following WGD. Additionally, extensive quantitative expression divergence between duplicate genes occurred after WGD. Diversification of clawed frogs was potentially catalyzed by transposition and divergent resolution--processes that occur through different genetic mechanisms but that have analogous implications for genome structure. How sex determination is maintained after genome duplication is fundamental to our understanding of why allopolyploidization is so prevalent in this group, and why clawed frogs violate Haldane's Rule for hybrid sterility. Future studies of expression subfunctionalization in polyploids will shed light on the role and purviews of cis- and trans-regulatory elements in gene regulation.

  14. Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes.

    PubMed

    Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen

    2012-11-01

    Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.

  15. Heterogeneous conservation of Dlx paralog co-expression in jawed vertebrates.

    PubMed

    Debiais-Thibaud, Mélanie; Metcalfe, Cushla J; Pollack, Jacob; Germon, Isabelle; Ekker, Marc; Depew, Michael; Laurenti, Patrick; Borday-Birraux, Véronique; Casane, Didier

    2013-01-01

    The Dlx gene family encodes transcription factors involved in the development of a wide variety of morphological innovations that first evolved at the origins of vertebrates or of the jawed vertebrates. This gene family expanded with the two rounds of genome duplications that occurred before jawed vertebrates diversified. It includes at least three bigene pairs sharing conserved regulatory sequences in tetrapods and teleost fish, but has been only partially characterized in chondrichthyans, the third major group of jawed vertebrates. Here we take advantage of developmental and molecular tools applied to the shark Scyliorhinus canicula to fill in the gap and provide an overview of the evolution of the Dlx family in the jawed vertebrates. These results are analyzed in the theoretical framework of the DDC (Duplication-Degeneration-Complementation) model. The genomic organisation of the catshark Dlx genes is similar to that previously described for tetrapods. Conserved non-coding elements identified in bony fish were also identified in catshark Dlx clusters and showed regulatory activity in transgenic zebrafish. Gene expression patterns in the catshark showed that there are some expression sites with high conservation of the expressed paralog(s) and other expression sites with events of paralog sub-functionalization during jawed vertebrate diversification, resulting in a wide variety of evolutionary scenarios within this gene family. Dlx gene expression patterns in the catshark show that there has been little neo-functionalization in Dlx genes over gnathostome evolution. In most cases, one tandem duplication and two rounds of vertebrate genome duplication have led to at least six Dlx coding sequences with redundant expression patterns followed by some instances of paralog sub-functionalization. Regulatory constraints such as shared enhancers, and functional constraints including gene pleiotropy, may have contributed to the evolutionary inertia leading to high redundancy between gene expression patterns.

  16. Lineage-Specific Evolutionary Histories and Regulation of Major Starch Metabolism Genes during Banana Ripening

    PubMed Central

    Jourda, Cyril; Cardi, Céline; Gibert, Olivier; Giraldo Toro, Andrès; Ricci, Julien; Mbéguié-A-Mbéguié, Didier; Yahiaoui, Nabila

    2016-01-01

    Starch is the most widespread and abundant storage carbohydrate in plants. It is also a major feature of cultivated bananas as it accumulates to large amounts during banana fruit development before almost complete conversion to soluble sugars during ripening. Little is known about the structure of major gene families involved in banana starch metabolism and their evolution compared to other species. To identify genes involved in banana starch metabolism and investigate their evolutionary history, we analyzed six gene families playing a crucial role in plant starch biosynthesis and degradation: the ADP-glucose pyrophosphorylases (AGPases), starch synthases (SS), starch branching enzymes (SBE), debranching enzymes (DBE), α-amylases (AMY) and β-amylases (BAM). Using comparative genomics and phylogenetic approaches, these genes were classified into families and sub-families and orthology relationships with functional genes in Eudicots and in grasses were identified. In addition to known ancestral duplications shaping starch metabolism gene families, independent evolution in banana and grasses also occurred through lineage-specific whole genome duplications for specific sub-families of AGPase, SS, SBE, and BAM genes; and through gene-scale duplications for AMY genes. In particular, banana lineage duplications yielded a set of AGPase, SBE and BAM genes that were highly or specifically expressed in banana fruits. Gene expression analysis highlighted a complex transcriptional reprogramming of starch metabolism genes during ripening of banana fruits. A differential regulation of expression between banana gene duplicates was identified for SBE and BAM genes, suggesting that part of starch metabolism regulation in the fruit evolved in the banana lineage. PMID:27994606

  17. Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species

    NASA Astrophysics Data System (ADS)

    Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.

    2016-04-01

    The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.

  18. Evidence of function for conserved noncoding sequences in Arabidopsis thaliana.

    PubMed

    Spangler, Jacob B; Subramaniam, Sabarinath; Freeling, Michael; Feltus, F Alex

    2012-01-01

    • Whole genome duplication events provide a lineage with a large reservoir of genes that can be molded by evolutionary forces into phenotypes that fit alternative environments. A well-studied whole genome duplication, the α-event, occurred in an ancestor of the model plant Arabidopsis thaliana. Retained segments of the α-event have been defined in recent years in the form of duplicate protein coding sequences (α-pairs) and associated conserved noncoding DNA sequences (CNSs). Our aim was to identify any association between CNSs and α-pair co-functionality at the gene expression level. • Here, we tested for correlation between CNS counts and α-pair co-expression and expression intensity across nine expression datasets: aerial tissue, flowers, leaves, roots, rosettes, seedlings, seeds, shoots and whole plants. • We provide evidence for a putative regulatory role of the CNSs. The association of CNSs with α-pair co-expression and expression intensity varied by gene function, subgene position and the presence of transcription factor binding motifs. A range of possible CNS regulatory mechanisms, including intron-mediated enhancement, messenger RNA fold stability and transcriptional regulation, are discussed. • This study provides a framework to understand how CNS motifs are involved in the maintenance of gene expression after a whole genome duplication event. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.

  19. Cdx ParaHox genes acquired distinct developmental roles after gene duplication in vertebrate evolution.

    PubMed

    Marlétaz, Ferdinand; Maeso, Ignacio; Faas, Laura; Isaacs, Harry V; Holland, Peter W H

    2015-08-01

    The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates.

  20. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies

    PubMed Central

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D.

    2016-01-01

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella. We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes—and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646

  1. Duplication of 17(p11.2p11.2) in a male child with autism and severe language delay.

    PubMed

    Nakamine, Alisa; Ouchanov, Leonid; Jiménez, Patricia; Manghi, Elina R; Esquivel, Marcela; Monge, Silvia; Fallas, Marietha; Burton, Barbara K; Szomju, Barbara; Elsea, Sarah H; Marshall, Christian R; Scherer, Stephen W; McInnes, L Alison

    2008-03-01

    Duplications of 17(p11.2p11.2) have been associated with various behavioral manifestations including attention deficits, obsessive-compulsive symptoms, autistic traits, and language delay. We are conducting a genetic study of autism and are screening all cases for submicroscopic chromosomal abnormalities, in addition to standard karyotyping, and fragile X testing. Using array-based comparative genomic hybridization analysis of data from the Affymetrix GeneChip(R) Human Mapping Array set, we detected a duplication of approximately 3.3 Mb on chromosome 17p11.2 in a male child with autism and severe expressive language delay. The duplication was confirmed by measuring the copy number of genomic DNA using quantitative polymerase chain reaction. Gene expression analyses revealed increased expression of three candidate genes for the Smith-Magenis neurobehavioral phenotype, RAI1, DRG2, and RASD1, in transformed lymphocytes from Case 81A, suggesting gene dosage effects. Our results add to a growing body of evidence suggesting that duplications of 17(p11.2p11.2) result in language delay as well as autism and related phenotypes. As Smith-Magenis syndrome is also associated with language delay, a gene involved in acquisition of language may lie within this interval. Whether a parent of origin effect, gender of the case, the presence of allelic variation, or changes in expression of genes outside the breakpoints influence the resultant phenotype remains to be determined. (c) 2007 Wiley-Liss, Inc.

  2. Genome-wide identification, phylogeny, and gonadal expression of fox genes in Nile tilapia, Oreochromis niloticus.

    PubMed

    Yuan, Jing; Tao, Wenjing; Cheng, Yunying; Huang, Baofeng; Wang, Deshou

    2014-08-01

    The fox genes play important roles in various biological processes, including sexual development. In the present study, we isolated 65 fox genes, belonging to 18 subfamilies named A-R, from Nile tilapia through genome-wide screening. Twenty-four of them have two or three (foxm1) copies. Furthermore, 16, 25, 68, and 45 fox members were isolated from nematodes, protochordates, teleosts, and tetrapods, respectively. Phylogenetic analyses indicated fox gene family had undergone three expansions parallel to the three rounds of genome duplication during evolution. We also analyzed the clustered fox genes and found that apparent linkage duplication existed in teleosts, which further supported fish-specific genome duplication hypothesis. In addition, species- and lineage-specific duplication is another reason for fox gene family expansion. Based on the four pairs of XX and XY gonadal transcriptome data from four critical developmental stages, we analyzed the expression profile of all fox genes and identified sexually dimorphic fox genes at each stage. All fox genes were detected in gonads, with 15 of them at the background expression level (total read per kb per million reads, RPKM < 10), 29 at moderate expression level (10 < total RPKM < 100), and 21 at high expression level (total RPKM > 100). There are 27, 24, 28, and 9 sexually dimorphic fox genes at 5, 30, 90, and 180 days after hatching (dah), respectively. foxq1a, foxf1, foxr1, and foxr1 were identified as the most differentially expressed genes at each stage. foxl2 was characterized as XX-dominant gene, while foxd5, foxi3, foxn3, foxj1a, foxj3b, and foxo6b were characterized as XY-dominant genes. qPCR and in situ hybridization of foxh1 and foxj1a were performed to confirm the expression profiles and to validate the transcriptome data. Our results suggest that fox genes might play important roles in sex determination and gonadal development in teleosts.

  3. Genome-Wide Analysis of Soybean HD-Zip Gene Family and Expression Profiling under Salinity and Drought Treatments

    PubMed Central

    Chen, Xue; Chen, Zhu; Zhao, Hualin; Zhao, Yang; Cheng, Beijiu; Xiang, Yan

    2014-01-01

    Background Homeodomain-leucine zipper (HD-Zip) proteins, a group of homeobox transcription factors, participate in various aspects of normal plant growth and developmental processes as well as environmental responses. To date, no overall analysis or expression profiling of the HD-Zip gene family in soybean (Glycine max) has been reported. Methods and Findings An investigation of the soybean genome revealed 88 putative HD-Zip genes. These genes were classified into four subfamilies, I to IV, based on phylogenetic analysis. In each subfamily, the constituent parts of gene structure and motif were relatively conserved. A total of 87 out of 88 genes were distributed unequally on 20 chromosomes with 36 segmental duplication events, indicating that segmental duplication is important for the expansion of the HD-Zip family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the HD-Zip family basically underwent purifying selection with restrictive functional divergence after the duplication events. Analysis of expression profiles showed that 80 genes differentially expressed across 14 tissues, and 59 HD-Zip genes are differentially expressed under salinity and drought stress, with 20 paralogous pairs showing nearly identical expression patterns and three paralogous pairs diversifying significantly under drought stress. Quantitative real-time RT-PCR (qRT-PCR) analysis of six paralogous pairs of 12 selected soybean HD-Zip genes under both drought and salinity stress confirmed their stress-inducible expression patterns. Conclusions This study presents a thorough overview of the soybean HD-Zip gene family and provides a new perspective on the evolution of this gene family. The results indicate that HD-Zip family genes may be involved in many plant responses to stress conditions. Additionally, this study provides a solid foundation for uncovering the biological roles of HD-Zip genes in soybean growth and development. PMID:24498296

  4. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    PubMed

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-06-08

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. GENE-dosage effects on fitness in recent adaptive duplications: ace-1 in the mosquito Culex pipiens.

    PubMed

    Labbé, Pierrick; Milesi, Pascal; Yébakima, André; Pasteur, Nicole; Weill, Mylène; Lenormand, Thomas

    2014-07-01

    Gene duplications have long been advocated to contribute to the evolution of new functions. The role of selection in their early spread is more controversial. Unless duplications are favored for a direct benefit of increased expression, they are likely detrimental. In this article, we investigated the case of duplications favored because they combine already functionally divergent alleles. Their gene-dosage/fitness relations are poorly known because selection may operate on both overall expression and duplicates relative dosage. Using the well-documented case of Culex pipiens resistance to insecticides, we compared strains with various ace-1 allele combinations, including two duplicated alleles carrying both susceptible and resistant copies. The overall protein activity was nearly additive, but, surprisingly, fitness correlated better with the relative proportion of susceptible and resistant copies rather than any absolute measure of activity. Gene dosage is thus crucial, duplications stabilizing a "heterozygote" phenotype. It corroborates the view that these were favored because they fix a permanent heterosis, thereby solving the irreducible trade-off between resistance and synaptic transmission. Moreover, we showed that the contrasted successes of the two duplicated alleles in natural populations depend on genetic changes unrelated to ace-1, confirming the probable implication of recessive sublethal mutations linked to structural rearrangements in some duplications. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.

  6. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

    PubMed

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

    2016-09-02

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Genome Duplication and Gene Loss Affect the Evolution of Heat Shock Transcription Factor Genes in Legumes

    PubMed Central

    Jin, Jing; Jin, Xiaolei; Jiang, Haiyang; Yan, Hanwei; Cheng, Beijiu

    2014-01-01

    Whole-genome duplication events (polyploidy events) and gene loss events have played important roles in the evolution of legumes. Here we show that the vast majority of Hsf gene duplications resulted from whole genome duplication events rather than tandem duplication, and significant differences in gene retention exist between species. By searching for intraspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found that genome duplications accounted for 42 of 46 Hsf-containing segments in Glycine max, while paired segments were rarely identified in Lotus japonicas, Medicago truncatula and Cajanus cajan. However, by comparing interspecies microsynteny, we determined that the great majority of Hsf-containing segments in Lotus japonicas, Medicago truncatula and Cajanus cajan show extensive conservation with the duplicated regions of Glycine max. These segments formed 17 groups of orthologous segments. These results suggest that these regions shared ancient genome duplication with Hsf genes in Glycine max, but more than half of the copies of these genes were lost. On the other hand, the Glycine max Hsf gene family retained approximately 75% and 84% of duplicated genes produced from the ancient genome duplication and recent Glycine-specific genome duplication, respectively. Continuous purifying selection has played a key role in the maintenance of Hsf genes in Glycine max. Expression analysis of the Hsf genes in Lotus japonicus revealed their putative involvement in multiple tissue-/developmental stages and responses to various abiotic stimuli. This study traces the evolution of Hsf genes in legume species and demonstrates that the rates of gene gain and loss are far from equilibrium in different species. PMID:25047803

  8. Duplication and diversification of the LEAFY HULL STERILE1 and Oryza sativa MADS5 SEPALLATA lineages in graminoid Poales

    PubMed Central

    2012-01-01

    Background Gene duplication and the subsequent divergence in function of the resulting paralogs via subfunctionalization and/or neofunctionalization is hypothesized to have played a major role in the evolution of plant form. The LEAFY HULL STERILE1 (LHS1) SEPALLATA (SEP) genes have been linked with the origin and diversification of the grass spikelet, but it is uncertain 1) when the duplication event that produced the LHS1 clade and its paralogous lineage Oryza sativa MADS5 (OSM5) occurred, and 2) how changes in gene structure and/or expression might have contributed to subfunctionalization and/or neofunctionalization in the two lineages. Methods Phylogenetic relationships among 84 SEP genes were estimated using Bayesian methods. RNA expression patterns were inferred using in situ hybridization. The patterns of protein sequence and RNA expression evolution were reconstructed using maximum parsimony (MP) and maximum likelihood (ML) methods, respectively. Results Phylogenetic analyses mapped the LHS1/OSM5 duplication event to the base of the grass family. MP character reconstructions estimated a change from cytosine to thymine in the first codon position of the first amino acid after the Zea mays MADS3 (ZMM3) domain converted a glutamine to a stop codon in the OSM5 ancestor following the LHS1/OSM5 duplication event. RNA expression analyses of OSM5 co-orthologs in Avena sativa, Chasmanthium latifolium, Hordeum vulgare, Pennisetum glaucum, and Sorghum bicolor followed by ML reconstructions of these data and previously published analyses estimated a complex pattern of gain and loss of LHS1 and OSM5 expression in different floral organs and different flowers within the spikelet or inflorescence. Conclusions Previous authors have reported that rice OSM5 and LHS1 proteins have different interaction partners indicating that the truncation of OSM5 following the LHS1/OSM5 duplication event has resulted in both partitioned and potentially novel gene functions. The complex pattern of OSM5 and LHS1 expression evolution is not consistent with a simple subfunctionalization model following the gene duplication event, but there is evidence of recent partitioning of OSM5 and LHS1 expression within different floral organs of A. sativa, C. latifolium, P. glaucum and S. bicolor, and between the upper and lower florets of the two-flowered maize spikelet. PMID:22340849

  9. Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.

    PubMed

    Spangler, Jacob B; Feltus, Frank Alex

    2013-01-01

    Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.

  10. Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis

    PubMed Central

    Spangler, Jacob B.; Feltus, Frank Alex

    2013-01-01

    Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377

  11. Reconstructing the Evolutionary History of Paralogous APETALA1/FRUITFULL-Like Genes in Grasses (Poaceae)

    PubMed Central

    Preston, Jill C.; Kellogg, Elizabeth A.

    2006-01-01

    Gene duplication is an important mechanism for the generation of evolutionary novelty. Paralogous genes that are not silenced may evolve new functions (neofunctionalization) that will alter the developmental outcome of preexisting genetic pathways, partition ancestral functions (subfunctionalization) into divergent developmental modules, or function redundantly. Functional divergence can occur by changes in the spatio-temporal patterns of gene expression and/or by changes in the activities of their protein products. We reconstructed the evolutionary history of two paralogous monocot MADS-box transcription factors, FUL1 and FUL2, and determined the evolution of sequence and gene expression in grass AP1/FUL-like genes. Monocot AP1/FUL-like genes duplicated at the base of Poaceae and codon substitutions occurred under relaxed selection mostly along the branch leading to FUL2. Following the duplication, FUL1 was apparently lost from early diverging taxa, a pattern consistent with major changes in grass floral morphology. Overlapping gene expression patterns in leaves and spikelets indicate that FUL1 and FUL2 probably share some redundant functions, but that FUL2 may have become temporally restricted under partial subfunctionalization to particular stages of floret development. These data have allowed us to reconstruct the history of AP1/FUL-like genes in Poaceae and to hypothesize a role for this gene duplication in the evolution of the grass spikelet. PMID:16816429

  12. Comprehensive analysis of TCP transcription factors and their expression during cotton (Gossypium arboreum) fiber early development

    PubMed Central

    Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C.; Zhang, Baohong

    2016-01-01

    TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development. PMID:26857372

  13. Comprehensive analysis of TCP transcription factors and their expression during cotton (Gossypium arboreum) fiber early development.

    PubMed

    Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C; Zhang, Baohong

    2016-02-09

    TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development.

  14. Characterization of various promoter regions of the human DNA helicase-encoding genes and identification of duplicated ets (GGAA) motifs as an essential transcription regulatory element.

    PubMed

    Uchiumi, Fumiaki; Watanabe, Takeshi; Tanuma, Sei-ichi

    2010-05-15

    DNA helicases are important in the regulation of DNA transaction and thereby various cellular functions. In this study, we developed a cost-effective multiple DNA transfection assay with DEAE-dextran reagent and analyzed the promoter activities of the human DNA helicases. The 5'-flanking regions of the human DNA helicase-encoding genes were isolated and subcloned into luciferase (Luc) expression plasmids. They were coated onto 96-well plate and used for co-transfection with a renilla-Luc expression vector into various cells, and dual-Luc assays were performed. The profiles of promoter activities were dependent on cell lines used. Among these human DNA helicase genes, XPB, RecQL5, and RTEL promoters were activated during TPA-induced HL-60 cell differentiation. Interestingly, duplicated ets (GGAA) elements are commonly located around the transcription start sites of these genes. The duplicated GGAA motifs are also found in the promoters of DNA replication/repair synthesis factor genes including PARG, ATR, TERC, and Rb1. Mutation analyses suggested that the duplicated GGAA-motifs are necessary for the basal promoter activity in various cells and some of them positively respond to TPA in HL-60 cells. TPA-induced response of 44-bp in the RTEL promoter was attenuated by co-transfection of the PU.1 expression vector. These findings suggest that the duplicated ets motifs regulate DNA-repair associated gene expressions during macrophage-like differentiation of HL-60 cells. Copyright 2010 Elsevier Inc. All rights reserved.

  15. Early stages of functional diversification in the Rab GTPase gene family revealed by genomic and localization studies in Paramecium species

    PubMed Central

    Bright, Lydia J.; Gout, Jean-Francois; Lynch, Michael

    2017-01-01

    New gene functions arise within existing gene families as a result of gene duplication and subsequent diversification. To gain insight into the steps that led to the functional diversification of paralogues, we tracked duplicate retention patterns, expression-level divergence, and subcellular markers of functional diversification in the Rab GTPase gene family in three Paramecium aurelia species. After whole-genome duplication, Rab GTPase duplicates are more highly retained than other genes in the genome but appear to be diverging more rapidly in expression levels, consistent with early steps in functional diversification. However, by localizing specific Rab proteins in Paramecium cells, we found that paralogues from the two most recent whole-genome duplications had virtually identical localization patterns, and that less closely related paralogues showed evidence of both conservation and diversification. The functionally conserved paralogues appear to target to compartments associated with both endocytic and phagocytic recycling functions, confirming evolutionary and functional links between the two pathways in a divergent eukaryotic lineage. Because the functionally diversifying paralogues are still closely related to and derived from a clade of functionally conserved Rab11 genes, we were able to pinpoint three specific amino acid residues that may be driving the change in the localization and thus the function in these proteins. PMID:28251922

  16. Evolution and Expression of Tissue Globins in Ray-Finned Fishes.

    PubMed

    Gallagher, Michael D; Macqueen, Daniel J

    2017-01-01

    The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution.

  17. Developmental expression of high molecular weight tropomyosin isoforms in Mesocestoides corti.

    PubMed

    Koziol, Uriel; Costábile, Alicia; Domínguez, María Fernanda; Iriarte, Andrés; Alvite, Gabriela; Kun, Alejandra; Castillo, Estela

    2011-02-01

    Tropomyosins are a family of actin-binding proteins with diverse roles in actin filament function. One of the best characterized roles is the regulation of muscle contraction. Tropomyosin isoforms can be generated from different genes, and from alternative promoters and alternative splicing from the same gene. In this work, we have isolated sequences for tropomyosin isoforms from the cestode Mesocestoides corti, and searched for tropomyosin genes and isoforms in other flatworms. Two genes are conserved in the cestodes M. corti and Echinococcus multilocularis, and in the trematode Schistosoma mansoni. Both genes have the same structure, and each gene gives rise to at least two different isoforms, a high molecular weight (HMW) and a low molecular weight (LMW) one. Because most exons are duplicated and spliced in a mutually exclusive fashion, isoforms from one gene only share one exon and are highly divergent. The gene duplication preceded the divergence of neodermatans and the planarian Schmidtea mediterranea. Further duplications occurred in Schmidtea, coupled to the selective loss of duplicated exons, resulting in genes that only code for HMW or LMW isoforms. A polyclonal antibody raised against a HMW tropomyosin from Echinococcus granulosus was demonstrated to specifically recognize HMW tropomyosin isoforms of M. corti, and used to study their expression during segmentation. HMW tropomyosins are expressed in muscle layers, with very low or absent levels in other tissues. No expression of HMW tropomyosins is present in early or late genital primordia, and expression only begins once muscle fibers develop in the genital ducts. Therefore, HMW tropomyosins are markers for the development of muscles during the final differentiation of genital primordia. Copyright © 2010 Elsevier B.V. All rights reserved.

  18. Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins

    PubMed Central

    Hargreaves, Adam D.; Swain, Martin T.; Hegarty, Matthew J.; Logan, Darren W.; Mulley, John F.

    2014-01-01

    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive “just-so story” in evolutionary biology. PMID:25079342

  19. The low-recombining pericentromeric region of barley restricts gene diversity and evolution but not gene expression

    PubMed Central

    Baker, Katie; Bayer, Micha; Cook, Nicola; Dreißig, Steven; Dhillon, Taniya; Russell, Joanne; Hedley, Pete E; Morris, Jenny; Ramsay, Luke; Colas, Isabelle; Waugh, Robbie; Steffenson, Brian; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J

    2014-01-01

    The low-recombining pericentromeric region of the barley genome contains roughly a quarter of the genes of the species, embedded in low-recombining DNA that is rich in repeats and repressive chromatin signatures. We have investigated the effects of pericentromeric region residency upon the expression, diversity and evolution of these genes. We observe no significant difference in average transcript level or developmental RNA specificity between the barley pericentromeric region and the rest of the genome. In contrast, all of the evolutionary parameters studied here show evidence of compromised gene evolution in this region. First, genes within the pericentromeric region of wild barley show reduced diversity and significantly weakened purifying selection compared with the rest of the genome. Second, gene duplicates (ohnolog pairs) derived from the cereal whole-genome duplication event ca. 60MYa have been completely eliminated from the barley pericentromeric region. Third, local gene duplication in the pericentromeric region is reduced by 29% relative to the rest of the genome. Thus, the pericentromeric region of barley is a permissive environment for gene expression but has restricted gene evolution in a sizeable fraction of barley's genes. PMID:24947331

  20. Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution.

    PubMed

    Acharya, Debarun; Ghosh, Tapash C

    2016-01-22

    Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.

  1. Revisiting the phosphatidylethanolamine-binding protein (PEBP) gene family reveals cryptic FLOWERING LOCUS T gene homologs in gymnosperms and sheds new light on functional evolution.

    PubMed

    Liu, Yan-Yan; Yang, Ke-Zhen; Wei, Xiao-Xin; Wang, Xiao-Quan

    2016-11-01

    Angiosperms and gymnosperms are two major groups of extant seed plants. It has been suggested that gymnosperms lack FLOWERING LOCUS T (FT), a key integrator at the core of flowering pathways in angiosperms. Taking advantage of newly released gymnosperm genomes, we revisited the evolutionary history of the plant phosphatidylethanolamine-binding protein (PEBP) gene family through phylogenetic reconstruction. Expression patterns in three gymnosperm taxa and heterologous expression in Arabidopsis were studied to investigate the functions of gymnosperm FT-like and TERMINAL FLOWER 1 (TFL1)-like genes. Phylogenetic reconstruction suggests that an ancient gene duplication predating the divergence of seed plants gave rise to the FT and TFL1 genes. Expression patterns indicate that gymnosperm TFL1-like genes play a role in the reproductive development process, while GymFT1 and GymFT2, the FT-like genes resulting from a duplication event in the common ancestor of gymnosperms, function in both growth rhythm and sexual development pathways. When expressed in Arabidopsis, both spruce FT-like and TFL1-like genes repressed flowering. Our study demonstrates that gymnosperms do have FT-like and TFL1-like genes. Frequent gene and genome duplications contributed significantly to the expansion of the plant PEBP gene family. The expression patterns of gymnosperm PEBP genes provide novel insight into the functional evolution of this gene family. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  2. Functional characterization of duplicated Suppressor of Overexpression of Constans 1-like genes in petunia.

    PubMed

    Preston, Jill C; Jorgensen, Stacy A; Jha, Suryatapa G

    2014-01-01

    Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene Suppressor Of Overexpression of Constans 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes Unshaven (UNS) and Floral Binding Protein 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods.

  3. Functional Characterization of Duplicated SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1-Like Genes in Petunia

    PubMed Central

    Preston, Jill C.; Jorgensen, Stacy A.; Jha, Suryatapa G.

    2014-01-01

    Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes UNSHAVEN (UNS) and FLORAL BINDING PROTEIN 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods. PMID:24787903

  4. Evolution of homeobox genes.

    PubMed

    Holland, Peter W H

    2013-01-01

    Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.

  5. Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.

    PubMed

    Williams, N A; Holland, P W

    1998-05-01

    We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.

  6. Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.

    PubMed

    Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M

    2017-04-01

    The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Hereditary mixed polyposis syndrome is caused by a 40-kb upstream duplication that leads to increased and ectopic expression of the BMP antagonist GREM1.

    PubMed

    Jaeger, Emma; Leedham, Simon; Lewis, Annabelle; Segditsas, Stefania; Becker, Martin; Cuadrado, Pedro Rodenas; Davis, Hayley; Kaur, Kulvinder; Heinimann, Karl; Howarth, Kimberley; East, James; Taylor, Jenny; Thomas, Huw; Tomlinson, Ian

    2012-05-06

    Hereditary mixed polyposis syndrome (HMPS) is characterized by apparent autosomal dominant inheritance of multiple types of colorectal polyp, with colorectal carcinoma occurring in a high proportion of affected individuals. Here, we use genetic mapping, copy-number analysis, exclusion of mutations by high-throughput sequencing, gene expression analysis and functional assays to show that HMPS is caused by a duplication spanning the 3' end of the SCG5 gene and a region upstream of the GREM1 locus. This unusual mutation is associated with increased allele-specific GREM1 expression. Whereas GREM1 is expressed in intestinal subepithelial myofibroblasts in controls, GREM1 is predominantly expressed in the epithelium of the large bowel in individuals with HMPS. The HMPS duplication contains predicted enhancer elements; some of these interact with the GREM1 promoter and can drive gene expression in vitro. Increased GREM1 expression is predicted to cause reduced bone morphogenetic protein (BMP) pathway activity, a mechanism that also underlies tumorigenesis in juvenile polyposis of the large bowel.

  8. Age distribution patterns of human gene families: divergent for Gene Ontology categories and concordant between different subcellular localizations.

    PubMed

    Liu, Gangbiao; Zou, Yangyun; Cheng, Qiqun; Zeng, Yanwu; Gu, Xun; Su, Zhixi

    2014-04-01

    The age distribution of gene duplication events within the human genome exhibits two waves of duplications along with an ancient component. However, because of functional constraint differences, genes in different functional categories might show dissimilar retention patterns after duplication. It is known that genes in some functional categories are highly duplicated in the early stage of vertebrate evolution. However, the correlations of the age distribution pattern of gene duplication between the different functional categories are still unknown. To investigate this issue, we developed a robust pipeline to date the gene duplication events in the human genome. We successfully estimated about three-quarters of the duplication events within the human genome, along with the age distribution pattern in each Gene Ontology (GO) slim category. We found that some GO slim categories show different distribution patterns when compared to the whole genome. Further hierarchical clustering of the GO slim functional categories enabled grouping into two main clusters. We found that human genes located in the duplicated copy number variant regions, whose duplicate genes have not been fixed in the human population, were mainly enriched in the groups with a high proportion of recently duplicated genes. Moreover, we used a phylogenetic tree-based method to date the age of duplications in three signaling-related gene superfamilies: transcription factors, protein kinases and G-protein coupled receptors. These superfamilies were expressed in different subcellular localizations. They showed a similar age distribution as the signaling-related GO slim categories. We also compared the differences between the age distributions of gene duplications in multiple subcellular localizations. We found that the distribution patterns of the major subcellular localizations were similar to that of the whole genome. This study revealed the whole picture of the evolution patterns of gene functional categories in the human genome.

  9. Female Behaviour Drives Expression and Evolution of Gustatory Receptors in Butterflies

    PubMed Central

    Briscoe, Adriana D.; Macias-Muñoz, Aide; Kozak, Krzysztof M.; Walters, James R.; Yuan, Furong; Jamie, Gabriel A.; Martin, Simon H.; Dasmahapatra, Kanchon K.; Ferguson, Laura C.; Mallet, James; Jacquin-Joly, Emmanuelle; Jiggins, Chris D.

    2013-01-01

    Secondary plant compounds are strong deterrents of insect oviposition and feeding, but may also be attractants for specialist herbivores. These insect-plant interactions are mediated by insect gustatory receptors (Grs) and olfactory receptors (Ors). An analysis of the reference genome of the butterfly Heliconius melpomene, which feeds on passion-flower vines (Passiflora spp.), together with whole-genome sequencing within the species and across the Heliconius phylogeny has permitted an unprecedented opportunity to study the patterns of gene duplication and copy-number variation (CNV) among these key sensory genes. We report in silico gene predictions of 73 Gr genes in the H. melpomene reference genome, including putative CO2, sugar, sugar alcohol, fructose, and bitter receptors. The majority of these Grs are the result of gene duplications since Heliconius shared a common ancestor with the monarch butterfly or the silkmoth. Among Grs but not Ors, CNVs are more common within species in those gene lineages that have also duplicated over this evolutionary time-scale, suggesting ongoing rapid gene family evolution. Deep sequencing (∼1 billion reads) of transcriptomes from proboscis and labial palps, antennae, and legs of adult H. melpomene males and females indicates that 67 of the predicted 73 Gr genes and 67 of the 70 predicted Or genes are expressed in these three tissues. Intriguingly, we find that one-third of all Grs show female-biased gene expression (n = 26) and nearly all of these (n = 21) are Heliconius-specific Grs. In fact, a significant excess of Grs that are expressed in female legs but not male legs are the result of recent gene duplication. This difference in Gr gene expression diversity between the sexes is accompanied by a striking sexual dimorphism in the abundance of gustatory sensilla on the forelegs of H. melpomene, suggesting that female oviposition behaviour drives the evolution of new gustatory receptors in butterfly genomes. PMID:23950722

  10. Evolutionary origins of a novel host plant detoxification gene in butterflies.

    PubMed

    Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko

    2008-05-01

    Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.

  11. Genomic Imprinting Was Evolutionarily Conserved during Wheat Polyploidization[OPEN

    PubMed Central

    Yang, Guanghui; Liu, Zhenshan; Gao, Lulu; Yu, Kuohai; Feng, Man; Peng, Huiru; Sun, Qixin; Ni, Zhongfu

    2018-01-01

    Genomic imprinting is an epigenetic phenomenon that causes genes to be differentially expressed depending on their parent of origin. To evaluate the evolutionary conservation of genomic imprinting and the effects of ploidy on this process, we investigated parent-of-origin-specific gene expression patterns in the endosperm of diploid (Aegilops spp), tetraploid, and hexaploid wheat (Triticum spp) at various stages of development via high-throughput transcriptome sequencing. We identified 91, 135, and 146 maternally or paternally expressed genes (MEGs or PEGs, respectively) in diploid, tetraploid, and hexaploid wheat, respectively, 52.7% of which exhibited dynamic expression patterns at different developmental stages. Gene Ontology enrichment analysis suggested that MEGs and PEGs were involved in metabolic processes and DNA-dependent transcription, respectively. Nearly half of the imprinted genes exhibited conserved expression patterns during wheat hexaploidization. In addition, 40% of the homoeolog pairs originating from whole-genome duplication were consistently maternally or paternally biased in the different subgenomes of hexaploid wheat. Furthermore, imprinted expression was found for 41.2% and 50.0% of homolog pairs that evolved by tandem duplication after genome duplication in tetraploid and hexaploid wheat, respectively. These results suggest that genomic imprinting was evolutionarily conserved between closely related Triticum and Aegilops species and in the face of polyploid hybridization between species in these genera. PMID:29298834

  12. Neofunctionalization of Duplicated P450 Genes Drives the Evolution of Insecticide Resistance in the Brown Planthopper.

    PubMed

    Zimmer, Christoph T; Garrood, William T; Singh, Kumar Saurabh; Randall, Emma; Lueke, Bettina; Gutbrod, Oliver; Matthiesen, Svend; Kohler, Maxie; Nauen, Ralf; Davies, T G Emyr; Bass, Chris

    2018-01-22

    Gene duplication is a major source of genetic variation that has been shown to underpin the evolution of a wide range of adaptive traits [1, 2]. For example, duplication or amplification of genes encoding detoxification enzymes has been shown to play an important role in the evolution of insecticide resistance [3-5]. In this context, gene duplication performs an adaptive function as a result of its effects on gene dosage and not as a source of functional novelty [3, 6-8]. Here, we show that duplication and neofunctionalization of a cytochrome P450, CYP6ER1, led to the evolution of insecticide resistance in the brown planthopper. Considerable genetic variation was observed in the coding sequence of CYP6ER1 in populations of brown planthopper collected from across Asia, but just two sequence variants are highly overexpressed in resistant strains and metabolize imidacloprid. Both variants are characterized by profound amino-acid alterations in substrate recognition sites, and the introduction of these mutations into a susceptible P450 sequence is sufficient to confer resistance. CYP6ER1 is duplicated in resistant strains with individuals carrying paralogs with and without the gain-of-function mutations. Despite numerical parity in the genome, the susceptible and mutant copies exhibit marked asymmetry in their expression with the resistant paralogs overexpressed. In the primary resistance-conferring CYP6ER1 variant, this results from an extended region of novel sequence upstream of the gene that provides enhanced expression. Our findings illustrate the versatility of gene duplication in providing opportunities for functional and regulatory innovation during the evolution of an adaptive trait. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  13. Evolution and Expression of Tissue Globins in Ray-Finned Fishes

    PubMed Central

    Gallagher, Michael D.

    2017-01-01

    The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution. PMID:28173090

  14. Comparative and Evolutionary Analysis of the HES/HEY Gene Family Reveal Exon/Intron Loss and Teleost Specific Duplication Events

    PubMed Central

    Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

    2012-01-01

    Background HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. Methods and Findings In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Conclusions Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication. PMID:22808219

  15. Comparative and evolutionary analysis of the HES/HEY gene family reveal exon/intron loss and teleost specific duplication events.

    PubMed

    Zhou, Mi; Yan, Jun; Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

    2012-01-01

    HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication.

  16. Evolution of the APETALA2 Gene Lineage in Seed Plants.

    PubMed

    Zumajo-Cardona, Cecilia; Pabón-Mora, Natalia

    2016-07-01

    Gene duplication is a fundamental source of functional evolutionary change and has been associated with organismal diversification and the acquisition of novel features. The APETALA2/ETHYLENE RESPONSIVE ELEMENT-BINDING FACTOR (AP2/ERF) genes are exclusive to vascular plants and have been classified into the AP2-like and ERF-like clades. The AP2-like clade includes the AINTEGUMENTA (ANT) and the euAPETALA2 (euAP2) genes, both regulated by miR172 Arabidopsis has two paralogs in the euAP2 clade, namely APETALA2 (AP2) and TARGET OF EAT3 (TOE3) that control flowering time, meristem determinacy, sepal and petal identity and fruit development. euAP2 genes are likely functionally divergent outside Brassicaceae, as they control fruit development in tomato, and regulate inflorescence meristematic activity in maize. We studied the evolution and expression patterns of euAP2/TOE3 genes to assess large scale and local duplications and evaluate protein motifs likely related with functional changes across seed plants. We sampled euAP2/TOE3 genes from vascular plants and have found three major duplications and a few taxon-specific duplications. Here, we report conserved and new motifs across euAP2/TOE3 proteins and conclude that proteins predating the Brassicaceae duplication are more similar to AP2 than TOE3. Expression data show a shift from restricted expression in leaves, carpels, and fruits in non-core eudicots and asterids to a broader expression of euAP2 genes in leaves, all floral organs and fruits in rosids. Altogether, our data show a functional trend where the canonical A-function (sepal and petal identity) is exclusive to Brassicaceae and it is likely not maintained outside of rosids. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Hox gene duplications correlate with posterior heteronomy in scorpions

    PubMed Central

    Sharma, Prashant P.; Schwager, Evelyn E.; Extavour, Cassandra G.; Wheeler, Ward C.

    2014-01-01

    The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions. PMID:25122224

  18. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species.

    PubMed

    Emms, David M; Covshoff, Sarah; Hibberd, Julian M; Kelly, Steven

    2016-07-01

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. C4 photosynthesis, gene duplication, gene families, parallel evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species

    PubMed Central

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; Kelly, Steven

    2016-01-01

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. Key words: C4 photosynthesis, gene duplication, gene families, parallel evolution. PMID:27016024

  20. Evolution of developmental regulation in the vertebrate FgfD subfamily.

    PubMed

    Jovelin, Richard; Yan, Yi-Lin; He, Xinjun; Catchen, Julian; Amores, Angel; Canestro, Cristian; Yokoi, Hayato; Postlethwait, John H

    2010-01-15

    Fibroblast growth factors (Fgfs) encode small signaling proteins that help regulate embryo patterning. Fgfs fall into seven families, including FgfD. Nonvertebrate chordates have a single FgfD gene; mammals have three (Fgf8, Fgf17, and Fgf18); and teleosts have six (fgf8a, fgf8b, fgf17, fgf18a, fgf18b, and fgf24). What are the evolutionary processes that led to the structural duplication and functional diversification of FgfD genes during vertebrate phylogeny? To study this question, we investigated conserved syntenies, patterns of gene expression, and the distribution of conserved noncoding elements (CNEs) in FgfD genes of stickleback and zebrafish, and compared them with data from cephalochordates, urochordates, and mammals. Genomic analysis suggests that Fgf8, Fgf17, Fgf18, and Fgf24 arose in two rounds of whole genome duplication at the base of the vertebrate radiation; that fgf8 and fgf18 duplications occurred at the base of the teleost radiation; and that Fgf24 is an ohnolog that was lost in the mammalian lineage. Expression analysis suggests that ancestral subfunctions partitioned between gene duplicates and points to the evolution of novel expression domains. Analysis of CNEs, at least some of which are candidate regulatory elements, suggests that ancestral CNEs partitioned between gene duplicates. These results help explain the evolutionary pathways by which the developmentally important family of FgfD molecules arose and the deduced principles that guided FgfD evolution are likely applicable to the evolution of developmental regulation in many vertebrate multigene families. (c) 2009 Wiley-Liss, Inc.

  1. Evidence for the involvement of Globosa-like gene duplications and expression divergence in the evolution of floral morphology in the Zingiberales.

    PubMed

    Bartlett, Madelaine E; Specht, Chelsea D

    2010-07-01

    *The MADS box transcription factor family has long been identified as an important contributor to the control of floral development. It is often hypothesized that the evolution of floral development across angiosperms and within specific lineages may occur as a result of duplication, functional diversification, and changes in regulation of MADS box genes. Here we examine the role of Globosa (GLO)-like genes, members of the B-class MADS box gene lineage, in the evolution of floral development within the monocot order Zingiberales. *We assessed changes in perianth and stamen whorl morphology in a phylogenetic framework. We identified GLO homologs (ZinGLO1-4) from 50 Zingiberales species and investigated the evolution of this gene lineage. Expression of two GLO homologs was assessed in Costus spicatus and Musa basjoo. *Based on the phylogenetic data and expression results, we propose several family-specific losses and gains of GLO homologs that appear to be associated with key morphological changes. The GLO-like gene lineage has diversified concomitant with the evolution of the dimorphic perianth and the staminodial labellum. *Duplications and expression divergence within the GLO-like gene lineage may have played a role in floral diversification in the Zingiberales.

  2. Comparative genomics of duplicate γ-glutamyl transferase genes in teleosts: medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), green spotted pufferfish (Tetraodon nigroviridis), fugu (Takifugu rubripes), and zebrafish (Danio rerio).

    PubMed

    Law, Sheran Hiu Wan; Redelings, Benjamin David; Kullman, Seth William

    2012-01-15

    The availability of multiple teleost (bony fish) genomes is providing unprecedented opportunities to understand the diversity and function of gene duplication events using comparative genomics. Here we examine multiple paralogous genes of γ-glutamyl transferase (GGT) in several distantly related teleost species including medaka, stickleback, green spotted pufferfish, fugu, and zebrafish. Through mining genome databases, we have identified multiple GGT orthologs. Duplicate (paralogous) GGT sequences for GGT1 (GGT1 a and b), GGTL1 (GGTL1 a and b), and GGTL3 (GGTL3 a and b) were identified for each species. Phylogenetic analysis suggests that GGTs are ancient proteins conserved across most metazoan phyla and those paralogous GGTs in teleosts likely arose from the serial 3R genome duplication events. A third GGTL1 gene (GGTL1c) was found in green spotted pufferfish; however, this gene is not present in medaka, stickleback, or fugu. Similarly, one or both paralogs of GGTL3 appear to have been lost in green spotted pufferfish, fugu, and zebrafish. Syntenic relationships were highly maintained between duplicated teleost chromosomes, among teleosts and across ray-finned (Actinopterygii) and lobe-finned (Sarcopterygii) species. To assess subfunction partitioning, six medaka GGT genes were cloned and assessed for developmental and tissue-specific expression. On the basis of these data, we propose a modification of the "duplication-degeneration-complementation" model of subfunction partitioning where quantitative differences rather than absolute differences in gene expression are observed between gene paralogs. Our results demonstrate that multiple GGT genes have been retained within teleost genomes. Questions remain, however, regarding the functional roles of multiple GGTs in these species. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.

  3. An epigenetic state associated with areas of gene duplication

    PubMed Central

    Gimelbrant, Alexander A.; Chess, Andrew

    2006-01-01

    Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes. PMID:16687731

  4. Positive selection on the K domain of the AGAMOUS protein in the Zingiberales suggests a mechanism for the evolution of androecial morphology.

    PubMed

    Almeida, Ana Maria R; Yockteng, Roxana; Otoni, Wagner C; Specht, Chelsea D

    2015-01-01

    The ABC model of flower development describes the molecular basis for specification of floral organ identity in model eudicots such as Arabidopsis and Antirrhinum. According to this model, expression of C-class genes is linked to stamen and gynoecium organ identity. The Zingiberales is an order of tropical monocots in which the evolution of floral morphology is characterized by a marked increase in petaloidy in the androecium. Petaloidy is a derived characteristic of the ginger families and seems to have arisen in the common ancestor of the ginger clade. We hypothesize that duplication of the C-class AGAMOUS (AG) gene followed by divergence of the duplicated AG copies during the diversification of the ginger clade lineages explains the evolution of petaloidy in the androecium. In order to address this hypothesis, we carried out phylogenetic analyses of the AG gene family across the Zingiberales and investigated patterns of gene expression within the androecium. Phylogenetic analysis supports a scenario in which Zingiberales-specific AG genes have undergone at least one round of duplication. Gene duplication was immediately followed by divergence of the retained copies. In particular, we detect positive selection in the third alpha-helix of the K domain of Zingiberales AGAMOUS copy 1 (ZinAG-1). A single fixed amino acid change is observed in ZinAG-1 within the ginger clade when compared to the banana grade. Expression analyses of AG and APETALA1/FRUITFULL (AP1/FUL) in Musa basjoo is similar to A- and C-class gene expressions in the Arabidopsis thaliana model, while Costus spicatus exhibits simultaneous expression of AG and AP1/FUL in most floral organs. We propose that this novel expression pattern could be correlated with the evolution of androecial petaloidy within the Zingiberales. Our results present an intricate story in which duplication of the AG lineage has lead to the retention of at least two diverged Zingiberales-specific copies, ZinAG-1 and Zingiberales AGAMOUS copy 2 (ZinAG-2). Positive selection on ZinAG-1 residues suggests a mechanism by which AG gene divergence may explain observed morphological changes in Zingiberales flowers. Expression data provides preliminary support for the proposed mechanism, although further studies are required to fully test this hypothesis.

  5. Unique Temporal Expression of Triplicated Long-Wavelength Opsins in Developing Butterfly Eyes

    PubMed Central

    Arikawa, Kentaro; Iwanaga, Tomoyuki; Wakakuwa, Motohiro; Kinoshita, Michiyo

    2017-01-01

    Following gene duplication events, the expression patterns of the resulting gene copies can often diverge both spatially and temporally. Here we report on gene duplicates that are expressed in distinct but overlapping patterns, and which exhibit temporally divergent expression. Butterflies have sophisticated color vision and spectrally complex eyes, typically with three types of heterogeneous ommatidia. The eyes of the butterfly Papilio xuthus express two green- and one red-absorbing visual pigment, which came about via gene duplication events, in addition to one ultraviolet (UV)- and one blue-absorbing visual pigment. We localized mRNAs encoding opsins of these visual pigments in developing eye disks throughout the pupal stage. The mRNAs of the UV and blue opsin are expressed early in pupal development (pd), specifying the type of the ommatidium in which they appear. Red sensitive photoreceptors first express a green opsin mRNA, which is replaced later by the red opsin mRNA. Broadband photoreceptors (that coexpress the green and red opsins) first express the green opsin mRNA, later change to red opsin mRNA and finally re-express the green opsin mRNA in addition to the red mRNA. Such a unique temporal and spatial expression pattern of opsin mRNAs may reflect the evolution of visual pigments and provide clues toward understanding how the spectrally complex eyes of butterflies evolved. PMID:29238294

  6. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae.

    PubMed

    Shen, Danyu; Liu, Tingli; Ye, Wenwu; Liu, Li; Liu, Peihan; Wu, Yuren; Wang, Yuanchao; Dou, Daolong

    2013-01-01

    Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN), or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG), and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD) and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  7. Retention of duplicated ITAM-containing transmembrane signaling subunits in the tetraploid amphibian species Xenopus laevis

    PubMed Central

    Guselnikov, S.V.; Grayfer, L.; De Jesús Andino, F.; Rogozin, I.B.; Robert, J.; Taranin, A.V.

    2015-01-01

    The ITAM-bearing transmembrane signaling subunits (TSS) are indispensable components of activating leukocyte receptor complexes. The TSS-encoding genes map to paralogous chromosomal regions, which are thought to arise from ancient genome tetraploidization(s). To assess a possible role of tetraploidization in the TSS evolution, we studied TSS and other functionally linked genes in the amphibian species Xenopus laevis whose genome was duplicated about 40 MYR ago. We found that X. laevis has retained a duplicated set of sixteen TSS genes, all except one being transcribed. Furthermore, duplicated TCRα loci and genes encoding TSS-coupling protein kinases have also been retained. No clear evidence for functional divergence of the TSS paralogs was obtained from gene expression and sequence analyses. We suggest that the main factor of maintenance of duplicated TSS genes in X. laevis was a protein dosage effect and that this effect might have facilitated the TSS set expansion in early vertebrates. PMID:26170006

  8. Saccharomyces cerevisiae Bat1 and Bat2 Aminotransferases Have Functionally Diverged from the Ancestral-Like Kluyveromyces lactis Orthologous Enzyme

    PubMed Central

    Colón, Maritrini; Hernández, Fabiola; López, Karla; Quezada, Héctor; González, James; López, Geovani; Aranda, Cristina; González, Alicia

    2011-01-01

    Background Gene duplication is a key evolutionary mechanism providing material for the generation of genes with new or modified functions. The fate of duplicated gene copies has been amply discussed and several models have been put forward to account for duplicate conservation. The specialization model considers that duplication of a bifunctional ancestral gene could result in the preservation of both copies through subfunctionalization, resulting in the distribution of the two ancestral functions between the gene duplicates. Here we investigate whether the presumed bifunctional character displayed by the single branched chain amino acid aminotransferase present in K. lactis has been distributed in the two paralogous genes present in S. cerevisiae, and whether this conservation has impacted S. cerevisiae metabolism. Principal Findings Our results show that the KlBat1 orthologous BCAT is a bifunctional enzyme, which participates in the biosynthesis and catabolism of branched chain aminoacids (BCAAs). This dual role has been distributed in S. cerevisiae Bat1 and Bat2 paralogous proteins, supporting the specialization model posed to explain the evolution of gene duplications. BAT1 is highly expressed under biosynthetic conditions, while BAT2 expression is highest under catabolic conditions. Bat1 and Bat2 differential relocalization has favored their physiological function, since biosynthetic precursors are generated in the mitochondria (Bat1), while catabolic substrates are accumulated in the cytosol (Bat2). Under respiratory conditions, in the presence of ammonium and BCAAs the bat1Δ bat2Δ double mutant shows impaired growth, indicating that Bat1 and Bat2 could play redundant roles. In K. lactis wild type growth is independent of BCAA degradation, since a Klbat1Δ mutant grows under this condition. Conclusions Our study shows that BAT1 and BAT2 differential expression and subcellular relocalization has resulted in the distribution of the biosynthetic and catabolic roles of the ancestral BCAT in two isozymes improving BCAAs metabolism and constituting an adaptation to facultative metabolism. PMID:21267457

  9. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.

  10. Genome-wide characterization of GRAS family genes in Medicago truncatula reveals their evolutionary dynamics and functional diversification

    PubMed Central

    Zhang, Hailing; Cao, Yingping; Shang, Chen; Li, Jikai; Wang, Jianli; Wu, Zhenying; Ma, Lichao; Qi, Tianxiong; Fu, Chunxiang; Hu, Baozhong

    2017-01-01

    The GRAS gene family is a large plant-specific family of transcription factors that are involved in diverse processes during plant development. Medicago truncatula is an ideal model plant for genetic research in legumes, and specifically for studying nodulation, which is crucial for nitrogen fixation. In this study, 59 MtGRAS genes were identified and classified into eight distinct subgroups based on phylogenetic relationships. Motifs located in the C-termini were conserved across the subgroups, while motifs in the N-termini were subfamily specific. Gene duplication was the main evolutionary force for MtGRAS expansion, especially proliferation of the LISCL subgroup. Seventeen duplicated genes showed strong effects of purifying selection and diverse expression patterns, highlighting their functional importance and diversification after duplication. Thirty MtGRAS genes, including NSP1 and NSP2, were preferentially expressed in nodules, indicating possible roles in the process of nodulation. A transcriptome study, combined with gene expression analysis under different stress conditions, suggested potential functions of MtGRAS genes in various biological pathways and stress responses. Taken together, these comprehensive analyses provide basic information for understanding the potential functions of GRAS genes, and will facilitate further discovery of MtGRAS gene functions. PMID:28945786

  11. Wild tobacco genomes reveal the evolution of nicotine biosynthesis.

    PubMed

    Xu, Shuqing; Brockmöller, Thomas; Navarro-Quezada, Aura; Kuhl, Heiner; Gase, Klaus; Ling, Zhihao; Zhou, Wenwu; Kreitzer, Christoph; Stanke, Mario; Tang, Haibao; Lyons, Eric; Pandey, Priyanka; Pandey, Shree P; Timmermann, Bernd; Gaquerel, Emmanuel; Baldwin, Ian T

    2017-06-06

    Nicotine, the signature alkaloid of Nicotiana species responsible for the addictive properties of human tobacco smoking, functions as a defensive neurotoxin against attacking herbivores. However, the evolution of the genetic features that contributed to the assembly of the nicotine biosynthetic pathway remains unknown. We sequenced and assembled genomes of two wild tobaccos, Nicotiana attenuata (2.5 Gb) and Nicotiana obtusifolia (1.5 Gb), two ecological models for investigating adaptive traits in nature. We show that after the Solanaceae whole-genome triplication event, a repertoire of rapidly expanding transposable elements (TEs) bloated these Nicotiana genomes, promoted expression divergences among duplicated genes, and contributed to the evolution of herbivory-induced signaling and defenses, including nicotine biosynthesis. The biosynthetic machinery that allows for nicotine synthesis in the roots evolved from the stepwise duplications of two ancient primary metabolic pathways: the polyamine and nicotinamide adenine dinucleotide (NAD) pathways. In contrast to the duplication of the polyamine pathway that is shared among several solanaceous genera producing polyamine-derived tropane alkaloids, we found that lineage-specific duplications within the NAD pathway and the evolution of root-specific expression of the duplicated Solanaceae-specific ethylene response factor that activates the expression of all nicotine biosynthetic genes resulted in the innovative and efficient production of nicotine in the genus Nicotiana Transcription factor binding motifs derived from TEs may have contributed to the coexpression of nicotine biosynthetic pathway genes and coordinated the metabolic flux. Together, these results provide evidence that TEs and gene duplications facilitated the emergence of a key metabolic innovation relevant to plant fitness.

  12. Global spread and genetic variants of the two CYP9M10 haplotype forms associated with insecticide resistance in Culex quinquefasciatus Say.

    PubMed

    Itokawa, K; Komagata, O; Kasai, S; Kawada, H; Mwatele, C; Dida, G O; Njenga, S M; Mwandawiro, C; Tomita, T

    2013-09-01

    Insecticide resistance develops as a genetic factor (allele) conferring lower susceptibility to insecticides proliferates within a target insect population under strong positive selection. Intriguingly, a resistance allele pre-existing in a population often bears a series of further adaptive allelic variants through new mutations. This phenomenon occasionally results in replacement of the predominating resistance allele by fitter new derivatives, and consequently, development of greater resistance at the population level. The overexpression of the cytochrome P450 gene CYP9M10 is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. Previously, we have found two genealogically related overexpressing CYP9M10 haplotypes, which differ in gene copy number (duplicated and non-duplicated). The duplicated haplotype was derived from the non-duplicated overproducer probably recently. In the present study, we investigated allelic series of CYP9M10 involved in three C. quinquefasciatus laboratory colonies recently collected from three different localities. Duplicated and non-duplicated overproducing haplotypes coexisted in African and Asian colonies indicating a global distribution of both haplotype lineages. The duplicated haplotypes both in the Asian and African colonies were associated with higher expression levels and stronger resistance than non-duplicated overproducing haplotypes. There were slight variation in expression level among the non-duplicated overproducing haplotypes. The nucleotide sequences in coding and upstream regions among members of this group also showed a little diversity. Non-duplicated overproducing haplotypes with relatively higher expression were genealogically closer to the duplicated haplotypes than the other non-duplicated overproducing haplotypes, suggesting multiple cis-acting mutations before duplication.

  13. The Human CHRNA7 and CHRFAM7A Genes: A Review of the Genetics, Regulation, and Function

    PubMed Central

    Sinkus, Melissa L.; Graw, Sharon; Freedman, Robert; Ross, Randal G.; Lester, Henry A.; Leonard, Sherry

    2015-01-01

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is ubiquitously expressed in both the central nervous system and in the periphery. CHRNA7 is genetically linked to multiple disorders with cognitive deficits, including schizophrenia, bipolar disorder, ADHD, epilepsy, Alzheimer’s disease, and Rett syndrome. The regulation of CHRNA7 is complex; more than a dozen mechanisms are known, one of which is a partial duplication of the parent gene. Exons 5-10 of CHRNA7 on chromosome 15 were duplicated and inserted 1.6 Mb upstream of CHRNA7, interrupting an earlier partial duplication of two other genes. The chimeric CHRFAM7A gene product, dupα7, assembles with α7 subunits, resulting in a dominant negative regulation of function. The duplication is human specific, occurring neither in primates nor in rodents. The duplicated α7 sequence in exons 5-10 of CHRFAM7A is almost identical to CHRNA7, and thus is not completely queried in high throughput genetic studies (GWAS). Further, pre-clinical animal models of the α7nAChR utilized in drug development research do not have CHRFAM7A (dupα7) and cannot fully model human drug responses. The wide expression of CHRNA7, its multiple functions and modes of regulation present challenges for study of this gene in disease. PMID:25701707

  14. Analysis of LMNB1 Duplications in Autosomal Dominant Leukodystrophy Provides Insights into Duplication Mechanisms and Allele-Specific Expression

    PubMed Central

    Giorgio, Elisa; Rolyan, Harshvardhan; Kropp, Laura; Chakka, Anish Baswanth; Yatsenko, Svetlana; Gregorio, Eleonora Di; Lacerenza, Daniela; Vaula, Giovanna; Talarico, Flavia; Mandich, Paola; Toro, Camilo; Pierre, Eleonore Eymard; Labauge, Pierre; Capellari, Sabina; Cortelli, Pietro; Vairo, Filippo Pinto; Miguel, Diego; Stubbolo, Danielle; Marques, Lourenco Charles; Gahl, William; Boespflug-Tanguy, Odile; Melberg, Atle; Hassin-Baer, Sharon; Cohen, Oren S; Pjontek, Rastislav; Grau, Armin; Klopstock, Thomas; Fogel, Brent; Meijer, Inge; Rouleau, Guy; Bouchard, Jean-Pierre L; Ganapathiraju, Madhavi; Vanderver, Adeline; Dahl, Niklas; Hobson, Grace; Brusco, Alfredo; Brussino, Alessandro; Padiath, Quasar Saleem

    2013-01-01

    ABSTRACT Autosomal dominant leukodystrophy (ADLD) is an adult onset demyelinating disorder that is caused by duplications of the lamin B1 (LMNB1) gene. However, as only a few cases have been analyzed in detail, the mechanisms underlying LMNB1 duplications are unclear. We report the detailed molecular analysis of the largest collection of ADLD families studied, to date. We have identified the minimal duplicated region necessary for the disease, defined all the duplication junctions at the nucleotide level and identified the first inverted LMNB1 duplication. We have demonstrated that the duplications are not recurrent; patients with identical duplications share the same haplotype, likely inherited from a common founder and that the duplications originated from intrachromosomal events. The duplication junction sequences indicated that nonhomologous end joining or replication-based mechanisms such fork stalling and template switching or microhomology-mediated break induced repair are likely to be involved. LMNB1 expression was increased in patients’ fibroblasts both at mRNA and protein levels and the three LMNB1 alleles in ADLD patients show equal expression, suggesting that regulatory regions are maintained within the rearranged segment. These results have allowed us to elucidate duplication mechanisms and provide insights into allele-specific LMNB1 expression levels. PMID:23649844

  15. Developmental and Environmental Regulation of Aquaporin Gene Expression across Populus Species: Divergence or Redundancy?

    PubMed Central

    Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

    2013-01-01

    Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected. PMID:23393587

  16. Developmental and environmental regulation of Aquaporin gene expression across Populus species: divergence or redundancy?

    PubMed

    Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

    2013-01-01

    Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected.

  17. Transducin Duplicates in the Zebrafish Retina and Pineal Complex: Differential Specialisation after the Teleost Tetraploidisation

    PubMed Central

    Lagman, David; Callado-Pérez, Amalia; Franzén, Ilkin E.

    2015-01-01

    Gene duplications provide raw materials that can be selected for functional adaptations by evolutionary mechanisms. We describe here the results of 350 million years of evolution of three functionally related gene families: the alpha, beta and gamma subunits of transducins, the G protein involved in vision. Early vertebrate tetraploidisations resulted in separate transducin heterotrimers: gnat1/gnb1/gngt1 for rods, and gnat2/gnb3/gngt2 for cones. The teleost-specific tetraploidisation generated additional duplicates for gnb1, gnb3 and gngt2. We report here that the duplicates have undergone several types of subfunctionalisation or neofunctionalisation in the zebrafish. We have found that gnb1a and gnb1b are co-expressed at different levels in rods; gnb3a and gnb3b have undergone compartmentalisation restricting gnb3b to the dorsal and medial retina, however, gnb3a expression was detected only at very low levels in both larvae and adult retina; gngt2b expression is restricted to the dorsal and medial retina, whereas gngt2a is expressed ventrally. This dorsoventral distinction could be an adaptation to protect the lower part of the retina from intense light damage. The ontogenetic analysis shows earlier onset of expression in the pineal complex than in the retina, in accordance with its earlier maturation. Additionally, gnb1a but not gnb1b is expressed in the pineal complex, and gnb3b and gngt2b are transiently expressed in the pineal during ontogeny, thus showing partial temporal subfunctionalisation. These retina-pineal distinctions presumably reflect their distinct functional roles in vision and circadian rhythmicity. In summary, this study describes several functional differences between transducin gene duplicates resulting from the teleost-specific tetraploidisation. PMID:25806532

  18. Structure and transcriptional regulation of the major intrinsic protein gene family in grapevine.

    PubMed

    Wong, Darren Chern Jan; Zhang, Li; Merlin, Isabelle; Castellarin, Simone D; Gambetta, Gregory A

    2018-04-11

    The major intrinsic protein (MIP) family is a family of proteins, including aquaporins, which facilitate water and small molecule transport across plasma membranes. In plants, MIPs function in a huge variety of processes including water transport, growth, stress response, and fruit development. In this study, we characterize the structure and transcriptional regulation of the MIP family in grapevine, describing the putative genome duplication events leading to the family structure and characterizing the family's tissue and developmental specific expression patterns across numerous preexisting microarray and RNAseq datasets. Gene co-expression network (GCN) analyses were carried out across these datasets and the promoters of each family member were analyzed for cis-regulatory element structure in order to provide insight into their transcriptional regulation. A total of 29 Vitis vinifera MIP family members (excluding putative pseudogenes) were identified of which all but two were mapped onto Vitis vinifera chromosomes. In this study, segmental duplication events were identified for five plasma membrane intrinsic protein (PIP) and four tonoplast intrinsic protein (TIP) genes, contributing to the expansion of PIPs and TIPs in grapevine. Grapevine MIP family members have distinct tissue and developmental expression patterns and hierarchical clustering revealed two primary groups regardless of the datasets analyzed. Composite microarray and RNA-seq gene co-expression networks (GCNs) highlighted the relationships between MIP genes and functional categories involved in cell wall modification and transport, as well as with other MIPs revealing a strong co-regulation within the family itself. Some duplicated MIP family members have undergone sub-functionalization and exhibit distinct expression patterns and GCNs. Cis-regulatory element (CRE) analyses of the MIP promoters and their associated GCN members revealed enrichment for numerous CREs including AP2/ERFs and NACs. Combining phylogenetic analyses, gene expression profiling, gene co-expression network analyses, and cis-regulatory element enrichment, this study provides a comprehensive overview of the structure and transcriptional regulation of the grapevine MIP family. The study highlights the duplication and sub-functionalization of the family, its strong coordinated expression with genes involved in growth and transport, and the putative classes of TFs responsible for its regulation.

  19. Assessment and Reconstruction of Novel HSP90 Genes: Duplications, Gains and Losses in Fungal and Animal Lineages

    PubMed Central

    Pantzartzi, Chrysoula N.; Drosopoulou, Elena; Scouras, Zacharias G.

    2013-01-01

    Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b) isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata. PMID:24066039

  20. Genomic Characterization of Variable Surface Antigens Reveals a Telomere Position Effect as a Prerequisite for RNA Interference-Mediated Silencing in Paramecium tetraurelia

    PubMed Central

    Baranasic, Damir; Oppermann, Timo; Cheaib, Miriam; Cullum, John; Schmidt, Helmut

    2014-01-01

    ABSTRACT Antigenic or phenotypic variation is a widespread phenomenon of expression of variable surface protein coats on eukaryotic microbes. To clarify the mechanism behind mutually exclusive gene expression, we characterized the genetic properties of the surface antigen multigene family in the ciliate Paramecium tetraurelia and the epigenetic factors controlling expression and silencing. Genome analysis indicated that the multigene family consists of intrachromosomal and subtelomeric genes; both classes apparently derive from different gene duplication events: whole-genome and intrachromosomal duplication. Expression analysis provides evidence for telomere position effects, because only subtelomeric genes follow mutually exclusive transcription. Microarray analysis of cultures deficient in Rdr3, an RNA-dependent RNA polymerase, in comparison to serotype-pure wild-type cultures, shows cotranscription of a subset of subtelomeric genes, indicating that the telomere position effect is due to a selective occurrence of Rdr3-mediated silencing in subtelomeric regions. We present a model of surface antigen evolution by intrachromosomal gene duplication involving the maintenance of positive selection of structurally relevant regions. Further analysis of chromosome heterogeneity shows that alternative telomere addition regions clearly affect transcription of closely related genes. Consequently, chromosome fragmentation appears to be of crucial importance for surface antigen expression and evolution. Our data suggest that RNAi-mediated control of this genetic network by trans-acting RNAs allows rapid epigenetic adaptation by phenotypic variation in combination with long-term genetic adaptation by Darwinian evolution of antigen genes. PMID:25389173

  1. Genome-Wide Analysis of the AP2/ERF Gene Family in Physic Nut and Overexpression of the JcERF011 Gene in Rice Increased Its Sensitivity to Salinity Stress

    PubMed Central

    Tang, Yuehui; Qin, Shanshan; Guo, Yali; Chen, Yanbo; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2016-01-01

    The AP2/ERF transcription factors play crucial roles in plant growth, development and responses to biotic and abiotic stresses. A total of 119 AP2/ERF genes (JcAP2/ERFs) have been identified in the physic nut genome; they include 16 AP2, 4 RAV, 1 Soloist, and 98 ERF genes. Phylogenetic analysis indicated that physic nut AP2 genes could be divided into 3 subgroups, while ERF genes could be classed into 11 groups or 43 subgroups. The AP2/ERF genes are non-randomly distributed across the 11 linkage groups of the physic nut genome and retain many duplicates which arose from ancient duplication events. The expression patterns of several JcAP2/ERF duplicates in the physic nut showed differences among four tissues (root, stem, leaf, and seed), and 38 JcAP2/ERF genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots according to analysis of digital gene expression tag data. The expression of JcERF011 was downregulated by salinity stress in physic nut roots. Overexpression of the JcERF011 gene in rice plants increased its sensitivity to salinity stress. The increased expression levels of several salt tolerance-related genes were impaired in the JcERF011-overexpressing plants under salinity stress. PMID:26943337

  2. Genome-Wide Analysis of the AP2/ERF Gene Family in Physic Nut and Overexpression of the JcERF011 Gene in Rice Increased Its Sensitivity to Salinity Stress.

    PubMed

    Tang, Yuehui; Qin, Shanshan; Guo, Yali; Chen, Yanbo; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2016-01-01

    The AP2/ERF transcription factors play crucial roles in plant growth, development and responses to biotic and abiotic stresses. A total of 119 AP2/ERF genes (JcAP2/ERFs) have been identified in the physic nut genome; they include 16 AP2, 4 RAV, 1 Soloist, and 98 ERF genes. Phylogenetic analysis indicated that physic nut AP2 genes could be divided into 3 subgroups, while ERF genes could be classed into 11 groups or 43 subgroups. The AP2/ERF genes are non-randomly distributed across the 11 linkage groups of the physic nut genome and retain many duplicates which arose from ancient duplication events. The expression patterns of several JcAP2/ERF duplicates in the physic nut showed differences among four tissues (root, stem, leaf, and seed), and 38 JcAP2/ERF genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots according to analysis of digital gene expression tag data. The expression of JcERF011 was downregulated by salinity stress in physic nut roots. Overexpression of the JcERF011 gene in rice plants increased its sensitivity to salinity stress. The increased expression levels of several salt tolerance-related genes were impaired in the JcERF011-overexpressing plants under salinity stress.

  3. Genomic Imprinting Was Evolutionarily Conserved during Wheat Polyploidization.

    PubMed

    Yang, Guanghui; Liu, Zhenshan; Gao, Lulu; Yu, Kuohai; Feng, Man; Yao, Yingyin; Peng, Huiru; Hu, Zhaorong; Sun, Qixin; Ni, Zhongfu; Xin, Mingming

    2018-01-01

    Genomic imprinting is an epigenetic phenomenon that causes genes to be differentially expressed depending on their parent of origin. To evaluate the evolutionary conservation of genomic imprinting and the effects of ploidy on this process, we investigated parent-of-origin-specific gene expression patterns in the endosperm of diploid ( Aegilops spp), tetraploid, and hexaploid wheat ( Triticum spp) at various stages of development via high-throughput transcriptome sequencing. We identified 91, 135, and 146 maternally or paternally expressed genes (MEGs or PEGs, respectively) in diploid, tetraploid, and hexaploid wheat, respectively, 52.7% of which exhibited dynamic expression patterns at different developmental stages. Gene Ontology enrichment analysis suggested that MEGs and PEGs were involved in metabolic processes and DNA-dependent transcription, respectively. Nearly half of the imprinted genes exhibited conserved expression patterns during wheat hexaploidization. In addition, 40% of the homoeolog pairs originating from whole-genome duplication were consistently maternally or paternally biased in the different subgenomes of hexaploid wheat. Furthermore, imprinted expression was found for 41.2% and 50.0% of homolog pairs that evolved by tandem duplication after genome duplication in tetraploid and hexaploid wheat, respectively. These results suggest that genomic imprinting was evolutionarily conserved between closely related Triticum and Aegilops species and in the face of polyploid hybridization between species in these genera. © 2018 American Society of Plant Biologists. All rights reserved.

  4. Root hairs, trichomes and the evolution of duplicate genes.

    PubMed

    Kellogg, E A

    2001-12-01

    The MYB-class proteins WEREWOLF and GLABRA1 are functionally interchangeable, even though one is normally expressed solely in roots and the other only in shoots. This shows that their different functions are the result of the modification of cis-regulatory sequences over evolutionary time. The two genes thus provide an example of morphological diversification created by gene duplication and changes in regulation.

  5. Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

    PubMed Central

    Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

    2014-01-01

    Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822

  6. Genome-wide identification and evolution of the PIN-FORMED (PIN) gene family in Glycine max.

    PubMed

    Liu, Yuan; Wei, Haichao

    2017-07-01

    Soybean (Glycine max) is one of the most important crop plants. Wild and cultivated soybean varieties have significant differences worth further investigation, such as plant morphology, seed size, and seed coat development; these characters may be related to auxin biology. The PIN gene family encodes essential transport proteins in cell-to-cell auxin transport, but little research on soybean PIN genes (GmPIN genes) has been done, especially with respect to the evolution and differences between wild and cultivated soybean. In this study, we retrieved 23 GmPIN genes from the latest updated G. max genome database; six GmPIN protein sequences were changed compared with the previous database. Based on the Plant Genome Duplication Database, 18 GmPIN genes have been involved in segment duplication. Three pairs of GmPIN genes arose after the second soybean genome duplication, and six occurred after the first genome duplication. The duplicated GmPIN genes retained similar expression patterns. All the duplicated GmPIN genes experienced purifying selection (K a /K s < 1) to prevent accumulation of non-synonymous mutations and thus remained more similar. In addition, we also focused on the artificial selection of the soybean PIN genes. Five artificially selected GmPIN genes were identified by comparing the genome sequence of 17 wild and 14 cultivated soybean varieties. Our research provides useful and comprehensive basic information for understanding GmPIN genes.

  7. Independent and parallel evolution of new genes by gene duplication in two origins of C4 photosynthesis provides new insight into the mechanism of phloem loading in C4 species

    DOE PAGES

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; ...

    2016-03-24

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes ismore » enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Moreover, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species.« less

  8. Comprehensive Genome-Wide Survey, Genomic Constitution and Expression Profiling of the NAC Transcription Factor Family in Foxtail Millet (Setaria italica L.)

    PubMed Central

    Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj

    2013-01-01

    The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254

  9. Comprehensive genome-wide survey, genomic constitution and expression profiling of the NAC transcription factor family in foxtail millet (Setaria italica L.).

    PubMed

    Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj

    2013-01-01

    The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.

  10. Segmental duplications and evolutionary acquisition of UV damage response in the SPATA31 gene family of primates and humans.

    PubMed

    Bekpen, Cemalettin; Künzel, Sven; Xie, Chen; Eaaswarkhanth, Muthukrishnan; Lin, Yen-Lung; Gokcumen, Omer; Akdis, Cezmi A; Tautz, Diethard

    2017-03-06

    Segmental duplications are an abundant source for novel gene functions and evolutionary adaptations. This mechanism of generating novelty was very active during the evolution of primates particularly in the human lineage. Here, we characterize the evolution and function of the SPATA31 gene family (former designation FAM75A), which was previously shown to be among the gene families with the strongest signal of positive selection in hominoids. The mouse homologue for this gene family is a single copy gene expressed during spermatogenesis. We show that in primates, the SPATA31 gene duplicated into SPATA31A and SPATA31C types and broadened the expression into many tissues. Each type became further segmentally duplicated in the line towards humans with the largest number of full-length copies found for SPATA31A in humans. Copy number estimates of SPATA31A based on digital PCR show an average of 7.5 with a range of 5-11 copies per diploid genome among human individuals. The primate SPATA31 genes also acquired new protein domains that suggest an involvement in UV response and DNA repair. We generated antibodies and show that the protein is re-localized from the nucleolus to the whole nucleus upon UV-irradiation suggesting a UV damage response. We used CRISPR/Cas mediated mutagenesis to knockout copies of the gene in human primary fibroblast cells. We find that cell lines with reduced functional copies as well as naturally occurring low copy number HFF cells show enhanced sensitivity towards UV-irradiation. The acquisition of new SPATA31 protein functions and its broadening of expression may be related to the evolution of the diurnal life style in primates that required a higher UV tolerance. The increased segmental duplications in hominoids as well as its fast evolution suggest the acquisition of further specific functions particularly in humans.

  11. Hox genes and chordate evolution.

    PubMed

    Holland, P W; Garcia-Fernàndez, J

    1996-02-01

    Hox genes are implicated in the control of axial patterning during embryonic development of many, perhaps all, animals. Here we review recent data on Hox gene diversity, genomic organization, and embryonic expression in chordates (including tunicates, amphioxus, hagfish, lampreys, teleosts) plus their putative sister group, the hemichordates. We consider the potential of comparative Hox gene data to resolve some outstanding controversies in chordate phylogeny. The use of Hox gene expression patterns to identify homologies between body plans both within the vertebrates and between the chordate subphyla is also discussed. Homology between the vertebrate hindbrain and an extensive region of amphioxus neural tube is suggested by comparison of Hox-3 homologues and strengthened by new data on amphioxus Hox-1 gene expression reported here. Finally, we give two examples of how Hox genes are giving glimpses into chordate developmental evolution. The first relates changes in Hox gene expression to transposition of vertebral of vertebral identities; the second describes a correlation between vertebrate origins and Hox gene cluster duplication. We suggest that the simultaneous duplication of many classes of genes, often interacting in gene networks, allowed the elaboration of new developmental control mechanisms at vertebrate origins.

  12. Xp11.2 microduplications including IQSEC2, TSPYL2 and KDM5C genes in patients with neurodevelopmental disorders

    PubMed Central

    Moey, Ching; Hinze, Susan J; Brueton, Louise; Morton, Jenny; McMullan, Dominic J; Kamien, Benjamin; Barnett, Christopher P; Brunetti-Pierri, Nicola; Nicholl, Jillian; Gecz, Jozef; Shoubridge, Cheryl

    2016-01-01

    Copy number variations are a common cause of intellectual disability (ID). Determining the contribution of copy number variants (CNVs), particularly gains, to disease remains challenging. Here, we report four males with ID with sub-microscopic duplications at Xp11.2 and review the few cases with overlapping duplications reported to date. We established the extent of the duplicated regions in each case encompassing a minimum of three known disease genes TSPYL2, KDM5C and IQSEC2 with one case also duplicating the known disease gene HUWE1. Patients with a duplication encompassing TSPYL2, KDM5C and IQSEC2 without gains of nearby SMC1A and HUWE1 genes have not been reported thus far. All cases presented with ID and significant deficits of speech development. Some patients also manifested behavioral disturbances such as hyperactivity and attention-deficit/hyperactivity disorder. Lymphoblastic cell lines from patients show markedly elevated levels of TSPYL2, KDM5C and SMC1A, transcripts consistent with the extent of their CNVs. The duplicated region in our patients contains several genes known to escape X-inactivation, including KDM5C, IQSEC2 and SMC1A. In silico analysis of expression data in selected gene expression omnibus series indicates that dosage of these genes, especially IQSEC2, is similar in males and females despite the fact they escape from X-inactivation in females. Taken together, the data suggest that gains in Xp11.22 including IQSEC2 cause ID and are associated with hyperactivity and attention-deficit/hyperactivity disorder, and are likely to be dosage-sensitive in males. PMID:26059843

  13. Xp11.2 microduplications including IQSEC2, TSPYL2 and KDM5C genes in patients with neurodevelopmental disorders.

    PubMed

    Moey, Ching; Hinze, Susan J; Brueton, Louise; Morton, Jenny; McMullan, Dominic J; Kamien, Benjamin; Barnett, Christopher P; Brunetti-Pierri, Nicola; Nicholl, Jillian; Gecz, Jozef; Shoubridge, Cheryl

    2016-03-01

    Copy number variations are a common cause of intellectual disability (ID). Determining the contribution of copy number variants (CNVs), particularly gains, to disease remains challenging. Here, we report four males with ID with sub-microscopic duplications at Xp11.2 and review the few cases with overlapping duplications reported to date. We established the extent of the duplicated regions in each case encompassing a minimum of three known disease genes TSPYL2, KDM5C and IQSEC2 with one case also duplicating the known disease gene HUWE1. Patients with a duplication encompassing TSPYL2, KDM5C and IQSEC2 without gains of nearby SMC1A and HUWE1 genes have not been reported thus far. All cases presented with ID and significant deficits of speech development. Some patients also manifested behavioral disturbances such as hyperactivity and attention-deficit/hyperactivity disorder. Lymphoblastic cell lines from patients show markedly elevated levels of TSPYL2, KDM5C and SMC1A, transcripts consistent with the extent of their CNVs. The duplicated region in our patients contains several genes known to escape X-inactivation, including KDM5C, IQSEC2 and SMC1A. In silico analysis of expression data in selected gene expression omnibus series indicates that dosage of these genes, especially IQSEC2, is similar in males and females despite the fact they escape from X-inactivation in females. Taken together, the data suggest that gains in Xp11.22 including IQSEC2 cause ID and are associated with hyperactivity and attention-deficit/hyperactivity disorder, and are likely to be dosage-sensitive in males.

  14. Application of community phylogenetic approaches to understand gene expression: differential exploration of venom gene space in predatory marine gastropods.

    PubMed

    Chang, Dan; Duda, Thomas F

    2014-06-05

    Predatory marine gastropods of the genus Conus exhibit substantial variation in venom composition both within and among species. Apart from mechanisms associated with extensive turnover of gene families and rapid evolution of genes that encode venom components ('conotoxins'), the evolution of distinct conotoxin expression patterns is an additional source of variation that may drive interspecific differences in the utilization of species' 'venom gene space'. To determine the evolution of expression patterns of venom genes of Conus species, we evaluated the expression of A-superfamily conotoxin genes of a set of closely related Conus species by comparing recovered transcripts of A-superfamily genes that were previously identified from the genomes of these species. We modified community phylogenetics approaches to incorporate phylogenetic history and disparity of genes and their expression profiles to determine patterns of venom gene space utilization. Less than half of the A-superfamily gene repertoire of these species is expressed, and only a few orthologous genes are coexpressed among species. Species exhibit substantially distinct expression strategies, with some expressing sets of closely related loci ('under-dispersed' expression of available genes) while others express sets of more disparate genes ('over-dispersed' expression). In addition, expressed genes show higher dN/dS values than either unexpressed or ancestral genes; this implies that expression exposes genes to selection and facilitates rapid evolution of these genes. Few recent lineage-specific gene duplicates are expressed simultaneously, suggesting that expression divergence among redundant gene copies may be established shortly after gene duplication. Our study demonstrates that venom gene space is explored differentially by Conus species, a process that effectively permits the independent and rapid evolution of venoms in these species.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes ismore » enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Moreover, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species.« less

  16. Evolutionary characterization and transcript profiling of β-tubulin genes in flax (Linum usitatissimum L.) during plant development.

    PubMed

    Gavazzi, Floriana; Pigna, Gaia; Braglia, Luca; Gianì, Silvia; Breviario, Diego; Morello, Laura

    2017-12-08

    Microtubules, polymerized from alpha and beta-tubulin monomers, play a fundamental role in plant morphogenesis, determining the cell division plane, the direction of cell expansion and the deposition of cell wall material. During polarized pollen tube elongation, microtubules serve as tracks for vesicular transport and deposition of proteins/lipids at the tip membrane. Such functions are controlled by cortical microtubule arrays. Aim of this study was to first characterize the flax β-tubulin family by sequence and phylogenetic analysis and to investigate differential expression of β-tubulin genes possibly related to fibre elongation and to flower development. We report the cloning and characterization of the complete flax β-tubulin gene family: exon-intron organization, duplicated gene comparison, phylogenetic analysis and expression pattern during stem and hypocotyl elongation and during flower development. Sequence analysis of the fourteen expressed β-tubulin genes revealed that the recent whole genome duplication of the flax genome was followed by massive retention of duplicated tubulin genes. Expression analysis showed that β-tubulin mRNA profiles gradually changed along with phloem fibre development in both the stem and hypocotyl. In flowers, changes in relative tubulin transcript levels took place at anthesis in anthers, but not in carpels. Phylogenetic analysis supports the origin of extant plant β-tubulin genes from four ancestral genes pre-dating angiosperm separation. Expression analysis suggests that particular tubulin subpopulations are more suitable to sustain different microtubule functions such as cell elongation, cell wall thickening or pollen tube growth. Tubulin genes possibly related to different microtubule functions were identified as candidate for more detailed studies.

  17. Adaptations to High Salt in a Halophilic Protist: Differential Expression and Gene Acquisitions through Duplications and Gene Transfers

    PubMed Central

    Harding, Tommy; Roger, Andrew J.; Simpson, Alastair G. B.

    2017-01-01

    The capacity of halophiles to thrive in extreme hypersaline habitats derives partly from the tight regulation of ion homeostasis, the salt-dependent adjustment of plasma membrane fluidity, and the increased capability to manage oxidative stress. Halophilic bacteria, and archaea have been intensively studied, and substantial research has been conducted on halophilic fungi, and the green alga Dunaliella. By contrast, there have been very few investigations of halophiles that are phagotrophic protists, i.e., protozoa. To gather fundamental knowledge about salt adaptation in these organisms, we studied the transcriptome-level response of Halocafeteria seosinensis (Stramenopiles) grown under contrasting salinities. We provided further evolutionary context to our analysis by identifying genes that underwent recent duplications. Genes that were highly responsive to salinity variations were involved in stress response (e.g., chaperones), ion homeostasis (e.g., Na+/H+ transporter), metabolism and transport of lipids (e.g., sterol biosynthetic genes), carbohydrate metabolism (e.g., glycosidases), and signal transduction pathways (e.g., transcription factors). A significantly high proportion (43%) of duplicated genes were also differentially expressed, accentuating the importance of gene expansion in adaptation by H. seosinensis to high salt environments. Furthermore, we found two genes that were lateral acquisitions from bacteria, and were also highly up-regulated and highly expressed at high salt, suggesting that this evolutionary mechanism could also have facilitated adaptation to high salt. We propose that a transition toward high-salt adaptation in the ancestors of H. seosinensis required the acquisition of new genes via duplication, and some lateral gene transfers (LGTs), as well as the alteration of transcriptional programs, leading to increased stress resistance, proper establishment of ion gradients, and modification of cell structure properties like membrane fluidity. PMID:28611746

  18. Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

    PubMed

    Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

    2016-01-01

    Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.

  19. Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins

    PubMed Central

    Opazo, Juan C.; Butts, G. Tyler; Nery, Mariana F.; Storz, Jay F.; Hoffmann, Federico G.

    2013-01-01

    Subsequent to the two rounds of whole-genome duplication that occurred in the common ancestor of vertebrates, a third genome duplication occurred in the stem lineage of teleost fishes. This teleost-specific genome duplication (TGD) is thought to have provided genetic raw materials for the physiological, morphological, and behavioral diversification of this highly speciose group. The extreme physiological versatility of teleost fish is manifest in their diversity of blood–gas transport traits, which reflects the myriad solutions that have evolved to maintain tissue O2 delivery in the face of changing metabolic demands and environmental O2 availability during different ontogenetic stages. During the course of development, regulatory changes in blood–O2 transport are mediated by the expression of multiple, functionally distinct hemoglobin (Hb) isoforms that meet the particular O2-transport challenges encountered by the developing embryo or fetus (in viviparous or oviparous species) and in free-swimming larvae and adults. The main objective of the present study was to assess the relative contributions of whole-genome duplication, large-scale segmental duplication, and small-scale gene duplication in producing the extraordinary functional diversity of teleost Hbs. To accomplish this, we integrated phylogenetic reconstructions with analyses of conserved synteny to characterize the genomic organization and evolutionary history of the globin gene clusters of teleosts. These results were then integrated with available experimental data on functional properties and developmental patterns of stage-specific gene expression. Our results indicate that multiple α- and β-globin genes were present in the common ancestor of gars (order Lepisoteiformes) and teleosts. The comparative genomic analysis revealed that teleosts possess a dual set of TGD-derived globin gene clusters, each of which has undergone lineage-specific changes in gene content via repeated duplication and deletion events. Phylogenetic reconstructions revealed that paralogous genes convergently evolved similar functional properties in different teleost lineages. Consistent with other recent studies of globin gene family evolution in vertebrates, our results revealed evidence for repeated evolutionary transitions in the developmental regulation of Hb synthesis. PMID:22949522

  20. The cytochrome P450 2AA gene cluster in zebrafish (Danio rerio): Expression of CYP2AA1 and CYP2AA2 and response to phenobarbital-type inducers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kubota, Akira; Bainy, Afonso C.D.; Departamento de Bioquímica, CCB, Universidade Federal de Santa Catarina, Florianopolis, SC 88040-900

    2013-10-01

    The cytochrome P450 (CYP) 2 gene family is the largest and most diverse CYP gene family in vertebrates. In zebrafish, we have identified 10 genes in a new subfamily, CYP2AA, which does not show orthology to any human or other mammalian CYP genes. Here we report evolutionary and structural relationships of the 10 CYP2AA genes and expression of the first two genes, CYP2AA1 and CYP2AA2. Parsimony reconstruction of the tandem duplication pattern for the CYP2AA cluster suggests that CYP2AA1, CYP2AA2 and CYP2AA3 likely arose in the earlier duplication events and thus are most diverged in function from the other CYP2AAs.more » On the other hand, CYP2AA8 and CYP2AA9 are genes that arose in the latest duplication event, implying functional similarity between these two CYPs. A molecular model of CYP2AA1 showing the sequence conservation across the CYP2AA cluster reveals that the regions with the highest variability within the cluster map onto CYP2AA1 near the substrate access channels, suggesting differing substrate specificities. Zebrafish CYP2AA1 transcript was expressed predominantly in the intestine, while CYP2AA2 was most highly expressed in the kidney, suggesting differing roles in physiology. In the liver CYP2AA2 expression but not that of CYP2AA1, was increased by 1,4-bis [2-(3,5-dichloropyridyloxy)] benzene (TCPOBOP) and, to a lesser extent, by phenobarbital (PB). In contrast, pregnenolone 16α-carbonitrile (PCN) increased CYP2AA1 expression, but not CYP2AA2 in the liver. The results identify a CYP2 subfamily in zebrafish that includes genes apparently induced by PB-type chemicals and PXR agonists, the first concrete in vivo evidence for a PB-type response in fish. - Highlights: • A tandemly duplicated cluster of ten CYP2AA genes was described in zebrafish. • Parsimony and duplication analyses suggest pathways to CYP2AA diversity. • Homology models reveal amino acid positions possibly related to functional diversity. • The CYP2AA locus does not share synteny with any CYP2 subfamily in mammals. • Induction of CYP2AA1 and CYP2AA2 indicates a phenobarbital-type response in fish.« less

  1. Increased expression of LD1 genes transcribed by RNA polymerase I in Leishmania donovani as a result of duplication into the rRNA gene locus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lodes, M.J.; Merlin, G.; DeVos, T.

    1995-12-01

    This report investigates the duplication of two LD1 genes into the rRNA locus and the resultant transcription by RNA polymerase I, which has a faster transcription rate than that of RNA polymerase II. This was conducted using a 2.2-Mb chromosome in Leishmania donovani. 55 refs., 6 figs.

  2. Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome

    PubMed Central

    2010-01-01

    Background Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar), but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. Results From existing expressed sequence tag (EST) resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius) ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. Conclusions 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate. PMID:20433749

  3. Genome-wide identification and characterization of Glyceraldehyde-3-phosphate dehydrogenase genes family in wheat (Triticum aestivum).

    PubMed

    Zeng, Lingfeng; Deng, Rong; Guo, Ziping; Yang, Shushen; Deng, Xiping

    2016-03-16

    Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is a central enzyme in glycolysi, we performed genome-wide identification of GAPDH genes in wheat and analyzed their structural characteristics and expression patterns under abiotic stress in wheat. A total of 22 GAPDH genes were identified in wheat cv. Chinese spring; the phylogenetic and structure analysis showed that these GAPDH genes could be divided into four distinct subfamilies. The expression profiles of GAPDH genes showed tissue specificity all over plant development stages. The qRT-PCR results revealed that wheat GAPDHs were involved in several abiotic stress response. Wheat carried 22 GAPDH genes, representing four types of plant GAPDHs (gapA/B, gapC, gapCp and gapN). Whole genome duplication and segmental duplication might account for the expansion of wheat GAPDHs. Expression analysis implied that GAPDHs play roles in plants abiotic stress tolerance.

  4. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.

    PubMed

    Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H

    2016-04-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.

  5. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups.

    PubMed

    Yin, Guangjun; Xu, Hongliang; Xiao, Shuyang; Qin, Yajuan; Li, Yaxuan; Yan, Yueming; Hu, Yingkao

    2013-10-03

    WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean.

  6. Reciprocal Loss of CArG-Boxes and Auxin Response Elements Drives Expression Divergence of MPF2-Like MADS-Box Genes Controlling Calyx Inflation

    PubMed Central

    Khan, Muhammad Ramzan; Hu, Jinyong; Ali, Ghulam Muhammad

    2012-01-01

    Expression divergence is thought to be a hallmark of functional diversification between homologs post duplication. Modification in regulatory elements has been invoked to explain expression divergence after duplication for several MADS-box genes, however, verification of reciprocal loss of cis-regulatory elements is lacking in plants. Here, we report that the evolution of MPF2-like genes has entailed degenerative mutations in a core promoter CArG-box and an auxin response factor (ARF) binding element in the large 1st intron in the coding region. Previously, MPF2-like genes were duplicated into MPF2-like-A and -B through genome duplication in Withania and Tubocapsicum (Withaninae). The calyx of Withania grows exorbitantly after pollination unlike Tubocapsicum, where it degenerates. Besides inflated calyx syndrome formation, MPF2-like transcription factors are implicated in functions both during the vegetative and reproductive development as well as in phase transition. MPF2-like-A of Withania (WSA206) is strongly expressed in sepals, while MPF2-like-B (WSB206) is not. Interestingly, their combined expression patterns seem to replicate the pattern of their closely related hypothetical progenitors from Vassobia and Physalis. Using phylogenetic shadowing, site-directed mutagenesis and motif swapping, we could show that the loss of a conserved CArG-box in MPF2-like-B of Withania is responsible for impeding its expression in sepals. Conversely, loss of an ARE in MPF2-like-A relaxed the constraint on expression in sepals. Thus, the ARE is an active suppressor of MPF2-like gene expression in sepals, which in contrast is activated via the CArG-box. The observed expression divergence in MPF2-like genes due to reciprocal loss of cis-regulatory elements has added to genetic and phenotypic variations in the Withaninae and enhanced the potential of natural selection for the adaptive evolution of ICS. Moreover, these results provide insight into the interplay of floral developmental and hormonal pathways during ICS development and add to the understanding of the importance of polyploidy in plants. PMID:22900049

  7. Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice.

    PubMed

    Wang, Yiyi; Feng, Lin; Zhu, Yuxin; Li, Yuan; Yan, Hanwei; Xiang, Yan

    2015-09-08

    WRKY III genes have significant functions in regulating plant development and resistance. In plant, WRKY gene family has been studied in many species, however, there still lack a comprehensive analysis of WRKY III genes in the woody plant species poplar, three representative lineages of flowering plant species are incorporated in most analyses: Arabidopsis (a model plant for annual herbaceous dicots), grape (one model plant for perennial dicots) and Oryza sativa (a model plant for monocots). In this study, we identified 10, 6, 13 and 28 WRKY III genes in the genomes of Populus trichocarpa, grape (Vitis vinifera), Arabidopsis thaliana and rice (Oryza sativa), respectively. Phylogenetic analysis revealed that the WRKY III proteins could be divided into four clades. By microsynteny analysis, we found that the duplicated regions were more conserved between poplar and grape than Arabidopsis or rice. We dated their duplications by Ks analysis of Populus WRKY III genes and demonstrated that all the blocks were formed after the divergence of monocots and dicots. Strong purifying selection has played a key role in the maintenance of WRKY III genes in Populus. Tissue expression analysis of the WRKY III genes in Populus revealed that five were most highly expressed in the xylem. We also performed quantitative real-time reverse transcription PCR analysis of WRKY III genes in Populus treated with salicylic acid, abscisic acid and polyethylene glycol to explore their stress-related expression patterns. This study highlighted the duplication and diversification of the WRKY III gene family in Populus and provided a comprehensive analysis of this gene family in the Populus genome. Our results indicated that the majority of WRKY III genes of Populus was expanded by large-scale gene duplication. The expression pattern of PtrWRKYIII gene identified that these genes play important roles in the xylem during poplar growth and development, and may play crucial role in defense to drought stress. Our results presented here may aid in the selection of appropriate candidate genes for further characterization of their biological functions in poplar.

  8. Expression, subcellular localization, and cis-regulatory structure of duplicated phytoene synthase genes in melon (Cucumis melo L.).

    PubMed

    Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li

    2011-10-01

    Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.

  9. Characterization and Comparison of the CPK Gene Family in the Apple (Malus × domestica) and Other Rosaceae Species and Its Response to Alternaria alternata Infection.

    PubMed

    Wei, Menghan; Wang, Sanhong; Dong, Hui; Cai, Binhua; Tao, Jianmin

    2016-01-01

    As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK) plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica), pear (Pyrus communis), peach (Prunus persica), plum (Prunus mume) and strawberry (Fragaria vesca) in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD) or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA]), while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA). Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR). Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae.

  10. Characterization and Comparison of the CPK Gene Family in the Apple (Malus × domestica) and Other Rosaceae Species and Its Response to Alternaria alternata Infection

    PubMed Central

    Wei, Menghan; Wang, Sanhong; Dong, Hui; Cai, Binhua; Tao, Jianmin

    2016-01-01

    As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK) plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica), pear (Pyrus communis), peach (Prunus persica), plum (Prunus mume) and strawberry (Fragaria vesca) in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD) or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA]), while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA). Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR). Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae. PMID:27186637

  11. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species.

    PubMed

    Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-04-23

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .

  12. Genome-Wide Identification, Characterization and Phylogenetic Analysis of ATP-Binding Cassette (ABC) Transporter Genes in Common Carp (Cyprinus carpio).

    PubMed

    Liu, Xiang; Li, Shangqi; Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A; Xu, Peng

    2016-01-01

    The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp.

  13. Genome-Wide Identification, Characterization and Phylogenetic Analysis of ATP-Binding Cassette (ABC) Transporter Genes in Common Carp (Cyprinus carpio)

    PubMed Central

    Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A.

    2016-01-01

    The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp. PMID:27058731

  14. Genome wide identification, phylogeny, and expression of bone morphogenetic protein genes in tetraploidized common carp (Cyprinus carpio).

    PubMed

    Chen, Lin; Dong, Chuanju; Kong, Shengnan; Zhang, Jiangfan; Li, Xuejun; Xu, Peng

    2017-09-05

    Bone morphogenetic proteins (Bmps) are a group of signaling molecules known to play important roles during formation and maintenance of various organs, not only bone, but also muscle, blood and so on. Common carp (Cyprinus carpio) is one of the most intensively studied fish due to its economic and environmental importance. Besides, common carp has encountered an additional round of whole genome duplication (WGD) compared with many closely related diploid teleost, which make it one of the most important models for genome evolutionary studies in teleost. Comprehensive genome resources of common carp have been developed recently, which facilitate the thorough characterization of bmp gene family in the tetraploidized common carp genome. We identified a total of 44 bmps from the common carp genome, which are twice as many as that of zebrafish. Phylogenetic analysis revealed that most of bmps are highly conserved. Comparative analysis was performed across six typical vertebrate genomes. It appeared that all the bmp genes in common carp were duplicated. Obviously, the expansion of the bmp gene family in common carp was due to the latest additional round of whole genome duplication and made it more abundant than other diploid teleosts. Expression signatures were assessed in major tissues, including gill, intestine, liver, spleen, skin, heart, gonad, muscle, kidney, head kidney, brain and blood, which demonstrated the comprehensive expression profiles of bmp genes in the tetraploidized genome. Significant gene expression divergences were observed which revealed substantial functional divergences of those duplicated bmp genes post the latest WGD event. The conserved synteny blocks of bmp5s revealed the genome rearrangement of common carp post the 4R WGD. The whole set of bmp gene family in common carp provides insight into gene fate of tetraploidized common carp genome post recent WGD. Copyright © 2017. Published by Elsevier B.V.

  15. Cloning and characterization of two duplicated interleukin-17A/F2 genes in common carp (Cyprinus carpio L.): Transcripts expression and bioactivity of recombinant IL-17A/F2.

    PubMed

    Li, Hongxia; Yu, Juhua; Li, Jianlin; Tang, Yongkai; Yu, Fan; Zhou, Jie; Yu, Wenjuan

    2016-04-01

    Interleukin-17 (IL-17) plays an important role in inflammation and host defense in mammals. In this study, we identified two duplicated IL-17A/F2 genes in the common carp (Cyprinus carpio) (ccIL-17A/F2a and ccIL-17A/F2b), putative encoded proteins contain 140 amino acids (aa) with conserved IL-17 family motifs. Expression analysis revealed high constitutive expression of ccIL-17A/F2s in mucosal tissues, including gill, skin and intestine, their expression could be induced by Aeromonas hydrophila, suggesting a potential role in mucosal immunity. Recombinant ccIL-17A/F2a protein (rccIL-17A/F2a) produced in Escherichia coli could induce the expression of proinflammatory cytokines (IL-1β) and the antimicrobial peptides S100A1, S100A10a and S100A10b in the primary kidney in a dose- and time-dependent manner. Above findings suggest that ccIL-17A/F2 plays an important role in both proinflammatory and innate immunity. Two duplicated ccIL-17A/F2s showed different expression level with ccIL-17A/F2a higher than b, comparison of two 5' regulatory regions indicated the length from anticipated promoter to transcriptional start site (TSS) and putative transcription factor binding site (TFBS) were different. Promoter activity of ccIL-17A/F2a was 2.5 times of ccIL-17A/F2b which consistent with expression results of two genes. These suggest mutations in 5'regulatory region contributed to the differentiation of duplicated genes. To our knowledge, this is the first report to analyze 5'regulatory region of piscine IL-17 family genes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Adaptive expansion of the maize maternally expressed gene (Meg) family involves changes in expression patterns and protein secondary structures of its members

    PubMed Central

    2014-01-01

    Background The Maternally expressed gene (Meg) family is a locally-duplicated gene family of maize which encodes cysteine-rich proteins (CRPs). The founding member of the family, Meg1, is required for normal development of the basal endosperm transfer cell layer (BETL) and is involved in the allocation of maternal nutrients to growing seeds. Despite the important roles of Meg1 in maize seed development, the evolutionary history of the Meg cluster and the activities of the duplicate genes are not understood. Results In maize, the Meg gene cluster resides in a 2.3 Mb-long genomic region that exhibits many features of non-centromeric heterochromatin. Using phylogenetic reconstruction and syntenic alignments, we identified the pedigree of the Meg family, in which 11 of its 13 members arose in maize after allotetraploidization ~4.8 mya. Phylogenetic and population-genetic analyses identified possible signatures suggesting recent positive selection in Meg homologs. Structural analyses of the Meg proteins indicated potentially adaptive changes in secondary structure from α-helix to β-strand during the expansion. Transcriptomic analysis of the maize endosperm indicated that 6 Meg genes are selectively activated in the BETL, and younger Meg genes are more active than older ones. In endosperms from B73 by Mo17 reciprocal crosses, most Meg genes did not display parent-specific expression patterns. Conclusions Recently-duplicated Meg genes have different protein secondary structures, and their expressions in the BETL dominate over those of older members. Together with the signs of positive selections in the young Meg genes, these results suggest that the expansion of the Meg family involves potentially adaptive transitions in which new members with novel functions prevailed over older members. PMID:25084677

  17. Characterization and expression of the ABC family (G group) in 'Dangshansuli' pear (Pyrus bretschneideri Rehd.) and its russet mutant.

    PubMed

    Hou, Zhaoqi; Jia, Bing; Li, Fei; Liu, Pu; Liu, Li; Ye, Zhenfeng; Zhu, Liwu; Wang, Qi; Heng, Wei

    2018-01-01

    The plant genes encoding ABCGs that have been identified to date play a role in suberin formation in response to abiotic and biotic stress. In the present study, 80 ABCG genes were identified in 'Dangshansuli' Chinese white pear and designated as PbABCGs. Based on the structural characteristics and phylogenetic analysis, the PbABCG family genes could be classified into seven main groups: classes A-G. Segmental and dispersed duplications were the primary forces underlying the PbABCG gene family expansion in 'Dangshansuli' pear. Most of the PbABCG duplicated gene pairs date to the recent whole-genome duplication that occurred 30~45 million years ago. Purifying selection has also played a critical role in the evolution of the ABCG genes. Ten PbABCG genes screened in the transcriptome of 'Dangshansuli' pear and its russet mutant 'Xiusu' were validated, and the expression levels of the PbABCG genes exhibited significant differences at different stages. The results presented here will undoubtedly be useful for better understanding of the complexity of the PbABCG gene family and will facilitate the functional characterization of suberin formation in the russet mutant.

  18. Sequencing of Pax6 Loci from the Elephant Shark Reveals a Family of Pax6 Genes in Vertebrate Genomes, Forged by Ancient Duplications and Divergences

    PubMed Central

    Gautier, Philippe; Loosli, Felix; Tay, Boon-Hui; Tay, Alice; Murdoch, Emma; Coutinho, Pedro; van Heyningen, Veronica; Brenner, Sydney; Venkatesh, Byrappa; Kleinjan, Dirk A.

    2013-01-01

    Pax6 is a developmental control gene essential for eye development throughout the animal kingdom. In addition, Pax6 plays key roles in other parts of the CNS, olfactory system, and pancreas. In mammals a single Pax6 gene encoding multiple isoforms delivers these pleiotropic functions. Here we provide evidence that the genomes of many other vertebrate species contain multiple Pax6 loci. We sequenced Pax6-containing BACs from the cartilaginous elephant shark (Callorhinchus milii) and found two distinct Pax6 loci. Pax6.1 is highly similar to mammalian Pax6, while Pax6.2 encodes a paired-less Pax6. Using synteny relationships, we identify homologs of this novel paired-less Pax6.2 gene in lizard and in frog, as well as in zebrafish and in other teleosts. In zebrafish two full-length Pax6 duplicates were known previously, originating from the fish-specific genome duplication (FSGD) and expressed in divergent patterns due to paralog-specific loss of cis-elements. We show that teleosts other than zebrafish also maintain duplicate full-length Pax6 loci, but differences in gene and regulatory domain structure suggest that these Pax6 paralogs originate from a more ancient duplication event and are hence renamed as Pax6.3. Sequence comparisons between mammalian and elephant shark Pax6.1 loci highlight the presence of short- and long-range conserved noncoding elements (CNEs). Functional analysis demonstrates the ancient role of long-range enhancers for Pax6 transcription. We show that the paired-less Pax6.2 ortholog in zebrafish is expressed specifically in the developing retina. Transgenic analysis of elephant shark and zebrafish Pax6.2 CNEs with homology to the mouse NRE/Pα internal promoter revealed highly specific retinal expression. Finally, morpholino depletion of zebrafish Pax6.2 resulted in a “small eye” phenotype, supporting a role in retinal development. In summary, our study reveals that the pleiotropic functions of Pax6 in vertebrates are served by a divergent family of Pax6 genes, forged by ancient duplication events and by independent, lineage-specific gene losses. PMID:23359656

  19. Duplicated Enhancer Region Increases Expression of CTSB and Segregates with Keratolytic Winter Erythema in South African and Norwegian Families.

    PubMed

    Ngcungcu, Thandiswa; Oti, Martin; Sitek, Jan C; Haukanes, Bjørn I; Linghu, Bolan; Bruccoleri, Robert; Stokowy, Tomasz; Oakeley, Edward J; Yang, Fan; Zhu, Jiang; Sultan, Marc; Schalkwijk, Joost; van Vlijmen-Willems, Ivonne M J J; von der Lippe, Charlotte; Brunner, Han G; Ersland, Kari M; Grayson, Wayne; Buechmann-Moller, Stine; Sundnes, Olav; Nirmala, Nanguneri; Morgan, Thomas M; van Bokhoven, Hans; Steen, Vidar M; Hull, Peter R; Szustakowski, Joseph; Staedtler, Frank; Zhou, Huiqing; Fiskerstrand, Torunn; Ramsay, Michele

    2017-05-04

    Keratolytic winter erythema (KWE) is a rare autosomal-dominant skin disorder characterized by recurrent episodes of palmoplantar erythema and epidermal peeling. KWE was previously mapped to 8p23.1-p22 (KWE critical region) in South African families. Using targeted resequencing of the KWE critical region in five South African families and SNP array and whole-genome sequencing in two Norwegian families, we identified two overlapping tandem duplications of 7.67 kb (South Africans) and 15.93 kb (Norwegians). The duplications segregated with the disease and were located upstream of CTSB, a gene encoding cathepsin B, a cysteine protease involved in keratinocyte homeostasis. Included in the 2.62 kb overlapping region of these duplications is an enhancer element that is active in epidermal keratinocytes. The activity of this enhancer correlated with CTSB expression in normal differentiating keratinocytes and other cell lines, but not with FDFT1 or NEIL2 expression. Gene expression (qPCR) analysis and immunohistochemistry of the palmar epidermis demonstrated significantly increased expression of CTSB, as well as stronger staining of cathepsin B in the stratum granulosum of affected individuals than in that of control individuals. Analysis of higher-order chromatin structure data and RNA polymerase II ChIA-PET data from MCF-7 cells did not suggest remote effects of the enhancer. In conclusion, KWE in South African and Norwegian families is caused by tandem duplications in a non-coding genomic region containing an active enhancer element for CTSB, resulting in upregulation of this gene in affected individuals. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  20. Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species

    PubMed Central

    Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj

    2015-01-01

    The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056

  1. Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon.

    PubMed

    Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek

    2017-08-01

    While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  2. Ancestral genomic duplication of the insulin gene in tilapia: An analysis of possible implications for clinical islet xenotransplantation using donor islets from transgenic tilapia expressing a humanized insulin gene.

    PubMed

    Hrytsenko, Olga; Pohajdak, Bill; Wright, James R

    2016-07-03

    Tilapia, a teleost fish, have multiple large anatomically discrete islets which are easy to harvest, and when transplanted into diabetic murine recipients, provide normoglycemia and mammalian-like glucose tolerance profiles. Tilapia insulin differs structurally from human insulin which could preclude their use as islet donors for xenotransplantation. Therefore, we produced transgenic tilapia with islets expressing a humanized insulin gene. It is now known that fish genomes may possess an ancestral duplication and so tilapia may have a second insulin gene. Therefore, we cloned, sequenced, and characterized the tilapia insulin 2 transcript and found that its expression is negligible in islets, is not islet-specific, and would not likely need to be silenced in our transgenic fish.

  3. Ancestral genomic duplication of the insulin gene in tilapia: An analysis of possible implications for clinical islet xenotransplantation using donor islets from transgenic tilapia expressing a humanized insulin gene

    PubMed Central

    Hrytsenko, Olga; Pohajdak, Bill; Wright, James R.

    2016-01-01

    ABSTRACT Tilapia, a teleost fish, have multiple large anatomically discrete islets which are easy to harvest, and when transplanted into diabetic murine recipients, provide normoglycemia and mammalian-like glucose tolerance profiles. Tilapia insulin differs structurally from human insulin which could preclude their use as islet donors for xenotransplantation. Therefore, we produced transgenic tilapia with islets expressing a humanized insulin gene. It is now known that fish genomes may possess an ancestral duplication and so tilapia may have a second insulin gene. Therefore, we cloned, sequenced, and characterized the tilapia insulin 2 transcript and found that its expression is negligible in islets, is not islet-specific, and would not likely need to be silenced in our transgenic fish. PMID:27222321

  4. Expression of HOXB genes is significantly different in acute myeloid leukemia with a partial tandem duplication of MLL vs. a MLL translocation: a cross-laboratory study.

    PubMed

    Liu, Hsi-Che; Shih, Lee-Yung; May Chen, Mei-Ju; Wang, Chien-Chih; Yeh, Ting-Chi; Lin, Tung-Huei; Chen, Chien-Yu; Lin, Chih-Jen; Liang, Der-Cherng

    2011-05-01

    In acute myeloid leukemia (AML), the mixed lineage leukemia (MLL) gene may be rearranged to generate a partial tandem duplication (PTD), or fused to partner genes through a chromosomal translocation (tMLL). In this study, we first explored the differentially expressed genes between MLL-PTD and tMLL using gene expression profiling of our cohort (15 MLL-PTD and 10 tMLL) and one published data set. The top 250 probes were chosen from each set, resulting in 29 common probes (21 unique genes) to both sets. The selected genes include four HOXB genes, HOXB2, B3, B5, and B6. The expression values of these HOXB genes significantly differ between MLL-PTD and tMLL cases. Clustering and classification analyses were thoroughly conducted to support our gene selection results. Second, as MLL-PTD, FLT3-ITD, and NPM1 mutations are identified in AML with normal karyotypes, we briefly studied their impact on the HOXB genes. Another contribution of this study is to demonstrate that using public data from other studies enriches samples for analysis and yields more conclusive results. 2011 Elsevier Inc. All rights reserved.

  5. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    PubMed Central

    Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan

    2009-01-01

    Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068

  6. Exonic duplication CNV of NDRG1 associated with autosomal-recessive HMSN-Lom/CMT4D.

    PubMed

    Okamoto, Yuji; Goksungur, Meryem Tuba; Pehlivan, Davut; Beck, Christine R; Gonzaga-Jauregui, Claudia; Muzny, Donna M; Atik, Mehmed M; Carvalho, Claudia M B; Matur, Zeliha; Bayraktar, Serife; Boone, Philip M; Akyuz, Kaya; Gibbs, Richard A; Battaloglu, Esra; Parman, Yesim; Lupski, James R

    2014-05-01

    Copy-number variations as a mutational mechanism contribute significantly to human disease. Approximately one-half of the patients with Charcot-Marie-Tooth (CMT) disease have a 1.4 Mb duplication copy-number variation as the cause of their neuropathy. However, non-CMT1A neuropathy patients rarely have causative copy-number variations, and to date, autosomal-recessive disease has not been associated with copy-number variation as a mutational mechanism. We performed Agilent 8 × 60 K array comparative genomic hybridization on DNA from 12 recessive Turkish families with CMT disease. Additional molecular studies were conducted to detect breakpoint junctions and to evaluate gene expression levels in a family in which we detected an intragenic duplication copy-number variation. We detected an ~6.25 kb homozygous intragenic duplication in NDRG1, a gene known to be causative for recessive HMSNL/CMT4D, in three individuals from a Turkish family with CMT neuropathy. Further studies showed that this intragenic copy-number variation resulted in a homozygous duplication of exons 6-8 that caused decreased mRNA expression of NDRG1. Exon-focused high-resolution array comparative genomic hybridization enables the detection of copy-number variation carrier states in recessive genes, particularly small copy-number variations encompassing or disrupting single genes. In families for whom a molecular diagnosis has not been elucidated by conventional clinical assays, an assessment for copy-number variations in known CMT genes might be considered.

  7. Medicago truncatula contains a second gene encoding a plastid located glutamine synthetase exclusively expressed in developing seeds.

    PubMed

    Seabra, Ana R; Vieira, Cristina P; Cullimore, Julie V; Carvalho, Helena G

    2010-08-19

    Nitrogen is a crucial nutrient that is both essential and rate limiting for plant growth and seed production. Glutamine synthetase (GS), occupies a central position in nitrogen assimilation and recycling, justifying the extensive number of studies that have been dedicated to this enzyme from several plant sources. All plants species studied to date have been reported as containing a single, nuclear gene encoding a plastid located GS isoenzyme per haploid genome. This study reports the existence of a second nuclear gene encoding a plastid located GS in Medicago truncatula. This study characterizes a new, second gene encoding a plastid located glutamine synthetase (GS2) in M. truncatula. The gene encodes a functional GS isoenzyme with unique kinetic properties, which is exclusively expressed in developing seeds. Based on molecular data and the assumption of a molecular clock, it is estimated that the gene arose from a duplication event that occurred about 10 My ago, after legume speciation and that duplicated sequences are also present in closely related species of the Vicioide subclade. Expression analysis by RT-PCR and western blot indicate that the gene is exclusively expressed in developing seeds and its expression is related to seed filling, suggesting a specific function of the enzyme associated to legume seed metabolism. Interestingly, the gene was found to be subjected to alternative splicing over the first intron, leading to the formation of two transcripts with similar open reading frames but varying 5' UTR lengths, due to retention of the first intron. To our knowledge, this is the first report of alternative splicing on a plant GS gene. This study shows that Medicago truncatula contains an additional GS gene encoding a plastid located isoenzyme, which is functional and exclusively expressed during seed development. Legumes produce protein-rich seeds requiring high amounts of nitrogen, we postulate that this gene duplication represents a functional innovation of plastid located GS related to storage protein accumulation exclusive to legume seed metabolism.

  8. Evolution and functional divergence of NLRP genes in mammalian reproductive systems

    PubMed Central

    2009-01-01

    Background NLRPs (Nucleotide-binding oligomerization domain, Leucine rich Repeat and Pyrin domain containing Proteins) are members of NLR (Nod-like receptors) protein family. Recent researches have shown that NLRP genes play important roles in both mammalian innate immune system and reproductive system. Several of NLRP genes were shown to be specifically expressed in the oocyte in mammals. The aim of the present work was to study how these genes evolved and diverged after their duplication, as well as whether natural selection played a role during their evolution. Results By using in silico methods, we have evaluated the evolution and functional divergence of NLRP genes, in particular of mouse reproduction-related Nlrp genes. We found that (1) major NLRP genes have been duplicated before the divergence of mammals, with certain lineage-specific duplications in primates (NLRP7 and 11) and in rodents (Nlrp1, 4 and 9 duplicates); (2) tandem duplication events gave rise to a mammalian reproduction-related NLRP cluster including NLRP2, 4, 5, 7, 8, 9, 11, 13 and 14 genes; (3) the function of mammalian oocyte-specific NLRP genes (NLRP4, 5, 9 and 14) might have diverged during gene evolution; (4) recent segmental duplications concerning Nlrp4 copies and vomeronasal 1 receptor encoding genes (V1r) have been undertaken in the mouse; and (5) duplicates of Nlrp4 and 9 in the mouse might have been subjected to adaptive evolution. Conclusion In conclusion, this study brings us novel information on the evolution of mammalian reproduction-related NLRPs. On the one hand, NLRP genes duplicated and functionally diversified in mammalian reproductive systems (such as NLRP4, 5, 9 and 14). On the other hand, during evolution, different lineages adapted to develop their own NLRP genes, particularly in reproductive function (such as the specific expansion of Nlrp4 and Nlrp9 in the mouse). PMID:19682372

  9. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species

    PubMed Central

    Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-01-01

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice (Oryza sativa), maize (Zea mays), and Arabidopsis (Arabidopsis thaliana). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis, respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis, respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis. A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis, respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus, and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis. PMID:29690608

  10. Evolutionary Expansion of WRKY Gene Family in Banana and Its Expression Profile during the Infection of Root Lesion Nematode, Pratylenchus coffeae.

    PubMed

    Kaliyappan, Raja; Viswanathan, Sriram; Suthanthiram, Backiyarani; Subbaraya, Uma; Marimuthu Somasundram, Saraswathi; Muthu, Mayilvaganan

    2016-01-01

    The WRKY family of transcription factors orchestrate the reprogrammed expression of the complex network of defense genes at various biotic and abiotic stresses. Within the last 96 million years, three rounds of Musa polyploidization events had occurred from selective pressure causing duplication of MusaWRKYs with new activities. Here, we identified a total of 153 WRKY transcription factors available from the DH Pahang genome. Based on their phylogenetic relationship, the MusaWRKYs available with complete gene sequence were classified into the seven common WRKY sub-groups. Synteny analyses data revealed paralogous relationships, with 17 MusaWRKY gene pairs originating from the duplication events that had occurred within the Musa lineage. We also found 15 other MusaWRKY gene pairs originating from much older duplication events that had occurred along Arecales and Poales lineage of commelinids. Based on the synonymous and nonsynonymous substitution rates, the fate of duplicated MusaWRKY genes was predicted to have undergone sub-functionalization in which the duplicated gene copies retain a subset of the ancestral gene function. Also, to understand the regulatory roles of MusaWRKY during a biotic stress, Illumina sequencing was performed on resistant and susceptible cultivars during the infection of root lesion nematode, Pratylenchus coffeae. The differential WRKY gene expression analysis in nematode resistant and susceptible cultivars during challenged and unchallenged conditions had distinguished: 1) MusaWRKYs participating in general banana defense mechanism against P.coffeae common to both susceptible and resistant cultivars, 2) MusaWRKYs that may aid in the pathogen survival as suppressors of plant triggered immunity, 3) MusaWRKYs that may aid in the host defense as activators of plant triggered immunity and 4) cultivar specific MusaWRKY regulation. Mainly, MusaWRKY52, -69 and -92 are found to be P.coffeae specific and can act as activators or repressors in a defense pathway. Overall, this preliminary study in Musa provides the basis for understanding the evolution and regulatory mechanism of MusaWRKY during nematode stress.

  11. Evolutionary Expansion of WRKY Gene Family in Banana and Its Expression Profile during the Infection of Root Lesion Nematode, Pratylenchus coffeae

    PubMed Central

    Suthanthiram, Backiyarani; Subbaraya, Uma; Marimuthu Somasundram, Saraswathi; Muthu, Mayilvaganan

    2016-01-01

    The WRKY family of transcription factors orchestrate the reprogrammed expression of the complex network of defense genes at various biotic and abiotic stresses. Within the last 96 million years, three rounds of Musa polyploidization events had occurred from selective pressure causing duplication of MusaWRKYs with new activities. Here, we identified a total of 153 WRKY transcription factors available from the DH Pahang genome. Based on their phylogenetic relationship, the MusaWRKYs available with complete gene sequence were classified into the seven common WRKY sub-groups. Synteny analyses data revealed paralogous relationships, with 17 MusaWRKY gene pairs originating from the duplication events that had occurred within the Musa lineage. We also found 15 other MusaWRKY gene pairs originating from much older duplication events that had occurred along Arecales and Poales lineage of commelinids. Based on the synonymous and nonsynonymous substitution rates, the fate of duplicated MusaWRKY genes was predicted to have undergone sub-functionalization in which the duplicated gene copies retain a subset of the ancestral gene function. Also, to understand the regulatory roles of MusaWRKY during a biotic stress, Illumina sequencing was performed on resistant and susceptible cultivars during the infection of root lesion nematode, Pratylenchus coffeae. The differential WRKY gene expression analysis in nematode resistant and susceptible cultivars during challenged and unchallenged conditions had distinguished: 1) MusaWRKYs participating in general banana defense mechanism against P.coffeae common to both susceptible and resistant cultivars, 2) MusaWRKYs that may aid in the pathogen survival as suppressors of plant triggered immunity, 3) MusaWRKYs that may aid in the host defense as activators of plant triggered immunity and 4) cultivar specific MusaWRKY regulation. Mainly, MusaWRKY52, -69 and -92 are found to be P.coffeae specific and can act as activators or repressors in a defense pathway. Overall, this preliminary study in Musa provides the basis for understanding the evolution and regulatory mechanism of MusaWRKY during nematode stress. PMID:27603787

  12. Early stages of functional diversification in the Rab GTPase gene family revealed by genomic and localization studies in Paramecium species.

    PubMed

    Bright, Lydia J; Gout, Jean-Francois; Lynch, Michael

    2017-04-15

    New gene functions arise within existing gene families as a result of gene duplication and subsequent diversification. To gain insight into the steps that led to the functional diversification of paralogues, we tracked duplicate retention patterns, expression-level divergence, and subcellular markers of functional diversification in the Rab GTPase gene family in three Paramecium aurelia species. After whole-genome duplication, Rab GTPase duplicates are more highly retained than other genes in the genome but appear to be diverging more rapidly in expression levels, consistent with early steps in functional diversification. However, by localizing specific Rab proteins in Paramecium cells, we found that paralogues from the two most recent whole-genome duplications had virtually identical localization patterns, and that less closely related paralogues showed evidence of both conservation and diversification. The functionally conserved paralogues appear to target to compartments associated with both endocytic and phagocytic recycling functions, confirming evolutionary and functional links between the two pathways in a divergent eukaryotic lineage. Because the functionally diversifying paralogues are still closely related to and derived from a clade of functionally conserved Rab11 genes, we were able to pinpoint three specific amino acid residues that may be driving the change in the localization and thus the function in these proteins. © 2017 Bright et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  13. The homeologous Zea mays gigantea genes: characterization of expression and novel mutant alleles

    USDA-ARS?s Scientific Manuscript database

    The two homeologous Zea mays gigantea (gi) genes, gi1 and gi2, arose from the last genome duplication event in the maize lineage. Homologs of these genes in other species are required for correct circadian rhythms and proper regulation of growth and development. Here we characterized the expression ...

  14. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii.

    PubMed

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong

    2014-10-16

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.

  15. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii

    PubMed Central

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong

    2014-01-01

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260

  16. Genome-wide identification of the potato WRKY transcription factor family.

    PubMed

    Zhang, Chao; Wang, Dongdong; Yang, Chenghui; Kong, Nana; Shi, Zheng; Zhao, Peng; Nan, Yunyou; Nie, Tengkun; Wang, Ruoqiu; Ma, Haoli; Chen, Qin

    2017-01-01

    WRKY transcription factors play pivotal roles in regulation of stress responses. This study identified 79 WRKY genes in potato (Solanum tuberosum). Based on multiple sequence alignment and phylogenetic relationships, WRKY genes were classified into three major groups. The majority of WRKY genes belonged to Group II (52 StWRKYs), Group III had 14 and Group I consisted of 13. The phylogenetic tree further classified Group II into five sub-groups. All StWRKY genes except StWRKY79 were mapped on potato chromosomes, with eight tandem duplication gene pairs and seven segmental duplication gene pairs found from StWRKY family genes. The expression analysis of 22 StWRKYs showed their differential expression levels under various stress conditions. Cis-element prediction showed that a large number of elements related to drought, heat and salicylic acid were present in the promotor regions of StWRKY genes. The expression analysis indicated that seven StWRKYs seemed to respond to stress (heat, drought and salinity) and salicylic acid treatment. These genes are candidates for abiotic stress signaling for further research.

  17. Genome-wide identification of the potato WRKY transcription factor family

    PubMed Central

    Kong, Nana; Shi, Zheng; Zhao, Peng; Nan, Yunyou; Nie, Tengkun; Wang, Ruoqiu; Ma, Haoli

    2017-01-01

    WRKY transcription factors play pivotal roles in regulation of stress responses. This study identified 79 WRKY genes in potato (Solanum tuberosum). Based on multiple sequence alignment and phylogenetic relationships, WRKY genes were classified into three major groups. The majority of WRKY genes belonged to Group II (52 StWRKYs), Group III had 14 and Group I consisted of 13. The phylogenetic tree further classified Group II into five sub-groups. All StWRKY genes except StWRKY79 were mapped on potato chromosomes, with eight tandem duplication gene pairs and seven segmental duplication gene pairs found from StWRKY family genes. The expression analysis of 22 StWRKYs showed their differential expression levels under various stress conditions. Cis-element prediction showed that a large number of elements related to drought, heat and salicylic acid were present in the promotor regions of StWRKY genes. The expression analysis indicated that seven StWRKYs seemed to respond to stress (heat, drought and salinity) and salicylic acid treatment. These genes are candidates for abiotic stress signaling for further research. PMID:28727761

  18. A novel X-linked disorder with developmental delay and autistic features.

    PubMed

    Kaya, Namik; Colak, Dilek; Albakheet, Albandary; Al-Owain, Mohammad; Abu-Dheim, Nada; Al-Younes, Banan; Al-Zahrani, Jawaher; Mukaddes, Nahit M; Dervent, Aysin; Al-Dosari, Naji; Al-Odaib, Ali; Kayaalp, Inci V; Al-Sayed, Moeenaladin; Al-Hassnan, Zuhair; Nester, Michael J; Al-Dosari, Mohammad; Al-Dhalaan, Hesham; Chedrawi, Aziza; Gunoz, Hulya; Karakas, Bedri; Sakati, Nadia; Alkuraya, Fowzan S; Gascon, Generaso G; Ozand, Pinar T

    2012-04-01

    Genomic duplications that lead to autism and other human diseases are interesting pathological lesions since the underlying mechanism almost certainly involves dosage sensitive genes. We aim to understand a novel genomic disorder with profound phenotypic consequences, most notably global developmental delay, autism, psychosis, and anorexia nervosa. We evaluated the affected individuals, all maternally related, using childhood autism rating scale (CARS) and Vineland Adaptive scales, magnetic resonance imaging (MRI) and magnetic resonance spectroscopy (MRS) brain, electroencephalography (EEG), electromyography (EMG), muscle biopsy, high-resolution molecular karyotype arrays, Giemsa banding (G-banding) and fluorescent in situ hybridization (FISH) experiments, mitochondrial DNA (mtDNA) sequencing, X-chromosome inactivation study, global gene expression analysis on Epstein-Barr virus (EBV)-transformed lymphoblasts, and quantitative reverse-transcription polymerase chain reaction (qRT-PCR). We have identified a novel Xq12-q13.3 duplication in an extended family. Clinically normal mothers were completely skewed in favor of the normal chromosome X. Global transcriptional profiling of affected individuals and controls revealed significant alterations of genes and pathways in a pattern consistent with previous microarray studies of autism spectrum disorder patients. Moreover, expression analysis revealed copy number-dependent increased messenger RNA (mRNA) levels in affected patients compared to control individuals. A subset of differentially expressed genes was validated using qRT-PCR. Xq12-q13.3 duplication is a novel global developmental delay and autism-predisposing chromosomal aberration; pathogenesis of which may be mediated by increased dosage of genes contained in the duplication, including NLGN3, OPHN1, AR, EFNB1, TAF1, GJB1, and MED12. Copyright © 2011 American Neurological Association.

  19. Differential retention of metabolic genes following whole-genome duplication.

    PubMed

    Gout, Jean-François; Duret, Laurent; Kahn, Daniel

    2009-05-01

    Classical studies in Metabolic Control Theory have shown that metabolic fluxes usually exhibit little sensitivity to changes in individual enzyme activity, yet remain sensitive to global changes of all enzymes in a pathway. Therefore, little selective pressure is expected on the dosage or expression of individual metabolic genes, yet entire pathways should still be constrained. However, a direct estimate of this selective pressure had not been evaluated. Whole-genome duplications (WGDs) offer a good opportunity to address this question by analyzing the fates of metabolic genes during the massive gene losses that follow. Here, we take advantage of the successive rounds of WGD that occurred in the Paramecium lineage. We show that metabolic genes exhibit different gene retention patterns than nonmetabolic genes. Contrary to what was expected for individual genes, metabolic genes appeared more retained than other genes after the recent WGD, which was best explained by selection for gene expression operating on entire pathways. Metabolic genes also tend to be less retained when present at high copy number before WGD, contrary to other genes that show a positive correlation between gene retention and preduplication copy number. This is rationalized on the basis of the classical concave relationship relating metabolic fluxes with enzyme expression.

  20. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups

    PubMed Central

    2013-01-01

    Background WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. Results We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. Conclusions In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. PMID:24088323

  1. Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication.

    PubMed

    Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S

    2016-01-01

    Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and compared the clinical, genotyping, and gene microarray results of our patient in order to characterize this novel 1p36 duplication syndrome, which might have contributed to the recurrent scabies infection in this patient.

  2. Life-stage-associated remodelling of lipid metabolism regulation in Atlantic salmon.

    PubMed

    Gillard, Gareth; Harvey, Thomas N; Gjuvsland, Arne; Jin, Yang; Thomassen, Magny; Lien, Sigbjørn; Leaver, Michael; Torgersen, Jacob S; Hvidsten, Torgeir R; Vik, Jon Olav; Sandve, Simen R

    2018-03-01

    Atlantic salmon migrates from rivers to sea to feed, grow and develop gonads before returning to spawn in freshwater. The transition to marine habitats is associated with dramatic changes in the environment, including water salinity, exposure to pathogens and shift in dietary lipid availability. Many changes in physiology and metabolism occur across this life-stage transition, but little is known about the molecular nature of these changes. Here, we use a long-term feeding experiment to study transcriptional regulation of lipid metabolism in Atlantic salmon gut and liver in both fresh- and saltwater. We find that lipid metabolism becomes significantly less plastic to differences in dietary lipid composition when salmon transitions to saltwater and experiences increased dietary lipid availability. Expression of genes in liver relating to lipogenesis and lipid transport decreases overall and becomes less responsive to diet, while genes for lipid uptake in gut become more highly expressed. Finally, analyses of evolutionary consequences of the salmonid-specific whole-genome duplication on lipid metabolism reveal several pathways with significantly different (p < .05) duplicate retention or duplicate regulatory conservation. We also find a limited number of cases where the whole-genome duplication has resulted in an increased gene dosage. In conclusion, we find variable and pathway-specific effects of the salmonid genome duplication on lipid metabolism genes. A clear life-stage-associated shift in lipid metabolism regulation is evident, and we hypothesize this to be, at least partly, driven by nondietary factors such as the preparatory remodelling of gene regulation and physiology prior to sea migration. © 2018 John Wiley & Sons Ltd.

  3. Genome-Wide Analysis of NBS-LRR Genes in Sorghum Genome Revealed Several Events Contributing to NBS-LRR Gene Evolution in Grass Species

    PubMed Central

    Yang, Xiping; Wang, Jianping

    2016-01-01

    The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976

  4. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    NASA Astrophysics Data System (ADS)

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M. Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-04-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.).

  5. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    PubMed Central

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M.Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-01-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.). PMID:28397791

  6. A diffusion approach to approximating preservation probabilities for gene duplicates.

    PubMed

    O'Hely, Martin

    2006-08-01

    Consider a haploid population and, within its genome, a gene whose presence is vital for the survival of any individual. Each copy of this gene is subject to mutations which destroy its function. Suppose one member of the population somehow acquires a duplicate copy of the gene, where the duplicate is fully linked to the original gene's locus. Preservation is said to occur if eventually the entire population consists of individuals descended from this one which initially carried the duplicate. The system is modelled by a finite state-space Markov process which in turn is approximated by a diffusion process, whence an explicit expression for the probability of preservation is derived. The event of preservation can be compared to the fixation of a selectively neutral gene variant initially present in a single individual, the probability of which is the reciprocal of the population size. For very weak mutation, this and the probability of preservation are equal, while as mutation becomes stronger, the preservation probability tends to double this reciprocal. This is in excellent agreement with simulation studies.

  7. Genome specific PPARαB duplicates in salmonids and insights into estrogenic regulation in brown trout.

    PubMed

    Madureira, Tânia Vieira; Pinheiro, Ivone; de Paula Freire, Rafaelle; Rocha, Eduardo; Castro, Luis Filipe; Urbatzka, Ralph

    2017-06-01

    Peroxisome proliferator-activated receptors (PPARs) are key regulators of many processes in vertebrates, such as carbohydrate and lipid metabolism. PPARα, a member of the PPAR nuclear receptor gene subfamily (NR1C1), is involved in fatty acid metabolism, namely in peroxisomal β-oxidation. Two gene paralogues, pparαA and pparαB, were described in several teleost species with their origin dating back to the teleost-specific genome duplication (3R). Given the additional salmonid-specific genome duplication (4R), four genes could be theoretically anticipated for this gene subfamily. In this work, we examined the pparα gene repertoire in brown trout, Salmo trutta f. fario. Data disclosed two pparα-like sequences in brown trout. Phylogenetic analyses further revealed that the isolated genes are most likely genome pparαB duplicates, pparαBa and pparαBb, while pparαA is apparently absent in salmonids. Both genes showed a ubiquitous mRNA expression across a panel of 11 different organs. In vitro exposed primary brown trout hepatocytes strongly suggest that pparα gene paralogues are differently regulated by ethinylestradiol (EE2). PparαBb mRNA expression significantly decreased with dosage, reaching significance after exposure to 50μM EE2, while pparαBa mRNA increased, significant at 1μM EE2. The present data enhances the understanding of pparα function and evolution in teleost, and reinforces the evidence of a potential crosstalk between estrogenic and pparα signaling pathways. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Systematic Analysis of Sequences and Expression Patterns of Drought-Responsive Members of the HD-Zip Gene Family in Maize

    PubMed Central

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Background Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Conclusions Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development. PMID:22164299

  9. Systematic analysis of sequences and expression patterns of drought-responsive members of the HD-Zip gene family in maize.

    PubMed

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.

  10. Evolutionary Patterns of RNA-Based Duplication in Non-Mammalian Chordates

    PubMed Central

    Li, Xin; Vibranovski, Maria D.; Gan, Xiaoni; Wang, Dengqiang; Wang, Wen; Long, Manyuan; He, Shunping

    2011-01-01

    The role of RNA-based duplication, or retroposition, in the evolution of new gene functions in mammals, plants, and Drosophila has been widely reported. However, little is known about RNA-based duplication in non-mammalian chordates. In this study, we screened ten non-mammalian chordate genomes for retrocopies and investigated their evolutionary patterns. We identified numerous retrocopies in these species. Examination of the age distribution of these retrocopies revealed no burst of young retrocopies in ancient chordate species. Upon comparing these non-mammalian chordate species to the mammalian species, we observed that a larger fraction of the non-mammalian retrocopies was under strong evolutionary constraints than mammalian retrocopies are, as evidenced by signals of purifying selection and expression profiles. For the Western clawed frog, Medaka, and Sea squirt, many retrogenes have evolved gonad and brain expression patterns, similar to what was observed in human. Testing of retrogene movement in the Medaka genome, where the nascent sex chrosomes have been well assembled, did not reveal any significant gene movement. Taken together, our analyses demonstrate that RNA-based duplication generates many functional genes and can make a significant contribution to the evolution of non-mammalian genomes. PMID:21779328

  11. Evolution and Expansion of the Prokaryote-Like Lipoxygenase Family in the Brown Alga Saccharina japonica

    PubMed Central

    Teng, Linhong; Han, Wentao; Fan, Xiao; Xu, Dong; Zhang, Xiaowen; Dittami, Simon M.; Ye, Naihao

    2017-01-01

    Lipoxygenase (LOX) plays important roles in fatty acid oxidation and lipid mediator biosynthesis. In this study, we give first insights into brown algal LOX evolution. Whole genome searches revealed four, three, and eleven LOXs in Ectocarpus siliculosus, Cladosiphon okamuranus, and Saccharina japonica, respectively. In phylogenetic analyses, LOXs from brown algae form a robust clade with those from prokaryotes, suggesting an ancestral origin and slow evolution. Brown algal LOXs were divided into two clades, C1 and C2 in a phylogenetic tree. Compared to the two species of Ectocarpales, LOX gene expansion occurred in the kelp S. japonica through tandem duplication and segmental duplication. Selection pressure analysis showed that LOX genes in brown algae have undergone strong purifying selection, while the selective constraint in the C2 clade was more relaxed than that in the C1 clade. Furthermore, within each clade, LOXs of S. japonica evolved under more relaxed selection constraints than E. siliculosus and C. okamuranus. Structural modeling showed that unlike LOXs of plants and animals, which contain a β barrel in the N-terminal part of the protein, LOXs in brown algae fold into a single domain. Analysis of previously published transcriptomic data showed that LOXs in E. siliculosus are responsive to hyposaline, hypersaline, oxidative, and copper stresses. Moreover, clear divergence of expression patterns was observed among different life stages, as well as between duplicate gene pairs. In E. siliculosus, all four LOXs are male-biased in immature gametophytes, and mature gametophytes showed significantly higher LOX mRNA levels than immature gametophytes and sporophytes. In S. japonica, however, our RNA-Seq data showed that most LOXs are highly expressed in sporophytes. Even the most recently duplicated gene pairs showed divergent expression patterns, suggesting that functional divergence has likely occurred since LOX genes duplicated, which potentially contributes to the production of various oxylipins in brown algae. PMID:29234336

  12. Genome-wide analysis of WRKY transcription factors in wheat (Triticum aestivum L.) and differential expression under water deficit condition.

    PubMed

    Ning, Pan; Liu, Congcong; Kang, Jingquan; Lv, Jinyin

    2017-01-01

    WRKY proteins, which comprise one of the largest transcription factor (TF) families in the plant kingdom, play crucial roles in plant development and stress responses. Despite several studies on WRKYs in wheat ( Triticum aestivum L.), functional annotation information about wheat WRKYs is limited. Here, 171 TaWRKY TFs were identified from the whole wheat genome and compared with proteins from 19 other species representing nine major plant lineages. A phylogenetic analysis, coupled with gene structure analysis and motif determination, divided these TaWRKYs into seven subgroups (Group I, IIa-e, and III). Chromosomal location showed that most TaWRKY genes were enriched on four chromosomes, especially on chromosome 3B. In addition, 85 (49.7%) genes were either tandem (5) or segmental duplication (80), which suggested that though tandem duplication has contributed to the expansion of TaWRKY family, segmental duplication probably played a more pivotal role. Analysis of cis -acting elements revealed putative functions of WRKYs in wheat during development as well as under numerous biotic and abiotic stresses. Finally, the expression of TaWRKY genes in flag leaves, glumes, and lemmas under water-deficit condition were analyzed. Results showed that different TaWRKY genes preferentially express in specific tissue during the grain-filling stage. Our results provide a more extensive insight on WRKY gene family in wheat, and also contribute to the screening of more candidate genes for further investigation on function characterization of WRKYs under various stresses.

  13. Soybean kinome: functional classification and gene expression patterns

    PubMed Central

    Liu, Jinyi; Chen, Nana; Grant, Joshua N.; Cheng, Zong-Ming (Max); Stewart, C. Neal; Hewezi, Tarek

    2015-01-01

    The protein kinase (PK) gene family is one of the largest and most highly conserved gene families in plants and plays a role in nearly all biological functions. While a large number of genes have been predicted to encode PKs in soybean, a comprehensive functional classification and global analysis of expression patterns of this large gene family is lacking. In this study, we identified the entire soybean PK repertoire or kinome, which comprised 2166 putative PK genes, representing 4.67% of all soybean protein-coding genes. The soybean kinome was classified into 19 groups, 81 families, and 122 subfamilies. The receptor-like kinase (RLK) group was remarkably large, containing 1418 genes. Collinearity analysis indicated that whole-genome segmental duplication events may have played a key role in the expansion of the soybean kinome, whereas tandem duplications might have contributed to the expansion of specific subfamilies. Gene structure, subcellular localization prediction, and gene expression patterns indicated extensive functional divergence of PK subfamilies. Global gene expression analysis of soybean PK subfamilies revealed tissue- and stress-specific expression patterns, implying regulatory functions over a wide range of developmental and physiological processes. In addition, tissue and stress co-expression network analysis uncovered specific subfamilies with narrow or wide interconnected relationships, indicative of their association with particular or broad signalling pathways, respectively. Taken together, our analyses provide a foundation for further functional studies to reveal the biological and molecular functions of PKs in soybean. PMID:25614662

  14. A second corticotropin-releasing hormone gene (CRH2) is conserved across vertebrate classes and expressed in the hindbrain of a basal neopterygian fish, the spotted gar (Lepisosteus oculatus).

    PubMed

    Grone, Brian P; Maruska, Karen P

    2015-05-01

    To investigate the origins of the vertebrate stress-response system, we searched sequenced vertebrate genomes for genes resembling corticotropin-releasing hormone (CRH). We found that vertebrate genomes possess, in addition to CRH, another gene that resembles CRH in sequence and syntenic environment. This paralogous gene was previously identified only in the elephant shark (a holocephalan), but we find it also in marsupials, monotremes, lizards, turtles, birds, and fishes. We examined the relationship of this second vertebrate CRH gene, which we name CRH2, to CRH1 (previously known as CRH) and urocortin1/urotensin1 (UCN1/UTS1) in primitive fishes, teleosts, and tetrapods. The paralogs CRH1 and CRH2 likely evolved via duplication of CRH during a whole-genome duplication early in the vertebrate lineage. CRH2 was subsequently lost in both teleost fishes and eutherian mammals but retained in other lineages. To determine where CRH2 is expressed relative to CRH1 and UTS1, we used in situ hybridization on brain tissue from spotted gar (Lepisosteus oculatus), a neopterygian fish closely related to teleosts. In situ hybridization revealed widespread distribution of both crh1 and uts1 in the brain. Expression of crh2 was restricted to the putative secondary gustatory/secondary visceral nucleus, which also expressed calcitonin-related polypeptide alpha (calca), a marker of parabrachial nucleus in mammals. Thus, the evolutionary history of CRH2 includes restricted expression in the brain, sequence changes, and gene loss, likely reflecting release of selective constraints following whole-genome duplication. The discovery of CRH2 opens many new possibilities for understanding the diverse functions of the CRH family of peptides across vertebrates. © 2015 Wiley Periodicals, Inc.

  15. Soybean (Glycine max) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies

    PubMed Central

    2014-01-01

    Background Expansins are plant cell wall loosening proteins that are involved in cell enlargement and a variety of other developmental processes. The expansin superfamily contains four subfamilies; namely, α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). Although the genome sequencing of soybeans is complete, our knowledge about the pattern of expansion and evolutionary history of soybean expansin genes remains limited. Results A total of 75 expansin genes were identified in the soybean genome, and grouped into four subfamilies based on their phylogenetic relationships. Structural analysis revealed that the expansin genes are conserved in each subfamily, but are divergent among subfamilies. Furthermore, in soybean and Arabidopsis, the expansin gene family has been mainly expanded through tandem and segmental duplications; however, in rice, segmental duplication appears to be the dominant process that generates this superfamily. The transcriptome atlas revealed notable differential expression in either transcript abundance or expression patterns under normal growth conditions. This finding was consistent with the differential distribution of the cis-elements in the promoter region, and indicated wide functional divergence in this superfamily. Moreover, some critical amino acids that contribute to functional divergence and positive selection were detected. Finally, site model and branch-site model analysis of positive selection indicated that the soybean expansin gene superfamily is under strong positive selection, and that divergent selection constraints might have influenced the evolution of the four subfamilies. Conclusion This study demonstrated that the soybean expansin gene superfamily has expanded through tandem and segmental duplication. Differential expression indicated wide functional divergence in this superfamily. Furthermore, positive selection analysis revealed that divergent selection constraints might have influenced the evolution of the four subfamilies. In conclusion, the results of this study contribute novel detailed information about the molecular evolution of the expansin gene superfamily in soybean. PMID:24720629

  16. Distinct cell-specific expression of homospermidine synthase involved in pyrrolizidine alkaloid biosynthesis in three species of the boraginales.

    PubMed

    Niemüller, Daniel; Reimann, Andreas; Ober, Dietrich

    2012-07-01

    Homospermidine synthase (HSS) is the first specific enzyme in pyrrolizidine alkaloid (PA) biosynthesis, a pathway involved in the plant's chemical defense. HSS has been shown to be recruited repeatedly by duplication of a gene involved in primary metabolism. Within the lineage of the Boraginales, only one gene duplication event gave rise to HSS. Here, we demonstrate that the tissue-specific expression of HSS in three boraginaceous species, Heliotropium indicum, Symphytum officinale, and Cynoglossum officinale, is unique with respect to plant organ, tissue, and cell type. Within H. indicum, HSS is expressed exclusively in nonspecialized cells of the lower epidermis of young leaves and shoots. In S. officinale, HSS expression has been detected in the cells of the root endodermis and in leaves directly underneath developing inflorescences. In young roots of C. officinale, HSS is detected only in cells of the endodermis, but in a later developmental stage, additionally in the pericycle. The individual expression patterns are compared with those within the Senecioneae lineage (Asteraceae), where HSS expression is reproducibly found in specific cells of the endodermis and the adjacent cortex parenchyma of the roots. The individual expression patterns within the Boraginales species are discussed as being a requirement for the successful recruitment of HSS after gene duplication. The diversity of HSS expression within this lineage adds a further facet to the already diverse patterns of expression that have been observed for HSS in other PA-producing plant lineages, making this PA-specific enzyme one of the most diverse expressed proteins described in the literature.

  17. Distinct Cell-Specific Expression of Homospermidine Synthase Involved in Pyrrolizidine Alkaloid Biosynthesis in Three Species of the Boraginales1[C][W][OA

    PubMed Central

    Niemüller, Daniel; Reimann, Andreas; Ober, Dietrich

    2012-01-01

    Homospermidine synthase (HSS) is the first specific enzyme in pyrrolizidine alkaloid (PA) biosynthesis, a pathway involved in the plant’s chemical defense. HSS has been shown to be recruited repeatedly by duplication of a gene involved in primary metabolism. Within the lineage of the Boraginales, only one gene duplication event gave rise to HSS. Here, we demonstrate that the tissue-specific expression of HSS in three boraginaceous species, Heliotropium indicum, Symphytum officinale, and Cynoglossum officinale, is unique with respect to plant organ, tissue, and cell type. Within H. indicum, HSS is expressed exclusively in nonspecialized cells of the lower epidermis of young leaves and shoots. In S. officinale, HSS expression has been detected in the cells of the root endodermis and in leaves directly underneath developing inflorescences. In young roots of C. officinale, HSS is detected only in cells of the endodermis, but in a later developmental stage, additionally in the pericycle. The individual expression patterns are compared with those within the Senecioneae lineage (Asteraceae), where HSS expression is reproducibly found in specific cells of the endodermis and the adjacent cortex parenchyma of the roots. The individual expression patterns within the Boraginales species are discussed as being a requirement for the successful recruitment of HSS after gene duplication. The diversity of HSS expression within this lineage adds a further facet to the already diverse patterns of expression that have been observed for HSS in other PA-producing plant lineages, making this PA-specific enzyme one of the most diverse expressed proteins described in the literature. PMID:22566491

  18. Expression analysis of genes encoding double B-box zinc finger proteins in maize.

    PubMed

    Li, Wenlan; Wang, Jingchao; Sun, Qi; Li, Wencai; Yu, Yanli; Zhao, Meng; Meng, Zhaodong

    2017-11-01

    The B-box proteins play key roles in plant development. The double B-box (DBB) family is one of the subfamily of the B-box family, with two B-box domains and without a CCT domain. In this study, 12 maize double B-box genes (ZmDBBs) were identified through a genome-wide survey. Phylogenetic analysis of DBB proteins from maize, rice, Sorghum bicolor, Arabidopsis, and poplar classified them into five major clades. Gene duplication analysis indicated that segmental duplications made a large contribution to the expansion of ZmDBBs. Furthermore, a large number of cis-acting regulatory elements related to plant development, response to light and phytohormone were identified in the promoter regions of the ZmDBB genes. The expression patterns of the ZmDBB genes in various tissues and different developmental stages demonstrated that ZmDBBs might play essential roles in plant development, and some ZmDBB genes might have unique function in specific developmental stages. In addition, several ZmDBB genes showed diurnal expression pattern. The expression levels of some ZmDBB genes changed significantly under light/dark treatment conditions and phytohormone treatments, implying that they might participate in light signaling pathway and hormone signaling. Our results will provide new information to better understand the complexity of the DBB gene family in maize.

  19. Evolution of the vertebrate Pax4/6 class of genes with focus on its novel member, the Pax10 gene.

    PubMed

    Feiner, Nathalie; Meyer, Axel; Kuraku, Shigehiro

    2014-06-19

    The members of the paired box (Pax) family regulate key developmental pathways in many metazoans as tissue-specific transcription factors. Vertebrate genomes typically possess nine Pax genes (Pax1-9), which are derived from four proto-Pax genes in the vertebrate ancestor that were later expanded through the so-called two-round (2R) whole-genome duplication. A recent study proposed that pax6a genes of a subset of teleost fishes (namely, acanthopterygians) are remnants of a paralog generated in the 2R genome duplication, to be renamed pax6.3, and reported one more group of vertebrate Pax genes (Pax6.2), most closely related to the Pax4/6 class. We propose to designate this new member Pax10 instead and reconstruct the evolutionary history of the Pax4/6/10 class with solid phylogenetic evidence. Our synteny analysis showed that Pax4, -6, and -10 originated in the 2R genome duplications early in vertebrate evolution. The phylogenetic analyses of relationships between teleost pax6a and other Pax4, -6, and -10 genes, however, do not support the proposed hypothesis of an ancient origin of the acanthopterygian pax6a genes in the 2R genome duplication. Instead, we confirmed the traditional scenario that the acanthopterygian pax6a is derived from the more recent teleost-specific genome duplication. Notably, Pax6 is present in all vertebrates surveyed to date, whereas Pax4 and -10 were lost multiple times in independent vertebrate lineages, likely because of their restricted expression patterns: Among Pax6-positive domains, Pax10 has retained expression in the adult retina alone, which we documented through in situ hybridization and quantitative reverse transcription polymerase chain reaction experiments on zebrafish, Xenopus, and anole lizard. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. β2-microglobulin gene duplication in cetartiodactyla remains intact only in pigs and possibly confers selective advantage to the species.

    PubMed

    Le, Thong Minh; Le, Quy Van Chanh; Truong, Dung Minh; Lee, Hye-Jeong; Choi, Min-Kyeung; Cho, Hyesun; Chung, Hak-Jae; Kim, Jin-Hoi; Do, Jeong-Tae; Song, Hyuk; Park, Chankyu

    2017-01-01

    Several β2-microglobulin (B2M) -bound protein complexes undertake key roles in various immune system pathways, including the neonatal Fc receptor (FcRn), cluster of differentiation 1 (CD1) protein, non-classical major histocompatibility complex (MHC), and well-known MHC class I molecules. Therefore, the duplication of B2M may lead to an increase in the biological competence of organisms to the environment. Based on the pig genome assembly SSC10.2, a segmental duplication of ~45.5 kb, encoding the entire B2M protein, was identified in pig chromosome 1. Through experimental validation, we confirmed the functional duplication of the B2M gene with a completely identical coding sequence between two copies in pigs. Considering the importance of B2M in the immune system, we performed the phylogenetic analysis of B2M duplication in ten mammalian species, confirming the presence of B2M duplication in cetartioldactyls, like cattle, sheep, goats, pigs and whales, but non-cetartiodactyl species, like mice, cats, dogs, horses, and humans. The density of long interspersed nuclear element (LINE) at the edges of duplicated blocks (39 to 66%) was found to be 2 to 3-fold higher than the average (20.12%) of the pig genome, suggesting its role in the duplication event. The B2M mRNA expression level in pigs was 12.71 and 7.57 times (2-ΔΔCt values) higher than humans and mice, respectively. However, we were unable to experimentally demonstrate the difference in the level of B2M protein because species specific anti-B2M antibodies are not available. We reported, for the first time, the functional duplication of the B2M gene in animals. The identification of partially remaining duplicated B2M sequences in the genomes of only cetartiodactyls indicates that the event was lineage specific. B2M duplication could be beneficial to the immune system of pigs by increasing the availability of MHC class I light chain protein, B2M, to complex with the proteins encoded by the relatively large number of MHC class I heavy chain genes in pigs. Further studies are necessary to address the biological meaning of increased expression of B2M.

  1. Mirror-image duplication of the primary axis and heart in Xenopus embryos by the overexpression of Msx-1 gene.

    PubMed

    Chen, Y; Solursh, M

    1995-10-01

    The Msx-1 gene (formerly known as Hox-7) is a member of a discrete subclass of homeobox-containing genes. Examination of the expression pattern of Msx-1 in murine and avian embryos suggests that this gene may be involved in the regionalization of the medio-lateral axis during earlier development. We have examined the possible functions of Xenopus Msx-1 during early Xenopus embryonic development by overexpression of the Msx-1 gene. Overexpression of Msx-1 causes a left-right mirror-image duplication of primary axial structures, including notochord, neural tube, somites, suckers, and foregut. The embryonic developing heart is also mirror-image duplicated, including looping directions and polarity. These results indicate that Msx-1 may be involved in the mesoderm formation as well as left-right patterning in the early Xenopus embryonic development.

  2. Ascorbate peroxidase-related (APx-R) is not a duplicable gene.

    PubMed

    Dunand, Christophe; Mathé, Catherine; Lazzarotto, Fernanda; Margis, Rogério; Margis-Pinheiro, Marcia

    2011-12-01

    Phylogenetic, genomic and functional analyses have allowed the identification of a new class of putative heme peroxidases, so called APx-R (APx-Related). These new class, mainly present in the green lineage (including green algae and land plants), can also be detected in other unicellular chloroplastic organisms. Except for recent polyploid organisms, only single-copy of APx-R gene was detected in each genome, suggesting that the majority of the APx-R extra-copies were lost after chromosomal or segmental duplications. In a similar way, most APx-R co-expressed genes in Arabidopsis genome do not have conserved extra-copies after chromosomal duplications and are predicted to be localized in organelles, as are the APx-R. The member of this gene network can be considered as unique gene, well conserved through the evolution due to a strong negative selection pressure and a low evolution rate. © 2011 Landes Bioscience

  3. Expansion of banana (Musa acuminata) gene families involved in ethylene biosynthesis and signalling after lineage-specific whole-genome duplications.

    PubMed

    Jourda, Cyril; Cardi, Céline; Mbéguié-A-Mbéguié, Didier; Bocs, Stéphanie; Garsmeur, Olivier; D'Hont, Angélique; Yahiaoui, Nabila

    2014-05-01

    Whole-genome duplications (WGDs) are widespread in plants, and three lineage-specific WGDs occurred in the banana (Musa acuminata) genome. Here, we analysed the impact of WGDs on the evolution of banana gene families involved in ethylene biosynthesis and signalling, a key pathway for banana fruit ripening. Banana ethylene pathway genes were identified using comparative genomics approaches and their duplication modes and expression profiles were analysed. Seven out of 10 banana ethylene gene families evolved through WGD and four of them (1-aminocyclopropane-1-carboxylate synthase (ACS), ethylene-insensitive 3-like (EIL), ethylene-insensitive 3-binding F-box (EBF) and ethylene response factor (ERF)) were preferentially retained. Banana orthologues of AtEIN3 and AtEIL1, two major genes for ethylene signalling in Arabidopsis, were particularly expanded. This expansion was paralleled by that of EBF genes which are responsible for control of EIL protein levels. Gene expression profiles in banana fruits suggested functional redundancy for several MaEBF and MaEIL genes derived from WGD and subfunctionalization for some of them. We propose that EIL and EBF genes were co-retained after WGD in banana to maintain balanced control of EIL protein levels and thus avoid detrimental effects of constitutive ethylene signalling. In the course of evolution, subfunctionalization was favoured to promote finer control of ethylene signalling. © 2014 CIRAD New Phytologist © 2014 New Phytologist Trust.

  4. The Rice B-Box Zinc Finger Gene Family: Genomic Identification, Characterization, Expression Profiling and Diurnal Analysis

    PubMed Central

    Huang, Jianyan; Zhao, Xiaobo; Weng, Xiaoyu; Wang, Lei; Xie, Weibo

    2012-01-01

    Background The B-box (BBX) -containing proteins are a class of zinc finger proteins that contain one or two B-box domains and play important roles in plant growth and development. The Arabidopsis BBX gene family has recently been re-identified and renamed. However, there has not been a genome-wide survey of the rice BBX (OsBBX) gene family until now. Methodology/Principal Findings In this study, we identified 30 rice BBX genes through a comprehensive bioinformatics analysis. Each gene was assigned a uniform nomenclature. We described the chromosome localizations, gene structures, protein domains, phylogenetic relationship, whole life-cycle expression profile and diurnal expression patterns of the OsBBX family members. Based on the phylogeny and domain constitution, the OsBBX gene family was classified into five subfamilies. The gene duplication analysis revealed that only chromosomal segmental duplication contributed to the expansion of the OsBBX gene family. The expression profile of the OsBBX genes was analyzed by Affymetrix GeneChip microarrays throughout the entire life-cycle of rice cultivar Zhenshan 97 (ZS97). In addition, microarray analysis was performed to obtain the expression patterns of these genes under light/dark conditions and after three phytohormone treatments. This analysis revealed that the expression patterns of the OsBBX genes could be classified into eight groups. Eight genes were regulated under the light/dark treatments, and eleven genes showed differential expression under at least one phytohormone treatment. Moreover, we verified the diurnal expression of the OsBBX genes using the data obtained from the Diurnal Project and qPCR analysis, and the results indicated that many of these genes had a diurnal expression pattern. Conclusions/Significance The combination of the genome-wide identification and the expression and diurnal analysis of the OsBBX gene family should facilitate additional functional studies of the OsBBX genes. PMID:23118960

  5. Phylogenetic analysis of the “ECE” (CYC/TB1) clade reveals duplications predating the core eudicots

    PubMed Central

    Howarth, Dianella G.; Donoghue, Michael J.

    2006-01-01

    Flower symmetry is of special interest in understanding angiosperm evolution and ecology. Evidence from the Antirrhineae (snapdragon and relatives) indicates that several TCP gene-family transcription factors, especially CYCLOIDEA (CYC) and DICHOTOMA (DICH), play a role in specifying dorsal identity in the corolla and androecium of monosymmetric (bilateral) flowers. Studies of rosid and asterid angiosperms suggest that orthologous TCP genes may be important in dorsal identity, but there has been no broad phylogenetic context to determine copy number or orthology. Here, we compare published data from rosids and asterids with newly collected data from ranunculids, caryophyllids, Saxifragales, and Asterales to ascertain the phylogenetic placement of major duplications in the “ECE” (CYC/TB1) clade of TCP transcription factors. Bayesian analyses indicate that there are three major copies of “CYC” in the ECE clade, and that duplications leading to these copies predate the core eudicots. CYC1 contains no subsequent duplications and may not be expressed in floral tissue. CYC3 exhibits similar patterns of duplication to CYC2 in several groups. Using RT-PCR, we show that, in flowers of Lonicera morrowii (Caprifoliaceae), DipsCYC2B is expressed in the four dorsal petals and not in the ventral petal. DipsCYC3B is expressed in flower and petal primordia, possibly most strongly in the ventral petal. PMID:16754863

  6. Genome-wide analysis of the WRKY gene family in cotton.

    PubMed

    Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

    2014-12-01

    WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.

  7. Rapid bursts of androgen-binding protein (Abp) gene duplication occurred independently in diverse mammals

    PubMed Central

    2008-01-01

    Background The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) α, β and γ subunits. Further investigation of 14 α-like (Abpa) and 13 β- or γ-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Results Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. Conclusion We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification. PMID:18269759

  8. Rapid bursts of androgen-binding protein (Abp) gene duplication occurred independently in diverse mammals.

    PubMed

    Laukaitis, Christina M; Heger, Andreas; Blakley, Tyler D; Munclinger, Pavel; Ponting, Chris P; Karn, Robert C

    2008-02-12

    The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) alpha, beta and gamma subunits. Further investigation of 14 alpha-like (Abpa) and 13 beta- or gamma-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification.

  9. Genomic characterization, phylogenetic comparison and differential expression of the cyclic nucleotide-gated channels gene family in pear (Pyrus bretchneideri Rehd.).

    PubMed

    Chen, Jianqing; Yin, Hao; Gu, Jinping; Li, Leiting; Liu, Zhe; Jiang, Xueting; Zhou, Hongsheng; Wei, Shuwei; Zhang, Shaoling; Wu, Juyou

    2015-01-01

    The cyclic nucleotide-gated channel (CNGC) family is involved in the uptake of various cations, such as Ca(2+), to regulate plant growth and respond to biotic and abiotic stresses. However, there is far less information about this family in woody plants such as pear. Here, we provided a genome-wide identification and analysis of the CNGC gene family in pear. Phylogenetic analysis showed that the 21 pear CNGC genes could be divided into five groups (I, II, III, IVA and IVB). The majority of gene duplications in pear appeared to have been caused by segmental duplication and occurred 32.94-39.14 million years ago. Evolutionary analysis showed that positive selection had driven the evolution of pear CNGCs. Motif analyses showed that Group I CNGCs generally contained 26 motifs, which was the greatest number of motifs in all CNGC groups. Among these, eight motifs were shared by each group, suggesting that these domains play a conservative role in CNGC activity. Tissue-specific expression analysis indicated that functional diversification of the duplicated CNGC genes was a major feature of long-term evolution. Our results also suggested that the P-S6 and PBC & hinge domains had co-evolved during the evolution. These results provide valuable information to increase our understanding of the function, evolution and expression analyses of the CNGC gene family in higher plants. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. F-box genes: Genome-wide expansion, evolution and their contribution to pollen growth in pear (Pyrus bretschneideri).

    PubMed

    Wang, Guo-Ming; Yin, Hao; Qiao, Xin; Tan, Xu; Gu, Chao; Wang, Bao-Hua; Cheng, Rui; Wang, Ying-Zhen; Zhang, Shao-Ling

    2016-12-01

    F-box gene family, as one of the largest gene families in plants, plays crucial roles in regulating plant development, reproduction, cellular protein degradation and responses to biotic and abiotic stresses. However, comprehensive analysis of the F-box gene family in pear (Pyrus bretschneideri Rehd.) and other Rosaceae species has not been reported yet. Herein, we identified a total of 226 full-length F-box genes in pear for the first time. And these genes were further divided into various subgroups based on specific domains and phylogenetic analysis. Intriguingly, we observed that whole-genome duplication and dispersed duplication have a major contribution to F-box family expansion. Furthermore, the dynamic evolution for different modes of gene duplication was dissected. Interestingly, we found that dispersed and tandem duplicate have been evolving at a high rate. In addition, we found that F-box genes exhibited functional specificity based on GO analysis, and most of the F-box genes were significantly enriched in the protein binding (GO: 0005515) term, supporting that F-box genes might play a critical role for gene regulation in pear. Transcriptome and digital expression profiles revealed that F-box genes are involved in the development of multiple pear tissues. Overall, these results will set stage for elaborating the biological role of F-box genes in pear and other plants. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  11. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate

    PubMed Central

    Adomako-Ankomah, Yaw; English, Elizabeth D.; Danielson, Jeffrey J.; Pernas, Lena F.; Parker, Michelle L.; Boulanger, Martin J.; Dubey, Jitender P.; Boyle, Jon P.

    2016-01-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum. Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA+ paralogs. Additionally, we found that exogenous expression of an HMA+ paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. PMID:26920761

  12. Recurrent 15q11.2 BP1-BP2 microdeletions and microduplications in the etiology of neurodevelopmental disorders.

    PubMed

    Picinelli, Chiara; Lintas, Carla; Piras, Ignazio Stefano; Gabriele, Stefano; Sacco, Roberto; Brogna, Claudia; Persico, Antonio Maria

    2016-12-01

    Rare and common CNVs can contribute to the etiology of neurodevelopmental disorders. One of the recurrent genomic aberrations associated with these phenotypes and proposed as a susceptibility locus is the 15q11.2 BP1-BP2 CNV encompassing TUBGCP5, CYFIP1, NIPA2, and NIPA1. Characterizing by array-CGH a cohort of 243 families with various neurodevelopmental disorders, we identified five patients carrying the 15q11.2 duplication and one carrying the deletion. All CNVs were confirmed by qPCR and were inherited, except for one duplication where parents were not available. The phenotypic spectrum of CNV carriers was broad but mainly neurodevelopmental, in line with all four genes being implicated in axonal growth and neural connectivity. Phenotypically normal and mildly affected carriers complicate the interpretation of this aberration. This variability may be due to reduced penetrance or altered gene dosage on a particular genetic background. We evaluated the expression levels of the four genes in peripheral blood RNA and found the expected reduction in the deleted case, while duplicated carriers displayed high interindividual variability. These data suggest that differential expression of these genes could partially account for differences in clinical phenotypes, especially among duplication carriers. Furthermore, urinary Mg 2+ levels appear negatively correlated with NIPA2 gene copy number, suggesting they could potentially represent a useful biomarker, whose reliability will need replication in larger samples. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  13. The yeast Holliday junction resolvase, CCE1, can restore wild-type mitochondrial DNA to human cells carrying rearranged mitochondrial DNA.

    PubMed

    Sembongi, Hiroshi; Di Re, Miriam; Bokori-Brown, Monika; Holt, Ian J

    2007-10-01

    Rearrangements of mitochondrial DNA (mtDNA) are a well-recognized cause of human disease; deletions are more frequent, but duplications are more readily transmitted to offspring. In theory, partial duplications of mtDNA can be resolved to partially deleted and wild-type (WT) molecules, via homologous recombination. Therefore, the yeast CCE1 gene, encoding a Holliday junction resolvase, was introduced into cells carrying partially duplicated or partially triplicated mtDNA. Some cell lines carrying the CCE1 gene had substantial amounts of WT mtDNA suggesting that the enzyme can mediate intramolecular recombination in human mitochondria. However, high levels of expression of CCE1 frequently led to mtDNA loss, and so it is necessary to strictly regulate the expression of CCE1 in human cells to ensure the selection and maintenance of WT mtDNA.

  14. Long-read sequencing uncovers the adaptive topography of a carnivorous plant genome

    PubMed Central

    Lan, Tianying; Renner, Tanya; Ibarra-Laclette, Enrique; Farr, Kimberly M.; Chang, Tien-Hao; Cervantes-Pérez, Sergio Alan; Zheng, Chunfang; Sankoff, David; Tang, Haibao; Purbojati, Rikky W.; Putra, Alexander; Drautz-Moses, Daniela I.; Schuster, Stephan C.; Herrera-Estrella, Luis; Albert, Victor A.

    2017-01-01

    Utricularia gibba, the humped bladderwort, is a carnivorous plant that retains a tiny nuclear genome despite at least two rounds of whole genome duplication (WGD) since common ancestry with grapevine and other species. We used a third-generation genome assembly with several complete chromosomes to reconstruct the two most recent lineage-specific ancestral genomes that led to the modern U. gibba genome structure. Patterns of subgenome dominance in the most recent WGD, both architectural and transcriptional, are suggestive of allopolyploidization, which may have generated genomic novelty and led to instantaneous speciation. Syntenic duplicates retained in polyploid blocks are enriched for transcription factor functions, whereas gene copies derived from ongoing tandem duplication events are enriched in metabolic functions potentially important for a carnivorous plant. Among these are tandem arrays of cysteine protease genes with trap-specific expression that evolved within a protein family known to be useful in the digestion of animal prey. Further enriched functions among tandem duplicates (also with trap-enhanced expression) include peptide transport (intercellular movement of broken-down prey proteins), ATPase activities (bladder-trap acidification and transmembrane nutrient transport), hydrolase and chitinase activities (breakdown of prey polysaccharides), and cell-wall dynamic components possibly associated with active bladder movements. Whereas independently polyploid Arabidopsis syntenic gene duplicates are similarly enriched for transcriptional regulatory activities, Arabidopsis tandems are distinct from those of U. gibba, while still metabolic and likely reflecting unique adaptations of that species. Taken together, these findings highlight the special importance of tandem duplications in the adaptive landscapes of a carnivorous plant genome. PMID:28507139

  15. Genome-wide identification and expression analysis of the apple ASR gene family in response to Alternaria alternata f. sp. mali.

    PubMed

    Huang, Kaihui; Zhong, Yan; Li, Yingjun; Zheng, Dan; Cheng, Zong-Ming

    2016-10-01

    The ABA/water stress/ripening-induced (ASR) gene family exists universally in higher plants, and many ASR genes are up-regulated during periods of environmental stress and fruit ripening. Although a considerable amount of research has been performed investigating ASR gene response to abiotic stresses, relatively little is known about their roles in response to biotic stresses. In this report, we identified five ASR genes in apple (Malus × domestica) and explored their phylogenetic relationship, duplication events, and selective pressure. Five apple ASR genes (Md-ASR) were divided into two clades based on phylogenetic analysis. Species-specific duplication was detected in M. domestica ASR genes. Leaves of 'Golden delicious' and 'Starking' were infected with Alternaria alternata f. sp. mali, which causes apple blotch disease, and examined for the expression of the ASR genes in lesion areas during the first 72 h after inoculation. Md-ASR genes showed different expression patterns at different sampling times in 'Golden delicious' and 'Starking'. The activities of stress-related enzymes, peroxidase (POD), superoxide dismutase (SOD), catalase (CAT), phenylalanine ammonia lyase (PAL), and polyphenoloxidase (PPO), and the content of malondialdehyde (MDA) were also measured in different stages of disease development in two cultivars. The ASR gene expression patterns and theses physiological indexes for disease resistance suggested that Md-ASR genes are involved in biotic stress responses in apple.

  16. Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication

    PubMed Central

    Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S

    2016-01-01

    Introduction Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. Case report We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Discussion Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. Conclusion To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and compared the clinical, genotyping, and gene microarray results of our patient in order to characterize this novel 1p36 duplication syndrome, which might have contributed to the recurrent scabies infection in this patient. PMID:26834495

  17. Williams syndrome deletions and duplications: Genetic windows to understanding anxiety, sociality, autism, and schizophrenia.

    PubMed

    Crespi, Bernard J; Procyshyn, Tanya L

    2017-08-01

    We describe and evaluate an integrative hypothesis for helping to explain the major neurocognitive features of individuals with Williams syndrome region deletions and duplications. First, we demonstrate how the cognitive differences between Williams syndrome individuals, individuals with duplications of this region, and healthy individuals parallel the differences between individuals subject to effects of increased or decreased oxytocin. Second, we synthesize evidence showing that variation in expression of the gene GTF2I (General Transcription Factor II-I) underlies the primary social phenotypes of Williams syndrome and that common genetic variation in GTF2I mediates oxytocin reactivity, and its correlates, in healthy populations. Third, we describe findings relevant to the hypothesis that the GTF2I gene is subject to parent of origin effects whose behavioral expression fits with predictions from the kinship theory of genomic imprinting. Fourth, we describe how Williams syndrome can be considered, in part, as an autistic syndrome of Lorna Wing's 'active-but-odd' autism subtype, in contrast to associations of duplications with both schizophrenia and autism. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Identification, expression, and comparative genomic analysis of the IPT and CKX gene families in Chinese cabbage (Brassica rapa ssp. pekinensis)

    PubMed Central

    2013-01-01

    Background Cytokinins (CKs) have significant roles in various aspects of plant growth and development, and they are also involved in plant stress adaptations. The fine-tuning of the controlled CK levels in individual tissues, cells, and organelles is properly maintained by isopentenyl transferases (IPTs) and cytokinin oxidase/dehydrogenases (CKXs). Chinese cabbage is one of the most economically important vegetable crops worldwide. The whole genome sequencing of Brassica rapa enables us to perform the genome-wide identification and functional analysis of the IPT and CKX gene families. Results In this study, a total of 13 BrIPT genes and 12 BrCKX genes were identified. The gene structures, conserved domains and phylogenetic relationships were analyzed. The isoelectric point, subcellular localization and glycosylation sites of the proteins were predicted. Segmental duplicates were found in both BrIPT and BrCKX gene families. We also analyzed evolutionary patterns and divergence of the IPT and CKX genes in the Cruciferae family. The transcription levels of BrIPT and BrCKX genes were analyzed to obtain an initial picture of the functions of these genes. Abiotic stress elements related to adverse environmental stimuli were found in the promoter regions of BrIPT and BrCKX genes and they were confirmed to respond to drought and high salinity conditions. The effects of 6-BA and ABA on the expressions of BrIPT and BrCKX genes were also investigated. Conclusions The expansion of BrIPT and BrCKX genes after speciation from Arabidopsis thaliana is mainly attributed to segmental duplication events during the whole genome triplication (WGT) and substantial duplicated genes are lost during the long evolutionary history. Genes produced by segmental duplication events have changed their expression patterns or may adopted new functions and thus are obtained. BrIPT and BrCKX genes respond well to drought and high salinity stresses, and their transcripts are affected by exogenous hormones, such as 6-BA and ABA, suggesting their potential roles in abiotic stress conditions and regulatory mechanisms of plant hormone homeostasis. The appropriate modulation of endogenous CKs levels by IPT and CKX genes is a promising approach for developing economically important high-yielding and high-quality stress-tolerant crops in agriculture. PMID:24001366

  19. A dynamic history of gene duplications and losses characterizes the evolution of the SPARC family in eumetazoans.

    PubMed

    Bertrand, Stephanie; Fuentealba, Jaime; Aze, Antoine; Hudson, Clare; Yasuo, Hitoyoshi; Torrejon, Marcela; Escriva, Hector; Marcellini, Sylvain

    2013-04-22

    The vertebrates share the ability to produce a skeleton made of mineralized extracellular matrix. However, our understanding of the molecular changes that accompanied their emergence remains scarce. Here, we describe the evolutionary history of the SPARC (secreted protein acidic and rich in cysteine) family, because its vertebrate orthologues are expressed in cartilage, bones and teeth where they have been proposed to bind calcium and act as extracellular collagen chaperones, and because further duplications of specific SPARC members produced the small calcium-binding phosphoproteins (SCPP) family that is crucial for skeletal mineralization to occur. Both phylogeny and synteny conservation analyses reveal that, in the eumetazoan ancestor, a unique ancestral gene duplicated to give rise to SPARC and SPARCB described here for the first time. Independent losses have eliminated one of the two paralogues in cnidarians, protostomes and tetrapods. Hence, only non-tetrapod deuterostomes have conserved both genes. Remarkably, SPARC and SPARCB paralogues are still linked in the amphioxus genome. To shed light on the evolution of the SPARC family members in chordates, we performed a comprehensive analysis of their embryonic expression patterns in amphioxus, tunicates, teleosts, amphibians and mammals. Our results show that in the chordate lineage SPARC and SPARCB family members were recurrently recruited in a variety of unrelated tissues expressing collagen genes. We propose that one of the earliest steps of skeletal evolution involved the co-expression of SPARC paralogues with collagenous proteins.

  20. The pineapple genome and the evolution of CAM photosynthesis.

    PubMed

    Ming, Ray; VanBuren, Robert; Wai, Ching Man; Tang, Haibao; Schatz, Michael C; Bowers, John E; Lyons, Eric; Wang, Ming-Li; Chen, Jung; Biggers, Eric; Zhang, Jisen; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Zhang, Jian; Ye, Zhangyao; Miao, Chenyong; Lin, Zhicong; Wang, Hao; Zhou, Hongye; Yim, Won C; Priest, Henry D; Zheng, Chunfang; Woodhouse, Margaret; Edger, Patrick P; Guyot, Romain; Guo, Hao-Bo; Guo, Hong; Zheng, Guangyong; Singh, Ratnesh; Sharma, Anupma; Min, Xiangjia; Zheng, Yun; Lee, Hayan; Gurtowski, James; Sedlazeck, Fritz J; Harkess, Alex; McKain, Michael R; Liao, Zhenyang; Fang, Jingping; Liu, Juan; Zhang, Xiaodan; Zhang, Qing; Hu, Weichang; Qin, Yuan; Wang, Kai; Chen, Li-Yu; Shirley, Neil; Lin, Yann-Rong; Liu, Li-Yu; Hernandez, Alvaro G; Wright, Chris L; Bulone, Vincent; Tuskan, Gerald A; Heath, Katy; Zee, Francis; Moore, Paul H; Sunkar, Ramanjulu; Leebens-Mack, James H; Mockler, Todd; Bennetzen, Jeffrey L; Freeling, Michael; Sankoff, David; Paterson, Andrew H; Zhu, Xinguang; Yang, Xiaohan; Smith, J Andrew C; Cushman, John C; Paull, Robert E; Yu, Qingyi

    2015-12-01

    Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues. CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.

  1. History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis.

    PubMed

    Gazave, Eve; Guillou, Aurélien; Balavoine, Guillaume

    2014-01-01

    The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages.

  2. Papain-like cysteine proteases in Carica papaya: lineage-specific gene duplication and expansion.

    PubMed

    Liu, Juan; Sharma, Anupma; Niewiara, Marie Jamille; Singh, Ratnesh; Ming, Ray; Yu, Qingyi

    2018-01-06

    Papain-like cysteine proteases (PLCPs), a large group of cysteine proteases structurally related to papain, play important roles in plant development, senescence, and defense responses. Papain, the first cysteine protease whose structure was determined by X-ray crystallography, plays a crucial role in protecting papaya from herbivorous insects. Except the four major PLCPs purified and characterized in papaya latex, the rest of the PLCPs in papaya genome are largely unknown. We identified 33 PLCP genes in papaya genome. Phylogenetic analysis clearly separated plant PLCP genes into nine subfamilies. PLCP genes are not equally distributed among the nine subfamilies and the number of PLCPs in each subfamily does not increase or decrease proportionally among the seven selected plant species. Papaya showed clear lineage-specific gene expansion in the subfamily III. Interestingly, all four major PLCPs purified from papaya latex, including papain, chymopapain, glycyl endopeptidase and caricain, were grouped into the lineage-specific expansion branch in the subfamily III. Mapping PLCP genes on chromosomes of five plant species revealed that lineage-specific expansions of PLCP genes were mostly derived from tandem duplications. We estimated divergence time of papaya PLCP genes of subfamily III. The major duplication events leading to lineage-specific expansion of papaya PLCP genes in subfamily III were estimated at 48 MYA, 34 MYA, and 16 MYA. The gene expression patterns of the papaya PLCP genes in different tissues were assessed by transcriptome sequencing and qRT-PCR. Most of the papaya PLCP genes of subfamily III expressed at high levels in leaf and green fruit tissues. Tandem duplications played the dominant role in affecting copy number of PLCPs in plants. Significant variations in size of the PLCP subfamilies among species may reflect genetic adaptation of plant species to different environments. The lineage-specific expansion of papaya PLCPs of subfamily III might have been promoted by the continuous reciprocal selective effects of herbivore attack and plant defense.

  3. Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals.

    PubMed

    Patel, Vidushi S; Cooper, Steven J B; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer A M

    2008-07-25

    Vertebrate alpha (alpha)- and beta (beta)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the alpha- and beta-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil beta-globin gene (omega) in the marsupial alpha-cluster, however, suggested that duplication of the alpha-beta cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous alpha- and beta-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. The platypus alpha-globin cluster (chromosome 21) contains embryonic and adult alpha- globin genes, a beta-like omega-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-zeta-zeta'-alphaD-alpha3-alpha2-alpha1-omega-GBY-3'. The platypus beta-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-epsilon-beta-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate alpha-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal beta-globin clusters are embedded in olfactory genes. Thus, the mammalian alpha- and beta-globin clusters are orthologous to the bird alpha- and beta-globin clusters respectively. We propose that alpha- and beta-globin clusters evolved from an ancient MPG-C16orf35-alpha-beta-GBY-LUC7L arrangement 410 million years ago. A copy of the original beta (represented by omega in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of beta-globin genes with different expression profiles in different lineages.

  4. Genome-Wide Identification, Evolutionary Expansion, and Expression Profile of Homeodomain-Leucine Zipper Gene Family in Poplar (Populus trichocarpa)

    PubMed Central

    Hu, Ruibo; Chi, Xiaoyuan; Chai, Guohua; Kong, Yingzhen; He, Guo; Wang, Xiaoyu; Shi, Dachuan; Zhang, Dongyuan; Zhou, Gongke

    2012-01-01

    Background Homeodomain-leucine zipper (HD-ZIP) proteins are plant-specific transcriptional factors known to play crucial roles in plant development. Although sequence phylogeny analysis of Populus HD-ZIPs was carried out in a previous study, no systematic analysis incorporating genome organization, gene structure, and expression compendium has been conducted in model tree species Populus thus far. Principal Findings In this study, a comprehensive analysis of Populus HD-ZIP gene family was performed. Sixty-three full-length HD-ZIP genes were found in Populus genome. These Populus HD-ZIP genes were phylogenetically clustered into four distinct subfamilies (HD-ZIP I–IV) and predominately distributed across 17 linkage groups (LG). Fifty genes from 25 Populus paralogous pairs were located in the duplicated blocks of Populus genome and then preferentially retained during the sequential evolutionary courses. Genomic organization analyses indicated that purifying selection has played a pivotal role in the retention and maintenance of Populus HD-ZIP gene family. Microarray analysis has shown that 21 Populus paralogous pairs have been differentially expressed across different tissues and under various stresses, with five paralogous pairs showing nearly identical expression patterns, 13 paralogous pairs being partially redundant and three paralogous pairs diversifying significantly. Quantitative real-time RT-PCR (qRT-PCR) analysis performed on 16 selected Populus HD-ZIP genes in different tissues and under both drought and salinity stresses confirms their tissue-specific and stress-inducible expression patterns. Conclusions Genomic organizations indicated that segmental duplications contributed significantly to the expansion of Populus HD-ZIP gene family. Exon/intron organization and conserved motif composition of Populus HD-ZIPs are highly conservative in the same subfamily, suggesting the members in the same subfamilies may also have conservative functionalities. Microarray and qRT-PCR analyses showed that 89% (56 out of 63) of Populus HD-ZIPs were duplicate genes that might have been retained by substantial subfunctionalization. Taken together, these observations may lay the foundation for future functional analysis of Populus HD-ZIP genes to unravel their biological roles. PMID:22359569

  5. The Goddard and Saturn Genes Are Essential for Drosophila Male Fertility and May Have Arisen De Novo

    PubMed Central

    Gubala, Anna M.; Schmitz, Jonathan F.; Kearns, Michael J.; Vinh, Tery T.; Bornberg-Bauer, Erich; Wolfner, Mariana F.

    2017-01-01

    New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. PMID:28104747

  6. Genome-wide analysis of WRKY gene family in Cucumis sativus

    PubMed Central

    2011-01-01

    Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985

  7. Genome-wide analysis of WRKY gene family in Cucumis sativus.

    PubMed

    Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan

    2011-09-28

    WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.

  8. Molecular evolution of a chordate specific family of G protein-coupled receptors

    PubMed Central

    2011-01-01

    Background Chordate evolution is a history of innovations that is marked by physical and behavioral specializations, which led to the development of a variety of forms from a single ancestral group. Among other important characteristics, vertebrates obtained a well developed brain, anterior sensory structures, a closed circulatory system and gills or lungs as blood oxygenation systems. The duplication of pre-existing genes had profound evolutionary implications for the developmental complexity in vertebrates, since mutations modifying the function of a duplicated protein can lead to novel functions, improving the evolutionary success. Results We analyzed here the evolution of the GPRC5 family of G protein-coupled receptors by comprehensive similarity searches and found that the receptors are only present in chordates and that the size of the receptor family expanded, likely due to genome duplication events in the early history of vertebrate evolution. We propose that a single GPRC5 receptor coding gene originated in a stem chordate ancestor and gave rise by duplication events to a gene family comprising three receptor types (GPRC5A-C) in vertebrates, and a fourth homologue present only in mammals (GPRC5D). Additional duplications of GPRC5B and GPRC5C sequences occurred in teleost fishes. The finding that the expression patterns of the receptors are evolutionarily conserved indicates an important biological function of these receptors. Moreover, we found that expression of GPRC5B is regulated by vitamin A in vivo, confirming previous findings that linked receptor expression to retinoic acid levels in tumor cell lines and strengthening the link between the receptor expression and the development of a complex nervous system in chordates, known to be dependent on retinoic acid signaling. Conclusions GPRC5 receptors, a class of G protein-coupled receptors with unique sequence characteristics, may represent a molecular novelty that helped non-chordates to become chordates. PMID:21827690

  9. Plants with double genomes might have had a better chance to survive the Cretaceous–Tertiary extinction event

    PubMed Central

    Fawcett, Jeffrey A.; Maere, Steven; Van de Peer, Yves

    2009-01-01

    Most flowering plants have been shown to be ancient polyploids that have undergone one or more whole genome duplications early in their evolution. Furthermore, many different plant lineages seem to have experienced an additional, more recent genome duplication. Starting from paralogous genes lying in duplicated segments or identified in large expressed sequence tag collections, we dated these youngest duplication events through penalized likelihood phylogenetic tree inference. We show that a majority of these independent genome duplications are clustered in time and seem to coincide with the Cretaceous–Tertiary (KT) boundary. The KT extinction event is the most recent mass extinction caused by one or more catastrophic events such as a massive asteroid impact and/or increased volcanic activity. These events are believed to have generated global wildfires and dust clouds that cut off sunlight during long periods of time resulting in the extinction of ≈60% of plant species, as well as a majority of animals, including dinosaurs. Recent studies suggest that polyploid species can have a higher adaptability and increased tolerance to different environmental conditions. We propose that polyploidization may have contributed to the survival and propagation of several plant lineages during or following the KT extinction event. Due to advantages such as altered gene expression leading to hybrid vigor and an increased set of genes and alleles available for selection, polyploid plants might have been better able to adapt to the drastically changed environment 65 million years ago. PMID:19325131

  10. Genome-wide analysis and expression profiling suggest diverse roles of GH3 genes during development and abiotic stress responses in legumes

    PubMed Central

    Singh, Vikash K.; Jain, Mukesh; Garg, Rohini

    2014-01-01

    Growth hormone auxin regulates various cellular processes by altering the expression of diverse genes in plants. Among various auxin-responsive genes, GH3 genes maintain endogenous auxin homeostasis by conjugating excess of auxin with amino acids. GH3 genes have been characterized in many plant species, but not in legumes. In the present work, we identified members of GH3 gene family and analyzed their chromosomal distribution, gene structure, gene duplication and phylogenetic analysis in different legumes, including chickpea, soybean, Medicago, and Lotus. A comprehensive expression analysis in different vegetative and reproductive tissues/stages revealed that many of GH3 genes were expressed in a tissue-specific manner. Notably, chickpea CaGH3-3, soybean GmGH3-8 and -25, and Lotus LjGH3-4, -5, -9 and -18 genes were up-regulated in root, indicating their putative role in root development. In addition, chickpea CaGH3-1 and -7, and Medicago MtGH3-7, -8, and -9 were found to be highly induced under drought and/or salt stresses, suggesting their role in abiotic stress responses. We also observed the examples of differential expression pattern of duplicated GH3 genes in soybean, indicating their functional diversification. Furthermore, analyses of three-dimensional structures, active site residues and ligand preferences provided molecular insights into function of GH3 genes in legumes. The analysis presented here would help in investigation of precise function of GH3 genes in legumes during development and stress conditions. PMID:25642236

  11. Naturally Occurring Deletion Mutants of the Pig-Specific, Intestinal Crypt Epithelial Cell Protein CLCA4b without Apparent Phenotype

    PubMed Central

    Plog, Stephanie; Klymiuk, Nikolai; Binder, Stefanie; Van Hook, Matthew J.; Thoreson, Wallace B.; Gruber, Achim D.; Mundhenk, Lars

    2015-01-01

    The human CLCA4 (chloride channel regulator, calcium-activated) modulates the intestinal phenotype of cystic fibrosis (CF) patients via an as yet unknown pathway. With the generation of new porcine CF models, species-specific differences between human modifiers of CF and their porcine orthologs are considered critical for the translation of experimental data. Specifically, the porcine ortholog to the human CF modulator gene CLCA4 has recently been shown to be duplicated into two separate genes, CLCA4a and CLCA4b. Here, we characterize the duplication product, CLCA4b, in terms of its genomic structure, tissue and cellular expression patterns as well as its in vitro electrophysiological properties. The CLCA4b gene is a pig-specific duplication product of the CLCA4 ancestor and its protein is exclusively expressed in small and large intestinal crypt epithelial cells, a niche specifically occupied by no other porcine CLCA family member. Surprisingly, a unique deleterious mutation of the CLCA4b gene is spread among modern and ancient breeds in the pig population, but this mutation did not result in an apparent phenotype in homozygously affected animals. Electrophysiologically, neither the products of the wild type nor of the mutated CLCA4b genes were able to evoke a calcium-activated anion conductance, a consensus feature of other CLCA proteins. The apparently pig-specific duplication of the CLCA4 gene with unique expression of the CLCA4b protein variant in intestinal crypt epithelial cells where the porcine CFTR is also present raises the question of whether it may modulate the porcine CF phenotype. Moreover, the naturally occurring null variant of CLCA4b will be valuable for the understanding of CLCA protein function and their relevance in modulating the CF phenotype. PMID:26474299

  12. Breakpoint analysis of the pericentric inversion between chimpanzee chromosome 10 and the homologous chromosome 12 in humans.

    PubMed

    Kehrer-Sawatzki, H; Sandig, C A; Goidts, V; Hameister, H

    2005-01-01

    During this study, we analysed the pericentric inversion that distinguishes human chromosome 12 (HSA12) from the homologous chimpanzee chromosome (PTR10). Two large chimpanzee-specific duplications of 86 and 23 kb were observed in the breakpoint regions, which most probably occurred associated with the inversion. The inversion break in PTR10p caused the disruption of the SLCO1B3 gene in exon 11. However, the 86-kb duplication includes the functional SLCO1B3 locus, which is thus retained in the chimpanzee, although inverted to PTR10q. The second duplication spans 23 kb and does not contain expressed sequences. Eleven genes map to a region of about 1 Mb around the breakpoints. Six of these eleven genes are not among the differentially expressed genes as determined previously by comparing the human and chimpanzee transcriptome of fibroblast cell lines, blood leukocytes, liver and brain samples. These findings imply that the inversion did not cause major expression differences of these genes. Comparative FISH analysis with BACs spanning the inversion breakpoints in PTR on metaphase chromosomes of gorilla (GGO) confirmed that the pericentric inversion of the chromosome 12 homologs in GGO and PTR have distinct breakpoints and that humans retain the ancestral arrangement. These findings coincide with the trend observed in hominoid karyotype evolution that humans have a karyotype close to an ancestral one, while African great apes present with more derived chromosome arrangements. Copyright (c) 2005 S. Karger AG, Basel.

  13. The duplication mutation of Quebec platelet disorder dysregulates PLAU, but not C10orf55, selectively increasing production of normal PLAU transcripts by megakaryocytes but not granulocytes.

    PubMed

    Hayward, Catherine P M; Liang, Minggao; Tasneem, Subia; Soomro, Asim; Waye, John S; Paterson, Andrew D; Rivard, Georges E; Wilson, Michael D

    2017-01-01

    Quebec Platelet disorder (QPD) is a unique bleeding disorder that markedly increases urokinase plasminogen activator (uPA) in megakaryocytes and platelets but not in plasma or urine. The cause is tandem duplication of a 78 kb region of chromosome 10 containing PLAU (the uPA gene) and C10orf55, a gene of unknown function. QPD increases uPA in platelets and megakaryocytes >100 fold, far more than expected for a gene duplication. To investigate the tissue-specific effect that PLAU duplication has on gene expression and transcript structure in QPD, we tested if QPD leads to: 1) overexpression of normal or unique PLAU transcripts; 2) increased uPA in leukocytes; 3) altered levels of C10orf55 mRNA and/or protein in megakaryocytes and leukocytes; and 4) global changes in megakaryocyte gene expression. Primary cells and cultured megakaryocytes from donors were prepared for quantitative reverse polymerase chain reaction analyses, RNA-seq and protein expression analyses. Rapidly isolated blood leukocytes from QPD subjects showed only a 3.9 fold increase in PLAU transcript levels, in keeping with the normal to minimally increased uPA in affinity purified, QPD leukocytes. All subjects had more uPA in granulocytes than monocytes and minimal uPA in lymphocytes. QPD leukocytes expressed PLAU alleles in proportions consistent with an extra copy of PLAU on the disease chromosome, unlike QPD megakaryocytes. QPD PLAU transcripts were consistent with reference gene models, with a much higher proportion of reads originating from the disease chromosome in megakaryocytes than granulocytes. QPD and control megakaryocytes contained minimal reads for C10orf55, and C10orf55 protein was not increased in QPD megakaryocytes or platelets. Finally, our QPD megakaryocyte transcriptome analysis revealed a global down regulation of the interferon type 1 pathway. We suggest that the low endogenous levels of uPA in blood are actively regulated, and that the regulatory mechanisms are disrupted in QPD in a megakaryocyte-specific manner.

  14. The duplication mutation of Quebec platelet disorder dysregulates PLAU, but not C10orf55, selectively increasing production of normal PLAU transcripts by megakaryocytes but not granulocytes

    PubMed Central

    Soomro, Asim; Waye, John S.; Paterson, Andrew D.; Rivard, Georges E.; Wilson, Michael D.

    2017-01-01

    Quebec Platelet disorder (QPD) is a unique bleeding disorder that markedly increases urokinase plasminogen activator (uPA) in megakaryocytes and platelets but not in plasma or urine. The cause is tandem duplication of a 78 kb region of chromosome 10 containing PLAU (the uPA gene) and C10orf55, a gene of unknown function. QPD increases uPA in platelets and megakaryocytes >100 fold, far more than expected for a gene duplication. To investigate the tissue-specific effect that PLAU duplication has on gene expression and transcript structure in QPD, we tested if QPD leads to: 1) overexpression of normal or unique PLAU transcripts; 2) increased uPA in leukocytes; 3) altered levels of C10orf55 mRNA and/or protein in megakaryocytes and leukocytes; and 4) global changes in megakaryocyte gene expression. Primary cells and cultured megakaryocytes from donors were prepared for quantitative reverse polymerase chain reaction analyses, RNA-seq and protein expression analyses. Rapidly isolated blood leukocytes from QPD subjects showed only a 3.9 fold increase in PLAU transcript levels, in keeping with the normal to minimally increased uPA in affinity purified, QPD leukocytes. All subjects had more uPA in granulocytes than monocytes and minimal uPA in lymphocytes. QPD leukocytes expressed PLAU alleles in proportions consistent with an extra copy of PLAU on the disease chromosome, unlike QPD megakaryocytes. QPD PLAU transcripts were consistent with reference gene models, with a much higher proportion of reads originating from the disease chromosome in megakaryocytes than granulocytes. QPD and control megakaryocytes contained minimal reads for C10orf55, and C10orf55 protein was not increased in QPD megakaryocytes or platelets. Finally, our QPD megakaryocyte transcriptome analysis revealed a global down regulation of the interferon type 1 pathway. We suggest that the low endogenous levels of uPA in blood are actively regulated, and that the regulatory mechanisms are disrupted in QPD in a megakaryocyte-specific manner. PMID:28301587

  15. Spotting and validation of a genome wide oligonucleotide chip with duplicate measurement of each gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thomassen, Mads; Skov, Vibe; Eiriksdottir, Freyja

    2006-06-16

    The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips wasmore » three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.« less

  16. Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

    PubMed

    van der Ley, P

    1988-11-01

    Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.

  17. Genome-wide identification, splicing, and expression analysis of the myosin gene family in maize (Zea mays)

    PubMed Central

    Wang, Guifeng; Zhong, Mingyu; Wang, Gang; Song, Rentao

    2014-01-01

    The actin-based myosin system is essential for the organization and dynamics of the endomembrane system and transport network in plant cells. Plants harbour two unique myosin groups, class VIII and class XI, and the latter is structurally and functionally analogous to the animal and fungal class V myosin. Little is known about myosins in grass, even though grass includes several agronomically important cereal crops. Here, we identified 14 myosin genes from the genome of maize (Zea mays). The relatively larger sizes of maize myosin genes are due to their much longer introns, which are abundant in transposable elements. Phylogenetic analysis indicated that maize myosin genes could be classified into class VIII and class XI, with three and 11 members, respectively. Apart from subgroup XI-F, the remaining subgroups were duplicated at least in one analysed lineage, and the duplication events occurred more extensively in Arabidopsis than in maize. Only two pairs of maize myosins were generated from segmental duplication. Expression analysis revealed that most maize myosin genes were expressed universally, whereas a few members (XI-1, -6, and -11) showed an anther-specific pattern, and many underwent extensive alternative splicing. We also found a short transcript at the O1 locus, which conceptually encoded a headless myosin that most likely functions at the transcriptional level rather than via a dominant-negative mechanism at the translational level. Together, these data provide significant insights into the evolutionary and functional characterization of maize myosin genes that could transfer to the identification and application of homologous myosins of other grasses. PMID:24363426

  18. Structure and vascular tissue expression of duplicated TERMINAL EAR1-like paralogues in poplar.

    PubMed

    Charon, Céline; Vivancos, Julien; Mazubert, Christelle; Paquet, Nicolas; Pilate, Gilles; Dron, Michel

    2010-02-01

    TERMINAL EAR1-like (TEL) genes encode putative RNA-binding proteins only found in land plants. Previous studies suggested that they may regulate tissue and organ initiation in Poaceae. Two TEL genes were identified in both Populus trichocarpa and the hybrid aspen Populus tremula x P. alba, named, respectively, PoptrTEL1-2 and PtaTEL1-2. The analysis of the organisation around the PoptrTEL genes in the P. trichocarpa genome and the estimation of the synonymous substitution rate for PtaTEL1-2 genes indicate that the paralogous link between these two Populus TEL genes probably results from the Salicoid large-scale gene-duplication event. Phylogenetic analyses confirmed their orthology link with the other TEL genes. The expression pattern of both PtaTEL genes appeared to be restricted to the mother cells of the plant body: leaf founder cells, leaf primordia, axillary buds and root differentiating tissues, as well as to mother cells of vascular tissues. Most interestingly, PtaTEL1-2 transcripts were found in differentiating cells of secondary xylem and phloem, but probably not in the cambium itself. Taken together, these results indicate specific expression of the TEL genes in differentiating cells controlling tissue and organ development in Populus (and other Angiosperm species).

  19. Function and Evolution of DNA Methylation in Nasonia vitripennis

    PubMed Central

    Wang, Xu; Wheeler, David; Avery, Amanda; Rago, Alfredo; Choi, Jeong-Hyeon; Colbourne, John K.; Clark, Andrew G.; Werren, John H.

    2013-01-01

    The parasitoid wasp Nasonia vitripennis is an emerging genetic model for functional analysis of DNA methylation. Here, we characterize genome-wide methylation at a base-pair resolution, and compare these results to gene expression across five developmental stages and to methylation patterns reported in other insects. An accurate assessment of DNA methylation across the genome is accomplished using bisulfite sequencing of adult females from a highly inbred line. One-third of genes show extensive methylation over the gene body, yet methylated DNA is not found in non-coding regions and rarely in transposons. Methylated genes occur in small clusters across the genome. Methylation demarcates exon-intron boundaries, with elevated levels over exons, primarily in the 5′ regions of genes. It is also elevated near the sites of translational initiation and termination, with reduced levels in 5′ and 3′ UTRs. Methylated genes have higher median expression levels and lower expression variation across development stages than non-methylated genes. There is no difference in frequency of differential splicing between methylated and non-methylated genes, and as yet no established role for methylation in regulating alternative splicing in Nasonia. Phylogenetic comparisons indicate that many genes maintain methylation status across long evolutionary time scales. Nasonia methylated genes are more likely to be conserved in insects, but even those that are not conserved show broader expression across development than comparable non-methylated genes. Finally, examination of duplicated genes shows that those paralogs that have lost methylation in the Nasonia lineage following gene duplication evolve more rapidly, show decreased median expression levels, and increased specialization in expression across development. Methylation of Nasonia genes signals constitutive transcription across developmental stages, whereas non-methylated genes show more dynamic developmental expression patterns. We speculate that loss of methylation may result in increased developmental specialization in evolution and acquisition of methylation may lead to broader constitutive expression. PMID:24130511

  20. Differential mantle transcriptomics and characterization of growth-related genes in the diploid and triploid pearl oyster Pinctada fucata.

    PubMed

    Guan, Yunyan; He, Maoxian; Wu, Houbo

    2017-06-01

    To explore the molecular mechanism of triploidy effect in the pearl oyster Pinctada fucata, two RNA-seq libraries were constructed from the mantle tissue of diploids and triploids by Roche-454 massive parallel pyrosequencing. The identification of differential expressed genes (DEGs) between diploid and triploid may reveal the molecular mechanism of triploidy effect. In this study, 230 down-regulated and 259 up-regulated DEGs were obtained by comparison between diploid and triploid libraries. The gene ontology and KEGG pathway analysis revealed more functional activation in triploids and it may due to the duplicated gene expression in transcriptional level during whole genome duplication (WGD). To confirm the sequencing data, a set of 11 up-regulated genes related to growth and development control and regulation were analyzed by RT-qPCR in independent experiment. According to the validation and annotation of these genes, it is hypothesized that the set of up-regulated expressed genes had the correlated expression pattern involved in shell building or other interactive probable functions during triploidization. The up- regulation of growth-related genes may support the classic hypotheses of 'energy redistribution' from early research. The results provide valuable resources to understand the molecular mechanism of triploidy effect in both shell building and producing high-quality seawater pearls. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. The early stages of duplicate gene evolution

    PubMed Central

    Moore, Richard C.; Purugganan, Michael D.

    2003-01-01

    Gene duplications are one of the primary driving forces in the evolution of genomes and genetic systems. Gene duplicates account for 8–20% of the genes in eukaryotic genomes, and the rates of gene duplication are estimated at between 0.2% and 2% per gene per million years. Duplicate genes are believed to be a major mechanism for the establishment of new gene functions and the generation of evolutionary novelty, yet very little is known about the early stages of the evolution of duplicated gene pairs. It is unclear, for example, to what extent selection, rather than neutral genetic drift, drives the fixation and early evolution of duplicate loci. Analysis of recently duplicated genes in the Arabidopsis thaliana genome reveals significantly reduced species-wide levels of nucleotide polymorphisms in the progenitor and/or duplicate gene copies, suggesting that selective sweeps accompany the initial stages of the evolution of these duplicated gene pairs. Our results support recent theoretical work that indicates that fates of duplicate gene pairs may be determined in the initial phases of duplicate gene evolution and that positive selection plays a prominent role in the evolutionary dynamics of the very early histories of duplicate nuclear genes. PMID:14671323

  2. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate.

    PubMed

    Adomako-Ankomah, Yaw; English, Elizabeth D; Danielson, Jeffrey J; Pernas, Lena F; Parker, Michelle L; Boulanger, Martin J; Dubey, Jitender P; Boyle, Jon P

    2016-05-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA(+) paralogs. Additionally, we found that exogenous expression of an HMA(+) paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. Copyright © 2016 by the Genetics Society of America.

  3. Evolution of Gene Duplication in Plants1[OPEN

    PubMed Central

    2016-01-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. PMID:27288366

  4. Evolution of Gene Duplication in Plants.

    PubMed

    Panchy, Nicholas; Lehti-Shiu, Melissa; Shiu, Shin-Han

    2016-08-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. © 2016 American Society of Plant Biologists. All Rights Reserved.

  5. High mature grain phytase activity in the Triticeae has evolved by duplication followed by neofunctionalization of the purple acid phosphatase phytase (PAPhy) gene

    PubMed Central

    Brinch-Pedersen, Henrik

    2013-01-01

    The phytase activity in food and feedstuffs is an important nutritional parameter. Members of the Triticeae tribe accumulate purple acid phosphatase phytases (PAPhy) during grain filling. This accumulation elevates mature grain phytase activities (MGPA) up to levels between ~650 FTU/kg for barley and 6000 FTU/kg for rye. This is notably more than other cereals. For instance, rice, maize, and oat have MGPAs below 100 FTU/kg. The cloning and characterization of the PAPhy gene complement from wheat, barley, rye, einkorn, and Aegilops tauschii is reported here. The Triticeae PAPhy genes generally consist of a set of paralogues, PAPhy_a and PAPhy_b, and have been mapped to Triticeae chromosomes 5 and 3, respectively. The promoters share a conserved core but the PAPhy_a promoter have acquired a novel cis-acting regulatory element for expression during grain filling while the PAPhy_b promoter has maintained the archaic function and drives expression during germination. Brachypodium is the only sequenced Poaceae sharing the PAPhy duplication. As for the Triticeae, the duplication is reflected in a high MGPA of ~4200 FTU/kg in Brachypodium. The sequence conservation of the paralogous loci on Brachypodium chromosomes 1 and 2 does not extend beyond the PAPhy gene. The results indicate that a single-gene segmental duplication may have enabled the evolution of high MGPA by creating functional redundancy of the parent PAPhy gene. This implies that similar MGPA levels may be out of reach in breeding programs for some Poaceae, e.g. maize and rice, whereas Triticeae breeders should focus on PAPhy_a. PMID:23918958

  6. Molecular genetic basis of pod corn (Tunicate maize)

    PubMed Central

    Wingen, Luzie U.; Münster, Thomas; Faigl, Wolfram; Deleu, Wim; Sommer, Hans; Saedler, Heinz; Theißen, Günter

    2012-01-01

    Pod corn is a classic morphological mutant of maize in which the mature kernels of the cob are covered by glumes, in contrast to generally grown maize varieties in which kernels are naked. Pod corn, known since pre-Columbian times, is the result of a dominant gain-of-function mutation at the Tunicate (Tu) locus. Some classic articles of 20th century maize genetics reported that the mutant Tu locus is complex, but molecular details remained elusive. Here, we show that pod corn is caused by a cis-regulatory mutation and duplication of the ZMM19 MADS-box gene. Although the WT locus contains a single-copy gene that is expressed in vegetative organs only, mutation and duplication of ZMM19 in Tu lead to ectopic expression of the gene in the inflorescences, thus conferring vegetative traits to reproductive organs. PMID:22517751

  7. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications.

    PubMed

    Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang

    2012-06-15

    Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication. This is particularly true for the zebrafish genome. Further analysis of the duplicated gene sets indicated that a significant portion of duplicated genes in the zebrafish genome were of recent, lineage-specific duplication events. Most strikingly, a subset of duplicated genes is enriched among the recently duplicated genes involved in immune or sensory response pathways. Such findings demonstrated the significance of continuous gene duplication as well as that of whole genome duplication in the course of genome evolution.

  8. Expansion and stress responses of the AP2/EREBP superfamily in cotton.

    PubMed

    Liu, Chunxiao; Zhang, Tianzhen

    2017-01-31

    The allotetraploid cotton originated from one hybridization event between an extant progenitor of Gosssypium herbaceum (A 1 ) or G. arboreum (A 2 ) and another progenitor, G. raimondii Ulbrich (D 5 ) 1-1.5 million years ago (Mya). The APETALA2/ethylene-responsive element binding protein (AP2/EREBP) transcription factors constitute one of the largest and most conserved gene families in plants. They are characterized by their AP2 domain, which comprises 60-70 amino acids, and are classified into four main subfamilies: the APETALA2 (AP2), Related to ABI3/VP1 (RAV), Dehydration-Responsive Element Binding protein (DREB) and Ethylene-Responsive Factor (ERF) subfamilies. The AP2/EREBP genes play crucial roles in plant growth, development and biotic and abiotic stress responses. Hence, understanding the molecular characteristics of cotton stress tolerance and gene family expansion would undoubtedly facilitate cotton resistance breeding and evolution research. A total of 269 AP2/EREBP genes were identified in the G. raimondii (D5) cotton genome. The protein domain architecture and intron/exon structure are simple and relatively conserved within each subfamily. They are distributed throughout all chromosomes but are clustered on various chromosomes due to genomic tandem duplication. We identified 73 tandem duplicated genes and 221 segmental duplicated gene pairs which contributed to the expansion of AP2/EREBP superfamily. Of them, tandem duplication was the most important force of the expansion of the B3 group. Transcriptome analysis showed that 504 AP2/EREBP genes were expressed in at least one tested G. hirsutum TM-1 tissues. In G. hirsutum, 151 non-repeated genes of the DREB and ERF subfamily genes were responsive to different stresses: 132 genes were induced by cold, 63 genes by drought and 94 genes by heat. qRT-PCR confirmed that 13 GhDREB and 15 GhERF genes were induced by cold and/or drought. No transcripts detected for 53 of the 111 tandem duplicated genes in TM-1. In addition, some homoeologous genes showed biased expression toward either A-or D-subgenome. The AP2/EREBP genes were obviously expanded in Gossypium. The GhDREB and GhERF genes play crucial roles in cotton stress responses. Our genome-wide analysis of AP2/EREBP genes in cotton provides valuable information for characterizing the molecular functions of AP2/EREBP genes and reveals insights into their evolution in polyploid plants.

  9. Expansion by whole genome duplication and evolution of the sox gene family in teleost fish

    PubMed Central

    Naville, Magali; Volff, Jean-Nicolas

    2017-01-01

    It is now recognized that several rounds of whole genome duplication (WGD) have occurred during the evolution of vertebrates, but the link between WGDs and phenotypic diversification remains unsolved. We have investigated in this study the impact of the teleost-specific WGD on the evolution of the sox gene family in teleostean fishes. The sox gene family, which encodes for transcription factors, has essential role in morphology, physiology and behavior of vertebrates and teleosts, the current largest group of vertebrates. We have first redrawn the evolution of all sox genes identified in eleven teleost genomes using a comparative genomic approach including phylogenetic and synteny analyses. We noticed, compared to tetrapods, an important expansion of the sox family: 58% (11/19) of sox genes are duplicated in teleost genomes. Furthermore, all duplicated sox genes, except sox17 paralogs, are derived from the teleost-specific WGD. Then, focusing on five sox genes, analyzing the evolution of coding and non-coding sequences, as well as the expression patterns in fish embryos and adult tissues, we demonstrated that these paralogs followed lineage-specific evolutionary trajectories in teleost genomes. This work, based on whole genome data from multiple teleostean species, supports the contribution of WGDs to the expansion of gene families, as well as to the emergence of genomic differences between lineages that might promote genetic and phenotypic diversity in teleosts. PMID:28738066

  10. Birth of a new gene on the Y chromosome of Drosophila melanogaster

    PubMed Central

    Carvalho, Antonio Bernardo; Vicoso, Beatriz; Russo, Claudia A. M.; Swenor, Bonnielin; Clark, Andrew G.

    2015-01-01

    Contrary to the pattern seen in mammalian sex chromosomes, where most Y-linked genes have X-linked homologs, the Drosophila X and Y chromosomes appear to be unrelated. Most of the Y-linked genes have autosomal paralogs, so autosome-to-Y transposition must be the main source of Drosophila Y-linked genes. Here we show how these genes were acquired. We found a previously unidentified gene (flagrante delicto Y, FDY) that originated from a recent duplication of the autosomal gene vig2 to the Y chromosome of Drosophila melanogaster. Four contiguous genes were duplicated along with vig2, but they became pseudogenes through the accumulation of deletions and transposable element insertions, whereas FDY remained functional, acquired testis-specific expression, and now accounts for ∼20% of the vig2-like mRNA in testis. FDY is absent in the closest relatives of D. melanogaster, and DNA sequence divergence indicates that the duplication to the Y chromosome occurred ∼2 million years ago. Thus, FDY provides a snapshot of the early stages of the establishment of a Y-linked gene and demonstrates how the Drosophila Y has been accumulating autosomal genes. PMID:26385968

  11. Duplication of Dio3 genes in teleost fish and their divergent expression in skin during flatfish metamorphosis.

    PubMed

    Alves, R N; Cardoso, J C R; Harboe, T; Martins, R S T; Manchado, M; Norberg, B; Power, D M

    2017-05-15

    Deiodinase 3 (Dio3) plays an essential role during early development in vertebrates by controlling tissue thyroid hormone (TH) availability. The Atlantic halibut (Hippoglossus hippoglossus) possesses duplicate dio3 genes (dio3a and dio3b). Expression analysis indicates that dio3b levels change in abocular skin during metamorphosis and this suggests that this enzyme is associated with the divergent development of larval skin to the juvenile phenotype. In larvae exposed to MMI, a chemical that inhibits TH production, expression of dio3b in ocular skin is significantly up-regulated suggesting that THs normally modulate this genes expression during this developmental event. The molecular basis for divergent dio3a and dio3b expression and responsiveness to MMI treatment is explained by the multiple conserved TREs in the proximal promoter region of teleost dio3b and their absence from the promoter of dio3a. We propose that the divergent expression of dio3 in ocular and abocular skin during halibut metamorphosis contributes to the asymmetric pigment development in response to THs. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Duplication and Whorl-Specific Down-Regulation of the Obligate AP3-PI Heterodimer Genes Explain the Origin of Paeonia lactiflora Plants with Spontaneous Corolla Mutation.

    PubMed

    Gong, Pichang; Ao, Xiang; Liu, Gaixiu; Cheng, Fangyun; He, Chaoying

    2017-03-01

    Herbaceous peony (Paeonia lactiflora) is a globally important ornamental plant. Spontaneous floral mutations occur frequently during cultivation, and are selected as a way to release new cultivars, but the underlying evolutionary developmental genetics remain largely elusive. Here, we investigated a collection of spontaneous corolla mutational plants (SCMPs) whose other floral organs were virtually unaffected. Unlike the corolla in normal plants (NPs) that withered soon after fertilization, the transformed corolla (petals) in SCMPs was greenish and persistent similar to the calyx (sepals). Epidermal cellular morphology of the SCMP corolla was also similar to that of calyx cells, further suggesting a sepaloid corolla in SCMPs. Ten floral MADS-box genes from these Paeonia plants were comparatively characterized with respect to sequence and expression. Codogenic sequence variation of these MADS-box genes was not linked to corolla changes in SCMPs. However, we found that both APETALA3 (AP3) and PISTILLATA (PI) lineages of B-class MADS-box genes were duplicated, and subsequent selective expression alterations of these genes were closely associated with the origin of SCMPs. AP3-PI obligate heterodimerization, essential for organ identity of corolla and stamens, was robustly detected. However, selective down-regulation of these duplicated genes might result in a reduction of this obligate heterodimer concentration in a corolla-specific manner, leading to the sepaloid corolla in SCMPs, thus representing a new sepaloid corolla model taking advantage of gene duplication. Our work suggests that modifying floral MADS-box genes could facilitate the breeding of novel cultivars with distinct floral morphology in ornamental plants, and also provides new insights into the functional evolution of the MADS-box genes in plants. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  13. History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis

    PubMed Central

    2014-01-01

    Background The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. Results We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Conclusions Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages. PMID:25250171

  14. The Goddard and Saturn Genes Are Essential for Drosophila Male Fertility and May Have Arisen De Novo.

    PubMed

    Gubala, Anna M; Schmitz, Jonathan F; Kearns, Michael J; Vinh, Tery T; Bornberg-Bauer, Erich; Wolfner, Mariana F; Findlay, Geoffrey D

    2017-05-01

    New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

    PubMed

    Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

    2013-09-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  16. Phylogenetic analysis of IDD gene family and characterization of its expression in response to flower induction in Malus.

    PubMed

    Fan, Sheng; Zhang, Dong; Xing, Libo; Qi, Siyan; Du, Lisha; Wu, Haiqin; Shao, Hongxia; Li, Youmei; Ma, Juanjuan; Han, Mingyu

    2017-08-01

    Although INDETERMINATE DOMAIN (IDD) genes encoding specific plant transcription factors have important roles in plant growth and development, little is known about apple IDD (MdIDD) genes and their potential functions in the flower induction. In this study, we identified 20 putative IDD genes in apple and named them according to their chromosomal locations. All identified MdIDD genes shared a conserved IDD domain. A phylogenetic analysis separated MdIDDs and other plant IDD genes into four groups. Bioinformatic analysis of chemical characteristics, gene structure, and prediction of protein-protein interactions demonstrated the functional and structural diversity of MdIDD genes. To further uncover their potential functions, we performed analysis of tandem, synteny, and gene duplications, which indicated several paired homologs of IDD genes between apple and Arabidopsis. Additionally, genome duplications also promoted the expansion and evolution of the MdIDD genes. Quantitative real-time PCR revealed that all the MdIDD genes showed distinct expression levels in five different tissues (stems, leaves, buds, flowers, and fruits). Furthermore, the expression levels of candidate MdIDD genes were also investigated in response to various circumstances, including GA treatment (decreased the flowering rate), sugar treatment (increased the flowering rate), alternate-bearing conditions, and two varieties with different-flowering intensities. Parts of them were affected by exogenous treatments and showed different expression patterns. Additionally, changes in response to alternate-bearing and different-flowering varieties of apple trees indicated that they were also responsive to flower induction. Taken together, our comprehensive analysis provided valuable information for further analysis of IDD genes aiming at flower induction.

  17. Evolutionary and Expression Analyses of the Apple Basic Leucine Zipper Transcription Factor Family

    PubMed Central

    Zhao, Jiao; Guo, Rongrong; Guo, Chunlei; Hou, Hongmin; Wang, Xiping; Gao, Hua

    2016-01-01

    Transcription factors (TFs) play essential roles in the regulatory networks controlling many developmental processes in plants. Members of the basic leucine (Leu) zipper (bZIP) TF family, which is unique to eukaryotes, are involved in regulating diverse processes, including flower and vascular development, seed maturation, stress signaling, and defense responses to pathogens. The bZIP proteins have a characteristic bZIP domain composed of a DNA-binding basic region and a Leu zipper dimerization region. In this study, we identified 112 apple (Malus domestica Borkh) bZIP TF-encoding genes, termed MdbZIP genes. Synteny analysis indicated that segmental and tandem duplication events, as well as whole genome duplication, have contributed to the expansion of the apple bZIP family. The family could be divided into 11 groups based on structural features of the encoded proteins, as well as on the phylogenetic relationship of the apple bZIP proteins to those of the model plant Arabidopsis thaliana (AtbZIP genes). Synteny analysis revealed that several paired MdbZIP genes and AtbZIP gene homologs were located in syntenic genomic regions. Furthermore, expression analyses of group A MdbZIP genes showed distinct expression levels in 10 different organs. Moreover, changes in these expression profiles in response to abiotic stress conditions and various hormone treatments identified MdbZIP genes that were responsive to high salinity and drought, as well as to different phytohormones. PMID:27066030

  18. Evolutionary and Expression Analyses of the Apple Basic Leucine Zipper Transcription Factor Family.

    PubMed

    Zhao, Jiao; Guo, Rongrong; Guo, Chunlei; Hou, Hongmin; Wang, Xiping; Gao, Hua

    2016-01-01

    Transcription factors (TFs) play essential roles in the regulatory networks controlling many developmental processes in plants. Members of the basic leucine (Leu) zipper (bZIP) TF family, which is unique to eukaryotes, are involved in regulating diverse processes, including flower and vascular development, seed maturation, stress signaling, and defense responses to pathogens. The bZIP proteins have a characteristic bZIP domain composed of a DNA-binding basic region and a Leu zipper dimerization region. In this study, we identified 112 apple (Malus domestica Borkh) bZIP TF-encoding genes, termed MdbZIP genes. Synteny analysis indicated that segmental and tandem duplication events, as well as whole genome duplication, have contributed to the expansion of the apple bZIP family. The family could be divided into 11 groups based on structural features of the encoded proteins, as well as on the phylogenetic relationship of the apple bZIP proteins to those of the model plant Arabidopsis thaliana (AtbZIP genes). Synteny analysis revealed that several paired MdbZIP genes and AtbZIP gene homologs were located in syntenic genomic regions. Furthermore, expression analyses of group A MdbZIP genes showed distinct expression levels in 10 different organs. Moreover, changes in these expression profiles in response to abiotic stress conditions and various hormone treatments identified MdbZIP genes that were responsive to high salinity and drought, as well as to different phytohormones.

  19. Diversification of CYCLOIDEA expression in the evolution of bilateral flower symmetry in Caprifoliaceae and Lonicera (Dipsacales)

    PubMed Central

    Howarth, Dianella G.; Martins, Tiago; Chimney, Edward; Donoghue, Michael J.

    2011-01-01

    Background and Aims The expression of floral symmetry genes is examined in the CYCLOIDEA lineage following duplication, and these are linked to changes in flower morphology. The study focuses on Dipsacales, comparing DipsCYC2 gene expression in Viburnum (radially symmetrical Adoxaceae) to members of early-diverging lineages of the bilaterally symmetrical Caprifoliaceae (Diervilla and Lonicera). Methods Floral tissue from six species, which included dorsal, lateral and ventral regions of the corolla, was dissected. RNA was extracted from these tissues and each copy of DipsCYC2 was amplified with reverse transcriptase PCR. Key Results Members of DipsCYC2 were expressed across the corolla in the radially symmetrical Viburnum plicatum. A shift to bilaterally symmetrical flowers at the base of the Caprifoliaceae was accompanied by a duplication of the DipsCYC2 gene, resulting in DipsCYC2A and DipsCYC2B, and by loss of expression of both of these copies in the ventral petal. In Lonicera (Caprifolieae), there is a shift from flowers with two dorsally and three ventrally oriented corolla lobes to a clear differentiation of dorsal, lateral and ventral lobes. This shift entailed a decoupling of expression of DipsCYC2A and DipsCYC2B; DipsCYC2B continues to be expressed in the dorsal and lateral lobes, while DipsCYC2A expression is restricted to just the two dorsal lobes. A reversion to more radially symmetrical flowers within Lonicera was accompanied by a re-expansion of expression of both DipsCYC2A and DipsCYC2B. Conclusions The transition to bilateral symmetry in Caprifoliaceae involved: (a) duplication of an ancestral DipsCYC2 gene; (b) the loss of expression of both of these copies in the ventral petal; and (c) changes in the zone of expression, with one copy continuing to be expressed across the dorsal and lateral petals, and the other copy becoming restricted in expression to the dorsal corolla lobes. PMID:21478175

  20. Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

    PubMed

    Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

    2016-01-01

    WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.

  1. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome

    PubMed Central

    Opazo, Juan C.; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F.

    2015-01-01

    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. PMID:25743544

  2. Genotype-phenotype characterization in 13 individuals with chromosome Xp11.22 duplications.

    PubMed

    Grams, Sarah E; Argiropoulos, Bob; Lines, Matthew; Chakraborty, Pranesh; Mcgowan-Jordan, Jean; Geraghty, Michael T; Tsang, Marilyn; Eswara, Marthand; Tezcan, Kamer; Adams, Kelly L; Linck, Leesa; Himes, Patricia; Kostiner, Dana; Zand, Dina J; Stalker, Heather; Driscoll, Daniel J; Huang, Taosheng; Rosenfeld, Jill A; Li, Xu; Chen, Emily

    2016-04-01

    We report 13 new individuals with duplications in Xp11.22-p11.23. The index family has one male and two female members in three generations with mild-severe intellectual disability (ID), speech delay, dysmorphic features, early puberty, constipation, and/or hand and foot abnormalities. Affected individuals were found to have two small duplications in Xp11.22 at nucleotide position (hg19) 50,112,063-50,456,458 bp (distal) and 53,160,114-53,713,154 bp (proximal). Collectively, these two regions include 14 RefSeq genes, prompting collection of a larger cohort of patients, in an attempt to delineate critical genes associated with the observed phenotype. In total, we have collected data on nine individuals with duplications overlapping the distal duplication region containing SHROOM4 and DGKK and eight individuals overlapping the proximal region including HUWE1. Duplications of HUWE1 have been previously associated with non-syndromic ID. Our data, with previously published reports, suggest that duplications involving SHROOM4 and DGKK may represent a new syndromic X-linked ID critical region associated with mild to severe ID, speech delay +/- dysarthria, attention deficit disorder, precocious puberty, constipation, and motor delay. We frequently observed foot abnormalities, 5th finger clinodactyly, tapering fingers, constipation, and exercise intolerance in patients with duplications of these two genes. Regarding duplications including the proximal region, our observations agree with previous studies, which have found associations with intellectual disability. In addition, expressive language delay, failure to thrive, motor delay, and 5th finger clinodactyly were also frequently observed in patients with the proximal duplication. © 2015 Wiley Periodicals, Inc.

  3. Segmental duplications: evolution and impact among the current Lepidoptera genomes.

    PubMed

    Zhao, Qian; Ma, Dongna; Vasseur, Liette; You, Minsheng

    2017-07-06

    Structural variation among genomes is now viewed to be as important as single nucleoid polymorphisms in influencing the phenotype and evolution of a species. Segmental duplication (SD) is defined as segments of DNA with homologous sequence. Here, we performed a systematic analysis of segmental duplications (SDs) among five lepidopteran reference genomes (Plutella xylostella, Danaus plexippus, Bombyx mori, Manduca sexta and Heliconius melpomene) to understand their potential impact on the evolution of these species. We find that the SDs content differed substantially among species, ranging from 1.2% of the genome in B. mori to 15.2% in H. melpomene. Most SDs formed very high identity (similarity higher than 90%) blocks but had very few large blocks. Comparative analysis showed that most of the SDs arose after the divergence of each linage and we found that P. xylostella and H. melpomene showed more duplications than other species, suggesting they might be able to tolerate extensive levels of variation in their genomes. Conserved ancestral and species specific SD events were assessed, revealing multiple examples of the gain, loss or maintenance of SDs over time. SDs content analysis showed that most of the genes embedded in SDs regions belonged to species-specific SDs ("Unique" SDs). Functional analysis of these genes suggested their potential roles in the lineage-specific evolution. SDs and flanking regions often contained transposable elements (TEs) and this association suggested some involvement in SDs formation. Further studies on comparison of gene expression level between SDs and non-SDs showed that the expression level of genes embedded in SDs was significantly lower, suggesting that structure changes in the genomes are involved in gene expression differences in species. The results showed that most of the SDs were "unique SDs", which originated after species formation. Functional analysis suggested that SDs might play different roles in different species. Our results provide a valuable resource beyond the genetic mutation to explore the genome structure for future Lepidoptera research.

  4. Positive selection and ancient duplications in the evolution of class B floral homeotic genes of orchids and grasses

    PubMed Central

    Mondragón-Palomino, Mariana; Hiese, Luisa; Härter, Andrea; Koch, Marcus A; Theißen, Günter

    2009-01-01

    Background Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged. Results The DEF-like genes from Orchidaceae form four well-supported, ancient clades of orthologues. In contrast, orchid GLO-like genes form a single clade of ancient orthologues and recent paralogues. DEF-like genes from orchid clade 2 (OMADS3-like genes) are under less stringent purifying selection than the other orchid DEF-like and GLO-like genes. In comparison with orchids, purifying selection was less stringent in DEF-like and GLO-like genes from Poales. Most importantly, positive selection took place before the major organ reduction and losses in the floral axis that eventually yielded the zygomorphic grass floret. Conclusion In DEF-like genes of Poales, positive selection on the region mediating interactions with other proteins or DNA could have triggered the evolution of the regulatory mechanisms behind the development of grass-specific reproductive structures. Orchidaceae show a different trend, where gene duplication and transcriptional divergence appear to have played a major role in the canalization and modularization of perianth development. PMID:19383167

  5. Regulatory divergence of homeologous Atlantic salmon elovl5 genes following the salmonid-specific whole-genome duplication.

    PubMed

    Carmona-Antoñanzas, Greta; Zheng, Xiaozhong; Tocher, Douglas R; Leaver, Michael J

    2016-10-10

    Fatty acyl elongase 5 (elovl5) is a critical enzyme in the vertebrate biosynthetic pathway which produces the physiologically essential long-chain polyunsaturated fatty acids (LC-PUFA), docosahexenoic acid (DHA), and eicosapentenoic acid (EPA) from 18 carbon fatty acids precursors. In contrast to most other vertebrates, Atlantic salmon possess two copies of elovl5 (elovl5a and elovl5b) as a result of a whole-genome duplication (WGD) which occurred at the base of the salmonid lineage. WGDs have had a major influence on vertebrate evolution, providing extra genetic material, enabling neofunctionalization to accelerate adaptation and speciation. However, little is known about the mechanisms by which such duplicated homeologous genes diverge. Here we show that homeologous Atlantic salmon elovl5a and elovl5b genes have been asymmetrically colonised by transposon-like elements. Identical locations and identities of insertions are also present in the rainbow trout duplicate elovl5 genes, but not in the nearest extant representative preduplicated teleost, the northern pike. Both elovl5 salmon duplicates possessed conserved regulatory elements that promoted Srebp1- and Srebp2-dependent transcription, and differences in the magnitude of Srebp response between promoters could be attributed to a tandem duplication of SRE and NF-Y cofactor binding sites in elovl5b. Furthermore, an insertion in the promoter region of elovl5a confers responsiveness to Lxr/Rxr transcriptional activation. Our results indicate that most, but not all, transposon mobilisation into elovl5 genes occurred after the split from the common ancestor of pike and salmon, but before more recent salmonid speciations, and that divergence of elovl5 regulatory regions have enabled neofuntionalization by promoting differential expression of these homeologous genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  6. Recent gene duplication and subfunctionalization produced a mitochondrial GrpE, the nucleotide exchange factor of the Hsp70 complex, specialized in thermotolerance to chronic heat stress in Arabidopsis.

    PubMed

    Hu, Catherine; Lin, Siou-ying; Chi, Wen-tzu; Charng, Yee-yung

    2012-02-01

    The duplication and divergence of heat stress (HS) response genes might help plants adapt to varied HS conditions, but little is known on the topic. Here, we examined the evolution and function of Arabidopsis (Arabidopsis thaliana) mitochondrial GrpE (Mge) proteins. GrpE acts as a nucleotide-exchange factor in the Hsp70/DnaK chaperone machinery. Genomic data show that AtMge1 and AtMge2 arose from a recent whole-genome duplication event. Phylogenetic analysis indicated that duplication and preservation of Mges occurred independently in many plant species, which suggests a common tendency in the evolution of the genes. Intron retention contributed to the divergence of the protein structure of Mge paralogs in higher plants. In both Arabidopsis and tomato (Solanum lycopersicum), Mge1 is induced by ultraviolet B light and Mge2 is induced by heat, which suggests regulatory divergence of the genes. Consistently, AtMge2 but not AtMge1 is under the control of HsfA1, the master regulator of the HS response. Heterologous expression of AtMge2 but not AtMge1 in the temperature-sensitive Escherichia coli grpE mutant restored its growth at 43°C. Arabidopsis T-DNA knockout lines under different HS regimes revealed that Mge2 is specifically required for tolerating prolonged exposure to moderately high temperature, as compared with the need of the heat shock protein 101 and the HS-associated 32-kD protein for short-term extreme heat. Therefore, with duplication and subfunctionalization, one copy of the Arabidopsis Mge genes became specialized in a distinct type of HS. We provide direct evidence supporting the connection between gene duplication and adaptation to environmental stress.

  7. Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.

    PubMed

    Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H; Schraiber, Joshua G; Narvaiza, Iñigo; Sudmant, Peter H; Penn, Osnat; Chiatante, Giorgia; Malig, Maika; Huddleston, John; Benner, Chris; Camponeschi, Francesca; Ciofi-Baffoni, Simone; Stessman, Holly A F; Marchetto, Maria C N; Denman, Laura; Harshman, Lana; Baker, Carl; Raja, Archana; Penewit, Kelsi; Janke, Nicolette; Tang, W Joyce; Ventura, Mario; Banci, Lucia; Antonacci, Francesca; Akey, Joshua M; Amemiya, Chris T; Gage, Fred H; Reymond, Alexandre; Eichler, Evan E

    2016-08-11

    Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.

  8. The Use of Duplication-Generating Rearrangements for Studying Heterokaryon Incompatibility Genes in Neurospora

    PubMed Central

    Perkins, David D.

    1975-01-01

    Heterokaryon (vegetative) incompatibility, governing the fusion of somatic hyphal filaments to form stable heterokaryons, is of interest because of its widespread occurrence in fungi and its bearing on cellular recognition. Conventional investigations of the genetic basis of heterokaryon incompatibility in N. crassa are difficult because in commonly used stocks differences are present at several het loci, all with similar incompatibility phenotypes. This difficulty is overcome by using duplications (partial diploids) that are unlikely to contain more than one het locus. A phenotypically expressed incompatibility reaction occurs when unlike het alleles are present within the same somatic nucleus, and this parallels the heterokaryon incompatibility reaction that occurs when unlike alleles in different haploid nuclei are introduced into the same somatic hypha by mycelial fusion.—Nontandem duplications were used to confirm that the incompatibility reactions in heterokaryons and in duplications are alternate expressions of the same genes. This was demonstrated for three loci which had previously been established by conventional heterokaryon tests—het-e, het-c and mt. These were each obtained in duplications as recombinant meiotic segregants from crosses heterozygous for duplication-generating chromosome rearrangements. The particular method of producing the duplications is irrelevant so long as the incompatibility alleles are heterozygous.—The duplication technique has made it possible to determine easily the het-e and het-c genotypes of numerous laboratory and wild strains of unknown constitution. In laboratory strains both loci are represented simply by two alleles. Analysis of het-c is more complicated in some wild strains, where differences have been demonstrated at one or more additional het loci within the duplication used and multiple allelism is also possible.—The results show that the duplication method can be used to identify and map additional vegetative incompatibility loci, without the necessity of heterokaryon tests. PMID:124288

  9. Japanese neuropathy patients with peripheral myelin protein-22 gene aneuploidy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lebo, R.V.; Li, L.Y.; Flandermeyer, R.R.

    1994-09-01

    Peripheral myelin protein (PMP-22) gene aneuploidy results in Charcot-Marie-Tooth disease Type 1A (CMT1A) and the Hereditary Neuropathy with Liability to Pressure Palsy (HNPP) in Japanese patients as well as Caucasian Americans. Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, results when expression of one of at least seven genes is defective. CMT1A, about half of all CMT mutations, is usually associated with a duplication spanning the peripheral myelin protein-22 gene on distal chromosome band 17p11.2. Autosomal dominant HNPP (hereditary pressure and sensory neuropathy, HPSN) results from a deletion of the CMT1A gene region. Multicolor in situ hybridization with PMP-22 genemore » region probe characterized HNPP deletion reliably and detected all different size duplications reported previously. In summary, 72% of 28 Japanese CMT1 (HMSNI) patients tested had the CMT1A duplication, while none of the CMT2 (HMSNII) or CMT3 (HMSNIII) patients had a duplication. Three cases of HNPP were identified by deletion of the CMT1A gene region on chromosome 17p. HNPP and CMT1A have been reported to result simultaneously from the same unequal recombination event. The lower frequency of HNPP compared to CMT1A suggests that HNPP patients have a lower reproductive fitness than CMT1A patients. This result, along with a CMT1A duplication found in an Asian Indian family, demonstrates the broad geographic distribution and high frequency of PMP-22 gene aneuploidy.« less

  10. UGT74S1 is the key player in controlling secoisolariciresinol diglucoside (SDG) formation in flax.

    PubMed

    Fofana, Bourlaye; Ghose, Kaushik; McCallum, Jason; You, Frank M; Cloutier, Sylvie

    2017-02-02

    Flax lignan, commonly known as secoisolariciresinol (SECO) diglucoside (SDG), has recently been reported with health-promoting activities, including its positive impact in metabolic diseases. However, not much was reported on the biosynthesis of SDG and its monoglucoside (SMG) until lately. Flax UGT74S1 was recently reported to sequentially glucosylate SECO into SMG and SDG in vitro. However, whether this gene is the only UGT achieving SECO glucosylation in flax was not known. Flax genome-wide mining for UGTs was performed. Phylogenetic and gene duplication analyses, heterologous gene expression and enzyme assays were conducted to identify family members closely related to UGT74S1 and to establish their roles in SECO glucosylation. A total of 299 different UGTs were identified, of which 241 (81%) were duplicated. Flax UGTs diverged 2.4-153.6 MYA and 71% were found to be under purifying selection pressure. UGT74S1, a single copy gene located on chromosome 7, displayed no evidence of duplication and was deemed to be under positive selection pressure. The phylogenetic analysis identified four main clusters where cluster 4, which included UGT74S1, was the most diverse. The duplicated UGT74S4 and UGT74S3, located on chromosomes 8 and 14, respectively, were the most closely related to UGT74S1 and were differentially expressed in different tissues. Heterologous expression levels of UGT74S1, UGT74S4 and UGT74S3 proteins were similar but UGT74S4 and UGT74S3 glucosylation activity towards SECO was seven fold less than UGT74S1. In addition, they both failed to produce SDG, suggesting neofunctionalization following their divergence from UGT74S1. We showed that UGT74S1 is closely related to two duplicated genes, UGT74S4 and UGT74S3 which, unlike UGT74S1, failed to glucosylate SMG into SDG. The study suggests that UGT74S1 may be the key player in controlling SECO glucosylation into SDG in flax although its closely related genes may also contribute to a minor extent in supplying the SMG precursor to UGT74S1.

  11. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms.

    PubMed

    Li, Zhen; Defoort, Jonas; Tasdighian, Setareh; Maere, Steven; Van de Peer, Yves; De Smet, Riet

    2016-02-01

    Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. © 2016 American Society of Plant Biologists. All rights reserved.

  12. Expansion of Genes Encoding piRNA-Associated Argonaute Proteins in the Pea Aphid: Diversification of Expression Profiles in Different Plastic Morphs

    PubMed Central

    Lu, Hsiao-ling; Tanguy, Sylvie; Rispe, Claude; Gauthier, Jean-Pierre; Walsh, Tom; Gordon, Karl; Edwards, Owain; Tagu, Denis; Chang, Chun-che; Jaubert-Possamai, Stéphanie

    2011-01-01

    Piwi-interacting RNAs (piRNAs) are known to regulate transposon activity in germ cells of several animal models that propagate sexually. However, the role of piRNAs during asexual reproduction remains almost unknown. Aphids that can alternate sexual and asexual reproduction cycles in response to seasonal changes of photoperiod provide a unique opportunity to study piRNAs and the piRNA pathway in both reproductive modes. Taking advantage of the recently sequenced genome of the pea aphid Acyrthosiphon pisum, we found an unusually large lineage-specific expansion of genes encoding the Piwi sub-clade of Argonaute proteins. In situ hybridisation showed differential expressions between the duplicated piwi copies: while Api-piwi2 and Api-piwi6 are “specialised” in germ cells their most closely related copy, respectively Api-piwi5 and Api-piwi3, are expressed in the somatic cells. The differential expression was also identified in duplicated ago3: Api-ago3a in germ cells and Api-ago3b in somatic cells. Moreover, analyses of expression profiles of the expanded piwi and ago3 genes by semi-quantitative RT-PCR showed that expressions varied according to the reproductive types. These specific expression patterns suggest that expanded aphid piwi and ago3 genes have distinct roles in asexual and sexual reproduction. PMID:22162754

  13. Specific duplication and dorsoventrally asymmetric expression patterns of Cycloidea-like genes in zygomorphic species of Ranunculaceae.

    PubMed

    Jabbour, Florian; Cossard, Guillaume; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine

    2014-01-01

    Floral bilateral symmetry (zygomorphy) has evolved several times independently in angiosperms from radially symmetrical (actinomorphic) ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc) have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like) lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture.

  14. Specific Duplication and Dorsoventrally Asymmetric Expression Patterns of Cycloidea-Like Genes in Zygomorphic Species of Ranunculaceae

    PubMed Central

    Jabbour, Florian; Cossard, Guillaume; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine

    2014-01-01

    Floral bilateral symmetry (zygomorphy) has evolved several times independently in angiosperms from radially symmetrical (actinomorphic) ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc) have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like) lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture. PMID:24752428

  15. The house spider genome reveals an ancient whole-genome duplication during arachnid evolution.

    PubMed

    Schwager, Evelyn E; Sharma, Prashant P; Clarke, Thomas; Leite, Daniel J; Wierschin, Torsten; Pechmann, Matthias; Akiyama-Oda, Yasuko; Esposito, Lauren; Bechsgaard, Jesper; Bilde, Trine; Buffry, Alexandra D; Chao, Hsu; Dinh, Huyen; Doddapaneni, HarshaVardhan; Dugan, Shannon; Eibner, Cornelius; Extavour, Cassandra G; Funch, Peter; Garb, Jessica; Gonzalez, Luis B; Gonzalez, Vanessa L; Griffiths-Jones, Sam; Han, Yi; Hayashi, Cheryl; Hilbrant, Maarten; Hughes, Daniel S T; Janssen, Ralf; Lee, Sandra L; Maeso, Ignacio; Murali, Shwetha C; Muzny, Donna M; Nunes da Fonseca, Rodrigo; Paese, Christian L B; Qu, Jiaxin; Ronshaugen, Matthew; Schomburg, Christoph; Schönauer, Anna; Stollewerk, Angelika; Torres-Oliva, Montserrat; Turetzek, Natascha; Vanthournout, Bram; Werren, John H; Wolff, Carsten; Worley, Kim C; Bucher, Gregor; Gibbs, Richard A; Coddington, Jonathan; Oda, Hiroki; Stanke, Mario; Ayoub, Nadia A; Prpic, Nikola-Michael; Flot, Jean-François; Posnien, Nico; Richards, Stephen; McGregor, Alistair P

    2017-07-31

    The duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum. We found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication. Our results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.

  16. DDC and COBL, flanking the imprinted GRB10 gene on 7p12, are biallelically expressed.

    PubMed

    Hitchins, Megan P; Bentley, Louise; Monk, David; Beechey, Colin; Peters, Jo; Kelsey, Gavin; Ishino, Fumitoshi; Preece, Michael A; Stanier, Philip; Moore, Gudrun E

    2002-12-01

    Maternal duplication of human 7p11.2-p13 has been associated with Silver-Russell syndrome (SRS) in two familial cases. GRB10 is the only imprinted gene identified within this region to date. GRB10 demonstrates an intricate tissue- and isoform-specific imprinting profile in humans, with paternal expression in fetal brain and maternal expression of one isoform in skeletal muscle. The mouse homolog is maternally transcribed. The GRB10 protein is a potent growth inhibitor and represents a candidate for SRS, which is characterized by pre- and postnatal growth retardation and a spectrum of additional dysmorphic features. Since imprinted genes tend to be grouped in clusters, we investigated the imprinting status of the dopa-decarboxylase gene (DDC) and the Cordon-bleu gene (COBL) which flank GRB10 within the 7p11.2-p13 SRS duplicated region. Although both genes were found to replicate asynchronously, suggestive of imprinting, SNP expression analyses showed that neither gene was imprinted in multiple human fetal tissues. The mouse homologues, Ddc and Cobl, which map to the homologous imprinted region on proximal Chr 11, were also biallelically expressed in mice with uniparental maternal or paternal inheritance of this region. With the intent of using mouse Grb10 as an imprinted control, biallelic expression was consistently observed in fetal, postnatal, and adult brain of these mice, in contrast to the maternal-specific transcription previously demonstrated in brain in inter-specific F1 progeny. This may be a further example of over-expression of maternally derived transcripts in inter-specific mouse crosses. GRB10 remains the only imprinted gene identified within 7p11.2-p13.

  17. [Diagnostic value of MYB protein expression in adenoid cystic carcinoma and status of MYB gene copy number].

    PubMed

    Huo, Zhen; Zeng, Xuan; Wu, Shafei; Wu, Huanwen; Meng, Yunxiao; Liu, Yuanyuan; Luo, Yufeng; Cao, Jinling; Liang, Zhiyong

    2015-08-01

    To explore the diagnostic value of MYB protein expression for adenoid cystic carcinoma and its differential diagnosis from other salivary gland tumors, and to further investigate the status of MYB gene copy number. MYB expression was studied by immunohistochemistry in 34 adenoid cystic carcinomas, 55 non-adenoid cystic carcinomas (other salivary gland tumors) including 10 pleomorphic adenomas, 10 basal cell adenomas, 10 epithelial-myoepithelial carcinomas, 9 basal cell adenocarcinomas, 8 mucoepidermoid carcinomas, 4 carcinoma in pleomorphic adenomas, and 4 polymorphous low-grade adenocarcinoma. MYB gene copy number status was detected by FISH in MYB protein-positive cases. 82.4% (28/34) of adenoid cystic carcinomas were MYB protein-positive, compared with 9.1% (5/55) of non-adenoid cystic carcinomas, and the difference between the two groups was statistically significant (P < 0.01). 2/18 of adenoid cystic carcinomas had duplication of MYB gene by FISH, and all non-adenoid cystic carcinomas were negative although the difference was not statistically significant (P = 0.435). MYB protein expression is a useful diagnostic marker for adenoid cystic carcinomas in its separation from other salivary gland tumors. In addition, duplication of MYB gene is no a major mechanism for the MYB protein overexpression.

  18. Isolation of the Ascobolus Immersus Spore Color Gene B2 and Study in Single Cells of Gene Silencing by Methylation Induced Premeiotically

    PubMed Central

    Colot, V.; Rossignol, J. L.

    1995-01-01

    The ascomycete Ascobolus immersus has been extensively used as a model system for the genetic study of meiotic recombination. More recently, an epigenetic process, known as methylation induced premeiotically (MIP), that acts on duplicated sequences has been discovered in A. immersus and has raised a new interest in this fungus. To try and extend these studies, we have now cloned the A. immersus spore color gene b2, a well characterized recombination hot-spot. Isolation of the whole gene was verified by physical mapping of four large b2 alterations, followed by transformation and mutant rescue of a null b2 allele. Transformation was also used to duplicate b2 and subject it to MIP. As a result, we were able for the first time to observe gene silencing as early as just after meiosis and in single cells. Furthermore, we have found evidence for a modulating effect of MIP on b2 expression, depending on the region of the gene that is duplicated and hence subjected to MIP. PMID:8601475

  19. The pineapple genome and the evolution of CAM photosynthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ming, Ray; VanBuren, Robert; Wai, Ching Man

    Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C 3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues.more » CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Lastly, we found pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C 3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.« less

  20. The pineapple genome and the evolution of CAM photosynthesis

    DOE PAGES

    Ming, Ray; VanBuren, Robert; Wai, Ching Man; ...

    2015-11-02

    Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C 3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues.more » CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Lastly, we found pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C 3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.« less

  1. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.)

    PubMed Central

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut. PMID:26125188

  2. Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress

    PubMed Central

    Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming

    2017-01-01

    The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance. PMID:28417911

  3. Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress.

    PubMed

    Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming

    2017-04-12

    The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance.

  4. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).

    PubMed

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut.

  5. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus.

    PubMed

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin

    2017-10-24

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .

  6. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus

    PubMed Central

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana

    2017-01-01

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393

  7. Two euAGAMOUS Genes Control C-Function in Medicago truncatula

    PubMed Central

    Gómez-Mena, Concepción; Constantin, Gabriela D.; Wen, Jiangqi; Mysore, Kirankumar S.; Lund, Ole S.; Johansen, Elisabeth; Beltrán, José Pío; Cañas, Luis A.

    2014-01-01

    C-function MADS-box transcription factors belong to the AGAMOUS (AG) lineage and specify both stamen and carpel identity and floral meristem determinacy. In core eudicots, the AG lineage is further divided into two branches, the euAG and PLE lineages. Functional analyses across flowering plants strongly support the idea that duplicated AG lineage genes have different degrees of subfunctionalization of the C-function. The legume Medicago truncatula contains three C-lineage genes in its genome: two euAG genes (MtAGa and MtAGb) and one PLENA-like gene (MtSHP). This species is therefore a good experimental system to study the effects of gene duplication within the AG subfamily. We have studied the respective functions of each euAG genes in M. truncatula employing expression analyses and reverse genetic approaches. Our results show that the M. truncatula euAG- and PLENA-like genes are an example of subfunctionalization as a result of a change in expression pattern. MtAGa and MtAGb are the only genes showing a full C-function activity, concomitant with their ancestral expression profile, early in the floral meristem, and in the third and fourth floral whorls during floral development. In contrast, MtSHP expression appears late during floral development suggesting it does not contribute significantly to the C-function. Furthermore, the redundant MtAGa and MtAGb paralogs have been retained which provides the overall dosage required to specify the C-function in M. truncatula. PMID:25105497

  8. Comparative methods for the analysis of gene-expression evolution: an example using yeast functional genomic data.

    PubMed

    Oakley, Todd H; Gu, Zhenglong; Abouheif, Ehab; Patel, Nipam H; Li, Wen-Hsiung

    2005-01-01

    Understanding the evolution of gene function is a primary challenge of modern evolutionary biology. Despite an expanding database from genomic and developmental studies, we are lacking quantitative methods for analyzing the evolution of some important measures of gene function, such as gene-expression patterns. Here, we introduce phylogenetic comparative methods to compare different models of gene-expression evolution in a maximum-likelihood framework. We find that expression of duplicated genes has evolved according to a nonphylogenetic model, where closely related genes are no more likely than more distantly related genes to share common expression patterns. These results are consistent with previous studies that found rapid evolution of gene expression during the history of yeast. The comparative methods presented here are general enough to test a wide range of evolutionary hypotheses using genomic-scale data from any organism.

  9. Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana.

    PubMed

    Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi

    2014-01-03

    Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome triplication analysis in B. oleracea, B. rapa and A. thaliana genomes, our study provides insight into the evolutionary history of NBS-encoding genes after divergence of A. thaliana and the Brassica lineage. These results together with expression pattern analysis of NBS-encoding orthologous genes provide useful resource for functional characterization of these genes and genetic improvement of relevant crops.

  10. Detection of a large duplication mutation in the myosin-binding protein C3 gene in a case of hypertrophic cardiomyopathy.

    PubMed

    Meyer, Thomas; Pankuweit, Sabine; Richter, Anette; Maisch, Bernhard; Ruppert, Volker

    2013-09-15

    Hypertrophic cardiomyopathy (HCM) is a cardiovascular disease with autosomal dominant inheritance caused by mutations in genes coding for sarcomeric and/or regulatory proteins expressed in cardiomyocytes. In a small cohort of HCM patients (n=8), we searched for mutations in the two most common genes responsible for HCM and found four missense mutations in the MYH7 gene encoding cardiac β-myosin heavy chain (R204H, M493V, R719W, and R870H) and three mutations in the myosin-binding protein C3 gene (MYBPC3) including one missense (A848V) and two frameshift mutations (c.3713delTG and c.702ins26bp). The c.702ins26bp insertion resulted from the duplication of a 26-bp fragment in a 54-year-old female HCM patient presenting with clinical signs of heart failure due to diastolic dysfunction. Although such large duplications (>10 bp) in the MYBPC3 gene are very rare and have been identified only in 4 families reported so far, the identical duplication mutation was found earlier in a Dutch patient, demonstrating that it may constitute a hitherto unknown founder mutation in central European populations. This observation underscores the significance of insertions into the coding sequence of the MYBPC3 gene for the development and pathogenesis of HCM. © 2013 Elsevier B.V. All rights reserved.

  11. Aldehyde Dehydrogenase Gene Superfamily in Populus: Organization and Expression Divergence between Paralogous Gene Pairs.

    PubMed

    Tian, Feng-Xia; Zang, Jian-Lei; Wang, Tan; Xie, Yu-Li; Zhang, Jin; Hu, Jian-Jun

    2015-01-01

    Aldehyde dehydrogenases (ALDHs) constitute a superfamily of NAD(P)+-dependent enzymes that catalyze the irreversible oxidation of a wide range of reactive aldehydes to their corresponding nontoxic carboxylic acids. ALDHs have been studied in many organisms from bacteria to mammals; however, no systematic analyses incorporating genome organization, gene structure, expression profiles, and cis-acting elements have been conducted in the model tree species Populus trichocarpa thus far. In this study, a comprehensive analysis of the Populus ALDH gene superfamily was performed. A total of 26 Populus ALDH genes were found to be distributed across 12 chromosomes. Genomic organization analysis indicated that purifying selection may have played a pivotal role in the retention and maintenance of PtALDH gene families. The exon-intron organizations of PtALDHs were highly conserved within the same family, suggesting that the members of the same family also may have conserved functionalities. Microarray data and qRT-PCR analysis indicated that most PtALDHs had distinct tissue-specific expression patterns. The specificity of cis-acting elements in the promoter regions of the PtALDHs and the divergence of expression patterns between nine paralogous PtALDH gene pairs suggested that gene duplications may have freed the duplicate genes from the functional constraints. The expression levels of some ALDHs were up- or down-regulated by various abiotic stresses, implying that the products of these genes may be involved in the adaptation of Populus to abiotic stresses. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the Populus ALDH gene superfamily and provide insights into the function and evolution of ALDH gene families in vascular plants.

  12. BcMF26a and BcMF26b Are Duplicated Polygalacturonase Genes with Divergent Expression Patterns and Functions in Pollen Development and Pollen Tube Formation in Brassica campestris

    PubMed Central

    Lyu, Meiling; Yu, Youjian; Jiang, Jingjing; Song, Limin; Liang, Ying; Ma, Zhiming; Xiong, Xingpeng; Cao, Jiashu

    2015-01-01

    Polygalacturonase (PG) is one of the cell wall hydrolytic enzymes involving in pectin degradation. A comparison of two highly conserved duplicated PG genes, namely, Brassica campestris Male Fertility 26a (BcMF26a) and BcMF26b, revealed the different features of their expression patterns and functions. We found that these two genes were orthologous genes of At4g33440, and they originated from a chromosomal segmental duplication. Although structurally similar, their regulatory and intron sequences largely diverged. QRT-PCR analysis showed that the expression level of BcMF26b was higher than that of BcMF26a in almost all the tested organs and tissues in Brassica campestris. Promoter activity analysis showed that, at reproductive development stages, BcMF26b promoter was active in tapetum, pollen grains, and pistils, whereas BcMF26a promoter was only active in pistils. In the subcellular localization experiment, BcMF26a and BcMF26b proteins could be localized to the cell wall. When the two genes were co-inhibited, pollen intine was formed abnormally and pollen tubes could not grow or stretch. Moreover, the knockout mutants of At4g33440 delayed the growth of pollen tubes. Therefore, BcMF26a/b can participate in the construction of pollen wall by modulating intine information and BcMF26b may play a major role in co-inhibiting transformed plants. PMID:26153985

  13. C2H2 type of zinc finger transcription factors in foxtail millet define response to abiotic stresses.

    PubMed

    Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Mishra, Awdhesh Kumar; Khandelwal, Rohit; Khan, Yusuf; Roy, Riti; Prasad, Manoj

    2014-09-01

    C2H2 type of zinc finger transcription factors (TFs) play crucial roles in plant stress response and hormone signal transduction. Hence considering its importance, genome-wide investigation and characterization of C2H2 zinc finger proteins were performed in Arabidopsis, rice and poplar but no such study was conducted in foxtail millet which is a C4 Panicoid model crop well known for its abiotic stress tolerance. The present study identified 124 C2H2-type zinc finger TFs in foxtail millet (SiC2H2) and physically mapped them onto the genome. The gene duplication analysis revealed that SiC2H2s primarily expanded in the genome through tandem duplication. The phylogenetic tree classified these TFs into five groups (I-V). Further, miRNAs targeting SiC2H2 transcripts in foxtail millet were identified. Heat map demonstrated differential and tissue-specific expression patterns of these SiC2H2 genes. Comparative physical mapping between foxtail millet SiC2H2 genes and its orthologs of sorghum, maize and rice revealed the evolutionary relationships of C2H2 type of zinc finger TFs. The duplication and divergence data provided novel insight into the evolutionary aspects of these TFs in foxtail millet and related grass species. Expression profiling of candidate SiC2H2 genes in response to salinity, dehydration and cold stress showed differential expression pattern of these genes at different time points of stresses.

  14. Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression.

    PubMed Central

    Jonas, V; Lin, C R; Kawashima, E; Semon, D; Swanson, L W; Mermod, J J; Evans, R M; Rosenfeld, M G

    1985-01-01

    Two mRNAs generated as a consequence of alternative RNA processing events in expression of the human calcitonin gene encode the protein precursors of either calcitonin or calcitonin gene-related peptide (CGRP). Both calcitonin and CGRP RNAs and their encoded peptide products are expressed in the human pituitary and in medullary thyroid tumors. On the basis of sequence comparison, it is suggested that both the calcitonin and CGRP exons arose from a common primordial sequence, suggesting that duplication and rearrangement events are responsible for the generation of this complex transcription unit. Images PMID:3872459

  15. Gene copy number evolution during tetraploid cotton radiation.

    PubMed

    Rong, J; Feltus, F A; Liu, L; Lin, L; Paterson, A H

    2010-11-01

    After polyploid formation, retention or loss of duplicated genes is not random. Genes with some functional domains are convergently restored to 'singleton' state after many independent genome duplications, and have been referred to as 'duplication-resistant' (DR) genes. To further explore the timeframe for their restoration to the singleton state, 27 cotton homologs of genes found to be 'DR' in Arabidopsis were selected based on diagnostic Pfam domains. Their copy numbers were studied using southern hybridization and sequence analysis in five tetraploid species and their ancestral A and D genome diploids. DR genes had significantly lower copy number than gene families hybridizing to randomly selected cotton ESTs. Three DR genes showed complete loss of D genome-derived homoeologs in some or all tetraploid species. Prior analysis has shown gene loss in polyploid cotton to be rare, and herein only one randomly selected gene showed loss of a homoeolog in only one of the five tetraploid species (Gossypium mustelinum). BAC sequencing confirmed two cases of gene loss in tetraploid cotton. Divergence among 5' sequences of DR genes amplified from G. arboreum, G. raimondii, and Gossypioides kirkii was correlated with gene copy number. These results show that genes containing Pfam domains associated with duplication resistance in Arabidopsis have also been preferentially restored to low copy number after a more recent polyploidization event in cotton. In tetraploid cotton, genes from the progenitor D genome seem to experience more gene copy number divergence than genes from the A genome. Together with D subgenome-biased alterations in gene expression, perhaps gene loss may contribute to the relatively larger portion of quantitative trait variation attributable to D than A subgenome chromosomes of tetraploid cotton.

  16. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706).

    PubMed

    Krsticevic, Flavia J; Arce, Débora P; Ezpeleta, Joaquín; Tapia, Elizabeth

    2016-10-13

    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. Copyright © 2016 Krsticevic et al.

  17. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706)

    PubMed Central

    Krsticevic, Flavia J.; Arce, Débora P.; Ezpeleta, Joaquín; Tapia, Elizabeth

    2016-01-01

    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. PMID:27565886

  18. Genomic organization, phylogenetic comparison, and expression profiles of the SPL family genes and their regulation in soybean.

    PubMed

    Tripathi, Rajiv K; Goel, Ridhi; Kumari, Sweta; Dahuja, Anil

    2017-03-01

    SQUAMOSA Promoter-Binding Protein-Like (SPL) genes form a major family of plant-specific transcription factors and play an important role in plant growth and development. In this study, we report the identification of 41 SPL genes (GmSPLs) in the soybean genome. Phylogenetic analysis revealed that these genes were divided into five groups (groups 1-5). Further, exon/intron structure and motif composition revealed that the GmSPL genes are conserved within their same group. The N-terminal zinc finger 1 (Zn1) of the SBP domain was a CCCH (Cys3His1) and the C terminus zinc finger 2 (Zn2) was a CCHC (Cys2HisCys) type. The 41 GmSPL genes were distributed unevenly on 17 of the 20 chromosomes, with tandem and segmental duplication events. We found that segmental duplication has made an important contribution to soybean SPL gene family expansion. The Ka/Ks ratios revealed that the duplicated GmSPL genes evolved under the effect of purifying selection. In addition, 17 of the 41 GmSPLs were found as targets of miR156; these might be involved in their posttranscriptional regulation through miR156. Importantly, RLM-RACE analysis confirmed the GmmiR156-mediated cleavage of GmSPL2a transcript in 2-4 mm stage of soybean seed. Alternative splicing events in 9 GmSPLs were detected which produces transcripts and proteins of different lengths that may modulate protein signaling, binding, localization, stability, and other properties. Expression analysis of the soybean SPL genes in various tissues and different developmental stages of seed suggested distinct spatiotemporal patterns. Differences in the expression patterns of miR156-targeted and miR156-non-targeted soybean SPL genes suggest that miR156 plays key functions in soybean development. Our results provide an important foundation for further uncovering the crucial roles of GmSPLs in the development of soybean and other biological processes.

  19. Conserved structure and expression of hsp70 paralogs in teleost fishes.

    PubMed

    Metzger, David C H; Hemmer-Hansen, Jakob; Schulte, Patricia M

    2016-06-01

    The cytosolic 70KDa heat shock proteins (Hsp70s) are widely used as biomarkers of environmental stress in ecological and toxicological studies in fish. Here we analyze teleost genome sequences to show that two genes encoding inducible hsp70s (hsp70-1 and hsp70-2) are likely present in all teleost fish. Phylogenetic and synteny analyses indicate that hsp70-1 and hsp70-2 are distinct paralogs that originated prior to the diversification of the teleosts. The promoters of both genes contain a TATA box and conserved heat shock elements (HSEs), but unlike mammalian HSP70s, both genes contain an intron in the 5' UTR. The hsp70-2 gene has undergone tandem duplication in several species. In addition, many other teleost genome assemblies have multiple copies of hsp70-2 present on separate, small, genomic scaffolds. To verify that these represent poorly assembled tandem duplicates, we cloned the genomic region surrounding hsp70-2 in Fundulus heteroclitus and showed that the hsp70-2 gene copies that are on separate scaffolds in the genome assembly are arranged as tandem duplicates. Real-time quantitative PCR of F. heteroclitus genomic DNA indicates that four copies of the hsp70-2 gene are likely present in the F. heteroclitus genome. Comparison of expression patterns in F. heteroclitus and Gasterosteus aculeatus demonstrates that hsp70-2 has a higher fold increase than hsp70-1 following heat shock in gill but not in muscle tissue, revealing a conserved difference in expression patterns between isoforms and tissues. These data indicate that ecological and toxicological studies using hsp70 as a biomarker in teleosts should take this complexity into account. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Saccharomyces cerevisiae ribosomal protein L37 is encoded by duplicate genes that are differentially expressed.

    PubMed

    Tornow, J; Santangelo, G M

    1994-06-01

    A duplicate copy of the RPL37A gene (encoding ribosomal protein L37) was cloned and sequenced. The coding region of RPL37B is very similar to that of RPL37A, with only one conservative amino-acid difference. However, the intron and flanking sequences of the two genes are extremely dissimilar. Disruption experiments indicate that the two loci are not functionally equivalent: disruption of RPL37B was insignificant, but disruption of RPL37A severely impaired the growth rate of the cell. When both RPL37 loci are disrupted, the cell is unable to grow at all, indicating that rpL37 is an essential protein. The functional disparity between the two RPL37 loci could be explained by differential gene expression. The results of two experiments support this idea: gene fusion of RPL37A to a reporter gene resulted in six-fold higher mRNA levels than was generated by the same reporter gene fused to RPL37B, and a modest increase in gene dosage of RPL37B overcame the lack of a functional RPL37A gene.

  1. Persons with Quebec platelet disorder have a tandem duplication of PLAU, the urokinase plasminogen activator gene.

    PubMed

    Paterson, Andrew D; Rommens, Johanna M; Bharaj, Bhupinder; Blavignac, Jessica; Wong, Isidro; Diamandis, Maria; Waye, John S; Rivard, Georges E; Hayward, Catherine P M

    2010-02-11

    Quebec platelet disorder (QPD) is an autosomal dominant bleeding disorder linked to a region on chromosome 10 that includes PLAU, the urokinase plasminogen activator gene. QPD increases urokinase plasminogen activator mRNA levels, particularly during megakaryocyte differentiation, without altering expression of flanking genes. Because PLAU sequence changes were excluded as the cause of this bleeding disorder, we investigated whether the QPD mutation involved PLAU copy number variation. All 38 subjects with QPD had a direct tandem duplication of a 78-kb genomic segment that includes PLAU. This mutation was specific to QPD as it was not present in any unaffected family members (n = 114), unrelated French Canadians (n = 221), or other persons tested (n = 90). This new information on the genetic mutation will facilitate diagnostic testing for QPD and studies of its pathogenesis and prevalence. QPD is the first bleeding disorder to be associated with a gene duplication event and a PLAU mutation.

  2. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms[OPEN

    PubMed Central

    Li, Zhen; Van de Peer, Yves; De Smet, Riet

    2016-01-01

    Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of “gene duplicability” is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. PMID:26744215

  3. Issues with RNA-seq analysis in non-model organisms: A salmonid example.

    PubMed

    Sundaram, Arvind; Tengs, Torstein; Grimholt, Unni

    2017-10-01

    High throughput sequencing (HTS) is useful for many purposes as exemplified by the other topics included in this special issue. The purpose of this paper is to look into the unique challenges of using this technology in non-model organisms where resources such as genomes, functional genome annotations or genome complexity provide obstacles not met in model organisms. To describe these challenges, we narrow our scope to RNA sequencing used to study differential gene expression in response to pathogen challenge. As a demonstration species we chose Atlantic salmon, which has a sequenced genome with poor annotation and an added complexity due to many duplicated genes. We find that our RNA-seq analysis pipeline deciphers between duplicates despite high sequence identity. However, annotation issues provide problems in linking differentially expressed genes to pathways. Also, comparing results between approaches and species are complicated due to lack of standardized annotation. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Conserved noncoding sequences conserve biological networks and influence genome evolution.

    PubMed

    Xie, Jianbo; Qian, Kecheng; Si, Jingna; Xiao, Liang; Ci, Dong; Zhang, Deqiang

    2018-05-01

    Comparative genomics approaches have identified numerous conserved cis-regulatory sequences near genes in plant genomes. Despite the identification of these conserved noncoding sequences (CNSs), our knowledge of their functional importance and selection remains limited. Here, we used a combination of DNA methylome analysis, microarray expression analyses, and functional annotation to study these sequences in the model tree Populus trichocarpa. Methylation in CG contexts and non-CG contexts was lower in CNSs, particularly CNSs in the 5'-upstream regions of genes, compared with other sites in the genome. We observed that CNSs are enriched in genes with transcription and binding functions, and this also associated with syntenic genes and those from whole-genome duplications, suggesting that cis-regulatory sequences play a key role in genome evolution. We detected a significant positive correlation between CNS number and protein interactions, suggesting that CNSs may have roles in the evolution and maintenance of biological networks. The divergence of CNSs indicates that duplication-degeneration-complementation drives the subfunctionalization of a proportion of duplicated genes from whole-genome duplication. Furthermore, population genomics confirmed that most CNSs are under strong purifying selection and only a small subset of CNSs shows evidence of adaptive evolution. These findings provide a foundation for future studies exploring these key genomic features in the maintenance of biological networks, local adaptation, and transcription.

  5. Molecular cloning, characterization and expression analysis of TLR9, MyD88 and TRAF6 genes in common carp (Cyprinus carpio).

    PubMed

    Kongchum, Pawapol; Hallerman, Eric M; Hulata, Gideon; David, Lior; Palti, Yniv

    2011-01-01

    Induction of innate immune pathways is critical for early host defense, but there is limited understanding of how teleost fishes recognize pathogen molecules and activate these pathways. In mammals, cells of the innate immune system detect pathogenic molecular structures using pattern recognition receptors (PRRs). TLR9 functions as a PRR that recognizes CpG motifs in bacterial and viral DNA and requires adaptor molecules MyD88 and TRAF6 for signal transduction. Here we report full-length cDNA isolation, structural characterization and tissue mRNA expression analysis of the common carp (cc) TLR9, MyD88 and TRAF6 gene orthologs. The ccTLR9 open-reading frame (ORF) is predicted to encode a 1064-amino acid (aa) protein. We found that MyD88 and TRAF6 genes are duplicated in common carp. This is the first report of TRAF6 duplication in a vertebrate genome and stronger evidence in support of MyD88 duplication is provided. The ccMyD88a and b ORFs are predicted to encode 288-aa and 284-aa peptides, respectively. They share 91% aa sequence identity between paralogs. The ccTRAF6a and b ORFs are both predicted to encode 543-aa peptides sharing 95% aa sequence identity between paralogs. The ccTLR9 gene is contained in a single large exon. The ccMyD88a and ccMyD88b coding sequences span five exons. The TRAF6b gene spans six exons. PCR amplification to obtain the entire coding sequence of ccTRAF6a gene was not successful. The 2104-bp fragment amplified covers the 3' end of the gene and it contains a partial sequence of one exon and three complete exons. The predicated protein domains of the ccTLR9, ccMyD88 and ccTRAF6 are conserved and resemble orthologs from other vertebrates. Real-time quantitative PCR assays of the ccTLR9, MyD88a and b, and TRAF6a and b gene transcripts in healthy common carp indicated that mRNA expression varied between tissues. Differential expression of duplicate copies were found for ccMyD88 and ccTRAF6 in white and red muscle tissues, suggesting that paralogs may have evolved and attained a new function. The genomic information we describe in this paper provides evidence of sequence and structural conservation of immune response genes in common carp. Published by Elsevier Ltd.

  6. Silver-Russell syndrome and Beckwith-Wiedemann syndrome phenotypes associated with 11p duplication in a single family.

    PubMed

    Cardarelli, Laura; Sparago, Angela; De Crescenzo, Agostina; Nalesso, Elisa; Zavan, Barbara; Cubellis, Maria Vittoria; Selicorni, Angelo; Cavicchioli, Paola; Pozzan, Giovanni Battista; Petrella, Marilena; Riccio, Andrea

    2010-01-01

    Genomic imprinting is an epigenetic phenomenon resulting in differential expression of maternal and paternal alleles of a subset of genes. In the mouse, mutation of imprinted genes often results in contrasting phenotypes, depending on parental origin. The overgrowth-associated Beckwith-Wiedemann syndrome (BWS) and the growth restriction-associated Silver-Russell syndrome (SRS) have been linked with a variety of epigenetic and genetic defects affecting a cluster of imprinted genes at chromosome 11p15.5. Paternally derived and maternally derived 11p15.5 duplications represent infrequent findings in BWS and SRS, respectively. Here, we report a case in which a 6.5 Mb duplication of 11p15.4-pter resulted in SRS and BWS phenotypes in a child and her mother, respectively. Molecular analyses demonstrated that the duplication involved the maternal chromosome 11p15 in the child and the paternal chromosome 11p15 in the mother. This observation provides a direct demonstration that SRS and BWS represent specular images, both at the clinical and molecular levels.

  7. Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals

    PubMed Central

    Patel, Vidushi S; Cooper, Steven JB; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer AM

    2008-01-01

    Background Vertebrate alpha (α)- and beta (β)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the α- and β-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil β-globin gene (ω) in the marsupial α-cluster, however, suggested that duplication of the α-β cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous α- and β-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. Results The platypus α-globin cluster (chromosome 21) contains embryonic and adult α- globin genes, a β-like ω-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-ζ-ζ'-αD-α3-α2-α1-ω-GBY-3'. The platypus β-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-ε-β-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate α-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal β-globin clusters are embedded in olfactory genes. Thus, the mammalian α- and β-globin clusters are orthologous to the bird α- and β-globin clusters respectively. Conclusion We propose that α- and β-globin clusters evolved from an ancient MPG-C16orf35-α-β-GBY-LUC7L arrangement 410 million years ago. A copy of the original β (represented by ω in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of β-globin genes with different expression profiles in different lineages. PMID:18657265

  8. A Theoretical Lower Bound for Selection on the Expression Levels of Proteins

    DOE PAGES

    Price, Morgan N.; Arkin, Adam P.

    2016-06-11

    We use simple models of the costs and benefits of microbial gene expression to show that changing a protein's expression away from its optimum by 2-fold should reduce fitness by at least [Formula: see text], where P is the fraction the cell's protein that the gene accounts for. As microbial genes are usually expressed at above 5 parts per million, and effective population sizes are likely to be above 10(6), this implies that 2-fold changes to gene expression levels are under strong selection, as [Formula: see text], where Ne is the effective population size and s is the selection coefficient.more » Thus, most gene duplications should be selected against. On the other hand, we predict that for most genes, small changes in the expression will be effectively neutral.« less

  9. A Theoretical Lower Bound for Selection on the Expression Levels of Proteins

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Price, Morgan N.; Arkin, Adam P.

    We use simple models of the costs and benefits of microbial gene expression to show that changing a protein's expression away from its optimum by 2-fold should reduce fitness by at least [Formula: see text], where P is the fraction the cell's protein that the gene accounts for. As microbial genes are usually expressed at above 5 parts per million, and effective population sizes are likely to be above 10(6), this implies that 2-fold changes to gene expression levels are under strong selection, as [Formula: see text], where Ne is the effective population size and s is the selection coefficient.more » Thus, most gene duplications should be selected against. On the other hand, we predict that for most genes, small changes in the expression will be effectively neutral.« less

  10. Valine-glutamine (VQ) motif coding genes are ancient and non-plant-specific with comprehensive expression regulation by various biotic and abiotic stresses.

    PubMed

    Jiang, Shu-Ye; Sevugan, Mayalagu; Ramachandran, Srinivasan

    2018-05-09

    Valine-glutamine (VQ) motif containing proteins play important roles in abiotic and biotic stress responses in plants. However, little is known about the origin and evolution as well as comprehensive expression regulation of the VQ gene family. In this study, we systematically surveyed this gene family in 50 plant genomes from algae, moss, gymnosperm and angiosperm and explored their presence in other species from animals, bacteria, fungi and viruses. No VQs were detected in all tested algae genomes and all genomes from moss, gymnosperm and angiosperm encode varying numbers of VQs. Interestingly, some of fungi, lower animals and bacteria also encode single to a few VQs. Thus, they are not plant-specific and should be regarded as an ancient family. Their family expansion was mainly due to segmental duplication followed by tandem duplication and mobile elements. Limited contribution of gene conversion was detected to the family evolution. Generally, VQs were very much conserved in their motif coding region and were under purifying selection. However, positive selection was also observed during species divergence. Many VQs were up- or down-regulated by various abiotic / biotic stresses and phytohormones in rice and Arabidopsis. They were also co-expressed with some of other stress-related genes. All of the expression data suggest a comprehensive expression regulation of the VQ gene family. We provide new insights into gene expansion, divergence, evolution and their expression regulation of this VQ family. VQs were detectable not only in plants but also in some of fungi, lower animals and bacteria, suggesting the evolutionary conservation and the ancient origin. Overall, VQs are non-plant-specific and play roles in abiotic / biotic responses or other biological processes through comprehensive expression regulation.

  11. Both mechanism and age of duplications contribute to biased gene retention patterns in plants.

    PubMed

    Rody, Hugo V S; Baute, Gregory J; Rieseberg, Loren H; Oliveira, Luiz O

    2017-01-06

    All extant seed plants are successful paleopolyploids, whose genomes carry duplicate genes that have survived repeated episodes of diploidization. However, the survival of gene duplicates is biased with respect to gene function and mechanism of duplication. Transcription factors, in particular, are reported to be preferentially retained following whole-genome duplications (WGDs), but disproportionately lost when duplicated by tandem events. An explanation for this pattern is provided by the Gene Balance Hypothesis (GBH), which posits that duplicates of highly connected genes are retained following WGDs to maintain optimal stoichiometry among gene products; but such connected gene duplicates are disfavored following tandem duplications. We used genomic data from 25 taxonomically diverse plant species to investigate the roles of duplication mechanism, gene function, and age of duplication in the retention of duplicate genes. Enrichment analyses were conducted to identify Gene Ontology (GO) functional categories that were overrepresented in either WGD or tandem duplications, or across ranges of divergence times. Tandem paralogs were much younger, on average, than WGD paralogs and the most frequently overrepresented GO categories were not shared between tandem and WGD paralogs. Transcription factors were overrepresented among ancient paralogs regardless of mechanism of origin or presence of a WGD. Also, in many cases, there was no bias toward transcription factor retention following recent WGDs. Both the fixation and the retention of duplicated genes in plant genomes are context-dependent events. The strong bias toward ancient transcription factor duplicates can be reconciled with the GBH if selection for optimal stoichiometry among gene products is strongest following the earliest polyploidization events and becomes increasingly relaxed as gene families expand.

  12. Genome-Wide Investigation and Expression Profiling of AP2/ERF Transcription Factor Superfamily in Foxtail Millet (Setaria italica L.)

    PubMed Central

    Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses. PMID:25409524

  13. Genome-wide investigation and expression profiling of AP2/ERF transcription factor superfamily in foxtail millet (Setaria italica L.).

    PubMed

    Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses.

  14. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    PubMed

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  15. Genome-wide analysis of the basic leucine zipper (bZIP) transcription factor gene family in six legume genomes.

    PubMed

    Wang, Zhihui; Cheng, Ke; Wan, Liyun; Yan, Liying; Jiang, Huifang; Liu, Shengyi; Lei, Yong; Liao, Boshou

    2015-12-10

    Plant bZIP proteins characteristically harbor a highly conserved bZIP domain with two structural features: a DNA-binding basic region and a leucine (Leu) zipper dimerization region. They have been shown to be diverse transcriptional regulators, playing crucial roles in plant development, physiological processes, and biotic/abiotic stress responses. Despite the availability of six completely sequenced legume genomes, a comprehensive investigation of bZIP family members in legumes has yet to be presented. In this study, we identified 428 bZIP genes encoding 585 distinct proteins in six legumes, Glycine max, Medicago truncatula, Phaseolus vulgaris, Cicer arietinum, Cajanus cajan, and Lotus japonicus. The legume bZIP genes were categorized into 11 groups according to their phylogenetic relationships with genes from Arabidopsis. Four kinds of intron patterns (a-d) within the basic and hinge regions were defined and additional conserved motifs were identified, both presenting high group specificity and supporting the group classification. We predicted the DNA-binding patterns and the dimerization properties, based on the characteristic features in the basic and hinge regions and the Leu zipper, respectively, which indicated that some highly conserved amino acid residues existed across each major group. The chromosome distribution and analysis for WGD-derived duplicated blocks revealed that the legume bZIP genes have expanded mainly by segmental duplication rather than tandem duplication. Expression data further revealed that the legume bZIP genes were expressed constitutively or in an organ-specific, development-dependent manner playing roles in multiple seed developmental stages and tissues. We also detected several key legume bZIP genes involved in drought- and salt-responses by comparing fold changes of expression values in drought-stressed or salt-stressed roots and leaves. In summary, this genome-wide identification, characterization and expression analysis of legume bZIP genes provides valuable information for understanding the molecular functions and evolution of the legume bZIP transcription factor family, and highlights potential legume bZIP genes involved in regulating tissue development and abiotic stress responses.

  16. Many gene and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon.

    PubMed

    Paterson, Andrew H; Chapman, Brad A; Kissinger, Jessica C; Bowers, John E; Feltus, Frank A; Estill, James C

    2006-11-01

    Genome duplication is potentially a good source of new genes, but such genes take time to evolve. We have found a group of "duplication-resistant" genes, which have undergone convergent restoration to singleton status following several independent genome duplications. Restoration of duplication-resistant genes to singleton status could be important to long-term survival of a polyploid lineage. Angiosperms show more frequent polyploidization and a higher degree of duplicate gene preservation than other paleopolyploids, making them well-suited to further study of duplication-resistant genes.

  17. Intron-loss evolution of hatching enzyme genes in Teleostei

    PubMed Central

    2010-01-01

    Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321

  18. Fatty acid-binding protein genes of the ancient, air-breathing, ray-finned fish, spotted gar (Lepisosteus oculatus).

    PubMed

    Venkatachalam, Ananda B; Fontenot, Quenton; Farrara, Allyse; Wright, Jonathan M

    2018-03-01

    With the advent of high-throughput DNA sequencing technology, the genomic sequence of many disparate species has led to the relatively new discipline of genomics, the study of genome structure, function and evolution. Much work has been focused on the role of whole genome duplications (WGD) in the architecture of extant vertebrate genomes, particularly those of teleost fishes which underwent a WGD early in the teleost radiation >230 million years ago (mya). Our past work has focused on the fate of duplicated copies of a multigene family coding for the intracellular lipid-binding protein (iLBP) genes in the teleost fishes. To define the evolutionary processes that determined the fate of duplicated genes and generated the structure of extant fish genomes, however, requires comparative genomic analysis with a fish lineage that diverged before the teleost WGD, such as the spotted gar (Lepisosteus oculatus), an ancient, air-breathing, ray-finned fish. Here, we describe the genomic organization, chromosomal location and tissue-specific expression of a subfamily of the iLBP genes that code for fatty acid-binding proteins (Fabps) in spotted gar. Based on this work, we have defined the minimum suite of fabp genes prior to their duplication in the teleost lineages ~230-400 mya. Spotted gar, therefore, serves as an appropriate outgroup, or ancestral/ancient fish, that did not undergo the teleost-specific WGD. As such, analyses of the spatio-temporal regulation of spotted gar genes provides a foundation to determine whether the duplicated fabp genes have been retained in teleost genomes owing to either sub- or neofunctionalization. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. An amphioxus Msx gene expressed predominantly in the dorsal neural tube.

    PubMed

    Sharman, A C; Shimeld, S M; Holland, P W

    1999-04-01

    Genomic and cDNA clones of an Msx class homeobox gene were isolated from amphioxus (Branchiostoma floridae). The gene, AmphiMsx, is expressed in the neural plate from late gastrulation; in later embryos it is expressed in dorsal cells of the neural tube, excluding anterior and posterior regions, in an irregular reiterated pattern. There is transient expression in dorsal cells within somites, reminiscent of migrating neural crest cells of vertebrates. In larvae, mRNA is detected in two patches of anterior ectoderm proposed to be placodes. Evolutionary analyses show there is little phylogenetic information in Msx protein sequences; however, it is likely that duplication of Msx genes occurred in the vertebrate lineage.

  20. Genome of wild olive and the evolution of oil biosynthesis.

    PubMed

    Unver, Turgay; Wu, Zhangyan; Sterck, Lieven; Turktas, Mine; Lohaus, Rolf; Li, Zhen; Yang, Ming; He, Lijuan; Deng, Tianquan; Escalante, Francisco Javier; Llorens, Carlos; Roig, Francisco J; Parmaksiz, Iskender; Dundar, Ekrem; Xie, Fuliang; Zhang, Baohong; Ipek, Arif; Uranbey, Serkan; Erayman, Mustafa; Ilhan, Emre; Badad, Oussama; Ghazal, Hassan; Lightfoot, David A; Kasarla, Pavan; Colantonio, Vincent; Tombuloglu, Huseyin; Hernandez, Pilar; Mete, Nurengin; Cetin, Oznur; Van Montagu, Marc; Yang, Huanming; Gao, Qiang; Dorado, Gabriel; Van de Peer, Yves

    2017-10-31

    Here we present the genome sequence and annotation of the wild olive tree ( Olea europaea var. sylvestris ), called oleaster, which is considered an ancestor of cultivated olive trees. More than 50,000 protein-coding genes were predicted, a majority of which could be anchored to 23 pseudochromosomes obtained through a newly constructed genetic map. The oleaster genome contains signatures of two Oleaceae lineage-specific paleopolyploidy events, dated at ∼28 and ∼59 Mya. These events contributed to the expansion and neofunctionalization of genes and gene families that play important roles in oil biosynthesis. The functional divergence of oil biosynthesis pathway genes, such as FAD2 , SACPD, EAR , and ACPTE , following duplication, has been responsible for the differential accumulation of oleic and linoleic acids produced in olive compared with sesame, a closely related oil crop. Duplicated oleaster FAD2 genes are regulated by an siRNA derived from a transposable element-rich region, leading to suppressed levels of FAD2 gene expression. Additionally, neofunctionalization of members of the SACPD gene family has led to increased expression of SACPD2 , 3 , 5 , and 7 , consequently resulting in an increased desaturation of steric acid. Taken together, decreased FAD2 expression and increased SACPD expression likely explain the accumulation of exceptionally high levels of oleic acid in olive. The oleaster genome thus provides important insights into the evolution of oil biosynthesis and will be a valuable resource for oil crop genomics.

  1. Genome of wild olive and the evolution of oil biosynthesis

    PubMed Central

    Unver, Turgay; Wu, Zhangyan; Sterck, Lieven; Turktas, Mine; Lohaus, Rolf; Li, Zhen; Yang, Ming; He, Lijuan; Deng, Tianquan; Escalante, Francisco Javier; Llorens, Carlos; Roig, Francisco J.; Parmaksiz, Iskender; Dundar, Ekrem; Xie, Fuliang; Zhang, Baohong; Ipek, Arif; Uranbey, Serkan; Erayman, Mustafa; Ilhan, Emre; Badad, Oussama; Ghazal, Hassan; Lightfoot, David A.; Kasarla, Pavan; Colantonio, Vincent; Tombuloglu, Huseyin; Hernandez, Pilar; Mete, Nurengin; Cetin, Oznur; Van Montagu, Marc; Yang, Huanming; Gao, Qiang; Dorado, Gabriel; Van de Peer, Yves

    2017-01-01

    Here we present the genome sequence and annotation of the wild olive tree (Olea europaea var. sylvestris), called oleaster, which is considered an ancestor of cultivated olive trees. More than 50,000 protein-coding genes were predicted, a majority of which could be anchored to 23 pseudochromosomes obtained through a newly constructed genetic map. The oleaster genome contains signatures of two Oleaceae lineage-specific paleopolyploidy events, dated at ∼28 and ∼59 Mya. These events contributed to the expansion and neofunctionalization of genes and gene families that play important roles in oil biosynthesis. The functional divergence of oil biosynthesis pathway genes, such as FAD2, SACPD, EAR, and ACPTE, following duplication, has been responsible for the differential accumulation of oleic and linoleic acids produced in olive compared with sesame, a closely related oil crop. Duplicated oleaster FAD2 genes are regulated by an siRNA derived from a transposable element-rich region, leading to suppressed levels of FAD2 gene expression. Additionally, neofunctionalization of members of the SACPD gene family has led to increased expression of SACPD2, 3, 5, and 7, consequently resulting in an increased desaturation of steric acid. Taken together, decreased FAD2 expression and increased SACPD expression likely explain the accumulation of exceptionally high levels of oleic acid in olive. The oleaster genome thus provides important insights into the evolution of oil biosynthesis and will be a valuable resource for oil crop genomics. PMID:29078332

  2. Genome-wide survey and expression analysis of F-box genes in chickpea.

    PubMed

    Gupta, Shefali; Garg, Vanika; Kant, Chandra; Bhatia, Sabhyata

    2015-02-13

    The F-box genes constitute one of the largest gene families in plants involved in degradation of cellular proteins. F-box proteins can recognize a wide array of substrates and regulate many important biological processes such as embryogenesis, floral development, plant growth and development, biotic and abiotic stress, hormonal responses and senescence, among others. However, little is known about the F-box genes in the important legume crop, chickpea. The available draft genome sequence of chickpea allowed us to conduct a genome-wide survey of the F-box gene family in chickpea. A total of 285 F-box genes were identified in chickpea which were classified based on their C-terminal domain structures into 10 subfamilies. Thirteen putative novel motifs were also identified in F-box proteins with no known functional domain at their C-termini. The F-box genes were physically mapped on the 8 chickpea chromosomes and duplication events were investigated which revealed that the F-box gene family expanded largely due to tandem duplications. Phylogenetic analysis classified the chickpea F-box genes into 9 clusters. Also, maximum syntenic relationship was observed with soybean followed by Medicago truncatula, Lotus japonicus and Arabidopsis. Digital expression analysis of F-box genes in various chickpea tissues as well as under abiotic stress conditions utilizing the available chickpea transcriptome data revealed differential expression patterns with several F-box genes specifically expressing in each tissue, few of which were validated by using quantitative real-time PCR. The genome-wide analysis of chickpea F-box genes provides new opportunities for characterization of candidate F-box genes and elucidation of their function in growth, development and stress responses for utilization in chickpea improvement.

  3. Genome-wide analysis of basic helix-loop-helix (bHLH) transcription factors in Brachypodium distachyon.

    PubMed

    Niu, Xin; Guan, Yuxiang; Chen, Shoukun; Li, Haifeng

    2017-08-15

    As a superfamily of transcription factors (TFs), the basic helix-loop-helix (bHLH) proteins have been characterized functionally in many plants with a vital role in the regulation of diverse biological processes including growth, development, response to various stresses, and so on. However, no systemic analysis of the bHLH TFs has been reported in Brachypodium distachyon, an emerging model plant in Poaceae. A total of 146 bHLH TFs were identified in the Brachypodium distachyon genome and classified into 24 subfamilies. BdbHLHs in the same subfamily share similar protein motifs and gene structures. Gene duplication events showed a close relationship to rice, maize and sorghum, and segment duplications might play a key role in the expansion of this gene family. The amino acid sequence of the bHLH domains were quite conservative, especially Leu-27 and Leu-54. Based on the predicted binding activities, the BdbHLHs were divided into DNA binding and non-DNA binding types. According to the gene ontology (GO) analysis, BdbHLHs were speculated to function in homodimer or heterodimer manner. By integrating the available high throughput data in public database and results of quantitative RT-PCR, we found the expression profiles of BdbHLHs were different, implying their differentiated functions. One hundred fourty-six BdbHLHs were identified and their conserved domains, sequence features, phylogenetic relationship, chromosomal distribution, GO annotations, gene structures, gene duplication and expression profiles were investigated. Our findings lay a foundation for further evolutionary and functional elucidation of BdbHLH genes.

  4. Genome-wide identification, characterisation and expression analysis of the MADS-box gene family in Prunus mume.

    PubMed

    Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia

    2014-10-01

    MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.

  5. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress

    PubMed Central

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K.; Asif, Mehar H.

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  6. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress.

    PubMed

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively.

  7. A genome-wide survey of homeodomain-leucine zipper genes and analysis of cold-responsive HD-Zip I members' expression in tomato.

    PubMed

    Zhang, Zhenzhu; Chen, Xiuling; Guan, Xin; Liu, Yang; Chen, Hongyu; Wang, Tingting; Mouekouba, Liana Dalcantara Ongouya; Li, Jingfu; Wang, Aoxue

    2014-01-01

    Homeodomain-leucine zipper (HD-Zip) proteins are a kind of transcriptional factors that play a vital role in plant growth and development. However, no detailed information of HD-Zip family in tomato has been reported till now. In this study, 51 HD-Zip genes (SlHZ01-51) in this family were identified and categorized into 4 classes by exon-intron and protein structure in tomato (Solanum lycopersicum) genome. The synthetical phylogenetic tree of tomato, Arabidopsis and rice HD-Zip genes were established for an insight into their evolutionary relationships and putative functions. The results showed that the contribution of segmental duplication was larger than that of tandem duplication for expansion and evolution of genes in this family of tomato. The expression profile results under abiotic stress suggested that all SlHZ I genes were responsive to cold stress. This study will provide a clue for the further investigation of functional identification and the role of tomato HD-Zip I subfamily in plant cold stress responses and developmental events.

  8. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    PubMed

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  9. Molecular evolution and patterns of duplication in the SEP/AGL6-like lineage of the Zingiberales: a proposed mechanism for floral diversification.

    PubMed

    Yockteng, Roxana; Almeida, Ana M R; Morioka, Kelsie; Alvarez-Buylla, Elena R; Specht, Chelsea D

    2013-11-01

    The diversity of floral forms in the plant order Zingiberales has evolved through alterations in floral organ morphology. One striking alteration is the shift from fertile, filamentous stamens to sterile, laminar (petaloid) organs in the stamen whorls, attributed to specific pollination syndromes. Here, we examine the role of the SEPALLATA (SEP) genes, known to be important in regulatory networks underlying floral development and organ identity, in the evolution of development of the diverse floral organs phenotypes in the Zingiberales. Phylogenetic analyses show that the SEP-like genes have undergone several duplication events giving rise to multiple copies. Selection tests on the SEP-like genes indicate that the two copies of SEP3 have mostly evolved under balancing selection, probably due to strong functional restrictions as a result of their critical role in floral organ specification. In contrast, the two LOFSEP copies have undergone differential positive selection, indicating neofunctionalization. Reverse transcriptase-polymerase chain reaction, gene expression from RNA-seq data, and in situ hybridization analyses show that the recovered genes have differential expression patterns across the various whorls and organ types found in the Zingiberales. Our data also suggest that AGL6, sister to the SEP-like genes, may play an important role in stamen morphology in the Zingiberales. Thus, the SEP-like genes are likely to be involved in some of the unique morphogenetic patterns of floral organ development found among this diverse order of tropical monocots. This work contributes to a growing body of knowledge focused on understanding the role of gene duplications and the evolution of entire gene networks in the evolution of flower development.

  10. Duplicated Leptin Receptors in Two Species of Eel Bring New Insights into the Evolution of the Leptin System in Vertebrates

    PubMed Central

    Morini, Marina; Pasquier, Jérémy; Dirks, Ron; van den Thillart, Guido; Tomkiewicz, Jonna; Rousseau, Karine; Dufour, Sylvie; Lafont, Anne-Gaëlle

    2015-01-01

    Since its discovery in mammals as a key-hormone in reproduction and metabolism, leptin has been identified in an increasing number of tetrapods and teleosts. Tetrapods possess only one leptin gene, while most teleosts possess two leptin genes, as a result of the teleost third whole genome duplication event (3R). Leptin acts through a specific receptor (LEPR). In the European and Japanese eels, we identified two leptin genes, and for the first time in vertebrates, two LEPR genes. Synteny analyses indicated that eel LEPRa and LEPRb result from teleost 3R. LEPRb seems to have been lost in the teleost lineage shortly after the elopomorph divergence. Quantitative PCRs revealed a wide distribution of leptins and LEPRs in the European eel, including tissues involved in metabolism and reproduction. Noticeably, leptin1 was expressed in fat tissue, while leptin2 in the liver, reflecting subfunctionalization. Four-month fasting had no impact on the expression of leptins and LEPRs in control European eels. This might be related to the remarkable adaptation of silver eel metabolism to long-term fasting throughout the reproductive oceanic migration. In contrast, sexual maturation induced differential increases in the expression of leptins and LEPRs in the BPG-liver axis. Leptin2 was strikingly upregulated in the liver, the central organ of the reproductive metabolic challenge in teleosts. LEPRs were differentially regulated during sexual maturation, which may have contributed to the conservation of the duplicated LEPRs in this species. This suggests an ancient and positive role of the leptin system in the vertebrate reproductive function. This study brings new insights on the evolutionary history of the leptin system in vertebrates. Among extant vertebrates, the eel represents a unique case of duplicated leptins and leptin receptors as a result of 3R. PMID:25946034

  11. Zebrafish hox paralogue group 2 genes function redundantly as selector genes to pattern the second pharyngeal arch.

    PubMed

    Hunter, Michael P; Prince, Victoria E

    2002-07-15

    The pharyngeal arches are one of the defining features of the vertebrates, with the first arch forming the mandibles of the jaw and the second forming jaw support structures. The cartilaginous elements of each arch are formed from separate migratory neural crest cell streams, which derive from the dorsal aspect of the neural tube. The second and more posterior crest streams are characterized by specific Hox gene expression. The zebrafish has a larger overall number of Hox genes than the tetrapod vertebrates, as the result of a duplication event in its lineage. However, in both zebrafish and mouse, there are just two members of Hox paralogue group 2 (PG2): Hoxa2 and Hoxb2. Here, we show that morpholino-mediated "knock-down" of both zebrafish Hox PG2 genes results in major defects in second pharyngeal arch cartilages, involving replacement of ventral elements with a mirror-image duplication of first arch structures, and accompanying changes to pharyngeal musculature. In the mouse, null mutants of Hoxa2 have revealed that this single Hox gene is required for normal second arch patterning. By contrast, loss-of-function of either zebrafish Hox PG2 gene individually has no phenotypic consequence, showing that these two genes function redundantly to confer proper pattern to the second pharyngeal arch. We have also used hoxb1a mis-expression to induce localized ectopic expression of zebrafish Hox PG2 genes in the first arch; using this strategy, we find that ectopic expression of either Hox PG2 gene can confer second arch identity onto first arch structures, suggesting that the zebrafish Hox PG2 genes act as "selector genes." 2002 Elsevier Science (USA).

  12. Comparative Transcriptome Analyses Reveal Core Parasitism Genes and Suggest Gene Duplication and Repurposing as Sources of Structural Novelty

    PubMed Central

    Yang, Zhenzhen; Wafula, Eric K.; Honaas, Loren A.; Zhang, Huiting; Das, Malay; Fernandez-Aparicio, Monica; Huang, Kan; Bandaranayake, Pradeepa C.G.; Wu, Biao; Der, Joshua P.; Clarke, Christopher R.; Ralph, Paula E.; Landherr, Lena; Altman, Naomi S.; Timko, Michael P.; Yoder, John I.; Westwood, James H.; dePamphilis, Claude W.

    2015-01-01

    The origin of novel traits is recognized as an important process underlying many major evolutionary radiations. We studied the genetic basis for the evolution of haustoria, the novel feeding organs of parasitic flowering plants, using comparative transcriptome sequencing in three species of Orobanchaceae. Around 180 genes are upregulated during haustorial development following host attachment in at least two species, and these are enriched in proteases, cell wall modifying enzymes, and extracellular secretion proteins. Additionally, about 100 shared genes are upregulated in response to haustorium inducing factors prior to host attachment. Collectively, we refer to these newly identified genes as putative “parasitism genes.” Most of these parasitism genes are derived from gene duplications in a common ancestor of Orobanchaceae and Mimulus guttatus, a related nonparasitic plant. Additionally, the signature of relaxed purifying selection and/or adaptive evolution at specific sites was detected in many haustorial genes, and may play an important role in parasite evolution. Comparative analysis of gene expression patterns in parasitic and nonparasitic angiosperms suggests that parasitism genes are derived primarily from root and floral tissues, but with some genes co-opted from other tissues. Gene duplication, often taking place in a nonparasitic ancestor of Orobanchaceae, followed by regulatory neofunctionalization, was an important process in the origin of parasitic haustoria. PMID:25534030

  13. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-07-29

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Quantifying the major mechanisms of recent gene duplications in the human and mouse genomes: a novel strategy to estimate gene duplication rates

    PubMed Central

    Pan, Deng; Zhang, Liqing

    2007-01-01

    Background The rate of gene duplication is an important parameter in the study of evolution, but the influence of gene conversion and technical problems have confounded previous attempts to provide a satisfying estimate. We propose a new strategy to estimate the rate that involves separate quantification of the rates of two different mechanisms of gene duplication and subsequent combination of the two rates, based on their respective contributions to the overall gene duplication rate. Results Previous estimates of gene duplication rates are based on small gene families. Therefore, to assess the applicability of this to families of all sizes, we looked at both two-copy gene families and the entire genome. We studied unequal crossover and retrotransposition, and found that these mechanisms of gene duplication are largely independent and account for a substantial amount of duplicated genes. Unequal crossover contributed more to duplications in the entire genome than retrotransposition did, but this contribution was significantly less in two-copy gene families, and duplicated genes arising from this mechanism are more likely to be retained. Combining rates of duplication using the two mechanisms, we estimated the overall rates to be from approximately 0.515 to 1.49 × 10-3 per gene per million years in human, and from approximately 1.23 to 4.23 × 10-3 in mouse. The rates estimated from two-copy gene families are always lower than those from the entire genome, and so it is not appropriate to use small families to estimate the rate for the entire genome. Conclusion We present a novel strategy for estimating gene duplication rates. Our results show that different mechanisms contribute differently to the evolution of small and large gene families. PMID:17683522

  15. Eye development in the four-eyed fish Anableps anableps: cranial and retinal adaptations to simultaneous aerial and aquatic vision

    PubMed Central

    Perez, Louise N.; Lorena, Jamily; Costa, Carinne M.; Araujo, Maysa S.; Frota-Lima, Gabriela N.; Matos-Rodrigues, Gabriel E.; Martins, Rodrigo A. P.; Mattox, George M. T.

    2017-01-01

    The unique eyes of the four-eyed fish Anableps anableps have long intrigued biologists. Key features associated with the bulging eye of Anableps include the expanded frontal bone and the duplicated pupils and cornea. Furthermore, the Anableps retina expresses different photoreceptor genes in dorsal and ventral regions, potentially associated with distinct aerial and aquatic stimuli. To gain insight into the developmental basis of the Anableps unique eye, we examined neurocranium and eye ontogeny, as well as photoreceptor gene expression during larval stages. First, we described six larval stages during which duplication of eye structures occurs. Our osteological analysis of neurocranium ontogeny revealed another distinctive Anablepid feature: an ossified interorbital septum partially separating the orbital cavities. Furthermore, we identified the onset of differences in cell proliferation and cell layer density between dorsal and ventral regions of the retina. Finally, we show that differential photoreceptor gene expression in the retina initiates during development, suggesting that it is inherited and not environmentally determined. In sum, our results shed light on the ontogenetic steps leading to the highly derived Anableps eye. PMID:28381624

  16. Genome-wide identification of WRKY transcription factors in kiwifruit (Actinidia spp.) and analysis of WRKY expression in responses to biotic and abiotic stresses.

    PubMed

    Jing, Zhaobin; Liu, Zhande

    2018-04-01

    As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.

  17. Genome-Wide Identification of R2R3-MYB Genes and Expression Analyses During Abiotic Stress in Gossypium raimondii

    PubMed Central

    He, Qiuling; Jones, Don C.; Li, Wei; Xie, Fuliang; Ma, Jun; Sun, Runrun; Wang, Qinglian; Zhu, Shuijin; Zhang, Baohong

    2016-01-01

    The R2R3-MYB is one of the largest families of transcription factors, which have been implicated in multiple biological processes. There is great diversity in the number of R2R3-MYB genes in different plants. However, there is no report on genome-wide characterization of this gene family in cotton. In the present study, a total of 205 putative R2R3-MYB genes were identified in cotton D genome (Gossypium raimondii), that are much larger than that found in other cash crops with fully sequenced genomes. These GrMYBs were classified into 13 groups with the R2R3-MYB genes from Arabidopsis and rice. The amino acid motifs and phylogenetic tree were predicted and analyzed. The sequences of GrMYBs were distributed across 13 chromosomes at various densities. The results showed that the expansion of the G. Raimondii R2R3-MYB family was mainly attributable to whole genome duplication and segmental duplication. Moreover, the expression pattern of 52 selected GrMYBs and 46 GaMYBs were tested in roots and leaves under different abiotic stress conditions. The results revealed that the MYB genes in cotton were differentially expressed under salt and drought stress treatment. Our results will be useful for determining the precise role of the MYB genes during stress responses with crop improvement. PMID:27009386

  18. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication.

    PubMed

    Chapman, Brad A; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2006-02-21

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a "paleologous" locus than in "singleton" genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during "genomic turmoil" immediately after genome duplication but continues to act approximately 60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages.

  19. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication

    PubMed Central

    Chapman, Brad A.; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2006-01-01

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a “paleologous” locus than in “singleton” genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during “genomic turmoil” immediately after genome duplication but continues to act ≈60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages. PMID:16467140

  20. Directed evolution for thermostabilization of a hygromycin B phosphotransferase from Streptomyces hygroscopicus.

    PubMed

    Sugimoto, Naohisa; Takakura, Yasuaki; Shiraki, Kentaro; Honda, Shinya; Takaya, Naoki; Hoshino, Takayuki; Nakamura, Akira

    2013-01-01

    To obtain a selection marker gene functional in a thermophilic bacterium, Thermus thermophilus, an in vivo-directed evolutionary strategy was conducted on a hygromycin B phosphotransferase gene (hyg) from Streptomyces hygroscopicus. The expression of wild-type hyg in T. thermophilus provided hygromycin B (HygB) resistance up to 60 °C. Through selection of mutants showing HygB resistance at higher temperatures, eight amino acid substitutions and the duplication of three amino acids were identified. A variant containing seven substitutions and the duplication (HYG10) showed HygB resistance at a highest temperature of 74 °C. Biochemical and biophysical analyses of recombinant HYG and HYG10 revealed that HYG10 was in fact thermostabilized. Modeling of the three-dimensional structure of HYG10 suggests the possible roles of the various substitutions and the duplication on thermostabilization, of which three substitutions and the duplication located at the enzyme surface suggested that these mutations made the enzyme more hydrophilic and provided increased stability in aqueous solution.

  1. Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

    PubMed Central

    Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

    2013-01-01

    Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455

  2. Severe Expressive-Language Delay Related to Duplication of the Williams–Beuren Locus

    PubMed Central

    Somerville, Martin J.; Mervis, Carolyn B.; Young, Edwin J.; Seo, Eul-Ju; del Campo, Miguel; Bamforth, Stephen; Peregrine, Ella; Loo, Wayne; Lilley, Margaret; Pérez-Jurado, Luis A.; Morris, Colleen A.; Scherer, Stephen W.; Osborne, Lucy R.

    2010-01-01

    SUMMARY The Williams–Beuren syndrome (WBS) locus, at 7q11.23, is prone to recurrent chromosomal rearrangements, including the microdeletion that causes WBS, a multisystem condition with characteristic cardiovascular, cognitive, and behavioral features. It is hypothesized that reciprocal duplications of the WBS interval should also occur, and here we present such a case description. The most striking phenotype was a severe delay in expressive speech, in contrast to the normal articulation and fluent expressive language observed in persons with WBS. Our results suggest that specific genes at 7q11.23 are exquisitely sensitive to dosage alterations that can influence human language and visuospatial capabilities. PMID:16236740

  3. To B or Not to B a flower: the role of DEFICIENS and GLOBOSA orthologs in the evolution of the angiosperms.

    PubMed

    Zahn, L M; Leebens-Mack, J; DePamphilis, C W; Ma, H; Theissen, G

    2005-01-01

    DEFICIENS (DEF) and GLOBOSA (GLO) function in petal and stamen organ identity in Antirrhinum and are orthologs of APETALA3 and PISTILLATA in Arabidopsis. These genes are known as B-function genes for their role in the ABC genetic model of floral organ identity. Phylogenetic analyses show that DEF and GLO are closely related paralogs, having originated from a gene duplication event after the separation of the lineages leading to the extant gymnosperms and the extant angiosperms. Several additional gene duplications followed, providing multiple potential opportunities for functional divergence. In most angiosperms studied to date, genes in the DEF/GLO MADS-box subfamily are expressed in the petals and stamens during flower development. However, in some angiosperms, the expression of DEF and GLO orthologs are occasionally observed in the first and fourth whorls of flowers or in nonfloral organs, where their function is unknown. In this article we review what is known about function, phylogeny, and expression in the DEF/GLO subfamily to examine their evolution in the angiosperms. Our analyses demonstrate that although the primary role of the DEF/GLO subfamily appears to be in specifying the stamens and inner perianth, several examples of potential sub- and neofunctionalization are observed.

  4. Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

    PubMed Central

    Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106

  5. Molecular Characterization of Soybean Pterocarpan 2-Dimethylallyltransferase in Glyceollin Biosynthesis: Local Gene and Whole-Genome Duplications of Prenyltransferase Genes Led to the Structural Diversity of Soybean Prenylated Isoflavonoids

    PubMed Central

    Yoneyama, Keisuke; Akashi, Tomoyoshi; Aoki, Toshio

    2016-01-01

    Soybean (Glycine max) accumulates several prenylated isoflavonoid phytoalexins, collectively referred to as glyceollins. Glyceollins (I, II, III, IV and V) possess modified pterocarpan skeletons with C5 moieties from dimethylallyl diphosphate, and they are commonly produced from (6aS, 11aS)-3,9,6a-trihydroxypterocarpan [(−)-glycinol]. The metabolic fate of (−)-glycinol is determined by the enzymatic introduction of a dimethylallyl group into C-4 or C-2, which is reportedly catalyzed by regiospecific prenyltransferases (PTs). 4-Dimethylallyl (−)-glycinol and 2-dimethylallyl (−)-glycinol are precursors of glyceollin I and other glyceollins, respectively. Although multiple genes encoding (−)-glycinol biosynthetic enzymes have been identified, those involved in the later steps of glyceollin formation mostly remain unidentified, except for (−)-glycinol 4-dimethylallyltransferase (G4DT), which is involved in glyceollin I biosynthesis. In this study, we identified four genes that encode isoflavonoid PTs, including (−)-glycinol 2-dimethylallyltransferase (G2DT), using homology-based in silico screening and biochemical characterization in yeast expression systems. Transcript analyses illustrated that changes in G2DT gene expression were correlated with the induction of glyceollins II, III, IV and V in elicitor-treated soybean cells and leaves, suggesting its involvement in glyceollin biosynthesis. Moreover, the genomic signatures of these PT genes revealed that G4DT and G2DT are paralogs derived from whole-genome duplications of the soybean genome, whereas other PT genes [isoflavone dimethylallyltransferase 1 (IDT1) and IDT2] were derived via local gene duplication on soybean chromosome 11. PMID:27986914

  6. Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template.

    PubMed

    Gouran, Hossein; Chakraborty, Sandeep; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya

    2014-01-01

    Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction.

  7. New insights into the nutritional regulation of gluconeogenesis in carnivorous rainbow trout (Oncorhynchus mykiss): a gene duplication trail.

    PubMed

    Marandel, Lucie; Seiliez, Iban; Véron, Vincent; Skiba-Cassy, Sandrine; Panserat, Stéphane

    2015-07-01

    The rainbow trout (Oncorhynchus mykiss) is considered to be a strictly carnivorous fish species that is metabolically adapted for high catabolism of proteins and low utilization of dietary carbohydrates. This species consequently has a "glucose-intolerant" phenotype manifested by persistent hyperglycemia when fed a high-carbohydrate diet. Gluconeogenesis in adult fish is also poorly, if ever, regulated by carbohydrates, suggesting that this metabolic pathway is involved in this specific phenotype. In this study, we hypothesized that the fate of duplicated genes after the salmonid-specific 4th whole genome duplication (Ss4R) may have led to adaptive innovation and that their study might provide new elements to enhance our understanding of gluconeogenesis and poor dietary carbohydrate use in this species. Our evolutionary analysis of gluconeogenic genes revealed that pck1, pck2, fbp1a, and g6pca were retained as singletons after Ss4r, while g6pcb1, g6pcb2, and fbp1b ohnolog pairs were maintained. For all genes, duplication may have led to sub- or neofunctionalization. Expression profiles suggest that the gluconeogenesis pathway remained active in trout fed a no-carbohydrate diet. When trout were fed a high-carbohydrate diet (30%), most of the gluconeogenic genes were non- or downregulated, except for g6pbc2 ohnologs, whose RNA levels were surprisingly increased. This study demonstrates that Ss4R in trout involved adaptive innovation via gene duplication and via the outcome of the resulting ohnologs. Indeed, maintenance of ohnologous g6pcb2 pair may contribute in a significant way to the glucose-intolerant phenotype of trout and may partially explain its poor use of dietary carbohydrates. Copyright © 2015 the American Physiological Society.

  8. Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template

    PubMed Central

    Rao, Basuthkar J.; Asgeirsson, Bjarni; Dandekar, Abhaya

    2014-01-01

    Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction. PMID:25717364

  9. Diurnal Cycling Transcription Factors of Pineapple Revealed by Genome-Wide Annotation and Global Transcriptomic Analysis

    PubMed Central

    Sharma, Anupma; Wai, Ching Man; Ming, Ray

    2017-01-01

    Abstract Circadian clock provides fitness advantage by coordinating internal metabolic and physiological processes to external cyclic environments. Core clock components exhibit daily rhythmic changes in gene expression, and the majority of them are transcription factors (TFs) and transcription coregulators (TCs). We annotated 1,398 TFs from 67 TF families and 80 TCs from 20 TC families in pineapple, and analyzed their tissue-specific and diurnal expression patterns. Approximately 42% of TFs and 45% of TCs displayed diel rhythmic expression, including 177 TF/TCs cycling only in the nonphotosynthetic leaf tissue, 247 cycling only in the photosynthetic leaf tissue, and 201 cycling in both. We identified 68 TF/TCs whose cycling expression was tightly coupled between the photosynthetic and nonphotosynthetic leaf tissues. These TF/TCs likely coordinate key biological processes in pineapple as we demonstrated that this group is enriched in homologous genes that form the core circadian clock in Arabidopsis and includes a STOP1 homolog. Two lines of evidence support the important role of the STOP1 homolog in regulating CAM photosynthesis in pineapple. First, STOP1 responds to acidic pH and regulates a malate channel in multiple plant species. Second, the cycling expression pattern of the pineapple STOP1 and the diurnal pattern of malate accumulation in pineapple leaf are correlated. We further examined duplicate-gene retention and loss in major known circadian genes and refined their evolutionary relationships between pineapple and other plants. Significant variations in duplicate-gene retention and loss were observed for most clock genes in both monocots and dicots. PMID:28922793

  10. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

    PubMed Central

    Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

    2014-01-01

    Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848

  11. Genome Mutational and Transcriptional Hotspots Are Traps for Duplicated Genes and Sources of Adaptations.

    PubMed

    Fares, Mario A; Sabater-Muñoz, Beatriz; Toft, Christina

    2017-05-01

    Gene duplication generates new genetic material, which has been shown to lead to major innovations in unicellular and multicellular organisms. A whole-genome duplication occurred in the ancestor of Saccharomyces yeast species but 92% of duplicates returned to single-copy genes shortly after duplication. The persisting duplicated genes in Saccharomyces led to the origin of major metabolic innovations, which have been the source of the unique biotechnological capabilities in the Baker's yeast Saccharomyces cerevisiae. What factors have determined the fate of duplicated genes remains unknown. Here, we report the first demonstration that the local genome mutation and transcription rates determine the fate of duplicates. We show, for the first time, a preferential location of duplicated genes in the mutational and transcriptional hotspots of S. cerevisiae genome. The mechanism of duplication matters, with whole-genome duplicates exhibiting different preservation trends compared to small-scale duplicates. Genome mutational and transcriptional hotspots are rich in duplicates with large repetitive promoter elements. Saccharomyces cerevisiae shows more tolerance to deleterious mutations in duplicates with repetitive promoter elements, which in turn exhibit higher transcriptional plasticity against environmental perturbations. Our data demonstrate that the genome traps duplicates through the accelerated regulatory and functional divergence of their gene copies providing a source of novel adaptations in yeast. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Variable Penetrance of the 15q11.2 BP1-BP2 Microduplication in a Family with Cognitive and Language Impairment

    PubMed Central

    Benítez-Burraco, Antonio; Barcos-Martínez, Montserrat; Espejo-Portero, Isabel; Jiménez-Romero, Salud

    2017-01-01

    The 15q11.2 BP1-BP2 region is found duplicated or deleted in people with cognitive, language, and behavioral impairment. We report on a family (a father and 3 male twin siblings) that presents with a duplication of the 15q11.2 BP1-BP2 region and a variable phenotype: the father and the fraternal twin are normal carriers, whereas the monozygotic twins exhibit severe language and cognitive delay as well as behavioral disturbances. The genes located within the duplicated region are involved in brain development and function, and some of them are related to language processing. The probands' phenotype may result from changes in the expression level of some of these genes important for cognitive development. PMID:28588435

  13. Ancient Duplications and Expression Divergence in the Globin Gene Superfamily of Vertebrates: Insights from the Elephant Shark Genome and Transcriptome.

    PubMed

    Opazo, Juan C; Lee, Alison P; Hoffmann, Federico G; Toloza-Villalobos, Jessica; Burmester, Thorsten; Venkatesh, Byrappa; Storz, Jay F

    2015-07-01

    Comparative analyses of vertebrate genomes continue to uncover a surprising diversity of genes in the globin gene superfamily, some of which have very restricted phyletic distributions despite their antiquity. Genomic analysis of the globin gene repertoire of cartilaginous fish (Chondrichthyes) should be especially informative about the duplicative origins and ancestral functions of vertebrate globins, as divergence between Chondrichthyes and bony vertebrates represents the most basal split within the jawed vertebrates. Here, we report a comparative genomic analysis of the vertebrate globin gene family that includes the complete globin gene repertoire of the elephant shark (Callorhinchus milii). Using genomic sequence data from representatives of all major vertebrate classes, integrated analyses of conserved synteny and phylogenetic relationships revealed that the last common ancestor of vertebrates possessed a repertoire of at least seven globin genes: single copies of androglobin and neuroglobin, four paralogous copies of globin X, and the single-copy progenitor of the entire set of vertebrate-specific globins. Combined with expression data, the genomic inventory of elephant shark globins yielded four especially surprising findings: 1) there is no trace of the neuroglobin gene (a highly conserved gene that is present in all other jawed vertebrates that have been examined to date), 2) myoglobin is highly expressed in heart, but not in skeletal muscle (reflecting a possible ancestral condition in vertebrates with single-circuit circulatory systems), 3) elephant shark possesses two highly divergent globin X paralogs, one of which is preferentially expressed in gonads, and 4) elephant shark possesses two structurally distinct α-globin paralogs, one of which is preferentially expressed in the brain. Expression profiles of elephant shark globin genes reveal distinct specializations of function relative to orthologs in bony vertebrates and suggest hypotheses about ancestral functions of vertebrate globins. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. Functional requirements driving the gene duplication in 12 Drosophila species.

    PubMed

    Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui

    2013-08-15

    Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.

  15. Genome-wide identification and analysis of the aldehyde dehydrogenase (ALDH) gene superfamily in apple (Malus × domestica Borkh.).

    PubMed

    Li, Xiaoqin; Guo, Rongrong; Li, Jun; Singer, Stacy D; Zhang, Yucheng; Yin, Xiangjing; Zheng, Yi; Fan, Chonghui; Wang, Xiping

    2013-10-01

    Aldehyde dehydrogenases (ALDHs) represent a protein superfamily encoding NAD(P)(+)-dependent enzymes that oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes. In plants, they are involved in many biological processes and play a role in the response to environmental stress. In this study, a total of 39 ALDH genes from ten families were identified in the apple (Malus × domestica Borkh.) genome. Synteny analysis of the apple ALDH (MdALDH) genes indicated that segmental and tandem duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of these gene families in apple. Moreover, synteny analysis between apple and Arabidopsis demonstrated that several MdALDH genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes appeared before the divergence of lineages that led to apple and Arabidopsis. In addition, phylogenetic analysis, as well as comparisons of exon-intron and protein structures, provided further insight into both their evolutionary relationships and their putative functions. Tissue-specific expression analysis of the MdALDH genes demonstrated diverse spatiotemporal expression patterns, while their expression profiles under abiotic stress and various hormone treatments indicated that many MdALDH genes were responsive to high salinity and drought, as well as different plant hormones. This genome-wide identification, as well as characterization of evolutionary relationships and expression profiles, of the apple MdALDH genes will not only be useful for the further analysis of ALDH genes and their roles in stress response, but may also aid in the future improvement of apple stress tolerance. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  16. The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

    PubMed Central

    Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

    2016-01-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095

  17. Expression and Sequence Evolution of Aromatase cyp19a1 and Other Sexual Development Genes in East African Cichlid Fishes

    PubMed Central

    Böhne, Astrid; Heule, Corina; Boileau, Nicolas; Salzburger, Walter

    2013-01-01

    Sex determination mechanisms are highly variable across teleost fishes and sexual development is often plastic. Nevertheless, downstream factors establishing the two sexes are presumably conserved. Here, we study sequence evolution and gene expression of core genes of sexual development in a prime model system in evolutionary biology, the East African cichlid fishes. Using the available five cichlid genomes, we test for signs of positive selection in 28 genes including duplicates from the teleost whole-genome duplication, and examine the expression of these candidate genes in three cichlid species. We then focus on a particularly striking case, the A- and B-copies of the aromatase cyp19a1, and detect different evolutionary trajectories: cyp19a1A evolved under strong positive selection, whereas cyp19a1B remained conserved at the protein level, yet is subject to regulatory changes at its transcription start sites. Importantly, we find shifts in gene expression in both copies. Cyp19a1 is considered the most conserved ovary-factor in vertebrates, and in all teleosts investigated so far, cyp19a1A and cyp19a1B are expressed in ovaries and the brain, respectively. This is not the case in cichlids, where we find new expression patterns in two derived lineages: the A-copy gained a novel testis-function in the Ectodine lineage, whereas the B-copy is overexpressed in the testis of the speciest-richest cichlid group, the Haplochromini. This suggests that even key factors of sexual development, including the sex steroid pathway, are not conserved in fish, supporting the idea that flexibility in sexual determination and differentiation may be a driving force of speciation. PMID:23883521

  18. A second visual rhodopsin gene, rh1-2, is expressed in zebrafish photoreceptors and found in other ray-finned fishes.

    PubMed

    Morrow, James M; Lazic, Savo; Dixon Fox, Monica; Kuo, Claire; Schott, Ryan K; de A Gutierrez, Eduardo; Santini, Francesco; Tropepe, Vincent; Chang, Belinda S W

    2017-01-15

    Rhodopsin (rh1) is the visual pigment expressed in rod photoreceptors of vertebrates that is responsible for initiating the critical first step of dim-light vision. Rhodopsin is usually a single copy gene; however, we previously discovered a novel rhodopsin-like gene expressed in the zebrafish retina, rh1-2, which we identified as a functional photosensitive pigment that binds 11-cis retinal and activates in response to light. Here, we localized expression of rh1-2 in the zebrafish retina to a subset of peripheral photoreceptor cells, which indicates a partially overlapping expression pattern with rh1 We also expressed, purified and characterized Rh1-2, including investigation of the stability of the biologically active intermediate. Using fluorescence spectroscopy, we found the half-life of the rate of retinal release of Rh1-2 following photoactivation to be more similar to that of the visual pigment rhodopsin than to the non-visual pigment exo-rhodopsin (exorh), which releases retinal around 5 times faster. Phylogenetic and molecular evolutionary analyses show that rh1-2 has ancient origins within teleost fishes, is under similar selective pressure to rh1, and likely experienced a burst of positive selection following its duplication and divergence from rh1 These findings indicate that rh1-2 is another functional visual rhodopsin gene, which contradicts the prevailing notion that visual rhodopsin is primarily found as a single copy gene within ray-finned fishes. The reasons for retention of this duplicate gene, as well as possible functional consequences for the visual system, are discussed. © 2017. Published by The Company of Biologists Ltd.

  19. UV induced foot duplication in regenerating hydra is mediated by metalloproteinases and modulation of the Wnt pathway.

    PubMed

    Krishnapati, Lakshmi-Surekha; Londhe, Rohini; Deoli, Vaishali; Barve, Apurva; Ghaskadbi, Saroj; Ghaskadbi, Surendra

    2016-01-01

    We have shown earlier that irradiation with UV induces duplication of foot in regenerating middle pieces of hydra. The present study was undertaken to elucidate the underlying mechanism(s) leading to this curious phenomenon. UV irradiation induced duplicated foot in about 30% of regenerating middle pieces. Metalloproteinases are important in foot formation, while Wnt pathway genes are important in head formation in hydra. The effect of UV irradiation on expression of these genes was studied by in situ hybridization and q-PCR. In whole polyps and middle pieces, UV irradiation led to up-regulation of HMP2 and HMMP, the two metalloproteinases involved in foot formation in hydra. HMP2 expression was significantly increased starting from 30 min post exposure to UV at 254 nm (500 J/m(2)), while HMMP showed significant up-regulation 6 h post UV exposure onwards. In middle pieces, increased expression of both metalloproteinases was observed only at 48 h. In whole polyps as well as in middle pieces, expression of Wnt3 and β-catenin was detected within 30 min of UV exposure and was accompanied by up-regulation of GSK3β, DKK3 and DKK1/2/4, inhibitors of the Wnt pathway. These conditions likely lead to inactivation of Wnt signaling. We therefore conclude that duplication of foot due to UV irradiation in regenerating middle pieces of hydra is a combined effect of up-regulation of metalloproteinases and inactivation of the Wnt pathway. Our results suggest that UV irradiation can be employed as a tool to understand patterning mechanisms during foot formation in hydra.

  20. Molecular cloning of doublesex genes of four cladocera (water flea) species.

    PubMed

    Toyota, Kenji; Kato, Yasuhiko; Sato, Masaru; Sugiura, Naomi; Miyagawa, Shinichi; Miyakawa, Hitoshi; Watanabe, Hajime; Oda, Shigeto; Ogino, Yukiko; Hiruta, Chizue; Mizutani, Takeshi; Tatarazako, Norihisa; Paland, Susanne; Jackson, Craig; Colbourne, John K; Iguchi, Taisen

    2013-04-10

    The gene doublesex (dsx) is known as a key factor regulating genetic sex determination in many organisms. We previously identified two dsx genes (DapmaDsx1 and DapmaDsx2) from a freshwater branchiopod crustacean, Daphnia magna, which are expressed in males but not in females. D. magna produces males by parthenogenesis in response to environmental cues (environmental sex determination) and we showed that DapmaDsx1 expression during embryonic stages is responsible for the male trait development. The D. magna dsx genes are thought to have arisen by a cladoceran-specific duplication; therefore, to investigate evolutionary conservation of sex specific expression of dsx genes and to further assess their functions in the environmental sex determination, we searched for dsx homologs in four closely related cladoceran species. We identified homologs of both dsx genes from, D. pulex, D. galeata, and Ceriodaphnia dubia, yet only a single dsx gene was found from Moina macrocopa. The deduced amino acid sequences of all 9 dsx homologs contained the DM and oligomerization domains, which are characteristic for all arthropod DSX family members. Molecular phylogenetic analysis suggested that the dsx gene duplication likely occurred prior to the divergence of these cladoceran species, because that of the giant tiger prawn Penaeus monodon is rooted ancestrally to both DSX1 and DSX2 of cladocerans. Therefore, this result also suggested that M. macrocopa lost dsx2 gene secondarily. Furthermore, all dsx genes identified in this study showed male-biased expression levels, yet only half of the putative 5' upstream regulatory elements are preserved in D. magna and D. pulex. The all dsx genes of five cladoceran species examined had similar amino acid structure containing highly conserved DM and oligomerization domains, and exhibited sexually dimorphic expression patterns, suggesting that these genes may have similar functions for environmental sex determination in cladocerans.

  1. Molecular cloning of doublesex genes of four cladocera (water flea) species

    PubMed Central

    2013-01-01

    Background The gene doublesex (dsx) is known as a key factor regulating genetic sex determination in many organisms. We previously identified two dsx genes (DapmaDsx1 and DapmaDsx2) from a freshwater branchiopod crustacean, Daphnia magna, which are expressed in males but not in females. D. magna produces males by parthenogenesis in response to environmental cues (environmental sex determination) and we showed that DapmaDsx1 expression during embryonic stages is responsible for the male trait development. The D. magna dsx genes are thought to have arisen by a cladoceran-specific duplication; therefore, to investigate evolutionary conservation of sex specific expression of dsx genes and to further assess their functions in the environmental sex determination, we searched for dsx homologs in four closely related cladoceran species. Results We identified homologs of both dsx genes from, D. pulex, D. galeata, and Ceriodaphnia dubia, yet only a single dsx gene was found from Moina macrocopa. The deduced amino acid sequences of all 9 dsx homologs contained the DM and oligomerization domains, which are characteristic for all arthropod DSX family members. Molecular phylogenetic analysis suggested that the dsx gene duplication likely occurred prior to the divergence of these cladoceran species, because that of the giant tiger prawn Penaeus monodon is rooted ancestrally to both DSX1 and DSX2 of cladocerans. Therefore, this result also suggested that M. macrocopa lost dsx2 gene secondarily. Furthermore, all dsx genes identified in this study showed male-biased expression levels, yet only half of the putative 5’ upstream regulatory elements are preserved in D. magna and D. pulex. Conclusions The all dsx genes of five cladoceran species examined had similar amino acid structure containing highly conserved DM and oligomerization domains, and exhibited sexually dimorphic expression patterns, suggesting that these genes may have similar functions for environmental sex determination in cladocerans. PMID:23575357

  2. The Interstitial Duplication 15q11.2-q13 Syndrome Includes Autism, Mild Facial Anomalies and a Characteristic EEG Signature

    PubMed Central

    Urraca, Nora; Cleary, Julie; Brewer, Victoria; Pivnick, Eniko K; McVicar, Kathryn; Thibert, Ronald L; Schanen, N Carolyn; Esmer, Carmen; Lamport, Dustin; Reiter, Lawrence T

    2013-01-01

    Chromosomal copy number variants (CNV) are the most common genetic lesion found in autism. Many autism-associated CNVs are duplications of chromosome 15q. Although most cases of interstitial (int) dup(15) that present clinically are de novo and maternally derived or inherited, both pathogenic and unaffected paternal duplications of 15q have been identified. We performed a phenotype/genotype analysis of individuals with interstitial 15q duplications to broaden our understanding of the 15q syndrome and investigate the contribution of 15q duplication to increased autism risk. All subjects were recruited solely on the basis of interstitial duplication 15q11.2-q13 status. Comparative array genome hybridization was used to determine the duplication size and boundaries while the methylation status of the maternally methylated small nuclear ribonucleoprotein polypeptide N gene was used to determine the parent of origin of the duplication. We determined the duplication size and parental origin for 14 int dup(15) subjects: 10 maternal and 4 paternal cases. The majority of int dup(15) cases recruited were maternal in origin, most likely due to our finding that maternal duplication was coincident with autism spectrum disorder. The size of the duplication did not correlate with the severity of the phenotype as established by Autism Diagnostic Observation Scale calibrated severity score. We identified phenotypes not comprehensively described before in this cohort including mild facial dysmorphism, sleep problems and an unusual electroencephalogram variant. Our results are consistent with the hypothesis that the maternally expressed ubiquitin protein ligase E3A gene is primarily responsible for the autism phenotype in int dup(15) since all maternal cases tested presented on the autism spectrum. PMID:23495136

  3. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members

    PubMed Central

    Yan, Xiping; Wang, Guosong; Liu, Hehe; Gan, Xiang; Zhang, Tao; Wang, Jiwen; Li, Liang

    2015-01-01

    Peroxisome proliferators-activated receptor (PPAR) gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors) domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors) are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3′ UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3′ UTR are essential for PPARs evolution and diversity functions acquired. PMID:25961030

  4. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members.

    PubMed

    Zhou, Tianyu; Yan, Xiping; Wang, Guosong; Liu, Hehe; Gan, Xiang; Zhang, Tao; Wang, Jiwen; Li, Liang

    2015-01-01

    Peroxisome proliferators-activated receptor (PPAR) gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors) domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors) are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3' UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3' UTR are essential for PPARs evolution and diversity functions acquired.

  5. Lineage-specific expansion and loss of tyrosinase genes across platyhelminths and their induction profiles in the carcinogenic oriental liver fluke, Clonorchis sinensis.

    PubMed

    Kim, Seon-Hee; Bae, Young-An

    2017-09-01

    Tyrosinase provides an essential activity during egg production in diverse platyhelminths by mediating sclerotization of eggshells. In this study, we investigated the genomic and evolutionary features of tyrosinases in parasitic platyhelminths whose genomic information is available. A pair of paralogous tyrosinases was detected in most trematodes, whereas they were lost in cyclophyllidean cestodes. A pseudophyllidean cestode displaying egg biology similar to that of trematodes possessed an orthologous gene. Interestingly, one of the paralogous tyrosinases appeared to have been multiplied into three copies in Clonorchis sinensis and Opisthorchis viverrini. In addition, a fifth tyrosinase gene that was minimally transcribed through all developmental stages was further detected in these opisthorchiid genomes. Phylogenetic analyses demonstrated that the tyrosinase gene has undergone duplication at least three times in platyhelminths. The additional opisthorchiid gene arose from the first duplication. A paralogous copy generated from these gene duplications, except for the last one, seemed to be lost in the major neodermatans lineages. In C. sinensis, tyrosinase gene expressions were initiated following sexual maturation and the levels were significantly enhanced by the presence of O2 and bile. Taken together, our data suggest that tyrosinase has evolved lineage-specifically across platyhelminths related to its copy number and induction mechanism.

  6. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    PubMed

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  7. Genome-wide survey and characterization of the WRKY gene family in Populus trichocarpa.

    PubMed

    He, Hongsheng; Dong, Qing; Shao, Yuanhua; Jiang, Haiyang; Zhu, Suwen; Cheng, Beijiu; Xiang, Yan

    2012-07-01

    WRKY transcription factors participate in diverse physiological and developmental processes in plants. They have highly conserved WRKYGQK amino acid sequences in their N-termini, followed by the novel zinc-finger-like motifs, Cys₂His₂ or Cys₂HisCys. To date, numerous WRKY genes have been identified and characterized in a number of herbaceous species. Survey and characterization of WRKY genes in a ligneous species would facilitate a better understanding of the evolutionary processes and functions of this gene family. In this study, 104 poplar WRKY genes (PtWRKY) were identified in the latest poplar genome sequence. According to their structural features, the predicted members were divided into the previously defined groups I-III, as described in rice. In addition, chromosomal localization of the genes demonstrated that there might be WRKY gene hot spots in 2.3 Mb regions on chromosome 14. Furthermore, approximately 83% (86 out of 104) WRKY genes participated in gene duplication events, including 69% (29 out of 42) gene pairs which exhibited segmental duplication. Using semi-quantitative RT-PCR, the expression patterns of subgroup III genes were investigated under different stresses [cold, drought, salinity and salicylic acid (SA)]. The data revealed that these genes presented different expression levels in response to various stress conditions. Expression analysis exhibited PtWRKY76 gene induced markedly in 0.1 mM SA or 25% PEG-6000 treatment. The results presented here provide a fundamental clue for cloning specific function genes in further studies and applications. This study identified 104 poplar WRKY genes and demonstrated WRKY gene hot spots on chromosome 14. Furthermore, semi-quantitative RT-PCR showed variable stress responses in subgroup III.

  8. FGFR3 gene mutation plus GRB10 gene duplication in a patient with achondroplasia plus growth delay with prenatal onset.

    PubMed

    Yuan, Haiming; Huang, Linhuan; Hu, Xizi; Li, Qian; Sun, Xiaofang; Xie, Yingjun; Kong, Shu; Wang, Xiaoman

    2016-07-02

    Achondroplasia is a well-defined and common bone dysplasia. Genotype- and phenotype-level correlations have been found between the clinical symptoms of achondroplasia and achondroplasia-specific FGFR3 mutations. A 2-year-old boy with clinical features consistent with achondroplasia and Silver-Russell syndrome-like symptoms was found to carry a mutation in the fibroblast growth factor receptor-3 (FGFR3) gene at c.1138G > A (p.Gly380Arg) and a de novo 574 kb duplication at chromosome 7p12.1 that involved the entire growth-factor receptor bound protein 10 (GRB10) gene. Using quantitative real-time PCR analysis, GRB10 was over-expressed, and, using enzyme-linked immunosorbent assays for IGF1 and IGF-binding protein-3 (IGFBP3), we found that IGF1 and IGFBP3 were low-expressed in this patient. We demonstrate that a combination of uncommon, rare and exceptional molecular defects related to the molecular bases of particular birth defects can be analyzed and diagnosed to potentially explain the observed variability in the combination of molecular defects.

  9. Sexual Dimorphism of Body Size Is Controlled by Dosage of the X-Chromosomal Gene Myc and by the Sex-Determining Gene tra in Drosophila.

    PubMed

    Mathews, Kristina Wehr; Cavegn, Margrith; Zwicky, Monica

    2017-03-01

    Drosophila females are larger than males. In this article, we describe how X -chromosome dosage drives sexual dimorphism of body size through two means: first, through unbalanced expression of a key X -linked growth-regulating gene, and second, through female-specific activation of the sex-determination pathway. X -chromosome dosage determines phenotypic sex by regulating the genes of the sex-determining pathway. In the presence of two sets of X -chromosome signal elements (XSEs), Sex-lethal ( Sxl ) is activated in female ( XX ) but not male ( XY ) animals. Sxl activates transformer ( tra ), a gene that encodes a splicing factor essential for female-specific development. It has previously been shown that null mutations in the tra gene result in only a partial reduction of body size of XX animals, which shows that other factors must contribute to size determination. We tested whether X dosage directly affects animal size by analyzing males with duplications of X -chromosomal segments. Upon tiling across the X chromosome, we found four duplications that increase male size by >9%. Within these, we identified several genes that promote growth as a result of duplication. Only one of these, Myc , was found not to be dosage compensated. Together, our results indicate that both Myc dosage and tra expression play crucial roles in determining sex-specific size in Drosophila larvae and adult tissue. Since Myc also acts as an XSE that contributes to tra activation in early development, a double dose of Myc in females serves at least twice in development to promote sexual size dimorphism. Copyright © 2017 by the Genetics Society of America.

  10. Two Functional Copies of the DGCR6 Gene Are Present on Human Chromosome 22q11 Due to a Duplication of an Ancestral Locus

    PubMed Central

    Edelmann, Lisa; Stankiewicz, Pavel; Spiteri, Elizabeth; Pandita, Raj K.; Shaffer, Lisa; Lupski, James; Morrow, Bernice E.

    2001-01-01

    The DGCR6 (DiGeorge critical region) gene encodes a putative protein with sequence similarity to gonadal (gdl), a Drosophila melanogaster gene of unknown function. We mapped the DGCR6 gene to chromosome 22q11 within a low copy repeat, termed sc11.1a, and identified a second copy of the gene, DGCR6L, within the duplicate locus, termed sc11.1b. Both sc11.1 repeats are deleted in most persons with velo-cardio-facial syndrome/DiGeorge syndrome (VCFS/DGS), and they map immediately adjacent and internal to the low copy repeats, termed LCR22, that mediate the deletions associated with VCFS/DGS. We sequenced genomic clones from both loci and determined that the putative initiator methionine is located further upstream than originally described, but in a position similar to the mouse and chicken orthologs. DGCR6L encodes a highly homologous, functional copy of DGCR6, with some base changes rendering amino acid differences. Expression studies of the two genes indicate that both genes are widely expressed in fetal and adult tissues. Evolutionary studies using FISH mapping in several different species of ape combined with sequence analysis of DGCR6 in a number of different primate species indicate that the duplication is at least 12 million years old and may date back to before the divergence of Catarrhines from Platyrrhines, 35 mya. These data suggest that there has been selective evolutionary pressure toward the functional maintenance of both paralogs. Interestingly, a full-length HERV-K provirus integrated into the sc11.1a locus after the divergence of chimpanzees and humans. PMID:11157784

  11. Gene Duplication and the Evolution of Hemoglobin Isoform Differentiation in Birds*

    PubMed Central

    Grispo, Michael T.; Natarajan, Chandrasekhar; Projecto-Garcia, Joana; Moriyama, Hideaki; Weber, Roy E.; Storz, Jay F.

    2012-01-01

    The majority of bird species co-express two functionally distinct hemoglobin (Hb) isoforms in definitive erythrocytes as follows: HbA (the major adult Hb isoform, with α-chain subunits encoded by the αA-globin gene) and HbD (the minor adult Hb isoform, with α-chain subunits encoded by the αD-globin gene). The αD-globin gene originated via tandem duplication of an embryonic α-like globin gene in the stem lineage of tetrapod vertebrates, which suggests the possibility that functional differentiation between the HbA and HbD isoforms may be attributable to a retained ancestral character state in HbD that harkens back to a primordial, embryonic function. To investigate this possibility, we conducted a combined analysis of protein biochemistry and sequence evolution to characterize the structural and functional basis of Hb isoform differentiation in birds. Functional experiments involving purified HbA and HbD isoforms from 11 different bird species revealed that HbD is characterized by a consistently higher O2 affinity in the presence of allosteric effectors such as organic phosphates and Cl− ions. In the case of both HbA and HbD, analyses of oxygenation properties under the two-state Monod-Wyman-Changeux allosteric model revealed that the pH dependence of Hb-O2 affinity stems primarily from changes in the O2 association constant of deoxy (T-state)-Hb. Ancestral sequence reconstructions revealed that the amino acid substitutions that distinguish the adult-expressed Hb isoforms are not attributable to the retention of an ancestral (pre-duplication) character state in the αD-globin gene that is shared with the embryonic α-like globin gene. PMID:22962007

  12. Gene duplication and the evolution of hemoglobin isoform differentiation in birds.

    PubMed

    Grispo, Michael T; Natarajan, Chandrasekhar; Projecto-Garcia, Joana; Moriyama, Hideaki; Weber, Roy E; Storz, Jay F

    2012-11-02

    The majority of bird species co-express two functionally distinct hemoglobin (Hb) isoforms in definitive erythrocytes as follows: HbA (the major adult Hb isoform, with α-chain subunits encoded by the α(A)-globin gene) and HbD (the minor adult Hb isoform, with α-chain subunits encoded by the α(D)-globin gene). The α(D)-globin gene originated via tandem duplication of an embryonic α-like globin gene in the stem lineage of tetrapod vertebrates, which suggests the possibility that functional differentiation between the HbA and HbD isoforms may be attributable to a retained ancestral character state in HbD that harkens back to a primordial, embryonic function. To investigate this possibility, we conducted a combined analysis of protein biochemistry and sequence evolution to characterize the structural and functional basis of Hb isoform differentiation in birds. Functional experiments involving purified HbA and HbD isoforms from 11 different bird species revealed that HbD is characterized by a consistently higher O(2) affinity in the presence of allosteric effectors such as organic phosphates and Cl(-) ions. In the case of both HbA and HbD, analyses of oxygenation properties under the two-state Monod-Wyman-Changeux allosteric model revealed that the pH dependence of Hb-O(2) affinity stems primarily from changes in the O(2) association constant of deoxy (T-state)-Hb. Ancestral sequence reconstructions revealed that the amino acid substitutions that distinguish the adult-expressed Hb isoforms are not attributable to the retention of an ancestral (pre-duplication) character state in the α(D)-globin gene that is shared with the embryonic α-like globin gene.

  13. Evolutionary changes in the notochord genetic toolkit: a comparative analysis of notochord genes in the ascidian Ciona and the larvacean Oikopleura.

    PubMed

    Kugler, Jamie E; Kerner, Pierre; Bouquet, Jean-Marie; Jiang, Di; Di Gregorio, Anna

    2011-01-20

    The notochord is a defining feature of the chordate clade, and invertebrate chordates, such as tunicates, are uniquely suited for studies of this structure. Here we used a well-characterized set of 50 notochord genes known to be targets of the notochord-specific Brachyury transcription factor in one tunicate, Ciona intestinalis (Class Ascidiacea), to begin determining whether the same genetic toolkit is employed to build the notochord in another tunicate, Oikopleura dioica (Class Larvacea). We identified Oikopleura orthologs of the Ciona notochord genes, as well as lineage-specific duplicates for which we determined the phylogenetic relationships with related genes from other chordates, and we analyzed their expression patterns in Oikopleura embryos. Of the 50 Ciona notochord genes that were used as a reference, only 26 had clearly identifiable orthologs in Oikopleura. Two of these conserved genes appeared to have undergone Oikopleura- and/or tunicate-specific duplications, and one was present in three copies in Oikopleura, thus bringing the number of genes to test to 30. We were able to clone and test 28 of these genes. Thirteen of the 28 Oikopleura orthologs of Ciona notochord genes showed clear expression in all or in part of the Oikopleura notochord, seven were diffusely expressed throughout the tail, six were expressed in tissues other than the notochord, while two probes did not provide a detectable signal at any of the stages analyzed. One of the notochord genes identified, Oikopleura netrin, was found to be unevenly expressed in notochord cells, in a pattern reminiscent of that previously observed for one of the Oikopleura Hox genes. A surprisingly high number of Ciona notochord genes do not have apparent counterparts in Oikopleura, and only a fraction of the evolutionarily conserved genes show clear notochord expression. This suggests that Ciona and Oikopleura, despite the morphological similarities of their notochords, have developed rather divergent sets of notochord genes after their split from a common tunicate ancestor. This study demonstrates that comparisons between divergent tunicates can lead to insights into the basic complement of genes sufficient for notochord development, and elucidate the constraints that control its composition.

  14. Evolutionary changes in the notochord genetic toolkit: a comparative analysis of notochord genes in the ascidian Ciona and the larvacean Oikopleura

    PubMed Central

    2011-01-01

    Background The notochord is a defining feature of the chordate clade, and invertebrate chordates, such as tunicates, are uniquely suited for studies of this structure. Here we used a well-characterized set of 50 notochord genes known to be targets of the notochord-specific Brachyury transcription factor in one tunicate, Ciona intestinalis (Class Ascidiacea), to begin determining whether the same genetic toolkit is employed to build the notochord in another tunicate, Oikopleura dioica (Class Larvacea). We identified Oikopleura orthologs of the Ciona notochord genes, as well as lineage-specific duplicates for which we determined the phylogenetic relationships with related genes from other chordates, and we analyzed their expression patterns in Oikopleura embryos. Results Of the 50 Ciona notochord genes that were used as a reference, only 26 had clearly identifiable orthologs in Oikopleura. Two of these conserved genes appeared to have undergone Oikopleura- and/or tunicate-specific duplications, and one was present in three copies in Oikopleura, thus bringing the number of genes to test to 30. We were able to clone and test 28 of these genes. Thirteen of the 28 Oikopleura orthologs of Ciona notochord genes showed clear expression in all or in part of the Oikopleura notochord, seven were diffusely expressed throughout the tail, six were expressed in tissues other than the notochord, while two probes did not provide a detectable signal at any of the stages analyzed. One of the notochord genes identified, Oikopleura netrin, was found to be unevenly expressed in notochord cells, in a pattern reminiscent of that previously observed for one of the Oikopleura Hox genes. Conclusions A surprisingly high number of Ciona notochord genes do not have apparent counterparts in Oikopleura, and only a fraction of the evolutionarily conserved genes show clear notochord expression. This suggests that Ciona and Oikopleura, despite the morphological similarities of their notochords, have developed rather divergent sets of notochord genes after their split from a common tunicate ancestor. This study demonstrates that comparisons between divergent tunicates can lead to insights into the basic complement of genes sufficient for notochord development, and elucidate the constraints that control its composition. PMID:21251251

  15. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution.

    PubMed

    Gu, Xun; Wang, Yufeng; Gu, Jianying

    2002-06-01

    The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.

  16. Detecting long tandem duplications in genomic sequences.

    PubMed

    Audemard, Eric; Schiex, Thomas; Faraut, Thomas

    2012-05-08

    Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS  <  1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.

  17. Evolution of the PEBP gene family in plants: functional diversification in seed plant evolution.

    PubMed

    Karlgren, Anna; Gyllenstrand, Niclas; Källman, Thomas; Sundström, Jens F; Moore, David; Lascoux, Martin; Lagercrantz, Ulf

    2011-08-01

    The phosphatidyl ethanolamine-binding protein (PEBP) gene family is present in all eukaryote kingdoms, with three subfamilies identified in angiosperms (FLOWERING LOCUS T [FT], MOTHER OF FT AND TFL1 [MFT], and TERMINAL FLOWER1 [TFL1] like). In angiosperms, PEBP genes have been shown to function both as promoters and suppressors of flowering and to control plant architecture. In this study, we focus on previously uncharacterized PEBP genes from gymnosperms. Extensive database searches suggest that gymnosperms possess only two types of PEBP genes, MFT-like and a group that occupies an intermediate phylogenetic position between the FT-like and TFL1-like (FT/TFL1-like). Overexpression of Picea abies PEBP genes in Arabidopsis (Arabidopsis thaliana) suggests that the FT/TFL1-like genes (PaFTL1 and PaFTL2) code for proteins with a TFL1-like function. However, PaFTL1 and PaFTL2 also show highly divergent expression patterns. While the expression of PaFTL2 is correlated with annual growth rhythm and mainly confined to needles and vegetative and reproductive buds, the expression of PaFTL1 is largely restricted to microsporophylls of male cones. The P. abies MFT-like genes (PaMFT1 and PaMFT2) show a predominant expression during embryo development, a pattern that is also found for many MFT-like genes from angiosperms. P. abies PEBP gene expression is primarily detected in tissues undergoing physiological changes related to growth arrest and dormancy. A first duplication event resulting in two families of plant PEBP genes (MFT-like and FT/TFL1-like) seems to coincide with the evolution of seed plants, in which independent control of bud and seed dormancy was required, and the second duplication resulting in the FT-like and TFL1-like clades probably coincided with the evolution of angiosperms.

  18. Autopolyploidy genome duplication preserves other ancient genome duplications in Atlantic salmon (Salmo salar).

    PubMed

    Christensen, Kris A; Davidson, William S

    2017-01-01

    Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well.

  19. MECP2 duplications in six patients with complex sex chromosome rearrangements

    PubMed Central

    Breman, Amy M; Ramocki, Melissa B; Kang, Sung-Hae L; Williams, Misti; Freedenberg, Debra; Patel, Ankita; Bader, Patricia I; Cheung, Sau Wai

    2011-01-01

    Duplications of the Xq28 chromosome region resulting in functional disomy are associated with a distinct clinical phenotype characterized by infantile hypotonia, severe developmental delay, progressive neurological impairment, absent speech, and proneness to infections. Increased expression of the dosage-sensitive MECP2 gene is considered responsible for the severe neurological impairments observed in affected individuals. Although cytogenetically visible duplications of Xq28 are well documented in the published literature, recent advances using array comparative genomic hybridization (CGH) led to the detection of an increasing number of microduplications spanning MECP2. In rare cases, duplication results from intrachromosomal rearrangement between the X and Y chromosomes. We report six cases with sex chromosome rearrangements involving duplication of MECP2. Cases 1–4 are unbalanced rearrangements between X and Y, resulting in MECP2 duplication. The additional Xq material was translocated to Yp in three cases (cases 1–3), and to the heterochromatic region of Yq12 in one case (case 4). Cases 5 and 6 were identified by array CGH to have a loss in copy number at Xp and a gain in copy number at Xq28 involving the MECP2 gene. In both cases, fluorescent in situ hybridization (FISH) analysis revealed a recombinant X chromosome containing the duplicated material from Xq28 on Xp, resulting from a maternal pericentric inversion. These cases add to a growing number of MECP2 duplications that have been detected by array CGH, while demonstrating the value of confirmatory chromosome and FISH studies for the localization of the duplicated material and the identification of complex rearrangements. PMID:21119712

  20. Male sex in houseflies is determined by Mdmd, a paralog of the generic splice factor gene CWC22.

    PubMed

    Sharma, Akash; Heinze, Svenia D; Wu, Yanli; Kohlbrenner, Tea; Morilla, Ian; Brunner, Claudia; Wimmer, Ernst A; van de Zande, Louis; Robinson, Mark D; Beukeboom, Leo W; Bopp, Daniel

    2017-05-12

    Across species, animals have diverse sex determination pathways, each consisting of a hierarchical cascade of genes and its associated regulatory mechanism. Houseflies have a distinctive polymorphic sex determination system in which a dominant male determiner, the M-factor, can reside on any of the chromosomes. We identified a gene, Musca domestica male determiner ( Mdmd ), as the M-factor. Mdmd originated from a duplication of the spliceosomal factor gene CWC22 ( nucampholin ). Targeted Mdmd disruption results in complete sex reversal to fertile females because of a shift from male to female expression of the downstream genes transformer and doublesex The presence of Mdmd on different chromosomes indicates that Mdmd translocated to different genomic sites. Thus, an instructive signal in sex determination can arise by duplication and neofunctionalization of an essential splicing regulator. Copyright © 2017, American Association for the Advancement of Science.

  1. Genome-wide characterization of the β-1,3-glucanase gene family in Gossypium by comparative analysis

    PubMed Central

    Xu, Xiaoyang; Feng, Yue; Fang, Shuai; Xu, Jun; Wang, Xinyu; Guo, Wangzhen

    2016-01-01

    The β-1,3-glucanase gene family is involved in a wide range of plant developmental processes as well as pathogen defense mechanisms. Comprehensive analyses of β-1,3-glucanase genes (GLUs) have not been reported in cotton. Here, we identified 67, 68, 130 and 158 GLUs in four sequenced cotton species, G. raimondii (D5), G. arboreum (A2), G. hirsutum acc. TM-1 (AD1), and G. barbadense acc. 3–79 (AD2), respectively. Cotton GLUs can be classified into the eight subfamilies (A–H), and their protein domain architecture and intron/exon structure are relatively conserved within each subfamily. Sixty-seven GLUs in G. raimondii were anchored onto 13 chromosomes, with 27 genes involved in segmental duplications, and 13 in tandem duplications. Expression patterns showed highly developmental and spatial regulation of GLUs in TM-1. In particular, the expression of individual member of GLUs in subfamily E was limited to roots, leaves, floral organs or fibers. Members of subfamily E also showed more protein evolution and subgenome expression bias compared with members of other subfamilies. We clarified that GLU42 and GLU43 in subfamily E were preferentially expressed in root and leaf tissues and significantly upregulated after Verticillium dahliae inoculation. Silencing of GLU42 and GLU43 significantly increased the susceptibility of cotton to V. dahliae. PMID:27353015

  2. New genes from old: asymmetric divergence of gene duplicates and the evolution of development.

    PubMed

    Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi

    2017-02-05

    Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).

  3. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates

    PubMed Central

    Kikuta, Hiroshi; Laplante, Mary; Navratilova, Pavla; Komisarczuk, Anna Z.; Engström, Pär G.; Fredman, David; Akalin, Altuna; Caccamo, Mario; Sealy, Ian; Howe, Kerstin; Ghislain, Julien; Pezeron, Guillaume; Mourrain, Philippe; Ellingsen, Staale; Oates, Andrew C.; Thisse, Christine; Thisse, Bernard; Foucher, Isabelle; Adolf, Birgit; Geling, Andrea; Lenhard, Boris; Becker, Thomas S.

    2007-01-01

    We report evidence for a mechanism for the maintenance of long-range conserved synteny across vertebrate genomes. We found the largest mammal-teleost conserved chromosomal segments to be spanned by highly conserved noncoding elements (HCNEs), their developmental regulatory target genes, and phylogenetically and functionally unrelated “bystander” genes. Bystander genes are not specifically under the control of the regulatory elements that drive the target genes and are expressed in patterns that are different from those of the target genes. Reporter insertions distal to zebrafish developmental regulatory genes pax6.1/2, rx3, id1, and fgf8 and miRNA genes mirn9-1 and mirn9-5 recapitulate the expression patterns of these genes even if located inside or beyond bystander genes, suggesting that the regulatory domain of a developmental regulatory gene can extend into and beyond adjacent transcriptional units. We termed these chromosomal segments genomic regulatory blocks (GRBs). After whole genome duplication in teleosts, GRBs, including HCNEs and target genes, were often maintained in both copies, while bystander genes were typically lost from one GRB, strongly suggesting that evolutionary pressure acts to keep the single-copy GRBs of higher vertebrates intact. We show that loss of bystander genes and other mutational events suffered by duplicated GRBs in teleost genomes permits target gene identification and HCNE/target gene assignment. These findings explain the absence of evolutionary breakpoints from large vertebrate chromosomal segments and will aid in the recognition of position effect mutations within human GRBs. PMID:17387144

  4. Characterization of DNA methyltransferase and demethylase genes in Fragaria vesca.

    PubMed

    Gu, Tingting; Ren, Shuai; Wang, Yuanhua; Han, Yuhui; Li, Yi

    2016-06-01

    DNA methylation is an epigenetic modification essential for gene regulations in plants, but understanding on how it is involved in fruit development, especially in non-climacteric fleshy fruit, is limited. The diploid woodland strawberry (Fragaria vesca) is an important model for non-climacteric fruit crops. In this study, we identified DNA methyltransferase genes and demethylase genes in Fragaria vesca and other angiosperm species. In accordance with previous studies, our phylogenetic analyses of those DNA methylation modifiers support the clustering of those genes into several classes. Our data indicate that whole-genome duplications and tandem duplications contributed to the expansion of those DNA methylation modifiers in angiosperms. We have further demonstrated that some DNA methylase and demethylase genes reach their highest expression levels in strawberry fleshy fruits when turning from white to red, suggesting that DNA methylation might undergo a dramatic change at the onset of fleshy fruit-ripening process. In addition, we have observed that expression of some DNA demethylase genes increases in response to various abiotic stresses including heat, cold, drought and salinity. Collectively, our study indicates a regulatory role of DNA methylation in the turning stage of non-climacteric fleshy fruit and responses to environment stimuli, and would facilitate functional studies of DNA methylation in the growth and development of non-climacteric fruits.

  5. Genetic diagnosis of Duchenne and Becker muscular dystrophy using next-generation sequencing technology: comprehensive mutational search in a single platform.

    PubMed

    Lim, Byung Chan; Lee, Seungbok; Shin, Jong-Yeon; Kim, Jong-Il; Hwang, Hee; Kim, Ki Joong; Hwang, Yong Seung; Seo, Jeong-Sun; Chae, Jong Hee

    2011-11-01

    Duchenne muscular dystrophy or Becker muscular dystrophy might be a suitable candidate disease for application of next-generation sequencing in the genetic diagnosis because the complex mutational spectrum and the large size of the dystrophin gene require two or more analytical methods and have a high cost. The authors tested whether large deletions/duplications or small mutations, such as point mutations or short insertions/deletions of the dystrophin gene, could be predicted accurately in a single platform using next-generation sequencing technology. A custom solution-based target enrichment kit was designed to capture whole genomic regions of the dystrophin gene and other muscular-dystrophy-related genes. A multiplexing strategy, wherein four differently bar-coded samples were captured and sequenced together in a single lane of the Illumina Genome Analyser, was applied. The study subjects were 25 16 with deficient dystrophin expression without a large deletion/duplication and 9 with a known large deletion/duplication. Nearly 100% of the exonic region of the dystrophin gene was covered by at least eight reads with a mean read depth of 107. Pathogenic small mutations were identified in 15 of the 16 patients without a large deletion/duplication. Using these 16 patients as the standard, the authors' method accurately predicted the deleted or duplicated exons in the 9 patients with known mutations. Inclusion of non-coding regions and paired-end sequence analysis enabled accurate identification by increasing the read depth and providing information about the breakpoint junction. The current method has an advantage for the genetic diagnosis of Duchenne muscular dystrophy and Becker muscular dystrophy wherein a comprehensive mutational search may be feasible using a single platform.

  6. Consensus properties and their large-scale applications for the gene duplication problem.

    PubMed

    Moon, Jucheol; Lin, Harris T; Eulenstein, Oliver

    2016-06-01

    Solving the gene duplication problem is a classical approach for species tree inference from gene trees that are confounded by gene duplications. This problem takes a collection of gene trees and seeks a species tree that implies the minimum number of gene duplications. Wilkinson et al. posed the conjecture that the gene duplication problem satisfies the desirable Pareto property for clusters. That is, for every instance of the problem, all clusters that are commonly present in the input gene trees of this instance, called strict consensus, will also be found in every solution to this instance. We prove that this conjecture does not generally hold. Despite this negative result we show that the gene duplication problem satisfies a weaker version of the Pareto property where the strict consensus is found in at least one solution (rather than all solutions). This weaker property contributes to our design of an efficient scalable algorithm for the gene duplication problem. We demonstrate the performance of our algorithm in analyzing large-scale empirical datasets. Finally, we utilize the algorithm to evaluate the accuracy of standard heuristics for the gene duplication problem using simulated datasets.

  7. Dact genes are chordate specific regulators at the intersection of Wnt and Tgf-β signaling pathways.

    PubMed

    Schubert, Frank Richard; Sobreira, Débora Rodrigues; Janousek, Ricardo Guerreiro; Alvares, Lúcia Elvira; Dietrich, Susanne

    2014-08-06

    Dacts are multi-domain adaptor proteins. They have been implicated in Wnt and Tgfβ signaling and serve as a nodal point in regulating many cellular activities. Dact genes have so far only been identified in bony vertebrates. Also, the number of Dact genes in a given species, the number and roles of protein motifs and functional domains, and the overlap of gene expression domains are all not clear. To address these problems, we have taken an evolutionary approach, screening for Dact genes in the animal kingdom and establishing their phylogeny and the synteny of Dact loci. Furthermore, we performed a deep analysis of the various Dact protein motifs and compared the expression patterns of different Dacts. Our study identified previously not recognized dact genes and showed that they evolved late in the deuterostome lineage. In gnathostomes, four Dact genes were generated by the two rounds of whole genome duplication in the vertebrate ancestor, with Dact1/3 and Dact2/4, respectively, arising from the two genes generated during the first genome duplication. In actinopterygians, a further dact4r gene arose from retrotranscription. The third genome duplication in the teleost ancestor, and subsequent gene loss in most gnathostome lineages left extant species with a subset of Dact genes. The distribution of functional domains suggests that the ancestral Dact function lied with Wnt signaling, and a role in Tgfβ signaling may have emerged with the Dact2/4 ancestor. Motif reduction, in particular in Dact4, suggests that this protein may counteract the function of the other Dacts. Dact genes were expressed in both distinct and overlapping domains, suggesting possible combinatorial function. The gnathostome Dact gene family comprises four members, derived from a chordate-specific ancestor. The ability to control Wnt signaling seems to be part of the ancestral repertoire of Dact functions, while the ability to inhibit Tgfβ signaling and to carry out specialized, ortholog-specific roles may have evolved later. The complement of Dact genes coexpressed in a tissue provides a complex way to fine-tune Wnt and Tgfβ signaling. Our work provides the basis for future structural and functional studies aimed at unraveling intracellular regulatory networks.

  8. PTGBase: an integrated database to study tandem duplicated genes in plants.

    PubMed

    Yu, Jingyin; Ke, Tao; Tehrim, Sadia; Sun, Fengming; Liao, Boshou; Hua, Wei

    2015-01-01

    Tandem duplication is a wide-spread phenomenon in plant genomes and plays significant roles in evolution and adaptation to changing environments. Tandem duplicated genes related to certain functions will lead to the expansion of gene families and bring increase of gene dosage in the form of gene cluster arrays. Many tandem duplication events have been studied in plant genomes; yet, there is a surprising shortage of efforts to systematically present the integration of large amounts of information about publicly deposited tandem duplicated gene data across the plant kingdom. To address this shortcoming, we developed the first plant tandem duplicated genes database, PTGBase. It delivers the most comprehensive resource available to date, spanning 39 plant genomes, including model species and newly sequenced species alike. Across these genomes, 54 130 tandem duplicated gene clusters (129 652 genes) are presented in the database. Each tandem array, as well as its member genes, is characterized in complete detail. Tandem duplicated genes in PTGBase can be explored through browsing or searching by identifiers or keywords of functional annotation and sequence similarity. Users can download tandem duplicated gene arrays easily to any scale, up to the complete annotation data set for an entire plant genome. PTGBase will be updated regularly with newly sequenced plant species as they become available. © The Author(s) 2015. Published by Oxford University Press.

  9. Genome-Wide Identification, Characterization, and Expression Profiling of Glutathione S-Transferase (GST) Family in Pumpkin Reveals Likely Role in Cold-Stress Tolerance

    PubMed Central

    Abdul Kayum, Md.; Nath, Ujjal Kumar; Park, Jong-In; Choi, Eung Kyoo; Song, Jae-Young; Kim, Hoy-Taek; Nou, Ill-Sup

    2018-01-01

    Plant growth and development can be adversely affected by cold stress, limiting productivity. The glutathione S-transferase (GST) family comprises important detoxifying enzymes, which play major roles in biotic and abiotic stress responses by reducing the oxidative damage caused by reactive oxygen species. Pumpkins (Cucurbita maxima) are widely grown, economically important, and nutritious; however, their yield can be severely affected by cold stress. The identification of putative candidate genes responsible for cold-stress tolerance, including the GST family genes, is therefore vital. For the first time, we identified 32 C. maxima GST (CmaGST) genes using a combination of bioinformatics approaches and characterized them by expression profiling. These CmaGST genes represent seven of the 14 known classes of plant GSTs, with 18 CmaGSTs categorized into the tau class. The CmaGSTs were distributed across 13 of pumpkin’s 20 chromosomes, with the highest numbers found on chromosomes 4 and 6. The large number of CmaGST genes resulted from gene duplication; 11 and 5 pairs of CmaGST genes were segmental- and tandem-duplicated, respectively. In addition, all CmaGST genes showed organ-specific expression. The expression of the putative GST genes in pumpkin was examined under cold stress in two lines with contrasting cold tolerance: cold-tolerant CP-1 (C. maxima) and cold-susceptible EP-1 (Cucurbita moschata). Seven genes (CmaGSTU3, CmaGSTU7, CmaGSTU8, CmaGSTU9, CmaGSTU11, CmaGSTU12, and CmaGSTU14) were highly expressed in the cold-tolerant line and are putative candidates for use in breeding cold-tolerant crop varieties. These results increase our understanding of the cold-stress-related functions of the GST family, as well as potentially enhancing pumpkin breeding programs. PMID:29439434

  10. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database.

    PubMed

    Pasquier, Jeremy; Cabau, Cédric; Nguyen, Thaovi; Jouanno, Elodie; Severac, Dany; Braasch, Ingo; Journot, Laurent; Pontarotti, Pierre; Klopp, Christophe; Postlethwait, John H; Guiguen, Yann; Bobe, Julien

    2016-05-18

    With more than 30,000 species, ray-finned fish represent approximately half of vertebrates. The evolution of ray-finned fish was impacted by several whole genome duplication (WGD) events including a teleost-specific WGD event (TGD) that occurred at the root of the teleost lineage about 350 million years ago (Mya) and more recent WGD events in salmonids, carps, suckers and others. In plants and animals, WGD events are associated with adaptive radiations and evolutionary innovations. WGD-spurred innovation may be especially relevant in the case of teleost fish, which colonized a wide diversity of habitats on earth, including many extreme environments. Fish biodiversity, the use of fish models for human medicine and ecological studies, and the importance of fish in human nutrition, fuel an important need for the characterization of gene expression repertoires and corresponding evolutionary histories of ray-finned fish genes. To this aim, we performed transcriptome analyses and developed the PhyloFish database to provide (i) de novo assembled gene repertoires in 23 different ray-finned fish species including two holosteans (i.e. a group that diverged from teleosts before TGD) and 21 teleosts (including six salmonids), and (ii) gene expression levels in ten different tissues and organs (and embryos for many) in the same species. This resource was generated using a common deep RNA sequencing protocol to obtain the most exhaustive gene repertoire possible in each species that allows between-species comparisons to study the evolution of gene expression in different lineages. The PhyloFish database described here can be accessed and searched using RNAbrowse, a simple and efficient solution to give access to RNA-seq de novo assembled transcripts.

  11. Genome-wide comparative analysis of papain-like cysteine protease family genes in castor bean and physic nut.

    PubMed

    Zou, Zhi; Huang, Qixing; Xie, Guishui; Yang, Lifu

    2018-01-10

    Papain-like cysteine proteases (PLCPs) are a class of proteolytic enzymes involved in many plant processes. Compared with the extensive research in Arabidopsis thaliana, little is known in castor bean (Ricinus communis) and physic nut (Jatropha curcas), two Euphorbiaceous plants without any recent whole-genome duplication. In this study, a total of 26 or 23 PLCP genes were identified from the genomes of castor bean and physic nut respectively, which can be divided into nine subfamilies based on the phylogenetic analysis: RD21, CEP, XCP, XBCP3, THI, SAG12, RD19, ALP and CTB. Although most of them harbor orthologs in Arabidopsis, several members in subfamilies RD21, CEP, XBCP3 and SAG12 form new groups or subgroups as observed in other species, suggesting specific gene loss occurred in Arabidopsis. Recent gene duplicates were also identified in these two species, but they are limited to the SAG12 subfamily and were all derived from local duplication. Expression profiling revealed diverse patterns of different family members over various tissues. Furthermore, the evolution characteristics of PLCP genes were also compared and discussed. Our findings provide a useful reference to characterize PLCP genes and investigate the family evolution in Euphorbiaceae and species beyond.

  12. Molecular phylogeny and evolution of alcohol dehydrogenase (Adh) genes in legumes

    PubMed Central

    Fukuda, Tatsuya; Yokoyama, Jun; Nakamura, Toru; Song, In-Ja; Ito, Takuro; Ochiai, Toshinori; Kanno, Akira; Kameya, Toshiaki; Maki, Masayuki

    2005-01-01

    Background Nuclear genes determine the vast range of phenotypes that are responsible for the adaptive abilities of organisms in nature. Nevertheless, the evolutionary processes that generate the structures and functions of nuclear genes are only now be coming understood. The aim of our study is to isolate the alcohol dehydrogenase (Adh) genes in two distantly related legumes, and use these sequences to examine the molecular evolutionary history of this nuclear gene. Results We isolated the expressed Adh genes from two species of legumes, Sophora flavescens Ait. and Wisteria floribunda DC., by a RT-PCR based approach and found a new Adh locus in addition to homologues of the Adh genes found previously in legumes. To examine the evolution of these genes, we compared the species and gene trees and found gene duplication of the Adh loci in the legumes occurred as an ancient event. Conclusion This is the first report revealing that some legume species have at least two Adh gene loci belonging to separate clades. Phylogenetic analyses suggest that these genes resulted from relatively ancient duplication events. PMID:15836788

  13. Intercellular Variability in Protein Levels from Stochastic Expression and Noisy Cell Cycle Processes

    PubMed Central

    Soltani, Mohammad; Vargas-Garcia, Cesar A.; Antunes, Duarte; Singh, Abhyudai

    2016-01-01

    Inside individual cells, expression of genes is inherently stochastic and manifests as cell-to-cell variability or noise in protein copy numbers. Since proteins half-lives can be comparable to the cell-cycle length, randomness in cell-division times generates additional intercellular variability in protein levels. Moreover, as many mRNA/protein species are expressed at low-copy numbers, errors incurred in partitioning of molecules between two daughter cells are significant. We derive analytical formulas for the total noise in protein levels when the cell-cycle duration follows a general class of probability distributions. Using a novel hybrid approach the total noise is decomposed into components arising from i) stochastic expression; ii) partitioning errors at the time of cell division and iii) random cell-division events. These formulas reveal that random cell-division times not only generate additional extrinsic noise, but also critically affect the mean protein copy numbers and intrinsic noise components. Counter intuitively, in some parameter regimes, noise in protein levels can decrease as cell-division times become more stochastic. Computations are extended to consider genome duplication, where transcription rate is increased at a random point in the cell cycle. We systematically investigate how the timing of genome duplication influences different protein noise components. Intriguingly, results show that noise contribution from stochastic expression is minimized at an optimal genome-duplication time. Our theoretical results motivate new experimental methods for decomposing protein noise levels from synchronized and asynchronized single-cell expression data. Characterizing the contributions of individual noise mechanisms will lead to precise estimates of gene expression parameters and techniques for altering stochasticity to change phenotype of individual cells. PMID:27536771

  14. Evolution history of duplicated smad3 genes in teleost: insights from Japanese flounder, Paralichthys olivaceus

    PubMed Central

    Du, Xinxin; Liu, Yuezhong; Liu, Jinxiang; Zhang, Quanqi

    2016-01-01

    Following the two rounds of whole-genome duplication (WGD) during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD), the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus). We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR) revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family. PMID:27703851

  15. Cloning of the Arabidopsis and Rice Formaldehyde Dehydrogenase Genes: Implications for the Origin of Plant Adh Enzymes

    PubMed Central

    Dolferus, R.; Osterman, J. C.; Peacock, W. J.; Dennis, E. S.

    1997-01-01

    This article reports the cloning of the genes encoding the Arabidopsis and rice class III ADH enzymes, members of the alcohol dehydrogenase or medium chain reductase/dehydrogenase superfamily of proteins with glutathione-dependent formaldehyde dehydrogenase activity (GSH-FDH). Both genes contain eight introns in exactly the same positions, and these positions are conserved in plant ethanol-active Adh genes (class P). These data provide further evidence that plant class P genes have evolved from class III genes by gene duplication and acquisition of new substrate specificities. The position of introns and similarities in the nucleic acid and amino acid sequences of the different classes of ADH enzymes in plants and humans suggest that plant and animal class III enzymes diverged before they duplicated to give rise to plant and animal ethanol-active ADH enzymes. Plant class P ADH enzymes have gained substrate specificities and evolved promoters with different expression properties, in keeping with their metabolic function as part of the alcohol fermentation pathway. PMID:9215914

  16. Diurnal Cycling Transcription Factors of Pineapple Revealed by Genome-Wide Annotation and Global Transcriptomic Analysis.

    PubMed

    Sharma, Anupma; Wai, Ching Man; Ming, Ray; Yu, Qingyi

    2017-09-01

    Circadian clock provides fitness advantage by coordinating internal metabolic and physiological processes to external cyclic environments. Core clock components exhibit daily rhythmic changes in gene expression, and the majority of them are transcription factors (TFs) and transcription coregulators (TCs). We annotated 1,398 TFs from 67 TF families and 80 TCs from 20 TC families in pineapple, and analyzed their tissue-specific and diurnal expression patterns. Approximately 42% of TFs and 45% of TCs displayed diel rhythmic expression, including 177 TF/TCs cycling only in the nonphotosynthetic leaf tissue, 247 cycling only in the photosynthetic leaf tissue, and 201 cycling in both. We identified 68 TF/TCs whose cycling expression was tightly coupled between the photosynthetic and nonphotosynthetic leaf tissues. These TF/TCs likely coordinate key biological processes in pineapple as we demonstrated that this group is enriched in homologous genes that form the core circadian clock in Arabidopsis and includes a STOP1 homolog. Two lines of evidence support the important role of the STOP1 homolog in regulating CAM photosynthesis in pineapple. First, STOP1 responds to acidic pH and regulates a malate channel in multiple plant species. Second, the cycling expression pattern of the pineapple STOP1 and the diurnal pattern of malate accumulation in pineapple leaf are correlated. We further examined duplicate-gene retention and loss in major known circadian genes and refined their evolutionary relationships between pineapple and other plants. Significant variations in duplicate-gene retention and loss were observed for most clock genes in both monocots and dicots. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. The grapevine kinome: annotation, classification and expression patterns in developmental processes and stress responses.

    PubMed

    Zhu, Kaikai; Wang, Xiaolong; Liu, Jinyi; Tang, Jun; Cheng, Qunkang; Chen, Jin-Gui; Cheng, Zong-Ming Max

    2018-01-01

    Protein kinases (PKs) have evolved as the largest family of molecular switches that regulate protein activities associated with almost all essential cellular functions. Only a fraction of plant PKs, however, have been functionally characterized even in model plant species. In the present study, the entire grapevine kinome was identified and annotated using the most recent version of the grapevine genome. A total of 1168 PK-encoding genes were identified and classified into 20 groups and 121 families, with the RLK-Pelle group being the largest, with 872 members. The 1168 kinase genes were unevenly distributed over all 19 chromosomes, and both tandem and segmental duplications contributed to the expansion of the grapevine kinome, especially of the RLK-Pelle group. Ka/Ks values indicated that most of the tandem and segmental duplication events were under purifying selection. The grapevine kinome families exhibited different expression patterns during plant development and in response to various stress treatments, with many being coexpressed. The comprehensive annotation of grapevine kinase genes, their patterns of expression and coexpression, and the related information facilitate a more complete understanding of the roles of various grapevine kinases in growth and development, responses to abiotic stress, and evolutionary history.

  18. Eye development in the four-eyed fish Anableps anableps: cranial and retinal adaptations to simultaneous aerial and aquatic vision.

    PubMed

    Perez, Louise N; Lorena, Jamily; Costa, Carinne M; Araujo, Maysa S; Frota-Lima, Gabriela N; Matos-Rodrigues, Gabriel E; Martins, Rodrigo A P; Mattox, George M T; Schneider, Patricia N

    2017-04-12

    The unique eyes of the four-eyed fish Anableps anableps have long intrigued biologists. Key features associated with the bulging eye of Anableps include the expanded frontal bone and the duplicated pupils and cornea. Furthermore, the Anableps retina expresses different photoreceptor genes in dorsal and ventral regions, potentially associated with distinct aerial and aquatic stimuli. To gain insight into the developmental basis of the Anableps unique eye, we examined neurocranium and eye ontogeny, as well as photoreceptor gene expression during larval stages. First, we described six larval stages during which duplication of eye structures occurs. Our osteological analysis of neurocranium ontogeny revealed another distinctive Anablepid feature: an ossified interorbital septum partially separating the orbital cavities. Furthermore, we identified the onset of differences in cell proliferation and cell layer density between dorsal and ventral regions of the retina. Finally, we show that differential photoreceptor gene expression in the retina initiates during development, suggesting that it is inherited and not environmentally determined. In sum, our results shed light on the ontogenetic steps leading to the highly derived Anableps eye. © 2017 The Author(s).

  19. Dose-sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses.

    PubMed

    Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.

  20. Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention Through Multiple Tetraploidies in the Grasses

    PubMed Central

    Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525

  1. Comparative Life Cycle Transcriptomics Revises Leishmania mexicana Genome Annotation and Links a Chromosome Duplication with Parasitism of Vertebrates

    PubMed Central

    Fiebig, Michael; Kelly, Steven; Gluenz, Eva

    2015-01-01

    Leishmania spp. are protozoan parasites that have two principal life cycle stages: the motile promastigote forms that live in the alimentary tract of the sandfly and the amastigote forms, which are adapted to survive and replicate in the harsh conditions of the phagolysosome of mammalian macrophages. Here, we used Illumina sequencing of poly-A selected RNA to characterise and compare the transcriptomes of L. mexicana promastigotes, axenic amastigotes and intracellular amastigotes. These data allowed the production of the first transcriptome evidence-based annotation of gene models for this species, including genome-wide mapping of trans-splice sites and poly-A addition sites. The revised genome annotation encompassed 9,169 protein-coding genes including 936 novel genes as well as modifications to previously existing gene models. Comparative analysis of gene expression across promastigote and amastigote forms revealed that 3,832 genes are differentially expressed between promastigotes and intracellular amastigotes. A large proportion of genes that were downregulated during differentiation to amastigotes were associated with the function of the motile flagellum. In contrast, those genes that were upregulated included cell surface proteins, transporters, peptidases and many uncharacterized genes, including 293 of the 936 novel genes. Genome-wide distribution analysis of the differentially expressed genes revealed that the tetraploid chromosome 30 is highly enriched for genes that were upregulated in amastigotes, providing the first evidence of a link between this whole chromosome duplication event and adaptation to the vertebrate host in this group. Peptide evidence for 42 proteins encoded by novel transcripts supports the idea of an as yet uncharacterised set of small proteins in Leishmania spp. with possible implications for host-pathogen interactions. PMID:26452044

  2. Induction of polyploidy by nuclear fusion mechanism upon decreased expression of the nuclear envelope protein LAP2β in the human osteosarcoma cell line U2OS.

    PubMed

    Ben-Shoshan, Shirley Oren; Simon, Amos J; Jacob-Hirsch, Jasmine; Shaklai, Sigal; Paz-Yaacov, Nurit; Amariglio, Ninette; Rechavi, Gideon; Trakhtenbrot, Luba

    2014-01-28

    Polyploidy has been recognized for many years as an important hallmark of cancer cells. Polyploid cells can arise through cell fusion, endoreplication and abortive cell cycle. The inner nuclear membrane protein LAP2β plays key roles in nuclear envelope breakdown and reassembly during mitosis, initiation of replication and transcriptional repression. Here we studied the function of LAP2β in the maintenance of cell ploidy state, a role which has not yet been assigned to this protein. By knocking down the expression of LAP2β, using both viral and non-viral RNAi approaches in osteosarcoma derived U2OS cells, we detected enlarged nuclear size, nearly doubling of DNA content and chromosomal duplications, as analyzed by fluorescent in situ hybridization and spectral karyotyping methodologies. Spectral karyotyping analyses revealed that near-hexaploid karyotypes of LAP2β knocked down cells consisted of not only seven duplicated chromosomal markers, as could be anticipated by genome duplication mechanism, but also of four single chromosomal markers. Furthermore, spectral karyotyping analysis revealed that both of two near-triploid U2OS sub-clones contained the seven markers that were duplicated in LAP2β knocked down cells, whereas the four single chromosomal markers were detected only in one of them. Gene expression profiling of LAP2β knocked down cells revealed that up to a third of the genes exhibiting significant changes in their expression are involved in cancer progression. Our results suggest that nuclear fusion mechanism underlies the polyploidization induction upon LAP2β reduced expression. Our study implies on a novel role of LAP2β in the maintenance of cell ploidy status. LAP2β depleted U2OS cells can serve as a model to investigate polyploidy and aneuploidy formation by nuclear fusion mechanism and its involvement in cancerogenesis.

  3. Expression analysis of vitellogenins in the workers of the red imported fire ant (Solenopsis invicta).

    PubMed

    Hawkings, Chloe; Tamborindeguy, Cecilia

    2018-01-01

    Vitellogenin has been proposed to regulate division of labor and social organization in social insects. The red imported fire ant ( Solenopsis invicta ) harbors four distinct, adjacent vitellogenin genes (Vg1, Vg2, Vg3, and Vg4). Contrary to honey bees that have a single Vg ortholog as well as potentially fertile nurses, and to other ant species that lay trophic eggs, S. invicta workers completely lack ovaries or the ability to lay eggs. This provides a unique model to investigate whether Vg duplication in S. invicta was followed by subfunctionalization to acquire non-reproductive functions and whether Vg was co-opted to regulate behavior within the worker caste. To investigate these questions, we compared the expression patterns of S. invicta Vg genes among workers from different morphological subcastes or performing different tasks. RT-qPCRs revealed higher relative expression of Vg1 in major workers compared to both medium and minor workers, and of Vg2 in major workers when compared to minor workers. Relative expression of Vg1 was also higher in carbohydrate foragers when compared to nurses and protein foragers. By contrast, the level of expression of Vg2, Vg3, and Vg4 were not significantly different among the workers performing the specific tasks. Additionally, we analyzed the relationship between the expression of the Vg genes and S-hydroprene, a juvenile hormone analog. No changes in Vg expression were recorded in workers 12 h after application of the analog. Our results suggest that in S. invicta the Vg gene underwent subfunctionalization after duplication to new functions based on the expression bias observed in these data. This may suggest an alternative and still unknown function for Vg in the workers that needs to be investigated further.

  4. Expression analysis of vitellogenins in the workers of the red imported fire ant (Solenopsis invicta)

    PubMed Central

    2018-01-01

    Vitellogenin has been proposed to regulate division of labor and social organization in social insects. The red imported fire ant (Solenopsis invicta) harbors four distinct, adjacent vitellogenin genes (Vg1, Vg2, Vg3, and Vg4). Contrary to honey bees that have a single Vg ortholog as well as potentially fertile nurses, and to other ant species that lay trophic eggs, S. invicta workers completely lack ovaries or the ability to lay eggs. This provides a unique model to investigate whether Vg duplication in S. invicta was followed by subfunctionalization to acquire non-reproductive functions and whether Vg was co-opted to regulate behavior within the worker caste. To investigate these questions, we compared the expression patterns of S. invicta Vg genes among workers from different morphological subcastes or performing different tasks. RT-qPCRs revealed higher relative expression of Vg1 in major workers compared to both medium and minor workers, and of Vg2 in major workers when compared to minor workers. Relative expression of Vg1 was also higher in carbohydrate foragers when compared to nurses and protein foragers. By contrast, the level of expression of Vg2, Vg3, and Vg4 were not significantly different among the workers performing the specific tasks. Additionally, we analyzed the relationship between the expression of the Vg genes and S-hydroprene, a juvenile hormone analog. No changes in Vg expression were recorded in workers 12 h after application of the analog. Our results suggest that in S. invicta the Vg gene underwent subfunctionalization after duplication to new functions based on the expression bias observed in these data. This may suggest an alternative and still unknown function for Vg in the workers that needs to be investigated further. PMID:29868280

  5. The evolutionary duplication and probable demise of an endodermal GATA factor in Caenorhabditis elegans.

    PubMed

    Fukushige, Tetsunari; Goszczynski, Barbara; Tian, Helen; McGhee, James D

    2003-10-01

    We describe the elt-4 gene from the nematode Caenorhabditis elegans. elt-4 is predicted to encode a very small (72 residues, 8.1 kD) GATA-type zinc finger transcription factor. The elt-4 gene is located approximately 5 kb upstream of the C. elegans elt-2 gene, which also encodes a GATA-type transcription factor; the zinc finger DNA-binding domains are highly conserved (24/25 residues) between the two proteins. The elt-2 gene is expressed only in the intestine and is essential for normal intestinal development. This article explores whether elt-4 also has a role in intestinal development. Reporter fusions to the elt-4 promoter or reporter insertions into the elt-4 coding regions show that elt-4 is indeed expressed in the intestine, beginning at the 1.5-fold stage of embryogenesis and continuing into adulthood. elt-4 reporter fusions are also expressed in nine cells of the posterior pharynx. Ectopic expression of elt-4 cDNA within the embryo does not cause detectable ectopic expression of biochemical markers of gut differentiation; furthermore, ectopic elt-4 expression neither inhibits nor enhances the ectopic marker expression caused by ectopic elt-2 expression. A deletion allele of elt-4 was isolated but no obvious phenotype could be detected, either in the gut or elsewhere; brood sizes, hatching efficiencies, and growth rates were indistinguishable from wild type. We found no evidence that elt-4 provided backup functions for elt-2. We used microarray analysis to search for genes that might be differentially expressed between L1 larvae of the elt-4 deletion strain and wild-type worms. Paired hybridizations were repeated seven times, allowing us to conclude, with some confidence, that no candidate target transcript could be identified as significantly up- or downregulated by loss of elt-4 function. In vitro binding experiments could not detect specific binding of ELT-4 protein to candidate binding sites (double-stranded oligonucleotides containing single or multiple WGATAR sequences); ELT-4 protein neither enhanced nor inhibited the strong sequence-specific binding of the ELT-2 protein. Whereas ELT-2 protein is a strong transcriptional activator in yeast, ELT-4 protein has no such activity under similar conditions, nor does it influence the transcriptional activity of coexpressed ELT-2 protein. Although an elt-2 homolog was easily identified in the genomic sequence of the related nematode C. briggsae, no elt-4 homolog could be identified. Analysis of the changes in silent third codon positions within the DNA-binding domains indicates that elt-4 arose as a duplication of elt-2, some 25-55 MYA. Thus, elt-4 has survived far longer than the average duplicated gene in C. elegans, even though no obvious biological function could be detected. elt-4 provides an interesting example of a tandemly duplicated gene that may originally have been the same size as elt-2 but has gradually been whittled down to its present size of little more than a zinc finger. Although elt-4 must confer (or must have conferred) some selective advantage to C. elegans, we suggest that its ultimate evolutionary fate will be disappearance from the C. elegans genome.

  6. Restriction of the Patau syndrome to duplication of 13q22{yields}q.32 and possible role of interphase nuclear structure

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Helali, A.N.; Jafolla, A.K.; Oumsiych, M.B.

    1994-09-01

    A 10-year-old white male presented with mild microcephaly, slight growth and psychomotor retardation, soft fleshy ears, and normal facial features except for thin lips. No other significant anomalies were reported except for tethered cord discovered at age 8 years. The karyotype was found to be 46,XY,der(18)t(13;18)(q32;p11.32)pat. The mild phenotype appears to be primarily due to the duplication of 13q32{yields}qter. None of the cardinal features of trisomy 13 are found in cases of duplication of bands 13q22 to qter. This case shows that Patau syndrome phenotype does not originate by duplication of 13q32{yields}qter and may thus be restricted to 13q22 tomore » 13q32. The variability in phenotypes points to an alternative explanation to the classical one of additive and interactive gene effects. This model involves effects of changes in chromosome position in the interphase nucleus on gene expression.« less

  7. Comparative Transcriptomic Analysis of Two Brassica napus Near-Isogenic Lines Reveals a Network of Genes That Influences Seed Oil Accumulation.

    PubMed

    Wang, Jingxue; Singh, Sanjay K; Du, Chunfang; Li, Chen; Fan, Jianchun; Pattanaik, Sitakanta; Yuan, Ling

    2016-01-01

    Rapeseed ( Brassica napus ) is an important oil seed crop, providing more than 13% of the world's supply of edible oils. An in-depth knowledge of the gene network involved in biosynthesis and accumulation of seed oil is critical for the improvement of B. napus . Using available genomic and transcriptomic resources, we identified 1,750 acyl-lipid metabolism (ALM) genes that are distributed over 19 chromosomes in the B . napus genome. B. rapa and B. oleracea , two diploid progenitors of B. napus , contributed almost equally to the ALM genes. Genome collinearity analysis demonstrated that the majority of the ALM genes have arisen due to genome duplication or segmental duplication events. In addition, we profiled the expression patterns of the ALM genes in four different developmental stages. Furthermore, we developed two B. napus near isogenic lines (NILs). The high oil NIL, YC13-559, accumulates significantly higher (∼10%) seed oil compared to the other, YC13-554. Comparative gene expression analysis revealed upregulation of lipid biosynthesis-related regulatory genes in YC13-559, including SHOOTMERISTEMLESS, LEAFY COTYLEDON 1 (LEC1), LEC2, FUSCA3, ABSCISIC ACID INSENSITIVE 3 (ABI3), ABI4, ABI5 , and WRINKLED1 , as well as structural genes, such as ACETYL-CoA CARBOXYLASE, ACYL-CoA DIACYLGLYCEROL ACYLTRANSFERASE , and LONG - CHAIN ACYL-CoA SYNTHETASES . We observed that several genes related to the phytohormones, gibberellins, jasmonate, and indole acetic acid, were differentially expressed in the NILs. Our findings provide a broad account of the numbers, distribution, and expression profiles of acyl-lipid metabolism genes, as well as gene networks that potentially control oil accumulation in B . napus seeds. The upregulation of key regulatory and structural genes related to lipid biosynthesis likely plays a major role for the increased seed oil in YC13-559.

  8. TTT and PIKK Complex Genes Reverted to Single Copy Following Polyploidization and Retain Function Despite Massive Retrotransposition in Maize.

    PubMed

    Garcia, Nelson; Messing, Joachim

    2017-01-01

    The TEL2, TTI1, and TTI2 proteins are co-chaperones for heat shock protein 90 (HSP90) to regulate the protein folding and maturation of phosphatidylinositol 3-kinase-related kinases (PIKKs). Referred to as the TTT complex, the genes that encode them are highly conserved from man to maize. TTT complex and PIKK genes exist mostly as single copy genes in organisms where they have been characterized. Members of this interacting protein network in maize were identified and synteny analyses were performed to study their evolution. Similar to other species, there is only one copy of each of these genes in maize which was due to a loss of the duplicated copy created by ancient allotetraploidy. Moreover, the retained copies of the TTT complex and the PIKK genes tolerated extensive retrotransposon insertion in their introns that resulted in increased gene lengths and gene body methylation, without apparent effect in normal gene expression and function. The results raise an interesting question on whether the reversion to single copy was due to selection against deleterious unbalanced gene duplications between members of the complex as predicted by the gene balance hypothesis, or due to neutral loss of extra copies. Uneven alteration of dosage either by adding extra copies or modulating gene expression of complex members is being proposed as a means to investigate whether the data supports the gene balance hypothesis or not.

  9. Regulation of the neuropathy-associated Pmp22 gene by a distal super-enhancer.

    PubMed

    Pantera, Harrison; Moran, John J; Hung, Holly A; Pak, Evgenia; Dutra, Amalia; Svaren, John

    2018-05-16

    Peripheral nerve myelination is adversely affected in the most common form of the hereditary peripheral neuropathy called Charcot-Marie-Tooth Disease. This form, classified as CMT1A, is caused by a 1.4 Mb duplication on chromosome 17, which includes the abundantly expressed Schwann cell myelin gene, Peripheral Myelin Protein 22 (PMP22). This is one of the most common copy number variants causing neurological disease. Overexpression of Pmp22 in rodent models recapitulates several aspects of neuropathy, and reduction of Pmp22 in such models results in amelioration of the neuropathy phenotype. Recently we identified a potential super-enhancer approximately 90-130 kb upstream of the Pmp22 transcription start sites. This super-enhancer encompasses a cluster of individual enhancers that have the acetylated histone H3K27 active enhancer mark, and coincides with smaller duplications identified in patients with milder CMT1A-like symptoms, where the PMP22 coding region itself was not part of the duplication. In this study, we have utilized genome editing to create a deletion of this super-enhancer to determine its role in Pmp22 regulation. Our data show a significant decrease in Pmp22 transcript expression using allele-specific internal controls. Moreover, the P2 promoter of the Pmp22 gene, which is used in other cell types, is affected, but we find that the Schwann cell-specific P1 promoter is disproportionately more sensitive to loss of the super-enhancer. These data show for the first time the requirement of these upstream enhancers for full Pmp22 expression.

  10. Insights into three whole-genome duplications gleaned from the Paramecium caudatum genome sequence.

    PubMed

    McGrath, Casey L; Gout, Jean-Francois; Doak, Thomas G; Yanagi, Akira; Lynch, Michael

    2014-08-01

    Paramecium has long been a model eukaryote. The sequence of the Paramecium tetraurelia genome reveals a history of three successive whole-genome duplications (WGDs), and the sequences of P. biaurelia and P. sexaurelia suggest that these WGDs are shared by all members of the aurelia species complex. Here, we present the genome sequence of P. caudatum, a species closely related to the P. aurelia species group. P. caudatum shares only the most ancient of the three WGDs with the aurelia complex. We found that P. caudatum maintains twice as many paralogs from this early event as the P. aurelia species, suggesting that post-WGD gene retention is influenced by subsequent WGDs and supporting the importance of selection for dosage in gene retention. The availability of P. caudatum as an outgroup allows an expanded analysis of the aurelia intermediate and recent WGD events. Both the Guanine+Cytosine (GC) content and the expression level of preduplication genes are significant predictors of duplicate retention. We find widespread asymmetrical evolution among aurelia paralogs, which is likely caused by gradual pseudogenization rather than by neofunctionalization. Finally, cases of divergent resolution of intermediate WGD duplicates between aurelia species implicate this process acts as an ongoing reinforcement mechanism of reproductive isolation long after a WGD event. Copyright © 2014 by the Genetics Society of America.

  11. Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize

    PubMed Central

    Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan

    2014-01-01

    Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683

  12. Evolution of a Cellular Immune Response in Drosophila: A Phenotypic and Genomic Comparative Analysis

    PubMed Central

    Salazar-Jaramillo, Laura; Paspati, Angeliki; van de Zande, Louis; Vermeulen, Cornelis Joseph; Schwander, Tanja; Wertheim, Bregje

    2014-01-01

    Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila. PMID:24443439

  13. Evolution of a cellular immune response in Drosophila: a phenotypic and genomic comparative analysis.

    PubMed

    Salazar-Jaramillo, Laura; Paspati, Angeliki; van de Zande, Louis; Vermeulen, Cornelis Joseph; Schwander, Tanja; Wertheim, Bregje

    2014-02-01

    Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila.

  14. Hsp70 gene expansions in the scallop Patinopecten yessoensis and their expression regulation after exposure to the toxic dinoflagellate Alexandrium catenella.

    PubMed

    Cheng, Jie; Xun, Xiaogang; Kong, Yifan; Wang, Shuyue; Yang, Zhihui; Li, Yajuan; Kong, Dexu; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2016-11-01

    Heat shock protein 70 (Hsp70s) family members are present in virtually all living organisms and perform a fundamental role against different types of environmental stressors and pathogenic organisms. Marine bivalves live in highly dynamic environments and may accumulate paralytic shellfish toxins (PSTs), a class of well-known neurotoxins closely associated with harmful algal blooms (HABs). Here, we provide a systematic analysis of Hsp70 genes (PyHsp70s) in the genome of Yesso scallop (Patinopecten yessoensis), an important aquaculture species in China, through in silico analysis using transcriptome and genome databases. Phylogenetic analyses indicated extensive expansion of Hsp70 genes from the Hspa12 sub-family in the Yesso scallop and also the bivalve lineages, with gene duplication events before or after the split between the Yesso scallop and the Pacific oyster. In addition, we determined the expression patterns of PyHsp70s after exposure to Alexandrium catenella, the dinoflagellate producing PSTs. Our results confirmed the inducible expression patterns of PyHsp70s under PSTs stress, and the responses to the toxic stress may have arisen through the adaptive recruitment of tandem duplication of Hsp70 genes. These findings provide a thorough overview of the evolution and modification of the Hsp70 family, which will gain insights into the functional characteristics of scallop Hsp70 genes in response to different stresses. Copyright © 2016. Published by Elsevier Ltd.

  15. Identification and molecular characterization of MYB Transcription Factor Superfamily in C4 model plant foxtail millet (Setaria italica L.).

    PubMed

    Muthamilarasan, Mehanathan; Khandelwal, Rohit; Yadav, Chandra Bhan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    MYB proteins represent one of the largest transcription factor families in plants, playing important roles in diverse developmental and stress-responsive processes. Considering its significance, several genome-wide analyses have been conducted in almost all land plants except foxtail millet. Foxtail millet (Setaria italica L.) is a model crop for investigating systems biology of millets and bioenergy grasses. Further, the crop is also known for its potential abiotic stress-tolerance. In this context, a comprehensive genome-wide survey was conducted and 209 MYB protein-encoding genes were identified in foxtail millet. All 209 S. italica MYB (SiMYB) genes were physically mapped onto nine chromosomes of foxtail millet. Gene duplication study showed that segmental- and tandem-duplication have occurred in genome resulting in expansion of this gene family. The protein domain investigation classified SiMYB proteins into three classes according to number of MYB repeats present. The phylogenetic analysis categorized SiMYBs into ten groups (I-X). SiMYB-based comparative mapping revealed a maximum orthology between foxtail millet and sorghum, followed by maize, rice and Brachypodium. Heat map analysis showed tissue-specific expression pattern of predominant SiMYB genes. Expression profiling of candidate MYB genes against abiotic stresses and hormone treatments using qRT-PCR revealed specific and/or overlapping expression patterns of SiMYBs. Taken together, the present study provides a foundation for evolutionary and functional characterization of MYB TFs in foxtail millet to dissect their functions in response to environmental stimuli.

  16. Identification of a novel Gig2 gene family specific to non-amniote vertebrates.

    PubMed

    Zhang, Yi-Bing; Liu, Ting-Kai; Jiang, Jun; Shi, Jun; Liu, Ying; Li, Shun; Gui, Jian-Fang

    2013-01-01

    Gig2 (grass carp reovirus (GCRV)-induced gene 2) is first identified as a novel fish interferon (IFN)-stimulated gene (ISG). Overexpression of a zebrafish Gig2 gene can protect cultured fish cells from virus infection. In the present study, we identify a novel gene family that is comprised of genes homologous to the previously characterized Gig2. EST/GSS search and in silico cloning identify 190 Gig2 homologous genes in 51 vertebrate species ranged from lampreys to amphibians. Further large-scale search of vertebrate and invertebrate genome databases indicate that Gig2 gene family is specific to non-amniotes including lampreys, sharks/rays, ray-finned fishes and amphibians. Phylogenetic analysis and synteny analysis reveal lineage-specific expansion of Gig2 gene family and also provide valuable evidence for the fish-specific genome duplication (FSGD) hypothesis. Although Gig2 family proteins exhibit no significant sequence similarity to any known proteins, a typical Gig2 protein appears to consist of two conserved parts: an N-terminus that bears very low homology to the catalytic domains of poly(ADP-ribose) polymerases (PARPs), and a novel C-terminal domain that is unique to this gene family. Expression profiling of zebrafish Gig2 family genes shows that some duplicate pairs have diverged in function via acquisition of novel spatial and/or temporal expression under stresses. The specificity of this gene family to non-amniotes might contribute to a large extent to distinct physiology in non-amniote vertebrates.

  17. The Ndst Gene Family in Zebrafish: Role of Ndst1b in Pharyngeal Arch Formation

    PubMed Central

    Haitina, Tatjana; Habicher, Judith; Ledin, Johan; Kjellén, Lena

    2015-01-01

    Heparan sulfate (HS) proteoglycans are ubiquitous components of the extracellular matrix and plasma membrane of metazoans. The sulfation pattern of the HS glycosaminoglycan chain is characteristic for each tissue and changes during development. The glucosaminyl N-deacetylase/N-sulfotransferase (NDST) enzymes catalyze N-deacetylation and N-sulfation during HS biosynthesis and have a key role in designing the sulfation pattern. We here report on the presence of five NDST genes in zebrafish. Zebrafish ndst1a, ndst1b, ndst2a and ndst2b represent duplicated mammalian orthologues of NDST1 and NDST2 that arose through teleost specific genome duplication. Interestingly, the single zebrafish orthologue ndst3, is equally similar to tetrapod Ndst3 and Ndst4. It is likely that a local duplication in the common ancestor of lobe-finned fish and tetrapods gave rise to these two genes. All zebrafish Ndst genes showed distinct but partially overlapping expression patterns during embryonic development. Morpholino knockdown of ndst1b resulted in delayed development, craniofacial cartilage abnormalities, shortened body and pectoral fin length, resembling some of the features of the Ndst1 mouse knockout. PMID:25767878

  18. House spider genome uncovers evolutionary shifts in the diversity and expression of black widow venom proteins associated with extreme toxicity.

    PubMed

    Gendreau, Kerry L; Haney, Robert A; Schwager, Evelyn E; Wierschin, Torsten; Stanke, Mario; Richards, Stephen; Garb, Jessica E

    2017-02-16

    Black widow spiders are infamous for their neurotoxic venom, which can cause extreme and long-lasting pain. This unusual venom is dominated by latrotoxins and latrodectins, two protein families virtually unknown outside of the black widow genus Latrodectus, that are difficult to study given the paucity of spider genomes. Using tissue-, sex- and stage-specific expression data, we analyzed the recently sequenced genome of the house spider (Parasteatoda tepidariorum), a close relative of black widows, to investigate latrotoxin and latrodectin diversity, expression and evolution. We discovered at least 47 latrotoxin genes in the house spider genome, many of which are tandem-arrayed. Latrotoxins vary extensively in predicted structural domains and expression, implying their significant functional diversification. Phylogenetic analyses show latrotoxins have substantially duplicated after the Latrodectus/Parasteatoda split and that they are also related to proteins found in endosymbiotic bacteria. Latrodectin genes are less numerous than latrotoxins, but analyses show their recruitment for venom function from neuropeptide hormone genes following duplication, inversion and domain truncation. While latrodectins and other peptides are highly expressed in house spider and black widow venom glands, latrotoxins account for a far smaller percentage of house spider venom gland expression. The house spider genome sequence provides novel insights into the evolution of venom toxins once considered unique to black widows. Our results greatly expand the size of the latrotoxin gene family, reinforce its narrow phylogenetic distribution, and provide additional evidence for the lateral transfer of latrotoxins between spiders and bacterial endosymbionts. Moreover, we strengthen the evidence for the evolution of latrodectin venom genes from the ecdysozoan Ion Transport Peptide (ITP)/Crustacean Hyperglycemic Hormone (CHH) neuropeptide superfamily. The lower expression of latrotoxins in house spiders relative to black widows, along with the absence of a vertebrate-targeting α-latrotoxin gene in the house spider genome, may account for the extreme potency of black widow venom.

  19. Disruption of rcsB by a duplicated sequence in a curli-producing Escherichia coli O157:H7 results in differential gene expression in relation to biofilm formation, stress responses and metabolism.

    PubMed

    Sharma, V K; Bayles, D O; Alt, D P; Looft, T; Brunelle, B W; Stasko, J A

    2017-03-08

    Escherichia coli O157:H7 (O157) strain 86-24, linked to a 1986 disease outbreak, displays curli- and biofilm-negative phenotypes that are correlated with the lack of Congo red (CR) binding and formation of white colonies (CR - ) on a CR-containing medium. However, on a CR medium this strain produces red isolates (CR + ) capable of producing curli fimbriae and biofilms. To identify genes controlling differential expression of curli fimbriae and biofilm formation, the RNA-Seq profile of a CR + isolate was compared to the CR - parental isolate. Of the 242 genes expressed differentially in the CR + isolate, 201 genes encoded proteins of known functions while the remaining 41 encoded hypothetical proteins. Among the genes with known functions, 149 were down- and 52 were up-regulated. Some of the upregulated genes were linked to biofilm formation through biosynthesis of curli fimbriae and flagella. The genes encoding transcriptional regulators, such as CsgD, QseB, YkgK, YdeH, Bdm, CspD, BssR and FlhDC, which modulate biofilm formation, were significantly altered in their expression. Several genes of the envelope stress (cpxP), heat shock (rpoH, htpX, degP), oxidative stress (ahpC, katE), nutrient limitation stress (phoB-phoR and pst) response pathways, and amino acid metabolism were downregulated in the CR + isolate. Many genes mediating acid resistance and colanic acid biosynthesis, which influence biofilm formation directly or indirectly, were also down-regulated. Comparative genomics of CR + and CR - isolates revealed the presence of a short duplicated sequence in the rcsB gene of the CR + isolate. The alignment of the amino acid sequences of RcsB of the two isolates showed truncation of RcsB in the CR + isolate at the insertion site of the duplicated sequence. Complementation of CR + isolate with rcsB of the CR - parent restored parental phenotypes to the CR + isolate. The results of this study indicate that RcsB is a global regulator affecting bacterial survival in growth-restrictive environments through upregulation of genes promoting biofilm formation while downregulating certain metabolic functions. Understanding whether rcsB inactivation enhances persistence and survival of O157 in carrier animals and the environment would be important in developing strategies for controlling this bacterial pathogen in these niches.

  20. Characterization and Evolution of Conserved MicroRNA through Duplication Events in Date Palm (Phoenix dactylifera)

    PubMed Central

    Yang, Yaodong; Mason, Annaliese S.; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events. PMID:23951162

  1. Characterization and evolution of conserved MicroRNA through duplication events in date palm (Phoenix dactylifera).

    PubMed

    Xiao, Yong; Xia, Wei; Yang, Yaodong; Mason, Annaliese S; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events.

  2. Comparative inference of duplicated genes produced by polyploidization in soybean genome.

    PubMed

    Yang, Yanmei; Wang, Jinpeng; Di, Jianyong

    2013-01-01

    Soybean (Glycine max) is one of the most important crop plants for providing protein and oil. It is important to investigate soybean genome for its economic and scientific value. Polyploidy is a widespread and recursive phenomenon during plant evolution, and it could generate massive duplicated genes which is an important resource for genetic innovation. Improved sequence alignment criteria and statistical analysis are used to identify and characterize duplicated genes produced by polyploidization in soybean. Based on the collinearity method, duplicated genes by whole genome duplication account for 70.3% in soybean. From the statistical analysis of the molecular distances between duplicated genes, our study indicates that the whole genome duplication event occurred more than once in the genome evolution of soybean, which is often distributed near the ends of chromosomes.

  3. Ncam1a and Ncam1b: two carriers of polysialic acid with different functions in the developing zebrafish nervous system.

    PubMed

    Langhauser, Melanie; Ustinova, Jana; Rivera-Milla, Eric; Ivannikov, Darja; Seidl, Carmen; Slomka, Christin; Finne, Jukka; Yoshihara, Yoshihiro; Bastmeyer, Martin; Bentrop, Joachim

    2012-02-01

    Polysialic acid (polySia) is mainly described as a glycan modification of the neural cell adhesion molecule NCAM1. PolySia-NCAM1 has multiple functions during the development of vertebrate nervous systems including axon extension and fasciculation. Phylogenetic analyses reveal the presence of two related gene clusters, NCAM1 and NCAM2, in tetrapods and fishes. Within the ncam1 cluster, teleost fishes express ncam1a (ncam) and ncam1b (pcam) as duplicated paralogs which arose from a second round of ray-finned fish-specific genome duplication. Tetrapods, in contrast, express a single NCAM1 gene. Using the zebrafish model, we identify Ncam1b as a novel major carrier of polySia in the nervous system. PolySia-Ncam1a is expressed predominantly in rostral regions of the developing nervous system, whereas polySia-Ncam1b prevails caudally. We show that ncam1a and ncam1b have different expression domains which only partially overlap. Furthermore, Ncam1a and Ncam1b and their polySia modifications serve different functions in axon guidance. Formation of the posterior commissure at the forebrain/midbrain junction requires polySia-Ncam1a on the axons for proper fasciculation, whereas Ncam1b, expressed by midbrain cell bodies, serves as an instructive guidance cue for the dorso-medially directed growth of axons. Spinal motor axons, on the other hand, depend on axonally expressed Ncam1b for correct growth toward their target region. Collectively, these findings suggest that the genome duplication in the teleost lineage has provided the basis for a functional diversification of polySia carriers in the nervous system.

  4. Evolution of herbivore-induced early defense signaling was shaped by genome-wide duplications in Nicotiana

    PubMed Central

    Zhou, Wenwu; Brockmöller, Thomas; Ling, Zhihao; Omdahl, Ashton; Baldwin, Ian T; Xu, Shuqing

    2016-01-01

    Herbivore-induced defenses are widespread, rapidly evolving and relevant for plant fitness. Such induced defenses are often mediated by early defense signaling (EDS) rapidly activated by the perception of herbivore associated elicitors (HAE) that includes transient accumulations of jasmonic acid (JA). Analyzing 60 HAE-induced leaf transcriptomes from closely-related Nicotiana species revealed a key gene co-expression network (M4 module) which is co-activated with the HAE-induced JA accumulations but is elicited independently of JA, as revealed in plants silenced in JA signaling. Functional annotations of the M4 module were consistent with roles in EDS and a newly identified hub gene of the M4 module (NaLRRK1) mediates a negative feedback loop with JA signaling. Phylogenomic analysis revealed preferential gene retention after genome-wide duplications shaped the evolution of HAE-induced EDS in Nicotiana. These results highlight the importance of genome-wide duplications in the evolution of adaptive traits in plants. DOI: http://dx.doi.org/10.7554/eLife.19531.001 PMID:27813478

  5. Multiple Genetic Mechanisms Contribute to Visual Sensitivity Variation in the Labridae

    PubMed Central

    Phillips, Genevieve A.C.; Carleton, Karen L.; Marshall, N. Justin

    2016-01-01

    Coral reefs are one of the most spectrally diverse environments, both in terms of habitat and animal color. Species identity, sex, and camouflage are drivers of the phenotypic diversity seen in coral reef fishes, but how the phenotypic diversity is reflected in the genotype remains to be answered. The labrids are a large, polyphyletic family of coral reef fishes that display a diverse range of colors, including developmental color morphs and extensive behavioral ecologies. Here, we assess the opsin sequence and expression diversity among labrids from the Great Barrier Reef, Australia. We found that labrids express a diverse palette of visual opsins, with gene duplications in both RH2 and LWS genes. The majority of opsins expressed were within the mid-to-long wavelength sensitive classes (RH2 and LWS). Three of the labrid species expressed SWS1 (ultra-violet sensitive) opsins with the majority expressing the violet-sensitive SWS2B gene and none expressing SWS2A. We used knowledge about spectral tuning sites to calculate approximate spectral sensitivities (λmax) for individual species’ visual pigments, which corresponded well with previously published λmax values for closely related species (SWS1: 356–370 nm; SWS2B: 421–451 nm; RH2B: 452–492 nm; RH2A: 516–528 nm; LWS1: 554–555 nm; LWS2: 561–562 nm). In contrast to the phenotypic diversity displayed via color patterns and feeding ecology, there was little amino acid diversity within the known opsin sequence tuning sites. However, gene duplications and differential expression provide alternative mechanisms for tuning visual pigments, resulting in variable visual sensitivities among labrid species. PMID:26464127

  6. The Evolution of Epigenetic Regulators CTCF and BORIS/CTCFL in Amniotes

    PubMed Central

    Hore, Timothy A.; Deakin, Janine E.; Marshall Graves, Jennifer A.

    2008-01-01

    CTCF is an essential, ubiquitously expressed DNA-binding protein responsible for insulator function, nuclear architecture, and transcriptional control within vertebrates. The gene CTCF was proposed to have duplicated in early mammals, giving rise to a paralogue called “brother of regulator of imprinted sites” (BORIS or CTCFL) with DNA binding capabilities similar to CTCF, but testis-specific expression in humans and mice. CTCF and BORIS have opposite regulatory effects on human cancer-testis genes, the anti-apoptotic BAG1 gene, the insulin-like growth factor 2/H19 imprint control region (IGF2/H19 ICR), and show mutually exclusive expression in humans and mice, suggesting that they are antagonistic epigenetic regulators. We discovered orthologues of BORIS in at least two reptilian species and found traces of its sequence in the chicken genome, implying that the duplication giving rise to BORIS occurred much earlier than previously thought. We analysed the expression of CTCF and BORIS in a range of amniotes by conventional and quantitative PCR. BORIS, as well as CTCF, was found widely expressed in monotremes (platypus) and reptiles (bearded dragon), suggesting redundancy or cooperation between these genes in a common amniote ancestor. However, we discovered that BORIS expression was gonad-specific in marsupials (tammar wallaby) and eutherians (cattle), implying that a functional change occurred in BORIS during the early evolution of therian mammals. Since therians show imprinting of IGF2 but other vertebrate taxa do not, we speculate that CTCF and BORIS evolved specialised functions along with the evolution of imprinting at this and other loci, coinciding with the restriction of BORIS expression to the germline and potential antagonism with CTCF. PMID:18769711

  7. The evolution of epigenetic regulators CTCF and BORIS/CTCFL in amniotes.

    PubMed

    Hore, Timothy A; Deakin, Janine E; Marshall Graves, Jennifer A

    2008-08-29

    CTCF is an essential, ubiquitously expressed DNA-binding protein responsible for insulator function, nuclear architecture, and transcriptional control within vertebrates. The gene CTCF was proposed to have duplicated in early mammals, giving rise to a paralogue called "brother of regulator of imprinted sites" (BORIS or CTCFL) with DNA binding capabilities similar to CTCF, but testis-specific expression in humans and mice. CTCF and BORIS have opposite regulatory effects on human cancer-testis genes, the anti-apoptotic BAG1 gene, the insulin-like growth factor 2/H19 imprint control region (IGF2/H19 ICR), and show mutually exclusive expression in humans and mice, suggesting that they are antagonistic epigenetic regulators. We discovered orthologues of BORIS in at least two reptilian species and found traces of its sequence in the chicken genome, implying that the duplication giving rise to BORIS occurred much earlier than previously thought. We analysed the expression of CTCF and BORIS in a range of amniotes by conventional and quantitative PCR. BORIS, as well as CTCF, was found widely expressed in monotremes (platypus) and reptiles (bearded dragon), suggesting redundancy or cooperation between these genes in a common amniote ancestor. However, we discovered that BORIS expression was gonad-specific in marsupials (tammar wallaby) and eutherians (cattle), implying that a functional change occurred in BORIS during the early evolution of therian mammals. Since therians show imprinting of IGF2 but other vertebrate taxa do not, we speculate that CTCF and BORIS evolved specialised functions along with the evolution of imprinting at this and other loci, coinciding with the restriction of BORIS expression to the germline and potential antagonism with CTCF.

  8. [Genome-wide identification and expression analysis of auxin-related gene families in grape].

    PubMed

    Yuan, Hua-zhao; Zhao, Mi-zhen; Wu, Wei-min; Yu, Hong-Mei; Qian, Ya-ming; Wang, Zhuang-wei; Wang, Xi-cheng

    2015-07-01

    The auxin response gene family adjusts the auxin balance and the growth hormone signaling pathways in plants. Using bioinformatics methods, the auxin-response genes from the grape genome database are identified and their chromosomal location, gene collinearity and phylogenetic analysis are performed. Probable genes include 25 AUX_IAA, 19 ARF, 9 GH3 and 42 LBD genes, which are unevenly distributed on all 19 chromosomes and some of them formed distinct tandem duplicate gene clusters. The available grape microarray databases show that all of the auxin-response genes are expressed in fruit and leaf buds, and significant overexpressed during fruit color-changing, bud break and bud dormancy periods. This paper provides a resource for functional studies of auxin-response genes in grape leaf and fruit development.

  9. Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

    PubMed Central

    Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789

  10. Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

    PubMed

    Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.

  11. Genome-wide identification and characterization of five MyD88 duplication genes in Yesso scallop (Patinopecten yessoensis) and expression changes in response to bacterial challenge.

    PubMed

    Ning, Xianhui; Wang, Ruijia; Li, Xue; Wang, Shuyue; Zhang, Mengran; Xing, Qiang; Sun, Yan; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2015-10-01

    Myeloid differentiation factor 88 (MyD88) is a pivotal adaptor in the TLR/IL-1R signaling pathway, which plays an important role in activating the innate immune system. Although MyD88 genes have been identified in a variety of species, they have not been systematically characterized in scallops. In this study, five MyD88 genes were identified in Yesso scallop (Patinopecten yessoensis), PyMyD88-1, PyMyD88-2a, PyMyD88-2b, PyMyD88-3 and PyMyD88-4, which consisted of two pairs of tandem duplications located on the same chromosome. To our knowledge, this is the largest number of MyD88 genes found in an invertebrate. Phylogenetic and protein structural analyses were carried out to determine the identities and evolutionary relationships of these genes. PyMyD88s have highly conserved structures compared to MyD88 genes from other invertebrate species, except for PyMyD88-4, which contains only a DD domain, suggesting the evolutionarily conserved form of this particular gene member. We investigated the expression profiles of PyMyD88 genes at different developmental stages and in healthy adult tissues and hemocytes after Micrococcus luteus and Vibrio anguillarum infection using quantitative real-time PCR (qRT-PCR). The expression of most PyMyD88s was significantly induced in the acute phase (3-6 h) after infection with both gram-positive (M. luteus) and gram-negative (V. anguillarum) bacteria, with much more dramatic changes in PyMyD88 expression being observed after V. anguillarum challenge. Collectively, the abundance of MyD88s and their specific expression patterns provide insight into their versatile roles in the response of the bivalve innate immune system to gram-negative bacterial pathogens. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Integrative transcriptome network analysis of iPSC-derived neurons from schizophrenia and schizoaffective disorder patients with 22q11.2 deletion.

    PubMed

    Lin, Mingyan; Pedrosa, Erika; Hrabovsky, Anastasia; Chen, Jian; Puliafito, Benjamin R; Gilbert, Stephanie R; Zheng, Deyou; Lachman, Herbert M

    2016-11-15

    Individuals with 22q11.2 Deletion Syndrome (22q11.2 DS) are a specific high-risk group for developing schizophrenia (SZ), schizoaffective disorder (SAD) and autism spectrum disorders (ASD). Several genes in the deleted region have been implicated in the development of SZ, e.g., PRODH and DGCR8. However, the mechanistic connection between these genes and the neuropsychiatric phenotype remains unclear. To elucidate the molecular consequences of 22q11.2 deletion in early neural development, we carried out RNA-seq analysis to investigate gene expression in early differentiating human neurons derived from induced pluripotent stem cells (iPSCs) of 22q11.2 DS SZ and SAD patients. Eight cases (ten iPSC-neuron samples in total including duplicate clones) and seven controls (nine in total including duplicate clones) were subjected to RNA sequencing. Using a systems level analysis, differentially expressed genes/gene-modules and pathway of interests were identified. Lastly, we related our findings from in vitro neuronal cultures to brain development by mapping differentially expressed genes to BrainSpan transcriptomes. We observed ~2-fold reduction in expression of almost all genes in the 22q11.2 region in SZ (37 genes reached p-value < 0.05, 36 of which reached a false discovery rate < 0.05). Outside of the deleted region, 745 genes showed significant differences in expression between SZ and control neurons (p < 0.05). Function enrichment and network analysis of the differentially expressed genes uncovered converging evidence on abnormal expression in key functional pathways, such as apoptosis, cell cycle and survival, and MAPK signaling in the SZ and SAD samples. By leveraging transcriptome profiles of normal human brain tissues across human development into adulthood, we showed that the differentially expressed genes converge on a sub-network mediated by CDC45 and the cell cycle, which would be disrupted by the 22q11.2 deletion during embryonic brain development, and another sub-network modulated by PRODH, which could contribute to disruption of brain function during adolescence. This study has provided evidence for disruption of potential molecular events in SZ patient with 22q11.2 deletion and related our findings from in vitro neuronal cultures to functional perturbations that can occur during brain development in SZ.

  13. Rapid evolution and copy number variation of primate RHOXF2, an X-linked homeobox gene involved in male reproduction and possibly brain function.

    PubMed

    Niu, Ao-lei; Wang, Yin-qiu; Zhang, Hui; Liao, Cheng-hong; Wang, Jin-kai; Zhang, Rui; Che, Jun; Su, Bing

    2011-10-12

    Homeobox genes are the key regulators during development, and they are in general highly conserved with only a few reported cases of rapid evolution. RHOXF2 is an X-linked homeobox gene in primates. It is highly expressed in the testicle and may play an important role in spermatogenesis. As male reproductive system is often the target of natural and/or sexual selection during evolution, in this study, we aim to dissect the pattern of molecular evolution of RHOXF2 in primates and its potential functional consequence. We studied sequences and copy number variation of RHOXF2 in humans and 16 nonhuman primate species as well as the expression patterns in human, chimpanzee, white-browed gibbon and rhesus macaque. The gene copy number analysis showed that there had been parallel gene duplications/losses in multiple primate lineages. Our evidence suggests that 11 nonhuman primate species have one RHOXF2 copy, and two copies are present in humans and four Old World monkey species, and at least 6 copies in chimpanzees. Further analysis indicated that the gene duplications in primates had likely been mediated by endogenous retrovirus (ERV) sequences flanking the gene regions. In striking contrast to non-human primates, humans appear to have homogenized their two RHOXF2 copies by the ERV-mediated non-allelic recombination mechanism. Coding sequence and phylogenetic analysis suggested multi-lineage strong positive selection on RHOXF2 during primate evolution, especially during the origins of humans and chimpanzees. All the 8 coding region polymorphic sites in human populations are non-synonymous, implying on-going selection. Gene expression analysis demonstrated that besides the preferential expression in the reproductive system, RHOXF2 is also expressed in the brain. The quantitative data suggests expression pattern divergence among primate species. RHOXF2 is a fast-evolving homeobox gene in primates. The rapid evolution and copy number changes of RHOXF2 had been driven by Darwinian positive selection acting on the male reproductive system and possibly also on the central nervous system, which sheds light on understanding the role of homeobox genes in adaptive evolution.

  14. Three neuropeptide Y receptor genes in the spiny dogfish, Squalus acanthias, support en bloc duplications in early vertebrate evolution.

    PubMed

    Salaneck, Erik; Ardell, David H; Larson, Earl T; Larhammar, Dan

    2003-08-01

    It has been debated whether the increase in gene number during early vertebrate evolution was due to multiple independent gene duplications or synchronous duplications of many genes. We describe here the cloning of three neuropeptide Y (NPY) receptor genes belonging to the Y1 subfamily in the spiny dogfish, Squalus acanthias, a cartilaginous fish. The three genes are orthologs of the mammalian subtypes Y1, Y4, and Y6, which are located in paralogous gene regions on different chromosomes in mammals. Thus, these genes arose by duplications of a chromosome region before the radiation of gnathostomes (jawed vertebrates). Estimates of duplication times from linearized trees together with evidence from other gene families supports two rounds of chromosome duplications or tetraploidizations early in vertebrate evolution. The anatomical distribution of mRNA was determined by reverse-transcriptase PCR and was found to differ from mammals, suggesting differential functional diversification of the new gene copies during the radiation of the vertebrate classes.

  15. Evolutionary change in the structure of the regulatory region that drives tissue and temporally regulated expression of alcohol dehydrogenase gene in Drosophila funebris.

    PubMed

    Amador, A; Papaceit, M; Juan, E

    2001-06-01

    The Adh locus of Drosophilidae is organized as a single gene transcribed from two spatially and temporally regulated promoters except in species of the repleta group, which have two single promoter genes. Here we show that in Drosophila funebris the Adh gene is transcribed from a single promoter, in both larva and adult, with qualitative and quantitative species specific-differences in tissue distribution. The gene is expressed in larval fat body but in other tissues such as gastric caeca, midgut and Malpighian tubules its expression is reduced compared to most Drosophilidae species, and in adults it is almost limited to the fat body. The comparative analysis of gene expression of two strains, which differ by a duplication, indicates that the cis elements necessary for this pattern of expression in larvae are included in the region of 1.55 kb upstream of the transcription initiation site. This new organization reveals the evolution of a different regulatory strategy to express the Adh gene in the subgenus Drosophila.

  16. Roles of the sister chromatid cohesion apparatus in gene expression, development, and human syndromes

    PubMed Central

    Dorsett, Dale

    2006-01-01

    The sister chromatid cohesion apparatus mediates physical pairing of duplicated chromosomes. This pairing is essential for appropriate distribution of chromosomes into the daughter cells upon cell division. Recent evidence shows that the cohesion apparatus, which is a significant structural component of chromosomes during interphase, also affects gene expression and development. The Cornelia de Lange (CdLS) and Roberts/SC phocomelia (RBS/SC) genetic syndromes in humans are caused by mutations affecting components of the cohesion apparatus. Studies in Drosophila suggest that effects on gene expression are most likely responsible for developmental alterations in CdLS. Effects on chromatid cohesion are apparent in RBS/SC syndrome, but data from yeast and Drosophila point to the likelihood that changes in expression of genes located in heterochromatin could contribute to the developmental deficits. PMID:16819604

  17. Human-Specific NOTCH2NL Genes Affect Notch Signaling and Cortical Neurogenesis.

    PubMed

    Fiddes, Ian T; Lodewijk, Gerrald A; Mooring, Meghan; Bosworth, Colleen M; Ewing, Adam D; Mantalas, Gary L; Novak, Adam M; van den Bout, Anouk; Bishara, Alex; Rosenkrantz, Jimi L; Lorig-Roach, Ryan; Field, Andrew R; Haeussler, Maximilian; Russo, Lotte; Bhaduri, Aparna; Nowakowski, Tomasz J; Pollen, Alex A; Dougherty, Max L; Nuttle, Xander; Addor, Marie-Claude; Zwolinski, Simon; Katzman, Sol; Kriegstein, Arnold; Eichler, Evan E; Salama, Sofie R; Jacobs, Frank M J; Haussler, David

    2018-05-31

    Genetic changes causing brain size expansion in human evolution have remained elusive. Notch signaling is essential for radial glia stem cell proliferation and is a determinant of neuronal number in the mammalian cortex. We find that three paralogs of human-specific NOTCH2NL are highly expressed in radial glia. Functional analysis reveals that different alleles of NOTCH2NL have varying potencies to enhance Notch signaling by interacting directly with NOTCH receptors. Consistent with a role in Notch signaling, NOTCH2NL ectopic expression delays differentiation of neuronal progenitors, while deletion accelerates differentiation into cortical neurons. Furthermore, NOTCH2NL genes provide the breakpoints in 1q21.1 distal deletion/duplication syndrome, where duplications are associated with macrocephaly and autism and deletions with microcephaly and schizophrenia. Thus, the emergence of human-specific NOTCH2NL genes may have contributed to the rapid evolution of the larger human neocortex, accompanied by loss of genomic stability at the 1q21.1 locus and resulting recurrent neurodevelopmental disorders. Copyright © 2018 Elsevier Inc. All rights reserved.

  18. The evolution of an osmotically inducible dps in the genus Streptomyces.

    PubMed

    Facey, Paul D; Hitchings, Matthew D; Williams, Jason S; Skibinski, David O F; Dyson, Paul J; Del Sol, Ricardo

    2013-01-01

    Dps proteins are found almost ubiquitously in bacterial genomes and there is now an appreciation of their multifaceted roles in various stress responses. Previous studies have shown that this family of proteins assemble into dodecamers and their quaternary structure is entirely critical to their function. Moreover, the numbers of dps genes per bacterial genome is variable; even amongst closely related species - however, for many genera this enigma is yet to be satisfactorily explained. We reconstruct the most probable evolutionary history of Dps in Streptomyces genomes. Typically, these bacteria encode for more than one Dps protein. We offer the explanation that variation in the number of dps per genome among closely related Streptomyces can be explained by gene duplication or lateral acquisition, and the former preceded a subsequent shift in expression patterns for one of the resultant paralogs. We show that the genome of S. coelicolor encodes for three Dps proteins including a tailless Dps. Our in vivo observations show that the tailless protein, unlike the other two Dps in S. coelicolor, does not readily oligomerise. Phylogenetic and bioinformatic analyses combined with expression studies indicate that in several Streptomyces species at least one Dps is significantly over-expressed during osmotic shock, but the identity of the ortholog varies. In silico analysis of dps promoter regions coupled with gene expression studies of duplicated dps genes shows that paralogous gene pairs are expressed differentially and this correlates with the presence of a sigB promoter. Lastly, we identify a rare novel clade of Dps and show that a representative of these proteins in S. coelicolor possesses a dodecameric quaternary structure of high stability.

  19. Genome-wide classification, evolutionary analysis and gene expression patterns of the kinome in Gossypium

    PubMed Central

    Yan, Jun; Li, Guilin; Guo, Xingqi; Li, Yang; Cao, Xuecheng

    2018-01-01

    The protein kinase (PK, kinome) family is one of the largest families in plants and regulates almost all aspects of plant processes, including plant development and stress responses. Despite their important functions, comprehensive functional classification, evolutionary analysis and expression patterns of the cotton PK gene family has yet to be performed on PK genes. In this study, we identified the cotton kinomes in the Gossypium raimondii, Gossypium arboretum, Gossypium hirsutum and Gossypium barbadense genomes and classified them into 7 groups and 122–24 subfamilies using software HMMER v3.0 scanning and neighbor-joining (NJ) phylogenetic analysis. Some conserved exon-intron structures were identified not only in cotton species but also in primitive plants, ferns and moss, suggesting the significant function and ancient origination of these PK genes. Collinearity analysis revealed that 16.6 million years ago (Mya) cotton-specific whole genome duplication (WGD) events may have played a partial role in the expansion of the cotton kinomes, whereas tandem duplication (TD) events mainly contributed to the expansion of the cotton RLK group. Synteny analysis revealed that tetraploidization of G. hirsutum and G. barbadense contributed to the expansion of G. hirsutum and G. barbadense PKs. Global expression analysis of cotton PKs revealed stress-specific and fiber development-related expression patterns, suggesting that many cotton PKs might be involved in the regulation of the stress response and fiber development processes. This study provides foundational information for further studies on the evolution and molecular function of cotton PKs. PMID:29768506

  20. Impact of gene gains, losses and duplication modes on the origin and diversification of vertebrates.

    PubMed

    Cañestro, Cristian; Albalat, Ricard; Irimia, Manuel; Garcia-Fernàndez, Jordi

    2013-02-01

    The study of the evolutionary origin of vertebrates has been linked to the study of genome duplications since Susumo Ohno suggested that the successful diversification of vertebrate innovations was facilitated by two rounds of whole-genome duplication (2R-WGD) in the stem vertebrate. Since then, studies on the functional evolution of many genes duplicated in the vertebrate lineage have provided the grounds to support experimentally this link. This article reviews cases of gene duplications derived either from the 2R-WGD or from local gene duplication events in vertebrates, analyzing their impact on the evolution of developmental innovations. We analyze how gene regulatory networks can be rewired by the activity of transposable elements after genome duplications, discuss how different mechanisms of duplication might affect the fate of duplicated genes, and how the loss of gene duplicates might influence the fate of surviving paralogs. We also discuss the evolutionary relationships between gene duplication and alternative splicing, in particular in the vertebrate lineage. Finally, we discuss the role that the 2R-WGD might have played in the evolution of vertebrate developmental gene networks, paying special attention to those related to vertebrate key features such as neural crest cells, placodes, and the complex tripartite brain. In this context, we argue that current evidences points that the 2R-WGD may not be linked to the origin of vertebrate innovations, but to their subsequent diversification in a broad variety of complex structures and functions that facilitated the successful transition from peaceful filter-feeding non-vertebrate ancestors to voracious vertebrate predators. Copyright © 2013 Elsevier Ltd. All rights reserved.

  1. Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks.

    PubMed

    Fogelmark, Karl; Peterson, Carsten; Troein, Carl

    2016-01-01

    Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks.

  2. Genetic control of ColE1 plasmid stability that is independent of plasmid copy number regulation.

    PubMed

    Standley, Melissa S; Million-Weaver, Samuel; Alexander, David L; Hu, Shuai; Camps, Manel

    2018-06-16

    ColE1-like plasmid vectors are widely used for expression of recombinant genes in E. coli. For these vectors, segregation of individual plasmids into daughter cells during cell division appears to be random, making them susceptible to loss over time when no mechanisms ensuring their maintenance are present. Here we use the plasmid pGFPuv in a recA relA strain as a sensitized model to study factors affecting plasmid stability in the context of recombinant gene expression. We find that in this model, plasmid stability can be restored by two types of genetic modifications to the plasmid origin of replication (ori) sequence: point mutations and a novel 269 nt duplication at the 5' end of the plasmid ori, which we named DAS (duplicated anti-sense) ori. Combinations of these modifications produce a range of copy numbers and of levels of recombinant expression. In direct contradiction with the classic random distribution model, we find no correlation between increased plasmid copy number and increased plasmid stability. Increased stability cannot be explained by reduced levels of recombinant gene expression either. Our observations would be more compatible with a hybrid clustered and free-distribution model, which has been recently proposed based on detection of individual plasmids in vivo using super-resolution fluorescence microscopy. This work suggests a role for the plasmid ori in the control of segregation of ColE1 plasmids that is distinct from replication initiation, opening the door for the genetic regulation of plasmid stability as a strategy aimed at enhancing large-scale recombinant gene expression or bioremediation.

  3. Molecular analyses of juvenile granulosa cell tumors bearing AKT1 mutations provide insights into tumor biology and therapeutic leads.

    PubMed

    Auguste, Aurélie; Bessière, Laurianne; Todeschini, Anne-Laure; Caburet, Sandrine; Sarnacki, Sabine; Prat, Jaime; D'angelo, Emanuela; De La Grange, Pierre; Ariste, Olivier; Lemoine, Fréderic; Legois, Bérangère; Sultan, Charles; Zider, Alain; Galmiche, Louise; Kalfa, Nicolas; Veitia, Reiner A

    2015-12-01

    Juvenile granulosa cell tumors (JGCTs) of the ovary are pediatric neoplasms representing 5% of all granulosa cell tumors (GCTs). Most GCTs are of adult type (AGCTs) and bear a mutation in the FOXL2 gene. The molecular basis of JGCTs is poorly understood, although mutations in the GNAS gene have been reported. We have detected in-frame duplications within the oncogene AKT1 in >60% of the JGCTs studied. Here, to evaluate the functional impact of these duplications and the existence of potential co-driver alterations, we have sequenced the transcriptome of four JGCTs and compared them with control transcriptomes. A search for gene variants detected only private alterations probably unrelated with tumorigenesis, suggesting that tandem duplications are the best candidates to underlie tumor formation in the absence of GNAS alterations. We previously showed that the duplications were specific to JGCTs. However, the screening of eight AGCTs samples without FOXL2 mutation showed the existence of an AKT1 duplication in one case, also having a stromal luteoma. The analysis of RNA-Seq data pinpointed a series of differentially expressed genes, involved in cytokine and hormone signaling and cell division-related processes. Further analyses pointed to the existence of a possible dedifferentiation process and suggested that most of the transcriptomic dysregulation might be mediated by a limited set of transcription factors perturbed by AKT1 activation. Finally, we show that commercially available AKT inhibitors can modulate the in vitro activity of various mutated forms. These results shed light on the pathogenesis of JGCTs and provide therapeutic leads for a targeted treatment. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. Evolution of AGL6-like MADS Box Genes in Grasses (Poaceae): Ovule Expression Is Ancient and Palea Expression Is New[W][OA

    PubMed Central

    Reinheimer, Renata; Kellogg, Elizabeth A.

    2009-01-01

    AGAMOUS-like6 (AGL6) genes encode MIKC-type MADS box transcription factors and are closely related to SEPALLATA and AP1/FUL-like genes. Here, we focus on the molecular evolution and expression of the AGL6-like genes in grasses. We have found that AGL6-like genes are expressed in ovules, lodicules (second whorl floral organs), paleas (putative first whorl floral organs), and floral meristems. Each of these expression domains was acquired at a different time in evolution, indicating that each represents a distinct function of the gene product and that the AGL6-like genes are pleiotropic. Expression in the inner integument of the ovule appears to be an ancient expression pattern corresponding to the expression of the gene in the megasporangium and integument in gymnosperms. Expression in floral meristems appears to have been acquired in the angiosperms and expression in second whorl organs in monocots. Early in grass evolution, AGL6-like orthologs acquired a new expression domain in the palea. Stamen expression is variable. Most grasses have a single AGL6-like gene (orthologous to the rice [Oryza sativa] gene MADS6). However, rice and other species of Oryza have a second copy (orthologous to rice MADS17) that appears to be the result of an ancient duplication. PMID:19749151

  5. Rapid Expansion of Immune-Related Gene Families in the House Fly, Musca domestica

    PubMed Central

    Lazzaro, Brian P.; Clark, Andrew G.

    2017-01-01

    Abstract The house fly, Musca domestica, occupies an unusual diversity of potentially septic niches compared with other sequenced Dipteran insects and is a vector of numerous diseases of humans and livestock. In the present study, we apply whole-transcriptome sequencing to identify genes whose expression is regulated in adult flies upon bacterial infection. We then combine the transcriptomic data with analysis of rates of gene duplication and loss to provide insight into the evolutionary dynamics of immune-related genes. Genes up-regulated after bacterial infection are biased toward being evolutionarily recent innovations, suggesting the recruitment of novel immune components in the M. domestica or ancestral Dipteran lineages. In addition, using new models of gene family evolution, we show that several different classes of immune-related genes, particularly those involved in either pathogen recognition or pathogen killing, are duplicating at a significantly accelerated rate on the M. domestica lineage relative to other Dipterans. Taken together, these results suggest that the M. domestica immune response includes an elevated diversity of genes, perhaps as a consequence of its lifestyle in septic environments. PMID:28087775

  6. Comprehensive analysis of MHC class I genes from the U-, S-, and Z-lineages in Atlantic salmon.

    PubMed

    Lukacs, Morten F; Harstad, Håvard; Bakke, Hege G; Beetz-Sargent, Marianne; McKinnel, Linda; Lubieniecki, Krzysztof P; Koop, Ben F; Grimholt, Unni

    2010-03-05

    We have previously sequenced more than 500 kb of the duplicated MHC class I regions in Atlantic salmon. In the IA region we identified the loci for the MHC class I gene Sasa-UBA in addition to a soluble MHC class I molecule, Sasa-ULA. A pseudolocus for Sasa-UCA was identified in the nonclassical IB region. Both regions contained genes for antigen presentation, as wells as orthologues to other genes residing in the human MHC region. The genomic localisation of two MHC class I lineages (Z and S) has been resolved. 7 BACs were sequenced using a combination of standard Sanger and 454 sequencing. The new sequence data extended the IA region with 150 kb identifying the location of one Z-lineage locus, ZAA. The IB region was extended with 350 kb including three new Z-lineage loci, ZBA, ZCA and ZDA in addition to a UGA locus. An allelic version of the IB region contained a functional UDA locus in addition to the UCA pseudolocus. Additionally a BAC harbouring two MHC class I genes (UHA) was placed on linkage group 14, while a BAC containing the S-lineage locus SAA (previously known as UAA) was placed on LG10. Gene expression studies showed limited expression range for all class I genes with exception of UBA being dominantly expressed in gut, spleen and gills, and ZAA with high expression in blood. Here we describe the genomic organization of MHC class I loci from the U-, Z-, and S-lineages in Atlantic salmon. Nine of the described class I genes are located in the extension of the duplicated IA and IB regions, while three class I genes are found on two separate linkage groups. The gene organization of the two regions indicates that the IB region is evolving at a different pace than the IA region. Expression profiling, polymorphic content, peptide binding properties and phylogenetic relationship show that Atlantic salmon has only one MHC class Ia gene (UBA), in addition to a multitude of nonclassical MHC class I genes from the U-, S- and Z-lineages.

  7. A Phylogenomic Investigation of CYCLOIDEA-Like TCP Genes in the Leguminosae1

    PubMed Central

    Citerne, Hélène L.; Luo, Da; Pennington, R. Toby; Coen, Enrico; Cronk, Quentin C.B.

    2003-01-01

    Numerous TCP genes (transcription factors with a TCP domain) occur in legumes. Genes of this class in Arabidopsis (TCP1) and snapdragon (Antirrhinum majus; CYCLOIDEA) have been shown to be asymmetrically expressed in developing floral primordia, and in snapdragon, they are required for floral zygomorphy (bilaterally symmetrical flowers). These genes are therefore particularly interesting in Leguminosae, a family that is thought to have evolved zygomorphy independently from other zygomorphic angiosperm lineages. Using a phylogenomic approach, we show that homologs of TCP1/CYCLOIDEA occur in legumes and may be divided into two main classes (LEGCYC group I and II), apparently the result of an early duplication, and each class is characterized by a typical amino acid signature in the TCP domain. Furthermore, group I genes in legumes may be divided into two subclasses (LEGCYC IA and IB), apparently the result of a duplication near the base of the papilionoid legumes or below. Most papilionoid legumes investigated have all three genes present (LEGCYC IA, IB, and II), inviting further work to investigate possible functional difference between the three types. However, within these three major gene groups, the precise relationships of the paralogs between species are difficult to determine probably because of a complex history of duplication and loss with lineage sorting or heterotachy (within-site rate variation) due to functional differentiation. The results illustrate both the potential and the difficulties of orthology determination in variable gene families, on which the phylogenomic approach to formulating hypotheses of function depends. PMID:12644657

  8. Duplication of an upstream silencer of FZP increases grain yield in rice.

    PubMed

    Bai, Xufeng; Huang, Yong; Hu, Yong; Liu, Haiyang; Zhang, Bo; Smaczniak, Cezary; Hu, Gang; Han, Zhongmin; Xing, Yongzhong

    2017-11-01

    Transcriptional silencer and copy number variants (CNVs) are associated with gene expression. However, their roles in generating phenotypes have not been well studied. Here we identified a rice quantitative trait locus, SGDP7 (Small Grain and Dense Panicle 7). SGDP7 is identical to FZP (FRIZZY PANICLE), which represses the formation of axillary meristems. The causal mutation of SGDP7 is an 18-bp fragment, named CNV-18bp, which was inserted ~5.3 kb upstream of FZP and resulted in a tandem duplication in the cultivar Chuan 7. The CNV-18bp duplication repressed FZP expression, prolonged the panicle branching period and increased grain yield by more than 15% through substantially increasing the number of spikelets per panicle (SPP) and slightly decreasing the 1,000-grain weight (TGW). The transcription repressor OsBZR1 binds the CGTG motifs in CNV-18bp and thereby represses FZP expression, indicating that CNV-18bp is the upstream silencer of FZP. These findings showed that the silencer CNVs coordinate a trade-off between SPP and TGW by fine-tuning FZP expression, and balancing the trade-off could enhance yield potential.

  9. A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors

    PubMed Central

    Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa

    2006-01-01

    Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. PMID:16606702

  10. Diversification and Expression of the PIN, AUX/LAX, and ABCB Families of Putative Auxin Transporters in Populus

    PubMed Central

    Carraro, Nicola; Tisdale-Orr, Tracy Eizabeth; Clouse, Ronald Matthew; Knöller, Anne Sophie; Spicer, Rachel

    2012-01-01

    Intercellular transport of the plant hormone auxin is mediated by three families of membrane-bound protein carriers, with the PIN and ABCB families coding primarily for efflux proteins and the AUX/LAX family coding for influx proteins. In the last decade our understanding of gene and protein function for these transporters in Arabidopsis has expanded rapidly but very little is known about their role in woody plant development. Here we present a comprehensive account of all three families in the model woody species Populus, including chromosome distribution, protein structure, quantitative gene expression, and evolutionary relationships. The PIN and AUX/LAX gene families in Populus comprise 16 and 8 members respectively and show evidence for the retention of paralogs following a relatively recent whole genome duplication. There is also differential expression across tissues within many gene pairs. The ABCB family is previously undescribed in Populus and includes 20 members, showing a much deeper evolutionary history, including both tandem and whole genome duplication as well as probable gene loss. A striking number of these transporters are expressed in developing Populus stems and we suggest that evolutionary and structural relationships with known auxin transporters in Arabidopsis can point toward candidate genes for further study in Populus. This is especially important for the ABCBs, which is a large family and includes members in Arabidopsis that are able to transport other substrates in addition to auxin. Protein modeling, sequence alignment and expression data all point to ABCB1.1 as a likely auxin transport protein in Populus. Given that basipetal auxin flow through the cambial zone shapes the development of woody stems, it is important that we identify the full complement of genes involved in this process. This work should lay the foundation for studies targeting specific proteins for functional characterization and in situ localization. PMID:22645571

  11. A genome-wide analysis of the expansin genes in Malus × Domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Gao, Zheng; Chen, Changtian; Jiang, Zesheng; Shu, Huairui

    2014-04-01

    Expansins were first identified as cell wall-loosening proteins; they are involved in regulating cell expansion, fruits softening and many other physiological processes. However, our knowledge about the expansin family members and their evolutionary relationships in fruit trees, such as apple, is limited. In this study, we identified 41 members of the expansin gene family in the genome of apple (Malus × Domestica L. Borkh). Phylogenetic analysis revealed that expansin genes in apple could be divided into four subfamilies according to their gene structures and protein motifs. By phylogenetic analysis of the expansins in five plants (Arabidopsis, rice, poplar, grape and apple), the expansins were divided into 17 subgroups. Our gene duplication analysis revealed that whole-genome and chromosomal-segment duplications contributed to the expansion of Mdexpansins. The microarray and expressed sequence tag (EST) data showed that 34 Mdexpansin genes could be divided into five groups by the EST analysis; they may also play different roles during fruit development. An expression model for MdEXPA16 and MdEXPA20 showed their potential role in developing fruit. Overall, our study provides useful data and novel insights into the functions and regulatory mechanisms of the expansin genes in apple, as well as their evolution and divergence. As the first step towards genome-wide analysis of the expansin genes in apple, our results have established a solid foundation for future studies on the function of the expansin genes in fruit development.

  12. Clock genes and their genomic distributions in three species of salmonid fishes: Associations with genes regulating sexual maturation and cell cycling

    PubMed Central

    2010-01-01

    Background Clock family genes encode transcription factors that regulate clock-controlled genes and thus regulate many physiological mechanisms/processes in a circadian fashion. Clock1 duplicates and copies of Clock3 and NPAS2-like genes were partially characterized (genomic sequencing) and mapped using family-based indels/SNPs in rainbow trout (RT)(Oncorhynchus mykiss), Arctic charr (AC)(Salvelinus alpinus), and Atlantic salmon (AS)(Salmo salar) mapping panels. Results Clock1 duplicates mapped to linkage groups RT-8/-24, AC-16/-13 and AS-2/-18. Clock3/NPAS2-like genes mapped to RT-9/-20, AC-20/-43, and AS-5. Most of these linkage group regions containing the Clock gene duplicates were derived from the most recent 4R whole genome duplication event specific to the salmonids. These linkage groups contain quantitative trait loci (QTL) for life history and growth traits (i.e., reproduction and cell cycling). Comparative synteny analyses with other model teleost species reveal a high degree of conservation for genes in these chromosomal regions suggesting that functionally related or co-regulated genes are clustered in syntenic blocks. For example, anti-müllerian hormone (amh), regulating sexual maturation, and ornithine decarboxylase antizymes (oaz1 and oaz2), regulating cell cycling, are contained within these syntenic blocks. Conclusions Synteny analyses indicate that regions homologous to major life-history QTL regions in salmonids contain many candidate genes that are likely to influence reproduction and cell cycling. The order of these genes is highly conserved across the vertebrate species examined, and as such, these genes may make up a functional cluster of genes that are likely co-regulated. CLOCK, as a transcription factor, is found within this block and therefore has the potential to cis-regulate the processes influenced by these genes. Additionally, clock-controlled genes (CCGs) are located in other life-history QTL regions within salmonids suggesting that at least in part, trans-regulation of these QTL regions may also occur via Clock expression. PMID:20670436

  13. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub

    PubMed Central

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.

    2014-01-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  14. Both msa genes in Renibacterium salmoninarum are needed for full virulence in bacterial kidney disease

    USGS Publications Warehouse

    Coady, A.M.; Murray, A.L.; Elliott, D.G.; Rhodes, L.D.

    2006-01-01

    Renibacterium salmoninarum, a gram-positive diplococcobacillus that causes bacterial kidney disease among salmon and trout, has two chromosomal loci encoding the major soluble antigen (msa) gene. Because the MSA protein is widely suspected to be an important virulence factor, we used insertion-duplication mutagenesis to generate disruptions of either the msa1 or msa2 gene. Surprisingly, expression of MSA protein in broth cultures appeared unaffected. However, the virulence of either mutant in juvenile Chinook salmon (Oncorhynchus tshawytscha) by intraperitoneal challenge was severely attenuated, suggesting that disruption of the msa1 or msa2 gene affected in vivo expression. Copyright ?? 2006, American Society for Microbiology. All Rights Reserved.

  15. Genomic mechanisms accounting for the adaptation to parasitism in nematode-trapping fungi.

    PubMed

    Meerupati, Tejashwari; Andersson, Karl-Magnus; Friman, Eva; Kumar, Dharmendra; Tunlid, Anders; Ahrén, Dag

    2013-11-01

    Orbiliomycetes is one of the earliest diverging branches of the filamentous ascomycetes. The class contains nematode-trapping fungi that form unique infection structures, called traps, to capture and kill free-living nematodes. The traps have evolved differently along several lineages and include adhesive traps (knobs, nets or branches) and constricting rings. We show, by genome sequencing of the knob-forming species Monacrosporium haptotylum and comparison with the net-forming species Arthrobotrys oligospora, that two genomic mechanisms are likely to have been important for the adaptation to parasitism in these fungi. Firstly, the expansion of protein domain families and the large number of species-specific genes indicated that gene duplication followed by functional diversification had a major role in the evolution of the nematode-trapping fungi. Gene expression indicated that many of these genes are important for pathogenicity. Secondly, gene expression of orthologs between the two fungi during infection indicated that differential regulation was an important mechanism for the evolution of parasitism in nematode-trapping fungi. Many of the highly expressed and highly upregulated M. haptotylum transcripts during the early stages of nematode infection were species-specific and encoded small secreted proteins (SSPs) that were affected by repeat-induced point mutations (RIP). An active RIP mechanism was revealed by lack of repeats, dinucleotide bias in repeats and genes, low proportion of recent gene duplicates, and reduction of recent gene family expansions. The high expression and rapid divergence of SSPs indicate a striking similarity in the infection mechanisms of nematode-trapping fungi and plant and insect pathogens from the crown groups of the filamentous ascomycetes (Pezizomycotina). The patterns of gene family expansions in the nematode-trapping fungi were more similar to plant pathogens than to insect and animal pathogens. The observation of RIP activity in the Orbiliomycetes suggested that this mechanism was present early in the evolution of the filamentous ascomycetes.

  16. Genomic Mechanisms Accounting for the Adaptation to Parasitism in Nematode-Trapping Fungi

    PubMed Central

    Meerupati, Tejashwari; Andersson, Karl-Magnus; Friman, Eva; Kumar, Dharmendra; Tunlid, Anders; Ahrén, Dag

    2013-01-01

    Orbiliomycetes is one of the earliest diverging branches of the filamentous ascomycetes. The class contains nematode-trapping fungi that form unique infection structures, called traps, to capture and kill free-living nematodes. The traps have evolved differently along several lineages and include adhesive traps (knobs, nets or branches) and constricting rings. We show, by genome sequencing of the knob-forming species Monacrosporium haptotylum and comparison with the net-forming species Arthrobotrys oligospora, that two genomic mechanisms are likely to have been important for the adaptation to parasitism in these fungi. Firstly, the expansion of protein domain families and the large number of species-specific genes indicated that gene duplication followed by functional diversification had a major role in the evolution of the nematode-trapping fungi. Gene expression indicated that many of these genes are important for pathogenicity. Secondly, gene expression of orthologs between the two fungi during infection indicated that differential regulation was an important mechanism for the evolution of parasitism in nematode-trapping fungi. Many of the highly expressed and highly upregulated M. haptotylum transcripts during the early stages of nematode infection were species-specific and encoded small secreted proteins (SSPs) that were affected by repeat-induced point mutations (RIP). An active RIP mechanism was revealed by lack of repeats, dinucleotide bias in repeats and genes, low proportion of recent gene duplicates, and reduction of recent gene family expansions. The high expression and rapid divergence of SSPs indicate a striking similarity in the infection mechanisms of nematode-trapping fungi and plant and insect pathogens from the crown groups of the filamentous ascomycetes (Pezizomycotina). The patterns of gene family expansions in the nematode-trapping fungi were more similar to plant pathogens than to insect and animal pathogens. The observation of RIP activity in the Orbiliomycetes suggested that this mechanism was present early in the evolution of the filamentous ascomycetes. PMID:24244185

  17. Structural Divergence in Vertebrate Phylogeny of a Duplicated Prototype Galectin

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bhat, R.; Chakraborty, M.; Mian, I. S.

    Prototype galectins, endogenously expressed animal lectins with a single carbohydrate recognition domain, are well-known regulators of tissue properties such as growth and adhesion. The earliest discovered and best studied of the prototype galectins is Galectin-1 (Gal-1). In the Gallus gallus (chicken) genome, Gal-1 is represented by two homologs: Gal-1A and Gal-1B, with distinct biochemical properties, tissue expression, and developmental functions. We investigated the origin of the Gal-1A/Gal-1B divergence to gain insight into when their developmental functions originated and how they could have contributed to vertebrate phenotypic evolution. Sequence alignment and phylogenetic tree construction showed that the Gal-1A/Gal-1B divergence can bemore » traced back to the origin of the sauropsid lineage (consisting of extinct and extant reptiles and birds) although lineage-specific duplications also occurred in the amphibian and actinopterygian genomes. Gene synteny analysis showed that sauropsid gal-1b (the gene for Gal-1B) and its frog and actinopterygian gal-1 homologs share a similar chromosomal location, whereas sauropsid gal-1a has translocated to a new position. Surprisingly, we found that chicken Gal-1A, encoded by the translocated gal-1a, was more similar in its tertiary folding pattern than Gal-1B, encoded by the untranslocated gal-1b, to experimentally determined and predicted folds of nonsauropsid Gal-1s. This inference is consistent with our finding of a lower proportion of conserved residues in sauropsid Gal-1Bs, and evidence for positive selection of sauropsid gal-1b, but not gal-1a genes. We propose that the duplication and structural divergence of Gal-1B away from Gal-1A led to specialization in both expression and function in the sauropsid lineage.« less

  18. Structural Divergence in Vertebrate Phylogeny of a Duplicated Prototype Galectin

    DOE PAGES

    Bhat, R.; Chakraborty, M.; Mian, I. S.; ...

    2014-09-25

    Prototype galectins, endogenously expressed animal lectins with a single carbohydrate recognition domain, are well-known regulators of tissue properties such as growth and adhesion. The earliest discovered and best studied of the prototype galectins is Galectin-1 (Gal-1). In the Gallus gallus (chicken) genome, Gal-1 is represented by two homologs: Gal-1A and Gal-1B, with distinct biochemical properties, tissue expression, and developmental functions. We investigated the origin of the Gal-1A/Gal-1B divergence to gain insight into when their developmental functions originated and how they could have contributed to vertebrate phenotypic evolution. Sequence alignment and phylogenetic tree construction showed that the Gal-1A/Gal-1B divergence can bemore » traced back to the origin of the sauropsid lineage (consisting of extinct and extant reptiles and birds) although lineage-specific duplications also occurred in the amphibian and actinopterygian genomes. Gene synteny analysis showed that sauropsid gal-1b (the gene for Gal-1B) and its frog and actinopterygian gal-1 homologs share a similar chromosomal location, whereas sauropsid gal-1a has translocated to a new position. Surprisingly, we found that chicken Gal-1A, encoded by the translocated gal-1a, was more similar in its tertiary folding pattern than Gal-1B, encoded by the untranslocated gal-1b, to experimentally determined and predicted folds of nonsauropsid Gal-1s. This inference is consistent with our finding of a lower proportion of conserved residues in sauropsid Gal-1Bs, and evidence for positive selection of sauropsid gal-1b, but not gal-1a genes. We propose that the duplication and structural divergence of Gal-1B away from Gal-1A led to specialization in both expression and function in the sauropsid lineage.« less

  19. Opsins have evolved under the permanent heterozygote model: insights from phylotranscriptomics of Odonata.

    PubMed

    Suvorov, Anton; Jensen, Nicholas O; Sharkey, Camilla R; Fujimoto, M Stanley; Bodily, Paul; Wightman, Haley M Cahill; Ogden, T Heath; Clement, Mark J; Bybee, Seth M

    2017-03-01

    Gene duplication plays a central role in adaptation to novel environments by providing new genetic material for functional divergence and evolution of biological complexity. Several evolutionary models have been proposed for gene duplication to explain how new gene copies are preserved by natural selection, but these models have rarely been tested using empirical data. Opsin proteins, when combined with a chromophore, form a photopigment that is responsible for the absorption of light, the first step in the phototransduction cascade. Adaptive gene duplications have occurred many times within the animal opsins' gene family, leading to novel wavelength sensitivities. Consequently, opsins are an attractive choice for the study of gene duplication evolutionary models. Odonata (dragonflies and damselflies) have the largest opsin repertoire of any insect currently known. Additionally, there is tremendous variation in opsin copy number between species, particularly in the long-wavelength-sensitive (LWS) class. Using comprehensive phylotranscriptomic and statistical approaches, we tested various evolutionary models of gene duplication. Our results suggest that both the blue-sensitive (BS) and LWS opsin classes were subjected to strong positive selection that greatly weakens after multiple duplication events, a pattern that is consistent with the permanent heterozygote model. Due to the immense interspecific variation and duplicability potential of opsin genes among odonates, they represent a unique model system to test hypotheses regarding opsin gene duplication and diversification at the molecular level. © 2016 John Wiley & Sons Ltd.

  20. Adrenal GIPR expression and chromosome 19q13 microduplications in GIP-dependent Cushing's syndrome.

    PubMed

    Lecoq, Anne-Lise; Stratakis, Constantine A; Viengchareun, Say; Chaligné, Ronan; Tosca, Lucie; Deméocq, Vianney; Hage, Mirella; Berthon, Annabel; Faucz, Fabio R; Hanna, Patrick; Boyer, Hadrien-Gaël; Servant, Nicolas; Salenave, Sylvie; Tachdjian, Gérard; Adam, Clovis; Benhamo, Vanessa; Clauser, Eric; Guiochon-Mantel, Anne; Young, Jacques; Lombès, Marc; Bourdeau, Isabelle; Maiter, Dominique; Tabarin, Antoine; Bertherat, Jérôme; Lefebvre, Hervé; de Herder, Wouter; Louiset, Estelle; Lacroix, André; Chanson, Philippe; Bouligand, Jérôme; Kamenický, Peter

    2017-09-21

    GIP-dependent Cushing's syndrome is caused by ectopic expression of glucose-dependent insulinotropic polypeptide receptor (GIPR) in cortisol-producing adrenal adenomas or in bilateral macronodular adrenal hyperplasias. Molecular mechanisms leading to ectopic GIPR expression in adrenal tissue are not known. Here we performed molecular analyses on adrenocortical adenomas and bilateral macronodular adrenal hyperplasias obtained from 14 patients with GIP-dependent adrenal Cushing's syndrome and one patient with GIP-dependent aldosteronism. GIPR expression in all adenoma and hyperplasia samples occurred through transcriptional activation of a single allele of the GIPR gene. While no abnormality was detected in proximal GIPR promoter methylation, we identified somatic duplications in chromosome region 19q13.32 containing the GIPR locus in the adrenocortical lesions derived from 3 patients. In 2 adenoma samples, the duplicated 19q13.32 region was rearranged with other chromosome regions, whereas a single tissue sample with hyperplasia had a 19q duplication only. We demonstrated that juxtaposition with cis-acting regulatory sequences such as glucocorticoid response elements in the newly identified genomic environment drives abnormal expression of the translocated GIPR allele in adenoma cells. Altogether, our results provide insight into the molecular pathogenesis of GIP-dependent Cushing's syndrome, occurring through monoallelic transcriptional activation of GIPR driven in some adrenal lesions by structural variations.

  1. Identification, characterization, and expression analysis of calmodulin and calmodulin-like genes in grapevine (Vitis vinifera) reveal likely roles in stress responses.

    PubMed

    Vandelle, Elodie; Vannozzi, Alessandro; Wong, Darren; Danzi, Davide; Digby, Anne-Marie; Dal Santo, Silvia; Astegno, Alessandra

    2018-06-04

    Calcium (Ca 2+ ) is an ubiquitous key second messenger in plants, where it modulates many developmental and adaptive processes in response to various stimuli. Several proteins containing Ca 2+ binding domain have been identified in plants, including calmodulin (CaM) and calmodulin-like (CML) proteins, which play critical roles in translating Ca 2+ signals into proper cellular responses. In this work, a genome-wide analysis conducted in Vitis vinifera identified three CaM- and 62 CML-encoding genes. We assigned gene family nomenclature, analyzed gene structure, chromosomal location and gene duplication, as well as protein motif organization. The phylogenetic clustering revealed a total of eight subgroups, including one unique clade of VviCaMs distinct from VviCMLs. VviCaMs were found to contain four EF-hand motifs whereas VviCML proteins have one to five. Most of grapevine CML genes were intronless, while VviCaMs were intron rich. All the genes were well spread among the 19 grapevine chromosomes and displayed a high level of duplication. The expression profiling of VviCaM/VviCML genes revealed a broad expression pattern across all grape organs and tissues at various developmental stages, and a significant modulation in biotic stress-related responses. Our results highlight the complexity of CaM/CML protein family also in grapevine, supporting the versatile role of its different members in modulating cellular responses to various stimuli, in particular to biotic stresses. This work lays the foundation for further functional and structural studies on specific grapevine CaMs/CMLs in order to better understand the role of Ca 2+ -binding proteins in grapevine and to explore their potential for further biotechnological applications. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  2. Rice Phospholipase A Superfamily: Organization, Phylogenetic and Expression Analysis during Abiotic Stresses and Development

    PubMed Central

    Singh, Amarjeet; Baranwal, Vinay; Shankar, Alka; Kanwar, Poonam; Ranjan, Rajeev; Yadav, Sandeep; Pandey, Amita; Kapoor, Sanjay; Pandey, Girdhar K.

    2012-01-01

    Background Phospholipase A (PLA) is an important group of enzymes responsible for phospholipid hydrolysis in lipid signaling. PLAs have been implicated in abiotic stress signaling and developmental events in various plants species. Genome-wide analysis of PLA superfamily has been carried out in dicot plant Arabidopsis. A comprehensive genome-wide analysis of PLAs has not been presented yet in crop plant rice. Methodology/Principal Findings A comprehensive bioinformatics analysis identified a total of 31 PLA encoding genes in the rice genome, which are divided into three classes; phospholipase A1 (PLA1), patatin like phospholipases (pPLA) and low molecular weight secretory phospholipase A2 (sPLA2) based on their sequences and phylogeny. A subset of 10 rice PLAs exhibited chromosomal duplication, emphasizing the role of duplication in the expansion of this gene family in rice. Microarray expression profiling revealed a number of PLA members expressing differentially and significantly under abiotic stresses and reproductive development. Comparative expression analysis with Arabidopsis PLAs revealed a high degree of functional conservation between the orthologs in two plant species, which also indicated the vital role of PLAs in stress signaling and plant development across different plant species. Moreover, sub-cellular localization of a few candidates suggests their differential localization and functional role in the lipid signaling. Conclusion/Significance The comprehensive analysis and expression profiling would provide a critical platform for the functional characterization of the candidate PLA genes in crop plants. PMID:22363522

  3. Duplication of the EFNB1 Gene in Familial Hypertelorism: Imbalance in Ephrin-B1 Expression and Abnormal Phenotypes in Humans and Mice

    PubMed Central

    Babbs, Christian; Stewart, Helen S; Williams, Louise J; Connell, Lyndsey; Goriely, Anne; Twigg, Stephen RF; Smith, Kim; Lester, Tracy; Wilkie, Andrew OM

    2011-01-01

    Familial hypertelorism, characterized by widely spaced eyes, classically shows autosomal dominant inheritance (Teebi type), but some pedigrees are compatible with X-linkage. No mechanism has been described previously, but clinical similarity has been noted to craniofrontonasal syndrome (CFNS), which is caused by mutations in the X-linked EFNB1 gene. Here we report a family in which females in three generations presented with hypertelorism, but lacked either craniosynostosis or a grooved nasal tip, excluding CFNS. DNA sequencing of EFNB1 was normal, but further analysis revealed a duplication of 937 kb including EFNB1 and two flanking genes: PJA1 and STARD8. We found that the X chromosome bearing the duplication produces ∼1.6-fold more EFNB1 transcript than the normal X chromosome and propose that, in the context of X-inactivation, this difference in expression level of EFNB1 results in abnormal cell sorting leading to hypertelorism. To support this hypothesis, we provide evidence from a mouse model carrying a targeted human EFNB1 cDNA, that abnormal cell sorting occurs in the cranial region. Hence, we propose that X-linked cases resembling Teebi hypertelorism may have a similar mechanism to CFNS, and that cellular mosaicism for different levels of ephrin-B1 (as well as simple presence/absence) leads to craniofacial abnormalities. Hum Mutat 32:1–9, 2011. © 2011 Wiley-Liss, Inc. PMID:21542058

  4. Genome tailoring powered production of isobutanol in continuous CO2/H2 blend fermentation using engineered acetogen biocatalyst.

    PubMed

    Gak, Eugene; Tyurin, Michael; Kiriukhin, Michael

    2014-05-01

    The cell energy fraction that powered maintenance and expression of genes encoding pro-phage elements, pta-ack cluster, early sporulation, sugar ABC transporter periplasmic proteins, 6-phosphofructokinase, pyruvate kinase, and fructose-1,6-disphosphatase in acetogen Clostridium sp. MT871 was re-directed to power synthetic operon encoding isobutanol biosynthesis at the expense of these genes achieved via their elimination. Genome tailoring decreased cell duplication time by 7.0 ± 0.1 min (p < 0.05) compared to the parental strain, with intact genome and cell duplication time of 68 ± 1 min (p < 0.05). Clostridium sp. MT871 with tailored genome was UVC-mutated to withstand 6.1 % isobutanol in fermentation broth to prevent product inhibition in an engineered commercial biocatalyst producing 5 % (674.5 mM) isobutanol during two-step continuous fermentation of CO2/H2 gas blend. Biocatalyst Clostridium sp. MT871RG- 11IBR6 was engineered to express six copies of synthetic operon comprising optimized synthetic format dehydrogenase, pyruvate formate lyase, acetolactate synthase, acetohydroxyacid reductoisomerase, 2,3-dihydroxy-isovalerate dehydratase, branched-chain alpha-ketoacid decarboxylase gene, aldehyde dehydrogenase, and alcohol dehydrogenase, regaining cell duplication time of 68 ± 1 min (p < 0.05) for the parental strain. This is the first report on isobutanol production by an engineered acetogen biocatalyst suitable for commercial manufacturing of this chemical/fuel using continuous fermentation of CO2/H2 blend thus contributing to the reversal of global warming.

  5. Genome-Wide Analysis of C2H2 Zinc-Finger Family Transcription Factors and Their Responses to Abiotic Stresses in Poplar (Populus trichocarpa)

    PubMed Central

    Liu, Quangang; Wang, Zhanchao; Xu, Xuemei; Zhang, Haizhen; Li, Chenghao

    2015-01-01

    Background C2H2 zinc-finger (C2H2-ZF) proteins are a large gene family in plants that participate in various aspects of normal plant growth and development, as well as in biotic and abiotic stress responses. To date, no overall analysis incorporating evolutionary history and expression profiling of the C2H2-ZF gene family in model tree species poplar (Populus trichocarpa) has been reported. Principal Findings Here, we identified 109 full-length C2H2-ZF genes in P. trichocarpa, and classified them into four groups, based on phylogenetic analysis. The 109 C2H2-ZF genes were distributed unequally on 19 P. trichocarpa linkage groups (LGs), with 39 segmental duplication events, indicating that segmental duplication has been important in the expansion of the C2H2-ZF gene family. Promoter cis-element analysis indicated that most of the C2H2-ZF genes contain phytohormone or abiotic stress-related cis-elements. The expression patterns of C2H2-ZF genes, based on heatmap analysis, suggested that C2H2-ZF genes are involved in tissue and organ development, especially root and floral development. Expression analysis based on quantitative real-time reverse transcription polymerase chain reaction indicated that C2H2-ZF genes are significantly involved in drought, heat and salt response, possibly via different mechanisms. Conclusions This study provides a thorough overview of the P. trichocarpa C2H2-ZF gene family and presents a new perspective on the evolution of this gene family. In particular, some C2H2-ZF genes may be involved in environmental stress tolerance regulation. PtrZFP2, 19 and 95 showed high expression levels in leaves and/or roots under environmental stresses. Additionally, this study provided a solid foundation for studying the biological roles of C2H2-ZF genes in Populus growth and development. These results form the basis for further investigation of the roles of these candidate genes and for future genetic engineering and gene functional studies in Populus. PMID:26237514

  6. Genome-wide analysis of autophagy-associated genes in foxtail millet (Setaria italica L.) and characterization of the function of SiATG8a in conferring tolerance to nitrogen starvation in rice.

    PubMed

    Li, Weiwei; Chen, Ming; Wang, Erhui; Hu, Liqin; Hawkesford, Malcolm J; Zhong, Li; Chen, Zhu; Xu, Zhaoshi; Li, Liancheng; Zhou, Yongbin; Guo, Changhong; Ma, Youzhi

    2016-10-12

    Autophagy is a cellular degradation process that is highly evolutionarily-conserved in yeast, plants, and animals. In plants, autophagy plays important roles in regulating intracellular degradation and recycling of amino acids in response to nutrient starvation, senescence, and other environmental stresses. Foxtail millet (Setaria italica) has strong resistance to stresses and has been proposed as an ideal material for use in the study of the physiological mechanisms of abiotic stress tolerance in plants. Although the genome sequence of foxtail millet (Setaria italica) is available, the characteristics and functions of abiotic stress-related genes remain largely unknown for this species. A total of 37 putative ATG (autophagy-associated genes) genes in the foxtail millet genome were identified. Gene duplication analysis revealed that both segmental and tandem duplication events have played significant roles in the expansion of the ATG gene family in foxtail millet. Comparative synteny mapping between the genomes of foxtail millet and rice suggested that the ATG genes in both species have common ancestors, as their ATG genes were primarily located in similar syntenic regions. Gene expression analysis revealed the induced expression of 31 SiATG genes by one or more phytohormone treatments, 26 SiATG genes by drought, salt and cold, 24 SiATG genes by darkness and 25 SiATG genes by nitrogen starvation. Results of qRT-PCR showing that among 37 SiATG genes, the expression level of SiATG8a was the highest after nitrogen starvation treatment 24 h, suggesting its potential role in tolerance to nutrient starvation. Moreover, the heterologous expression of SiATG8a in rice improved nitrogen starvation tolerance. Compared to wild type rice, the transgenic rice performed better and had higher aboveground total nitrogen content when the plants were grown under nitrogen starvation conditions. Our results deepen understanding about the characteristics and functions of ATG genes in foxtail millet and also identify promising new genetic resources that should be of use in future efforts to develop varieties of foxtail millet and other crop species that have resistance to nitrogen deficiency stress.

  7. Restriction of Equine Infectious Anemia Virus by Equine APOBEC3 Cytidine Deaminases ▿ †

    PubMed Central

    Zielonka, Jörg; Bravo, Ignacio G.; Marino, Daniela; Conrad, Elea; Perković, Mario; Battenberg, Marion; Cichutek, Klaus; Münk, Carsten

    2009-01-01

    The mammalian APOBEC3 (A3) proteins comprise a multigene family of cytidine deaminases that act as potent inhibitors of retroviruses and retrotransposons. The A3 locus on the chromosome 28 of the horse genome contains multiple A3 genes: two copies of A3Z1, five copies of A3Z2, and a single copy of A3Z3, indicating a complex evolution of multiple gene duplications. We have cloned and analyzed for expression the different equine A3 genes and examined as well the subcellular distribution of the corresponding proteins. Additionally, we have tested the functional antiretroviral activity of the equine and of several of the human and nonprimate A3 proteins against the Equine infectious anemia virus (EIAV), the Simian immunodeficiency virus (SIV), and the Adeno-associated virus type 2 (AAV-2). Hematopoietic cells of horses express at least five different A3s: A3Z1b, A3Z2a-Z2b, A3Z2c-Z2d, A3Z2e, and A3Z3, whereas circulating macrophages, the natural target of EIAV, express only part of the A3 repertoire. The five A3Z2 tandem copies arose after three consecutive, recent duplication events in the horse lineage, after the split between Equidae and Carnivora. The duplicated genes show different antiviral activities against different viruses: equine A3Z3 and A3Z2c-Z2d are potent inhibitors of EIAV while equine A3Z1b, A3Z2a-Z2b, A3Z2e showed only weak anti-EIAV activity. Equine A3Z1b and A3Z3 restricted AAV and all equine A3s, except A3Z1b, inhibited SIV. We hypothesize that the horse A3 genes are undergoing a process of subfunctionalization in their respective viral specificities, which might provide the evolutionary advantage for keeping five copies of the original gene. PMID:19458006

  8. Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

    PubMed

    Hong, Jungeui; Gresham, David

    2017-11-01

    Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.

  9. Evolutionary diversification of galactinol synthases in Rosaceae: adaptive roles of galactinol and raffinose during apple bud dormancy.

    PubMed

    Falavigna, Vítor da Silveira; Porto, Diogo Denardi; Miotto, Yohanna Evelyn; Santos, Henrique Pessoa Dos; Oliveira, Paulo Ricardo Dias de; Margis-Pinheiro, Márcia; Pasquali, Giancarlo; Revers, Luís Fernando

    2018-01-24

    Galactinol synthase (GolS) is a key enzyme in the biosynthetic pathway of raffinose family oligosaccharides (RFOs), which play roles in carbon storage, signal transduction, and osmoprotection. The present work assessed the evolutionary history of GolS genes across the Rosaceae using several bioinformatic tools. Apple (Malus × domestica) GolS genes were transcriptionally characterized during bud dormancy, in parallel with galactinol and raffinose measurements. Additionally, MdGolS2, a candidate to regulate seasonal galactinol and RFO content during apple bud dormancy, was functionally characterized in Arabidopsis. Evolutionary analyses revealed that whole genome duplications have driven GolS gene evolution and diversification in Rosaceae speciation. The strong purifying selection identified in duplicated GolS genes suggests that differential gene expression might define gene function better than protein structure. Interestingly, MdGolS2 was differentially expressed during bud dormancy, concomitantly with the highest galactinol and raffinose levels. One of the intrinsic adaptive features of bud dormancy is limited availability of free water; therefore, we generated transgenic Arabidopsis plants expressing MdGolS2. They showed higher galactinol and raffinose contents and increased tolerance to water deficit. Our results suggest that MdGolS2 is the major GolS responsible for RFO accumulation during apple dormancy, and these carbohydrates help to protect dormant buds against limited water supply. © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  10. Identification and Molecular Characterization of MYB Transcription Factor Superfamily in C4 Model Plant Foxtail Millet (Setaria italica L.)

    PubMed Central

    Muthamilarasan, Mehanathan; Khandelwal, Rohit; Yadav, Chandra Bhan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj

    2014-01-01

    MYB proteins represent one of the largest transcription factor families in plants, playing important roles in diverse developmental and stress-responsive processes. Considering its significance, several genome-wide analyses have been conducted in almost all land plants except foxtail millet. Foxtail millet (Setaria italica L.) is a model crop for investigating systems biology of millets and bioenergy grasses. Further, the crop is also known for its potential abiotic stress-tolerance. In this context, a comprehensive genome-wide survey was conducted and 209 MYB protein-encoding genes were identified in foxtail millet. All 209 S. italica MYB (SiMYB) genes were physically mapped onto nine chromosomes of foxtail millet. Gene duplication study showed that segmental- and tandem-duplication have occurred in genome resulting in expansion of this gene family. The protein domain investigation classified SiMYB proteins into three classes according to number of MYB repeats present. The phylogenetic analysis categorized SiMYBs into ten groups (I - X). SiMYB-based comparative mapping revealed a maximum orthology between foxtail millet and sorghum, followed by maize, rice and Brachypodium. Heat map analysis showed tissue-specific expression pattern of predominant SiMYB genes. Expression profiling of candidate MYB genes against abiotic stresses and hormone treatments using qRT-PCR revealed specific and/or overlapping expression patterns of SiMYBs. Taken together, the present study provides a foundation for evolutionary and functional characterization of MYB TFs in foxtail millet to dissect their functions in response to environmental stimuli. PMID:25279462

  11. Identification of a pair of phospholipid:diacylglycerol acyltransferases from developing flax (Linum usitatissimum L.) seed catalyzing the selective production of trilinolenin.

    PubMed

    Pan, Xue; Siloto, Rodrigo M P; Wickramarathna, Aruna D; Mietkiewska, Elzbieta; Weselake, Randall J

    2013-08-16

    The oil from flax (Linum usitatissimum L.) has high amounts of α-linolenic acid (ALA; 18:3(cis)(Δ9,12,15)) and is one of the richest sources of omega-3 polyunsaturated fatty acids (ω-3-PUFAs). To produce ∼57% ALA in triacylglycerol (TAG), it is likely that flax contains enzymes that can efficiently transfer ALA to TAG. To test this hypothesis, we conducted a systematic characterization of TAG-synthesizing enzymes from flax. We identified several genes encoding acyl-CoA:diacylglycerol acyltransferases (DGATs) and phospholipid:diacylglycerol acyltransferases (PDATs) from the flax genome database. Due to recent genome duplication, duplicated gene pairs have been identified for all genes except DGAT2-2. Analysis of gene expression indicated that two DGAT1, two DGAT2, and four PDAT genes were preferentially expressed in flax embryos. Yeast functional analysis showed that DGAT1, DGAT2, and two PDAT enzymes restored TAG synthesis when produced recombinantly in yeast H1246 strain. The activity of particular PDAT enzymes (LuPDAT1 and LuPDAT2) was stimulated by the presence of ALA. Further seed-specific expression of flax genes in Arabidopsis thaliana indicated that DGAT1, PDAT1, and PDAT2 had significant effects on seed oil phenotype. Overall, this study indicated the existence of unique PDAT enzymes from flax that are able to preferentially catalyze the synthesis of TAG containing ALA acyl moieties. The identified LuPDATs may have practical applications for increasing the accumulation of ALA and other polyunsaturated fatty acids in oilseeds for food and industrial applications.

  12. Insights into the Ecology and Evolution of Polyploid Plants through Network Analysis.

    PubMed

    Gallagher, Joseph P; Grover, Corrinne E; Hu, Guanjing; Wendel, Jonathan F

    2016-06-01

    Polyploidy is a widespread phenomenon throughout eukaryotes, with important ecological and evolutionary consequences. Although genes operate as components of complex pathways and networks, polyploid changes in genes and gene expression have typically been evaluated as either individual genes or as a part of broad-scale analyses. Network analysis has been fruitful in associating genomic and other 'omic'-based changes with phenotype for many systems. In polyploid species, network analysis has the potential not only to facilitate a better understanding of the complex 'omic' underpinnings of phenotypic and ecological traits common to polyploidy, but also to provide novel insight into the interaction among duplicated genes and genomes. This adds perspective to the global patterns of expression (and other 'omic') change that accompany polyploidy and to the patterns of recruitment and/or loss of genes following polyploidization. While network analysis in polyploid species faces challenges common to other analyses of duplicated genomes, present technologies combined with thoughtful experimental design provide a powerful system to explore polyploid evolution. Here, we demonstrate the utility and potential of network analysis to questions pertaining to polyploidy with an example involving evolution of the transgressively superior cotton fibres found in polyploid Gossypium hirsutum. By combining network analysis with prior knowledge, we provide further insights into the role of profilins in fibre domestication and exemplify the potential for network analysis in polyploid species. © 2016 John Wiley & Sons Ltd.

  13. The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates

    PubMed Central

    Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado

    2007-01-01

    Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839

  14. Conservation of the abscission signaling peptide IDA during Angiosperm evolution: withstanding genome duplications and gain and loss of the receptors HAE/HSL2

    PubMed Central

    Stø, Ida M.; Orr, Russell J. S.; Fooyontphanich, Kim; Jin, Xu; Knutsen, Jonfinn M. B.; Fischer, Urs; Tranbarger, Timothy J.; Nordal, Inger; Aalen, Reidunn B.

    2015-01-01

    The peptide INFLORESCENCE DEFICIENT IN ABSCISSION (IDA), which signals through the leucine-rich repeat receptor-like kinases HAESA (HAE) and HAESA-LIKE2 (HSL2), controls different cell separation events in Arabidopsis thaliana. We hypothesize the involvement of this signaling module in abscission processes in other plant species even though they may shed other organs than A. thaliana. As the first step toward testing this hypothesis from an evolutionarily perspective we have identified genes encoding putative orthologs of IDA and its receptors by BLAST searches of publically available protein, nucleotide and genome databases for angiosperms. Genes encoding IDA or IDA-LIKE (IDL) peptides and HSL proteins were found in all investigated species, which were selected as to represent each angiosperm order with available genomic sequences. The 12 amino acids representing the bioactive peptide in A. thaliana have virtually been unchanged throughout the evolution of the angiosperms; however, the number of IDL and HSL genes varies between different orders and species. The phylogenetic analyses suggest that IDA, HSL2, and the related HSL1 gene, were present in the species that gave rise to the angiosperms. HAE has arisen from HSL1 after a genome duplication that took place after the monocot—eudicots split. HSL1 has also independently been duplicated in the monocots, while HSL2 has been lost in gingers (Zingiberales) and grasses (Poales). IDA has been duplicated in eudicots to give rise to functionally divergent IDL peptides. We postulate that the high number of IDL homologs present in the core eudicots is a result of multiple whole genome duplications (WGD). We substantiate the involvement of IDA and HAE/HSL2 homologs in abscission by providing gene expression data of different organ separation events from various species. PMID:26579174

  15. Parental Origin of Interstitial Duplications at 15q11.2-q13.3 in Schizophrenia and Neurodevelopmental Disorders

    PubMed Central

    Isles, Anthony R.; Ingason, Andrés; Lowther, Chelsea; Gawlick, Micha; Stöber, Gerald; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F.; Gejman, Pablo V.; Shi, Jianxin; Sanders, Alan R.; Duan, Jubao; Sisodiya, Sanjay; Costain, Gregory; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J.; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Stefansson, Hreinn; Stefansson, Kari; O’Donovan, Michael C.; Owen, Michael J.; Bassett, Anne; Kirov, George

    2016-01-01

    Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally expressed imprinted genes in the contribution of Copy Number Variants (CNVs) at this interval to the incidence of psychotic illness. This work will have tangible benefits for patients with 15q11.2-q13.3 duplications by aiding genetic counseling. PMID:27153221

  16. Parental Origin of Interstitial Duplications at 15q11.2-q13.3 in Schizophrenia and Neurodevelopmental Disorders.

    PubMed

    Isles, Anthony R; Ingason, Andrés; Lowther, Chelsea; Walters, James; Gawlick, Micha; Stöber, Gerald; Rees, Elliott; Martin, Joanna; Little, Rosie B; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Aleksic, Branko; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F; Gejman, Pablo V; Shi, Jianxin; Sanders, Alan R; Duan, Jubao; Willis, Joseph; Sisodiya, Sanjay; Costain, Gregory; Werge, Thomas M; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Girirajan, Santhosh D; Stefansson, Hreinn; Stefansson, Kari; O'Donovan, Michael C; Owen, Michael J; Bassett, Anne; Kirov, George

    2016-05-01

    Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally expressed imprinted genes in the contribution of Copy Number Variants (CNVs) at this interval to the incidence of psychotic illness. This work will have tangible benefits for patients with 15q11.2-q13.3 duplications by aiding genetic counseling.

  17. An Autosomal Gene That Affects X Chromosome Expression and Sex Determination in CAENORHABDITIS ELEGANS

    PubMed Central

    Meneely, Philip M.; Wood, William B.

    1984-01-01

    Recessive mutant alleles at the autosomal dpy-21 locus of C. elegans cause a dumpy phenotype in XX animals but not in XO animals. This dumpy phenotype is characteristic of X chromosome aneuploids with higher than normal X to autosome ratios and is proposed to result from overexpression of X-linked genes. We have isolated a new dpy-21 allele that also causes partial hermaphroditization of XO males, without causing the dumpy phenotype. All dpy-21 alleles show hermaphroditization effects in XO males that carry a duplication of part of the X chromosome and also partially suppress a transformer (tra-1) mutation that converts XX animals into males. Experiments with a set of X chromosome duplications show that the defects of dpy-21 mutants can result from interaction with several different regions of the X chromosome. We propose that dpy-21 regulates X chromosome expression and may be involved in interpreting X chromosome dose for the developmental decisions of both sex determination and dosage compensation. PMID:6537930

  18. Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast

    PubMed Central

    Jeffares, Daniel C.; Jolly, Clemency; Hoti, Mimoza; Speed, Doug; Shaw, Liam; Rallis, Charalampos; Balloux, Francois; Dessimoz, Christophe; Bähler, Jürg; Sedlazeck, Fritz J.

    2017-01-01

    Large structural variations (SVs) within genomes are more challenging to identify than smaller genetic variants but may substantially contribute to phenotypic diversity and evolution. We analyse the effects of SVs on gene expression, quantitative traits and intrinsic reproductive isolation in the yeast Schizosaccharomyces pombe. We establish a high-quality curated catalogue of SVs in the genomes of a worldwide library of S. pombe strains, including duplications, deletions, inversions and translocations. We show that copy number variants (CNVs) show a variety of genetic signals consistent with rapid turnover. These transient CNVs produce stoichiometric effects on gene expression both within and outside the duplicated regions. CNVs make substantial contributions to quantitative traits, most notably intracellular amino acid concentrations, growth under stress and sugar utilization in winemaking, whereas rearrangements are strongly associated with reproductive isolation. Collectively, these findings have broad implications for evolution and for our understanding of quantitative traits including complex human diseases. PMID:28117401

  19. Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate

    PubMed Central

    Dehal, Paramvir; Boore, Jeffrey L

    2005-01-01

    The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish–tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage. PMID:16128622

  20. Comparative Analysis of Syntenic Genes in Grass Genomes Reveals Accelerated Rates of Gene Structure and Coding Sequence Evolution in Polyploid Wheat1[W][OA

    PubMed Central

    Akhunov, Eduard D.; Sehgal, Sunish; Liang, Hanquan; Wang, Shichen; Akhunova, Alina R.; Kaur, Gaganpreet; Li, Wanlong; Forrest, Kerrie L.; See, Deven; Šimková, Hana; Ma, Yaqin; Hayden, Matthew J.; Luo, Mingcheng; Faris, Justin D.; Doležel, Jaroslav; Gill, Bikram S.

    2013-01-01

    Cycles of whole-genome duplication (WGD) and diploidization are hallmarks of eukaryotic genome evolution and speciation. Polyploid wheat (Triticum aestivum) has had a massive increase in genome size largely due to recent WGDs. How these processes may impact the dynamics of gene evolution was studied by comparing the patterns of gene structure changes, alternative splicing (AS), and codon substitution rates among wheat and model grass genomes. In orthologous gene sets, significantly more acquired and lost exonic sequences were detected in wheat than in model grasses. In wheat, 35% of these gene structure rearrangements resulted in frame-shift mutations and premature termination codons. An increased codon mutation rate in the wheat lineage compared with Brachypodium distachyon was found for 17% of orthologs. The discovery of premature termination codons in 38% of expressed genes was consistent with ongoing pseudogenization of the wheat genome. The rates of AS within the individual wheat subgenomes (21%–25%) were similar to diploid plants. However, we uncovered a high level of AS pattern divergence between the duplicated homeologous copies of genes. Our results are consistent with the accelerated accumulation of AS isoforms, nonsynonymous mutations, and gene structure rearrangements in the wheat lineage, likely due to genetic redundancy created by WGDs. Whereas these processes mostly contribute to the degeneration of a duplicated genome and its diploidization, they have the potential to facilitate the origin of new functional variations, which, upon selection in the evolutionary lineage, may play an important role in the origin of novel traits. PMID:23124323

  1. Evolution of the shut-off steps of vertebrate phototransduction

    PubMed Central

    Patel, Hardip R.; Chuah, Aaron

    2018-01-01

    Different isoforms of the genes involved in phototransduction are expressed in vertebrate rod and cone photoreceptors, providing a unique example of parallel evolution via gene duplication. In this study, we determine the molecular phylogeny of the proteins underlying the shut-off steps of phototransduction in the agnathan and jawed vertebrate lineages. For the G-protein receptor kinases (GRKs), the GRK1 and GRK7 divisions arose prior to the divergence of tunicates, with further expansion during the two rounds of whole-genome duplication (2R); subsequently, jawed and agnathan vertebrates retained different subsets of three isoforms of GRK. For the arrestins, gene expansion occurred during 2R. Importantly, both for GRKs and arrestins, the respective rod isoforms did not emerge until the second round of 2R, just prior to the separation of jawed and agnathan vertebrates. For the triplet of proteins mediating shut-off of the G-protein transducin, RGS9 diverged from RGS11, probably at the second round of 2R, whereas Gβ5 and R9AP appear not to have undergone 2R expansion. Overall, our analysis provides a description of the duplications and losses of phototransduction shut-off genes that occurred during the transition from a chordate with only cone-like photoreceptors to an ancestral vertebrate with both cone- and rod-like photoreceptors. PMID:29321241

  2. Polymorphism, selection and tandem duplication of transferrin genes in Atlantic cod (Gadus morhua) - Conserved synteny between fish monolobal and tetrapod bilobal transferrin loci

    PubMed Central

    2011-01-01

    Background The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. Results The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. Conclusions The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations. PMID:21612617

  3. Polymorphism, selection and tandem duplication of transferrin genes in Atlantic cod (Gadus morhua)--conserved synteny between fish monolobal and tetrapod bilobal transferrin loci.

    PubMed

    Andersen, Øivind; De Rosa, Maria Cristina; Pirolli, Davide; Tooming-Klunderud, Ave; Petersen, Petra E; André, Carl

    2011-05-25

    The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations.

  4. Duplication of SOX9 is not a common cause of 46,XX testicular or 46,XX ovotesticular DSD.

    PubMed

    Seeherunvong, Tossaporn; Ukarapong, Supamit; McElreavey, Kenneth; Berkovitz, Gary D; Perera, Erasmo M

    2012-01-01

    Translocation of the SRY gene to the paternal X chromosome is the explanation for testis development in the majority of subjects with 46,XX testicular disorder of sexual development (DSD). However, nearly all subjects with 46,XX ovotesticular DSD and up to one third of subjects with 46,XX testicular DSD lack SRY. SRY-independent expression of SOX9 has been implicated in the etiology of testis development in some individuals. We amplified microsatellite markers in the region of SOX9 from a cohort of 30 subjects with either 46,XX testicular or 46,XX ovotesticular DSD to detect SOX9 duplications. Duplication of the SOX9 region in 17q was not detected in any subject. Duplication in the region of 17q that contains SOX9 is not a common cause of testis development in subjects with SRY-negative 46,XX testicular or ovotesticular DSD.

  5. A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments.

    PubMed

    Bansal, Vikas

    2017-03-14

    PCR amplification is an important step in the preparation of DNA sequencing libraries prior to high-throughput sequencing. PCR amplification introduces redundant reads in the sequence data and estimating the PCR duplication rate is important to assess the frequency of such reads. Existing computational methods do not distinguish PCR duplicates from "natural" read duplicates that represent independent DNA fragments and therefore, over-estimate the PCR duplication rate for DNA-seq and RNA-seq experiments. In this paper, we present a computational method to estimate the average PCR duplication rate of high-throughput sequence datasets that accounts for natural read duplicates by leveraging heterozygous variants in an individual genome. Analysis of simulated data and exome sequence data from the 1000 Genomes project demonstrated that our method can accurately estimate the PCR duplication rate on paired-end as well as single-end read datasets which contain a high proportion of natural read duplicates. Further, analysis of exome datasets prepared using the Nextera library preparation method indicated that 45-50% of read duplicates correspond to natural read duplicates likely due to fragmentation bias. Finally, analysis of RNA-seq datasets from individuals in the 1000 Genomes project demonstrated that 70-95% of read duplicates observed in such datasets correspond to natural duplicates sampled from genes with high expression and identified outlier samples with a 2-fold greater PCR duplication rate than other samples. The method described here is a useful tool for estimating the PCR duplication rate of high-throughput sequence datasets and for assessing the fraction of read duplicates that correspond to natural read duplicates. An implementation of the method is available at https://github.com/vibansal/PCRduplicates .

  6. Partial redundancy and functional specialization of E-class SEPALLATA genes in an early-diverging eudicot.

    PubMed

    Soza, Valerie L; Snelson, Corey D; Hewett Hazelton, Kristen D; Di Stilio, Verónica S

    2016-11-01

    Plant MADS-box genes have duplicated extensively, allegedly contributing to the immense diversity of floral form in angiosperms. In Arabidopsis thaliana (a core eudicot model plant), four SEPALLATA (SEP) genes comprise the E-class from the extended ABCE model of flower development. They are redundantly involved in the development of the four types of floral organs (sepals, petals, stamens and carpels) and in floral meristem determinacy. E-class genes have been examined in other core eudicots and monocots, but have been less investigated in non-core eudicots. Our goal was to functionally characterize the E-class genes in the early-diverging eudicot Thalictrum thalictroides (Ranunculaceae), whose flowers are apetalous. We identified four SEP orthologs, which when placed in a phylogenetic context, resulted from a major gene duplication event before the origin of angiosperms and a subsequent duplication at the origin of the Ranunculales. We used Virus-Induced Gene Silencing (VIGS) to down-regulate the three expressed paralogs individually and in combination to investigate their function and to determine the degree of conservation versus divergence of this important plant transcription factor. All loci were partially redundant in sepal and stamen identity and in promoting petaloidy of sepals, yet the SEP3 ortholog had a more pronounced role in carpel identity and development. The two other paralogs appear to have subfunctionalized in their cadastral roles to keep the boundaries between either sepal and stamen zones or stamen and carpel zones. Double knockdowns had enhanced phenotypes and the triple knockdown had an even more severe phenotype that included partial to complete homeotic conversion of stamens and carpels to sepaloid organs and green sepals, highlighting a role of E-class genes in petaloidy of sepals in this species. While no floral meristem determinacy defects were observed, this could be due to residual amounts of gene expression in the VIGS experiments being sufficient to perform this function or to the masking role of a redundant gene. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. The gene space in wheat: the complete γ-gliadin gene family from the wheat cultivar Chinese Spring.

    PubMed

    Anderson, Olin D; Huo, Naxin; Gu, Yong Q

    2013-06-01

    The complete set of unique γ-gliadin genes is described for the wheat cultivar Chinese Spring using a combination of expressed sequence tag (EST) and Roche 454 DNA sequences. Assemblies of Chinese Spring ESTs yielded 11 different γ-gliadin gene sequences. Two of the sequences encode identical polypeptides and are assumed to be the result of a recent gene duplication. One gene has a 3' coding mutation that changes the reading frame in the final eight codons. A second assembly of Chinese Spring γ-gliadin sequences was generated using Roche 454 total genomic DNA sequences. The 454 assembly confirmed the same 11 active genes as the EST assembly plus two pseudogenes not represented by ESTs. These 13 γ-gliadin sequences represent the complete unique set of γ-gliadin genes for cv Chinese Spring, although not ruled out are additional genes that are exact duplications of these 13 genes. A comparison with the ESTs of two other hexaploid cultivars (Butte 86 and Recital) finds that the most active genes are present in all three cultivars, with exceptions likely due to too few ESTs for detection in Butte 86 and Recital. A comparison of the numbers of ESTs per gene indicates differential levels of expression within the γ-gliadin gene family. Genome assignments were made for 6 of the 13 Chinese Spring γ-gliadin genes, i.e., one assignment from a match to two γ-gliadin genes found within a tetraploid wheat A genome BAC and four genes that match four distinct γ-gliadin sequences assembled from Roche 454 sequences from Aegilops tauschii, the hexaploid wheat D-genome ancestor.

  8. Hepatocellular carcinomas of the albumin SV40 T-antigen transgenic rat display fetal-like re-expression of lgf2 and deregulation of H19.

    PubMed

    Czarny, Matthew J; Babcock, Karlee; Baus, Rebecca M; Manoharan, Herbert; Pitot, Henry C

    2007-09-01

    Previous studies in our laboratory have shown that one of the earliest events during hepatocarcinogenesis in the albumin SV40 T antigen (Alb SV40 T Ag) transgenic rat is the duplication of chromosome 1q3.7-4.3, a region which contains the imprinted and coordinately regulated genes Igf2 and H19. We have also shown that this duplication is associated with the biallelic expression of the normally monoallelically-expressed H19. These results, however, are seemingly at odds with studies in the mouse that have shown a conservation of fetal regulatory patterns of these two genes in hepatic neoplasms. We therefore aimed in this study to determine the allelic origin of Igf2 expression in hepatocellular carcinomas of the Alb SV40 T Ag transgenic rat. Sprague-Dawley Alb SV40 T Ag transgenic rats and Brown Norway rats were reciprocally mated and the expression of Igf2 in hepatocellular carcinomas of the resulting F(1) transgene-positive female rats was analyzed by Northern blotting and RT-PCR. We determined that Igf2 was expressed exclusively from the paternal allele, which prompted the study (by the same methods) of the allelic origin of H19 in the same hepatocellular carcinomas in order to determine if the two genes remained coordinately regulated. Our results demonstrate fetal-like re-expression of Igf2 and deregulation of H19 in singular hepatocellular carcinomas of the rat. These results imply that another regulatory mechanism other than the generally accepted ICR/CTCF mechanism may play a role in the control of Igf2 and H19 expression. (c) 2007 Wiley-Liss, Inc.

  9. Diversification of Genes Encoding Granule-Bound Starch Synthase in Monocots and Dicots Is Marked by Multiple Genome-Wide Duplication Events

    PubMed Central

    Qiu, Wen-Ming; Li, Jing; Zhou, Hui; Zhang, Qiong; Guo, Wenwu; Zhu, Tingting; Peng, Junhua; Sun, Fengjie; Li, Shaohua; Korban, Schuyler S.; Han, Yuepeng

    2012-01-01

    Starch is one of the major components of cereals, tubers, and fruits. Genes encoding granule-bound starch synthase (GBSS), which is responsible for amylose synthesis, have been extensively studied in cereals but little is known about them in fruits. Due to their low copy gene number, GBSS genes have been used to study plant phylogenetic and evolutionary relationships. In this study, GBSS genes have been isolated and characterized in three fruit trees, including apple, peach, and orange. Moreover, a comprehensive evolutionary study of GBSS genes has also been conducted between both monocots and eudicots. Results have revealed that genomic structures of GBSS genes in plants are conserved, suggesting they all have evolved from a common ancestor. In addition, the GBSS gene in an ancestral angiosperm must have undergone genome duplication ∼251 million years ago (MYA) to generate two families, GBSSI and GBSSII. Both GBSSI and GBSSII are found in monocots; however, GBSSI is absent in eudicots. The ancestral GBSSII must have undergone further divergence when monocots and eudicots split ∼165 MYA. This is consistent with expression profiles of GBSS genes, wherein these profiles are more similar to those of GBSSII in eudicots than to those of GBSSI genes in monocots. In dicots, GBSSII must have undergone further divergence when rosids and asterids split from each other ∼126 MYA. Taken together, these findings suggest that it is GBSSII rather than GBSSI of monocots that have orthologous relationships with GBSS genes of eudicots. Moreover, diversification of GBSS genes is mainly associated with genome-wide duplication events throughout the evolutionary course of history of monocots and eudicots. PMID:22291904

  10. Evolutionary history and functional divergence of the cytochrome P450 gene superfamily between Arabidopsis thaliana and Brassica species uncover effects of whole genome and tandem duplications.

    PubMed

    Yu, Jingyin; Tehrim, Sadia; Wang, Linhai; Dossa, Komivi; Zhang, Xiurong; Ke, Tao; Liao, Boshou

    2017-09-18

    The cytochrome P450 monooxygenase (P450) superfamily is involved in the biosynthesis of various primary and secondary metabolites. However, little is known about the effects of whole genome duplication (WGD) and tandem duplication (TD) events on the evolutionary history and functional divergence of P450s in Brassica after splitting from a common ancestor with Arabidopsis thaliana. Using Hidden Markov Model search and manual curation, we detected that Brassica species have nearly 1.4-fold as many P450 members as A. thaliana. Most P450s in A. thaliana and Brassica species were located on pseudo-chromosomes. The inferred phylogeny indicated that all P450s were clustered into two different subgroups. Analysis of WGD event revealed that different P450 gene families had appeared after evolutionary events of species. For the TD event analyses, the P450s from TD events in Brassica species can be divided into ancient and recent parts. Our comparison of influence of WGD and TD events on the P450 gene superfamily between A. thaliana and Brassica species indicated that the family-specific evolution in the Brassica lineage can be attributed to both WGD and TD, whereas WGD was recognized as the major mechanism for the recent evolution of the P450 super gene family. Expression analysis of P450s from A. thaliana and Brassica species indicated that WGD-type P450s showed the same expression pattern but completely different expression with TD-type P450s across different tissues in Brassica species. Selection force analysis suggested that P450 orthologous gene pairs between A. thaliana and Brassica species underwent negative selection, but no significant differences were found between P450 orthologous gene pairs in A. thaliana-B. rapa and A. thaliana-B. oleracea lineages, as well as in different subgenomes in B. rapa or B. oleracea compared with A. thaliana. This study is the first to investigate the effects of WGD and TD on the evolutionary history and functional divergence of P450 gene families in A. thaliana and Brassica species. This study provides a biology model to study the mechanism of gene family formation, particularly in the context of the evolutionary history of angiosperms, and offers novel insights for the study of angiosperm genomes.

  11. Genome-wide analysis of the potato Hsp20 gene family: identification, genomic organization and expression profiles in response to heat stress.

    PubMed

    Zhao, Peng; Wang, Dongdong; Wang, Ruoqiu; Kong, Nana; Zhang, Chao; Yang, Chenghui; Wu, Wentao; Ma, Haoli; Chen, Qin

    2018-01-18

    Heat shock proteins (Hsps) are essential components in plant tolerance mechanism under various abiotic stresses. Hsp20 is the major family of heat shock proteins, but little of Hsp20 family is known in potato (Solanum tuberosum), which is an important vegetable crop that is thermosensitive. To reveal the mechanisms of potato Hsp20s coping with abiotic stresses, analyses of the potato Hsp20 gene family were conducted using bioinformatics-based methods. In total, 48 putative potato Hsp20 genes (StHsp20s) were identified and named according to their chromosomal locations. A sequence analysis revealed that most StHsp20 genes (89.6%) possessed no, or only one, intron. A phylogenetic analysis indicated that all of the StHsp20 genes, except 10, were grouped into 12 subfamilies. The 48 StHsp20 genes were randomly distributed on 12 chromosomes. Nineteen tandem duplicated StHsp20s and one pair of segmental duplicated genes (StHsp20-15 and StHsp20-48) were identified. A cis-element analysis inferred that StHsp20s, except for StHsp20-41, possessed at least one stress response cis-element. A heatmap of the StHsp20 gene family showed that the genes, except for StHsp20-2 and StHsp20-45, were expressed in various tissues and organs. Real-time quantitative PCR was used to detect the expression level of StHsp20 genes and demonstrated that the genes responded to multiple abiotic stresses, such as heat, salt or drought stress. The relative expression levels of 14 StHsp20 genes (StHsp20-4, 6, 7, 9, 20, 21, 33, 34, 35, 37, 41, 43, 44 and 46) were significantly up-regulated (more than 100-fold) under heat stress. These results provide valuable information for clarifying the evolutionary relationship of the StHsp20 family and in aiding functional characterization of StHsp20 genes in further research.

  12. Trans-oligomerization of duplicated aminoacyl-tRNA synthetases maintains genetic code fidelity under stress.

    PubMed

    Rubio, Miguel Ángel; Napolitano, Mauro; Ochoa de Alda, Jesús A G; Santamaría-Gómez, Javier; Patterson, Carl J; Foster, Andrew W; Bru-Martínez, Roque; Robinson, Nigel J; Luque, Ignacio

    2015-11-16

    Aminoacyl-tRNA synthetases (aaRSs) play a key role in deciphering the genetic message by producing charged tRNAs and are equipped with proofreading mechanisms to ensure correct pairing of tRNAs with their cognate amino acid. Duplicated aaRSs are very frequent in Nature, with 25,913 cases observed in 26,837 genomes. The oligomeric nature of many aaRSs raises the question of how the functioning and oligomerization of duplicated enzymes is organized. We characterized this issue in a model prokaryotic organism that expresses two different threonyl-tRNA synthetases, responsible for Thr-tRNA(Thr) synthesis: one accurate and constitutively expressed (T1) and another (T2) with impaired proofreading activity that also generates mischarged Ser-tRNA(Thr). Low zinc promotes dissociation of dimeric T1 into monomers deprived of aminoacylation activity and simultaneous induction of T2, which is active for aminoacylation under low zinc. T2 either forms homodimers or heterodimerizes with T1 subunits that provide essential proofreading activity in trans. These findings evidence that in organisms with duplicated genes, cells can orchestrate the assemblage of aaRSs oligomers that meet the necessities of the cell in each situation. We propose that controlled oligomerization of duplicated aaRSs is an adaptive mechanism that can potentially be expanded to the plethora of organisms with duplicated oligomeric aaRSs. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Evolution and Variation of Renin Genes in Mice

    PubMed Central

    Dickinson, Douglas P.; Gross, Kenneth W.; Piccini, Nina; Wilson, Carol M.

    1984-01-01

    Inbred strains of mice carry Ren-1, a gene encoding the thermostable Renin-1 isozyme. Ren-1 is expressed at relatively low levels in mouse submandibular gland and kidney. Some strains also carry Ren-2, a gene encoding the thermolabile Renin-2 isozyme. Ren-2 is expressed at high levels in the mouse submandibular gland and at very low levels, if at all, in the kidney. Ren-1 and Ren-2 are closely linked on mouse chromosome 1, show extensive homology in coding and noncoding regions and provide a model for studying the regulation of gene expression. An investigation of renin genes and enzymatic activity in wild-derived mice identified several restriction site polymorphisms as well as putative variants in renin gene expression and protein structure. The number of renin genes carried by different subpopulations of wild-derived mice is consistent with the occurrence of a gene duplication event prior to the divergence of M. spretus (2.75–5.5 million yr ago). This conclusion is in agreement with a prior estimate based upon comparative sequence analysis of Ren-1 and Ren-2 from inbred laboratory mice. PMID:6389258

  14. Functional diversification of duplicated CYC2 clade genes in regulation of inflorescence development in Gerbera hybrida (Asteraceae).

    PubMed

    Juntheikki-Palovaara, Inka; Tähtiharju, Sari; Lan, Tianying; Broholm, Suvi K; Rijpkema, Anneke S; Ruonala, Raili; Kale, Liga; Albert, Victor A; Teeri, Teemu H; Elomaa, Paula

    2014-09-01

    The complex inflorescences (capitula) of Asteraceae consist of different types of flowers. In Gerbera hybrida (gerbera), the peripheral ray flowers are bilaterally symmetrical and lack functional stamens while the central disc flowers are more radially symmetrical and hermaphroditic. Proteins of the CYC2 subclade of the CYC/TB1-like TCP domain transcription factors have been recruited several times independently for parallel evolution of bilaterally symmetrical flowers in various angiosperm plant lineages, and have also been shown to regulate flower-type identity in Asteraceae. The CYC2 subclade genes in gerbera show largely overlapping gene expression patterns. At the level of single flowers, their expression domain in petals shows a spatial shift from the dorsal pattern known so far in species with bilaterally symmetrical flowers, suggesting that this change in expression may have evolved after the origin of Asteraceae. Functional analysis indicates that GhCYC2, GhCYC3 and GhCYC4 mediate positional information at the proximal-distal axis of the inflorescence, leading to differentiation of ray flowers, but that they also regulate ray flower petal growth by affecting cell proliferation until the final size and shape of the petals is reached. Moreover, our data show functional diversification for the GhCYC5 gene. Ectopic activation of GhCYC5 increases flower density in the inflorescence, suggesting that GhCYC5 may promote the flower initiation rate during expansion of the capitulum. Our data thus indicate that modification of the ancestral network of TCP factors has, through gene duplications, led to the establishment of new expression domains and to functional diversification. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  15. Comparative transcriptome analysis of obligately asexual and cyclically sexual rotifers reveals genes with putative functions in sexual reproduction, dormancy, and asexual egg production.

    PubMed

    Hanson, Sara J; Stelzer, Claus-Peter; Welch, David B Mark; Logsdon, John M

    2013-06-19

    Sexual reproduction is a widely studied biological process because it is critically important to the genetics, evolution, and ecology of eukaryotes. Despite decades of study on this topic, no comprehensive explanation has been accepted that explains the evolutionary forces underlying its prevalence and persistence in nature. Monogonont rotifers offer a useful system for experimental studies relating to the evolution of sexual reproduction due to their rapid reproductive rate and close relationship to the putatively ancient asexual bdelloid rotifers. However, little is known about the molecular underpinnings of sex in any rotifer species. We generated mRNA-seq libraries for obligate parthenogenetic (OP) and cyclical parthenogenetic (CP) strains of the monogonont rotifer, Brachionus calyciflorus, to identify genes specific to both modes of reproduction. Our differential expression analysis identified receptors with putative roles in signaling pathways responsible for the transition from asexual to sexual reproduction. Differential expression of a specific copy of the duplicated cell cycle regulatory gene CDC20 and specific copies of histone H2A suggest that such duplications may underlie the phenotypic plasticity required for reproductive mode switch in monogononts. We further identified differential expression of genes involved in the formation of resting eggs, a process linked exclusively to sex in this species. Finally, we identified transcripts from the bdelloid rotifer Adineta ricciae that have significant sequence similarity to genes with higher expression in CP strains of B. calyciflorus. Our analysis of global gene expression differences between facultatively sexual and exclusively asexual populations of B. calyciflorus provides insights into the molecular nature of sexual reproduction in rotifers. Furthermore, our results offer insight into the evolution of obligate asexuality in bdelloid rotifers and provide indicators important for the use of monogononts as a model system for investigating the evolution of sexual reproduction.

  16. Sucrose metabolism gene families and their biological functions

    PubMed Central

    Jiang, Shu-Ye; Chi, Yun-Hua; Wang, Ji-Zhou; Zhou, Jun-Xia; Cheng, Yan-Song; Zhang, Bao-Lan; Ma, Ali; Vanitha, Jeevanandam; Ramachandran, Srinivasan

    2015-01-01

    Sucrose, as the main product of photosynthesis, plays crucial roles in plant development. Although studies on general metabolism pathway were well documented, less information is available on the genome-wide identification of these genes, their expansion and evolutionary history as well as their biological functions. We focused on four sucrose metabolism related gene families including sucrose synthase, sucrose phosphate synthase, sucrose phosphate phosphatase and UDP-glucose pyrophosphorylase. These gene families exhibited different expansion and evolutionary history as their host genomes experienced differentiated rates of the whole genome duplication, tandem and segmental duplication, or mobile element mediated gene gain and loss. They were evolutionarily conserved under purifying selection among species and expression divergence played important roles for gene survival after expansion. However, we have detected recent positive selection during intra-species divergence. Overexpression of 15 sorghum genes in Arabidopsis revealed their roles in biomass accumulation, flowering time control, seed germination and response to high salinity and sugar stresses. Our studies uncovered the molecular mechanisms of gene expansion and evolution and also provided new insight into the role of positive selection in intra-species divergence. Overexpression data revealed novel biological functions of these genes in flowering time control and seed germination under normal and stress conditions. PMID:26616172

  17. Expansion Mechanisms and Evolutionary History on Genes Encoding DNA Glycosylases and Their Involvement in Stress and Hormone Signaling

    PubMed Central

    Jiang, Shu-Ye; Ramachandran, Srinivasan

    2016-01-01

    DNA glycosylases catalyze the release of methylated bases. They play vital roles in the base excision repair pathway and might also function in DNA demethylation. At least three families of DNA glycosylases have been identified, which included 3′-methyladenine DNA glycosylase (MDG) I, MDG II, and HhH-GPD (Helix–hairpin–Helix and Glycine/Proline/aspartate (D)). However, little is known on their genome-wide identification, expansion, and evolutionary history as well as their expression profiling and biological functions. In this study, we have genome-widely identified and evolutionarily characterized these family members. Generally, a genome encodes only one MDG II gene in most of organisms. No MDG I or MDG II gene was detected in green algae. However, HhH-GPD genes were detectable in all available organisms. The ancestor species contain small size of MDG I and HhH-GPD families. These two families were mainly expanded through the whole-genome duplication and segmental duplication. They were evolutionarily conserved and were generally under purifying selection. However, we have detected recent positive selection among the Oryza genus, which might play roles in species divergence. Further investigation showed that expression divergence played important roles in gene survival after expansion. All of these family genes were expressed in most of developmental stages and tissues in rice plants. High ratios of family genes were downregulated by drought and fungus pathogen as well as abscisic acid (ABA) and jasmonic acid (JA) treatments, suggesting a negative regulation in response to drought stress and pathogen infection through ABA- and/or JA-dependent hormone signaling pathway. PMID:27026054

  18. Selection Shapes Transcriptional Logic and Regulatory Specialization in Genetic Networks

    PubMed Central

    Fogelmark, Karl; Peterson, Carsten; Troein, Carl

    2016-01-01

    Background Living organisms need to regulate their gene expression in response to environmental signals and internal cues. This is a computational task where genes act as logic gates that connect to form transcriptional networks, which are shaped at all scales by evolution. Large-scale mutations such as gene duplications and deletions add and remove network components, whereas smaller mutations alter the connections between them. Selection determines what mutations are accepted, but its importance for shaping the resulting networks has been debated. Methodology To investigate the effects of selection in the shaping of transcriptional networks, we derive transcriptional logic from a combinatorially powerful yet tractable model of the binding between DNA and transcription factors. By evolving the resulting networks based on their ability to function as either a simple decision system or a circadian clock, we obtain information on the regulation and logic rules encoded in functional transcriptional networks. Comparisons are made between networks evolved for different functions, as well as with structurally equivalent but non-functional (neutrally evolved) networks, and predictions are validated against the transcriptional network of E. coli. Principal Findings We find that the logic rules governing gene expression depend on the function performed by the network. Unlike the decision systems, the circadian clocks show strong cooperative binding and negative regulation, which achieves tight temporal control of gene expression. Furthermore, we find that transcription factors act preferentially as either activators or repressors, both when binding multiple sites for a single target gene and globally in the transcriptional networks. This separation into positive and negative regulators requires gene duplications, which highlights the interplay between mutation and selection in shaping the transcriptional networks. PMID:26927540

  19. Gene duplication, population genomics, and species-level differentiation within a tropical mountain shrub.

    PubMed

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C

    2014-09-14

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Recurrent Deletions and Reciprocal Duplications of 10q11.21q11.23 Including CHAT and SLC18A3 are Likely Mediated by Complex Low-Copy Repeats

    PubMed Central

    Stankiewicz, Paweł; Kulkarni, Shashikant; Dharmadhikari, Avinash V.; Sampath, Srirangan; Bhatt, Samarth S.; Shaikh, Tamim H.; Xia, Zhilian; Pursley, Amber N.; Cooper, M. Lance; Shinawi, Marwan; Paciorkowski, Alex R.; Grange, Dorothy K.; Noetzel, Michael J.; Saunders, Scott; Simons, Paul; Summar, Marshall; Lee, Brendan; Scaglia, Fernando; Fellmann, Florence; Martinet, Danielle; Beckmann, Jacques S.; Asamoah, Alexander; Platky, Kathryn; Sparks, Susan; Martin, Ann S.; Madan-Khetarpal, Suneeta; Hoover, Jacqueline; Medne, Livija; Bonnemann, Carsten G.; Moeschler, John B.; Vallee, Stephanie E.; Parikh, Sumit; Irwin, Polly; Dalzell, Victoria P.; Smith, Wendy E.; Banks, Valerie C.; Flannery, David B.; Lovell, Carolyn M.; Bellus, Gary A.; Golden-Grant, Kathryn; Gorski, Jerome L.; Kussmann, Jennifer L.; McGregor, Tracy L.; Hamid, Rizwan; Pfotenhauer, Jean; Ballif, Blake C.; Shaw, Chad A.; Kang, Sung-Hae L.; Bacino, Carlos A.; Patel, Ankita; Rosenfeld, Jill A.; Cheung, Sau Wai; Shaffer, Lisa G.

    2013-01-01

    We report 24 unrelated individuals with deletions and 17 additional cases with duplications at 10q11.21q21.1 identified by chromosomal microarray analysis. The rearrangements range in size from 0.3 to 12 Mb. Nineteen of the deletions and eight duplications are flanked by large, directly oriented segmental duplications of >98% sequence identity, suggesting that nonallelic homologous recombination (NAHR) caused these genomic rearrangements. Nine individuals with deletions and five with duplications have additional copy number changes. Detailed clinical evaluation of 20 patients with deletions revealed variable clinical features, with developmental delay (DD) and/or intellectual disability (ID) as the only features common to a majority of individuals. We suggest that some of the other features present in more than one patient with deletion, including hypotonia, sleep apnea, chronic constipation, gastroesophageal and vesicoureteral refluxes, epilepsy, ataxia, dysphagia, nystagmus, and ptosis may result from deletion of the CHAT gene, encoding choline acetyltransferase, and the SLC18A3 gene, mapping in the first intron of CHAT and encoding vesicular acetylcholine transporter. The phenotypic diversity and presence of the deletion in apparently normal carrier parents suggest that subjects carrying 10q11.21q11.23 deletions may exhibit variable phenotypic expressivity and incomplete penetrance influenced by additional genetic and nongenetic modifiers. PMID:21948486

Top