The early stages of duplicate gene evolution
Moore, Richard C.; Purugganan, Michael D.
2003-01-01
Gene duplications are one of the primary driving forces in the evolution of genomes and genetic systems. Gene duplicates account for 8–20% of the genes in eukaryotic genomes, and the rates of gene duplication are estimated at between 0.2% and 2% per gene per million years. Duplicate genes are believed to be a major mechanism for the establishment of new gene functions and the generation of evolutionary novelty, yet very little is known about the early stages of the evolution of duplicated gene pairs. It is unclear, for example, to what extent selection, rather than neutral genetic drift, drives the fixation and early evolution of duplicate loci. Analysis of recently duplicated genes in the Arabidopsis thaliana genome reveals significantly reduced species-wide levels of nucleotide polymorphisms in the progenitor and/or duplicate gene copies, suggesting that selective sweeps accompany the initial stages of the evolution of these duplicated gene pairs. Our results support recent theoretical work that indicates that fates of duplicate gene pairs may be determined in the initial phases of duplicate gene evolution and that positive selection plays a prominent role in the evolutionary dynamics of the very early histories of duplicate nuclear genes. PMID:14671323
Evolution of Gene Duplication in Plants1[OPEN
2016-01-01
Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. PMID:27288366
Evolution of Gene Duplication in Plants.
Panchy, Nicholas; Lehti-Shiu, Melissa; Shiu, Shin-Han
2016-08-01
Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. © 2016 American Society of Plant Biologists. All Rights Reserved.
Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang
2012-06-15
Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication. This is particularly true for the zebrafish genome. Further analysis of the duplicated gene sets indicated that a significant portion of duplicated genes in the zebrafish genome were of recent, lineage-specific duplication events. Most strikingly, a subset of duplicated genes is enriched among the recently duplicated genes involved in immune or sensory response pathways. Such findings demonstrated the significance of continuous gene duplication as well as that of whole genome duplication in the course of genome evolution.
Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.
Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J
2016-11-04
Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types, duplication ages and co-expression consequences.
Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms.
Li, Zhen; Defoort, Jonas; Tasdighian, Setareh; Maere, Steven; Van de Peer, Yves; De Smet, Riet
2016-02-01
Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. © 2016 American Society of Plant Biologists. All rights reserved.
Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.
Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.
2016-01-01
Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960
Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms[OPEN
Li, Zhen; Van de Peer, Yves; De Smet, Riet
2016-01-01
Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of “gene duplicability” is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. PMID:26744215
Gene duplication and the evolution of phenotypic diversity in insect societies.
Chau, Linh M; Goodisman, Michael A D
2017-12-01
Gene duplication is an important evolutionary process thought to facilitate the evolution of phenotypic diversity. We investigated if gene duplication was associated with the evolution of phenotypic differences in a highly social insect, the honeybee Apis mellifera. We hypothesized that the genetic redundancy provided by gene duplication could promote the evolution of social and sexual phenotypes associated with advanced societies. We found a positive correlation between sociality and rate of gene duplications across the Apoidea, indicating that gene duplication may be associated with sociality. We also discovered that genes showing biased expression between A. mellifera alternative phenotypes tended to be found more frequently than expected among duplicated genes than singletons. Moreover, duplicated genes had higher levels of caste-, sex-, behavior-, and tissue-biased expression compared to singletons, as expected if gene duplication facilitated phenotypic differentiation. We also found that duplicated genes were maintained in the A. mellifera genome through the processes of conservation, neofunctionalization, and specialization, but not subfunctionalization. Overall, we conclude that gene duplication may have facilitated the evolution of social and sexual phenotypes, as well as tissue differentiation. Thus this study further supports the idea that gene duplication allows species to evolve an increased range of phenotypic diversity. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.
A limited role for gene duplications in the evolution of platypus venom.
Wong, Emily S W; Papenfuss, Anthony T; Whittington, Camilla M; Warren, Wesley C; Belov, Katherine
2012-01-01
Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the "venome" of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation.
A Limited Role for Gene Duplications in the Evolution of Platypus Venom
Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine
2012-01-01
Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864
Both mechanism and age of duplications contribute to biased gene retention patterns in plants.
Rody, Hugo V S; Baute, Gregory J; Rieseberg, Loren H; Oliveira, Luiz O
2017-01-06
All extant seed plants are successful paleopolyploids, whose genomes carry duplicate genes that have survived repeated episodes of diploidization. However, the survival of gene duplicates is biased with respect to gene function and mechanism of duplication. Transcription factors, in particular, are reported to be preferentially retained following whole-genome duplications (WGDs), but disproportionately lost when duplicated by tandem events. An explanation for this pattern is provided by the Gene Balance Hypothesis (GBH), which posits that duplicates of highly connected genes are retained following WGDs to maintain optimal stoichiometry among gene products; but such connected gene duplicates are disfavored following tandem duplications. We used genomic data from 25 taxonomically diverse plant species to investigate the roles of duplication mechanism, gene function, and age of duplication in the retention of duplicate genes. Enrichment analyses were conducted to identify Gene Ontology (GO) functional categories that were overrepresented in either WGD or tandem duplications, or across ranges of divergence times. Tandem paralogs were much younger, on average, than WGD paralogs and the most frequently overrepresented GO categories were not shared between tandem and WGD paralogs. Transcription factors were overrepresented among ancient paralogs regardless of mechanism of origin or presence of a WGD. Also, in many cases, there was no bias toward transcription factor retention following recent WGDs. Both the fixation and the retention of duplicated genes in plant genomes are context-dependent events. The strong bias toward ancient transcription factor duplicates can be reconciled with the GBH if selection for optimal stoichiometry among gene products is strongest following the earliest polyploidization events and becomes increasingly relaxed as gene families expand.
Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.
Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M
2017-08-01
Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.
Paterson, Andrew H; Chapman, Brad A; Kissinger, Jessica C; Bowers, John E; Feltus, Frank A; Estill, James C
2006-11-01
Genome duplication is potentially a good source of new genes, but such genes take time to evolve. We have found a group of "duplication-resistant" genes, which have undergone convergent restoration to singleton status following several independent genome duplications. Restoration of duplication-resistant genes to singleton status could be important to long-term survival of a polyploid lineage. Angiosperms show more frequent polyploidization and a higher degree of duplicate gene preservation than other paleopolyploids, making them well-suited to further study of duplication-resistant genes.
Sanzol, Javier
2010-05-14
Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to gamma-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82% of duplicated genes, including 80% of young paralogs, showed uncorrelated expression profiles, suggesting extensive subfunctionalization and a role of gene duplication in the acquisition of novel patterns of gene expression. This study reports a genome-wide analysis of the mode of gene duplication in the apple, and provides evidence for its role in genome functional diversification by characterising three major processes: selective retention of paralogs, amplification of gene families, and changes in gene expression.
Pan, Deng; Zhang, Liqing
2007-01-01
Background The rate of gene duplication is an important parameter in the study of evolution, but the influence of gene conversion and technical problems have confounded previous attempts to provide a satisfying estimate. We propose a new strategy to estimate the rate that involves separate quantification of the rates of two different mechanisms of gene duplication and subsequent combination of the two rates, based on their respective contributions to the overall gene duplication rate. Results Previous estimates of gene duplication rates are based on small gene families. Therefore, to assess the applicability of this to families of all sizes, we looked at both two-copy gene families and the entire genome. We studied unequal crossover and retrotransposition, and found that these mechanisms of gene duplication are largely independent and account for a substantial amount of duplicated genes. Unequal crossover contributed more to duplications in the entire genome than retrotransposition did, but this contribution was significantly less in two-copy gene families, and duplicated genes arising from this mechanism are more likely to be retained. Combining rates of duplication using the two mechanisms, we estimated the overall rates to be from approximately 0.515 to 1.49 × 10-3 per gene per million years in human, and from approximately 1.23 to 4.23 × 10-3 in mouse. The rates estimated from two-copy gene families are always lower than those from the entire genome, and so it is not appropriate to use small families to estimate the rate for the entire genome. Conclusion We present a novel strategy for estimating gene duplication rates. Our results show that different mechanisms contribute differently to the evolution of small and large gene families. PMID:17683522
Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P; Feltus, F Alex; Paterson, Andrew H
2011-01-01
Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution.
Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P.; Feltus, F. Alex; Paterson, Andrew H.
2011-01-01
Background Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. Results In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Conclusion Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution. PMID:22164235
Chapman, Brad A; Bowers, John E; Feltus, Frank A; Paterson, Andrew H
2006-02-21
Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a "paleologous" locus than in "singleton" genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during "genomic turmoil" immediately after genome duplication but continues to act approximately 60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages.
Chapman, Brad A.; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.
2006-01-01
Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a “paleologous” locus than in “singleton” genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during “genomic turmoil” immediately after genome duplication but continues to act ≈60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages. PMID:16467140
Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution
Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.
2014-01-01
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106
Jin, Jing; Jin, Xiaolei; Jiang, Haiyang; Yan, Hanwei; Cheng, Beijiu
2014-01-01
Whole-genome duplication events (polyploidy events) and gene loss events have played important roles in the evolution of legumes. Here we show that the vast majority of Hsf gene duplications resulted from whole genome duplication events rather than tandem duplication, and significant differences in gene retention exist between species. By searching for intraspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found that genome duplications accounted for 42 of 46 Hsf-containing segments in Glycine max, while paired segments were rarely identified in Lotus japonicas, Medicago truncatula and Cajanus cajan. However, by comparing interspecies microsynteny, we determined that the great majority of Hsf-containing segments in Lotus japonicas, Medicago truncatula and Cajanus cajan show extensive conservation with the duplicated regions of Glycine max. These segments formed 17 groups of orthologous segments. These results suggest that these regions shared ancient genome duplication with Hsf genes in Glycine max, but more than half of the copies of these genes were lost. On the other hand, the Glycine max Hsf gene family retained approximately 75% and 84% of duplicated genes produced from the ancient genome duplication and recent Glycine-specific genome duplication, respectively. Continuous purifying selection has played a key role in the maintenance of Hsf genes in Glycine max. Expression analysis of the Hsf genes in Lotus japonicus revealed their putative involvement in multiple tissue-/developmental stages and responses to various abiotic stimuli. This study traces the evolution of Hsf genes in legume species and demonstrates that the rates of gene gain and loss are far from equilibrium in different species. PMID:25047803
Fares, Mario A; Sabater-Muñoz, Beatriz; Toft, Christina
2017-05-01
Gene duplication generates new genetic material, which has been shown to lead to major innovations in unicellular and multicellular organisms. A whole-genome duplication occurred in the ancestor of Saccharomyces yeast species but 92% of duplicates returned to single-copy genes shortly after duplication. The persisting duplicated genes in Saccharomyces led to the origin of major metabolic innovations, which have been the source of the unique biotechnological capabilities in the Baker's yeast Saccharomyces cerevisiae. What factors have determined the fate of duplicated genes remains unknown. Here, we report the first demonstration that the local genome mutation and transcription rates determine the fate of duplicates. We show, for the first time, a preferential location of duplicated genes in the mutational and transcriptional hotspots of S. cerevisiae genome. The mechanism of duplication matters, with whole-genome duplicates exhibiting different preservation trends compared to small-scale duplicates. Genome mutational and transcriptional hotspots are rich in duplicates with large repetitive promoter elements. Saccharomyces cerevisiae shows more tolerance to deleterious mutations in duplicates with repetitive promoter elements, which in turn exhibit higher transcriptional plasticity against environmental perturbations. Our data demonstrate that the genome traps duplicates through the accelerated regulatory and functional divergence of their gene copies providing a source of novel adaptations in yeast. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc
2017-01-01
Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708
Functional requirements driving the gene duplication in 12 Drosophila species.
Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui
2013-08-15
Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.
Gu, Xun; Wang, Yufeng; Gu, Jianying
2002-06-01
The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.
Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc
2017-11-01
The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Detecting long tandem duplications in genomic sequences.
Audemard, Eric; Schiex, Thomas; Faraut, Thomas
2012-05-08
Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS < 1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.
Christensen, Kris A; Davidson, William S
2017-01-01
Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well.
Baker, Richard H; Narechania, Apurva; Johns, Philip M; Wilkinson, Gerald S
2012-08-19
Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict.
Baker, Richard H.; Narechania, Apurva; Johns, Philip M.; Wilkinson, Gerald S.
2012-01-01
Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict. PMID:22777023
Identification of three duplicated Spin genes in medaka (Oryzias latipes).
Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang
2005-05-09
Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.
New genes from old: asymmetric divergence of gene duplicates and the evolution of development.
Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi
2017-02-05
Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).
Liu, Gangbiao; Zou, Yangyun; Cheng, Qiqun; Zeng, Yanwu; Gu, Xun; Su, Zhixi
2014-04-01
The age distribution of gene duplication events within the human genome exhibits two waves of duplications along with an ancient component. However, because of functional constraint differences, genes in different functional categories might show dissimilar retention patterns after duplication. It is known that genes in some functional categories are highly duplicated in the early stage of vertebrate evolution. However, the correlations of the age distribution pattern of gene duplication between the different functional categories are still unknown. To investigate this issue, we developed a robust pipeline to date the gene duplication events in the human genome. We successfully estimated about three-quarters of the duplication events within the human genome, along with the age distribution pattern in each Gene Ontology (GO) slim category. We found that some GO slim categories show different distribution patterns when compared to the whole genome. Further hierarchical clustering of the GO slim functional categories enabled grouping into two main clusters. We found that human genes located in the duplicated copy number variant regions, whose duplicate genes have not been fixed in the human population, were mainly enriched in the groups with a high proportion of recently duplicated genes. Moreover, we used a phylogenetic tree-based method to date the age of duplications in three signaling-related gene superfamilies: transcription factors, protein kinases and G-protein coupled receptors. These superfamilies were expressed in different subcellular localizations. They showed a similar age distribution as the signaling-related GO slim categories. We also compared the differences between the age distributions of gene duplications in multiple subcellular localizations. We found that the distribution patterns of the major subcellular localizations were similar to that of the whole genome. This study revealed the whole picture of the evolution patterns of gene functional categories in the human genome.
Consensus properties and their large-scale applications for the gene duplication problem.
Moon, Jucheol; Lin, Harris T; Eulenstein, Oliver
2016-06-01
Solving the gene duplication problem is a classical approach for species tree inference from gene trees that are confounded by gene duplications. This problem takes a collection of gene trees and seeks a species tree that implies the minimum number of gene duplications. Wilkinson et al. posed the conjecture that the gene duplication problem satisfies the desirable Pareto property for clusters. That is, for every instance of the problem, all clusters that are commonly present in the input gene trees of this instance, called strict consensus, will also be found in every solution to this instance. We prove that this conjecture does not generally hold. Despite this negative result we show that the gene duplication problem satisfies a weaker version of the Pareto property where the strict consensus is found in at least one solution (rather than all solutions). This weaker property contributes to our design of an efficient scalable algorithm for the gene duplication problem. We demonstrate the performance of our algorithm in analyzing large-scale empirical datasets. Finally, we utilize the algorithm to evaluate the accuracy of standard heuristics for the gene duplication problem using simulated datasets.
PTGBase: an integrated database to study tandem duplicated genes in plants.
Yu, Jingyin; Ke, Tao; Tehrim, Sadia; Sun, Fengming; Liao, Boshou; Hua, Wei
2015-01-01
Tandem duplication is a wide-spread phenomenon in plant genomes and plays significant roles in evolution and adaptation to changing environments. Tandem duplicated genes related to certain functions will lead to the expansion of gene families and bring increase of gene dosage in the form of gene cluster arrays. Many tandem duplication events have been studied in plant genomes; yet, there is a surprising shortage of efforts to systematically present the integration of large amounts of information about publicly deposited tandem duplicated gene data across the plant kingdom. To address this shortcoming, we developed the first plant tandem duplicated genes database, PTGBase. It delivers the most comprehensive resource available to date, spanning 39 plant genomes, including model species and newly sequenced species alike. Across these genomes, 54 130 tandem duplicated gene clusters (129 652 genes) are presented in the database. Each tandem array, as well as its member genes, is characterized in complete detail. Tandem duplicated genes in PTGBase can be explored through browsing or searching by identifiers or keywords of functional annotation and sequence similarity. Users can download tandem duplicated gene arrays easily to any scale, up to the complete annotation data set for an entire plant genome. PTGBase will be updated regularly with newly sequenced plant species as they become available. © The Author(s) 2015. Published by Oxford University Press.
Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu
2011-12-20
The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.
Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira
2016-10-13
PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.
Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.
Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525
Acharya, Debarun; Ghosh, Tapash C
2016-01-22
Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.
Comparative inference of duplicated genes produced by polyploidization in soybean genome.
Yang, Yanmei; Wang, Jinpeng; Di, Jianyong
2013-01-01
Soybean (Glycine max) is one of the most important crop plants for providing protein and oil. It is important to investigate soybean genome for its economic and scientific value. Polyploidy is a widespread and recursive phenomenon during plant evolution, and it could generate massive duplicated genes which is an important resource for genetic innovation. Improved sequence alignment criteria and statistical analysis are used to identify and characterize duplicated genes produced by polyploidization in soybean. Based on the collinearity method, duplicated genes by whole genome duplication account for 70.3% in soybean. From the statistical analysis of the molecular distances between duplicated genes, our study indicates that the whole genome duplication event occurred more than once in the genome evolution of soybean, which is often distributed near the ends of chromosomes.
Salaneck, Erik; Ardell, David H; Larson, Earl T; Larhammar, Dan
2003-08-01
It has been debated whether the increase in gene number during early vertebrate evolution was due to multiple independent gene duplications or synchronous duplications of many genes. We describe here the cloning of three neuropeptide Y (NPY) receptor genes belonging to the Y1 subfamily in the spiny dogfish, Squalus acanthias, a cartilaginous fish. The three genes are orthologs of the mammalian subtypes Y1, Y4, and Y6, which are located in paralogous gene regions on different chromosomes in mammals. Thus, these genes arose by duplications of a chromosome region before the radiation of gnathostomes (jawed vertebrates). Estimates of duplication times from linearized trees together with evidence from other gene families supports two rounds of chromosome duplications or tetraploidizations early in vertebrate evolution. The anatomical distribution of mRNA was determined by reverse-transcriptase PCR and was found to differ from mammals, suggesting differential functional diversification of the new gene copies during the radiation of the vertebrate classes.
Gene family size conservation is a good indicator of evolutionary rates.
Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen
2010-08-01
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
Impact of gene gains, losses and duplication modes on the origin and diversification of vertebrates.
Cañestro, Cristian; Albalat, Ricard; Irimia, Manuel; Garcia-Fernàndez, Jordi
2013-02-01
The study of the evolutionary origin of vertebrates has been linked to the study of genome duplications since Susumo Ohno suggested that the successful diversification of vertebrate innovations was facilitated by two rounds of whole-genome duplication (2R-WGD) in the stem vertebrate. Since then, studies on the functional evolution of many genes duplicated in the vertebrate lineage have provided the grounds to support experimentally this link. This article reviews cases of gene duplications derived either from the 2R-WGD or from local gene duplication events in vertebrates, analyzing their impact on the evolution of developmental innovations. We analyze how gene regulatory networks can be rewired by the activity of transposable elements after genome duplications, discuss how different mechanisms of duplication might affect the fate of duplicated genes, and how the loss of gene duplicates might influence the fate of surviving paralogs. We also discuss the evolutionary relationships between gene duplication and alternative splicing, in particular in the vertebrate lineage. Finally, we discuss the role that the 2R-WGD might have played in the evolution of vertebrate developmental gene networks, paying special attention to those related to vertebrate key features such as neural crest cells, placodes, and the complex tripartite brain. In this context, we argue that current evidences points that the 2R-WGD may not be linked to the origin of vertebrate innovations, but to their subsequent diversification in a broad variety of complex structures and functions that facilitated the successful transition from peaceful filter-feeding non-vertebrate ancestors to voracious vertebrate predators. Copyright © 2013 Elsevier Ltd. All rights reserved.
Zeng, Jia; Hannenhalli, Sridhar
2013-01-01
Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study. Here we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human. Our results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.
Complexity of Gene Expression Evolution after Duplication: Protein Dosage Rebalancing
Rogozin, Igor B.
2014-01-01
Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC). Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO) annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes. PMID:25197576
Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.
2014-01-01
Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767
Suvorov, Anton; Jensen, Nicholas O; Sharkey, Camilla R; Fujimoto, M Stanley; Bodily, Paul; Wightman, Haley M Cahill; Ogden, T Heath; Clement, Mark J; Bybee, Seth M
2017-03-01
Gene duplication plays a central role in adaptation to novel environments by providing new genetic material for functional divergence and evolution of biological complexity. Several evolutionary models have been proposed for gene duplication to explain how new gene copies are preserved by natural selection, but these models have rarely been tested using empirical data. Opsin proteins, when combined with a chromophore, form a photopigment that is responsible for the absorption of light, the first step in the phototransduction cascade. Adaptive gene duplications have occurred many times within the animal opsins' gene family, leading to novel wavelength sensitivities. Consequently, opsins are an attractive choice for the study of gene duplication evolutionary models. Odonata (dragonflies and damselflies) have the largest opsin repertoire of any insect currently known. Additionally, there is tremendous variation in opsin copy number between species, particularly in the long-wavelength-sensitive (LWS) class. Using comprehensive phylotranscriptomic and statistical approaches, we tested various evolutionary models of gene duplication. Our results suggest that both the blue-sensitive (BS) and LWS opsin classes were subjected to strong positive selection that greatly weakens after multiple duplication events, a pattern that is consistent with the permanent heterozygote model. Due to the immense interspecific variation and duplicability potential of opsin genes among odonates, they represent a unique model system to test hypotheses regarding opsin gene duplication and diversification at the molecular level. © 2016 John Wiley & Sons Ltd.
Bassham, Susan; Cañestro, Cristian; Postlethwait, John H
2008-08-22
Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial remodeling, and in sensory tissues evokes like functions of Pax2, Pax5 and Pax8 in vertebrate embryos, and may indicate ancient origins for these functions in the chordate common ancestor. Comparative analysis of expression patterns of chordate Pax2/5/8 duplicates, rooted on the single-copy Pax2/5/8 gene of amphioxus, whose lineage diverged basally among chordates, provides new insights into the evolution and development of the heart, thyroid, pharynx, stomodeum and placodes in chordates; supports the controversial conclusion that the atrial siphon of ascidians and the otic placode in vertebrates are homologous; and backs the notion that Pax2/5/8 functioned in ancestral chordates to engineer epithelial fusions and perforations, including gill slit openings.
Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate
Dehal, Paramvir; Boore, Jeffrey L
2005-01-01
The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish–tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage. PMID:16128622
Models for loosely linked gene duplicates suggest lengthy persistence of both copies.
O'Hely, Martin; Wockner, Leesa
2007-06-21
Consider the appearance of a duplicate copy of a gene at a locus linked loosely, if at all, to the locus at which the gene is usually found. If all copies of the gene are subject to non-functionalizing mutations, then two fates are possible: loss of functional copies at the duplicate locus (loss of duplicate expression), or loss of functional copies at the original locus (map change). This paper proposes a simple model to address the probability of map change, the time taken for a map change and/or loss of duplicate expression, and considers where in the spectrum between loss of duplicate expression and map change such a duplicate complex is likely to be found. The findings are: the probability of map change is always half the reciprocal of the population size N, the time for a map change to occur is order NlogN generations, and that there is a marked tendency for duplicates to remain near equi-frequency with the gene at the original locus for a large portion of that time. This is in excellent agreement with simulations.
Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C
2014-09-14
Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Emms, David M; Covshoff, Sarah; Hibberd, Julian M; Kelly, Steven
2016-07-01
C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. C4 photosynthesis, gene duplication, gene families, parallel evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; Kelly, Steven
2016-01-01
C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. Key words: C4 photosynthesis, gene duplication, gene families, parallel evolution. PMID:27016024
Sorting by Cuts, Joins, and Whole Chromosome Duplications.
Zeira, Ron; Shamir, Ron
2017-02-01
Genome rearrangement problems have been extensively studied due to their importance in biology. Most studied models assumed a single copy per gene. However, in reality, duplicated genes are common, most notably in cancer. In this study, we make a step toward handling duplicated genes by considering a model that allows the atomic operations of cut, join, and whole chromosome duplication. Given two linear genomes, [Formula: see text] with one copy per gene and [Formula: see text] with two copies per gene, we give a linear time algorithm for computing a shortest sequence of operations transforming [Formula: see text] into [Formula: see text] such that all intermediate genomes are linear. We also show that computing an optimal sequence with fewest duplications is NP-hard.
Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion
Fawcett, Jeffrey A.; Innan, Hideki
2011-01-01
Gene conversion is one of the major mutational mechanisms involved in the DNA sequence evolution of duplicated genes. It contributes to create unique patters of DNA polymorphism within species and divergence between species. A typical pattern is so-called concerted evolution, in which the divergence between duplicates is maintained low for a long time because of frequent exchanges of DNA fragments. In addition, gene conversion affects the DNA evolution of duplicates in various ways especially when selection operates. Here, we review theoretical models to understand the evolution of duplicates in both neutral and non-neutral cases. We also explain how these theories contribute to interpreting real polymorphism and divergence data by using some intriguing examples. PMID:24710144
5p13 microduplication syndrome: a new case and better clinical definition of the syndrome.
Novara, Francesca; Alfei, Enrico; D'Arrigo, Stefano; Pantaleoni, Chiara; Beri, Silvana; Achille, Valentina; Sciacca, Francesca L; Giorda, Roberto; Zuffardi, Orsetta; Ciccone, Roberto
2013-01-01
Chromosome 5p13 duplication syndrome (OMIM #613174), a contiguous gene syndrome involving duplication of several genes on chromosome 5p13 including NIPBL (OMIM 608667), has been described in rare patients with developmental delay and learning disability, behavioral problems and peculiar facial dysmorphisms. 5p13 duplications described so far present with variable sizes, from 0.25 to 13.6 Mb, and contain a variable number of genes. Here we report another patient with 5p13 duplication syndrome including NIPBL gene only. Proband's phenotype overlapped that reported in patients with 5p13 microduplication syndrome and especially that of subjects with smaller duplications. Moreover, we better define genotype-phenotype relationship associated with this duplication and confirmed that NIPBL was likely the major dosage sensitive gene for the 5p13 microduplication phenotype. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Bassham, Susan; Cañestro, Cristian; Postlethwait, John H
2008-01-01
Background Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. Results To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial remodeling, and in sensory tissues evokes like functions of Pax2, Pax5 and Pax8 in vertebrate embryos, and may indicate ancient origins for these functions in the chordate common ancestor. Conclusion Comparative analysis of expression patterns of chordate Pax2/5/8 duplicates, rooted on the single-copy Pax2/5/8 gene of amphioxus, whose lineage diverged basally among chordates, provides new insights into the evolution and development of the heart, thyroid, pharynx, stomodeum and placodes in chordates; supports the controversial conclusion that the atrial siphon of ascidians and the otic placode in vertebrates are homologous; and backs the notion that Pax2/5/8 functioned in ancestral chordates to engineer epithelial fusions and perforations, including gill slit openings. PMID:18721460
Toloza-Villalobos, Jessica; Arroyo, José Ignacio; Opazo, Juan C
2015-01-01
The circadian clock is a central oscillator that coordinates endogenous rhythms. Members of six gene families underlie the metabolic machinery of this system. Although this machinery appears to correspond to a highly conserved genetic system in metazoans, it has been recognized that vertebrates possess a more diverse gene inventory than that of non-vertebrates. This difference could have originated in the two successive rounds of whole-genome duplications that took place in the common ancestor of the group. Teleost fish underwent an extra event of whole-genome duplication, which is thought to have provided an abundance of raw genetic material for the biological innovations that facilitated the radiation of the group. In this study, we assessed the relative contributions of whole-genome duplication and small-scale gene duplication to generate the repertoire of genes associated with the circadian clock of teleost fish. To achieve this goal, we annotated genes from six gene families associated with the circadian clock in eight teleost fish species, and we reconstructed their evolutionary history by inferring phylogenetic relationships. Our comparative analysis indicated that teleost species possess a variable repertoire of genes related to the circadian clock gene families and that the actual diversity of these genes has been shaped by a variety of phenomena, such as the complete deletion of ohnologs, the differential retention of genes, and lineage-specific gene duplications. From a functional perspective, the subfunctionalization of two ohnolog genes (PER1a and PER1b) in zebrafish highlights the power of whole-genome duplications to generate biological diversity.
Evolution of tuf genes: ancient duplication, differential loss and gene conversion.
Lathe, W C; Bork, P
2001-08-03
The tuf gene of eubacteria, encoding the EF-tu elongation factor, was duplicated early in the evolution of the taxon. Phylogenetic and genomic location analysis of 20 complete eubacterial genomes suggests that this ancient duplication has been differentially lost and maintained in eubacteria.
Marlétaz, Ferdinand; Maeso, Ignacio; Faas, Laura; Isaacs, Harry V; Holland, Peter W H
2015-08-01
The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates.
Zhong, Jinshun; Kellogg, Elizabeth A
2015-01-01
Duplication, retention, and expression of CYCLOIDEA2 (CYC2)-like genes are thought to affect evolution of corolla symmetry. However, exactly what and how changes in CYC2-like genes correlate with the origin of corolla zygomorphy are poorly understood. We inferred and calibrated a densely sampled phylogeny of CYC2-like genes across the Lamiales and examined their expression in early diverging (EDL) and higher core clades (HCL). CYC2-like genes duplicated extensively in Lamiales, at least six times in core Lamiales (CL) around the Cretaceous-Paleogene (K-Pg) boundary, and seven more in EDL relatively more recently. Nested duplications and losses of CYC2-like paralogs are pervasive but may not correlate with transitions in corolla symmetry. We found evidence for dN/dS (ω) variation following gene duplications. CYC2-like paralogs in HCL show differential expression with higher expression in adaxial petals. Asymmetric expression but not recurrent duplication of CYC2-like genes correlates with the origin of corolla zygomorphy. Changes in both cis-regulatory and coding domains of CYC2-like genes are probably crucial for the evolution of corolla zygomorphy. Multiple selection regimes appear likely to play important roles in gene retention. The parallel duplications of CYC2-like genes are after the initial diversification of bumble bees and Euglossine bees. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
Parmar, Manoj B; Wright, Jonathan M
2013-11-01
A whole-genome duplication (WGD) early in the teleost fish lineage makes fish ideal organisms to study the fate of duplicated genes and underlying evolutionary trajectories that have led to the retention of ohnologous gene duplicates in fish genomes. Here, we compare the genomic organization and tissue-specific transcription of the ohnologous fabp7 and fabp10 genes in medaka, three-spined stickleback, and spotted green pufferfish to the well-studied duplicated fabp7 and fabp10 genes of zebrafish. Teleost fabp7 and fabp10 genes contain four exons interrupted by three introns. Polypeptide sequences of Fabp7 and Fabp10 show the highest sequence identity and similarity with their orthologs from vertebrates. Orthology was evident as the ohnologous Fabp7 and Fabp10 polypeptides of teleost fishes each formed distinct clades and clustered together with their orthologs from other vertebrates in a phylogenetic tree. Furthermore, ohnologous teleost fabp7 and fabp10 genes exhibit conserved gene synteny with human FABP7 and chicken FABP10, respectively, which provides compelling evidence that the duplicated fabp7 and fabp10 genes of teleost fishes most likely arose from the well-documented WGD. The tissue-specific distribution of fabp7a, fabp7b, fabp10a, and fabp10b transcripts provides evidence of diverged spatial transcriptional regulation between ohnologous gene duplicates of fabp7 and fabp10 in teleost fishes.
Duplicated growth hormone genes in a passerine bird, the jungle crow (Corvus macrorhynchos).
Arai, Natsumi; Iigo, Masayuki
2010-07-02
Molecular cloning, molecular phylogeny, gene structure and expression analyses of growth hormone (GH) were performed in a passerine bird, the jungle crow (Corvus macrorhynchos). Unexpectedly, duplicated GH cDNA and genes were identified and designated as GH1A and GH1B. In silico analyses identified the zebra finch orthologs. Both GH genes encode 217 amino acid residues and consist of five exons and four introns, spanning 5.2 kbp in GH1A and 4.2 kbp in GH1B. Predicted GH proteins of the jungle crow and zebra finch contain four conserved cysteine residues, suggesting duplicated GH genes are functional. Molecular phylogenetic analysis revealed that duplication of GH genes occur after divergence of the passerine lineage from the other avian orders as has been suggested from partial genomic DNA sequences of passerine GH genes. RT-PCR analyses confirmed expression of GH1A and GH1B in the pituitary gland. In addition, GH1A gene is expressed in all the tissues examined. However, expression of GH1B is confined to several brain areas and blood cells. These results indicate that the regulatory mechanisms of duplicated GH genes are different and that duplicated GH genes exert both endocrine and autocrine/paracrine functions. Copyright 2010 Elsevier Inc. All rights reserved.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-11-29
Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.
Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene
Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil
2007-01-01
Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649
Circular DNA Intermediate in the Duplication of Nile Tilapia vasa Genes
Fujimura, Koji; Conte, Matthew A.; Kocher, Thomas D.
2011-01-01
vasa is a highly conserved RNA helicase involved in animal germ cell development. Among vertebrate species, it is typically present as a single copy per genome. Here we report the isolation and sequencing of BAC clones for Nile tilapia vasa genes. Contrary to a previous report that Nile tilapia have a single copy of the vasa gene, we find evidence for at least three vasa gene loci. The vasa gene locus was duplicated from the original site and integrated into two distant novel sites. For one of these insertions we find evidence that the duplication was mediated by a circular DNA intermediate. This mechanism of gene duplication may explain the origin of isolated gene duplicates during the evolution of fish genomes. These data provide a foundation for studying the role of multiple vasa genes in the development of tilapia gonads, and will contribute to investigations of the molecular mechanisms of sex determination and evolution in cichlid fishes. PMID:22216289
Genome-wide identification and evolution of the PIN-FORMED (PIN) gene family in Glycine max.
Liu, Yuan; Wei, Haichao
2017-07-01
Soybean (Glycine max) is one of the most important crop plants. Wild and cultivated soybean varieties have significant differences worth further investigation, such as plant morphology, seed size, and seed coat development; these characters may be related to auxin biology. The PIN gene family encodes essential transport proteins in cell-to-cell auxin transport, but little research on soybean PIN genes (GmPIN genes) has been done, especially with respect to the evolution and differences between wild and cultivated soybean. In this study, we retrieved 23 GmPIN genes from the latest updated G. max genome database; six GmPIN protein sequences were changed compared with the previous database. Based on the Plant Genome Duplication Database, 18 GmPIN genes have been involved in segment duplication. Three pairs of GmPIN genes arose after the second soybean genome duplication, and six occurred after the first genome duplication. The duplicated GmPIN genes retained similar expression patterns. All the duplicated GmPIN genes experienced purifying selection (K a /K s < 1) to prevent accumulation of non-synonymous mutations and thus remained more similar. In addition, we also focused on the artificial selection of the soybean PIN genes. Five artificially selected GmPIN genes were identified by comparing the genome sequence of 17 wild and 14 cultivated soybean varieties. Our research provides useful and comprehensive basic information for understanding GmPIN genes.
Evolution and functional divergence of NLRP genes in mammalian reproductive systems
2009-01-01
Background NLRPs (Nucleotide-binding oligomerization domain, Leucine rich Repeat and Pyrin domain containing Proteins) are members of NLR (Nod-like receptors) protein family. Recent researches have shown that NLRP genes play important roles in both mammalian innate immune system and reproductive system. Several of NLRP genes were shown to be specifically expressed in the oocyte in mammals. The aim of the present work was to study how these genes evolved and diverged after their duplication, as well as whether natural selection played a role during their evolution. Results By using in silico methods, we have evaluated the evolution and functional divergence of NLRP genes, in particular of mouse reproduction-related Nlrp genes. We found that (1) major NLRP genes have been duplicated before the divergence of mammals, with certain lineage-specific duplications in primates (NLRP7 and 11) and in rodents (Nlrp1, 4 and 9 duplicates); (2) tandem duplication events gave rise to a mammalian reproduction-related NLRP cluster including NLRP2, 4, 5, 7, 8, 9, 11, 13 and 14 genes; (3) the function of mammalian oocyte-specific NLRP genes (NLRP4, 5, 9 and 14) might have diverged during gene evolution; (4) recent segmental duplications concerning Nlrp4 copies and vomeronasal 1 receptor encoding genes (V1r) have been undertaken in the mouse; and (5) duplicates of Nlrp4 and 9 in the mouse might have been subjected to adaptive evolution. Conclusion In conclusion, this study brings us novel information on the evolution of mammalian reproduction-related NLRPs. On the one hand, NLRP genes duplicated and functionally diversified in mammalian reproductive systems (such as NLRP4, 5, 9 and 14). On the other hand, during evolution, different lineages adapted to develop their own NLRP genes, particularly in reproductive function (such as the specific expansion of Nlrp4 and Nlrp9 in the mouse). PMID:19682372
Rensing, Stefan A; Ick, Julia; Fawcett, Jeffrey A; Lang, Daniel; Zimmer, Andreas; Van de Peer, Yves; Reski, Ralf
2007-01-01
Background: Analyses of complete genomes and large collections of gene transcripts have shown that most, if not all seed plants have undergone one or more genome duplications in their evolutionary past. Results: In this study, based on a large collection of EST sequences, we provide evidence that the haploid moss Physcomitrella patens is a paleopolyploid as well. Based on the construction of linearized phylogenetic trees we infer the genome duplication to have occurred between 30 and 60 million years ago. Gene Ontology and pathway association of the duplicated genes in P. patens reveal different biases of gene retention compared with seed plants. Conclusion: Metabolic genes seem to have been retained in excess following the genome duplication in P. patens. This might, at least partly, explain the versatility of metabolism, as described for P. patens and other mosses, in comparison to other land plants. PMID:17683536
Screening of duplicated loci reveals hidden divergence patterns in a complex salmonid genome
Limborg, Morten T.; Larson, Wesley; Seeb, Lisa W.; Seeb, James E.
2017-01-01
A whole-genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid-origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid-origin species. We demonstrate a new method that enables genome-wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus-specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid-origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.
2010-01-01
Background Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar), but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. Results From existing expressed sequence tag (EST) resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius) ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. Conclusions 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate. PMID:20433749
Workshop on Self-Determination in Developing and Evolving Systems
1994-02-18
processes of duplication (e.g. gene duplication, cell duplication, structural enlargement), responses to selfish DNA (e.g. suppression of outlaw...direct their development, then the genes would need some form of environmental feedback. Are there any plausible mechanisms for such feedback? 3. What is...evolutionary innovation, what is the contribution of random mutations, directed mutation, gene conversion, symbiogenesis, fusion, jumping genes or other
Marandel, Lucie; Panserat, Stéphane; Plagnes-Juan, Elisabeth; Arbenoits, Eva; Soengas, José Luis; Bobe, Julien
2017-05-02
Glucose-6-phosphate (G6pc) is a key enzyme involved in the regulation of the glucose homeostasis. The present study aims at revisiting and clarifying the evolutionary history of g6pc genes in vertebrates. g6pc duplications happened by successive rounds of whole genome duplication that occurred during vertebrate evolution. g6pc duplicated before or around Osteichthyes/Chondrichthyes radiation, giving rise to g6pca and g6pcb as a consequence of the second vertebrate whole genome duplication. g6pca was lost after this duplication in Sarcopterygii whereas both g6pca and g6pcb then duplicated as a consequence of the teleost-specific whole genome duplication. One g6pca duplicate was lost after this duplication in teleosts. Similarly one g6pcb2 duplicate was lost at least in the ancestor of percomorpha. The analysis of the evolution of spatial expression patterns of g6pc genes in vertebrates showed that all g6pc were mainly expressed in intestine and liver whereas teleost-specific g6pcb2 genes were mainly and surprisingly expressed in brain and heart. g6pcb2b, one gene previously hypothesised to be involved in the glucose intolerant phenotype in trout, was unexpectedly up-regulated (as it was in liver) by carbohydrates in trout telencephalon without showing significant changes in other brain regions. This up-regulation is in striking contrast with expected glucosensing mechanisms suggesting that its positive response to glucose relates to specific unknown processes in this brain area. Our results suggested that the fixation and the divergence of g6pc duplicated genes during vertebrates' evolution may lead to adaptive novelty and probably to the emergence of novel phenotypes related to glucose homeostasis.
Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.
2015-01-01
The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392
Maciejowski, John; Ahn, James Hyungsoo; Cipriani, Patricia Giselle; Killian, Darrell J.; Chaudhary, Aisha L.; Lee, Ji Inn; Voutev, Roumen; Johnsen, Robert C.; Baillie, David L.; Gunsalus, Kristin C.; Fitch, David H. A.; Hubbard, E. Jane Albert
2005-01-01
We report molecular genetic studies of three genes involved in early germ-line proliferation in Caenorhabditis elegans that lend unexpected insight into a germ-line/soma functional separation of autosomal/X-linked duplicated gene pairs. In a genetic screen for germ-line proliferation-defective mutants, we identified mutations in rpl-11.1 (L11 protein of the large ribosomal subunit), pab-1 [a poly(A)-binding protein], and glp-3/eft-3 (an elongation factor 1-α homolog). All three are members of autosome/X gene pairs. Consistent with a germ-line-restricted function of rpl-11.1 and pab-1, mutations in these genes extend life span and cause gigantism. We further examined the RNAi phenotypes of the three sets of rpl genes (rpl-11, rpl-24, and rpl-25) and found that for the two rpl genes with autosomal/X-linked pairs (rpl-11 and rpl-25), zygotic germ-line function is carried by the autosomal copy. Available RNAi results for highly conserved autosomal/X-linked gene pairs suggest that other duplicated genes may follow a similar trend. The three rpl and the pab-1/2 duplications predate the divergence between C. elegans and C. briggsae, while the eft-3/4 duplication appears to have occurred in the lineage to C. elegans after it diverged from C. briggsae. The duplicated C. briggsae orthologs of the three C. elegans autosomal/X-linked gene pairs also display functional differences between paralogs. We present hypotheses for evolutionary mechanisms that may underlie germ-line/soma subfunctionalization of duplicated genes, taking into account the role of X chromosome silencing in the germ line and analogous mammalian phenomena. PMID:15687263
Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.
Williams, N A; Holland, P W
1998-05-01
We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.
Maroni, G.; Wise, J.; Young, J. E.; Otto, E.
1987-01-01
A search for duplications of the Drosophila melanogaster metallothionein gene (Mtn) yielded numerous examples of this type of chromosomal rearrangement. These duplications are distributed widely—we found them in samples from four continents, and they are functional—larvae carrying Mtn duplications produce more Mtn RNA and tolerate increased cadmium and copper concentrations. Six different duplication types were characterized by restriction-enzyme analyses using probes from the Mtn region. The restriction maps show that in four cases the sequences, ranging in size between 2.2 and 6.0 kb, are arranged as direct, tandem repeats; in two other cases, this basic pattern is modified by the insertion of a putative transposable element into one of the repeated units. Duplications of the D. melanogaster metallothionein gene such as those that we found in natural populations may represent early stages in the evolution of a gene family. PMID:2828157
Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi
2013-01-01
The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs. PMID:23396833
The HOPA Gene Dodecamer Duplication Is Not a Significant Etiological Factor in Autism.
ERIC Educational Resources Information Center
Michaelis, Ron C.; Copeland-Yates, Susan A.; Sossey-Alaoui, Khalid; Skinner, Cindy; Friez, Michael J.; Longshore, John W.; Simensen, Richard J.; Schroer, Richard J.; Stevenson, Roger E.
2000-01-01
A study of 202 patients with autism found the incidence of a dodecamer duplication in the HOPA gene was not significantly different between patients and controls. Three female patients inherited the duplication from nonautistic fathers. Also, there was no systematic skewing of X inactivation in female patients with the duplication. (Contains…
2013-01-01
Background Microsporidian Nosema bombycis has received much attention because the pébrine disease of domesticated silkworms results in great economic losses in the silkworm industry. So far, no effective treatment could be found for pébrine. Compared to other known Nosema parasites, N. bombycis can unusually parasitize a broad range of hosts. To gain some insights into the underlying genetic mechanism of pathological ability and host range expansion in this parasite, a comparative genomic approach is conducted. The genome of two Nosema parasites, N. bombycis and N. antheraeae (an obligatory parasite to undomesticated silkworms Antheraea pernyi), were sequenced and compared with their distantly related species, N. ceranae (an obligatory parasite to honey bees). Results Our comparative genomics analysis show that the N. bombycis genome has greatly expanded due to the following three molecular mechanisms: 1) the proliferation of host-derived transposable elements, 2) the acquisition of many horizontally transferred genes from bacteria, and 3) the production of abundnant gene duplications. To our knowledge, duplicated genes derived not only from small-scale events (e.g., tandem duplications) but also from large-scale events (e.g., segmental duplications) have never been seen so abundant in any reported microsporidia genomes. Our relative dating analysis further indicated that these duplication events have arisen recently over very short evolutionary time. Furthermore, several duplicated genes involving in the cytotoxic metabolic pathway were found to undergo positive selection, suggestive of the role of duplicated genes on the adaptive evolution of pathogenic ability. Conclusions Genome expansion is rarely considered as the evolutionary outcome acting on those highly reduced and compact parasitic microsporidian genomes. This study, for the first time, demonstrates that the parasitic genomes can expand, instead of shrink, through several common molecular mechanisms such as gene duplication, horizontal gene transfer, and transposable element expansion. We also showed that the duplicated genes can serve as raw materials for evolutionary innovations possibly contributing to the increase of pathologenic ability. Based on our research, we propose that duplicated genes of N. bombycis should be treated as primary targets for treatment designs against pébrine. PMID:23496955
Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.
Gout, Jean-Francois; Lynch, Michael
2015-08-01
Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Djogbénou, Luc S.; Berthomieu, Arnaud; Makoundou, Patrick; Baba-Moussa, Lamine S.; Fiston-Lavier, Anna-Sophie; Belkhir, Khalid; Labbé, Pierrick; Weill, Mylène
2016-01-01
Gene copy-number variations are widespread in natural populations, but investigating their phenotypic consequences requires contemporary duplications under selection. Such duplications have been found at the ace-1 locus (encoding the organophosphate and carbamate insecticides’ target) in the mosquito Anopheles gambiae (the major malaria vector); recent studies have revealed their intriguing complexity, consistent with the involvement of various numbers and types (susceptible or resistant to insecticide) of copies. We used an integrative approach, from genome to phenotype level, to investigate the influence of duplication architecture and gene-dosage on mosquito fitness. We found that both heterogeneous (i.e., one susceptible and one resistant ace-1 copy) and homogeneous (i.e., identical resistant copies) duplications segregated in field populations. The number of copies in homogeneous duplications was variable and positively correlated with acetylcholinesterase activity and resistance level. Determining the genomic structure of the duplicated region revealed that, in both types of duplication, ace-1 and 11 other genes formed tandem 203kb amplicons. We developed a diagnostic test for duplications, which showed that ace-1 was amplified in all 173 resistant mosquitoes analyzed (field-collected in several African countries), in heterogeneous or homogeneous duplications. Each type was associated with different fitness trade-offs: heterogeneous duplications conferred an intermediate phenotype (lower resistance and fitness costs), whereas homogeneous duplications tended to increase both resistance and fitness cost, in a complex manner. The type of duplication selected seemed thus to depend on the intensity and distribution of selection pressures. This versatility of trade-offs available through gene duplication highlights the importance of large mutation events in adaptation to environmental variation. This impressive adaptability could have a major impact on vector control in Africa. PMID:27918584
Innes, Roger W; Ameline-Torregrosa, Carine; Ashfield, Tom; Cannon, Ethalinda; Cannon, Steven B; Chacko, Ben; Chen, Nicolas W G; Couloux, Arnaud; Dalwani, Anita; Denny, Roxanne; Deshpande, Shweta; Egan, Ashley N; Glover, Natasha; Hans, Christian S; Howell, Stacy; Ilut, Dan; Jackson, Scott; Lai, Hongshing; Mammadov, Jafar; Del Campo, Sara Martin; Metcalf, Michelle; Nguyen, Ashley; O'Bleness, Majesta; Pfeil, Bernard E; Podicheti, Ram; Ratnaparkhe, Milind B; Samain, Sylvie; Sanders, Iryna; Ségurens, Béatrice; Sévignac, Mireille; Sherman-Broyles, Sue; Thareau, Vincent; Tucker, Dominic M; Walling, Jason; Wawrzynski, Adam; Yi, Jing; Doyle, Jeff J; Geffroy, Valérie; Roe, Bruce A; Maroof, M A Saghai; Young, Nevin D
2008-12-01
The genomes of most, if not all, flowering plants have undergone whole genome duplication events during their evolution. The impact of such polyploidy events is poorly understood, as is the fate of most duplicated genes. We sequenced an approximately 1 million-bp region in soybean (Glycine max) centered on the Rpg1-b disease resistance gene and compared this region with a region duplicated 10 to 14 million years ago. These two regions were also compared with homologous regions in several related legume species (a second soybean genotype, Glycine tomentella, Phaseolus vulgaris, and Medicago truncatula), which enabled us to determine how each of the duplicated regions (homoeologues) in soybean has changed following polyploidy. The biggest change was in retroelement content, with homoeologue 2 having expanded to 3-fold the size of homoeologue 1. Despite this accumulation of retroelements, over 77% of the duplicated low-copy genes have been retained in the same order and appear to be functional. This finding contrasts with recent analyses of the maize (Zea mays) genome, in which only about one-third of duplicated genes appear to have been retained over a similar time period. Fluorescent in situ hybridization revealed that the homoeologue 2 region is located very near a centromere. Thus, pericentromeric localization, per se, does not result in a high rate of gene inactivation, despite greatly accelerated retrotransposon accumulation. In contrast to low-copy genes, nucleotide-binding-leucine-rich repeat disease resistance gene clusters have undergone dramatic species/homoeologue-specific duplications and losses, with some evidence for partitioning of subfamilies between homoeologues.
Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; ...
2016-03-24
C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes ismore » enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Moreover, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species.« less
Assessing duplication and loss of APETALA1/FRUITFULL homologs in Ranunculales
Pabón-Mora, Natalia; Hidalgo, Oriane; Gleissberg, Stefan; Litt, Amy
2013-01-01
Gene duplication and loss provide raw material for evolutionary change within organismal lineages as functional diversification of gene copies provide a mechanism for phenotypic variation. Here we focus on the APETALA1/FRUITFULL MADS-box gene lineage evolution. AP1/FUL genes are angiosperm-specific and have undergone several duplications. By far the most significant one is the core-eudicot duplication resulting in the euAP1 and euFUL clades. Functional characterization of several euAP1 and euFUL genes has shown that both function in proper floral meristem identity, and axillary meristem repression. Independently, euAP1 genes function in floral meristem and sepal identity, whereas euFUL genes control phase transition, cauline leaf growth, compound leaf morphogenesis and fruit development. Significant functional variation has been detected in the function of pre-duplication basal-eudicot FUL-like genes, but the underlying mechanisms for change have not been identified. FUL-like genes in the Papaveraceae encode all functions reported for euAP1 and euFUL genes, whereas FUL-like genes in Aquilegia (Ranunculaceae) function in inflorescence development and leaf complexity, but not in flower or fruit development. Here we isolated FUL-like genes across the Ranunculales and used phylogenetic approaches to analyze their evolutionary history. We identified an early duplication resulting in the RanFL1 and RanFL2 clades. RanFL1 genes were present in all the families sampled and are mostly under strong negative selection in the MADS, I and K domains. RanFL2 genes were only identified from Eupteleaceae, Papaveraceae s.l., Menispermaceae and Ranunculaceae and show relaxed purifying selection at the I and K domains. We discuss how asymmetric sequence diversification, new motifs, differences in codon substitutions and likely protein-protein interactions resulting from this Ranunculiid-specific duplication can help explain the functional differences among basal-eudicot FUL-like genes. PMID:24062757
Gene duplication in the major insecticide target site, Rdl, in Drosophila melanogaster
Remnant, Emily J.; Good, Robert T.; Schmidt, Joshua M.; Lumb, Christopher; Robin, Charles; Daborn, Phillip J.; Batterham, Philip
2013-01-01
The Resistance to Dieldrin gene, Rdl, encodes a GABA-gated chloride channel subunit that is targeted by cyclodiene and phenylpyrazole insecticides. The gene was first characterized in Drosophila melanogaster by genetic mapping of resistance to the cyclodiene dieldrin. The 4,000-fold resistance observed was due to a single amino acid replacement, Ala301 to Ser. The equivalent change was subsequently identified in Rdl orthologs of a large range of resistant insect species. Here, we report identification of a duplication at the Rdl locus in D. melanogaster. The 113-kb duplication contains one WT copy of Rdl and a second copy with two point mutations: an Ala301 to Ser resistance mutation and Met360 to Ile replacement. Individuals with this duplication exhibit intermediate dieldrin resistance compared with single copy Ser301 homozygotes, reduced temperature sensitivity, and altered RNA editing associated with the resistant allele. Ectopic recombination between Roo transposable elements is involved in generating this genomic rearrangement. The duplication phenotypes were confirmed by construction of a transgenic, artificial duplication integrating the 55.7-kb Rdl locus with a Ser301 change into an Ala301 background. Gene duplications can contribute significantly to the evolution of insecticide resistance, most commonly by increasing the amount of gene product produced. Here however, duplication of the Rdl target site creates permanent heterozygosity, providing unique potential for adaptive mutations to accrue in one copy, without abolishing the endogenous role of an essential gene. PMID:23959864
Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A
2015-06-08
The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Extensive Local Gene Duplication and Functional Divergence among Paralogs in Atlantic Salmon
Warren, Ian A.; Ciborowski, Kate L.; Casadei, Elisa; Hazlerigg, David G.; Martin, Sam; Jordan, William C.; Sumner, Seirian
2014-01-01
Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle. PMID:24951567
Guselnikov, S.V.; Grayfer, L.; De Jesús Andino, F.; Rogozin, I.B.; Robert, J.; Taranin, A.V.
2015-01-01
The ITAM-bearing transmembrane signaling subunits (TSS) are indispensable components of activating leukocyte receptor complexes. The TSS-encoding genes map to paralogous chromosomal regions, which are thought to arise from ancient genome tetraploidization(s). To assess a possible role of tetraploidization in the TSS evolution, we studied TSS and other functionally linked genes in the amphibian species Xenopus laevis whose genome was duplicated about 40 MYR ago. We found that X. laevis has retained a duplicated set of sixteen TSS genes, all except one being transcribed. Furthermore, duplicated TCRα loci and genes encoding TSS-coupling protein kinases have also been retained. No clear evidence for functional divergence of the TSS paralogs was obtained from gene expression and sequence analyses. We suggest that the main factor of maintenance of duplicated TSS genes in X. laevis was a protein dosage effect and that this effect might have facilitated the TSS set expansion in early vertebrates. PMID:26170006
The Sequence and Analysis of Duplication Rich Human Chromosome 16
DOE R&D Accomplishments Database
Martin, Joel; Han, Cliff; Gordon, Laurie A.; Terry, Astrid; Prabhakar, Shyam; She, Xinwei; Xie, Gary; Hellsten, Uffe; Man Chan, Yee; Altherr, Michael; Couronne, Olivier; Aerts, Andrea; Bajorek, Eva; Black, Stacey; Blumer, Heather; Branscomb, Elbert; Brown, Nancy C.; Bruno, William J.; Buckingham, Judith M.; Callen, David F.; Campbell, Connie S.; Campbell, Mary L.; Campbell, Evelyn W.; Caoile, Chenier; Challacombe, Jean F.; Chasteen, Leslie A.; Chertkov, Olga; Chi, Han C.; Christensen, Mari; Clark, Lynn M.; Cohn, Judith D.; Denys, Mirian; Detter, John C.; Dickson, Mark; Dimitrijevic-Bussod, Mira; Escobar, Julio; Fawcett, Joseph J.; Flowers, Dave; Fotopulos, Dea; Glavina, Tijana; Gomez, Maria; Gonzales, Eidelyn; Goodstein, David; Goodwin, Lynne A.; Grady, Deborah L.; Grigoriev, Igor; Groza, Matthew; Hammon, Nancy; Hawkins, Trevor; Haydu, Lauren; Hildebrand, Carl E.; Huang, Wayne; Israni, Sanjay; Jett, Jamie; Jewett, Phillip E.; Kadner, Kristen; Kimball, Heather; Kobayashi, Arthur; Krawczyk, Marie-Claude; Leyba, Tina; Longmire, Jonathan L.; Lopez, Frederick; Lou, Yunian; Lowry, Steve; Ludeman, Thom; Mark, Graham A.; Mcmurray, Kimberly L.; Meincke, Linda J.; Morgan, Jenna; Moyzis, Robert K.; Mundt, Mark O.; Munk, A. Christine; Nandkeshwar, Richard D.; Pitluck, Sam; Pollard, Martin; Predki, Paul; Parson-Quintana, Beverly; Ramirez, Lucia; Rash, Sam; Retterer, James; Ricke, Darryl O.; Robinson, Donna L.; Rodriguez, Alex; Salamov, Asaf; Saunders, Elizabeth H.; Scott, Duncan; Shough, Timothy; Stallings, Raymond L.; Stalvey, Malinda; Sutherland, Robert D.; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Torney, David C.; Tran-Gyamfi, Mary; Tsai, Ming; Ulanovsky, Levy E.; Ustaszewska, Anna; Vo, Nu; White, P. Scott; Williams, Albert L.; Wills, Patricia L.; Wu, Jung-Rung; Wu, Kevin; Yang, Joan; DeJong, Pieter; Bruce, David; Doggett, Norman; Deaven, Larry; Schmutz, Jeremy; Grimwood, Jane; Richardson, Paul; et al.
2004-01-01
We report here the 78,884,754 base pairs of finished human chromosome 16 sequence, representing over 99.9 percent of its euchromatin. Manual annotation revealed 880 protein coding genes confirmed by 1,637 aligned transcripts, 19 tRNA genes, 341 pseudogenes and 3 RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukemia. Several large-scale structural polymorphisms spanning hundreds of kilobasepairs were identified and result in gene content differences across humans. One of the unique features of chromosome 16 is its high level of segmental duplication, ranked among the highest of the human autosomes. While the segmental duplications are enriched in the relatively gene poor pericentromere of the p-arm, some are involved in recent gene duplication and conversion events which are likely to have had an impact on the evolution of primates and human disease susceptibility.
Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.
2016-01-01
Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576
The evolution of duplicate gene expression in mammalian organs
Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik
2017-01-01
Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766
Xp22.33p22.12 Duplication in a Patient with Intellectual Disability and Dysmorphic Facial Features
Lintas, Carla; Picinelli, Chiara; Piras, Ignazio S.; Sacco, Roberto; Gabriele, Stefano; Verdecchia, Magda; Persico, Antonio M.
2016-01-01
A novel 19.98-Mb duplication in chromosome Xp22.33p22.12 was detected by array CGH in a 30-year-old man affected by intellectual disability, congenital hypotonia and dysmorphic features. The duplication encompasses more than 100 known genes. Many of these genes (such as neuroligin 4, cyclin-dependent kinase like 5, and others) have already correlated with X-linked intellectual disability and/or neurodevelopmental disorders. Due to the high number of potentially pathogenic genes involved in the reported duplication, we cannot correlate the clinical phenotype to a single gene. Indeed, we suggest that the resulting clinical phenotype may have arisen from the overexpression and consequent perturbation of fine gene dosage. PMID:26997944
Xp22.33p22.12 Duplication in a Patient with Intellectual Disability and Dysmorphic Facial Features.
Lintas, Carla; Picinelli, Chiara; Piras, Ignazio S; Sacco, Roberto; Gabriele, Stefano; Verdecchia, Magda; Persico, Antonio M
2016-02-01
A novel 19.98-Mb duplication in chromosome Xp22.33p22.12 was detected by array CGH in a 30-year-old man affected by intellectual disability, congenital hypotonia and dysmorphic features. The duplication encompasses more than 100 known genes. Many of these genes (such as neuroligin 4, cyclin-dependent kinase like 5, and others) have already correlated with X-linked intellectual disability and/or neurodevelopmental disorders. Due to the high number of potentially pathogenic genes involved in the reported duplication, we cannot correlate the clinical phenotype to a single gene. Indeed, we suggest that the resulting clinical phenotype may have arisen from the overexpression and consequent perturbation of fine gene dosage.
Pantzartzi, Chrysoula N.; Drosopoulou, Elena; Scouras, Zacharias G.
2013-01-01
Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b) isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata. PMID:24066039
Barbaro, Michela; Oscarson, Mikael; Schoumans, Jacqueline; Staaf, Johan; Ivarsson, Sten A; Wedell, Anna
2007-08-01
Testis development is a tightly regulated process that requires an efficient and coordinated spatiotemporal action of many factors, and it has been shown that several genes involved in gonadal development exert a dosage effect. Chromosomal imbalances have been reported in several patients presenting with gonadal dysgenesis as part of severe dysmorphic phenotypes. We screened for submicroscopic DNA copy number variations in two sisters with an apparent normal 46,XY karyotype and female external genitalia due to gonadal dysgenesis, and in which mutations in known candidate genes had been excluded. By high-resolution tiling bacterial artificial chromosome array comparative genome hybridization, a submicroscopic duplication at Xp21.2 containing DAX1 (NR0B1) was identified. Using fluorescence in situ hybridization, multiple ligation probe amplification, and PCR, the rearrangement was further characterized. This revealed a 637-kb tandem duplication that in addition to DAX1 includes the four MAGEB genes, the hypothetical gene CXorf21, GK, and part of the MAP3K7IP3 gene. Sequencing and analysis of the breakpoint boundaries and duplication junction suggest that the duplication originated through a coupled homologous and nonhomologous recombination process. This represents the first duplication on Xp21.2 identified in patients with isolated gonadal dysgenesis because all previously described XY subjects with Xp21 duplications presented with gonadal dysgenesis as part of a more complex phenotype, including mental retardation and/or malformations. Thus, our data support DAX1 as a dosage sensitive gene responsible for gonadal dysgenesis and highlight the importance of considering DAX1 locus duplications in the evaluation of all cases of 46,XY gonadal dysgenesis.
Mielczarek, M; Frąszczak, M; Giannico, R; Minozzi, G; Williams, John L; Wojdak-Maksymiec, K; Szyda, J
2017-07-01
Thirty-two whole genome DNA sequences of cows were analyzed to evaluate inter-individual variability in the distribution and length of copy number variations (CNV) and to functionally annotate CNV breakpoints. The total number of deletions per individual varied between 9,731 and 15,051, whereas the number of duplications was between 1,694 and 5,187. Most of the deletions (81%) and duplications (86%) were unique to a single cow. No relation between the pattern of variant sharing and a family relationship or disease status was found. The animal-averaged length of deletions was from 5,234 to 9,145 bp and the average length of duplications was between 7,254 and 8,843 bp. Highly significant inter-individual variation in length and number of CNV was detected for both deletions and duplications. The majority of deletion and duplication breakpoints were located in intergenic regions and introns, whereas fewer were identified in noncoding transcripts and splice regions. Only 1.35 and 0.79% of the deletion and duplication breakpoints were observed within coding regions. A gene with the highest number of deletion breakpoints codes for protein kinase cGMP-dependent type I, whereas the T-cell receptor α constant gene had the most duplication breakpoints. The functional annotation of genes with the largest incidence of deletion/duplication breakpoints identified 87/112 Kyoto Encyclopedia of Genes and Genomes pathways, but none of the pathways were significantly enriched or depleted with breakpoints. The analysis of Gene Ontology (GO) terms revealed that a cluster with the highest enrichment score among genes with many deletion breakpoints was represented by GO terms related to ion transport, whereas the GO term cluster mostly enriched among the genes with many duplication breakpoints was related to binding of macromolecules. Furthermore, when considering the number of deletion breakpoints per gene functional category, no significant differences were observed between the "housekeeping" and "strong selection" categories, but genes representing the "low selection pressure" group showed a significantly higher number of breakpoints. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
GENE-dosage effects on fitness in recent adaptive duplications: ace-1 in the mosquito Culex pipiens.
Labbé, Pierrick; Milesi, Pascal; Yébakima, André; Pasteur, Nicole; Weill, Mylène; Lenormand, Thomas
2014-07-01
Gene duplications have long been advocated to contribute to the evolution of new functions. The role of selection in their early spread is more controversial. Unless duplications are favored for a direct benefit of increased expression, they are likely detrimental. In this article, we investigated the case of duplications favored because they combine already functionally divergent alleles. Their gene-dosage/fitness relations are poorly known because selection may operate on both overall expression and duplicates relative dosage. Using the well-documented case of Culex pipiens resistance to insecticides, we compared strains with various ace-1 allele combinations, including two duplicated alleles carrying both susceptible and resistant copies. The overall protein activity was nearly additive, but, surprisingly, fitness correlated better with the relative proportion of susceptible and resistant copies rather than any absolute measure of activity. Gene dosage is thus crucial, duplications stabilizing a "heterozygote" phenotype. It corroborates the view that these were favored because they fix a permanent heterosis, thereby solving the irreducible trade-off between resistance and synaptic transmission. Moreover, we showed that the contrasted successes of the two duplicated alleles in natural populations depend on genetic changes unrelated to ace-1, confirming the probable implication of recessive sublethal mutations linked to structural rearrangements in some duplications. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.
Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.
Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H
2009-07-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes
Deitcher, David L.; Bass, Andrew H.
2009-01-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.
Shao, Mingfu; Lin, Yu; Moret, Bernard M E
2015-05-01
Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR.
Wentz, Elisabet; Vujic, Mihailo; Kärrstedt, Ewa-Lotta; Erlandsson, Anna; Gillberg, Christopher
2014-05-01
Autism spectrum disorder, severe behaviour problems and duplication of the Xq12 to Xq13 region have recently been described in three male relatives. To describe the psychiatric comorbidity and dysmorphic features, including craniosynostosis, of two male siblings with autism and duplication of the Xq13 to Xq21 region, and attempt to narrow down the number of duplicated genes proposed to be leading to global developmental delay and autism. We performed DNA sequencing of certain exons of the TWIST1 gene, the FGFR2 gene and the FGFR3 gene. We also performed microarray analysis of the DNA. In addition to autism, the two male siblings exhibited severe learning disability, self-injurious behaviour, temper tantrums and hyperactivity, and had no communicative language. Chromosomal analyses were normal. Neither of the two siblings showed mutations of the sequenced exons known to produce craniosynostosis. The microarray analysis detected an extra copy of a region on the long arm of chromosome X, chromosome band Xq13.1-q21.1. Comparison of our two cases with previously described patients allowed us to identify three genes predisposing for autism in the duplicated chromosomal region. Sagittal craniosynostosis is also a new finding linked to the duplication.
Abi Rached, L; McDermott, M F; Pontarotti, P
1999-02-01
The human Major Histocompatibility Complex (MHC) shares similarities with three other chromosome regions in human. This could be the vestige of ancestral large scale duplications. We discuss here the possibility i) that these duplications occurred during two rounds of tetraploidization supposed to have taken place during chordate evolution before the jawed vertebrate radiation, and ii) that one of the quadruplicate regions, relaxed of functional constraints, gave rise to the vertebrate MHC by a quick round of gene cis-duplication and cis-exon shuffling. These different rounds of cis-duplications and exon shufflings allowed the emergence of new genes participating in novel biological functions i.e. adaptive immune responses. Cis-duplications and cis-exon shufflings are ongoing processes in the evolution of some of these genes in this region as they have occurred and were fixed at different times and in different lineages during vertebrate evolution. In contrast, other genes within the MHC have remained stable since the emergence of jawed vertebrates.
Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins
Hargreaves, Adam D.; Swain, Martin T.; Hegarty, Matthew J.; Logan, Darren W.; Mulley, John F.
2014-01-01
Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive “just-so story” in evolutionary biology. PMID:25079342
Li, Qi; Zhang, Ning; Zhang, Liangsheng; Ma, Hong
2015-04-01
Rhomboid proteins are intramembrane serine proteases that are involved in a plethora of biological functions, but the evolutionary history of the rhomboid gene family is not clear. We performed a comprehensive molecular evolutionary analysis of the rhomboid gene family and also investigated the organization and sequence features of plant rhomboids in different subfamilies. Our results showed that eukaryotic rhomboids could be divided into five subfamilies (RhoA-RhoD and PARL). Most orthology groups appeared to be conserved only as single or low-copy genes in all lineages in RhoB-RhoD and PARL, whereas RhoA genes underwent several duplication events, resulting in multiple gene copies. These duplication events were due to whole genome duplications in plants and animals and the duplicates might have experienced functional divergence. We also identified a novel group of plant rhomboid (RhoB1) that might have lost their enzymatic activity; their existence suggests that they might have evolved new mechanisms. Plant and animal rhomboids have similar evolutionary patterns. In addition, there are mutations affecting key active sites in RBL8, RBL9 and one of the Brassicaceae PARL duplicates. This study delineates a possible evolutionary scheme for intramembrane proteins and illustrates distinct fates and a mechanism of evolution of gene duplicates. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
A diffusion model for the fate of tandem gene duplicates in diploids.
O'Hely, Martin
2007-06-01
Suppose one chromosome in one member of a population somehow acquires a duplicate copy of the gene, fully linked to the original gene's locus. Preservation is the event that eventually every chromosome in the population is a descendant of the one which initially carried the duplicate. For a haploid population in which the absence of all copies of the gene is lethal, the probability of preservation has recently been estimated via a diffusion approximation. That approximation is shown to carry over to the case of diploids and arbitrary strong selection against the absence of the gene. The techniques used lead to some new results. In the large population limit, it is shown that the relative probability that descendants of a small number of individuals carrying multiple copies of the gene fix in the population is proportional to the number of copies carried. The probability of preservation is approximated when chromosomes carrying two copies of the gene are subject to additional, fully non-functionalizing mutations, thereby modelling either an additional cost of replicating a longer genome, or a partial duplication of the gene. In the latter case the preservation probability depends only on the mutation rate to null for the duplicated portion of the gene.
Divergence and evolution of cotton bHLH proteins from diploid to allotetraploid.
Liu, Bingliang; Guan, Xueying; Liang, Wenhua; Chen, Jiedan; Fang, Lei; Hu, Yan; Guo, Wangzhen; Rong, Junkang; Xu, Guohua; Zhang, Tianzhen
2018-02-23
Polyploidy is considered a major driving force in genome expansion, yielding duplicated genes whose expression may be conserved or divergence as a consequence of polyploidization. We compared the genome sequences of tetraploid cotton (Gossypium hirsutum) and its two diploid progenitors, G. arboreum and G. raimondii, and found that the bHLH genes were conserved over the polyploidization. Oppositely, the expression of the homeolgous gene pairs was diversified. The biased homeologous proportion for bHLH family is significantly higher (64.6%) than the genome wide homeologous expression bias (40%). Compared with cacao (T. cacao), orthologous genes only accounted for a small proportion (41.7%) of whole cotton bHLHs family. The further Ks analysis indicated that bHLH genes underwent at least two distinct episodes of whole genome duplication: a recent duplication (1.0-60.0 million years ago, MYA, 0.005 < Ks < 0.312) and an old duplication (> 60.0 MYA, 0.312 < Ks < 3.0). The old duplication event might have played a key role in the expansion of the bHLH family. Both recent and old duplicated pairs (68.8%) showed a divergent expression profile, indicating specialized functions. The expression diversification of the duplicated genes suggested it might be a universal feature of the long-term evolution of cotton. Overview of cotton bHLH proteins indicated a conserved and divergent evolution from diploids to allotetraploid. Our results provided an excellent example for studying the long-term evolution of polyploidy.
Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan
2012-01-01
Background HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. Methods and Findings In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Conclusions Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication. PMID:22808219
Zhou, Mi; Yan, Jun; Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan
2012-01-01
HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.
C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes ismore » enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Moreover, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species.« less
A duplicated PLP gene causing Pelizaeus-Merzbacher disease detected by comparative multiplex PCR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Inoue, K.; Sugiyama, N.; Kawanishi, C.
1996-07-01
Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused by abnormalities in the proteolipid protein (PLP) gene, which is essential for oligodendrocyte differentiation and CNS myelin formation. Although linkage analysis has shown the homogeneity at the PLP locus in patients with PMD, exonic mutations in the PLP gene have been identified in only 10% - 25% of all cases, which suggests the presence of other genetic aberrations, including gene duplication. In this study, we examined five families with PMD not carrying exonic mutations in PLP gene, using comparative multiplex PCR (CM-PCR) as a semiquantitative assay of gene dosage. PLP genemore » duplications were identified in four families by CM-PCR and confirmed in three families by densitometric RFLP analysis. Because a homologous myelin protein gene, PMP22, is duplicated in the majority of patients with Charcot-Marie-Tooth 1A, PLP gene overdosage may be an important genetic abnormality in PMD and affect myelin formation. 38 ref., 5 figs., 2 tabs.« less
The Genomic Basis of Evolutionary Innovation in Pseudomonas aeruginosa
Wagner, Andreas; MacLean, R. Craig
2016-01-01
Novel traits play a key role in evolution, but their origins remain poorly understood. Here we address this problem by using experimental evolution to study bacterial innovation in real time. We allowed 380 populations of Pseudomonas aeruginosa to adapt to 95 different carbon sources that challenged bacteria with either evolving novel metabolic traits or optimizing existing traits. Whole genome sequencing of more than 80 clones revealed profound differences in the genetic basis of innovation and optimization. Innovation was associated with the rapid acquisition of mutations in genes involved in transcription and metabolism. Mutations in pre-existing duplicate genes in the P. aeruginosa genome were common during innovation, but not optimization. These duplicate genes may have been acquired by P. aeruginosa due to either spontaneous gene amplification or horizontal gene transfer. High throughput phenotype assays revealed that novelty was associated with increased pleiotropic costs that are likely to constrain innovation. However, mutations in duplicate genes with close homologs in the P. aeruginosa genome were associated with low pleiotropic costs compared to mutations in duplicate genes with distant homologs in the P. aeruginosa genome, suggesting that functional redundancy between duplicates facilitates innovation by buffering pleiotropic costs. PMID:27149698
Polan, Michelle B; Pastore, Matthew T; Steingass, Katherine; Hashimoto, Sayaka; Thrush, Devon L; Pyatt, Robert; Reshmi, Shalini; Gastier-Foster, Julie M; Astbury, Caroline; McBride, Kim L
2014-01-01
Recent studies have shown that certain copy number variations (CNV) are associated with a wide range of neurodevelopmental disorders, including autism spectrum disorders (ASD), bipolar disorder and intellectual disabilities. Implicated regions and genes have comprised a variety of post synaptic complex proteins and neurotransmitter receptors, including gamma-amino butyric acid A (GABAA). Clusters of GABAA receptor subunit genes are found on chromosomes 4p12, 5q34, 6q15 and 15q11-13. Maternally inherited 15q11-13 duplications among individuals with neurodevelopmental disorders are well described, but few case reports exist for the other regions. We describe a family with a 2.42 Mb duplication at chromosome 4p13 to 4p12, identified in the index case and other family members by oligonucleotide array comparative genomic hybridization, that contains 13 genes including a cluster of four GABAA receptor subunit genes. Fluorescent in-situ hybridization was used to confirm the duplication. The duplication segregates with a variety of neurodevelopmental disorders in this family, including ASD (index case), developmental delay, dyspraxia and ADHD (brother), global developmental delays (brother), learning disabilities (mother) and bipolar disorder (maternal grandmother). In addition, we identified and describe another individual unrelated to this family, with a similar duplication, who was diagnosed with ASD, ADHD and borderline intellectual disability. The 4p13 to 4p12 duplication appears to confer a susceptibility to a variety of neurodevelopmental disorders in these two families. We hypothesize that the duplication acts through a dosage effect of GABAA receptor subunit genes, adding evidence for alterations in the GABAergic system in the etiology of neurodevelopmental disorders. PMID:23695283
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.
Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich
2004-03-01
By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
Gacesa, Ranko; Chung, Ray; Dunn, Simon R; Weston, Andrew J; Jaimes-Becerra, Adrian; Marques, Antonio C; Morandini, André C; Hranueli, Daslav; Starcevic, Antonio; Ward, Malcolm; Long, Paul F
2015-10-13
Gene duplication followed by adaptive selection is a well-accepted process leading to toxin diversification in venoms. However, emergent genomic, transcriptomic and proteomic evidence now challenges this role to be at best equivocal to other processess . Cnidaria are arguably the most ancient phylum of the extant metazoa that are venomous and such provide a definitive ancestral anchor to examine the evolution of this trait. Here we compare predicted toxins from the translated genome of the coral Acropora digitifera to putative toxins revealed by proteomic analysis of soluble proteins discharged from nematocysts, to determine the extent to which gene duplications contribute to venom innovation in this reef-building coral species. A new bioinformatics tool called HHCompare was developed to detect potential gene duplications in the genomic data, which is made freely available ( https://github.com/rgacesa/HHCompare ). A total of 55 potential toxin encoding genes could be predicted from the A. digitifera genome, of which 36 (65 %) had likely arisen by gene duplication as evinced using the HHCompare tool and verified using two standard phylogeny methods. Surprisingly, only 22 % (12/55) of the potential toxin repertoire could be detected following rigorous proteomic analysis, for which only half (6/12) of the toxin proteome could be accounted for as peptides encoded by the gene duplicates. Biological activities of these toxins are dominatedby putative phospholipases and toxic peptidases. Gene expansions in A. digitifera venom are the most extensive yet described in any venomous animal, and gene duplication plays a significant role leading to toxin diversification in this coral species. Since such low numbers of toxins were detected in the proteome, it is unlikely that the venom is evolving rapidly by prey-driven positive natural selection. Rather we contend that the venom has a defensive role deterring predation or harm from interspecific competition and overgrowth by fouling organisms. Factors influencing translation of toxin encoding genes perhaps warrants more profound experimental consideration.
Duplicated genes evolve independently in allopolyploid cotton.
Richard C. Cronn; Randall L. Small; Jonathan F. Wendel
1999-01-01
Of the many processes that generate gene duplications, polyploidy is unique in that entire genomes are duplicated. This process has been important in the evolution of many eukaryotic groups, and it occurs with high frequency in plants. Recent evidence suggests that polyploidization may be accompanied by rapid genomic changes, but the evolutionary fate of discrete loci...
On the Complexity of Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.
Kordi, Misagh; Bansal, Mukul S
2017-01-01
Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.
Kaltenegger, Elisabeth; Eich, Eckart; Ober, Dietrich
2013-01-01
Homospermidine synthase (HSS), the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, is known to have its origin in the duplication of a gene encoding deoxyhypusine synthase. To study the processes that followed this gene duplication event and gave rise to HSS, we identified sequences encoding HSS and deoxyhypusine synthase from various species of the Convolvulaceae. We show that HSS evolved only once in this lineage. This duplication event was followed by several losses of a functional gene copy attributable to gene loss or pseudogenization. Statistical analyses of sequence data suggest that, in those lineages in which the gene copy was successfully recruited as HSS, the gene duplication event was followed by phases of various selection pressures, including purifying selection, relaxed functional constraints, and possibly positive Darwinian selection. Site-specific mutagenesis experiments have confirmed that the substitution of sites predicted to be under positive Darwinian selection is sufficient to convert a deoxyhypusine synthase into a HSS. In addition, analyses of transcript levels have shown that HSS and deoxyhypusine synthase have also diverged with respect to their regulation. The impact of protein–protein interaction on the evolution of HSS is discussed with respect to current models of enzyme evolution. PMID:23572540
Evolution dynamics of a model for gene duplication under adaptive conflict
NASA Astrophysics Data System (ADS)
Ancliff, Mark; Park, Jeong-Man
2014-06-01
We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.
Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.
Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M
2017-04-01
The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Jourda, Cyril; Cardi, Céline; Gibert, Olivier; Giraldo Toro, Andrès; Ricci, Julien; Mbéguié-A-Mbéguié, Didier; Yahiaoui, Nabila
2016-01-01
Starch is the most widespread and abundant storage carbohydrate in plants. It is also a major feature of cultivated bananas as it accumulates to large amounts during banana fruit development before almost complete conversion to soluble sugars during ripening. Little is known about the structure of major gene families involved in banana starch metabolism and their evolution compared to other species. To identify genes involved in banana starch metabolism and investigate their evolutionary history, we analyzed six gene families playing a crucial role in plant starch biosynthesis and degradation: the ADP-glucose pyrophosphorylases (AGPases), starch synthases (SS), starch branching enzymes (SBE), debranching enzymes (DBE), α-amylases (AMY) and β-amylases (BAM). Using comparative genomics and phylogenetic approaches, these genes were classified into families and sub-families and orthology relationships with functional genes in Eudicots and in grasses were identified. In addition to known ancestral duplications shaping starch metabolism gene families, independent evolution in banana and grasses also occurred through lineage-specific whole genome duplications for specific sub-families of AGPase, SS, SBE, and BAM genes; and through gene-scale duplications for AMY genes. In particular, banana lineage duplications yielded a set of AGPase, SBE and BAM genes that were highly or specifically expressed in banana fruits. Gene expression analysis highlighted a complex transcriptional reprogramming of starch metabolism genes during ripening of banana fruits. A differential regulation of expression between banana gene duplicates was identified for SBE and BAM genes, suggesting that part of starch metabolism regulation in the fruit evolved in the banana lineage. PMID:27994606
Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes
Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu
2014-01-01
It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342
Carrigan, Matthew A.; Uryasev, Oleg; Davis, Ross P.; Zhai, LanMin; Hurley, Thomas D.; Benner, Steven A.
2012-01-01
Background Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids. Methodology/Principal Findings To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences. Conclusions/Significance We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in the ancestor of Catarrhine and Platyrrhine primates, followed by the loss of an ADH1 paralog in the human lineage. PMID:22859968
Zimmer, Christoph T; Garrood, William T; Singh, Kumar Saurabh; Randall, Emma; Lueke, Bettina; Gutbrod, Oliver; Matthiesen, Svend; Kohler, Maxie; Nauen, Ralf; Davies, T G Emyr; Bass, Chris
2018-01-22
Gene duplication is a major source of genetic variation that has been shown to underpin the evolution of a wide range of adaptive traits [1, 2]. For example, duplication or amplification of genes encoding detoxification enzymes has been shown to play an important role in the evolution of insecticide resistance [3-5]. In this context, gene duplication performs an adaptive function as a result of its effects on gene dosage and not as a source of functional novelty [3, 6-8]. Here, we show that duplication and neofunctionalization of a cytochrome P450, CYP6ER1, led to the evolution of insecticide resistance in the brown planthopper. Considerable genetic variation was observed in the coding sequence of CYP6ER1 in populations of brown planthopper collected from across Asia, but just two sequence variants are highly overexpressed in resistant strains and metabolize imidacloprid. Both variants are characterized by profound amino-acid alterations in substrate recognition sites, and the introduction of these mutations into a susceptible P450 sequence is sufficient to confer resistance. CYP6ER1 is duplicated in resistant strains with individuals carrying paralogs with and without the gain-of-function mutations. Despite numerical parity in the genome, the susceptible and mutant copies exhibit marked asymmetry in their expression with the resistant paralogs overexpressed. In the primary resistance-conferring CYP6ER1 variant, this results from an extended region of novel sequence upstream of the gene that provides enhanced expression. Our findings illustrate the versatility of gene duplication in providing opportunities for functional and regulatory innovation during the evolution of an adaptive trait. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.
The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.
Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M
2013-05-29
Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in the most recent duplication are the main contributions of our study.
The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family
2013-01-01
Background Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Results Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. Conclusions We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in the most recent duplication are the main contributions of our study. PMID:23718880
Evolutionary origins of a novel host plant detoxification gene in butterflies.
Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko
2008-05-01
Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.
Levels of duplicate gene expression in armoured catfishes.
Dunham, R A; Philipp, D P; Whitt, G S
1980-01-01
Species of armoured catfishes differ significantly in their cellular DNA content and chromosome number. Starch gel electrophoresis of isozymes was used to determine whether each of 16 enzyme loci was expressed in a single or duplicate state. The percent of enzyme loci exhibiting duplicate locus expression in Corydoras aeneus, Corydoras julii, Corydoras melanistius, and Corydoras myersi was 37.5 percent, 18.75 percent, 12.5 percent, and 6.25 percent, respectively. The percentage of loci expressed in duplicate is higher in the species with higher haploid DNA contents, which are 4.4 pg, 3.0 pg, and 2.3 pg, respectively. These differences in DNA contents are also associated with differences in chromosome number. These data are consistent with the hypothesis that increases in DNA contents and enzyme loci occur both by tetraploidization and by regional gene duplication and that these increases are then followed by a partial loss of DNA and a reduction in the number of the duplicate isozyme loci expressed. Such analyses provide insight into the mechanisms of genome amplification and reduction as well as insights into the fats of duplicate genes.
Bright, Lydia J.; Gout, Jean-Francois; Lynch, Michael
2017-01-01
New gene functions arise within existing gene families as a result of gene duplication and subsequent diversification. To gain insight into the steps that led to the functional diversification of paralogues, we tracked duplicate retention patterns, expression-level divergence, and subcellular markers of functional diversification in the Rab GTPase gene family in three Paramecium aurelia species. After whole-genome duplication, Rab GTPase duplicates are more highly retained than other genes in the genome but appear to be diverging more rapidly in expression levels, consistent with early steps in functional diversification. However, by localizing specific Rab proteins in Paramecium cells, we found that paralogues from the two most recent whole-genome duplications had virtually identical localization patterns, and that less closely related paralogues showed evidence of both conservation and diversification. The functionally conserved paralogues appear to target to compartments associated with both endocytic and phagocytic recycling functions, confirming evolutionary and functional links between the two pathways in a divergent eukaryotic lineage. Because the functionally diversifying paralogues are still closely related to and derived from a clade of functionally conserved Rab11 genes, we were able to pinpoint three specific amino acid residues that may be driving the change in the localization and thus the function in these proteins. PMID:28251922
Law, Sheran Hiu Wan; Redelings, Benjamin David; Kullman, Seth William
2012-01-15
The availability of multiple teleost (bony fish) genomes is providing unprecedented opportunities to understand the diversity and function of gene duplication events using comparative genomics. Here we examine multiple paralogous genes of γ-glutamyl transferase (GGT) in several distantly related teleost species including medaka, stickleback, green spotted pufferfish, fugu, and zebrafish. Through mining genome databases, we have identified multiple GGT orthologs. Duplicate (paralogous) GGT sequences for GGT1 (GGT1 a and b), GGTL1 (GGTL1 a and b), and GGTL3 (GGTL3 a and b) were identified for each species. Phylogenetic analysis suggests that GGTs are ancient proteins conserved across most metazoan phyla and those paralogous GGTs in teleosts likely arose from the serial 3R genome duplication events. A third GGTL1 gene (GGTL1c) was found in green spotted pufferfish; however, this gene is not present in medaka, stickleback, or fugu. Similarly, one or both paralogs of GGTL3 appear to have been lost in green spotted pufferfish, fugu, and zebrafish. Syntenic relationships were highly maintained between duplicated teleost chromosomes, among teleosts and across ray-finned (Actinopterygii) and lobe-finned (Sarcopterygii) species. To assess subfunction partitioning, six medaka GGT genes were cloned and assessed for developmental and tissue-specific expression. On the basis of these data, we propose a modification of the "duplication-degeneration-complementation" model of subfunction partitioning where quantitative differences rather than absolute differences in gene expression are observed between gene paralogs. Our results demonstrate that multiple GGT genes have been retained within teleost genomes. Questions remain, however, regarding the functional roles of multiple GGTs in these species. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.
Zheng, Deyou
2008-01-01
Background Sequencing and annotation of several mammalian genomes have revealed that segmental duplications are a common architectural feature of primate genomes; in fact, about 5% of the human genome is composed of large blocks of interspersed segmental duplications. These segmental duplications have been implicated in genomic copy-number variation, gene novelty, and various genomic disorders. However, the molecular processes involved in the evolution and regulation of duplicated sequences remain largely unexplored. Results In this study, the profile of about 20 histone modifications within human segmental duplications was characterized using high-resolution, genome-wide data derived from a ChIP-Seq study. The analysis demonstrates that derivative loci of segmental duplications often differ significantly from the original with respect to many histone methylations. Further investigation showed that genes are present three times more frequently in the original than in the derivative, whereas pseudogenes exhibit the opposite trend. These asymmetries tend to increase with the age of segmental duplications. The uneven distribution of genes and pseudogenes does not, however, fully account for the asymmetry in the profile of histone modifications. Conclusion The first systematic analysis of histone modifications between segmental duplications demonstrates that two seemingly 'identical' genomic copies are distinct in their epigenomic properties. Results here suggest that local chromatin environments may be implicated in the discrimination of derived copies of segmental duplications from their originals, leading to a biased pseudogenization of the new duplicates. The data also indicate that further exploration of the interactions between histone modification and sequence degeneration is necessary in order to understand the divergence of duplicated sequences. PMID:18598352
Clayton-Smith, Jill; Walters, Sarah; Hobson, Emma; Burkitt-Wright, Emma; Smith, Rupert; Toutain, Annick; Amiel, Jeanne; Lyonnet, Stanislas; Mansour, Sahar; Fitzpatrick, David; Ciccone, Roberto; Ricca, Ivana; Zuffardi, Orsetta; Donnai, Dian
2009-01-01
Xq28 duplications encompassing MECP2 have been described in male patients with a severe neurodevelopmental disorder associated with hypotonia and spasticity, severe learning disability and recurrent pneumonia. We identified an Xq28 duplication in three families where several male patients had presented with intestinal pseudo-obstruction or bladder distension. The affected boys had similar dysmorphic facial appearances. Subsequently, we ascertained seven further families where the proband presented with similar features. We demonstrated duplications of the Xq28 region in five of these additional families. In addition to MECP2, these duplications encompassed several other genes already known to be associated with diseases including SLC6A8, L1CAM and Filamin A (FLNA). The two remaining families were shown to have intragenic duplications of FLNA only. We discuss which elements of the Xq28 duplication phenotype may be associated with the various genes in the duplication. We propose that duplication of FLNA may contribute to the bowel and bladder phenotype seen in these seven families. PMID:18854860
Human-Specific Duplication and Mosaic Transcripts: The Recent Paralogous Structure of Chromosome 22
Bailey, Jeffrey A. ; Yavor, Amy M. ; Viggiano, Luigi ; Misceo, Doriana ; Horvath, Juliann E. ; Archidiacono, Nicoletta ; Schwartz, Stuart ; Rocchi, Mariano ; Eichler, Evan E.
2002-01-01
In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (<35 million years ago) segmental duplication. This dynamism of segmental duplications has important implications in disease and evolution. Here we present a chromosome-wide view of the structure and evolution of the most highly homologous duplications (⩾1 kb and ⩾90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications—consistent with their recent origin. Several duplications appear to be human specific, including a ∼400-kb duplication (99.4%–99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species. PMID:11731936
2013-01-01
Background Mitochondrial genomic (mitogenomic) reorganizations are rarely found in closely-related animals, yet drastic reorganizations have been found in the Ranoides frogs. The phylogenetic relationships of the three major ranoid taxa (Natatanura, Microhylidae, and Afrobatrachia) have been problematic, and mitogenomic information for afrobatrachians has not been available. Several molecular models for mitochondrial (mt) gene rearrangements have been proposed, but observational evidence has been insufficient to evaluate them. Furthermore, evolutionary trends in rearranged mt genes have not been well understood. To gain molecular and phylogenetic insights into these issues, we analyzed the mt genomes of four afrobatrachian species (Breviceps adspersus, Hemisus marmoratus, Hyperolius marmoratus, and Trichobatrachus robustus) and performed molecular phylogenetic analyses. Furthermore we searched for two evolutionary patterns expected in the rearranged mt genes of ranoids. Results Extensively reorganized mt genomes having many duplicated and rearranged genes were found in three of the four afrobatrachians analyzed. In fact, Breviceps has the largest known mt genome among vertebrates. Although the kinds of duplicated and rearranged genes differed among these species, a remarkable gene rearrangement pattern of non-tandemly copied genes situated within tandemly-copied regions was commonly found. Furthermore, the existence of concerted evolution was observed between non-neighboring copies of triplicated 12S and 16S ribosomal RNA regions. Conclusions Phylogenetic analyses based on mitogenomic data support a close relationship between Afrobatrachia and Microhylidae, with their estimated divergence 100 million years ago consistent with present-day endemism of afrobatrachians on the African continent. The afrobatrachian mt data supported the first tandem and second non-tandem duplication model for mt gene rearrangements and the recombination-based model for concerted evolution of duplicated mt regions. We also showed that specific nucleotide substitution and compositional patterns expected in duplicated and rearranged mt genes did not occur, suggesting no disadvantage in employing these genes for phylogenetic inference. PMID:24053406
Vatansever, Recep; Koc, Ibrahim; Ozyigit, Ibrahim Ilker; Sen, Ugur; Uras, Mehmet Emin; Anjum, Naser A; Pereira, Eduarda; Filiz, Ertugrul
2016-12-01
Solanum tuberosum genome analysis revealed 12 StSULTR genes encoding 18 transcripts. Among genes annotated at group level ( StSULTR I-IV), group III members formed the largest SULTRs-cluster and were potentially involved in biotic/abiotic stress responses via various regulatory factors, and stress and signaling proteins. Employing bioinformatics tools, this study performed genome-wide identification and expression analysis of SULTR (StSULTR) genes in potato (Solanum tuberosum L.). Very strict homology search and subsequent domain verification with Hidden Markov Model revealed 12 StSULTR genes encoding 18 transcripts. StSULTR genes were mapped on seven S. tuberosum chromosomes. Annotation of StSULTR genes was also done as StSULTR I-IV at group level based mainly on the phylogenetic distribution with Arabidopsis SULTRs. Several tandem and segmental duplications were identified between StSULTR genes. Among these duplications, Ka/Ks ratios indicated neutral nature of mutations that might not be causing any selection. Two segmental and one-tandem duplications were calculated to occur around 147.69, 180.80 and 191.00 million years ago (MYA), approximately corresponding to the time of monocot/dicot divergence. Two other segmental duplications were found to occur around 61.23 and 67.83 MYA, which is very close to the origination of monocotyledons. Most cis-regulatory elements in StSULTRs were found associated with major hormones (such as abscisic acid and methyl jasmonate), and defense and stress responsiveness. The cis-element distribution in duplicated gene pairs indicated the contribution of duplication events in conferring the neofunctionalization/s in StSULTR genes. Notably, RNAseq data analyses unveiled expression profiles of StSULTR genes under different stress conditions. In particular, expression profiles of StSULTR III members suggested their involvement in plant stress responses. Additionally, gene co-expression networks of these group members included various regulatory factors, stress and signaling proteins, and housekeeping and some other proteins with unknown functions.
Evolutionary Diversification of Insect Innexins
Hughes, Austin L.
2014-01-01
Abstract Phylogenetic analysis of insect innexins supported the hypothesis that six major clades of insect innexins arose by gene duplication prior to the origin of the endopterygote insects. Within one of the six clades (the Zpg Clade), two independent gene duplication events were inferred to have occurred in the lineage of Drosophila , after the most recent common ancestor of the dipteran families Culicidae and Drosophilidae. The relationships among this clades were poorly resolved, except for a sister relationship between ShakB and Ogre. Gene expression data from FlyAtlas supported the hypothesis that the latter gene duplication events gave rise to functional differentiation, with Zpg showing a high level of expression in ovary, and Inx5 and Inx6 showing a high level of expression in testis. Because unduplicated members of this clade in Bombyx mori and Anopheles gambiae showed high levels of expression in both ovary and tests, the expression patterns of the Drosophila members of this clade provide evidence of subdivision of an ancestral gene function after gene duplication. PMID:25502029
Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins
Opazo, Juan C.; Butts, G. Tyler; Nery, Mariana F.; Storz, Jay F.; Hoffmann, Federico G.
2013-01-01
Subsequent to the two rounds of whole-genome duplication that occurred in the common ancestor of vertebrates, a third genome duplication occurred in the stem lineage of teleost fishes. This teleost-specific genome duplication (TGD) is thought to have provided genetic raw materials for the physiological, morphological, and behavioral diversification of this highly speciose group. The extreme physiological versatility of teleost fish is manifest in their diversity of blood–gas transport traits, which reflects the myriad solutions that have evolved to maintain tissue O2 delivery in the face of changing metabolic demands and environmental O2 availability during different ontogenetic stages. During the course of development, regulatory changes in blood–O2 transport are mediated by the expression of multiple, functionally distinct hemoglobin (Hb) isoforms that meet the particular O2-transport challenges encountered by the developing embryo or fetus (in viviparous or oviparous species) and in free-swimming larvae and adults. The main objective of the present study was to assess the relative contributions of whole-genome duplication, large-scale segmental duplication, and small-scale gene duplication in producing the extraordinary functional diversity of teleost Hbs. To accomplish this, we integrated phylogenetic reconstructions with analyses of conserved synteny to characterize the genomic organization and evolutionary history of the globin gene clusters of teleosts. These results were then integrated with available experimental data on functional properties and developmental patterns of stage-specific gene expression. Our results indicate that multiple α- and β-globin genes were present in the common ancestor of gars (order Lepisoteiformes) and teleosts. The comparative genomic analysis revealed that teleosts possess a dual set of TGD-derived globin gene clusters, each of which has undergone lineage-specific changes in gene content via repeated duplication and deletion events. Phylogenetic reconstructions revealed that paralogous genes convergently evolved similar functional properties in different teleost lineages. Consistent with other recent studies of globin gene family evolution in vertebrates, our results revealed evidence for repeated evolutionary transitions in the developmental regulation of Hb synthesis. PMID:22949522
Chromosome I duplications in Caenorhabditis elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
McKim, K.S.; Rose, A.M.
1990-01-01
We have isolated and characterized 76 duplications of chromosome I in the genome of Caenorhabditis elegans. The region studied is the 20 map unit left half of the chromosome. Sixty-two duplications were induced with gamma radiation and 14 arose spontaneously. The latter class was apparently the result of spontaneous breaks within the parental duplication. The majority of duplications behave as if they are free. Three duplications are attached to identifiable sequences from other chromosomes. The duplication breakpoints have been mapped by complementation analysis relative to genes on chromosome I. Nineteen duplication breakpoints and seven deficiency breakpoints divide the left halfmore » of the chromosome into 24 regions. We have studied the relationship between duplication size and segregational stability. While size is an important determinant of mitotic stability, it is not the only one. We observed clear exceptions to a size-stability correlation. In addition to size, duplication stability may be influenced by specific sequences or chromosome structure. The majority of the duplications were stable enough to be powerful tools for gene mapping. Therefore the duplications described here will be useful in the genetic characterization of chromosome I and the techniques we have developed can be adapted to other regions of the genome.« less
Guo, Yong; Qiu, Li-Juan
2013-01-01
The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
2008-01-01
Background The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) α, β and γ subunits. Further investigation of 14 α-like (Abpa) and 13 β- or γ-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Results Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. Conclusion We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification. PMID:18269759
Laukaitis, Christina M; Heger, Andreas; Blakley, Tyler D; Munclinger, Pavel; Ponting, Chris P; Karn, Robert C
2008-02-12
The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) alpha, beta and gamma subunits. Further investigation of 14 alpha-like (Abpa) and 13 beta- or gamma-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification.
Genomic analysis reveals extensive gene duplication within the bovine TRB locus
Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan
2009-01-01
Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes, which is substantially larger than that described for humans and mice. Conclusion The analyses completed in this study reveal that, although the gene content and organization of the bovine TRB locus are broadly similar to that of humans and mice, multiple duplication events have led to a marked expansion in the number of TRB genes. Similar expansions in other ruminant TR loci suggest strong evolutionary pressures in this lineage have selected for the development of enlarged sets of TR genes that can contribute to diverse TR repertoires. PMID:19393068
Hofberger, Johannes A.; Lyons, Eric; Edger, Patrick P.; Chris Pires, J.; Eric Schranz, M.
2013-01-01
Plants share a common history of successive whole-genome duplication (WGD) events retaining genomic patterns of duplicate gene copies (ohnologs) organized in conserved syntenic blocks. Duplication was often proposed to affect the origin of novel traits during evolution. However, genetic evidence linking WGD to pathway diversification is scarce. We show that WGD and tandem duplication (TD) accelerated genetic versatility of plant secondary metabolism, exemplified with the glucosinolate (GS) pathway in the mustard family. GS biosynthesis is a well-studied trait, employing at least 52 biosynthetic and regulatory genes in the model plant Arabidopsis. In a phylogenomics approach, we identified 67 GS loci in Aethionema arabicum of the tribe Aethionemae, sister group to all mustard family members. All but one of the Arabidopsis GS gene families evolved orthologs in Aethionema and all but one of the orthologous sequence pairs exhibit synteny. The 45% fraction of duplicates among all protein-coding genes in Arabidopsis was increased to 95% and 97% for Arabidopsis and Aethionema GS pathway inventory, respectively. Compared with the 22% average for all protein-coding genes in Arabidopsis, 52% and 56% of Aethionema and Arabidopsis GS loci align to ohnolog copies dating back to the last common WGD event. Although 15% of all Arabidopsis genes are organized in tandem arrays, 45% and 48% of GS loci in Arabidopsis and Aethionema descend from TD, respectively. We describe a sequential combination of TD and WGD events driving gene family extension, thereby expanding the evolutionary playground for functional diversification and thus potential novelty and success. PMID:24171911
Jiang, Hua; Liu, Sha; Zhang, Yong-Ling; Wan, Jun-Hui; Li, Ru; Li, Dong-Zhi
2015-01-01
We describe a new case of a β-thalassemia (β-thal) heterozygote with the mutation IVS-II-654 (C>T) presenting with a transfusion-dependent phenotype. Multiplex ligation-dependent probe amplification (MLPA) and array comparative genomic hybridization (CGH) analyses of the α-globin gene cluster revealed a full duplication of the α-globin genes including the upstream regulatory element. The duplicated allele and the normal allele in trans resulted in a total of six active α-globin genes. The severe clinical phenotype seemed to be related to the considerable excess of the α- and β-globin deficit caused by the presence of the β-thal. α-Globin cluster duplication should be considered in patients heterozygous for β-thal who show a more severe phenotype than β-thal trait.
Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.
Studer, Romain A; Penel, Simon; Duret, Laurent; Robinson-Rechavi, Marc
2008-09-01
A stringent branch-site codon model was used to detect positive selection in vertebrate evolution. We show that the test is robust to the large evolutionary distances involved. Positive selection was detected in 77% of 884 genes studied. Most positive selection concerns a few sites on a single branch of the phylogenetic tree: Between 0.9% and 4.7% of sites are affected by positive selection depending on the branches. No functional category was overrepresented among genes under positive selection. Surprisingly, whole genome duplication had no effect on the prevalence of positive selection, whether the fish-specific genome duplication or the two rounds at the origin of vertebrates. Thus positive selection has not been limited to a few gene classes, or to specific evolutionary events such as duplication, but has been pervasive during vertebrate evolution.
USDA-ARS?s Scientific Manuscript database
During normal bacterial DNA replication, gene duplication and amplification (GDA) events occur randomly at a low frequency in the genome throughout a population. In the absence of selection, GDA events that increase the number of copies of a bacterial gene (or a set of genes) are lost. Antibiotic ...
Schwarte, Sandra; Tiedemann, Ralph
2011-06-01
Rubisco (ribulose-1,5-bisphosphate carboxylase/oxygenase; EC 4.1.1.39), the most abundant protein in nature, catalyzes the assimilation of CO(2) (worldwide about 10(11) t each year) by carboxylation of ribulose-1,5-bisphosphate. It is a hexadecamer consisting of eight large and eight small subunits. Although the Rubisco large subunit (rbcL) is encoded by a single gene on the multicopy chloroplast genome, the Rubisco small subunits (rbcS) are encoded by a family of nuclear genes. In Arabidopsis thaliana, the rbcS gene family comprises four members, that is, rbcS-1a, rbcS-1b, rbcS-2b, and rbcS-3b. We sequenced all Rubisco genes in 26 worldwide distributed A. thaliana accessions. In three of these accessions, we detected a gene duplication/loss event, where rbcS-1b was lost and substituted by a duplicate of rbcS-2b (called rbcS-2b*). By screening 74 additional accessions using a specific polymerase chain reaction assay, we detected five additional accessions with this duplication/loss event. In summary, we found the gene duplication/loss in 8 of 100 A. thaliana accessions, namely, Bch, Bu, Bur, Cvi, Fei, Lm, Sha, and Sorbo. We sequenced an about 1-kb promoter region for all Rubisco genes as well. This analysis revealed that the gene duplication/loss event was associated with promoter alterations (two insertions of 450 and 850 bp, one deletion of 730 bp) in rbcS-2b and a promoter deletion (2.3 kb) in rbcS-2b* in all eight affected accessions. The substitution of rbcS-1b by a duplicate of rbcS-2b (i.e., rbcS-2b*) might be caused by gene conversion. All four Rubisco genes evolve under purifying selection, as expected for central genes of the highly conserved photosystem of green plants. We inferred a single positive selected site, a tyrosine to aspartic acid substitution at position 72 in rbcS-1b. Exactly the same substitution compromises carboxylase activity in the cyanobacterium Anacystis nidulans. In A. thaliana, this substitution is associated with an inferred recombination. Functional implications of the substitution remain to be evaluated.
Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu
2013-07-01
NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Thomas, N Simon; Harvey, John F; Bunyan, David J; Rankin, Julia; Grigelioniene, Giedre; Bruno, Damien L; Tan, Tiong Y; Tomkins, Susan; Hastings, Robert
2009-07-01
Deletions of the SHOX gene are well documented and cause disproportionate short stature and variable skeletal abnormalities. In contrast interstitial SHOX duplications limited to PAR1 appear to be very rare and the clinical significance of the only case report in the literature is unclear. Mapping of this duplication has now shown that it includes the entire SHOX gene but little flanking sequence and so will not encompass any of the long-range enhancers required for SHOX transcription. We now describe the clinical and molecular characterization of three additional cases. The duplications all included the SHOX coding sequence but varied in the amount of flanking sequence involved. The probands were ascertained for a variety of reasons: hypotonia and features of Asperger syndrome, Leri-Weill dyschondrosteosis (LWD), and a family history of cleft palate. However, the presence of a duplication did not correlate with any of these features or with evidence of skeletal abnormality. Remarkably, the proband with LWD had inherited both a SHOX deletion and a duplication. The effect of the duplications on stature was variable: height appeared to be elevated in some carriers, particularly in those with the largest duplications, but was still within the normal range. SHOX duplications are likely to be under ascertained and more cases need to be identified and characterized in detail in order to accurately determine their phenotypic consequences.
Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo
2012-01-01
Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714
Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan
2016-07-01
The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Evolution and Expression Patterns of TCP Genes in Asparagales
Madrigal, Yesenia; Alzate, Juan F.; Pabón-Mora, Natalia
2017-01-01
CYCLOIDEA-like genes are involved in the symmetry gene network, limiting cell proliferation in the dorsal regions of bilateral flowers in core eudicots. CYC-like and closely related TCP genes (acronym for TEOSINTE BRANCHED1, CYCLOIDEA, and PROLIFERATION CELL FACTOR) have been poorly studied in Asparagales, the largest order of monocots that includes both bilateral flowers in Orchidaceae (ca. 25.000 spp) and radially symmetrical flowers in Hypoxidaceae (ca. 200 spp). With the aim of assessing TCP gene evolution in the Asparagales, we isolated TCP-like genes from publicly available databases and our own transcriptomes of Cattleya trianae (Orchidaceae) and Hypoxis decumbens (Hypoxidaceae). Our matrix contains 452 sequences representing the three major clades of TCP genes. Besides the previously identified CYC specific core eudicot duplications, our ML phylogenetic analyses recovered an early CIN-like duplication predating all angiosperms, two CIN-like Asparagales-specific duplications and a duplication prior to the diversification of Orchidoideae and Epidendroideae. In addition, we provide evidence of at least three duplications of PCF-like genes in Asparagales. While CIN-like and PCF-like genes have multiplied in Asparagales, likely enhancing the genetic network for cell proliferation, CYC-like genes remain as single, shorter copies with low expression. Homogeneous expression of CYC-like genes in the labellum as well as the lateral petals suggests little contribution to the bilateral perianth in C. trianae. CIN-like and PCF-like gene expression suggests conserved roles in cell proliferation in leaves, sepals and petals, carpels, ovules and fruits in Asparagales by comparison with previously reported functions in core eudicots and monocots. This is the first large scale analysis of TCP-like genes in Asparagales that will serve as a platform for in-depth functional studies in emerging model monocots. PMID:28144250
A salmonid EST genomic study: genes, duplications, phylogeny and microarrays
USDA-ARS?s Scientific Manuscript database
Background: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most wide...
Extensive concerted evolution of rice paralogs and the road to regaining independence.
Wang, Xiyin; Tang, Haibao; Bowers, John E; Feltus, Frank A; Paterson, Andrew H
2007-11-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the approximately 0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, approximately 8% of japonica paralogs produced 5-7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while approximately 70-MY-old "paleologs" resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice-sorghum divergence approximately 41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity--that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5-7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization.
Hernández-Hernández, Tania; Martínez-Castilla, León Patricio; Alvarez-Buylla, Elena R
2007-02-01
B-class MADS-box genes have been shown to be the key regulators of petal and stamen specification in several eudicot model species such as Arabidopsis thaliana, Antirrhinum majus, and Petunia hybrida. Orthologs of these genes have been found across angiosperms and gymnosperms, and it is thought that the basic regulatory function of B proteins is conserved in seed plant lineages. The evolution of B genes is characterized by numerous duplications that might represent key elements fostering the functional diversification of duplicates with a deep impact on their role in the evolution of the floral developmental program. To evaluate this, we performed a rigorous statistical analysis with B gene sequences. Using maximum likelihood and Bayesian methods, we estimated molecular substitution rates and determined the selective regimes operating at each residue of B proteins. We implemented tests that rely on phylogenetic hypotheses and codon substitution models to detect significant differences in substitution rates (DSRs) and sites under positive adaptive selection (PS) in specific lineages before and after duplication events. With these methods, we identified several protein residues fixed by PS shortly after the origin of PISTILLATA-like and APETALA3-like lineages in angiosperms and shortly after the origin of the euAP3-like lineage in core eudicots, the 2 main B gene duplications. The residues inferred to have been fixed by positive selection lie mostly within the K domain of the protein, which is key to promote heterodimerization. Additionally, we used a likelihood method that accommodates DSRs among lineages to estimate duplication dates for AP3-PI and euAP3-TM6, calibrating with data from the fossil record. The dates obtained are consistent with angiosperm origins and diversification of core eudicots. Our results strongly suggest that novel multimer formation with other MADS proteins could have been crucial for the functional divergence of B MADS-box genes. We thus propose a mechanism of functional diversification and persistence of gene duplicates by the appearance of novel multimerization capabilities after duplications. Multimer formation in different combinations of regulatory proteins can be a mechanistic basis for the origin of novel regulatory functions and a gene regulatory mechanism for the appearance of morphological innovations.
Ajmal, Wajya; Khan, Hiba; Abbasi, Amir Ali
2014-12-01
Understanding the genetic mechanisms underlying the organismal complexity and origin of novelties during vertebrate history is one of the central goals of evolutionary biology. Ohno (1970) was the first to postulate that whole genome duplications (WGD) have played a vital role in the evolution of new gene functions: permitting an increase in morphological, physiological and anatomical complexity during early vertebrate history. Here, we analyze the evolutionary history of human FGFR-bearing paralogon (human autosome 4/5/8/10) by the phylogenetic analysis of multigene families with triplicate and quadruplicate distribution on these chromosomes. Our results categorized the histories of 21 families into discrete co-duplicated groups. Genes of a particular co-duplicated group exhibit identical evolutionary history and have duplicated in concert with each other, whereas genes belonging to different groups have dissimilar histories and have not duplicated concurrently. Taken together with our previously published data, we submit that there is sufficient empirical evidence to disprove the 1R/2R hypothesis and to support the general prediction that vertebrate genome evolved by relatively small-scale, regional duplication events that spread across the history of life. Copyright © 2014 Elsevier Inc. All rights reserved.
Indrasumunar, Arief; Wilde, Julia; Hayashi, Satomi; Li, Dongxue; Gresshoff, Peter M
2015-03-15
Association between legumes and rhizobia results in the formation of root nodules, where symbiotic nitrogen fixation occurs. The early stages of this association involve a complex of signalling events between the host and microsymbiont. Several genes dealing with early signal transduction have been cloned, and one of them encodes the leucine-rich repeat (LRR) receptor kinase (SymRK; also termed NORK). The Symbiosis Receptor Kinase gene is required by legumes to establish a root endosymbiosis with Rhizobium bacteria as well as mycorrhizal fungi. Using degenerate primer and BAC sequencing, we cloned duplicated SymRK homeologues in soybean called GmSymRKα and GmSymRKβ. These duplicated genes have high similarity of nucleotide (96%) and amino acid sequence (95%). Sequence analysis predicted a malectin-like domain within the extracellular domain of both genes. Several putative cis-acting elements were found in promoter regions of GmSymRKα and GmSymRKβ, suggesting a participation in lateral root development, cell division and peribacteroid membrane formation. The mutant of SymRK genes is not available in soybean; therefore, to know the functions of these genes, RNA interference (RNAi) of these duplicated genes was performed. For this purpose, RNAi construct of each gene was generated and introduced into the soybean genome by Agrobacterium rhizogenes-mediated hairy root transformation. RNAi of GmSymRKβ gene resulted in an increased reduction of nodulation and mycorrhizal infection than RNAi of GmSymRKα, suggesting it has the major activity of the duplicated gene pair. The results from the important crop legume soybean confirm the joint phenotypic action of GmSymRK genes in both mycorrhizal and rhizobial infection seen in model legumes. Copyright © 2015 Elsevier GmbH. All rights reserved.
A rare FANCA gene variation as a breast cancer susceptibility allele in an Iranian population
Abbasi, Sakineh; Rasouli, Mina
2017-01-01
Fanconi Anemia (FA) is an autosomal recessive syndrome characterized by congenital abnormalities, progressive bone marrow failure and Fanconi anemia complementation group A (FANCA) is also a potential breast and ovarian cancer susceptibility gene. A novel allele with tandem duplication of 13 base pair sequence in promoter region was identified. To investigate whether the 13 base pair sequence of tandem duplication in promoter region of the FANCA gene is of high penetrance in patients with breast cancer and to determine if the presence of the duplicated allele was associated with an altered risk of breast cancer, the present study screened DNA in blood samples from 304 breast cancer patients and 295 normal individuals as controls. The duplication allele had a frequency of 35.4 and 21.2% in patients with breast cancer and normal controls, respectively. There was a significant increase in the frequency of the duplication allele in patients with familial breast cancer compared with controls (45.1%, P=0.001). Furthermore, the estimated risk of breast cancer in individuals with a homozygote [odds ratio (OR), 4.093; 95% confidence intervals (CI), 1.957–8.561] or heterozygote duplicated genotype (OR, 3.315; 95% CI, 1.996–5.506) was higher compared with the corresponding normal homozygote genotype. In conclusion, the present study indicated that the higher the frequency of the duplicated allele, the higher the risk of breast cancer. To the best of our knowledge, the present study is the first to report FANCA gene duplication in patients with breast cancer. PMID:28440412
A rare FANCA gene variation as a breast cancer susceptibility allele in an Iranian population.
Abbasi, Sakineh; Rasouli, Mina
2017-06-01
Fanconi Anemia (FA) is an autosomal recessive syndrome characterized by congenital abnormalities, progressive bone marrow failure and Fanconi anemia complementation group A (FANCA) is also a potential breast and ovarian cancer susceptibility gene. A novel allele with tandem duplication of 13 base pair sequence in promoter region was identified. To investigate whether the 13 base pair sequence of tandem duplication in promoter region of the FANCA gene is of high penetrance in patients with breast cancer and to determine if the presence of the duplicated allele was associated with an altered risk of breast cancer, the present study screened DNA in blood samples from 304 breast cancer patients and 295 normal individuals as controls. The duplication allele had a frequency of 35.4 and 21.2% in patients with breast cancer and normal controls, respectively. There was a significant increase in the frequency of the duplication allele in patients with familial breast cancer compared with controls (45.1%, P=0.001). Furthermore, the estimated risk of breast cancer in individuals with a homozygote [odds ratio (OR), 4.093; 95% confidence intervals (CI), 1.957‑8.561] or heterozygote duplicated genotype (OR, 3.315; 95% CI, 1.996‑5.506) was higher compared with the corresponding normal homozygote genotype. In conclusion, the present study indicated that the higher the frequency of the duplicated allele, the higher the risk of breast cancer. To the best of our knowledge, the present study is the first to report FANCA gene duplication in patients with breast cancer.
Du, Jianchang; Tian, Zhixi; Sui, Yi; Zhao, Meixia; Song, Qijian; Cannon, Steven B.; Cregan, Perry; Ma, Jianxin
2012-01-01
The evolutionary forces that govern the divergence and retention of duplicated genes in polyploids are poorly understood. In this study, we first investigated the rates of nonsynonymous substitution (Ka) and the rates of synonymous substitution (Ks) for a nearly complete set of genes in the paleopolyploid soybean (Glycine max) by comparing the orthologs between soybean and its progenitor species Glycine soja and then compared the patterns of gene divergence and expression between pericentromeric regions and chromosomal arms in different gene categories. Our results reveal strong associations between duplication status and Ka and gene expression levels and overall low Ks and low levels of gene expression in pericentromeric regions. It is theorized that deleterious mutations can easily accumulate in recombination-suppressed regions, because of Hill-Robertson effects. Intriguingly, the genes in pericentromeric regions—the cold spots for meiotic recombination in soybean—showed significantly lower Ka and higher levels of expression than their homoeologs in chromosomal arms. This asymmetric evolution of two members of individual whole genome duplication (WGD)-derived gene pairs, echoing the biased accumulation of singletons in pericentromeric regions, suggests that distinct genomic features between the two distinct chromatin types are important determinants shaping the patterns of divergence and retention of WGD-derived genes. PMID:22227891
Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K.
2014-01-01
Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1∶1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple. PMID:25232838
Wang, Guo-Ming; Yin, Hao; Qiao, Xin; Tan, Xu; Gu, Chao; Wang, Bao-Hua; Cheng, Rui; Wang, Ying-Zhen; Zhang, Shao-Ling
2016-12-01
F-box gene family, as one of the largest gene families in plants, plays crucial roles in regulating plant development, reproduction, cellular protein degradation and responses to biotic and abiotic stresses. However, comprehensive analysis of the F-box gene family in pear (Pyrus bretschneideri Rehd.) and other Rosaceae species has not been reported yet. Herein, we identified a total of 226 full-length F-box genes in pear for the first time. And these genes were further divided into various subgroups based on specific domains and phylogenetic analysis. Intriguingly, we observed that whole-genome duplication and dispersed duplication have a major contribution to F-box family expansion. Furthermore, the dynamic evolution for different modes of gene duplication was dissected. Interestingly, we found that dispersed and tandem duplicate have been evolving at a high rate. In addition, we found that F-box genes exhibited functional specificity based on GO analysis, and most of the F-box genes were significantly enriched in the protein binding (GO: 0005515) term, supporting that F-box genes might play a critical role for gene regulation in pear. Transcriptome and digital expression profiles revealed that F-box genes are involved in the development of multiple pear tissues. Overall, these results will set stage for elaborating the biological role of F-box genes in pear and other plants. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S
2010-10-07
PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
Moey, Ching; Hinze, Susan J; Brueton, Louise; Morton, Jenny; McMullan, Dominic J; Kamien, Benjamin; Barnett, Christopher P; Brunetti-Pierri, Nicola; Nicholl, Jillian; Gecz, Jozef; Shoubridge, Cheryl
2016-01-01
Copy number variations are a common cause of intellectual disability (ID). Determining the contribution of copy number variants (CNVs), particularly gains, to disease remains challenging. Here, we report four males with ID with sub-microscopic duplications at Xp11.2 and review the few cases with overlapping duplications reported to date. We established the extent of the duplicated regions in each case encompassing a minimum of three known disease genes TSPYL2, KDM5C and IQSEC2 with one case also duplicating the known disease gene HUWE1. Patients with a duplication encompassing TSPYL2, KDM5C and IQSEC2 without gains of nearby SMC1A and HUWE1 genes have not been reported thus far. All cases presented with ID and significant deficits of speech development. Some patients also manifested behavioral disturbances such as hyperactivity and attention-deficit/hyperactivity disorder. Lymphoblastic cell lines from patients show markedly elevated levels of TSPYL2, KDM5C and SMC1A, transcripts consistent with the extent of their CNVs. The duplicated region in our patients contains several genes known to escape X-inactivation, including KDM5C, IQSEC2 and SMC1A. In silico analysis of expression data in selected gene expression omnibus series indicates that dosage of these genes, especially IQSEC2, is similar in males and females despite the fact they escape from X-inactivation in females. Taken together, the data suggest that gains in Xp11.22 including IQSEC2 cause ID and are associated with hyperactivity and attention-deficit/hyperactivity disorder, and are likely to be dosage-sensitive in males. PMID:26059843
Moey, Ching; Hinze, Susan J; Brueton, Louise; Morton, Jenny; McMullan, Dominic J; Kamien, Benjamin; Barnett, Christopher P; Brunetti-Pierri, Nicola; Nicholl, Jillian; Gecz, Jozef; Shoubridge, Cheryl
2016-03-01
Copy number variations are a common cause of intellectual disability (ID). Determining the contribution of copy number variants (CNVs), particularly gains, to disease remains challenging. Here, we report four males with ID with sub-microscopic duplications at Xp11.2 and review the few cases with overlapping duplications reported to date. We established the extent of the duplicated regions in each case encompassing a minimum of three known disease genes TSPYL2, KDM5C and IQSEC2 with one case also duplicating the known disease gene HUWE1. Patients with a duplication encompassing TSPYL2, KDM5C and IQSEC2 without gains of nearby SMC1A and HUWE1 genes have not been reported thus far. All cases presented with ID and significant deficits of speech development. Some patients also manifested behavioral disturbances such as hyperactivity and attention-deficit/hyperactivity disorder. Lymphoblastic cell lines from patients show markedly elevated levels of TSPYL2, KDM5C and SMC1A, transcripts consistent with the extent of their CNVs. The duplicated region in our patients contains several genes known to escape X-inactivation, including KDM5C, IQSEC2 and SMC1A. In silico analysis of expression data in selected gene expression omnibus series indicates that dosage of these genes, especially IQSEC2, is similar in males and females despite the fact they escape from X-inactivation in females. Taken together, the data suggest that gains in Xp11.22 including IQSEC2 cause ID and are associated with hyperactivity and attention-deficit/hyperactivity disorder, and are likely to be dosage-sensitive in males.
A diffusion approach to approximating preservation probabilities for gene duplicates.
O'Hely, Martin
2006-08-01
Consider a haploid population and, within its genome, a gene whose presence is vital for the survival of any individual. Each copy of this gene is subject to mutations which destroy its function. Suppose one member of the population somehow acquires a duplicate copy of the gene, where the duplicate is fully linked to the original gene's locus. Preservation is said to occur if eventually the entire population consists of individuals descended from this one which initially carried the duplicate. The system is modelled by a finite state-space Markov process which in turn is approximated by a diffusion process, whence an explicit expression for the probability of preservation is derived. The event of preservation can be compared to the fixation of a selectively neutral gene variant initially present in a single individual, the probability of which is the reciprocal of the population size. For very weak mutation, this and the probability of preservation are equal, while as mutation becomes stronger, the preservation probability tends to double this reciprocal. This is in excellent agreement with simulation studies.
Citerne, Hélène L.; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine
2013-01-01
TCP ECE genes encode transcription factors which have received much attention for their repeated recruitment in the control of floral symmetry in core eudicots, and more recently in monocots. Major duplications of TCP ECE genes have been described in core eudicots, but the evolutionary history of this gene family is unknown in basal eudicots. Reconstructing the phylogeny of ECE genes in basal eudicots will help set a framework for understanding the functional evolution of these genes. TCP ECE genes were sequenced in all major lineages of basal eudicots and Gunnera which belongs to the sister clade to all other core eudicots. We show that in these lineages they have a complex evolutionary history with repeated duplications. We estimate the timing of the two major duplications already identified in the core eudicots within a timeframe before the divergence of Gunnera and after the divergence of Proteales. We also use a synteny-based approach to examine the extent to which the expansion of TCP ECE genes in diverse eudicot lineages may be due to genome-wide duplications. The three major core-eudicot specific clades share a number of collinear genes, and their common evolutionary history may have originated at the γ event. Genomic comparisons in Arabidopsis thaliana and Solanum lycopersicum highlight their separate polyploid origin, with syntenic fragments with and without TCP ECE genes showing differential gene loss and genomic rearrangements. Comparison between recently available genomes from two basal eudicots Aquilegia coerulea and Nelumbo nucifera suggests that the two TCP ECE paralogs in these species are also derived from large-scale duplications. TCP ECE loci from basal eudicots share many features with the three main core eudicot loci, and allow us to infer the makeup of the ancestral eudicot locus. PMID:24019982
Shen, Danyu; Liu, Tingli; Ye, Wenwu; Liu, Li; Liu, Peihan; Wu, Yuren; Wang, Yuanchao; Dou, Daolong
2013-01-01
Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN), or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG), and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD) and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.
Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estelle; Bertrand, Daniel; Leonard, Sherry
2011-01-01
The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca++, that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [125I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. PMID:21718690
Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estele; Bertrand, Daniel; Leonard, Sherry
2011-10-15
The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca(2+), that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [(125)I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. Copyright © 2011 Elsevier Inc. All rights reserved.
The Human CHRNA7 and CHRFAM7A Genes: A Review of the Genetics, Regulation, and Function
Sinkus, Melissa L.; Graw, Sharon; Freedman, Robert; Ross, Randal G.; Lester, Henry A.; Leonard, Sherry
2015-01-01
The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is ubiquitously expressed in both the central nervous system and in the periphery. CHRNA7 is genetically linked to multiple disorders with cognitive deficits, including schizophrenia, bipolar disorder, ADHD, epilepsy, Alzheimer’s disease, and Rett syndrome. The regulation of CHRNA7 is complex; more than a dozen mechanisms are known, one of which is a partial duplication of the parent gene. Exons 5-10 of CHRNA7 on chromosome 15 were duplicated and inserted 1.6 Mb upstream of CHRNA7, interrupting an earlier partial duplication of two other genes. The chimeric CHRFAM7A gene product, dupα7, assembles with α7 subunits, resulting in a dominant negative regulation of function. The duplication is human specific, occurring neither in primates nor in rodents. The duplicated α7 sequence in exons 5-10 of CHRFAM7A is almost identical to CHRNA7, and thus is not completely queried in high throughput genetic studies (GWAS). Further, pre-clinical animal models of the α7nAChR utilized in drug development research do not have CHRFAM7A (dupα7) and cannot fully model human drug responses. The wide expression of CHRNA7, its multiple functions and modes of regulation present challenges for study of this gene in disease. PMID:25701707
Drift diffusion model of reward and punishment learning in rare alpha-synuclein gene carriers.
Moustafa, Ahmed A; Kéri, Szabolcs; Polner, Bertalan; White, Corey
To understand the cognitive effects of alpha-synuclein polymorphism, we employed a drift diffusion model (DDM) to analyze reward- and punishment-guided probabilistic learning task data of participants with the rare alpha-synuclein gene duplication and age- and education-matched controls. Overall, the DDM analysis showed that, relative to controls, asymptomatic alpha-synuclein gene duplication carriers had significantly increased learning from negative feedback, while they tended to show impaired learning from positive feedback. No significant differences were found in response caution, response bias, or motor/encoding time. We here discuss the implications of these computational findings to the understanding of the neural mechanism of alpha-synuclein gene duplication.
Extensive Concerted Evolution of Rice Paralogs and the Road to Regaining Independence
Wang, Xiyin; Tang, Haibao; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.
2007-01-01
Many genes duplicated by whole-genome duplications (WGDs) are more similar to one another than expected. We investigated whether concerted evolution through conversion and crossing over, well-known to affect tandem gene clusters, also affects dispersed paralogs. Genome sequences for two Oryza subspecies reveal appreciable gene conversion in the ∼0.4 MY since their divergence, with a gradual progression toward independent evolution of older paralogs. Since divergence from subspecies indica, ∼8% of japonica paralogs produced 5–7 MYA on chromosomes 11 and 12 have been affected by gene conversion and several reciprocal exchanges of chromosomal segments, while ∼70-MY-old “paleologs” resulting from a genome duplication (GD) show much less conversion. Sequence similarity analysis in proximal gene clusters also suggests more conversion between younger paralogs. About 8% of paleologs may have been converted since rice–sorghum divergence ∼41 MYA. Domain-encoding sequences are more frequently converted than nondomain sequences, suggesting a sort of circularity—that sequences conserved by selection may be further conserved by relatively frequent conversion. The higher level of concerted evolution in the 5–7 MY-old segmental duplication may reflect the behavior of many genomes within the first few million years after duplication or polyploidization. PMID:18039882
Ren, Lin-Ling; Liu, Yan-Jing; Liu, Hai-Jing; Qian, Ting-Ting; Qi, Li-Wang; Wang, Xiao-Ru; Zeng, Qing-Yin
2014-01-01
Gene duplication is the primary source of new genes and novel functions. Over the course of evolution, many duplicate genes lose their function and are eventually removed by deletion. However, some duplicates have persisted and evolved diverse functions. A particular challenge is to understand how this diversity arises and whether positive selection plays a role. In this study, we reconstructed the evolutionary history of the class III peroxidase (PRX) genes from the Populus trichocarpa genome. PRXs are plant-specific enzymes that play important roles in cell wall metabolism and in response to biotic and abiotic stresses. We found that two large tandem-arrayed clusters of PRXs evolved from an ancestral cell wall type PRX to vacuole type, followed by tandem duplications and subsequent functional specification. Substitution models identified seven positively selected sites in the vacuole PRXs. These positively selected sites showed significant effects on the biochemical functions of the enzymes. We also found that positive selection acts more frequently on residues adjacent to, rather than directly at, a critical active site of the enzyme, and on flexible regions rather than on rigid structural elements of the protein. Our study provides new insights into the adaptive molecular evolution of plant enzyme families. PMID:24934172
Evolution of cholinesterases in the animal kingdom.
Pezzementi, Leo; Chatonnet, Arnaud
2010-09-06
Cholinesterases emerged from a family of enzymes and proteins with adhesion properties. This family is absent in plants and expanded in multicellular animals. True cholinesterases appeared in triploblastic animals together with the cholinergic system. Lineage specific duplications resulted in two acetylcholinesterases in most hexapods and in up to four genes in nematodes. In vertebrates the duplication leading to acetylcholinesterase (AChE) and butyrylcholinesterase (BChE) is now considered to be an ancient event which occurred before the split of osteichthyes. The product of one or the other of the paralogues is responsible for the physiological hydrolysis of acetylcholine, depending on the species lineage and tissue considered. The BChE gene seems to have been lost in some fish lineages. The complete genome of amphioxus (Branchiostoma floridae: cephalochordate) contains a large number of duplicated genes or pseudogenes of cholinesterases. Sequence comparison and tree constructions raise the question of considering the atypical ChE studied in this organism as a representative of ancient BChE. Thus nematodes, arthropods, annelids, molluscs, and vertebrates typically possess two paralogous genes coding for cholinesterases. The origin of the duplication(s) is discussed. The mode of attachment through alternative C-terminal coding exons seems to have evolved independently from the catalytic part of the gene. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Genome evolution and speciation genetics of clawed frogs (Xenopus and Silurana).
Evans, Ben J
2008-05-01
Speciation of clawed frogs occurred through bifurcation and reticulation of evolutionary lineages, and resulted in extant species with different ploidy levels. Duplicate gene evolution and expression in these animals provides a unique perspective into the earliest genomic transformations after vertebrate whole genome duplication (WGD) and suggests that functional constraints are relaxed compared to before duplication but still consistently strong for millions of years following WGD. Additionally, extensive quantitative expression divergence between duplicate genes occurred after WGD. Diversification of clawed frogs was potentially catalyzed by transposition and divergent resolution--processes that occur through different genetic mechanisms but that have analogous implications for genome structure. How sex determination is maintained after genome duplication is fundamental to our understanding of why allopolyploidization is so prevalent in this group, and why clawed frogs violate Haldane's Rule for hybrid sterility. Future studies of expression subfunctionalization in polyploids will shed light on the role and purviews of cis- and trans-regulatory elements in gene regulation.
Assogba, Benoît S; Djogbénou, Luc S; Milesi, Pascal; Berthomieu, Arnaud; Perez, Julie; Ayala, Diego; Chandre, Fabrice; Makoutodé, Michel; Labbé, Pierrick; Weill, Mylène
2015-10-05
Widespread resistance to pyrethroids threatens malaria control in Africa. Consequently, several countries switched to carbamates and organophophates insecticides for indoor residual spraying. However, a mutation in the ace-1 gene conferring resistance to these compounds (ace-1(R) allele), is already present. Furthermore, a duplicated allele (ace-1(D)) recently appeared; characterizing its selective advantage is mandatory to evaluate the threat. Our data revealed that a unique duplication event, pairing a susceptible and a resistant copy of the ace-1 gene spread through West Africa. Further investigations revealed that, while ace-1(D) confers less resistance than ace-1(R), the high fitness cost associated with ace-1(R) is almost completely suppressed by the duplication for all traits studied. ace-1 duplication thus represents a permanent heterozygote phenotype, selected, and thus spreading, due to the mosaic nature of mosquito control. It provides malaria mosquito with a new evolutionary path that could hamper resistance management.
Assogba, Benoît S.; Djogbénou, Luc S.; Milesi, Pascal; Berthomieu, Arnaud; Perez, Julie; Ayala, Diego; Chandre, Fabrice; Makoutodé, Michel; Labbé, Pierrick; Weill, Mylène
2015-01-01
Widespread resistance to pyrethroids threatens malaria control in Africa. Consequently, several countries switched to carbamates and organophophates insecticides for indoor residual spraying. However, a mutation in the ace-1 gene conferring resistance to these compounds (ace-1R allele), is already present. Furthermore, a duplicated allele (ace-1D) recently appeared; characterizing its selective advantage is mandatory to evaluate the threat. Our data revealed that a unique duplication event, pairing a susceptible and a resistant copy of the ace-1 gene spread through West Africa. Further investigations revealed that, while ace-1D confers less resistance than ace-1R, the high fitness cost associated with ace-1R is almost completely suppressed by the duplication for all traits studied. ace-1 duplication thus represents a permanent heterozygote phenotype, selected, and thus spreading, due to the mosaic nature of mosquito control. It provides malaria mosquito with a new evolutionary path that could hamper resistance management. PMID:26434951
Breakup of a homeobox cluster after genome duplication in teleosts
Mulley, John F.; Chiu, Chi-hua; Holland, Peter W. H.
2006-01-01
Several families of homeobox genes are arranged in genomic clusters in metazoan genomes, including the Hox, ParaHox, NK, Rhox, and Iroquois gene clusters. The selective pressures responsible for maintenance of these gene clusters are poorly understood. The ParaHox gene cluster is evolutionarily conserved between amphioxus and human but is fragmented in teleost fishes. We show that two basal ray-finned fish, Polypterus and Amia, each possess an intact ParaHox cluster; this implies that the selective pressure maintaining clustering was lost after whole-genome duplication in teleosts. Cluster breakup is because of gene loss, not transposition or inversion, and the total number of ParaHox genes is the same in teleosts, human, mouse, and frog. We propose that this homeobox gene cluster is held together in chordates by the existence of interdigitated control regions that could be separated after locus duplication in the teleost fish. PMID:16801555
Zimowski, Janusz G; Massalska, Diana; Holding, Mariola; Jadczak, Sylwia; Fidziańska, Elżbieta; Lusakowska, Anna; Kostera-Pruszczyk, Anna; Kamińska, Anna; Zaremba, Jacek
2014-01-01
Duchenne/Becker muscular dystrophy (DMD/BMD) is a recessive, X-linked disorder caused by a mutation in the dystrophin gene. Deletions account for approximately 60-65% of mutations, duplications for 5-10%. The remaining cases are mainly point mutations. According to Monaco theory clinical form of the disease depends on maintaining or disrupting the reading frame. The purpose of the study was to determine frequency and location of deletions and duplications in the dystrophin gene, to determine the compliance between maintaining/disrupting the reading frame and clinical form of the disease and to check the effectiveness of MLPA (multiplex ligation-dependent probe amplification) in the detection of these mutations in hemizygous patients and heterozygous female carriers. The material is composed of combined results of molecular diagnosis carried out in years 2009-2012 in 180 unrelated patients referred with the diagnosis of DMD/BMD tested by use of MLPA. We identified 110 deletions, 22 duplication (in one patient two different duplications were detected) and 2 point mutations. Deletions involved mainly exons 45-54 and 3-21, whereas most duplications involved exons 3-18. The compliance with Monaco theory was 95% for deletions and 76% for duplications. Most of mutations in the dystrophin gene were localized in the hot spots - different for deletions and duplications. MLPA enabled their quick identification, exact localization and determination whether or not they maintained or disrupted the reading frame. MLPA was also effective in detection of deletions and duplications in female carriers. Copyright © 2014 Polish Neurological Society. Published by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
An adaptive radiation model for the origin of new genefunctions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Francino, M. Pilar
2004-10-18
The evolution of new gene functions is one of the keys to evolutionary innovation. Most novel functions result from gene duplication followed by divergence. However, the models hitherto proposed to account for this process are not fully satisfactory. The classic model of neofunctionalization holds that the two paralogous gene copies resulting from a duplication are functionally redundant, such that one of them can evolve under no functional constraints and occasionally acquire a new function. This model lacks a convincing mechanism for the new gene copies to increase in frequency in the population and survive the mutational load expected to accumulatemore » under neutrality, before the acquisition of the rare beneficial mutations that would confer new functionality. The subfunctionalization model has been proposed as an alternative way to generate genes with altered functions. This model also assumes that new paralogous gene copies are functionally redundant and therefore neutral, but it predicts that relaxed selection will affect both gene copies such that some of the capabilities of the parent gene will disappear in one of the copies and be retained in the other. Thus, the functions originally present in a single gene will be partitioned between the two descendant copies. However, although this model can explain increases in gene number, it does not really address the main evolutionary question, which is the development of new biochemical capabilities. Recently, a new concept has been introduced into the gene evolution literature which is most likely to help solve this dilemma. The key point is to allow for a period of natural selection for the duplication per se, before new function evolves, rather than considering gene duplication to be neutral as in the previous models. Here, I suggest a new model that draws on the advantage of postulating selection for gene duplication, and proposes that bursts of adaptive gene amplification in response to specific selection pressures provide the raw material for the evolution of new function.« less
2014-01-01
Background Starch is the main source of carbon storage in the Archaeplastida. The starch biosynthesis pathway (sbp) emerged from cytosolic glycogen metabolism shortly after plastid endosymbiosis and was redirected to the plastid stroma during the green lineage divergence. The SBP is a complex network of genes, most of which are members of large multigene families. While some gene duplications occurred in the Archaeplastida ancestor, most were generated during the sbp redirection process, and the remaining few paralogs were generated through compartmentalization or tissue specialization during the evolution of the land plants. In the present study, we tested models of duplicated gene evolution in order to understand the evolutionary forces that have led to the development of SBP in angiosperms. We combined phylogenetic analyses and tests on the rates of evolution along branches emerging from major duplication events in six gene families encoding sbp enzymes. Results We found evidence of positive selection along branches following cytosolic or plastidial specialization in two starch phosphorylases and identified numerous residues that exhibited changes in volume, polarity or charge. Starch synthases, branching and debranching enzymes functional specializations were also accompanied by accelerated evolution. However, none of the sites targeted by selection corresponded to known functional domains, catalytic or regulatory. Interestingly, among the 13 duplications tested, 7 exhibited evidence of positive selection in both branches emerging from the duplication, 2 in only one branch, and 4 in none of the branches. Conclusions The majority of duplications were followed by accelerated evolution targeting specific residues along both branches. This pattern was consistent with the optimization of the two sub-functions originally fulfilled by the ancestral gene before duplication. Our results thereby provide strong support to the so-called “Escape from Adaptive Conflict” (EAC) model. Because none of the residues targeted by selection occurred in characterized functional domains, we propose that enzyme specialization has occurred through subtle changes in affinity, activity or interaction with other enzymes in complex formation, while the basic function defined by the catalytic domain has been maintained. PMID:24884572
Song, Xiaowei; Wang, Yajun; Tang, Yezhong
2013-01-01
As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication.
Song, Xiaowei; Wang, Yajun; Tang, Yezhong
2013-01-01
As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication. PMID:24349554
Diversity of human copy number variation and multicopy genes.
Sudmant, Peter H; Kitzman, Jacob O; Antonacci, Francesca; Alkan, Can; Malig, Maika; Tsalenko, Anya; Sampas, Nick; Bruhn, Laurakay; Shendure, Jay; Eichler, Evan E
2010-10-29
Copy number variants affect both disease and normal phenotypic variation, but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrated accurate estimation of absolute copy number for duplications as small as 1.9 kilobase pairs, ranging from 0 to 48 copies. We identified 4.1 million "singly unique nucleotide" positions informative in distinguishing specific copies and used them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes ~1000 genes accessible to genetic studies of disease association.
Extensive Copy-Number Variation of Young Genes across Stickleback Populations
Eizaguirre, Christophe; Samonte, Irene E.; Kalbe, Martin; Lenz, Tobias L.; Stoll, Monika; Bornberg-Bauer, Erich; Milinski, Manfred; Reusch, Thorsten B. H.
2014-01-01
Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their potential adaptive benefits, we combine comparative genomics with population genomics analyses to evaluate the distribution and impact of CNVs across natural populations of an eco-genomic model, the three-spined stickleback. With whole genome sequences of 66 individuals from populations inhabiting three distinct habitats, we find that CNVs generally occur at low frequencies and are often only found in one of the 11 populations surveyed. A subset of CNVs, however, displays copy-number differentiation between populations, showing elevated within-population frequencies consistent with local adaptation. By comparing teleost genomes to identify lineage-specific genes and duplications in sticklebacks, we highlight rampant gene content differences among individuals in which over 30% of young duplicate genes are CNVs. These CNV genes are evolving rapidly at the molecular level and are enriched with functional categories associated with environmental interactions, depicting the dynamic early copy-number polymorphic stage of genes during population differentiation. PMID:25474574
Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D
2015-02-01
The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.
Preston, Jill C.; Kellogg, Elizabeth A.
2006-01-01
Gene duplication is an important mechanism for the generation of evolutionary novelty. Paralogous genes that are not silenced may evolve new functions (neofunctionalization) that will alter the developmental outcome of preexisting genetic pathways, partition ancestral functions (subfunctionalization) into divergent developmental modules, or function redundantly. Functional divergence can occur by changes in the spatio-temporal patterns of gene expression and/or by changes in the activities of their protein products. We reconstructed the evolutionary history of two paralogous monocot MADS-box transcription factors, FUL1 and FUL2, and determined the evolution of sequence and gene expression in grass AP1/FUL-like genes. Monocot AP1/FUL-like genes duplicated at the base of Poaceae and codon substitutions occurred under relaxed selection mostly along the branch leading to FUL2. Following the duplication, FUL1 was apparently lost from early diverging taxa, a pattern consistent with major changes in grass floral morphology. Overlapping gene expression patterns in leaves and spikelets indicate that FUL1 and FUL2 probably share some redundant functions, but that FUL2 may have become temporally restricted under partial subfunctionalization to particular stages of floret development. These data have allowed us to reconstruct the history of AP1/FUL-like genes in Poaceae and to hypothesize a role for this gene duplication in the evolution of the grass spikelet. PMID:16816429
Chai, Wenbo; Si, Weina; Ji, Wei; Qin, Qianqian; Zhao, Manli; Jiang, Haiyang
2018-01-01
HD-Zip proteins represent the major transcription factors in higher plants, playing essential roles in plant development and stress responses. Foxtail millet is a crop to investigate the systems biology of millet and biofuel grasses and the HD-Zip gene family has not been studied in foxtail millet. For further investigation of the expression profile of the HD-Zip gene family in foxtail millet, a comprehensive genome-wide expression analysis was conducted in this study. We found 47 protein-encoding genes in foxtail millet using BLAST search tools; the putative proteins were classified into four subfamilies, namely, subfamilies I, II, III, and IV. Gene structure and motif analysis indicate that the genes in one subfamily were conserved. Promotor analysis showed that HD-Zip gene was involved in abiotic stress. Duplication analysis revealed that 8 (~17%) hdz genes were tandemly duplicated and 28 (58%) were segmentally duplicated; purifying duplication plays important roles in gene expansion. Microsynteny analysis revealed the maximum relationship in foxtail millet-sorghum and foxtail millet-rice. Expression profiling upon the abiotic stresses of drought and high salinity and the biotic stress of ABA revealed that some genes regulated responses to drought and salinity stresses via an ABA-dependent process, especially sihdz29 and sihdz45. Our study provides new insight into evolutionary and functional analyses of HD-Zip genes involved in environmental stress responses in foxtail millet.
The Evolutionary Fates of a Large Segmental Duplication in Mouse
Morgan, Andrew P.; Holt, J. Matthew; McMullan, Rachel C.; Bell, Timothy A.; Clayshulte, Amelia M.-F.; Didion, John P.; Yadgary, Liran; Thybert, David; Odom, Duncan T.; Flicek, Paul; McMillan, Leonard; de Villena, Fernando Pardo-Manuel
2016-01-01
Gene duplication and loss are major sources of genetic polymorphism in populations, and are important forces shaping the evolution of genome content and organization. We have reconstructed the origin and history of a 127-kbp segmental duplication, R2d, in the house mouse (Mus musculus). R2d contains a single protein-coding gene, Cwc22. De novo assembly of both the ancestral (R2d1) and the derived (R2d2) copies reveals that they have been subject to nonallelic gene conversion events spanning tens of kilobases. R2d2 is also a hotspot for structural variation: its diploid copy number ranges from zero in the mouse reference genome to >80 in wild mice sampled from around the globe. Hemizygosity for high copy-number alleles of R2d2 is associated in cis with meiotic drive; suppression of meiotic crossovers; and copy-number instability, with a mutation rate in excess of 1 per 100 transmissions in some laboratory populations. Our results provide a striking example of allelic diversity generated by duplication and demonstrate the value of de novo assembly in a phylogenetic context for understanding the mutational processes affecting duplicate genes. PMID:27371833
Adomako-Ankomah, Yaw; English, Elizabeth D.; Danielson, Jeffrey J.; Pernas, Lena F.; Parker, Michelle L.; Boulanger, Martin J.; Dubey, Jitender P.; Boyle, Jon P.
2016-01-01
In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum. Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA+ paralogs. Additionally, we found that exogenous expression of an HMA+ paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. PMID:26920761
Genotype-phenotype characterization in 13 individuals with chromosome Xp11.22 duplications.
Grams, Sarah E; Argiropoulos, Bob; Lines, Matthew; Chakraborty, Pranesh; Mcgowan-Jordan, Jean; Geraghty, Michael T; Tsang, Marilyn; Eswara, Marthand; Tezcan, Kamer; Adams, Kelly L; Linck, Leesa; Himes, Patricia; Kostiner, Dana; Zand, Dina J; Stalker, Heather; Driscoll, Daniel J; Huang, Taosheng; Rosenfeld, Jill A; Li, Xu; Chen, Emily
2016-04-01
We report 13 new individuals with duplications in Xp11.22-p11.23. The index family has one male and two female members in three generations with mild-severe intellectual disability (ID), speech delay, dysmorphic features, early puberty, constipation, and/or hand and foot abnormalities. Affected individuals were found to have two small duplications in Xp11.22 at nucleotide position (hg19) 50,112,063-50,456,458 bp (distal) and 53,160,114-53,713,154 bp (proximal). Collectively, these two regions include 14 RefSeq genes, prompting collection of a larger cohort of patients, in an attempt to delineate critical genes associated with the observed phenotype. In total, we have collected data on nine individuals with duplications overlapping the distal duplication region containing SHROOM4 and DGKK and eight individuals overlapping the proximal region including HUWE1. Duplications of HUWE1 have been previously associated with non-syndromic ID. Our data, with previously published reports, suggest that duplications involving SHROOM4 and DGKK may represent a new syndromic X-linked ID critical region associated with mild to severe ID, speech delay +/- dysarthria, attention deficit disorder, precocious puberty, constipation, and motor delay. We frequently observed foot abnormalities, 5th finger clinodactyly, tapering fingers, constipation, and exercise intolerance in patients with duplications of these two genes. Regarding duplications including the proximal region, our observations agree with previous studies, which have found associations with intellectual disability. In addition, expressive language delay, failure to thrive, motor delay, and 5th finger clinodactyly were also frequently observed in patients with the proximal duplication. © 2015 Wiley Periodicals, Inc.
Molecular evolution of the HoxA cluster in the three major gnathostome lineages
Chiu, Chi-hua; Amemiya, Chris; Dewar, Ken; Kim, Chang-Bae; Ruddle, Frank H.; Wagner, Günter P.
2002-01-01
The duplication of Hox clusters and their maintenance in a lineage has a prominent but little understood role in chordate evolution. Here we examined how Hox cluster duplication may influence changes in cluster architecture and patterns of noncoding sequence evolution. We sequenced the entire duplicated HoxAa and HoxAb clusters of zebrafish (Danio rerio) and extended the 5′ (posterior) part of the HoxM (HoxA-like) cluster of horn shark (Heterodontus francisci) containing the hoxa11 and hoxa13 orthologs as well as intergenic and flanking noncoding sequences. The duplicated HoxA clusters in zebrafish each house considerably fewer genes and are dramatically shorter than the single HoxA clusters of human and horn shark. We compared the intergenic sequences of the HoxA clusters of human, horn shark, zebrafish (Aa, Ab), and striped bass and found extensive conservation of noncoding sequence motifs, i.e., phylogenetic footprints, between the human and horn shark, representing two of the three gnathostome lineages. These are putative cis-regulatory elements that may play a role in the regulation of the ancestral HoxA cluster. In contrast, homologous regions of the duplicated HoxAa and HoxAb clusters of zebrafish and the HoxA cluster of striped bass revealed a striking loss of conservation of these putative cis-regulatory sequences in the 3′ (anterior) segment of the cluster, where zebrafish only retains single representatives of group 1, 3, 4, and 5 (HoxAa) and group 2 (HoxAb) genes and in the 5′ part of the clusters, where zebrafish retains two copies of the group 13, 11, and 9 genes, i.e., AbdB-like genes. In analyzing patterns of cis-sequence evolution in the 5′ part of the clusters, we explicitly looked for evidence of complementary loss of conserved noncoding sequences, as predicted by the duplication-degeneration-complementation model in which genetic redundancy after gene duplication is resolved because of the fixation of complementary degenerative mutations. Our data did not yield evidence supporting this prediction. We conclude that changes in the pattern of cis-sequence conservation after Hox cluster duplication are more consistent with being the outcome of adaptive modification rather than passive mechanisms that erode redundancy created by the duplication event. These results support the view that genome duplications may provide a mechanism whereby master control genes undergo radical modifications conducive to major alterations in body plan. Such genomic revolutions may contribute significantly to the evolutionary process. PMID:11943847
Holland, Peter W H
2013-01-01
Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.
Carmona-Antoñanzas, Greta; Zheng, Xiaozhong; Tocher, Douglas R; Leaver, Michael J
2016-10-10
Fatty acyl elongase 5 (elovl5) is a critical enzyme in the vertebrate biosynthetic pathway which produces the physiologically essential long-chain polyunsaturated fatty acids (LC-PUFA), docosahexenoic acid (DHA), and eicosapentenoic acid (EPA) from 18 carbon fatty acids precursors. In contrast to most other vertebrates, Atlantic salmon possess two copies of elovl5 (elovl5a and elovl5b) as a result of a whole-genome duplication (WGD) which occurred at the base of the salmonid lineage. WGDs have had a major influence on vertebrate evolution, providing extra genetic material, enabling neofunctionalization to accelerate adaptation and speciation. However, little is known about the mechanisms by which such duplicated homeologous genes diverge. Here we show that homeologous Atlantic salmon elovl5a and elovl5b genes have been asymmetrically colonised by transposon-like elements. Identical locations and identities of insertions are also present in the rainbow trout duplicate elovl5 genes, but not in the nearest extant representative preduplicated teleost, the northern pike. Both elovl5 salmon duplicates possessed conserved regulatory elements that promoted Srebp1- and Srebp2-dependent transcription, and differences in the magnitude of Srebp response between promoters could be attributed to a tandem duplication of SRE and NF-Y cofactor binding sites in elovl5b. Furthermore, an insertion in the promoter region of elovl5a confers responsiveness to Lxr/Rxr transcriptional activation. Our results indicate that most, but not all, transposon mobilisation into elovl5 genes occurred after the split from the common ancestor of pike and salmon, but before more recent salmonid speciations, and that divergence of elovl5 regulatory regions have enabled neofuntionalization by promoting differential expression of these homeologous genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Hu, Catherine; Lin, Siou-ying; Chi, Wen-tzu; Charng, Yee-yung
2012-02-01
The duplication and divergence of heat stress (HS) response genes might help plants adapt to varied HS conditions, but little is known on the topic. Here, we examined the evolution and function of Arabidopsis (Arabidopsis thaliana) mitochondrial GrpE (Mge) proteins. GrpE acts as a nucleotide-exchange factor in the Hsp70/DnaK chaperone machinery. Genomic data show that AtMge1 and AtMge2 arose from a recent whole-genome duplication event. Phylogenetic analysis indicated that duplication and preservation of Mges occurred independently in many plant species, which suggests a common tendency in the evolution of the genes. Intron retention contributed to the divergence of the protein structure of Mge paralogs in higher plants. In both Arabidopsis and tomato (Solanum lycopersicum), Mge1 is induced by ultraviolet B light and Mge2 is induced by heat, which suggests regulatory divergence of the genes. Consistently, AtMge2 but not AtMge1 is under the control of HsfA1, the master regulator of the HS response. Heterologous expression of AtMge2 but not AtMge1 in the temperature-sensitive Escherichia coli grpE mutant restored its growth at 43°C. Arabidopsis T-DNA knockout lines under different HS regimes revealed that Mge2 is specifically required for tolerating prolonged exposure to moderately high temperature, as compared with the need of the heat shock protein 101 and the HS-associated 32-kD protein for short-term extreme heat. Therefore, with duplication and subfunctionalization, one copy of the Arabidopsis Mge genes became specialized in a distinct type of HS. We provide direct evidence supporting the connection between gene duplication and adaptation to environmental stress.
Farashi, Samaneh; Faramarzi Garous, Negin; Zeinali, Fatemeh; Vakili, Shadi; Ashki, Mehri; Imanian, Hashem; Najmabadi, Hossein; Azarkeivan, Azita; Tamaddoni, Ahmad
2015-01-01
α-Thalassemia (α-thal) is a common genetic disorder in Iran and many parts of the world. Genetic defects in the α-globin gene cluster can result in α-thal that may develop into a clinical phenotype varying from almost asymptomatic to a lethal hemolytic anemia. Loss of one functional α gene, indicated as heterozygous α(+)-thal, shows minor hematological abnormalities. Homozygosity for α(+)- or heterozygosity for α(0)-thal have more severe hematological abnormalities due to a markedly reduced α chain output. At the molecular level, the absence of three α-globin genes resulting from the compound heterozygous state for α(0)- and α(+)-thal, lead to Hb H disease. Here we present a 21 nucleotide (nt) duplication consisting of six amino acids and 3 bp of intronic sequence at the exon-intron boundary, in both the α-globin genes, detected by direct DNA sequencing. This duplication was identified in three patients originating from two different Iranian ethnic groups and one Arab during more than 12 years. The clinical presentation of these individuals varies widely from a mild asymptomatic anemia (heterozygote in α1-globin gene) to a severely anemic state, diagnosed as an Hb H individual requiring blood transfusion (duplication on the α2-globin gene in combination with the - -(MED) double α-globin gene deletion). The third individual, who was homozygous for this nt duplication on the α1-globin gene, showed severe hypochromic microcytic anemia and splenomegaly. In the last decade, numerous α-globin mutations have demonstrated the necessity of prenatal diagnosis (PND) for α-thal, and this study has contributed another mutation as important enough that needs to be considered.
Thompson, Ella; Dragovic, Rebecca L; Stephenson, Sally-Anne; Eccles, Diana M; Campbell, Ian G; Dobrovic, Alexander
2005-04-29
The FANCA gene is one of the genes in which mutations lead to Fanconi anaemia, a rare autosomal recessive disorder characterised by congenital abnormalities, bone marrow failure, and predisposition to malignancy. FANCA is also a potential breast and ovarian cancer susceptibility gene. A novel allele was identified which has a tandem duplication of a 13 base pair sequence in the promoter region. We screened germline DNA from 352 breast cancer patients, 390 ovarian cancer patients and 256 normal controls to determine if the presence of either of these two alleles was associated with an increased risk of breast or ovarian cancer. The duplication allele had a frequency of 0.34 in the normal controls. There was a non-significant decrease in the frequency of the duplication allele in breast cancer patients. The frequency of the duplication allele was significantly decreased in ovarian cancer patients. However, when malignant and benign tumours were considered separately, the decrease was only significant in benign tumours. The allele with the tandem duplication does not appear to modify breast cancer risk but may act as a low penetrance protective allele for ovarian cancer.
Thompson, Ella; Dragovic, Rebecca L; Stephenson, Sally-Anne; Eccles, Diana M; Campbell, Ian G; Dobrovic, Alexander
2005-01-01
The FANCA gene is one of the genes in which mutations lead to Fanconi anaemia, a rare autosomal recessive disorder characterised by congenital abnormalities, bone marrow failure, and predisposition to malignancy. FANCA is also a potential breast and ovarian cancer susceptibility gene. A novel allele was identified which has a tandem duplication of a 13 base pair sequence in the promoter region. Methods We screened germline DNA from 352 breast cancer patients, 390 ovarian cancer patients and 256 normal controls to determine if the presence of either of these two alleles was associated with an increased risk of breast or ovarian cancer. Results The duplication allele had a frequency of 0.34 in the normal controls. There was a non-significant decrease in the frequency of the duplication allele in breast cancer patients. The frequency of the duplication allele was significantly decreased in ovarian cancer patients. However, when malignant and benign tumours were considered separately, the decrease was only significant in benign tumours. Conclusion The allele with the tandem duplication does not appear to modify breast cancer risk but may act as a low penetrance protective allele for ovarian cancer. PMID:15860134
PGDD: a database of gene and genome duplication in plants
Lee, Tae-Ho; Tang, Haibao; Wang, Xiyin; Paterson, Andrew H.
2013-01-01
Genome duplication (GD) has permanently shaped the architecture and function of many higher eukaryotic genomes. The angiosperms (flowering plants) are outstanding models in which to elucidate consequences of GD for higher eukaryotes, owing to their propensity for chromosomal duplication or even triplication in a few cases. Duplicated genome structures often require both intra- and inter-genome alignments to unravel their evolutionary history, also providing the means to deduce both obvious and otherwise-cryptic orthology, paralogy and other relationships among genes. The burgeoning sets of angiosperm genome sequences provide the foundation for a host of investigations into the functional and evolutionary consequences of gene and GD. To provide genome alignments from a single resource based on uniform standards that have been validated by empirical studies, we built the Plant Genome Duplication Database (PGDD; freely available at http://chibba.agtec.uga.edu/duplication/), a web service providing synteny information in terms of colinearity between chromosomes. At present, PGDD contains data for 26 plants including bryophytes and chlorophyta, as well as angiosperms with draft genome sequences. In addition to the inclusion of new genomes as they become available, we are preparing new functions to enhance PGDD. PMID:23180799
Population Level Purifying Selection and Gene Expression Shape Subgenome Evolution in Maize.
Pophaly, Saurabh D; Tellier, Aurélien
2015-12-01
The maize ancestor experienced a recent whole-genome duplication (WGD) followed by gene erosion which generated two subgenomes, the dominant subgenome (maize1) experiencing fewer deletions than maize2. We take advantage of available extensive polymorphism and gene expression data in maize to study purifying selection and gene expression divergence between WGD retained paralog pairs. We first report a strong correlation in nucleotide diversity between duplicate pairs, except for upstream regions. We then show that maize1 genes are under stronger purifying selection than maize2. WGD retained genes have higher gene dosage and biased Gene Ontologies consistent with previous studies. The relative gene expression of paralogs across tissues demonstrates that 98% of duplicate pairs have either subfunctionalized in a tissuewise manner or have diverged consistently in their expression thereby preventing functional complementation. Tissuewise subfunctionalization seems to be a hallmark of transcription factors, whereas consistent repression occurs for macromolecular complexes. We show that dominant gene expression is a strong determinant of the strength of purifying selection, explaining the inferred stronger negative selection on maize1 genes. We propose a novel expression-based classification of duplicates which is more robust to explain observed polymorphism patterns than the subgenome location. Finally, upstream regions of repressed genes exhibit an enrichment in transposable elements which indicates a possible mechanism for expression divergence. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The fate of the duplicated androgen receptor in fishes: a late neofunctionalization event?
2008-01-01
Background Based on the observation of an increased number of paralogous genes in teleost fishes compared with other vertebrates and on the conserved synteny between duplicated copies, it has been shown that a whole genome duplication (WGD) occurred during the evolution of Actinopterygian fish. Comparative phylogenetic dating of this duplication event suggests that it occurred early on, specifically in teleosts. It has been proposed that this event might have facilitated the evolutionary radiation and the phenotypic diversification of the teleost fish, notably by allowing the sub- or neo-functionalization of many duplicated genes. Results In this paper, we studied in a wide range of Actinopterygians the duplication and fate of the androgen receptor (AR, NR3C4), a nuclear receptor known to play a key role in sex-determination in vertebrates. The pattern of AR gene duplication is consistent with an early WGD event: it has been duplicated into two genes AR-A and AR-B after the split of the Acipenseriformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes. Genomic and syntenic analyses in addition to lack of PCR amplification show that one of the duplicated copies, AR-B, was lost in several basal Clupeocephala such as Cypriniformes (including the model species zebrafish), Siluriformes, Characiformes and Salmoniformes. Interestingly, we also found that, in basal teleost fish (Osteoglossiformes and Anguilliformes), the two copies remain very similar, whereas, specifically in Percomorphs, one of the copies, AR-B, has accumulated substitutions in both the ligand binding domain (LBD) and the DNA binding domain (DBD). Conclusion The comparison of the mutations present in these divergent AR-B with those known in human to be implicated in complete, partial or mild androgen insensitivity syndrome suggests that the existence of two distinct AR duplicates may be correlated to specific functional differences that may be connected to the well-known plasticity of sex determination in fish. This suggests that three specific events have shaped the present diversity of ARs in Actinopterygians: (i) early WGD, (ii) parallel loss of one duplicate in several lineages and (iii) putative neofunctionalization of the same duplicate in percomorphs, which occurred a long time after the WGD. PMID:19094205
Siggs, Owen M; Javadiyan, Shari; Sharma, Shiwani; Souzeau, Emmanuelle; Lower, Karen M; Taranath, Deepa A; Black, Jo; Pater, John; Willoughby, John G; Burdon, Kathryn P; Craig, Jamie E
2017-01-01
Congenital cataract is a rare but severe paediatric visual impediment, often caused by variants in one of several crystallin genes that produce the bulk of structural proteins in the lens. Here we describe a pedigree with autosomal dominant isolated congenital cataract and linkage to the crystallin gene cluster on chromosome 22. No rare single nucleotide variants or short indels were identified by exome sequencing, yet copy number variant analysis revealed a duplication spanning both CRYBB1 and CRYBA4. While the CRYBA4 duplication was complete, the CRYBB1 duplication was not, with the duplicated CRYBB1 product predicted to create a gain of function allele. This association suggests a new genetic mechanism for the development of isolated congenital cataract. PMID:28272538
Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping
2018-04-23
The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .
Gene copy number evolution during tetraploid cotton radiation.
Rong, J; Feltus, F A; Liu, L; Lin, L; Paterson, A H
2010-11-01
After polyploid formation, retention or loss of duplicated genes is not random. Genes with some functional domains are convergently restored to 'singleton' state after many independent genome duplications, and have been referred to as 'duplication-resistant' (DR) genes. To further explore the timeframe for their restoration to the singleton state, 27 cotton homologs of genes found to be 'DR' in Arabidopsis were selected based on diagnostic Pfam domains. Their copy numbers were studied using southern hybridization and sequence analysis in five tetraploid species and their ancestral A and D genome diploids. DR genes had significantly lower copy number than gene families hybridizing to randomly selected cotton ESTs. Three DR genes showed complete loss of D genome-derived homoeologs in some or all tetraploid species. Prior analysis has shown gene loss in polyploid cotton to be rare, and herein only one randomly selected gene showed loss of a homoeolog in only one of the five tetraploid species (Gossypium mustelinum). BAC sequencing confirmed two cases of gene loss in tetraploid cotton. Divergence among 5' sequences of DR genes amplified from G. arboreum, G. raimondii, and Gossypioides kirkii was correlated with gene copy number. These results show that genes containing Pfam domains associated with duplication resistance in Arabidopsis have also been preferentially restored to low copy number after a more recent polyploidization event in cotton. In tetraploid cotton, genes from the progenitor D genome seem to experience more gene copy number divergence than genes from the A genome. Together with D subgenome-biased alterations in gene expression, perhaps gene loss may contribute to the relatively larger portion of quantitative trait variation attributable to D than A subgenome chromosomes of tetraploid cotton.
Duplication of 17(p11.2p11.2) in a male child with autism and severe language delay.
Nakamine, Alisa; Ouchanov, Leonid; Jiménez, Patricia; Manghi, Elina R; Esquivel, Marcela; Monge, Silvia; Fallas, Marietha; Burton, Barbara K; Szomju, Barbara; Elsea, Sarah H; Marshall, Christian R; Scherer, Stephen W; McInnes, L Alison
2008-03-01
Duplications of 17(p11.2p11.2) have been associated with various behavioral manifestations including attention deficits, obsessive-compulsive symptoms, autistic traits, and language delay. We are conducting a genetic study of autism and are screening all cases for submicroscopic chromosomal abnormalities, in addition to standard karyotyping, and fragile X testing. Using array-based comparative genomic hybridization analysis of data from the Affymetrix GeneChip(R) Human Mapping Array set, we detected a duplication of approximately 3.3 Mb on chromosome 17p11.2 in a male child with autism and severe expressive language delay. The duplication was confirmed by measuring the copy number of genomic DNA using quantitative polymerase chain reaction. Gene expression analyses revealed increased expression of three candidate genes for the Smith-Magenis neurobehavioral phenotype, RAI1, DRG2, and RASD1, in transformed lymphocytes from Case 81A, suggesting gene dosage effects. Our results add to a growing body of evidence suggesting that duplications of 17(p11.2p11.2) result in language delay as well as autism and related phenotypes. As Smith-Magenis syndrome is also associated with language delay, a gene involved in acquisition of language may lie within this interval. Whether a parent of origin effect, gender of the case, the presence of allelic variation, or changes in expression of genes outside the breakpoints influence the resultant phenotype remains to be determined. (c) 2007 Wiley-Liss, Inc.
Preston, Jill C; Jorgensen, Stacy A; Jha, Suryatapa G
2014-01-01
Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene Suppressor Of Overexpression of Constans 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes Unshaven (UNS) and Floral Binding Protein 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods.
Preston, Jill C.; Jorgensen, Stacy A.; Jha, Suryatapa G.
2014-01-01
Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes UNSHAVEN (UNS) and FLORAL BINDING PROTEIN 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods. PMID:24787903
Japanese neuropathy patients with peripheral myelin protein-22 gene aneuploidy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lebo, R.V.; Li, L.Y.; Flandermeyer, R.R.
1994-09-01
Peripheral myelin protein (PMP-22) gene aneuploidy results in Charcot-Marie-Tooth disease Type 1A (CMT1A) and the Hereditary Neuropathy with Liability to Pressure Palsy (HNPP) in Japanese patients as well as Caucasian Americans. Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, results when expression of one of at least seven genes is defective. CMT1A, about half of all CMT mutations, is usually associated with a duplication spanning the peripheral myelin protein-22 gene on distal chromosome band 17p11.2. Autosomal dominant HNPP (hereditary pressure and sensory neuropathy, HPSN) results from a deletion of the CMT1A gene region. Multicolor in situ hybridization with PMP-22 genemore » region probe characterized HNPP deletion reliably and detected all different size duplications reported previously. In summary, 72% of 28 Japanese CMT1 (HMSNI) patients tested had the CMT1A duplication, while none of the CMT2 (HMSNII) or CMT3 (HMSNIII) patients had a duplication. Three cases of HNPP were identified by deletion of the CMT1A gene region on chromosome 17p. HNPP and CMT1A have been reported to result simultaneously from the same unequal recombination event. The lower frequency of HNPP compared to CMT1A suggests that HNPP patients have a lower reproductive fitness than CMT1A patients. This result, along with a CMT1A duplication found in an Asian Indian family, demonstrates the broad geographic distribution and high frequency of PMP-22 gene aneuploidy.« less
Den Dunnen, J T; Grootscholten, P M; Bakker, E; Blonden, L A; Ginjaar, H B; Wapenaar, M C; van Paassen, H M; van Broeckhoven, C; Pearson, P L; van Ommen, G J
1989-01-01
We have studied 34 Becker and 160 Duchenne muscular dystrophy (DMD) patients with the dystrophin cDNA, using conventional blots and FIGE analysis. One hundred twenty-eight mutations (65%) were found, 115 deletions and 13 duplications, of which 106 deletions and 11 duplications could be precisely mapped in relation to both the mRNA and the major and minor mutation hot spots. Junction fragments, ideal markers for carrier detection, were found in 23 (17%) of the 128 cases. We identified eight new cDNA RFLPs within the DMD gene. With the use of cDNA probes we have completed the long-range map of the DMD gene, by the identification of a 680-kb SfiI fragment containing the gene's 3' end. The size of the DMD gene is now determined to be about 2.3 million basepairs. The combination of cDNA hybridizations with long-range analysis of deletion and duplication patients yields a global picture of the exon spacing within the dystrophin gene. The gene shows a large variability of intron size, ranging from only a few kilobases to 160-180 kb for the P20 intron. Images Figure 1 Figure 4 PMID:2573997
Evolutionary history of the enolase gene family.
Tracy, M R; Hedges, S B
2000-12-23
The enzyme enolase [EC 4.2.1.11] is found in all organisms, with vertebrates exhibiting tissue-specific isozymes encoded by three genes: alpha (alpha), beta (beta), and gamma (gamma) enolase. Limited taxonomic sampling of enolase has obscured the timing of gene duplication events. To help clarify the evolutionary history of the gene family, cDNAs were sequenced from six taxa representing major lineages of vertebrates: Chiloscyllium punctatum (shark), Amia calva (bowfin), Salmo trutta (trout), Latimeria chalumnae (coelacanth), Lepidosiren paradoxa (South American lungfish), and Neoceratodus forsteri (Australian lungfish). Phylogenetic analysis of all enolase and related gene sequences revealed an early gene duplication event prior to the last common ancestor of living organisms. Several distantly related archaebacterial sequences were designated as 'enolase-2', whereas all other enolase sequences were designated 'enolase-1'. Two of the three isozymes of enolase-1, alpha- and beta-enolase, were discovered in actinopterygian, sarcopterygian, and chondrichthian fishes. Phylogenetic analysis of vertebrate enolases revealed that the two gene duplications leading to the three isozymes of enolase-1 occurred subsequent to the divergence of living agnathans, near the Proterozoic/Phanerozoic boundary (approximately 550Mya). Two copies of enolase, designated alpha(1) and alpha(2), were found in the trout and are presumed to be the result of a genome duplication event.
Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping
2016-05-09
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.
Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping
2016-01-01
Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299
Alternative splicing and the evolution of phenotypic novelty.
Bush, Stephen J; Chen, Lu; Tovar-Corona, Jaime M; Urrutia, Araxi O
2017-02-05
Alternative splicing, a mechanism of post-transcriptional RNA processing whereby a single gene can encode multiple distinct transcripts, has been proposed to underlie morphological innovations in multicellular organisms. Genes with developmental functions are enriched for alternative splicing events, suggestive of a contribution of alternative splicing to developmental programmes. The role of alternative splicing as a source of transcript diversification has previously been compared to that of gene duplication, with the relationship between the two extensively explored. Alternative splicing is reduced following gene duplication with the retention of duplicate copies higher for genes which were alternatively spliced prior to duplication. Furthermore, and unlike the case for overall gene number, the proportion of alternatively spliced genes has also increased in line with the evolutionary diversification of cell types, suggesting alternative splicing may contribute to the complexity of developmental programmes. Together these observations suggest a prominent role for alternative splicing as a source of functional innovation. However, it is unknown whether the proliferation of alternative splicing events indeed reflects a functional expansion of the transcriptome or instead results from weaker selection acting on larger species, which tend to have a higher number of cell types and lower population sizes.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).
Alternative splicing and the evolution of phenotypic novelty
Bush, Stephen J.; Chen, Lu; Tovar-Corona, Jaime M.
2017-01-01
Alternative splicing, a mechanism of post-transcriptional RNA processing whereby a single gene can encode multiple distinct transcripts, has been proposed to underlie morphological innovations in multicellular organisms. Genes with developmental functions are enriched for alternative splicing events, suggestive of a contribution of alternative splicing to developmental programmes. The role of alternative splicing as a source of transcript diversification has previously been compared to that of gene duplication, with the relationship between the two extensively explored. Alternative splicing is reduced following gene duplication with the retention of duplicate copies higher for genes which were alternatively spliced prior to duplication. Furthermore, and unlike the case for overall gene number, the proportion of alternatively spliced genes has also increased in line with the evolutionary diversification of cell types, suggesting alternative splicing may contribute to the complexity of developmental programmes. Together these observations suggest a prominent role for alternative splicing as a source of functional innovation. However, it is unknown whether the proliferation of alternative splicing events indeed reflects a functional expansion of the transcriptome or instead results from weaker selection acting on larger species, which tend to have a higher number of cell types and lower population sizes. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994117
Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication.
Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S
2016-01-01
Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and compared the clinical, genotyping, and gene microarray results of our patient in order to characterize this novel 1p36 duplication syndrome, which might have contributed to the recurrent scabies infection in this patient.
Colón, Maritrini; Hernández, Fabiola; López, Karla; Quezada, Héctor; González, James; López, Geovani; Aranda, Cristina; González, Alicia
2011-01-01
Background Gene duplication is a key evolutionary mechanism providing material for the generation of genes with new or modified functions. The fate of duplicated gene copies has been amply discussed and several models have been put forward to account for duplicate conservation. The specialization model considers that duplication of a bifunctional ancestral gene could result in the preservation of both copies through subfunctionalization, resulting in the distribution of the two ancestral functions between the gene duplicates. Here we investigate whether the presumed bifunctional character displayed by the single branched chain amino acid aminotransferase present in K. lactis has been distributed in the two paralogous genes present in S. cerevisiae, and whether this conservation has impacted S. cerevisiae metabolism. Principal Findings Our results show that the KlBat1 orthologous BCAT is a bifunctional enzyme, which participates in the biosynthesis and catabolism of branched chain aminoacids (BCAAs). This dual role has been distributed in S. cerevisiae Bat1 and Bat2 paralogous proteins, supporting the specialization model posed to explain the evolution of gene duplications. BAT1 is highly expressed under biosynthetic conditions, while BAT2 expression is highest under catabolic conditions. Bat1 and Bat2 differential relocalization has favored their physiological function, since biosynthetic precursors are generated in the mitochondria (Bat1), while catabolic substrates are accumulated in the cytosol (Bat2). Under respiratory conditions, in the presence of ammonium and BCAAs the bat1Δ bat2Δ double mutant shows impaired growth, indicating that Bat1 and Bat2 could play redundant roles. In K. lactis wild type growth is independent of BCAA degradation, since a Klbat1Δ mutant grows under this condition. Conclusions Our study shows that BAT1 and BAT2 differential expression and subcellular relocalization has resulted in the distribution of the biosynthetic and catabolic roles of the ancestral BCAT in two isozymes improving BCAAs metabolism and constituting an adaptation to facultative metabolism. PMID:21267457
Root hairs, trichomes and the evolution of duplicate genes.
Kellogg, E A
2001-12-01
The MYB-class proteins WEREWOLF and GLABRA1 are functionally interchangeable, even though one is normally expressed solely in roots and the other only in shoots. This shows that their different functions are the result of the modification of cis-regulatory sequences over evolutionary time. The two genes thus provide an example of morphological diversification created by gene duplication and changes in regulation.
Bijlsma, E K; Collins, A; Papa, F T; Tejada, M I; Wheeler, P; Peeters, E A J; Gijsbers, A C J; van de Kamp, J M; Kriek, M; Losekoot, M; Broekma, A J; Crolla, J A; Pollazzon, M; Mucciolo, M; Katzaki, E; Disciglio, V; Ferreri, M I; Marozza, A; Mencarelli, M A; Castagnini, C; Dosa, L; Ariani, F; Mari, F; Canitano, R; Hayek, G; Botella, M P; Gener, B; Mínguez, M; Renieri, A; Ruivenkamp, C A L
2012-06-01
Duplications leading to functional disomy of chromosome Xq28, including MECP2 as the critical dosage-sensitive gene, are associated with a distinct clinical phenotype in males, characterized by severe mental retardation, infantile hypotonia, progressive neurologic impairment, recurrent infections, bladder dysfunction, and absent speech. Female patients with Xq duplications including MECP2 are rare. Only recently submicroscopic duplications of this region on Xq28 have been recognized in four females, and a triplication in a fifth, all in combination with random X-chromosome inactivation (XCI). Based on this small series, it was concluded that in females with MECP2 duplication and random XCI, the typical symptoms of affected boys are not present. We present clinical and molecular data on a series of five females with an Xq28 duplication including the MECP2 gene, both isolated and as the result of a translocation, and compare them with the previously reported cases of small duplications in females. The collected data indicate that the associated phenotype in females is distinct from males with similar duplications, but the clinical effects may be as severe as seen in males. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Bijlsma, E.K.; Collins, A.; Papa, F.T.; Tejada, M.I.; Wheeler, P.; Peeters, E.A.J.; Gijsbers, A.C.J.; van de Kamp, J.M.; Kriek, M.; Losekoot, M.; Broekma, A.J.; Crolla, J.A.; Pollazzon, M.; Mucciolo, M.; Katzaki, E.; Disciglio, V.; Ferreri, M.I.; Marozza, A.; Mencarelli, M.A.; Castagnini, C.; Dosa, L.; Ariani, F.; Mari, F.; Canitano, R.; Hayek, G.; Botella, M.P.; Gener, B.; Mínguez, M.; Renieri, A.; Ruivenkamp, C.A.L.
2012-01-01
Duplications leading to functional disomy of chromosome Xq28, including MECP2 as the critical dosage-sensitive gene, are associated with a distinct clinical phenotype in males, characterized by severe mental retardation, infantile hypotonia, progressive neurologic impairment, recurrent infections, bladder dysfunction, and absent speech. Female patients with Xq duplications including MECP2 are rare. Only recently submicroscopic duplications of this region on Xq28 have been recognized in four females, and a triplication in a fifth, all in combination with random X-chromosome inactivation (XCI). Based on this small series, it was concluded that in females with MECP2 duplication and random XCI, the typical symptoms of affected boys are not present. We present clinical and molecular data on a series of five females with an Xq28 duplication including the MECP2 gene, both isolated and as the result of a translocation, and compare them with the previously reported cases of small duplications in females. The collected data indicate that the associated phenotype in females is distinct from males with similar duplications, but the clinical effects may be as severe as seen in males. PMID:22522176
Expansion by whole genome duplication and evolution of the sox gene family in teleost fish
Naville, Magali; Volff, Jean-Nicolas
2017-01-01
It is now recognized that several rounds of whole genome duplication (WGD) have occurred during the evolution of vertebrates, but the link between WGDs and phenotypic diversification remains unsolved. We have investigated in this study the impact of the teleost-specific WGD on the evolution of the sox gene family in teleostean fishes. The sox gene family, which encodes for transcription factors, has essential role in morphology, physiology and behavior of vertebrates and teleosts, the current largest group of vertebrates. We have first redrawn the evolution of all sox genes identified in eleven teleost genomes using a comparative genomic approach including phylogenetic and synteny analyses. We noticed, compared to tetrapods, an important expansion of the sox family: 58% (11/19) of sox genes are duplicated in teleost genomes. Furthermore, all duplicated sox genes, except sox17 paralogs, are derived from the teleost-specific WGD. Then, focusing on five sox genes, analyzing the evolution of coding and non-coding sequences, as well as the expression patterns in fish embryos and adult tissues, we demonstrated that these paralogs followed lineage-specific evolutionary trajectories in teleost genomes. This work, based on whole genome data from multiple teleostean species, supports the contribution of WGDs to the expansion of gene families, as well as to the emergence of genomic differences between lineages that might promote genetic and phenotypic diversity in teleosts. PMID:28738066
MLL duplication in a pediatric patient with B-cell lymphoblastic lymphoma.
Mater, David Van; Goodman, Barbara K; Wang, Endi; Gaca, Ana M; Wechsler, Daniel S
2012-04-01
Lymphoblastic lymphoma is the second most common type of non-Hodgkin lymphoma seen in children. Approximately, 90% of lymphoblastic lymphomas arise from T cells, with the remaining 10% being B-cell-lineage derived. Although T-cell lymphoblastic lymphoma most frequently occurs in the anterior mediastinum (thymus), B-cell lymphoblastic lymphoma (B-LBL) predominates in extranodal sites such as skin and bone. Here, we describe a pediatric B-LBL patient who presented with extensive abdominal involvement and whose lymphoma cells displayed segmental duplication of the mixed lineage leukemia (MLL) gene. MLL duplication/amplification has been described primarily in acute myeloid leukemia and myelodysplastic syndrome with no published reports of discrete MLL duplication/amplification events in B-LBL. The MLL gene duplication noted in this case may represent a novel mechanism for tumorigenesis in B-LBL.
Duplication 16p13.3 and the CREBBP gene: confirmation of the phenotype.
Demeer, Bénédicte; Andrieux, Joris; Receveur, Aline; Morin, Gilles; Petit, Florence; Julia, Sophie; Plessis, Ghislaine; Martin-Coignard, Dominique; Delobel, Bruno; Firth, Helen V; Thuresson, Ann C; Lanco Dosen, Sandrine; Sjörs, Kerstin; Le Caignec, Cedric; Devriendt, Koenraad; Mathieu-Dramard, Michèle
2013-01-01
The introduction of molecular karyotyping technologies into the diagnostic work-up of patients with congenital disorders permitted the identification and delineation of novel microdeletion and microduplication syndromes. Interstitial 16p13.3 duplication, encompassing the CREBBP gene, which is mutated or deleted in the Rubinstein-Taybi syndrome, have been proposed to cause a recognisable syndrome with variable intellectual disability, normal growth, mild facial dysmorphism, mild anomalies of the extremities, and occasional findings such as developmental defects of the heart, genitalia, palate or the eyes. We here report the phenotypic and genotypic delineation of 9 patients carrying a submicroscopic 16p13.3 duplication, including the smallest 16p13.3 duplication reported so far. Careful clinical assessment confirms the distinctive clinical phenotype and also defines frequent associated features : marked speech problems, frequent ocular region involvement with upslanting of the eyes, narrow palpebral fissures, ptosis and strabismus, frequent proximal implantation of thumbs, cleft palate/bifid uvula and inguinal hernia. It also confirms that CREBBP is the critical gene involved in the duplication 16p13.3 syndrome. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Brain evolution by brain pathway duplication
Chakraborty, Mukta; Jarvis, Erich D.
2015-01-01
Understanding the mechanisms of evolution of brain pathways for complex behaviours is still in its infancy. Making further advances requires a deeper understanding of brain homologies, novelties and analogies. It also requires an understanding of how adaptive genetic modifications lead to restructuring of the brain. Recent advances in genomic and molecular biology techniques applied to brain research have provided exciting insights into how complex behaviours are shaped by selection of novel brain pathways and functions of the nervous system. Here, we review and further develop some insights to a new hypothesis on one mechanism that may contribute to nervous system evolution, in particular by brain pathway duplication. Like gene duplication, we propose that whole brain pathways can duplicate and the duplicated pathway diverge to take on new functions. We suggest that one mechanism of brain pathway duplication could be through gene duplication, although other mechanisms are possible. We focus on brain pathways for vocal learning and spoken language in song-learning birds and humans as example systems. This view presents a new framework for future research in our understanding of brain evolution and novel behavioural traits. PMID:26554045
Faundes, Víctor; Santa María, Lorena; Morales, Paulina; Curotto, Bianca; Parraguez, María M
2016-10-01
Chromosome 7q11.23 duplication syndrome is a well-recognised syndrome which involves the duplication of the same genes located in the Williams-Beuren critical region. However, in 2010, 4 patients were reported with a microduplication only in the HIP1 and YWHAG genes. We refer to this as a distal 7q11.23 duplication (dup7q11.23D). Here, we report the fifth de novo patient with dup7q11.23D, whose symptoms may be explained by YWHAG overexpression as was demonstrated recently in mice and obese patients. Finally, further studies will be necessary to delineate this emerging microduplication syndrome.
SHOX duplications found in some cases with type I Mayer-Rokitansky-Kuster-Hauser syndrome.
Gervasini, Cristina; Grati, Francesca Romana; Lalatta, Faustina; Tabano, Silvia; Gentilin, Barbara; Colapietro, Patrizia; De Toffol, Simona; Frontino, Giada; Motta, Francesca; Maitz, Silvia; Bernardini, Laura; Dallapiccola, Bruno; Fedele, Luigi; Larizza, Lidia; Miozzo, Monica
2010-10-01
The Mayer-Rokitansky-Küster-Hauser syndrome is defined as congenital aplasia of müllerian ducts derived structures in females with a normal female chromosomal and gonadal sex. Most cases with Mayer-Rokitansky-Küster-Hauser syndrome are sporadic, although familial cases have been reported. The genetic basis of Mayer-Rokitansky-Küster-Hauser syndrome is largely unknown and seems heterogeneous, and a small number of cases were found to have mutations in the WNT4 gene. The aim of this study was to identify possible recurrent submicroscopic imbalances in a cohort of familial and sporadic cases with Mayer-Rokitansky-Küster-Hauser syndrome. Multiplex ligation-dependent probe amplification was used to screen the subtelomeric sequences of all chromosomes in 30 patients with Mayer-Rokitansky-Küster-Hauser syndrome (sporadic, n = 27 and familial, n = 3). Segregation analysis and pyrosequencing were applied to validate the MLPA results in the informative family. Partial duplication of the Xpter pseudoautosomal region 1 containing the short stature homeobox (SHOX) gene was detected in five patients with Mayer-Rokitansky-Küster-Hauser syndrome (familial, n = 3 and sporadic, n = 2) and not in 53 healthy controls. The duplications were not overlapping, and SHOX was never entirely duplicated. Haplotyping in the informative family revealed that SHOX gene duplication was inherited from the unaffected father and was absent in two healthy sisters. Partial duplication of SHOX gene is found in some cases with both familial and sporadic Mayer-Rokitansky-Küster-Hauser type I syndrome.
The house spider genome reveals an ancient whole-genome duplication during arachnid evolution.
Schwager, Evelyn E; Sharma, Prashant P; Clarke, Thomas; Leite, Daniel J; Wierschin, Torsten; Pechmann, Matthias; Akiyama-Oda, Yasuko; Esposito, Lauren; Bechsgaard, Jesper; Bilde, Trine; Buffry, Alexandra D; Chao, Hsu; Dinh, Huyen; Doddapaneni, HarshaVardhan; Dugan, Shannon; Eibner, Cornelius; Extavour, Cassandra G; Funch, Peter; Garb, Jessica; Gonzalez, Luis B; Gonzalez, Vanessa L; Griffiths-Jones, Sam; Han, Yi; Hayashi, Cheryl; Hilbrant, Maarten; Hughes, Daniel S T; Janssen, Ralf; Lee, Sandra L; Maeso, Ignacio; Murali, Shwetha C; Muzny, Donna M; Nunes da Fonseca, Rodrigo; Paese, Christian L B; Qu, Jiaxin; Ronshaugen, Matthew; Schomburg, Christoph; Schönauer, Anna; Stollewerk, Angelika; Torres-Oliva, Montserrat; Turetzek, Natascha; Vanthournout, Bram; Werren, John H; Wolff, Carsten; Worley, Kim C; Bucher, Gregor; Gibbs, Richard A; Coddington, Jonathan; Oda, Hiroki; Stanke, Mario; Ayoub, Nadia A; Prpic, Nikola-Michael; Flot, Jean-François; Posnien, Nico; Richards, Stephen; McGregor, Alistair P
2017-07-31
The duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum. We found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication. Our results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.
Ancient and Recent Duplications Support Functional Diversity of Daphnia Opsins.
Brandon, Christopher S; Greenwold, Matthew J; Dudycha, Jeffry L
2017-01-01
Daphnia pulex has the largest known family of opsins, genes critical for photoreception and vision in animals. This diversity may be functionally redundant, arising from recent processes, or ancient duplications may have been preserved due to distinct functions and independent contributions to fitness. We analyzed opsins in D. pulex and its distant congener Daphnia magna. We identified 48 opsins in the D. pulex genome and 32 in D. magna. We inferred the complement of opsins in the last common ancestor of all Daphnia and evaluated the history of opsin duplication and loss. We further analyzed sequence variation to assess possible functional diversification among Daphnia opsins. Much of the opsin expansion occurred before the D. pulex-D. magna split more than 145 Mya, and both Daphnia lineages preserved most ancient opsins. More recent expansion occurred in pteropsins and long-wavelength visual opsins in both species, particularly D. pulex. Recent duplications were not random: the same ancestral genes duplicated independently in each modern species. Most ancient and some recent duplications involved differentiation at residues known to influence spectral tuning of visual opsins. Arthropsins show evidence of gene conversion between tandemly arrayed paralogs in functionally important domains. Intron-exon gene structure was generally conserved within clades inferred from sequences, although pteropsins showed substantial intron size variation. Overall, our analyses support the hypotheses that diverse opsins are maintained due to diverse functional roles in photoreception and vision, that functional diversification is both ancient and recent, and that multiple evolutionary processes have influenced different types of opsins.
Le, Thong Minh; Le, Quy Van Chanh; Truong, Dung Minh; Lee, Hye-Jeong; Choi, Min-Kyeung; Cho, Hyesun; Chung, Hak-Jae; Kim, Jin-Hoi; Do, Jeong-Tae; Song, Hyuk; Park, Chankyu
2017-01-01
Several β2-microglobulin (B2M) -bound protein complexes undertake key roles in various immune system pathways, including the neonatal Fc receptor (FcRn), cluster of differentiation 1 (CD1) protein, non-classical major histocompatibility complex (MHC), and well-known MHC class I molecules. Therefore, the duplication of B2M may lead to an increase in the biological competence of organisms to the environment. Based on the pig genome assembly SSC10.2, a segmental duplication of ~45.5 kb, encoding the entire B2M protein, was identified in pig chromosome 1. Through experimental validation, we confirmed the functional duplication of the B2M gene with a completely identical coding sequence between two copies in pigs. Considering the importance of B2M in the immune system, we performed the phylogenetic analysis of B2M duplication in ten mammalian species, confirming the presence of B2M duplication in cetartioldactyls, like cattle, sheep, goats, pigs and whales, but non-cetartiodactyl species, like mice, cats, dogs, horses, and humans. The density of long interspersed nuclear element (LINE) at the edges of duplicated blocks (39 to 66%) was found to be 2 to 3-fold higher than the average (20.12%) of the pig genome, suggesting its role in the duplication event. The B2M mRNA expression level in pigs was 12.71 and 7.57 times (2-ΔΔCt values) higher than humans and mice, respectively. However, we were unable to experimentally demonstrate the difference in the level of B2M protein because species specific anti-B2M antibodies are not available. We reported, for the first time, the functional duplication of the B2M gene in animals. The identification of partially remaining duplicated B2M sequences in the genomes of only cetartiodactyls indicates that the event was lineage specific. B2M duplication could be beneficial to the immune system of pigs by increasing the availability of MHC class I light chain protein, B2M, to complex with the proteins encoded by the relatively large number of MHC class I heavy chain genes in pigs. Further studies are necessary to address the biological meaning of increased expression of B2M.
High level of microsynteny and purifying selection affect the evolution of WRKY family in Gramineae.
Jin, Jing; Kong, Jingjing; Qiu, Jianle; Zhu, Huasheng; Peng, Yuancheng; Jiang, Haiyang
2016-01-01
The WRKY gene family, which encodes proteins in the regulation processes of diverse developmental stages, is one of the largest families of transcription factors in higher plants. In this study, by searching for interspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found 35 chromosomal segments of subgroup I genes of WRKY family (WRKY I) in four Gramineae species (Brachypodium, rice, sorghum, and maize) formed eight orthologous groups. After a stepwise gene-by-gene reciprocal comparison of all the protein sequences in the WRKY I gene flanking areas, highly conserved regions of microsynteny were found in the four Gramineae species. Most gene pairs showed conserved orientation within syntenic genome regions. Furthermore, tandem duplication events played the leading role in gene expansion. Eventually, environmental selection pressure analysis indicated strong purifying selection for the WRKY I genes in Gramineae, which may have been followed by gene loss and rearrangement. The results presented in this study provide basic information of Gramineae WRKY I genes and form the foundation for future functional studies of these genes. High level of microsynteny in the four grass species provides further evidence that a large-scale genome duplication event predated speciation.
Adomako-Ankomah, Yaw; English, Elizabeth D; Danielson, Jeffrey J; Pernas, Lena F; Parker, Michelle L; Boulanger, Martin J; Dubey, Jitender P; Boyle, Jon P
2016-05-01
In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA(+) paralogs. Additionally, we found that exogenous expression of an HMA(+) paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. Copyright © 2016 by the Genetics Society of America.
Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen
2012-11-01
Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.
Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species
Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping
2018-01-01
The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice (Oryza sativa), maize (Zea mays), and Arabidopsis (Arabidopsis thaliana). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis, respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis, respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis. A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis, respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus, and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis. PMID:29690608
Andersen, Erica F; Baldwin, Erin E; Ellingwood, Sara; Smith, Rosemarie; Lamb, Allen N
2014-07-01
Duplications involving terminal Xq28 are a known cause of intellectual disability (ID) in males and in females with unfavorable X-inactivation patterns. Within Xq28, functional disomy of MECP2 causes a severe ID syndrome, however the dosage sensitivity of other Xq28 duplicated genes is less certain. Duplications involving the int22h-1/int22h-2 LCR-flanked region in distal Xq28 have recently been linked to a novel ID-associated phenotype. While evidence for the dosage sensitivity of this region is emerging, the phenotypic contribution of individual genes within the int22h-1/int22h-2-flanked region has yet to be determined. We report a familial case of a novel 774 kb Xq28-qter duplication, detected by cytogenomic microarray analysis, that partially overlaps the int22h-1/int22h-2-flanked region. This duplication and a 570 kb Xpter-p22.33 loss within the pseudoautosomal region were identified in three siblings, one female and two males, who presented with developmental delays/intellectual disability, mild dysmorphic features and short stature. Although unconfirmed, these results are suggestive of maternal inheritance of a recombinant X. We compare our clinical findings to patients with int22h-1/int22h-2-mediated duplications and discuss the potential pathogenicity of genes within the duplicated region, including those within the shared region of overlap, RAB39B and CLIC2. © 2014 Wiley Periodicals, Inc.
Yuan, Jian; He, Zhuzi; Yuan, Xiangnan; Jiang, Xiayun; Sun, Xiaowen; Zou, Shuming
2010-09-15
Recent studies on comparative genomics have suggested that a round of fish-specific whole genome duplication (3R) in ray-finned fishes might have occurred around 226-316 Mya. Additional genome duplication, specifically in cyprinids, may have occurred more recently after the divergence of the teleosts. The timing of this event, however, is unknown. To address this question, we sequenced four Hox genes from taxa representing the polyploid Cyprinidae fish, common carp (Cyprinus carpio, 2n=100), crucian carp (Carassius auratus auratus, 2n=100), and silver crucian carp (C. auratus gibelio, 2n=156), and then compared them with known sequences from the diploid Cyprinidae fish, blunt snout bream (Megalobrama amblycephala, 2n=48). Our results showed the presence of two distinct Hox duplicates in the genomes of common and crucian carp. Three distinct Hox sequences, one of them orthologous to a Hox gene in common carp and the other two orthologous to a Hox gene in crucian carp, were isolated in silver crucian carp, indicating a possible hybrid origin of silver crucian carp from crucian and common carp. The gene duplication resulting in the origin of the common ancestor of common and crucian carp likely occurred around 10.9-13.2 Mya. The speciations of common vs. crucian carp and silver crucian vs. crucian carp likely occurred around 8.1-11.4 and 2.3-3.0 Mya, respectively. Finally, nonfunctionalization resulting from point mutations in the coding region is a probable fate for some Hox duplicates. Taken together, these results suggested an evolutionary model for polyploidization in speciation and diversification of polyploid fish. (c) 2010 Wiley-Liss, Inc.
A graph-theoretic approach for inparalog detection.
Tremblay-Savard, Olivier; Swenson, Krister M
2012-01-01
Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila.
Chen, Y; Solursh, M
1995-10-01
The Msx-1 gene (formerly known as Hox-7) is a member of a discrete subclass of homeobox-containing genes. Examination of the expression pattern of Msx-1 in murine and avian embryos suggests that this gene may be involved in the regionalization of the medio-lateral axis during earlier development. We have examined the possible functions of Xenopus Msx-1 during early Xenopus embryonic development by overexpression of the Msx-1 gene. Overexpression of Msx-1 causes a left-right mirror-image duplication of primary axial structures, including notochord, neural tube, somites, suckers, and foregut. The embryonic developing heart is also mirror-image duplicated, including looping directions and polarity. These results indicate that Msx-1 may be involved in the mesoderm formation as well as left-right patterning in the early Xenopus embryonic development.
Ascorbate peroxidase-related (APx-R) is not a duplicable gene.
Dunand, Christophe; Mathé, Catherine; Lazzarotto, Fernanda; Margis, Rogério; Margis-Pinheiro, Marcia
2011-12-01
Phylogenetic, genomic and functional analyses have allowed the identification of a new class of putative heme peroxidases, so called APx-R (APx-Related). These new class, mainly present in the green lineage (including green algae and land plants), can also be detected in other unicellular chloroplastic organisms. Except for recent polyploid organisms, only single-copy of APx-R gene was detected in each genome, suggesting that the majority of the APx-R extra-copies were lost after chromosomal or segmental duplications. In a similar way, most APx-R co-expressed genes in Arabidopsis genome do not have conserved extra-copies after chromosomal duplications and are predicted to be localized in organelles, as are the APx-R. The member of this gene network can be considered as unique gene, well conserved through the evolution due to a strong negative selection pressure and a low evolution rate. © 2011 Landes Bioscience
EqualTDRL: illustrating equivalent tandem duplication random loss rearrangements.
Hartmann, Tom; Bernt, Matthias; Middendorf, Martin
2018-05-30
To study the differences between two unichromosomal circular genomes, e.g., mitochondrial genomes, under the tandem duplication random loss (TDRL) rearrangement it is important to consider the whole set of potential TDRL rearrangement events that could have taken place. The reason is that for two given circular gene orders there can exist different TDRL rearrangements that transform one of the gene orders into the other. Hence, a TDRL event cannot always be reconstructed only from the knowledge of the circular gene order before a TDRL event and the circular gene order after it. We present the program EqualTDRL that computes and illustrates the complete set of TDRLs for pairs of circular gene orders that differ by only one TDRL. EqualTDRL considers the circularity of the given genomes and certain restrictions on the TDRL rearrangements. Examples for the latter are sequences of genes that have to be conserved during a TDRL or pairs of genes that frame intergenic regions which might represent remnants of duplicated genes. Additionally, EqualTDRL allows to determine the set of TDRLs that are minimum with respect to the number of duplicated genes. EqualTDRL supports scientists to study the complete set of TDRLs that possibly could have taken place in the evolution of mitochondrial genomes. EqualTDRL is implemented in C++ using the ggplot2 package of the open source programming language R and is freely available from http://pacosy.informatik.uni-leipzig.de/equaltdrl .
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lodes, M.J.; Merlin, G.; DeVos, T.
1995-12-01
This report investigates the duplication of two LD1 genes into the rRNA locus and the resultant transcription by RNA polymerase I, which has a faster transcription rate than that of RNA polymerase II. This was conducted using a 2.2-Mb chromosome in Leishmania donovani. 55 refs., 6 figs.
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. PMID:26560065
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Chen, Xue; Chen, Zhu; Zhao, Hualin; Zhao, Yang; Cheng, Beijiu; Xiang, Yan
2014-01-01
Homeodomain-leucine zipper (HD-Zip) proteins, a group of homeobox transcription factors, participate in various aspects of normal plant growth and developmental processes as well as environmental responses. To date, no overall analysis or expression profiling of the HD-Zip gene family in soybean (Glycine max) has been reported. An investigation of the soybean genome revealed 88 putative HD-Zip genes. These genes were classified into four subfamilies, I to IV, based on phylogenetic analysis. In each subfamily, the constituent parts of gene structure and motif were relatively conserved. A total of 87 out of 88 genes were distributed unequally on 20 chromosomes with 36 segmental duplication events, indicating that segmental duplication is important for the expansion of the HD-Zip family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the HD-Zip family basically underwent purifying selection with restrictive functional divergence after the duplication events. Analysis of expression profiles showed that 80 genes differentially expressed across 14 tissues, and 59 HD-Zip genes are differentially expressed under salinity and drought stress, with 20 paralogous pairs showing nearly identical expression patterns and three paralogous pairs diversifying significantly under drought stress. Quantitative real-time RT-PCR (qRT-PCR) analysis of six paralogous pairs of 12 selected soybean HD-Zip genes under both drought and salinity stress confirmed their stress-inducible expression patterns. This study presents a thorough overview of the soybean HD-Zip gene family and provides a new perspective on the evolution of this gene family. The results indicate that HD-Zip family genes may be involved in many plant responses to stress conditions. Additionally, this study provides a solid foundation for uncovering the biological roles of HD-Zip genes in soybean growth and development.
Nuclear hormone receptors in parasitic helminths
Wu, Wenjie; LoVerde, Philip T
2010-01-01
Nuclear receptors (NRs) belong to a large protein superfamily that are important transcriptional modulators in metazoans. Parasitic helminths include parasitic worms from the Lophotrochozoa (Platyhelminths) and Ecdysozoa (Nematoda). NRs in parasitic helminths diverged into two different evolutionary lineages. NRs in parasitic Platyhelminths have orthologues in Deuterostomes, in arthropods or both with a feature of extensive gene loss and gene duplication within different gene groups. NRs in parasitic Nematoda follow the nematode evolutionary lineage with a feature of multiple duplication of SupNRs and gene loss. PMID:20600585
Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D.
2016-01-01
Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella. We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes—and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646
Bright, Lydia J; Gout, Jean-Francois; Lynch, Michael
2017-04-15
New gene functions arise within existing gene families as a result of gene duplication and subsequent diversification. To gain insight into the steps that led to the functional diversification of paralogues, we tracked duplicate retention patterns, expression-level divergence, and subcellular markers of functional diversification in the Rab GTPase gene family in three Paramecium aurelia species. After whole-genome duplication, Rab GTPase duplicates are more highly retained than other genes in the genome but appear to be diverging more rapidly in expression levels, consistent with early steps in functional diversification. However, by localizing specific Rab proteins in Paramecium cells, we found that paralogues from the two most recent whole-genome duplications had virtually identical localization patterns, and that less closely related paralogues showed evidence of both conservation and diversification. The functionally conserved paralogues appear to target to compartments associated with both endocytic and phagocytic recycling functions, confirming evolutionary and functional links between the two pathways in a divergent eukaryotic lineage. Because the functionally diversifying paralogues are still closely related to and derived from a clade of functionally conserved Rab11 genes, we were able to pinpoint three specific amino acid residues that may be driving the change in the localization and thus the function in these proteins. © 2017 Bright et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Kjærner-Semb, Erik; Ayllon, Fernando; Furmanek, Tomasz; Wennevik, Vidar; Dahle, Geir; Niemelä, Eero; Ozerov, Mikhail; Vähä, Juha-Pekka; Glover, Kevin A; Rubin, Carl J; Wargelius, Anna; Edvardsen, Rolf B
2016-08-11
Populations of Atlantic salmon display highly significant genetic differences with unresolved molecular basis. These differences may result from separate postglacial colonization patterns, diversifying natural selection and adaptation, or a combination. Adaptation could be influenced or even facilitated by the recent whole genome duplication in the salmonid lineage which resulted in a partly tetraploid species with duplicated genes and regions. In order to elucidate the genes and genomic regions underlying the genetic differences, we conducted a genome wide association study using whole genome resequencing data from eight populations from Northern and Southern Norway. From a total of ~4.5 million sequencing-derived SNPs, more than 10 % showed significant differentiation between populations from these two regions and ten selective sweeps on chromosomes 5, 10, 11, 13-15, 21, 24 and 25 were identified. These comprised 59 genes, of which 15 had one or more differentiated missense mutation. Our analysis showed that most sweeps have paralogous regions in the partially tetraploid genome, each lacking the high number of significant SNPs found in the sweeps. The most significant sweep was found on Chr 25 and carried several missense mutations in the antiviral mx genes, suggesting that these populations have experienced differing viral pressures. Interestingly the second most significant sweep, found on Chr 5, contains two genes involved in the NF-KB pathway (nkap and nkrf), which is also a known pathogen target that controls a large number of processes in animals. Our results show that natural selection acting on immune related genes has contributed to genetic divergence between salmon populations in Norway. The differences between populations may have been facilitated by the plasticity of the salmon genome. The observed signatures of selection in duplicated genomic regions suggest that the recently duplicated genome has provided raw material for evolutionary adaptation.
Johnston, Jennifer J; Walker, Robert L; Davis, Sean; Facio, Flavia; Turner, Joyce T; Bick, David P; Daentl, Donna L; Ellison, Jay W; Meltzer, Paul S; Biesecker, Leslie G
2007-01-01
Contiguous gene syndromes cause disorders via haploinsufficiency for adjacent genes. Some contiguous gene syndromes (CGS) have stereotypical breakpoints, but others have variable breakpoints. In CGS that have variable breakpoints, the extent of the deletions may be correlated with severity. The Greig cephalopolysyndactyly contiguous gene syndrome (GCPS‐CGS) is a multiple malformation syndrome caused by haploinsufficiency of GLI3 and adjacent genes. In addition, non‐CGS GCPS can be caused by deletions or duplications in GLI3. Although fluorescence in situ hybridisation (FISH) can identify large deletion mutations in patients with GCPS or GCPS‐CGS, it is not practical for identification of small intragenic deletions or insertions, and it is difficult to accurately characterise the extent of the large deletions using this technique. We have designed a custom comparative genomic hybridisation (CGH) array that allows identification of deletions and duplications at kilobase resolution in the vicinity of GLI3. The array averages one probe every 730 bp for a total of about 14 000 probes over 10 Mb. We have analysed 16 individuals with known or suspected deletions or duplications. In 15 of 16 individuals (14 deletions and 1 duplication), the array confirmed the prior results. In the remaining patient, the normal CGH array result was correct, and the prior assessment was a false positive quantitative polymerase chain reaction result. We conclude that high‐density CGH array analysis is more sensitive than FISH analysis for detecting deletions and provides clinically useful results on the extent of the deletion. We suggest that high‐density CGH array analysis should replace FISH analysis for assessment of deletions and duplications in patients with contiguous gene syndromes caused by variable deletions. PMID:17098889
Venkatachalam, Ananda B; Fontenot, Quenton; Farrara, Allyse; Wright, Jonathan M
2018-03-01
With the advent of high-throughput DNA sequencing technology, the genomic sequence of many disparate species has led to the relatively new discipline of genomics, the study of genome structure, function and evolution. Much work has been focused on the role of whole genome duplications (WGD) in the architecture of extant vertebrate genomes, particularly those of teleost fishes which underwent a WGD early in the teleost radiation >230 million years ago (mya). Our past work has focused on the fate of duplicated copies of a multigene family coding for the intracellular lipid-binding protein (iLBP) genes in the teleost fishes. To define the evolutionary processes that determined the fate of duplicated genes and generated the structure of extant fish genomes, however, requires comparative genomic analysis with a fish lineage that diverged before the teleost WGD, such as the spotted gar (Lepisosteus oculatus), an ancient, air-breathing, ray-finned fish. Here, we describe the genomic organization, chromosomal location and tissue-specific expression of a subfamily of the iLBP genes that code for fatty acid-binding proteins (Fabps) in spotted gar. Based on this work, we have defined the minimum suite of fabp genes prior to their duplication in the teleost lineages ~230-400 mya. Spotted gar, therefore, serves as an appropriate outgroup, or ancestral/ancient fish, that did not undergo the teleost-specific WGD. As such, analyses of the spatio-temporal regulation of spotted gar genes provides a foundation to determine whether the duplicated fabp genes have been retained in teleost genomes owing to either sub- or neofunctionalization. Copyright © 2017 Elsevier Inc. All rights reserved.
The Evolutionary History of Sarco(endo)plasmic Calcium ATPase (SERCA)
Altshuler, Ianina; Vaillant, James J.; Xu, Sen; Cristescu, Melania E.
2012-01-01
Investigating the phylogenetic relationships within physiologically essential gene families across a broad range of taxa can reveal the key gene duplication events underlying their family expansion and is thus important to functional genomics studies. P-Type II ATPases represent a large family of ATP powered transporters that move ions across cellular membranes and includes Na+/K+ transporters, H+/K+ transporters, and plasma membrane Ca2+ pumps. Here, we examine the evolutionary history of one such transporter, the Sarco(endo)plasmic reticulum calcium ATPase (SERCA), which maintains calcium homeostasis in the cell by actively pumping Ca2+ into the sarco(endo)plasmic reticulum. Our protein-based phylogenetic analyses across Eukaryotes revealed two monophyletic clades of SERCA proteins, one containing animals, fungi, and plants, and the other consisting of plants and protists. Our analyses suggest that the three known SERCA proteins in vertebrates arose through two major gene duplication events after the divergence from tunicates, but before the separation of fishes and tetrapods. In plants, we recovered two SERCA clades, one being the sister group to Metazoa and the other to Apicomplexa clade, suggesting an ancient duplication in an early eukaryotic ancestor, followed by subsequent loss of one copy in Opisthokonta, the other in protists, and retention of both in plants. We also report relatively recent and independent gene duplication events within invertebrate taxa including tunicates and the leech Helobdella robusta. Thus, it appears that both ancient and recent gene duplication events have played an important role in the evolution of this ubiquitous gene family across the eukaryotic domain. PMID:23285113
The evolutionary history of sarco(endo)plasmic calcium ATPase (SERCA).
Altshuler, Ianina; Vaillant, James J; Xu, Sen; Cristescu, Melania E
2012-01-01
Investigating the phylogenetic relationships within physiologically essential gene families across a broad range of taxa can reveal the key gene duplication events underlying their family expansion and is thus important to functional genomics studies. P-Type II ATPases represent a large family of ATP powered transporters that move ions across cellular membranes and includes Na(+)/K(+) transporters, H(+)/K(+) transporters, and plasma membrane Ca(2+) pumps. Here, we examine the evolutionary history of one such transporter, the Sarco(endo)plasmic reticulum calcium ATPase (SERCA), which maintains calcium homeostasis in the cell by actively pumping Ca(2+) into the sarco(endo)plasmic reticulum. Our protein-based phylogenetic analyses across Eukaryotes revealed two monophyletic clades of SERCA proteins, one containing animals, fungi, and plants, and the other consisting of plants and protists. Our analyses suggest that the three known SERCA proteins in vertebrates arose through two major gene duplication events after the divergence from tunicates, but before the separation of fishes and tetrapods. In plants, we recovered two SERCA clades, one being the sister group to Metazoa and the other to Apicomplexa clade, suggesting an ancient duplication in an early eukaryotic ancestor, followed by subsequent loss of one copy in Opisthokonta, the other in protists, and retention of both in plants. We also report relatively recent and independent gene duplication events within invertebrate taxa including tunicates and the leech Helobdella robusta. Thus, it appears that both ancient and recent gene duplication events have played an important role in the evolution of this ubiquitous gene family across the eukaryotic domain.
Evolution of developmental regulation in the vertebrate FgfD subfamily.
Jovelin, Richard; Yan, Yi-Lin; He, Xinjun; Catchen, Julian; Amores, Angel; Canestro, Cristian; Yokoi, Hayato; Postlethwait, John H
2010-01-15
Fibroblast growth factors (Fgfs) encode small signaling proteins that help regulate embryo patterning. Fgfs fall into seven families, including FgfD. Nonvertebrate chordates have a single FgfD gene; mammals have three (Fgf8, Fgf17, and Fgf18); and teleosts have six (fgf8a, fgf8b, fgf17, fgf18a, fgf18b, and fgf24). What are the evolutionary processes that led to the structural duplication and functional diversification of FgfD genes during vertebrate phylogeny? To study this question, we investigated conserved syntenies, patterns of gene expression, and the distribution of conserved noncoding elements (CNEs) in FgfD genes of stickleback and zebrafish, and compared them with data from cephalochordates, urochordates, and mammals. Genomic analysis suggests that Fgf8, Fgf17, Fgf18, and Fgf24 arose in two rounds of whole genome duplication at the base of the vertebrate radiation; that fgf8 and fgf18 duplications occurred at the base of the teleost radiation; and that Fgf24 is an ohnolog that was lost in the mammalian lineage. Expression analysis suggests that ancestral subfunctions partitioned between gene duplicates and points to the evolution of novel expression domains. Analysis of CNEs, at least some of which are candidate regulatory elements, suggests that ancestral CNEs partitioned between gene duplicates. These results help explain the evolutionary pathways by which the developmentally important family of FgfD molecules arose and the deduced principles that guided FgfD evolution are likely applicable to the evolution of developmental regulation in many vertebrate multigene families. (c) 2009 Wiley-Liss, Inc.
Uchiumi, Fumiaki; Watanabe, Takeshi; Tanuma, Sei-ichi
2010-05-15
DNA helicases are important in the regulation of DNA transaction and thereby various cellular functions. In this study, we developed a cost-effective multiple DNA transfection assay with DEAE-dextran reagent and analyzed the promoter activities of the human DNA helicases. The 5'-flanking regions of the human DNA helicase-encoding genes were isolated and subcloned into luciferase (Luc) expression plasmids. They were coated onto 96-well plate and used for co-transfection with a renilla-Luc expression vector into various cells, and dual-Luc assays were performed. The profiles of promoter activities were dependent on cell lines used. Among these human DNA helicase genes, XPB, RecQL5, and RTEL promoters were activated during TPA-induced HL-60 cell differentiation. Interestingly, duplicated ets (GGAA) elements are commonly located around the transcription start sites of these genes. The duplicated GGAA motifs are also found in the promoters of DNA replication/repair synthesis factor genes including PARG, ATR, TERC, and Rb1. Mutation analyses suggested that the duplicated GGAA-motifs are necessary for the basal promoter activity in various cells and some of them positively respond to TPA in HL-60 cells. TPA-induced response of 44-bp in the RTEL promoter was attenuated by co-transfection of the PU.1 expression vector. These findings suggest that the duplicated ets motifs regulate DNA-repair associated gene expressions during macrophage-like differentiation of HL-60 cells. Copyright 2010 Elsevier Inc. All rights reserved.
Meyer, Thomas; Pankuweit, Sabine; Richter, Anette; Maisch, Bernhard; Ruppert, Volker
2013-09-15
Hypertrophic cardiomyopathy (HCM) is a cardiovascular disease with autosomal dominant inheritance caused by mutations in genes coding for sarcomeric and/or regulatory proteins expressed in cardiomyocytes. In a small cohort of HCM patients (n=8), we searched for mutations in the two most common genes responsible for HCM and found four missense mutations in the MYH7 gene encoding cardiac β-myosin heavy chain (R204H, M493V, R719W, and R870H) and three mutations in the myosin-binding protein C3 gene (MYBPC3) including one missense (A848V) and two frameshift mutations (c.3713delTG and c.702ins26bp). The c.702ins26bp insertion resulted from the duplication of a 26-bp fragment in a 54-year-old female HCM patient presenting with clinical signs of heart failure due to diastolic dysfunction. Although such large duplications (>10 bp) in the MYBPC3 gene are very rare and have been identified only in 4 families reported so far, the identical duplication mutation was found earlier in a Dutch patient, demonstrating that it may constitute a hitherto unknown founder mutation in central European populations. This observation underscores the significance of insertions into the coding sequence of the MYBPC3 gene for the development and pathogenesis of HCM. © 2013 Elsevier B.V. All rights reserved.
Itokawa, K; Komagata, O; Kasai, S; Kawada, H; Mwatele, C; Dida, G O; Njenga, S M; Mwandawiro, C; Tomita, T
2013-09-01
Insecticide resistance develops as a genetic factor (allele) conferring lower susceptibility to insecticides proliferates within a target insect population under strong positive selection. Intriguingly, a resistance allele pre-existing in a population often bears a series of further adaptive allelic variants through new mutations. This phenomenon occasionally results in replacement of the predominating resistance allele by fitter new derivatives, and consequently, development of greater resistance at the population level. The overexpression of the cytochrome P450 gene CYP9M10 is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. Previously, we have found two genealogically related overexpressing CYP9M10 haplotypes, which differ in gene copy number (duplicated and non-duplicated). The duplicated haplotype was derived from the non-duplicated overproducer probably recently. In the present study, we investigated allelic series of CYP9M10 involved in three C. quinquefasciatus laboratory colonies recently collected from three different localities. Duplicated and non-duplicated overproducing haplotypes coexisted in African and Asian colonies indicating a global distribution of both haplotype lineages. The duplicated haplotypes both in the Asian and African colonies were associated with higher expression levels and stronger resistance than non-duplicated overproducing haplotypes. There were slight variation in expression level among the non-duplicated overproducing haplotypes. The nucleotide sequences in coding and upstream regions among members of this group also showed a little diversity. Non-duplicated overproducing haplotypes with relatively higher expression were genealogically closer to the duplicated haplotypes than the other non-duplicated overproducing haplotypes, suggesting multiple cis-acting mutations before duplication.
Yuan, Haiming; Meng, Zhe; Zhang, Lina; Luo, Xiangyang; Liu, Liping; Chen, Mengfan; Li, Xinwei; Zhao, Weiwei; Liang, Liyang
2016-01-01
Interstitial duplications distal to 15q13 are very rare. Here, we reported a 14-year-old boy with severe short stature, delayed bone age, hypogonadism, global developmental delay and intellectual disability. His had distinctive facial features including macrocephaly, broad forehead, deep-set and widely spaced eyes, broad nose bridge, shallow philtrum and thick lips. A de novo 6.4 Mb interstitial duplication of 15q15.3q21.2 was detected by chromosomal microarray analysis. We compared our patient's clinical phenotypes with those of several individuals with overlapping duplications and several candidate genes responsible for the phenotypes were identified as well. The results suggest a novel contiguous gene duplication syndrome characterized with shared features including short stature, hypogonadism, global developmental delay and other congenital anomalies.
Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.; Eberhard, Jessica R.; Miyaki, Cristina Y.; Sanchez, Juan J.; Hernandez, Alexis; Müeller, Heinrich; Graves, Gary R.; Fleischer, Robert C.; Wright, Timothy F.
2012-01-01
Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified with increasing frequency, particularly in birds (Class Aves). In this study, we investigate the evolutionary history of mitochondrial control region states within the avian order Psittaciformes (parrots and cockatoos). To this aim, we reconstructed a comprehensive multi-locus phylogeny of parrots, used PCR of three diagnostic fragments to classify the mitochondrial control region state as single or duplicated, and mapped these states onto the phylogeny. We further sequenced 44 selected species to validate these inferences of control region state. Ancestral state reconstruction using a range of weighting schemes identified six independent origins of mitochondrial control region duplications within Psittaciformes. Analysis of sequence data showed that varying levels of mitochondrial gene and tRNA homology and degradation were present within a given clade exhibiting duplications. Levels of divergence between control regions within an individual varied from 0–10.9% with the differences occurring mainly between 51 and 225 nucleotides 3′ of the goose hairpin in domain I. Further investigations into the fates of duplicated mitochondrial genes, the potential costs and benefits of having a second control region, and the complex relationship between evolutionary rates, selection, and time since duplication are needed to fully explain these patterns in the mitochondrial genome. PMID:22543055
Baker, Katie; Bayer, Micha; Cook, Nicola; Dreißig, Steven; Dhillon, Taniya; Russell, Joanne; Hedley, Pete E; Morris, Jenny; Ramsay, Luke; Colas, Isabelle; Waugh, Robbie; Steffenson, Brian; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J
2014-01-01
The low-recombining pericentromeric region of the barley genome contains roughly a quarter of the genes of the species, embedded in low-recombining DNA that is rich in repeats and repressive chromatin signatures. We have investigated the effects of pericentromeric region residency upon the expression, diversity and evolution of these genes. We observe no significant difference in average transcript level or developmental RNA specificity between the barley pericentromeric region and the rest of the genome. In contrast, all of the evolutionary parameters studied here show evidence of compromised gene evolution in this region. First, genes within the pericentromeric region of wild barley show reduced diversity and significantly weakened purifying selection compared with the rest of the genome. Second, gene duplicates (ohnolog pairs) derived from the cereal whole-genome duplication event ca. 60MYa have been completely eliminated from the barley pericentromeric region. Third, local gene duplication in the pericentromeric region is reduced by 29% relative to the rest of the genome. Thus, the pericentromeric region of barley is a permissive environment for gene expression but has restricted gene evolution in a sizeable fraction of barley's genes. PMID:24947331
Exonic duplication CNV of NDRG1 associated with autosomal-recessive HMSN-Lom/CMT4D.
Okamoto, Yuji; Goksungur, Meryem Tuba; Pehlivan, Davut; Beck, Christine R; Gonzaga-Jauregui, Claudia; Muzny, Donna M; Atik, Mehmed M; Carvalho, Claudia M B; Matur, Zeliha; Bayraktar, Serife; Boone, Philip M; Akyuz, Kaya; Gibbs, Richard A; Battaloglu, Esra; Parman, Yesim; Lupski, James R
2014-05-01
Copy-number variations as a mutational mechanism contribute significantly to human disease. Approximately one-half of the patients with Charcot-Marie-Tooth (CMT) disease have a 1.4 Mb duplication copy-number variation as the cause of their neuropathy. However, non-CMT1A neuropathy patients rarely have causative copy-number variations, and to date, autosomal-recessive disease has not been associated with copy-number variation as a mutational mechanism. We performed Agilent 8 × 60 K array comparative genomic hybridization on DNA from 12 recessive Turkish families with CMT disease. Additional molecular studies were conducted to detect breakpoint junctions and to evaluate gene expression levels in a family in which we detected an intragenic duplication copy-number variation. We detected an ~6.25 kb homozygous intragenic duplication in NDRG1, a gene known to be causative for recessive HMSNL/CMT4D, in three individuals from a Turkish family with CMT neuropathy. Further studies showed that this intragenic copy-number variation resulted in a homozygous duplication of exons 6-8 that caused decreased mRNA expression of NDRG1. Exon-focused high-resolution array comparative genomic hybridization enables the detection of copy-number variation carrier states in recessive genes, particularly small copy-number variations encompassing or disrupting single genes. In families for whom a molecular diagnosis has not been elucidated by conventional clinical assays, an assessment for copy-number variations in known CMT genes might be considered.
Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.
Kordi, Misagh; Bansal, Mukul S
2017-06-01
Duplication-Transfer-Loss (DTL) reconciliation is a powerful method for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation seeks to reconcile gene trees with species trees by postulating speciation, duplication, transfer, and loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. In practice, however, gene trees are often non-binary due to uncertainty in the gene tree topologies, and DTL reconciliation with non-binary gene trees is known to be NP-hard. In this paper, we present the first exact algorithms for DTL reconciliation with non-binary gene trees. Specifically, we (i) show that the DTL reconciliation problem for non-binary gene trees is fixed-parameter tractable in the maximum degree of the gene tree, (ii) present an exponential-time, but in-practice efficient, algorithm to track and enumerate all optimal binary resolutions of a non-binary input gene tree, and (iii) apply our algorithms to a large empirical data set of over 4700 gene trees from 100 species to study the impact of gene tree uncertainty on DTL-reconciliation and to demonstrate the applicability and utility of our algorithms. The new techniques and algorithms introduced in this paper will help biologists avoid incorrect evolutionary inferences caused by gene tree uncertainty.
Liu, Xiang; Li, Shangqi; Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A; Xu, Peng
2016-01-01
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp.
Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A.
2016-01-01
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp. PMID:27058731
The Sorghum bicolor genome and the diversification of grasses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paterson, Andrew H.; Bowers, John E.; Bruggmann, Remy
2008-08-20
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approx730-megabase Sorghum bicolor (L.) Moench genome, placing approx98percent of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approx75percent larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidizationmore » approx70 million years ago, most duplicated gene sets lost one member before the sorghum rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24percent of genes are grass-specific and 7percent are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.« less
The Sorghum bicolor genome and the diversification of grasses.
Paterson, Andrew H; Bowers, John E; Bruggmann, Rémy; Dubchak, Inna; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hellsten, Uffe; Mitros, Therese; Poliakov, Alexander; Schmutz, Jeremy; Spannagl, Manuel; Tang, Haibao; Wang, Xiyin; Wicker, Thomas; Bharti, Arvind K; Chapman, Jarrod; Feltus, F Alex; Gowik, Udo; Grigoriev, Igor V; Lyons, Eric; Maher, Christopher A; Martis, Mihaela; Narechania, Apurva; Otillar, Robert P; Penning, Bryan W; Salamov, Asaf A; Wang, Yu; Zhang, Lifang; Carpita, Nicholas C; Freeling, Michael; Gingle, Alan R; Hash, C Thomas; Keller, Beat; Klein, Patricia; Kresovich, Stephen; McCann, Maureen C; Ming, Ray; Peterson, Daniel G; Mehboob-ur-Rahman; Ware, Doreen; Westhoff, Peter; Mayer, Klaus F X; Messing, Joachim; Rokhsar, Daniel S
2009-01-29
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C.; Zhang, Baohong
2016-01-01
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development. PMID:26857372
Orthologs, paralogs and genome comparisons
NASA Technical Reports Server (NTRS)
Gogarten, J. P.; Olendzenski, L.
1999-01-01
During the past decade, ancient gene duplications were recognized as one of the main forces in the generation of diverse gene families and the creation of new functional capabilities. New tools developed to search data banks for homologous sequences, and an increased availability of reliable three-dimensional structural information led to the recognition that proteins with diverse functions can belong to the same superfamily. Analyses of the evolution of these superfamilies promises to provide insights into early evolution but are complicated by several important evolutionary processes. Horizontal transfer of genes can lead to a vertical spread of innovations among organisms, therefore finding a certain property in some descendants of an ancestor does not guarantee that it was present in that ancestor. Complete or partial gene conversion between duplicated genes can yield phylogenetic trees with several, apparently independent gene duplications, suggesting an often surprising parallelism in the evolution of independent lineages. Additionally, the breakup of domains within a protein and the fusion of domains into multifunctional proteins makes the delineation of superfamilies a task that remains difficult to automate.
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C; Zhang, Baohong
2016-02-09
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development.
The pineapple genome and the evolution of CAM photosynthesis.
Ming, Ray; VanBuren, Robert; Wai, Ching Man; Tang, Haibao; Schatz, Michael C; Bowers, John E; Lyons, Eric; Wang, Ming-Li; Chen, Jung; Biggers, Eric; Zhang, Jisen; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Zhang, Jian; Ye, Zhangyao; Miao, Chenyong; Lin, Zhicong; Wang, Hao; Zhou, Hongye; Yim, Won C; Priest, Henry D; Zheng, Chunfang; Woodhouse, Margaret; Edger, Patrick P; Guyot, Romain; Guo, Hao-Bo; Guo, Hong; Zheng, Guangyong; Singh, Ratnesh; Sharma, Anupma; Min, Xiangjia; Zheng, Yun; Lee, Hayan; Gurtowski, James; Sedlazeck, Fritz J; Harkess, Alex; McKain, Michael R; Liao, Zhenyang; Fang, Jingping; Liu, Juan; Zhang, Xiaodan; Zhang, Qing; Hu, Weichang; Qin, Yuan; Wang, Kai; Chen, Li-Yu; Shirley, Neil; Lin, Yann-Rong; Liu, Li-Yu; Hernandez, Alvaro G; Wright, Chris L; Bulone, Vincent; Tuskan, Gerald A; Heath, Katy; Zee, Francis; Moore, Paul H; Sunkar, Ramanjulu; Leebens-Mack, James H; Mockler, Todd; Bennetzen, Jeffrey L; Freeling, Michael; Sankoff, David; Paterson, Andrew H; Zhu, Xinguang; Yang, Xiaohan; Smith, J Andrew C; Cushman, John C; Paull, Robert E; Yu, Qingyi
2015-12-01
Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues. CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.
Yegorov, Sergey; Bogerd, Jan; Good, Sara V
2014-12-01
Relaxin family peptide receptors (Rxfps) and their ligands, relaxin (Rln) and insulin-like (Insl) peptides, are broadly implicated in the regulation of reproductive and neuroendocrine processes in mammals. Most placental mammals harbour genes for four receptors, namely rxfp1, rxfp2, rxfp3 and rxfp4. The number and identity of rxfps in other vertebrates are immensely variable, which is probably attributable to intraspecific variation in reproductive and neuroendocrine regulation. Here, we highlight several interesting, but greatly overlooked, aspects of the rln/insl-rxfp evolutionary history: the ancient origin, recruitment of novel receptors, diverse roles of selection, differential retention and lineage-specific loss of genes over evolutionary time. The tremendous diversity of rln/insl and rxfp genes appears to have arisen from two divergent receptors and one ligand that were duplicated by whole genome duplications (WGD) in early vertebrate evolution, although several genes, notably relaxin in mammals, were also duplicated via small scale duplications. Duplication and loss of genes have varied across lineages: teleosts retained more WGD-derived genes, dominated by those thought to be involved in neuroendocrine regulation (rln3, insl5 and rxfp 3/4 genes), while eutherian mammals witnessed the diversification and rapid evolution of genes involved in reproduction (rln/insl3). Several genes that arose early in evolutionary history were lost in most mammals, but retained in teleosts and, to a lesser extent, in early diverging tetrapods. To elaborate on their evolutionary history, we provide updated phylogenies of the Rxfp1/2 and Rxfp3/4 receptors and their ligands, including new sequences from early diverging vertebrate taxa such as coelacanth, skate, spotted gar, and lamprey. We also summarize the recent progress made towards understanding the functional biology of Rxfps in non-mammalian taxa, providing a new conceptual framework for research on Rxfp signaling across vertebrates. Copyright © 2014 Elsevier Inc. All rights reserved.
Bekpen, Cemalettin; Künzel, Sven; Xie, Chen; Eaaswarkhanth, Muthukrishnan; Lin, Yen-Lung; Gokcumen, Omer; Akdis, Cezmi A; Tautz, Diethard
2017-03-06
Segmental duplications are an abundant source for novel gene functions and evolutionary adaptations. This mechanism of generating novelty was very active during the evolution of primates particularly in the human lineage. Here, we characterize the evolution and function of the SPATA31 gene family (former designation FAM75A), which was previously shown to be among the gene families with the strongest signal of positive selection in hominoids. The mouse homologue for this gene family is a single copy gene expressed during spermatogenesis. We show that in primates, the SPATA31 gene duplicated into SPATA31A and SPATA31C types and broadened the expression into many tissues. Each type became further segmentally duplicated in the line towards humans with the largest number of full-length copies found for SPATA31A in humans. Copy number estimates of SPATA31A based on digital PCR show an average of 7.5 with a range of 5-11 copies per diploid genome among human individuals. The primate SPATA31 genes also acquired new protein domains that suggest an involvement in UV response and DNA repair. We generated antibodies and show that the protein is re-localized from the nucleolus to the whole nucleus upon UV-irradiation suggesting a UV damage response. We used CRISPR/Cas mediated mutagenesis to knockout copies of the gene in human primary fibroblast cells. We find that cell lines with reduced functional copies as well as naturally occurring low copy number HFF cells show enhanced sensitivity towards UV-irradiation. The acquisition of new SPATA31 protein functions and its broadening of expression may be related to the evolution of the diurnal life style in primates that required a higher UV tolerance. The increased segmental duplications in hominoids as well as its fast evolution suggest the acquisition of further specific functions particularly in humans.
2012-01-01
Background Gene duplications play an important role in the evolution of functional protein diversity. Some models of duplicate gene evolution predict complex forms of paralog divergence; orthologous proteins may diverge as well, further complicating patterns of divergence among and within gene families. Consequently, studying the link between protein sequence evolution and duplication requires the use of flexible substitution models that can accommodate multiple shifts in selection across a phylogeny. Here, we employed a variety of codon substitution models, primarily Clade models, to explore how selective constraint evolved following the duplication of a green-sensitive (RH2a) visual pigment protein (opsin) in African cichlids. Past studies have linked opsin divergence to ecological and sexual divergence within the African cichlid adaptive radiation. Furthermore, biochemical and regulatory differences between the RH2aα and RH2aβ paralogs have been documented. It thus seems likely that selection varies in complex ways throughout this gene family. Results Clade model analysis of African cichlid RH2a opsins revealed a large increase in the nonsynonymous-to-synonymous substitution rate ratio (ω) following the duplication, as well as an even larger increase, one consistent with positive selection, for Lake Tanganyikan cichlid RH2aβ opsins. Analysis using the popular Branch-site models, by contrast, revealed no such alteration of constraint. Several amino acid sites known to influence spectral and non-spectral aspects of opsin biochemistry were found to be evolving divergently, suggesting that orthologous RH2a opsins may vary in terms of spectral sensitivity and response kinetics. Divergence appears to be occurring despite intronic gene conversion among the tandemly-arranged duplicates. Conclusions Our findings indicate that variation in selective constraint is associated with both gene duplication and divergence among orthologs in African cichlid RH2a opsins. At least some of this variation may reflect an adaptive response to differences in light environment. Interestingly, these patterns only became apparent through the use of Clade models, not through the use of the more widely employed Branch-site models; we suggest that this difference stems from the increased flexibility associated with Clade models. Our results thus bear both on studies of cichlid visual system evolution and on studies of gene family evolution in general. PMID:23078361
Barker, Elizabeth I; Ashton, Neil W
2016-03-01
The Physcomitrella pseudochromosomal genome assembly revealed previously invisible synteny enabling realisation of the full potential of shared synteny as a tool for probing evolution of this plant's MADS-box gene family. Assembly of the sequenced genome of Physcomitrella patens into 27 mega-scaffolds (pseudochromosomes) has confirmed the major predictions of our earlier model of expansion of the MADS-box gene family in the Physcomitrella lineage. Additionally, microsynteny has been conserved in the immediate vicinity of some recent duplicates of MADS-box genes. However, comparison of non-syntenic MIKC MADS-box genes and neighbouring genes indicates that chromosomal rearrangements and/or sequence degeneration have destroyed shared synteny over longer distances (macrosynteny) around MADS-box genes despite subsets comprising two or three MIKC genes having remained syntenic. In contrast, half of the type I MADS-box genes have been transposed creating new syntenic relations with MIKC genes. This implies that conservation of ancient ancestral synteny of MIKC genes and of more recently acquired synteny of type I and MIKC genes may be selectively advantageous. Our revised model predicts the birth rate of MIKC genes in Physcomitrella is higher than that of type I genes. However, this difference is attributable to an early tandem duplication and an early segmental duplication of MIKC genes prior to the two polyploidisations that account for most of the expansion of the MADS-box gene family in Physcomitrella. Furthermore, this early segmental duplication spawned two chromosomal lineages: one with a MIKC (C) gene, belonging to the PPM2 clade, in close proximity to one or a pair of MIKC* genes and another with a MIKC (C) gene, belonging to the PpMADS-S clade, characterised by greater separation from syntenic MIKC* genes. Our model has evolutionary implications for the Physcomitrella karyotype.
Gene Duplication and Evolutionary Innovations in Hemoglobin-Oxygen Transport
2016-01-01
During vertebrate evolution, duplicated hemoglobin (Hb) genes diverged with respect to functional properties as well as the developmental timing of expression. For example, the subfamilies of genes that encode the different subunit chains of Hb are ontogenetically regulated such that functionally distinct Hb isoforms are expressed during different developmental stages. In some vertebrate taxa, functional differentiation between co-expressed Hb isoforms may also contribute to physiologically important divisions of labor. PMID:27053736
Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D
2016-09-02
Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals.
Popova, Olga V; Mikhailov, Kirill V; Nikitin, Mikhail A; Logacheva, Maria D; Penin, Aleksey A; Muntyan, Maria S; Kedrova, Olga S; Petrov, Nikolai B; Panchin, Yuri V; Aleoshin, Vladimir V
2016-01-01
Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha-an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia.
Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals
Popova, Olga V.; Mikhailov, Kirill V.; Nikitin, Mikhail A.; Logacheva, Maria D.; Penin, Aleksey A.; Muntyan, Maria S.; Kedrova, Olga S.; Petrov, Nikolai B.; Panchin, Yuri V.
2016-01-01
Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha—an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia. PMID:27755612
Linhares, Natália D; Valadares, Eugênia R; da Costa, Silvia S; Arantes, Rodrigo R; de Oliveira, Luiz Roberto; Rosenberg, Carla; Vianna-Morgante, Angela M; Svartman, Marta
2016-09-01
We report on a 16-year-old boy with a maternally inherited ~ 18.3 Mb Xq13.2-q21.31 duplication delimited by aCGH. As previously described in patients with similar duplications, his clinical features included intellectual disability, developmental delay, speech delay, generalized hypotonia, infantile feeding difficulties, self-injurious behavior, short stature and endocrine problems. As additional findings, he presented recurrent seizures and pubertal gynecomastia. His mother was phenotypically normal and had completely skewed inactivation of the duplicated X chromosome, as most female carriers of such duplications. Five previously reported patients with partial Xq duplications presented duplication breakpoints similar to those of our patient. One of them, a fetus with multiple congenital abnormalities, had the same cytogenetic duplication breakpoint. Three of the reported patients shared many features with our proband but the other had some clinical features of the Prader-Willi syndrome. It was suggested that ATRX overexpression could be involved in the major clinical features of patients with partial Xq duplications. We propose that this gene could also be involved with the obesity of the patient with the Prader-Willi-like phenotype. Additionally, we suggest that the PCDH11X gene could be a candidate for our patient's recurrent seizures. In males, the Xq13-q21 duplication should be considered in the differential diagnosis of Prader-Willi syndrome, as previously suggested, and neuromuscular diseases, particularly mitochondriopathies.
Birth of a new gene on the Y chromosome of Drosophila melanogaster
Carvalho, Antonio Bernardo; Vicoso, Beatriz; Russo, Claudia A. M.; Swenor, Bonnielin; Clark, Andrew G.
2015-01-01
Contrary to the pattern seen in mammalian sex chromosomes, where most Y-linked genes have X-linked homologs, the Drosophila X and Y chromosomes appear to be unrelated. Most of the Y-linked genes have autosomal paralogs, so autosome-to-Y transposition must be the main source of Drosophila Y-linked genes. Here we show how these genes were acquired. We found a previously unidentified gene (flagrante delicto Y, FDY) that originated from a recent duplication of the autosomal gene vig2 to the Y chromosome of Drosophila melanogaster. Four contiguous genes were duplicated along with vig2, but they became pseudogenes through the accumulation of deletions and transposable element insertions, whereas FDY remained functional, acquired testis-specific expression, and now accounts for ∼20% of the vig2-like mRNA in testis. FDY is absent in the closest relatives of D. melanogaster, and DNA sequence divergence indicates that the duplication to the Y chromosome occurred ∼2 million years ago. Thus, FDY provides a snapshot of the early stages of the establishment of a Y-linked gene and demonstrates how the Drosophila Y has been accumulating autosomal genes. PMID:26385968
Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication
Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S
2016-01-01
Introduction Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. Case report We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Discussion Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. Conclusion To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and compared the clinical, genotyping, and gene microarray results of our patient in order to characterize this novel 1p36 duplication syndrome, which might have contributed to the recurrent scabies infection in this patient. PMID:26834495
Wild tobacco genomes reveal the evolution of nicotine biosynthesis.
Xu, Shuqing; Brockmöller, Thomas; Navarro-Quezada, Aura; Kuhl, Heiner; Gase, Klaus; Ling, Zhihao; Zhou, Wenwu; Kreitzer, Christoph; Stanke, Mario; Tang, Haibao; Lyons, Eric; Pandey, Priyanka; Pandey, Shree P; Timmermann, Bernd; Gaquerel, Emmanuel; Baldwin, Ian T
2017-06-06
Nicotine, the signature alkaloid of Nicotiana species responsible for the addictive properties of human tobacco smoking, functions as a defensive neurotoxin against attacking herbivores. However, the evolution of the genetic features that contributed to the assembly of the nicotine biosynthetic pathway remains unknown. We sequenced and assembled genomes of two wild tobaccos, Nicotiana attenuata (2.5 Gb) and Nicotiana obtusifolia (1.5 Gb), two ecological models for investigating adaptive traits in nature. We show that after the Solanaceae whole-genome triplication event, a repertoire of rapidly expanding transposable elements (TEs) bloated these Nicotiana genomes, promoted expression divergences among duplicated genes, and contributed to the evolution of herbivory-induced signaling and defenses, including nicotine biosynthesis. The biosynthetic machinery that allows for nicotine synthesis in the roots evolved from the stepwise duplications of two ancient primary metabolic pathways: the polyamine and nicotinamide adenine dinucleotide (NAD) pathways. In contrast to the duplication of the polyamine pathway that is shared among several solanaceous genera producing polyamine-derived tropane alkaloids, we found that lineage-specific duplications within the NAD pathway and the evolution of root-specific expression of the duplicated Solanaceae-specific ethylene response factor that activates the expression of all nicotine biosynthetic genes resulted in the innovative and efficient production of nicotine in the genus Nicotiana Transcription factor binding motifs derived from TEs may have contributed to the coexpression of nicotine biosynthetic pathway genes and coordinated the metabolic flux. Together, these results provide evidence that TEs and gene duplications facilitated the emergence of a key metabolic innovation relevant to plant fitness.
2012-01-01
Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280
Duplication of 20p12.3 associated with familial Wolff-Parkinson-White syndrome.
Mills, Kimberly I; Anderson, Jacqueline; Levy, Philip T; Cole, F Sessions; Silva, Jennifer N A; Kulkarni, Shashikant; Shinawi, Marwan
2013-01-01
Wolff-Parkinson-White (WPW) syndrome is caused by preexcitation of the ventricular myocardium via an accessory pathway which increases the risk for paroxysmal supraventricular tachycardia. The condition is often sporadic and of unknown etiology in the majority of cases. Autosomal dominant inheritance and association with congenital heart defects or ventricular hypertrophy were described. Microdeletions of 20p12.3 have been associated with WPW syndrome with either cognitive dysfunction or Alagille syndrome. Here, we describe the association of 20p12.3 duplication with WPW syndrome in a patient who presented with non-immune hydrops. Her paternal uncle carries the duplication and has attention-deficit hyperactivity disorder and electrocardiographic findings consistent with WPW. The 769 kb duplication was detected by the Affymetrix Whole Genome-Human SNP Array 6.0 and encompasses two genes and the first two exons of a third gene. We discuss the potential role of the genes in the duplicated region in the pathogenesis of WPW and possible neurobehavioral abnormalities. Our data provide additional support for a significant role of 20p12.3 chromosomal rearrangements in the etiology of WPW syndrome. Copyright © 2012 Wiley Periodicals, Inc.
Toxin gene determination and evolution in scorpaenoid fish.
Chuang, Po-Shun; Shiao, Jen-Chieh
2014-09-01
In this study, we determine the toxin genes from both cDNA and genomic DNA of four scorpaenoid fish and reconstruct their evolutionary relationship. The deduced protein sequences of the two toxin subunits in Sebastapistes strongia, Scorpaenopsis oxycephala, and Sebastiscus marmoratus are about 700 amino acid, similar to the sizes of the stonefish (Synanceia horrida, and Synanceia verrucosa) and lionfish (Pterois antennata and Pterois volitans) toxins previously published. The intron positions are highly conserved among these species, which indicate the applicability of gene finding by using genomic DNA template. The phylogenetic analysis shows that the two toxin subunits were duplicated prior to the speciation of Scorpaenoidei. The precedence of the gene duplication over speciation indicates that the toxin genes may be common to the whole family of Scorpaeniform. Furthermore, one additional toxin gene has been determined in the genomic DNA of Dendrochirus zebra. The phylogenetic analysis suggests that an additional gene duplication occurred before the speciation of the lionfish (Pteroinae) and a pseudogene may be generally present in the lineage of lionfish. Copyright © 2014 Elsevier Ltd. All rights reserved.
Gautier, Philippe; Loosli, Felix; Tay, Boon-Hui; Tay, Alice; Murdoch, Emma; Coutinho, Pedro; van Heyningen, Veronica; Brenner, Sydney; Venkatesh, Byrappa; Kleinjan, Dirk A.
2013-01-01
Pax6 is a developmental control gene essential for eye development throughout the animal kingdom. In addition, Pax6 plays key roles in other parts of the CNS, olfactory system, and pancreas. In mammals a single Pax6 gene encoding multiple isoforms delivers these pleiotropic functions. Here we provide evidence that the genomes of many other vertebrate species contain multiple Pax6 loci. We sequenced Pax6-containing BACs from the cartilaginous elephant shark (Callorhinchus milii) and found two distinct Pax6 loci. Pax6.1 is highly similar to mammalian Pax6, while Pax6.2 encodes a paired-less Pax6. Using synteny relationships, we identify homologs of this novel paired-less Pax6.2 gene in lizard and in frog, as well as in zebrafish and in other teleosts. In zebrafish two full-length Pax6 duplicates were known previously, originating from the fish-specific genome duplication (FSGD) and expressed in divergent patterns due to paralog-specific loss of cis-elements. We show that teleosts other than zebrafish also maintain duplicate full-length Pax6 loci, but differences in gene and regulatory domain structure suggest that these Pax6 paralogs originate from a more ancient duplication event and are hence renamed as Pax6.3. Sequence comparisons between mammalian and elephant shark Pax6.1 loci highlight the presence of short- and long-range conserved noncoding elements (CNEs). Functional analysis demonstrates the ancient role of long-range enhancers for Pax6 transcription. We show that the paired-less Pax6.2 ortholog in zebrafish is expressed specifically in the developing retina. Transgenic analysis of elephant shark and zebrafish Pax6.2 CNEs with homology to the mouse NRE/Pα internal promoter revealed highly specific retinal expression. Finally, morpholino depletion of zebrafish Pax6.2 resulted in a “small eye” phenotype, supporting a role in retinal development. In summary, our study reveals that the pleiotropic functions of Pax6 in vertebrates are served by a divergent family of Pax6 genes, forged by ancient duplication events and by independent, lineage-specific gene losses. PMID:23359656
Gouran, Hossein; Chakraborty, Sandeep; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya
2014-01-01
Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction.
Marandel, Lucie; Seiliez, Iban; Véron, Vincent; Skiba-Cassy, Sandrine; Panserat, Stéphane
2015-07-01
The rainbow trout (Oncorhynchus mykiss) is considered to be a strictly carnivorous fish species that is metabolically adapted for high catabolism of proteins and low utilization of dietary carbohydrates. This species consequently has a "glucose-intolerant" phenotype manifested by persistent hyperglycemia when fed a high-carbohydrate diet. Gluconeogenesis in adult fish is also poorly, if ever, regulated by carbohydrates, suggesting that this metabolic pathway is involved in this specific phenotype. In this study, we hypothesized that the fate of duplicated genes after the salmonid-specific 4th whole genome duplication (Ss4R) may have led to adaptive innovation and that their study might provide new elements to enhance our understanding of gluconeogenesis and poor dietary carbohydrate use in this species. Our evolutionary analysis of gluconeogenic genes revealed that pck1, pck2, fbp1a, and g6pca were retained as singletons after Ss4r, while g6pcb1, g6pcb2, and fbp1b ohnolog pairs were maintained. For all genes, duplication may have led to sub- or neofunctionalization. Expression profiles suggest that the gluconeogenesis pathway remained active in trout fed a no-carbohydrate diet. When trout were fed a high-carbohydrate diet (30%), most of the gluconeogenic genes were non- or downregulated, except for g6pbc2 ohnologs, whose RNA levels were surprisingly increased. This study demonstrates that Ss4R in trout involved adaptive innovation via gene duplication and via the outcome of the resulting ohnologs. Indeed, maintenance of ohnologous g6pcb2 pair may contribute in a significant way to the glucose-intolerant phenotype of trout and may partially explain its poor use of dietary carbohydrates. Copyright © 2015 the American Physiological Society.
Rao, Basuthkar J.; Asgeirsson, Bjarni; Dandekar, Abhaya
2014-01-01
Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction. PMID:25717364
Hou, Zhaoqi; Jia, Bing; Li, Fei; Liu, Pu; Liu, Li; Ye, Zhenfeng; Zhu, Liwu; Wang, Qi; Heng, Wei
2018-01-01
The plant genes encoding ABCGs that have been identified to date play a role in suberin formation in response to abiotic and biotic stress. In the present study, 80 ABCG genes were identified in 'Dangshansuli' Chinese white pear and designated as PbABCGs. Based on the structural characteristics and phylogenetic analysis, the PbABCG family genes could be classified into seven main groups: classes A-G. Segmental and dispersed duplications were the primary forces underlying the PbABCG gene family expansion in 'Dangshansuli' pear. Most of the PbABCG duplicated gene pairs date to the recent whole-genome duplication that occurred 30~45 million years ago. Purifying selection has also played a critical role in the evolution of the ABCG genes. Ten PbABCG genes screened in the transcriptome of 'Dangshansuli' pear and its russet mutant 'Xiusu' were validated, and the expression levels of the PbABCG genes exhibited significant differences at different stages. The results presented here will undoubtedly be useful for better understanding of the complexity of the PbABCG gene family and will facilitate the functional characterization of suberin formation in the russet mutant.
2011-01-01
Background Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. Methods We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. Results In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. Conclusions This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus. PMID:22044463
Thonberg, Håkan; Fallström, Marie; Björkström, Jenny; Schoumans, Jacqueline; Nennesmo, Inger; Graff, Caroline
2011-11-01
Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus.
Yin, Guangjun; Xu, Hongliang; Xiao, Shuyang; Qin, Yajuan; Li, Yaxuan; Yan, Yueming; Hu, Yingkao
2013-10-03
WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean.
Evolution of the APETALA2 Gene Lineage in Seed Plants.
Zumajo-Cardona, Cecilia; Pabón-Mora, Natalia
2016-07-01
Gene duplication is a fundamental source of functional evolutionary change and has been associated with organismal diversification and the acquisition of novel features. The APETALA2/ETHYLENE RESPONSIVE ELEMENT-BINDING FACTOR (AP2/ERF) genes are exclusive to vascular plants and have been classified into the AP2-like and ERF-like clades. The AP2-like clade includes the AINTEGUMENTA (ANT) and the euAPETALA2 (euAP2) genes, both regulated by miR172 Arabidopsis has two paralogs in the euAP2 clade, namely APETALA2 (AP2) and TARGET OF EAT3 (TOE3) that control flowering time, meristem determinacy, sepal and petal identity and fruit development. euAP2 genes are likely functionally divergent outside Brassicaceae, as they control fruit development in tomato, and regulate inflorescence meristematic activity in maize. We studied the evolution and expression patterns of euAP2/TOE3 genes to assess large scale and local duplications and evaluate protein motifs likely related with functional changes across seed plants. We sampled euAP2/TOE3 genes from vascular plants and have found three major duplications and a few taxon-specific duplications. Here, we report conserved and new motifs across euAP2/TOE3 proteins and conclude that proteins predating the Brassicaceae duplication are more similar to AP2 than TOE3. Expression data show a shift from restricted expression in leaves, carpels, and fruits in non-core eudicots and asterids to a broader expression of euAP2 genes in leaves, all floral organs and fruits in rosids. Altogether, our data show a functional trend where the canonical A-function (sepal and petal identity) is exclusive to Brassicaceae and it is likely not maintained outside of rosids. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evolution of the vertebrate Pax4/6 class of genes with focus on its novel member, the Pax10 gene.
Feiner, Nathalie; Meyer, Axel; Kuraku, Shigehiro
2014-06-19
The members of the paired box (Pax) family regulate key developmental pathways in many metazoans as tissue-specific transcription factors. Vertebrate genomes typically possess nine Pax genes (Pax1-9), which are derived from four proto-Pax genes in the vertebrate ancestor that were later expanded through the so-called two-round (2R) whole-genome duplication. A recent study proposed that pax6a genes of a subset of teleost fishes (namely, acanthopterygians) are remnants of a paralog generated in the 2R genome duplication, to be renamed pax6.3, and reported one more group of vertebrate Pax genes (Pax6.2), most closely related to the Pax4/6 class. We propose to designate this new member Pax10 instead and reconstruct the evolutionary history of the Pax4/6/10 class with solid phylogenetic evidence. Our synteny analysis showed that Pax4, -6, and -10 originated in the 2R genome duplications early in vertebrate evolution. The phylogenetic analyses of relationships between teleost pax6a and other Pax4, -6, and -10 genes, however, do not support the proposed hypothesis of an ancient origin of the acanthopterygian pax6a genes in the 2R genome duplication. Instead, we confirmed the traditional scenario that the acanthopterygian pax6a is derived from the more recent teleost-specific genome duplication. Notably, Pax6 is present in all vertebrates surveyed to date, whereas Pax4 and -10 were lost multiple times in independent vertebrate lineages, likely because of their restricted expression patterns: Among Pax6-positive domains, Pax10 has retained expression in the adult retina alone, which we documented through in situ hybridization and quantitative reverse transcription polymerase chain reaction experiments on zebrafish, Xenopus, and anole lizard. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Lagman, David; Ocampo Daza, Daniel; Widmark, Jenny; Abalo, Xesús M; Sundström, Görel; Larhammar, Dan
2013-11-02
Vertebrate color vision is dependent on four major color opsin subtypes: RH2 (green opsin), SWS1 (ultraviolet opsin), SWS2 (blue opsin), and LWS (red opsin). Together with the dim-light receptor rhodopsin (RH1), these form the family of vertebrate visual opsins. Vertebrate genomes contain many multi-membered gene families that can largely be explained by the two rounds of whole genome duplication (WGD) in the vertebrate ancestor (2R) followed by a third round in the teleost ancestor (3R). Related chromosome regions resulting from WGD or block duplications are said to form a paralogon. We describe here a paralogon containing the genes for visual opsins, the G-protein alpha subunit families for transducin (GNAT) and adenylyl cyclase inhibition (GNAI), the oxytocin and vasopressin receptors (OT/VP-R), and the L-type voltage-gated calcium channels (CACNA1-L). Sequence-based phylogenies and analyses of conserved synteny show that the above-mentioned gene families, and many neighboring gene families, expanded in the early vertebrate WGDs. This allows us to deduce the following evolutionary scenario: The vertebrate ancestor had a chromosome containing the genes for two visual opsins, one GNAT, one GNAI, two OT/VP-Rs and one CACNA1-L gene. This chromosome was quadrupled in 2R. Subsequent gene losses resulted in a set of five visual opsin genes, three GNAT and GNAI genes, six OT/VP-R genes and four CACNA1-L genes. These regions were duplicated again in 3R resulting in additional teleost genes for some of the families. Major chromosomal rearrangements have taken place in the teleost genomes. By comparison with the corresponding chromosomal regions in the spotted gar, which diverged prior to 3R, we could time these rearrangements to post-3R. We present an extensive analysis of the paralogon housing the visual opsin, GNAT and GNAI, OT/VP-R, and CACNA1-L gene families. The combined data imply that the early vertebrate WGD events contributed to the evolution of vision and the other neuronal and neuroendocrine functions exerted by the proteins encoded by these gene families. In pouched lamprey all five visual opsin genes have previously been identified, suggesting that lampreys diverged from the jawed vertebrates after 2R.
Myelodysplastic syndrome in an infant with constitutional pure duplication 1q41-qter.
Morokawa, Hirokazu; Kamiya, Motoko; Wakui, Keiko; Kobayashi, Mikiko; Kurata, Takashi; Matsuda, Kazuyuki; Kawamura, Rie; Kanno, Hiroyuki; Fukushima, Yoshimitsu; Nakazawa, Yozo; Kosho, Tomoki
2018-01-01
We report on a Japanese female infant as the fourth patient with the constitutional pure duplication 1q41-qter confirmed by chromosomal microarray and as the first who developed myelodysplastic syndrome (MDS) among those with the constitutional 1q duplication. Common clinical features of the constitutional pure duplication 1q41-qter include developmental delay, craniofacial characteristics, foot malformation, hypertrichosis, and respiratory insufficiency. The association between MDS and the duplication of the genes in the 1q41-qter region remains unknown.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hunsperger, Heather M.; Randhawa, Tejinder; Cattolico, Rose Ann
Two non-homologous, isofunctional enzymes catalyze the penultimate step of chlorophyll a synthesis in oxygenic photosynthetic organisms such as cyanobacteria, eukaryotic algae and land plants: the light independent (LIPOR) and light-dependent (POR) protochlorophyllide oxidoreductases. Whereas the distribution of these enzymes in cyanobacteria and land plants is well understood, the presence, loss, duplication, and replacement of these genes have not been surveyed in the polyphyletic and remarkably diverse eukaryotic algal lineages.
Hunsperger, Heather M.; Randhawa, Tejinder; Cattolico, Rose Ann
2015-02-10
Two non-homologous, isofunctional enzymes catalyze the penultimate step of chlorophyll a synthesis in oxygenic photosynthetic organisms such as cyanobacteria, eukaryotic algae and land plants: the light independent (LIPOR) and light-dependent (POR) protochlorophyllide oxidoreductases. Whereas the distribution of these enzymes in cyanobacteria and land plants is well understood, the presence, loss, duplication, and replacement of these genes have not been surveyed in the polyphyletic and remarkably diverse eukaryotic algal lineages.
Conserved noncoding sequences conserve biological networks and influence genome evolution.
Xie, Jianbo; Qian, Kecheng; Si, Jingna; Xiao, Liang; Ci, Dong; Zhang, Deqiang
2018-05-01
Comparative genomics approaches have identified numerous conserved cis-regulatory sequences near genes in plant genomes. Despite the identification of these conserved noncoding sequences (CNSs), our knowledge of their functional importance and selection remains limited. Here, we used a combination of DNA methylome analysis, microarray expression analyses, and functional annotation to study these sequences in the model tree Populus trichocarpa. Methylation in CG contexts and non-CG contexts was lower in CNSs, particularly CNSs in the 5'-upstream regions of genes, compared with other sites in the genome. We observed that CNSs are enriched in genes with transcription and binding functions, and this also associated with syntenic genes and those from whole-genome duplications, suggesting that cis-regulatory sequences play a key role in genome evolution. We detected a significant positive correlation between CNS number and protein interactions, suggesting that CNSs may have roles in the evolution and maintenance of biological networks. The divergence of CNSs indicates that duplication-degeneration-complementation drives the subfunctionalization of a proportion of duplicated genes from whole-genome duplication. Furthermore, population genomics confirmed that most CNSs are under strong purifying selection and only a small subset of CNSs shows evidence of adaptive evolution. These findings provide a foundation for future studies exploring these key genomic features in the maintenance of biological networks, local adaptation, and transcription.
Evolution of vertebrate visual pigments.
Bowmaker, James K
2008-09-01
The visual pigments of vertebrates evolved about 500 million years ago, before the major evolutionary step of the development of jaws. Four spectrally distinct classes of cone opsin evolved through gene duplication, followed by the rod opsin class that arose from the duplication of the middle-wave-sensitive cone opsin. All four cone classes are present in many extant teleost fish, reptiles and birds, but one or more classes have been lost in primitive fish, amphibians and mammals. Gene duplication within the cone classes, especially in teleosts, has resulted in multiple opsins being available, both temporally and spatially, during development.
Cai, Guiqing; Edelmann, Lisa; Goldsmith, Juliet E; Cohen, Ninette; Nakamine, Alisa; Reichert, Jennifer G; Hoffman, Ellen J; Zurawiecki, Danielle M; Silverman, Jeremy M; Hollander, Eric; Soorya, Latha; Anagnostou, Evdokia; Betancur, Catalina; Buxbaum, Joseph D
2008-01-01
Background It has previously been shown that specific microdeletions and microduplications, many of which also associated with cognitive impairment (CI), can present with autism spectrum disorders (ASDs). Multiplex ligation-dependent probe amplification (MLPA) represents an efficient method to screen for such recurrent microdeletions and microduplications. Methods In the current study, a total of 279 unrelated subjects ascertained for ASDs were screened for genomic disorders associated with CI using MLPA. Fluorescence in situ hybridization (FISH), quantitative polymerase chain reaction (Q-PCR) and/or direct DNA sequencing were used to validate potential microdeletions and microduplications. Methylation-sensitive MLPA was used to characterize individuals with duplications in the Prader-Willi/Angelman (PWA) region. Results MLPA showed two subjects with typical ASD-associated interstitial duplications of the 15q11-q13 PWA region of maternal origin. Two additional subjects showed smaller, de novo duplications of the PWA region that had not been previously characterized. Genes in these two novel duplications include GABRB3 and ATP10A in one case, and MKRN3, MAGEL2 and NDN in the other. In addition, two subjects showed duplications of the 22q11/DiGeorge syndrome region. One individual was found to carry a 12 kb deletion in one copy of the ASPA gene on 17p13, which when mutated in both alleles leads to Canavan disease. Two subjects showed partial duplication of the TM4SF2 gene on Xp11.4, previously implicated in X-linked non-specific mental retardation, but in our subsequent analyses such variants were also found in controls. A partial duplication in the ASMT gene, located in the pseudoautosomal region 1 (PAR1) of the sex chromosomes and previously suggested to be involved in ASD susceptibility, was observed in 6–7% of the cases but in only 2% of controls (P = 0.003). Conclusion MLPA proves to be an efficient method to screen for chromosomal abnormalities. We identified duplications in 15q11-q13 and in 22q11, including new de novo small duplications, as likely contributing to ASD in the current sample by increasing liability and/or exacerbating symptoms. Our data indicate that duplications in TM4SF2 are not associated with the phenotype given their presence in controls. The results in PAR1/PAR2 are the first large-scale studies of gene dosage in these regions, and the findings at the ASMT locus indicate that further studies of the duplication of the ASMT gene are needed in order to gain insight into its potential involvement in ASD. Our studies also identify some limitations of MLPA, where single base changes in probe binding sequences alter results. In summary, our studies indicate that MLPA, with a focus on accepted medical genetic conditions, may be an inexpensive method for detection of microdeletions and microduplications in ASD patients for purposes of genetic counselling if MLPA-identified deletions are validated by additional methods. PMID:18925931
Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel
2007-01-01
Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters. PMID:17822543
On the need for widespread horizontal gene transfers under genome size constraint.
Isambert, Hervé; Stein, Richard R
2009-08-25
While eukaryotes primarily evolve by duplication-divergence expansion (and reduction) of their own gene repertoire with only rare horizontal gene transfers, prokaryotes appear to evolve under both gene duplications and widespread horizontal gene transfers over long evolutionary time scales. But, the evolutionary origin of this striking difference in the importance of horizontal gene transfers remains by and large a mystery. We propose that the abundance of horizontal gene transfers in free-living prokaryotes is a simple but necessary consequence of two opposite effects: i) their apparent genome size constraint compared to typical eukaryote genomes and ii) their underlying genome expansion dynamics through gene duplication-divergence evolution, as demonstrated by the presence of many tandem and block repeated genes. In principle, this combination of genome size constraint and underlying duplication expansion should lead to a coalescent-like process with extensive turnover of functional genes. This would, however, imply the unlikely, systematic reinvention of functions from discarded genes within independent phylogenetic lineages. Instead, we propose that the long-term evolutionary adaptation of free-living prokaryotes must have resulted in the emergence of efficient non-phylogenetic pathways to circumvent gene loss. This need for widespread horizontal gene transfers due to genome size constraint implies, in particular, that prokaryotes must remain under strong selection pressure in order to maintain the long-term evolutionary adaptation of their "mutualized" gene pool, beyond the inevitable turnover of individual prokaryote species. By contrast, the absence of genome size constraint for typical eukaryotes has presumably relaxed their need for widespread horizontal gene transfers and strong selection pressure. Yet, the resulting loss of genetic functions, due to weak selection pressure and inefficient gene recovery mechanisms, must have ultimately favored the emergence of more complex life styles and ecological integration of many eukaryotes. This article was reviewed by Pierre Pontarotti, Eugene V Koonin and Sergei Maslov.
An epigenetic state associated with areas of gene duplication
Gimelbrant, Alexander A.; Chess, Andrew
2006-01-01
Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes. PMID:16687731
A Synergism between Adaptive Effects and Evolvability Drives Whole Genome Duplication to Fixation
Cuypers, Thomas D.; Hogeweg, Paulien
2014-01-01
Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change. PMID:24743268
A synergism between adaptive effects and evolvability drives whole genome duplication to fixation.
Cuypers, Thomas D; Hogeweg, Paulien
2014-04-01
Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change.
Recurrent duplications of the annexin A1 gene (ANXA1) in autism spectrum disorders.
Correia, Catarina T; Conceição, Inês C; Oliveira, Bárbara; Coelho, Joana; Sousa, Inês; Sequeira, Ana F; Almeida, Joana; Café, Cátia; Duque, Frederico; Mouga, Susana; Roberts, Wendy; Gao, Kun; Lowe, Jennifer K; Thiruvahindrapuram, Bhooma; Walker, Susan; Marshall, Christian R; Pinto, Dalila; Nurnberger, John I; Scherer, Stephen W; Geschwind, Daniel H; Oliveira, Guiomar; Vicente, Astrid M
2014-04-10
Validating the potential pathogenicity of copy number variants (CNVs) identified in genome-wide studies of autism spectrum disorders (ASD) requires detailed assessment of case/control frequencies, inheritance patterns, clinical correlations, and functional impact. Here, we characterize a small recurrent duplication in the annexin A1 (ANXA1) gene, identified by the Autism Genome Project (AGP) study. From the AGP CNV genomic screen in 2,147 ASD individuals, we selected for characterization an ANXA1 gene duplication that was absent in 4,964 population-based controls. We further screened the duplication in a follow-up sample including 1,496 patients and 410 controls, and evaluated clinical correlations and family segregation. Sequencing of exonic/downstream ANXA1 regions was performed in 490 ASD patients for identification of additional variants. The ANXA1 duplication, overlapping the last four exons and 3'UTR region, had an overall prevalence of 11/3,643 (0.30%) in unrelated ASD patients but was not identified in 5,374 controls. Duplication carriers presented no distinctive clinical phenotype. Family analysis showed neuropsychiatric deficits and ASD traits in multiple relatives carrying the duplication, suggestive of a complex genetic inheritance. Sequencing of exonic regions and the 3'UTR identified 11 novel changes, but no obvious variants with clinical significance. We provide multilevel evidence for a role of ANXA1 in ASD etiology. Given its important role as mediator of glucocorticoid function in a wide variety of brain processes, including neuroprotection, apoptosis, and control of the neuroendocrine system, the results add ANXA1 to the growing list of rare candidate genetic etiological factors for ASD.
Korablev, Alexei N; Serova, Irina A; Serov, Oleg L
2017-12-28
Copy Number Variation (CNV) of the human CNTN6 gene (encoding the contactin-6 protein), caused by deletions or duplications, is responsible for severe neurodevelopmental impairments, often in combination with facial dysmorphias. Conversely, deleterious point mutations of this gene do not show any clinical phenotypes. The aim of this study is to generate mice carrying large deletions, duplications and inversions involving the Cntn6 gene as a new experimental model to study CNV of the human CNTN6 locus. To generate large chromosomal rearrangements on mouse chromosome 6, we applied CRISPR/Cas9 technology in zygotes. Two guide RNAs (gRNAs) (flanking a DNA fragment of 1137 Mb) together with Cas9 mRNA and single-stranded DNA oligonucleotides (ssODN) were microinjected into the cytoplasm of 599 zygotes of F1 (C57BL x CBA) mice, and 256 of them were transplanted into oviducts of CD-1 females. As a result, we observed the birth of 41 viable F0 offspring. Genotyping of these mice was performed by PCR analysis and sequencing of PCR products. Among the 41 F0 offspring, we identified seven mice with deletions, two animals carrying duplications of the gene and four carrying inversions. Interestingly, two F0 offspring had both deletions and duplications. It is important to note that while three of seven deletion carriers showed expected sequences at the new joint sites, in another three, we identified an absence of 1-10 nucleotides at the CRISPR/Cas9 cut sites, and in one animal, 103 bp were missing, presumably due to error-prone non-homologous end joining. In addition, we detected the absence of 5 and 13 nucleotides at these sites in two F0 duplication carriers. Similar sequence changes at CRISPR/Cas9 cut sites were observed at the right and left boundaries of inversions. Thus, megabase-scale deletions, duplications and inversions were identified in 11 F0 offspring among 41 analyzed, i.e., approximately 25% efficiency. All genetically modified F0 offspring were viable and able to transmit these large chromosomal rearrangements to the next generation. Using CRISPR/Cas9 technology, we created mice carrying megabase-scale deletions, duplications, and inversions involving the full-sized Cntn6 gene. These mice became founders of new mouse lines, which may be more appropriate experimental models of CNV in the human 3p26.3 region than Сntn6 knockout mice.
Cridland, Julie M; Thornton, Kevin R
2010-01-13
Several recent studies have focused on the evolution of recently duplicated genes in Drosophila. Currently, however, little is known about the evolutionary forces acting upon duplications that are segregating in natural populations. We used a high-throughput, paired-end sequencing platform (Illumina) to identify structural variants in a population sample of African D. melanogaster. Polymerase chain reaction and sequencing confirmation of duplications detected by multiple, independent paired-ends showed that paired-end sequencing reliably uncovered the break points of structural rearrangements and allowed us to identify a number of tandem duplications segregating within a natural population. Our confirmation experiments show that rates of confirmation are very high, even at modest coverage. Our results also compare well with previous studies using microarrays (Emerson J, Cardoso-Moreira M, Borevitz JO, Long M. 2008. Natural selection shapes genome wide patterns of copy-number polymorphism in Drosophila melanogaster. Science. 320:1629-1631. and Dopman EB, Hartl DL. 2007. A portrait of copy-number polymorphism in Drosophila melanogaster. Proc Natl Acad Sci U S A. 104:19920-19925.), which both gives us confidence in the results of this study as well as confirms previous microarray results.We were also able to identify whole-gene duplications, such as a novel duplication of Or22a, an olfactory receptor, and identify copy-number differences in genes previously known to be under positive selection, like Cyp6g1, which confers resistance to dichlorodiphenyltrichloroethane. Several "hot spots" of duplications were detected in this study, which indicate that particular regions of the genome may be more prone to generating duplications. Finally, population frequency analysis of confirmed events also showed an excess of rare variants in our population, which indicates that duplications segregating in the population may be deleterious and ultimately destined to be lost from the population.
Evolutionary interplay between sister cytochrome P450 genes shapes plasticity in plant metabolism.
Liu, Zhenhua; Tavares, Raquel; Forsythe, Evan S; André, François; Lugan, Raphaël; Jonasson, Gabriella; Boutet-Mercey, Stéphanie; Tohge, Takayuki; Beilstein, Mark A; Werck-Reichhart, Danièle; Renault, Hugues
2016-10-07
Expansion of the cytochrome P450 gene family is often proposed to have a critical role in the evolution of metabolic complexity, in particular in microorganisms, insects and plants. However, the molecular mechanisms underlying the evolution of this complexity are poorly understood. Here we describe the evolutionary history of a plant P450 retrogene, which emerged and underwent fixation in the common ancestor of Brassicales, before undergoing tandem duplication in the ancestor of Brassicaceae. Duplication leads first to gain of dual functions in one of the copies. Both sister genes are retained through subsequent speciation but eventually return to a single copy in two of three diverging lineages. In the lineage in which both copies are maintained, the ancestral functions are split between paralogs and a novel function arises in the copy under relaxed selection. Our work illustrates how retrotransposition and gene duplication can favour the emergence of novel metabolic functions.
North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene
Sullivan, Lori S.; Wheaton, Dianna K.; Locke, Kirsten G.; Jones, Kaylie D.; Koboldt, Daniel C.; Fulton, Robert S.; Wilson, Richard K.; Blanton, Susan H.; Birch, David G.; Daiger, Stephen P.
2016-01-01
Purpose To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). Methods A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Results Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13. The duplication creates a partial copy of CCNC and a complete copy of PRDM13. The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. Conclusions The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1 hypersensitive site upstream of the CCNC and PRDM13 genes or a tandem duplication of the PRDM13 gene. The duplication found in the RFS355 family is distinct from the previously reported duplication and provides additional support that dysregulation of PRDM13, not CCNC, is the cause of NCMD mapped to the MCDR1 locus. PMID:27777503
North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene.
Bowne, Sara J; Sullivan, Lori S; Wheaton, Dianna K; Locke, Kirsten G; Jones, Kaylie D; Koboldt, Daniel C; Fulton, Robert S; Wilson, Richard K; Blanton, Susan H; Birch, David G; Daiger, Stephen P
2016-01-01
To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13 . The duplication creates a partial copy of CCNC and a complete copy of PRDM13 . The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1 hypersensitive site upstream of the CCNC and PRDM13 genes or a tandem duplication of the PRDM13 gene. The duplication found in the RFS355 family is distinct from the previously reported duplication and provides additional support that dysregulation of PRDM13 , not CCNC , is the cause of NCMD mapped to the MCDR1 locus.
Life-stage-associated remodelling of lipid metabolism regulation in Atlantic salmon.
Gillard, Gareth; Harvey, Thomas N; Gjuvsland, Arne; Jin, Yang; Thomassen, Magny; Lien, Sigbjørn; Leaver, Michael; Torgersen, Jacob S; Hvidsten, Torgeir R; Vik, Jon Olav; Sandve, Simen R
2018-03-01
Atlantic salmon migrates from rivers to sea to feed, grow and develop gonads before returning to spawn in freshwater. The transition to marine habitats is associated with dramatic changes in the environment, including water salinity, exposure to pathogens and shift in dietary lipid availability. Many changes in physiology and metabolism occur across this life-stage transition, but little is known about the molecular nature of these changes. Here, we use a long-term feeding experiment to study transcriptional regulation of lipid metabolism in Atlantic salmon gut and liver in both fresh- and saltwater. We find that lipid metabolism becomes significantly less plastic to differences in dietary lipid composition when salmon transitions to saltwater and experiences increased dietary lipid availability. Expression of genes in liver relating to lipogenesis and lipid transport decreases overall and becomes less responsive to diet, while genes for lipid uptake in gut become more highly expressed. Finally, analyses of evolutionary consequences of the salmonid-specific whole-genome duplication on lipid metabolism reveal several pathways with significantly different (p < .05) duplicate retention or duplicate regulatory conservation. We also find a limited number of cases where the whole-genome duplication has resulted in an increased gene dosage. In conclusion, we find variable and pathway-specific effects of the salmonid genome duplication on lipid metabolism genes. A clear life-stage-associated shift in lipid metabolism regulation is evident, and we hypothesize this to be, at least partly, driven by nondietary factors such as the preparatory remodelling of gene regulation and physiology prior to sea migration. © 2018 John Wiley & Sons Ltd.
Diverse Cis-Regulatory Mechanisms Contribute to Expression Evolution of Tandem Gene Duplicates
Baudouin-Gonzalez, Luís; Santos, Marília A; Tempesta, Camille; Sucena, Élio; Roch, Fernando; Tanaka, Kohtaro
2017-01-01
Abstract Pairs of duplicated genes generally display a combination of conserved expression patterns inherited from their unduplicated ancestor and newly acquired domains. However, how the cis-regulatory architecture of duplicated loci evolves to produce these expression patterns is poorly understood. We have directly examined the gene-regulatory evolution of two tandem duplicates, the Drosophila Ly6 genes CG9336 and CG9338, which arose at the base of the drosophilids between 40 and 60 Ma. Comparing the expression patterns of the two paralogs in four Drosophila species with that of the unduplicated ortholog in the tephritid Ceratitis capitata, we show that they diverged from each other as well as from the unduplicated ortholog. Moreover, the expression divergence appears to have occurred close to the duplication event and also more recently in a lineage-specific manner. The comparison of the tissue-specific cis-regulatory modules (CRMs) controlling the paralog expression in the four Drosophila species indicates that diverse cis-regulatory mechanisms, including the novel tissue-specific enhancers, differential inactivation, and enhancer sharing, contributed to the expression evolution. Our analysis also reveals a surprisingly variable cis-regulatory architecture, in which the CRMs driving conserved expression domains change in number, location, and specificity. Altogether, this study provides a detailed historical account that uncovers a highly dynamic picture of how the paralog expression patterns and their underlying cis-regulatory landscape evolve. We argue that our findings will encourage studying cis-regulatory evolution at the whole-locus level to understand how interactions between enhancers and other regulatory levels shape the evolution of gene expression. PMID:28961967
Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj
2015-01-01
The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056
Erixon, Per; Oxelman, Bengt
2008-01-01
Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue of whether negative or positive selection is to be expected after gene duplications by providing evidence for the latter alternative. The observed increase in synonymous substitution rates in some of the lineages indicates that the detection of positive selection may be obscured under such circumstances. Future studies are required to explore the functional significance of the large inserted repeated amino acid motifs, as well as the possibility that synonymous substitution rates may be affected by positive selection. PMID:18167545
Harding, Tommy; Roger, Andrew J.; Simpson, Alastair G. B.
2017-01-01
The capacity of halophiles to thrive in extreme hypersaline habitats derives partly from the tight regulation of ion homeostasis, the salt-dependent adjustment of plasma membrane fluidity, and the increased capability to manage oxidative stress. Halophilic bacteria, and archaea have been intensively studied, and substantial research has been conducted on halophilic fungi, and the green alga Dunaliella. By contrast, there have been very few investigations of halophiles that are phagotrophic protists, i.e., protozoa. To gather fundamental knowledge about salt adaptation in these organisms, we studied the transcriptome-level response of Halocafeteria seosinensis (Stramenopiles) grown under contrasting salinities. We provided further evolutionary context to our analysis by identifying genes that underwent recent duplications. Genes that were highly responsive to salinity variations were involved in stress response (e.g., chaperones), ion homeostasis (e.g., Na+/H+ transporter), metabolism and transport of lipids (e.g., sterol biosynthetic genes), carbohydrate metabolism (e.g., glycosidases), and signal transduction pathways (e.g., transcription factors). A significantly high proportion (43%) of duplicated genes were also differentially expressed, accentuating the importance of gene expansion in adaptation by H. seosinensis to high salt environments. Furthermore, we found two genes that were lateral acquisitions from bacteria, and were also highly up-regulated and highly expressed at high salt, suggesting that this evolutionary mechanism could also have facilitated adaptation to high salt. We propose that a transition toward high-salt adaptation in the ancestors of H. seosinensis required the acquisition of new genes via duplication, and some lateral gene transfers (LGTs), as well as the alteration of transcriptional programs, leading to increased stress resistance, proper establishment of ion gradients, and modification of cell structure properties like membrane fluidity. PMID:28611746
USDA-ARS?s Scientific Manuscript database
The ADIPOQ gene of cattle, is located in the vicinity of the quantitative trait locus (QTL) wich effects marbling, the rib eye muscle area and fat thickness on BTA1. In our study, a novel variable duplication (NW_003103812.1:g.9232067_9232133 dup) in the bovine ADIPOQ promoter region was identified ...
Ohtani, Haruka; Morimoto, Takuya; Beppu, Kenji; Kataoka, Ikuo
2018-01-01
Dioecy, the presence of male and female flowers on distinct individuals, has evolved independently in multiple plant lineages, and the genes involved in this differential development are just starting to be uncovered in a few species. Here, we used genomic approaches to investigate this pathway in kiwifruits (genus Actinidia). Genome-wide cataloging of male-specific subsequences, combined with transcriptome analysis, led to the identification of a type-C cytokinin response regulator as a potential sex determinant gene in this genus. Functional transgenic analyses in two model systems, Arabidopsis thaliana and Nicotiana tabacum, indicated that this gene acts as a dominant suppressor of carpel development, prompting us to name it Shy Girl (SyGI). Evolutionary analyses in a panel of Actinidia species revealed that SyGI is located in the Y-specific region of the genome and probably arose from a lineage-specific gene duplication. Comparisons with the duplicated autosomal counterpart, and with orthologs from other angiosperms, suggest that the SyGI-specific duplication and subsequent evolution of cis-elements may have played a key role in the acquisition of separate sexes in this species. PMID:29626069
Expansion and stress responses of the AP2/EREBP superfamily in cotton.
Liu, Chunxiao; Zhang, Tianzhen
2017-01-31
The allotetraploid cotton originated from one hybridization event between an extant progenitor of Gosssypium herbaceum (A 1 ) or G. arboreum (A 2 ) and another progenitor, G. raimondii Ulbrich (D 5 ) 1-1.5 million years ago (Mya). The APETALA2/ethylene-responsive element binding protein (AP2/EREBP) transcription factors constitute one of the largest and most conserved gene families in plants. They are characterized by their AP2 domain, which comprises 60-70 amino acids, and are classified into four main subfamilies: the APETALA2 (AP2), Related to ABI3/VP1 (RAV), Dehydration-Responsive Element Binding protein (DREB) and Ethylene-Responsive Factor (ERF) subfamilies. The AP2/EREBP genes play crucial roles in plant growth, development and biotic and abiotic stress responses. Hence, understanding the molecular characteristics of cotton stress tolerance and gene family expansion would undoubtedly facilitate cotton resistance breeding and evolution research. A total of 269 AP2/EREBP genes were identified in the G. raimondii (D5) cotton genome. The protein domain architecture and intron/exon structure are simple and relatively conserved within each subfamily. They are distributed throughout all chromosomes but are clustered on various chromosomes due to genomic tandem duplication. We identified 73 tandem duplicated genes and 221 segmental duplicated gene pairs which contributed to the expansion of AP2/EREBP superfamily. Of them, tandem duplication was the most important force of the expansion of the B3 group. Transcriptome analysis showed that 504 AP2/EREBP genes were expressed in at least one tested G. hirsutum TM-1 tissues. In G. hirsutum, 151 non-repeated genes of the DREB and ERF subfamily genes were responsive to different stresses: 132 genes were induced by cold, 63 genes by drought and 94 genes by heat. qRT-PCR confirmed that 13 GhDREB and 15 GhERF genes were induced by cold and/or drought. No transcripts detected for 53 of the 111 tandem duplicated genes in TM-1. In addition, some homoeologous genes showed biased expression toward either A-or D-subgenome. The AP2/EREBP genes were obviously expanded in Gossypium. The GhDREB and GhERF genes play crucial roles in cotton stress responses. Our genome-wide analysis of AP2/EREBP genes in cotton provides valuable information for characterizing the molecular functions of AP2/EREBP genes and reveals insights into their evolution in polyploid plants.
Ahmed, Anwar; Haider, Shakir H; Parveen, Shama; Arshad, Mohammed; Alsenaidy, Hytham A; Baaboud, Alawi Omar; Mobaireek, Khalid Fahad; AlSaadi, Muslim Mohammed; Alsenaidy, Abdulrahman M; Sullender, Wayne
2016-01-01
Respiratory syncytial virus (RSV) is an important viral pathogen of acute respiratory tract infection (ARI). Limited data are available on molecular epidemiology of RSV from Saudi Arabia. A total of 130 nasopharyngeal aspirates were collected from children less than 5 years of age with ARI symptoms attending the Emergency Department at King Khalid University Hospital and King Fahad Medical City, Riyadh, Saudi Arabia between October and December, 2014. RSV was identified in the 26% of the hospitalized children by reverse transcriptase PCR. Group A RSV (77%) predominated during the study as compared to group B RSV (23%). The phylogenetic analysis of 28 study strains clustered group A RSV in NA1 and ON1 genotypes and group B viruses in BA (BA9) genotype. Interestingly, 26% of the positive samples clustered in genotypes with duplication in the G protein gene (ON1 for group A and BA for group B). Both the genotypes showed enhanced O-linked glycosylation in the duplicated region, with 10 and 2 additional sites in ON1 and BA respectively. Selection pressure analysis revealed purifying selection in both the ON1 and BA genotypes. One codon each in the ON1 (position 274) and BA genotypes (position 219) were positively selected and had high entropy values indicating variations at these amino acid positions. This is the first report describing the presence of ON1 genotype and the first report on co-circulation of two different genotypes of RSV with duplication in the G protein gene from Saudi Arabia. The clinical implications of the simultaneous occurrence of genotypes with duplication in G protein gene in a given population especially in the concurrent infections should be investigated in future. Further, the ongoing surveillance of RSV in this region will reveal the evolutionary trajectory of these two genotypes with duplication in G protein gene from largest country in the Middle East.
Ahmed, Anwar; Haider, Shakir H.; Parveen, Shama; Arshad, Mohammed; Alsenaidy, Hytham A.; Baaboud, Alawi Omar; Mobaireek, Khalid Fahad; AlSaadi, Muslim Mohammed; Alsenaidy, Abdulrahman M.; Sullender, Wayne
2016-01-01
Respiratory syncytial virus (RSV) is an important viral pathogen of acute respiratory tract infection (ARI). Limited data are available on molecular epidemiology of RSV from Saudi Arabia. A total of 130 nasopharyngeal aspirates were collected from children less than 5 years of age with ARI symptoms attending the Emergency Department at King Khalid University Hospital and King Fahad Medical City, Riyadh, Saudi Arabia between October and December, 2014. RSV was identified in the 26% of the hospitalized children by reverse transcriptase PCR. Group A RSV (77%) predominated during the study as compared to group B RSV (23%). The phylogenetic analysis of 28 study strains clustered group A RSV in NA1 and ON1 genotypes and group B viruses in BA (BA9) genotype. Interestingly, 26% of the positive samples clustered in genotypes with duplication in the G protein gene (ON1 for group A and BA for group B). Both the genotypes showed enhanced O-linked glycosylation in the duplicated region, with 10 and 2 additional sites in ON1 and BA respectively. Selection pressure analysis revealed purifying selection in both the ON1 and BA genotypes. One codon each in the ON1 (position 274) and BA genotypes (position 219) were positively selected and had high entropy values indicating variations at these amino acid positions. This is the first report describing the presence of ON1 genotype and the first report on co-circulation of two different genotypes of RSV with duplication in the G protein gene from Saudi Arabia. The clinical implications of the simultaneous occurrence of genotypes with duplication in G protein gene in a given population especially in the concurrent infections should be investigated in future. Further, the ongoing surveillance of RSV in this region will reveal the evolutionary trajectory of these two genotypes with duplication in G protein gene from largest country in the Middle East. PMID:27835664
2012-01-01
Background Gene duplication and the subsequent divergence in function of the resulting paralogs via subfunctionalization and/or neofunctionalization is hypothesized to have played a major role in the evolution of plant form. The LEAFY HULL STERILE1 (LHS1) SEPALLATA (SEP) genes have been linked with the origin and diversification of the grass spikelet, but it is uncertain 1) when the duplication event that produced the LHS1 clade and its paralogous lineage Oryza sativa MADS5 (OSM5) occurred, and 2) how changes in gene structure and/or expression might have contributed to subfunctionalization and/or neofunctionalization in the two lineages. Methods Phylogenetic relationships among 84 SEP genes were estimated using Bayesian methods. RNA expression patterns were inferred using in situ hybridization. The patterns of protein sequence and RNA expression evolution were reconstructed using maximum parsimony (MP) and maximum likelihood (ML) methods, respectively. Results Phylogenetic analyses mapped the LHS1/OSM5 duplication event to the base of the grass family. MP character reconstructions estimated a change from cytosine to thymine in the first codon position of the first amino acid after the Zea mays MADS3 (ZMM3) domain converted a glutamine to a stop codon in the OSM5 ancestor following the LHS1/OSM5 duplication event. RNA expression analyses of OSM5 co-orthologs in Avena sativa, Chasmanthium latifolium, Hordeum vulgare, Pennisetum glaucum, and Sorghum bicolor followed by ML reconstructions of these data and previously published analyses estimated a complex pattern of gain and loss of LHS1 and OSM5 expression in different floral organs and different flowers within the spikelet or inflorescence. Conclusions Previous authors have reported that rice OSM5 and LHS1 proteins have different interaction partners indicating that the truncation of OSM5 following the LHS1/OSM5 duplication event has resulted in both partitioned and potentially novel gene functions. The complex pattern of OSM5 and LHS1 expression evolution is not consistent with a simple subfunctionalization model following the gene duplication event, but there is evidence of recent partitioning of OSM5 and LHS1 expression within different floral organs of A. sativa, C. latifolium, P. glaucum and S. bicolor, and between the upper and lower florets of the two-flowered maize spikelet. PMID:22340849
Madureira, Tânia Vieira; Pinheiro, Ivone; de Paula Freire, Rafaelle; Rocha, Eduardo; Castro, Luis Filipe; Urbatzka, Ralph
2017-06-01
Peroxisome proliferator-activated receptors (PPARs) are key regulators of many processes in vertebrates, such as carbohydrate and lipid metabolism. PPARα, a member of the PPAR nuclear receptor gene subfamily (NR1C1), is involved in fatty acid metabolism, namely in peroxisomal β-oxidation. Two gene paralogues, pparαA and pparαB, were described in several teleost species with their origin dating back to the teleost-specific genome duplication (3R). Given the additional salmonid-specific genome duplication (4R), four genes could be theoretically anticipated for this gene subfamily. In this work, we examined the pparα gene repertoire in brown trout, Salmo trutta f. fario. Data disclosed two pparα-like sequences in brown trout. Phylogenetic analyses further revealed that the isolated genes are most likely genome pparαB duplicates, pparαBa and pparαBb, while pparαA is apparently absent in salmonids. Both genes showed a ubiquitous mRNA expression across a panel of 11 different organs. In vitro exposed primary brown trout hepatocytes strongly suggest that pparα gene paralogues are differently regulated by ethinylestradiol (EE2). PparαBb mRNA expression significantly decreased with dosage, reaching significance after exposure to 50μM EE2, while pparαBa mRNA increased, significant at 1μM EE2. The present data enhances the understanding of pparα function and evolution in teleost, and reinforces the evidence of a potential crosstalk between estrogenic and pparα signaling pathways. Copyright © 2017 Elsevier Inc. All rights reserved.
Hox gene duplications correlate with posterior heteronomy in scorpions
Sharma, Prashant P.; Schwager, Evelyn E.; Extavour, Cassandra G.; Wheeler, Ward C.
2014-01-01
The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions. PMID:25122224
Urraca, Nora; Cleary, Julie; Brewer, Victoria; Pivnick, Eniko K; McVicar, Kathryn; Thibert, Ronald L; Schanen, N Carolyn; Esmer, Carmen; Lamport, Dustin; Reiter, Lawrence T
2013-01-01
Chromosomal copy number variants (CNV) are the most common genetic lesion found in autism. Many autism-associated CNVs are duplications of chromosome 15q. Although most cases of interstitial (int) dup(15) that present clinically are de novo and maternally derived or inherited, both pathogenic and unaffected paternal duplications of 15q have been identified. We performed a phenotype/genotype analysis of individuals with interstitial 15q duplications to broaden our understanding of the 15q syndrome and investigate the contribution of 15q duplication to increased autism risk. All subjects were recruited solely on the basis of interstitial duplication 15q11.2-q13 status. Comparative array genome hybridization was used to determine the duplication size and boundaries while the methylation status of the maternally methylated small nuclear ribonucleoprotein polypeptide N gene was used to determine the parent of origin of the duplication. We determined the duplication size and parental origin for 14 int dup(15) subjects: 10 maternal and 4 paternal cases. The majority of int dup(15) cases recruited were maternal in origin, most likely due to our finding that maternal duplication was coincident with autism spectrum disorder. The size of the duplication did not correlate with the severity of the phenotype as established by Autism Diagnostic Observation Scale calibrated severity score. We identified phenotypes not comprehensively described before in this cohort including mild facial dysmorphism, sleep problems and an unusual electroencephalogram variant. Our results are consistent with the hypothesis that the maternally expressed ubiquitin protein ligase E3A gene is primarily responsible for the autism phenotype in int dup(15) since all maternal cases tested presented on the autism spectrum. PMID:23495136
Simulating evolution of protein complexes through gene duplication and co-option.
Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter
2016-06-21
We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.
Spangler, Jacob B; Feltus, Frank Alex
2013-01-01
Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis
Spangler, Jacob B.; Feltus, Frank Alex
2013-01-01
Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377
Colot, V.; Rossignol, J. L.
1995-01-01
The ascomycete Ascobolus immersus has been extensively used as a model system for the genetic study of meiotic recombination. More recently, an epigenetic process, known as methylation induced premeiotically (MIP), that acts on duplicated sequences has been discovered in A. immersus and has raised a new interest in this fungus. To try and extend these studies, we have now cloned the A. immersus spore color gene b2, a well characterized recombination hot-spot. Isolation of the whole gene was verified by physical mapping of four large b2 alterations, followed by transformation and mutant rescue of a null b2 allele. Transformation was also used to duplicate b2 and subject it to MIP. As a result, we were able for the first time to observe gene silencing as early as just after meiosis and in single cells. Furthermore, we have found evidence for a modulating effect of MIP on b2 expression, depending on the region of the gene that is duplicated and hence subjected to MIP. PMID:8601475
Duplication polymorphisms in exon 4 of κ-casein gene in yak breeds/populations.
Pingcuo, S; Gao, J; Jiang, Z R; Jin, S Y; Fu, C Y; Liu, X; Huang, L; Zheng, Y C
2015-08-28
The objective of this study was to compare 12 bp-duplication polymorphisms in exon 4 of the κ-casein gene among 3 breeds/populations of yak (Bos grunniens). Genomic DNA was extracted from yak blood or muscle samples (N = 211) and a partial sequence of exon 4 of κ-casein gene was amplified by polymerase chain reaction. A polyacrylamide gel electrophoresis assay of the products (169 bp) revealed 2 variants. These variants differed in a 12-bp duplication of the nucleotide sequence corresponding to amino acids 147-150 (Glu-Ala-Ser-Pro) or 148-151 (Ala-Ser-Pro-Glu). The genotype frequency and gene frequency of the 2 κ-casein variants differed among the 3 yak breeds/populations. The long form of the κ-casein gene was the predominant allele, and the Jiulong yak showed the highest frequency of the short form variant of the κ-casein gene. In addition, 2 nucleotide differences resulting in amino acid substitutions were also identified in yaks. These results are significant for designing a breeding strategy to improve the genetic makeup of yak herds.
The pineapple genome and the evolution of CAM photosynthesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ming, Ray; VanBuren, Robert; Wai, Ching Man
Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C 3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues.more » CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Lastly, we found pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C 3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.« less
The pineapple genome and the evolution of CAM photosynthesis
Ming, Ray; VanBuren, Robert; Wai, Ching Man; ...
2015-11-02
Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C 3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues.more » CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Lastly, we found pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C 3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.« less
Yuan, Jing; Tao, Wenjing; Cheng, Yunying; Huang, Baofeng; Wang, Deshou
2014-08-01
The fox genes play important roles in various biological processes, including sexual development. In the present study, we isolated 65 fox genes, belonging to 18 subfamilies named A-R, from Nile tilapia through genome-wide screening. Twenty-four of them have two or three (foxm1) copies. Furthermore, 16, 25, 68, and 45 fox members were isolated from nematodes, protochordates, teleosts, and tetrapods, respectively. Phylogenetic analyses indicated fox gene family had undergone three expansions parallel to the three rounds of genome duplication during evolution. We also analyzed the clustered fox genes and found that apparent linkage duplication existed in teleosts, which further supported fish-specific genome duplication hypothesis. In addition, species- and lineage-specific duplication is another reason for fox gene family expansion. Based on the four pairs of XX and XY gonadal transcriptome data from four critical developmental stages, we analyzed the expression profile of all fox genes and identified sexually dimorphic fox genes at each stage. All fox genes were detected in gonads, with 15 of them at the background expression level (total read per kb per million reads, RPKM < 10), 29 at moderate expression level (10 < total RPKM < 100), and 21 at high expression level (total RPKM > 100). There are 27, 24, 28, and 9 sexually dimorphic fox genes at 5, 30, 90, and 180 days after hatching (dah), respectively. foxq1a, foxf1, foxr1, and foxr1 were identified as the most differentially expressed genes at each stage. foxl2 was characterized as XX-dominant gene, while foxd5, foxi3, foxn3, foxj1a, foxj3b, and foxo6b were characterized as XY-dominant genes. qPCR and in situ hybridization of foxh1 and foxj1a were performed to confirm the expression profiles and to validate the transcriptome data. Our results suggest that fox genes might play important roles in sex determination and gonadal development in teleosts.
Chen, Xue; Chen, Zhu; Zhao, Hualin; Zhao, Yang; Cheng, Beijiu; Xiang, Yan
2014-01-01
Background Homeodomain-leucine zipper (HD-Zip) proteins, a group of homeobox transcription factors, participate in various aspects of normal plant growth and developmental processes as well as environmental responses. To date, no overall analysis or expression profiling of the HD-Zip gene family in soybean (Glycine max) has been reported. Methods and Findings An investigation of the soybean genome revealed 88 putative HD-Zip genes. These genes were classified into four subfamilies, I to IV, based on phylogenetic analysis. In each subfamily, the constituent parts of gene structure and motif were relatively conserved. A total of 87 out of 88 genes were distributed unequally on 20 chromosomes with 36 segmental duplication events, indicating that segmental duplication is important for the expansion of the HD-Zip family. Analysis of the Ka/Ks ratios showed that the duplicated genes of the HD-Zip family basically underwent purifying selection with restrictive functional divergence after the duplication events. Analysis of expression profiles showed that 80 genes differentially expressed across 14 tissues, and 59 HD-Zip genes are differentially expressed under salinity and drought stress, with 20 paralogous pairs showing nearly identical expression patterns and three paralogous pairs diversifying significantly under drought stress. Quantitative real-time RT-PCR (qRT-PCR) analysis of six paralogous pairs of 12 selected soybean HD-Zip genes under both drought and salinity stress confirmed their stress-inducible expression patterns. Conclusions This study presents a thorough overview of the soybean HD-Zip gene family and provides a new perspective on the evolution of this gene family. The results indicate that HD-Zip family genes may be involved in many plant responses to stress conditions. Additionally, this study provides a solid foundation for uncovering the biological roles of HD-Zip genes in soybean growth and development. PMID:24498296
Zhang, Hailing; Cao, Yingping; Shang, Chen; Li, Jikai; Wang, Jianli; Wu, Zhenying; Ma, Lichao; Qi, Tianxiong; Fu, Chunxiang; Hu, Baozhong
2017-01-01
The GRAS gene family is a large plant-specific family of transcription factors that are involved in diverse processes during plant development. Medicago truncatula is an ideal model plant for genetic research in legumes, and specifically for studying nodulation, which is crucial for nitrogen fixation. In this study, 59 MtGRAS genes were identified and classified into eight distinct subgroups based on phylogenetic relationships. Motifs located in the C-termini were conserved across the subgroups, while motifs in the N-termini were subfamily specific. Gene duplication was the main evolutionary force for MtGRAS expansion, especially proliferation of the LISCL subgroup. Seventeen duplicated genes showed strong effects of purifying selection and diverse expression patterns, highlighting their functional importance and diversification after duplication. Thirty MtGRAS genes, including NSP1 and NSP2, were preferentially expressed in nodules, indicating possible roles in the process of nodulation. A transcriptome study, combined with gene expression analysis under different stress conditions, suggested potential functions of MtGRAS genes in various biological pathways and stress responses. Taken together, these comprehensive analyses provide basic information for understanding the potential functions of GRAS genes, and will facilitate further discovery of MtGRAS gene functions. PMID:28945786
[Genome-wide identification and analysis of WRKY transcription factors in Medicago truncatula].
Song, Hui; Nan, Zhibiao
2014-02-01
WRKY gene family plays important roles in plant by involving in transcriptional regulations during various physiologically processes such as development, metabolism and responses to biotic and abiotic stresses. WRKY genes have been identified in various plants. However, only few WRKY genes in Medicago truncatula have been identified with systematic analysis and comparison. In this study, we identified 93 WRKY genes through analyses of M. truncatula genome. These genes include 19 type-I genes, 49 type II genes and 13 type-III genes, and 12 non-regular type genes. All of these genes were characterized through analyses of gene duplication, chromosomal locations, structural diversity, conserved protein motifs and phylogenetic relations. The results showed that 11 times of gene duplication event occurred in WRKY gene family involving 24 genes. WRKY genes, containing 6 gene clusters, are unevenly distributed into chromosome 1 to 6, and there is the purifying selection pressure in WRKY group III genes.
2012-01-01
Background Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. Results To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Conclusions Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants. PMID:23102090
Pavy, Nathalie; Pelgas, Betty; Laroche, Jérôme; Rigault, Philippe; Isabel, Nathalie; Bousquet, Jean
2012-10-26
Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants.
Yang, Zhenzhen; Wafula, Eric K.; Honaas, Loren A.; Zhang, Huiting; Das, Malay; Fernandez-Aparicio, Monica; Huang, Kan; Bandaranayake, Pradeepa C.G.; Wu, Biao; Der, Joshua P.; Clarke, Christopher R.; Ralph, Paula E.; Landherr, Lena; Altman, Naomi S.; Timko, Michael P.; Yoder, John I.; Westwood, James H.; dePamphilis, Claude W.
2015-01-01
The origin of novel traits is recognized as an important process underlying many major evolutionary radiations. We studied the genetic basis for the evolution of haustoria, the novel feeding organs of parasitic flowering plants, using comparative transcriptome sequencing in three species of Orobanchaceae. Around 180 genes are upregulated during haustorial development following host attachment in at least two species, and these are enriched in proteases, cell wall modifying enzymes, and extracellular secretion proteins. Additionally, about 100 shared genes are upregulated in response to haustorium inducing factors prior to host attachment. Collectively, we refer to these newly identified genes as putative “parasitism genes.” Most of these parasitism genes are derived from gene duplications in a common ancestor of Orobanchaceae and Mimulus guttatus, a related nonparasitic plant. Additionally, the signature of relaxed purifying selection and/or adaptive evolution at specific sites was detected in many haustorial genes, and may play an important role in parasite evolution. Comparative analysis of gene expression patterns in parasitic and nonparasitic angiosperms suggests that parasitism genes are derived primarily from root and floral tissues, but with some genes co-opted from other tissues. Gene duplication, often taking place in a nonparasitic ancestor of Orobanchaceae, followed by regulatory neofunctionalization, was an important process in the origin of parasitic haustoria. PMID:25534030
Yang, Ya; Moore, Michael J.; Brockington, Samuel F.; Soltis, Douglas E.; Wong, Gane Ka-Shu; Carpenter, Eric J.; Zhang, Yong; Chen, Li; Yan, Zhixiang; Xie, Yinlong; Sage, Rowan F.; Covshoff, Sarah; Hibberd, Julian M.; Nelson, Matthew N.; Smith, Stephen A.
2015-01-01
Many phylogenomic studies based on transcriptomes have been limited to “single-copy” genes due to methodological challenges in homology and orthology inferences. Only a relatively small number of studies have explored analyses beyond reconstructing species relationships. We sampled 69 transcriptomes in the hyperdiverse plant clade Caryophyllales and 27 outgroups from annotated genomes across eudicots. Using a combined similarity- and phylogenetic tree-based approach, we recovered 10,960 homolog groups, where each was represented by at least eight ingroup taxa. By decomposing these homolog trees, and taking gene duplications into account, we obtained 17,273 ortholog groups, where each was represented by at least ten ingroup taxa. We reconstructed the species phylogeny using a 1,122-gene data set with a gene occupancy of 92.1%. From the homolog trees, we found that both synonymous and nonsynonymous substitution rates in herbaceous lineages are up to three times as fast as in their woody relatives. This is the first time such a pattern has been shown across thousands of nuclear genes with dense taxon sampling. We also pinpointed regions of the Caryophyllales tree that were characterized by relatively high frequencies of gene duplication, including three previously unrecognized whole-genome duplications. By further combining information from homolog tree topology and synonymous distance between paralog pairs, phylogenetic locations for 13 putative genome duplication events were identified. Genes that experienced the greatest gene family expansion were concentrated among those involved in signal transduction and oxidoreduction, including a cytochrome P450 gene that encodes a key enzyme in the betalain synthesis pathway. Our approach demonstrates a new approach for functional phylogenomic analysis in nonmodel species that is based on homolog groups in addition to inferred ortholog groups. PMID:25837578
Wei, Menghan; Wang, Sanhong; Dong, Hui; Cai, Binhua; Tao, Jianmin
2016-01-01
As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK) plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica), pear (Pyrus communis), peach (Prunus persica), plum (Prunus mume) and strawberry (Fragaria vesca) in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD) or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA]), while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA). Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR). Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae.
Wei, Menghan; Wang, Sanhong; Dong, Hui; Cai, Binhua; Tao, Jianmin
2016-01-01
As one of the Ca2+ sensors, calcium-dependent protein kinase (CPK) plays vital roles in immune and stress signaling, growth and development, and hormone responses, etc. Recently, the whole genome of apple (Malus × domestica), pear (Pyrus communis), peach (Prunus persica), plum (Prunus mume) and strawberry (Fragaria vesca) in Rosaceae family has been fully sequenced. However, little is known about the CPK gene family in these Rosaceae species. In this study, 123 CPK genes were identified from five Rosaceae species, including 37 apple CPKs, 37 pear CPKs, 17 peach CPKs, 16 strawberry CPKs, and 16 plum CPKs. Based on the phylogenetic tree topology and structural characteristics, we divided the CPK gene family into 4 distinct subfamilies: Group I, II, III, and IV. Whole-genome duplication (WGD) or segmental duplication played vital roles in the expansion of the CPK in these Rosaceae species. Most of segmental duplication pairs in peach and plum may have arisen from the γ triplication (~140 million years ago [MYA]), while in apple genome, many duplicated genes may have been derived from a recent WGD (30~45 MYA). Purifying selection also played a critical role in the function evolution of CPK family genes. Expression of apple CPK genes in response to apple pathotype of Alternaria alternata was verified by analysis of quantitative real-time RT-PCR (qPCR). Expression data demonstrated that CPK genes in apple might have evolved independently in different biological contexts. The analysis of evolution history and expression profile laid a foundation for further examining the function and complexity of the CPK gene family in Rosaceae. PMID:27186637
Developmental expression of high molecular weight tropomyosin isoforms in Mesocestoides corti.
Koziol, Uriel; Costábile, Alicia; Domínguez, María Fernanda; Iriarte, Andrés; Alvite, Gabriela; Kun, Alejandra; Castillo, Estela
2011-02-01
Tropomyosins are a family of actin-binding proteins with diverse roles in actin filament function. One of the best characterized roles is the regulation of muscle contraction. Tropomyosin isoforms can be generated from different genes, and from alternative promoters and alternative splicing from the same gene. In this work, we have isolated sequences for tropomyosin isoforms from the cestode Mesocestoides corti, and searched for tropomyosin genes and isoforms in other flatworms. Two genes are conserved in the cestodes M. corti and Echinococcus multilocularis, and in the trematode Schistosoma mansoni. Both genes have the same structure, and each gene gives rise to at least two different isoforms, a high molecular weight (HMW) and a low molecular weight (LMW) one. Because most exons are duplicated and spliced in a mutually exclusive fashion, isoforms from one gene only share one exon and are highly divergent. The gene duplication preceded the divergence of neodermatans and the planarian Schmidtea mediterranea. Further duplications occurred in Schmidtea, coupled to the selective loss of duplicated exons, resulting in genes that only code for HMW or LMW isoforms. A polyclonal antibody raised against a HMW tropomyosin from Echinococcus granulosus was demonstrated to specifically recognize HMW tropomyosin isoforms of M. corti, and used to study their expression during segmentation. HMW tropomyosins are expressed in muscle layers, with very low or absent levels in other tissues. No expression of HMW tropomyosins is present in early or late genital primordia, and expression only begins once muscle fibers develop in the genital ducts. Therefore, HMW tropomyosins are markers for the development of muscles during the final differentiation of genital primordia. Copyright © 2010 Elsevier B.V. All rights reserved.
Heterogeneous conservation of Dlx paralog co-expression in jawed vertebrates.
Debiais-Thibaud, Mélanie; Metcalfe, Cushla J; Pollack, Jacob; Germon, Isabelle; Ekker, Marc; Depew, Michael; Laurenti, Patrick; Borday-Birraux, Véronique; Casane, Didier
2013-01-01
The Dlx gene family encodes transcription factors involved in the development of a wide variety of morphological innovations that first evolved at the origins of vertebrates or of the jawed vertebrates. This gene family expanded with the two rounds of genome duplications that occurred before jawed vertebrates diversified. It includes at least three bigene pairs sharing conserved regulatory sequences in tetrapods and teleost fish, but has been only partially characterized in chondrichthyans, the third major group of jawed vertebrates. Here we take advantage of developmental and molecular tools applied to the shark Scyliorhinus canicula to fill in the gap and provide an overview of the evolution of the Dlx family in the jawed vertebrates. These results are analyzed in the theoretical framework of the DDC (Duplication-Degeneration-Complementation) model. The genomic organisation of the catshark Dlx genes is similar to that previously described for tetrapods. Conserved non-coding elements identified in bony fish were also identified in catshark Dlx clusters and showed regulatory activity in transgenic zebrafish. Gene expression patterns in the catshark showed that there are some expression sites with high conservation of the expressed paralog(s) and other expression sites with events of paralog sub-functionalization during jawed vertebrate diversification, resulting in a wide variety of evolutionary scenarios within this gene family. Dlx gene expression patterns in the catshark show that there has been little neo-functionalization in Dlx genes over gnathostome evolution. In most cases, one tandem duplication and two rounds of vertebrate genome duplication have led to at least six Dlx coding sequences with redundant expression patterns followed by some instances of paralog sub-functionalization. Regulatory constraints such as shared enhancers, and functional constraints including gene pleiotropy, may have contributed to the evolutionary inertia leading to high redundancy between gene expression patterns.
Evolution of the vertebrate insulin receptor substrate (Irs) gene family.
Al-Salam, Ahmad; Irwin, David M
2017-06-23
Insulin receptor substrate (Irs) proteins are essential for insulin signaling as they allow downstream effectors to dock with, and be activated by, the insulin receptor. A family of four Irs proteins have been identified in mice, however the gene for one of these, IRS3, has been pseudogenized in humans. While it is known that the Irs gene family originated in vertebrates, it is not known when it originated and which members are most closely related to each other. A better understanding of the evolution of Irs genes and proteins should provide insight into the regulation of metabolism by insulin. Multiple genes for Irs proteins were identified in a wide variety of vertebrate species. Phylogenetic and genomic neighborhood analyses indicate that this gene family originated very early in vertebrae evolution. Most Irs genes were duplicated and retained in fish after the fish-specific genome duplication. Irs genes have been lost of various lineages, including Irs3 in primates and birds and Irs1 in most fish. Irs3 and Irs4 experienced an episode of more rapid protein sequence evolution on the ancestral mammalian lineage. Comparisons of the conservation of the proteins sequences among Irs paralogs show that domains involved in binding to the plasma membrane and insulin receptors are most strongly conserved, while divergence has occurred in sequences involved in interacting with downstream effector proteins. The Irs gene family originated very early in vertebrate evolution, likely through genome duplications, and in parallel with duplications of other components of the insulin signaling pathway, including insulin and the insulin receptor. While the N-terminal sequences of these proteins are conserved among the paralogs, changes in the C-terminal sequences likely allowed changes in biological function.
NASA Astrophysics Data System (ADS)
Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.
2016-04-01
The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.
Murat, Florent; Zhang, Rongzhi; Guizard, Sébastien; Flores, Raphael; Armero, Alix; Pont, Caroline; Steinbach, Delphine; Quesneville, Hadi; Cooke, Richard; Salse, Jerome
2013-01-01
Modern plant genomes are diploidized paleopolyploids. We revisited grass genome paleohistory in response to the diploidization process through a detailed investigation of the evolutionary fate of duplicated blocks. Ancestrally duplicated genes can be conserved, deleted, and shuffled, defining dominant (bias toward duplicate retention) and sensitive (bias toward duplicate erosion) chromosomal fragments. We propose a new grass genome paleohistory deriving from an ancestral karyotype structured in seven protochromosomes containing 16,464 protogenes and following evolutionary rules where 1) ancestral shared polyploidizations shaped conserved dominant (D) and sensitive (S) subgenomes, 2) subgenome dominance is revealed by both gene deletion and shuffling from the S blocks, 3) duplicate deletion/movement may have been mediated by single-/double-stranded illegitimate recombination mechanisms, 4) modern genomes arose through centromeric fusion of protochromosomes, leading to functional monocentric neochromosomes, 5) the fusion of two dominant blocks leads to supradominant neochromosomes (D + D = D) with higher ancestral gene retention compared with D + S = D (i.e., fusion of blocks with opposite sensitivity) or even S + S = S (i.e., fusion of two sensitive ancestral blocks). A new user-friendly online tool named “PlantSyntenyViewer,” available at http://urgi.versailles.inra.fr/synteny-cereal, presents the refined comparative genomics data. PMID:24317974
Legume genome evolution viewed through the Medicago truncatula and Lotus japonicus genomes
Cannon, Steven B.; Sterck, Lieven; Rombauts, Stephane; Sato, Shusei; Cheung, Foo; Gouzy, Jérôme; Wang, Xiaohong; Mudge, Joann; Vasdewani, Jayprakash; Schiex, Thomas; Spannagl, Manuel; Monaghan, Erin; Nicholson, Christine; Humphray, Sean J.; Schoof, Heiko; Mayer, Klaus F. X.; Rogers, Jane; Quétier, Francis; Oldroyd, Giles E.; Debellé, Frédéric; Cook, Douglas R.; Retzel, Ernest F.; Roe, Bruce A.; Town, Christopher D.; Tabata, Satoshi; Van de Peer, Yves; Young, Nevin D.
2006-01-01
Genome sequencing of the model legumes, Medicago truncatula and Lotus japonicus, provides an opportunity for large-scale sequence-based comparison of two genomes in the same plant family. Here we report synteny comparisons between these species, including details about chromosome relationships, large-scale synteny blocks, microsynteny within blocks, and genome regions lacking clear correspondence. The Lotus and Medicago genomes share a minimum of 10 large-scale synteny blocks, each with substantial collinearity and frequently extending the length of whole chromosome arms. The proportion of genes syntenic and collinear within each synteny block is relatively homogeneous. Medicago–Lotus comparisons also indicate similar and largely homogeneous gene densities, although gene-containing regions in Mt occupy 20–30% more space than Lj counterparts, primarily because of larger numbers of Mt retrotransposons. Because the interpretation of genome comparisons is complicated by large-scale genome duplications, we describe synteny, synonymous substitutions and phylogenetic analyses to identify and date a probable whole-genome duplication event. There is no direct evidence for any recent large-scale genome duplication in either Medicago or Lotus but instead a duplication predating speciation. Phylogenetic comparisons place this duplication within the Rosid I clade, clearly after the split between legumes and Salicaceae (poplar). PMID:17003129
Kitsiou-Tzeli, Sophia; Tzetis, Maria; Sofocleous, Christalena; Vrettou, Christina; Xaidara, Athena; Giannikou, Krinio; Pampanos, Andreas; Mavrou, Ariadne; Kanavakis, E
2010-08-01
The 15q11-q13 PWS/AS critical region involves genes that are characterized by genomic imprinting. Multiple repeat elements within the region mediate rearrangements, including interstitial duplications, interstitial triplications, and supernumerary isodicentric marker chromosomes, as well as the deletions that cause Prader-Willi syndrome (PWS) and Angelman syndrome (AS). Recently, duplications of maternal origin concerning the same critical region have been implicated in autism spectrum disorders (ASD). We present a 6-month-old girl carrying a de novo duplication of maternal origin of the 15q11.2-q14 PWS/AS region (17.73 Mb in size) [46,XX,dup(15)(q11.2-q14)] detected with a high-resolution microarray-based comparative genomic hybridization (array-CGH). The patient is characterized by severe hypotonia, obesity, microstomia, long eyelashes, hirsutism, microretrognathia, short nose, severe psychomotor retardation, and multiple episodes of drug-resistant epileptic seizures, while her brain magnetic resonance imaging (MRI) documented partial corpus callosum dysplasia. In our patient the duplicated region is quite large extending beyond the Prader-Willi-Angelman critical region (PWACR), containing a number of genes that have been shown to be involved in ASD, exhibiting a severe phenotype, beyond the typical PWS/AS clinical manifestations. Reporting of similar well-characterized clinical cases with clearly delineated breakpoints of the duplicated region will clarify the contribution of specific genes to the phenotype.
Orsomucoid: A new variant and additional duplicated ORM1 gene in Qatari population
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sebetan, I.M.; Alali, K.A.; Alzaman, A.
1994-09-01
A new genetically determined ORM2 variant and additional duplicated ORM1 gene were observed in Qatari population using isoelectric focusing in ultra thin layer polyacrylamide gels. The studied population samples indicate occurence of six ORM1 alleles and three ORM2 ones. A simple reliable method for separation of orsomucoid variations with comparison of different reported methods will be presented.
Kaliyappan, Raja; Viswanathan, Sriram; Suthanthiram, Backiyarani; Subbaraya, Uma; Marimuthu Somasundram, Saraswathi; Muthu, Mayilvaganan
2016-01-01
The WRKY family of transcription factors orchestrate the reprogrammed expression of the complex network of defense genes at various biotic and abiotic stresses. Within the last 96 million years, three rounds of Musa polyploidization events had occurred from selective pressure causing duplication of MusaWRKYs with new activities. Here, we identified a total of 153 WRKY transcription factors available from the DH Pahang genome. Based on their phylogenetic relationship, the MusaWRKYs available with complete gene sequence were classified into the seven common WRKY sub-groups. Synteny analyses data revealed paralogous relationships, with 17 MusaWRKY gene pairs originating from the duplication events that had occurred within the Musa lineage. We also found 15 other MusaWRKY gene pairs originating from much older duplication events that had occurred along Arecales and Poales lineage of commelinids. Based on the synonymous and nonsynonymous substitution rates, the fate of duplicated MusaWRKY genes was predicted to have undergone sub-functionalization in which the duplicated gene copies retain a subset of the ancestral gene function. Also, to understand the regulatory roles of MusaWRKY during a biotic stress, Illumina sequencing was performed on resistant and susceptible cultivars during the infection of root lesion nematode, Pratylenchus coffeae. The differential WRKY gene expression analysis in nematode resistant and susceptible cultivars during challenged and unchallenged conditions had distinguished: 1) MusaWRKYs participating in general banana defense mechanism against P.coffeae common to both susceptible and resistant cultivars, 2) MusaWRKYs that may aid in the pathogen survival as suppressors of plant triggered immunity, 3) MusaWRKYs that may aid in the host defense as activators of plant triggered immunity and 4) cultivar specific MusaWRKY regulation. Mainly, MusaWRKY52, -69 and -92 are found to be P.coffeae specific and can act as activators or repressors in a defense pathway. Overall, this preliminary study in Musa provides the basis for understanding the evolution and regulatory mechanism of MusaWRKY during nematode stress.
Suthanthiram, Backiyarani; Subbaraya, Uma; Marimuthu Somasundram, Saraswathi; Muthu, Mayilvaganan
2016-01-01
The WRKY family of transcription factors orchestrate the reprogrammed expression of the complex network of defense genes at various biotic and abiotic stresses. Within the last 96 million years, three rounds of Musa polyploidization events had occurred from selective pressure causing duplication of MusaWRKYs with new activities. Here, we identified a total of 153 WRKY transcription factors available from the DH Pahang genome. Based on their phylogenetic relationship, the MusaWRKYs available with complete gene sequence were classified into the seven common WRKY sub-groups. Synteny analyses data revealed paralogous relationships, with 17 MusaWRKY gene pairs originating from the duplication events that had occurred within the Musa lineage. We also found 15 other MusaWRKY gene pairs originating from much older duplication events that had occurred along Arecales and Poales lineage of commelinids. Based on the synonymous and nonsynonymous substitution rates, the fate of duplicated MusaWRKY genes was predicted to have undergone sub-functionalization in which the duplicated gene copies retain a subset of the ancestral gene function. Also, to understand the regulatory roles of MusaWRKY during a biotic stress, Illumina sequencing was performed on resistant and susceptible cultivars during the infection of root lesion nematode, Pratylenchus coffeae. The differential WRKY gene expression analysis in nematode resistant and susceptible cultivars during challenged and unchallenged conditions had distinguished: 1) MusaWRKYs participating in general banana defense mechanism against P.coffeae common to both susceptible and resistant cultivars, 2) MusaWRKYs that may aid in the pathogen survival as suppressors of plant triggered immunity, 3) MusaWRKYs that may aid in the host defense as activators of plant triggered immunity and 4) cultivar specific MusaWRKY regulation. Mainly, MusaWRKY52, -69 and -92 are found to be P.coffeae specific and can act as activators or repressors in a defense pathway. Overall, this preliminary study in Musa provides the basis for understanding the evolution and regulatory mechanism of MusaWRKY during nematode stress. PMID:27603787
Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.
2014-01-01
Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822
Zhang, Hong-Li; Ye, Fei
2017-01-01
Praying mantises are a diverse group of predatory insects. Although some Mantodea mitogenomes have been reported, a comprehensive comparative and evolutionary genomic study is lacking for this group. In the present study, four new mitogenomes were sequenced, annotated, and compared to the previously published mitogenomes of other Mantodea species. Most Mantodea mitogenomes share a typical set of mitochondrial genes and a putative control region (CR). Additionally, and most intriguingly, another large non-coding region (LNC) was detected between trnM and ND2 in all six Paramantini mitogenomes examined. The main section in this common region of Paramantini may have initially originated from the corresponding control region for each species, whereas sequence differences between the LNCs and CRs and phylogenetic analyses indicate that LNC and CR are largely independently evolving. Namely, the LNC (the duplicated CR) may have subsequently degenerated during evolution. Furthermore, evidence suggests that special intergenic gaps have been introduced in some species through gene rearrangement and duplication. These gaps are actually the original abutting sequences of migrated or duplicated genes. Some gaps (G5 and G6) are homologous to the 5' and 3' surrounding regions of the duplicated gene in the original gene order, and another specific gap (G7) has tandem repeats. We analysed the phylogenetic relationships of fifteen Mantodea species using 37 concatenated mitochondrial genes and detected several synapomorphies unique to species in some clades. PMID:28367101
Recurrent duplications of the annexin A1 gene (ANXA1) in autism spectrum disorders
2014-01-01
Background Validating the potential pathogenicity of copy number variants (CNVs) identified in genome-wide studies of autism spectrum disorders (ASD) requires detailed assessment of case/control frequencies, inheritance patterns, clinical correlations, and functional impact. Here, we characterize a small recurrent duplication in the annexin A1 (ANXA1) gene, identified by the Autism Genome Project (AGP) study. Methods From the AGP CNV genomic screen in 2,147 ASD individuals, we selected for characterization an ANXA1 gene duplication that was absent in 4,964 population-based controls. We further screened the duplication in a follow-up sample including 1,496 patients and 410 controls, and evaluated clinical correlations and family segregation. Sequencing of exonic/downstream ANXA1 regions was performed in 490 ASD patients for identification of additional variants. Results The ANXA1 duplication, overlapping the last four exons and 3’UTR region, had an overall prevalence of 11/3,643 (0.30%) in unrelated ASD patients but was not identified in 5,374 controls. Duplication carriers presented no distinctive clinical phenotype. Family analysis showed neuropsychiatric deficits and ASD traits in multiple relatives carrying the duplication, suggestive of a complex genetic inheritance. Sequencing of exonic regions and the 3’UTR identified 11 novel changes, but no obvious variants with clinical significance. Conclusions We provide multilevel evidence for a role of ANXA1 in ASD etiology. Given its important role as mediator of glucocorticoid function in a wide variety of brain processes, including neuroprotection, apoptosis, and control of the neuroendocrine system, the results add ANXA1 to the growing list of rare candidate genetic etiological factors for ASD. PMID:24720851
DOE Office of Scientific and Technical Information (OSTI.GOV)
Venken, Koen J. T.; Popodi, Ellen; Holtzman, Stacy L.
We describe a molecularly defined duplication kit for the X chromosome of Drosophila melanogaster. A set of 408 overlapping P[acman] BAC clones was used to create small duplications (average length 88 kb) covering the 22-Mb sequenced portion of the chromosome. The BAC clones were inserted into an attP docking site on chromosome 3L using C31 integrase, allowing direct comparison of different transgenes. The insertions complement 92% of the essential and viable mutations and deletions tested, demonstrating that almost all Drosophila genes are compact and that the current annotations of the genome are reasonably accurate. Moreover, almost all genes are toleratedmore » at twice the normal dosage. Finally, we more precisely mapped two regions at which duplications cause diplo-lethality in males. This collection comprises the first molecularly defined duplication set to cover a whole chromosome in a multicellular organism. The work presented removes a long-standing barrier to genetic analysis of the Drosophila X chromosome, will greatly facilitate functional assays of X-linked genes in vivo, and provides a model for functional analyses of entire chromosomes in other species.« less
Cardarelli, Laura; Sparago, Angela; De Crescenzo, Agostina; Nalesso, Elisa; Zavan, Barbara; Cubellis, Maria Vittoria; Selicorni, Angelo; Cavicchioli, Paola; Pozzan, Giovanni Battista; Petrella, Marilena; Riccio, Andrea
2010-01-01
Genomic imprinting is an epigenetic phenomenon resulting in differential expression of maternal and paternal alleles of a subset of genes. In the mouse, mutation of imprinted genes often results in contrasting phenotypes, depending on parental origin. The overgrowth-associated Beckwith-Wiedemann syndrome (BWS) and the growth restriction-associated Silver-Russell syndrome (SRS) have been linked with a variety of epigenetic and genetic defects affecting a cluster of imprinted genes at chromosome 11p15.5. Paternally derived and maternally derived 11p15.5 duplications represent infrequent findings in BWS and SRS, respectively. Here, we report a case in which a 6.5 Mb duplication of 11p15.4-pter resulted in SRS and BWS phenotypes in a child and her mother, respectively. Molecular analyses demonstrated that the duplication involved the maternal chromosome 11p15 in the child and the paternal chromosome 11p15 in the mother. This observation provides a direct demonstration that SRS and BWS represent specular images, both at the clinical and molecular levels.
The detection of large deletions or duplications in genomic DNA.
Armour, J A L; Barton, D E; Cockburn, D J; Taylor, G R
2002-11-01
While methods for the detection of point mutations and small insertions or deletions in genomic DNA are well established, the detection of larger (>100 bp) genomic duplications or deletions can be more difficult. Most mutation scanning methods use PCR as a first step, but the subsequent analyses are usually qualitative rather than quantitative. Gene dosage methods based on PCR need to be quantitative (i.e., they should report molar quantities of starting material) or semi-quantitative (i.e., they should report gene dosage relative to an internal standard). Without some sort of quantitation, heterozygous deletions and duplications may be overlooked and therefore be under-ascertained. Gene dosage methods provide the additional benefit of reporting allele drop-out in the PCR. This could impact on SNP surveys, where large-scale genotyping may miss null alleles. Here we review recent developments in techniques for the detection of this type of mutation and compare their relative strengths and weaknesses. We emphasize that comprehensive mutation analysis should include scanning for large insertions and deletions and duplications. Copyright 2002 Wiley-Liss, Inc.
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse
Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.
2009-01-01
The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Gibb, Gillian C; Kardailsky, Olga; Kimball, Rebecca T; Braun, Edward L; Penny, David
2007-01-01
We improve the taxon sampling for avian phylogeny by analyzing 7 new mitochondrial genomes (a toucan, woodpecker, osprey, forest falcon, American kestrel, heron, and a pelican). This improves inference of the avian tree, and it supports 3 major conclusions. The first is that some birds (including a parrot, a toucan, and an osprey) exhibit a complete duplication of the control region (CR) meaning that there are at least 4 distinct gene orders within birds. However, it appears that there are regions of continued gene conversion between the duplicate CRs, resulting in duplications that can be stable for long evolutionary periods. Because of this stable duplicated state, gene order can eventually either revert to the original order or change to the new gene order. The existence of this stable duplicate state explains how an apparently unlikely event (finding the same novel gene order) can arise multiple times. Although rare genomic changes have theoretical advantages for tree reconstruction, they can be compromised if these apparently rare events have a stable intermediate state. Secondly, the toucan and woodpecker improve the resolution of the 6-way split within Neoaves that has been called an "explosive radiation." An explosive radiation implies that normal microevolutionary events are insufficient to explain the observed macroevolution. By showing the avian tree is, in principle, resolvable, we demonstrate that the radiation of birds is amenable to standard evolutionary analysis. Thirdly, and as expected from theory, additional taxa breaking up long branches stabilize the position of some problematic taxa (like the falcon). In addition, we report that within the birds of prey and allies, we did not find evidence pairing New World vultures with storks or accipitrids (hawks, eagles, and osprey) with Falconids.
Lim, Byung Chan; Lee, Seungbok; Shin, Jong-Yeon; Kim, Jong-Il; Hwang, Hee; Kim, Ki Joong; Hwang, Yong Seung; Seo, Jeong-Sun; Chae, Jong Hee
2011-11-01
Duchenne muscular dystrophy or Becker muscular dystrophy might be a suitable candidate disease for application of next-generation sequencing in the genetic diagnosis because the complex mutational spectrum and the large size of the dystrophin gene require two or more analytical methods and have a high cost. The authors tested whether large deletions/duplications or small mutations, such as point mutations or short insertions/deletions of the dystrophin gene, could be predicted accurately in a single platform using next-generation sequencing technology. A custom solution-based target enrichment kit was designed to capture whole genomic regions of the dystrophin gene and other muscular-dystrophy-related genes. A multiplexing strategy, wherein four differently bar-coded samples were captured and sequenced together in a single lane of the Illumina Genome Analyser, was applied. The study subjects were 25 16 with deficient dystrophin expression without a large deletion/duplication and 9 with a known large deletion/duplication. Nearly 100% of the exonic region of the dystrophin gene was covered by at least eight reads with a mean read depth of 107. Pathogenic small mutations were identified in 15 of the 16 patients without a large deletion/duplication. Using these 16 patients as the standard, the authors' method accurately predicted the deleted or duplicated exons in the 9 patients with known mutations. Inclusion of non-coding regions and paired-end sequence analysis enabled accurate identification by increasing the read depth and providing information about the breakpoint junction. The current method has an advantage for the genetic diagnosis of Duchenne muscular dystrophy and Becker muscular dystrophy wherein a comprehensive mutational search may be feasible using a single platform.
Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li
2011-10-01
Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.
2012-01-01
Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster. PMID:22577841
Female Behaviour Drives Expression and Evolution of Gustatory Receptors in Butterflies
Briscoe, Adriana D.; Macias-Muñoz, Aide; Kozak, Krzysztof M.; Walters, James R.; Yuan, Furong; Jamie, Gabriel A.; Martin, Simon H.; Dasmahapatra, Kanchon K.; Ferguson, Laura C.; Mallet, James; Jacquin-Joly, Emmanuelle; Jiggins, Chris D.
2013-01-01
Secondary plant compounds are strong deterrents of insect oviposition and feeding, but may also be attractants for specialist herbivores. These insect-plant interactions are mediated by insect gustatory receptors (Grs) and olfactory receptors (Ors). An analysis of the reference genome of the butterfly Heliconius melpomene, which feeds on passion-flower vines (Passiflora spp.), together with whole-genome sequencing within the species and across the Heliconius phylogeny has permitted an unprecedented opportunity to study the patterns of gene duplication and copy-number variation (CNV) among these key sensory genes. We report in silico gene predictions of 73 Gr genes in the H. melpomene reference genome, including putative CO2, sugar, sugar alcohol, fructose, and bitter receptors. The majority of these Grs are the result of gene duplications since Heliconius shared a common ancestor with the monarch butterfly or the silkmoth. Among Grs but not Ors, CNVs are more common within species in those gene lineages that have also duplicated over this evolutionary time-scale, suggesting ongoing rapid gene family evolution. Deep sequencing (∼1 billion reads) of transcriptomes from proboscis and labial palps, antennae, and legs of adult H. melpomene males and females indicates that 67 of the predicted 73 Gr genes and 67 of the 70 predicted Or genes are expressed in these three tissues. Intriguingly, we find that one-third of all Grs show female-biased gene expression (n = 26) and nearly all of these (n = 21) are Heliconius-specific Grs. In fact, a significant excess of Grs that are expressed in female legs but not male legs are the result of recent gene duplication. This difference in Gr gene expression diversity between the sexes is accompanied by a striking sexual dimorphism in the abundance of gustatory sensilla on the forelegs of H. melpomene, suggesting that female oviposition behaviour drives the evolution of new gustatory receptors in butterfly genomes. PMID:23950722
Papain-like cysteine proteases in Carica papaya: lineage-specific gene duplication and expansion.
Liu, Juan; Sharma, Anupma; Niewiara, Marie Jamille; Singh, Ratnesh; Ming, Ray; Yu, Qingyi
2018-01-06
Papain-like cysteine proteases (PLCPs), a large group of cysteine proteases structurally related to papain, play important roles in plant development, senescence, and defense responses. Papain, the first cysteine protease whose structure was determined by X-ray crystallography, plays a crucial role in protecting papaya from herbivorous insects. Except the four major PLCPs purified and characterized in papaya latex, the rest of the PLCPs in papaya genome are largely unknown. We identified 33 PLCP genes in papaya genome. Phylogenetic analysis clearly separated plant PLCP genes into nine subfamilies. PLCP genes are not equally distributed among the nine subfamilies and the number of PLCPs in each subfamily does not increase or decrease proportionally among the seven selected plant species. Papaya showed clear lineage-specific gene expansion in the subfamily III. Interestingly, all four major PLCPs purified from papaya latex, including papain, chymopapain, glycyl endopeptidase and caricain, were grouped into the lineage-specific expansion branch in the subfamily III. Mapping PLCP genes on chromosomes of five plant species revealed that lineage-specific expansions of PLCP genes were mostly derived from tandem duplications. We estimated divergence time of papaya PLCP genes of subfamily III. The major duplication events leading to lineage-specific expansion of papaya PLCP genes in subfamily III were estimated at 48 MYA, 34 MYA, and 16 MYA. The gene expression patterns of the papaya PLCP genes in different tissues were assessed by transcriptome sequencing and qRT-PCR. Most of the papaya PLCP genes of subfamily III expressed at high levels in leaf and green fruit tissues. Tandem duplications played the dominant role in affecting copy number of PLCPs in plants. Significant variations in size of the PLCP subfamilies among species may reflect genetic adaptation of plant species to different environments. The lineage-specific expansion of papaya PLCPs of subfamily III might have been promoted by the continuous reciprocal selective effects of herbivore attack and plant defense.
Patel, Vidushi S; Cooper, Steven J B; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer A M
2008-07-25
Vertebrate alpha (alpha)- and beta (beta)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the alpha- and beta-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil beta-globin gene (omega) in the marsupial alpha-cluster, however, suggested that duplication of the alpha-beta cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous alpha- and beta-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. The platypus alpha-globin cluster (chromosome 21) contains embryonic and adult alpha- globin genes, a beta-like omega-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-zeta-zeta'-alphaD-alpha3-alpha2-alpha1-omega-GBY-3'. The platypus beta-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-epsilon-beta-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate alpha-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal beta-globin clusters are embedded in olfactory genes. Thus, the mammalian alpha- and beta-globin clusters are orthologous to the bird alpha- and beta-globin clusters respectively. We propose that alpha- and beta-globin clusters evolved from an ancient MPG-C16orf35-alpha-beta-GBY-LUC7L arrangement 410 million years ago. A copy of the original beta (represented by omega in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of beta-globin genes with different expression profiles in different lineages.
Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René
2012-05-11
The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.
Książkiewicz, Michał; Rychel, Sandra; Nelson, Matthew N; Wyrwa, Katarzyna; Naganowska, Barbara; Wolko, Bogdan
2016-10-21
The Arabidopsis FLOWERING LOCUS T (FT) gene, a member of the phosphatidylethanolamine binding protein (PEBP) family, is a major controller of flowering in response to photoperiod, vernalization and light quality. In legumes, FT evolved into three, functionally diversified clades, FTa, FTb and FTc. A milestone achievement in narrow-leafed lupin (Lupinus angustifolius L.) domestication was the loss of vernalization responsiveness at the Ku locus. Recently, one of two existing L. angustifolius homologs of FTc, LanFTc1, was revealed to be the gene underlying Ku. It is the first recorded involvement of an FTc homologue in vernalization. The evolutionary basis of this phenomenon in lupin has not yet been deciphered. Bacterial artificial chromosome (BAC) clones carrying LanFTc1 and LanFTc2 genes were localized in different mitotic chromosomes and constituted sequence-specific landmarks for linkage groups NLL-10 and NLL-17. BAC-derived superscaffolds containing LanFTc genes revealed clear microsyntenic patterns to genome sequences of nine legume species. Superscaffold-1 carrying LanFTc1 aligned to regions encoding one or more FT-like genes whereas superscaffold-2 mapped to a region lacking such a homolog. Comparative mapping of the L. angustifolius genome assembly anchored to linkage map localized superscaffold-1 in the middle of a 15 cM conserved, collinear region. In contrast, superscaffold-2 was found at the edge of a 20 cM syntenic block containing highly disrupted collinearity at the LanFTc2 locus. 118 PEBP-family full-length homologs were identified in 10 legume genomes. Bayesian phylogenetic inference provided novel evidence supporting the hypothesis that whole-genome and tandem duplications contributed to expansion of PEBP-family genes in legumes. Duplicated genes were subjected to strong purifying selection. Promoter analysis of FT genes revealed no statistically significant sequence similarity between duplicated copies; only RE-alpha and CCAAT-box motifs were found at conserved positions and orientations. Numerous lineage-specific duplications occurred during the evolution of legume PEBP-family genes. Whole-genome duplications resulted in the origin of subclades FTa, FTb and FTc and in the multiplication of FTa and FTb copy number. LanFTc1 is located in the region conserved among all main lineages of Papilionoideae. LanFTc1 is a direct descendant of ancestral FTc, whereas LanFTc2 appeared by subsequent duplication.
The butterfly plant arms-race escalated by gene and genome duplications
Edger, Patrick P.; Heidel-Fischer, Hanna M.; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E.; Heckel, David G.; Der, Joshua P.; Wafula, Eric K.; Tang, Michelle; Hofberger, Johannes A.; Smithson, Ann; Hall, Jocelyn C.; Blanchette, Matthieu; Bureau, Thomas E.; Wright, Stephen I.; dePamphilis, Claude W.; Eric Schranz, M.; Barker, Michael S.; Conant, Gavin C.; Wahlberg, Niklas; Vogel, Heiko; Pires, J. Chris; Wheat, Christopher W.
2015-01-01
Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits. PMID:26100883
Linder, P; Dölz, R; Mossé, M O; Lazowska, J; Slonimski, P P
1993-01-01
The amount of nucleotide sequence data is increasing exponentially. We therefore made an effort to make a comprehensive database (LISTA) for the yeast Saccharomyces cerevisiae. Each sequence has been attributed a single genetic name and in the case of allelic duplicated sequences, synonyms are given, if necessary. For the nomenclature we have introduced a standard principle for naming gene sequences based on priority rules. We have also applied a simple method to distinguish duplicated sequences of one and the same gene from non-allelic sequences of duplicated genes. By using these principles we have sorted out a lot of confusion in the literature and databanks. Along with the genetic name, the mnemonic from the EMBL databank, the codon bias, reference of the publication of the sequence and the EMBL accession numbers are included in each entry. PMID:8332521
The butterfly plant arms-race escalated by gene and genome duplications.
Edger, Patrick P; Heidel-Fischer, Hanna M; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E; Heckel, David G; Der, Joshua P; Wafula, Eric K; Tang, Michelle; Hofberger, Johannes A; Smithson, Ann; Hall, Jocelyn C; Blanchette, Matthieu; Bureau, Thomas E; Wright, Stephen I; dePamphilis, Claude W; Eric Schranz, M; Barker, Michael S; Conant, Gavin C; Wahlberg, Niklas; Vogel, Heiko; Pires, J Chris; Wheat, Christopher W
2015-07-07
Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits.
De novo direct duplication of chromosome segment 22q11.2-q13.1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fujimoto, Atsuko; Lin, Ming S.
Lindsay et al. [1995] reported a case of de novo duplication of the segment 22q11-q12. Molecular cytogenetics studies showed that the segment includes the regions responsible for the {open_quotes}cat eye,{close_quotes} DiGeorge, and velo-cardio-facial syndrome, and extends distal to the breakpoint cluster region. The phenotype was milder than that of complete trisomy 22 and der(22)t(11;22) (q23;q11) syndrome and was similar in type and severity to that of {open_quotes}cat eye{close_quotes} syndrome (CES). They suggested that trisomy of gene(s) responsible for the CES might have a predominant phenotypic effect over other genes present in the region duplicated in their patient. 3 refs., 2more » figs.« less
van der Ley, P
1988-11-01
Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
MECP2 duplications in six patients with complex sex chromosome rearrangements
Breman, Amy M; Ramocki, Melissa B; Kang, Sung-Hae L; Williams, Misti; Freedenberg, Debra; Patel, Ankita; Bader, Patricia I; Cheung, Sau Wai
2011-01-01
Duplications of the Xq28 chromosome region resulting in functional disomy are associated with a distinct clinical phenotype characterized by infantile hypotonia, severe developmental delay, progressive neurological impairment, absent speech, and proneness to infections. Increased expression of the dosage-sensitive MECP2 gene is considered responsible for the severe neurological impairments observed in affected individuals. Although cytogenetically visible duplications of Xq28 are well documented in the published literature, recent advances using array comparative genomic hybridization (CGH) led to the detection of an increasing number of microduplications spanning MECP2. In rare cases, duplication results from intrachromosomal rearrangement between the X and Y chromosomes. We report six cases with sex chromosome rearrangements involving duplication of MECP2. Cases 1–4 are unbalanced rearrangements between X and Y, resulting in MECP2 duplication. The additional Xq material was translocated to Yp in three cases (cases 1–3), and to the heterochromatic region of Yq12 in one case (case 4). Cases 5 and 6 were identified by array CGH to have a loss in copy number at Xp and a gain in copy number at Xq28 involving the MECP2 gene. In both cases, fluorescent in situ hybridization (FISH) analysis revealed a recombinant X chromosome containing the duplicated material from Xq28 on Xp, resulting from a maternal pericentric inversion. These cases add to a growing number of MECP2 duplications that have been detected by array CGH, while demonstrating the value of confirmatory chromosome and FISH studies for the localization of the duplicated material and the identification of complex rearrangements. PMID:21119712
Cheetahs have 4 serum amyloid a genes evolved through repeated duplication events.
Chen, Lei; Une, Yumi; Higuchi, Keiichi; Mori, Masayuki
2012-01-01
Amyloid A (AA) amyloidosis is a leading cause of mortality in captive cheetahs (Acinonyx jubatus). We performed genome walking and PCR cloning and revealed that cheetahs have 4 SAA genes (provisionally named SAA1A, SAA1B, SAA3A, and SAA3B). In addition, we identified multiple nucleotide polymorphisms in the 4 SAA genes by screening 51 cheetahs. The polymorphisms defined 4, 7, 6, and 4 alleles for SAA1A, SAA3A, SAA1B, and SAA3B, respectively. Pedigree analysis of the inheritance of genotypes for the SAA genes revealed that specific combinations of alleles for the 4 SAA genes cosegregated as a unit (haplotype) in pedigrees, indicating that the 4 genes were linked on the same chromosome. Notably, cheetah SAA1A and SAA1B were highly homologous in their nucleotide sequences. Likewise, SAA3A and SAA3B genes were homologous. These observations suggested a model for the evolution of the 4 SAA genes in cheetahs in which duplication of an ancestral SAA gene first gave rise to SAA1 and SAA3. Subsequently, each gene duplicated one more time, uniquely making 4 genes in the cheetah genome. The monomorphism of the cheetah SAA1A protein might be one of the factors responsible for the high incidence of AA amyloidosis in this species.
Trento, Alfonsina; Viegas, Mariana; Galiano, Mónica; Videla, Cristina; Carballal, Guadalupe; Mistchenko, Alicia S.; Melero, José A.
2006-01-01
A total of 47 clinical samples were identified during an active surveillance program of respiratory infections in Buenos Aires (BA) (1999 to 2004) that contained sequences of human respiratory syncytial virus (HRSV) with a 60-nucleotide duplication in the attachment (G) protein gene. This duplication was analogous to that previously described for other three viruses also isolated in Buenos Aires in 1999 (A. Trento et al., J. Gen. Virol. 84:3115-3120, 2003). Phylogenetic analysis indicated that BA sequences with that duplication shared a common ancestor (dated about 1998) with other HRSV G sequences reported worldwide after 1999. The duplicated nucleotide sequence was an exact copy of the preceding 60 nucleotides in early viruses, but both copies of the duplicated segment accumulated nucleotide substitutions in more recent viruses at a rate apparently higher than in other regions of the G protein gene. The evolution of the viruses with the duplicated G segment apparently followed the overall evolutionary pattern previously described for HRSV, and this genotype has replaced other prevailing antigenic group B genotypes in Buenos Aires and other places. Thus, the duplicated segment represents a natural tag that can be used to track the dissemination and evolution of HRSV in an unprecedented setting. We have taken advantage of this situation to reexamine the molecular epidemiology of HRSV and to explore the natural history of this important human pathogen. PMID:16378999
Evidence of function for conserved noncoding sequences in Arabidopsis thaliana.
Spangler, Jacob B; Subramaniam, Sabarinath; Freeling, Michael; Feltus, F Alex
2012-01-01
• Whole genome duplication events provide a lineage with a large reservoir of genes that can be molded by evolutionary forces into phenotypes that fit alternative environments. A well-studied whole genome duplication, the α-event, occurred in an ancestor of the model plant Arabidopsis thaliana. Retained segments of the α-event have been defined in recent years in the form of duplicate protein coding sequences (α-pairs) and associated conserved noncoding DNA sequences (CNSs). Our aim was to identify any association between CNSs and α-pair co-functionality at the gene expression level. • Here, we tested for correlation between CNS counts and α-pair co-expression and expression intensity across nine expression datasets: aerial tissue, flowers, leaves, roots, rosettes, seedlings, seeds, shoots and whole plants. • We provide evidence for a putative regulatory role of the CNSs. The association of CNSs with α-pair co-expression and expression intensity varied by gene function, subgene position and the presence of transcription factor binding motifs. A range of possible CNS regulatory mechanisms, including intron-mediated enhancement, messenger RNA fold stability and transcriptional regulation, are discussed. • This study provides a framework to understand how CNS motifs are involved in the maintenance of gene expression after a whole genome duplication event. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.
Duplication and concerted evolution in a master sex determiner under balancing selection.
Privman, Eyal; Wurm, Yannick; Keller, Laurent
2013-05-07
The transformer (tra) gene is a key regulator in the signalling hierarchy controlling all aspects of somatic sexual differentiation in Drosophila and other insects. Here, we show that six of the seven sequenced ants have two copies of tra. Surprisingly, the two paralogues are always more similar within species than among species. Comparative sequence analyses indicate that this pattern is owing to the ongoing concerted evolution after an ancestral duplication rather than independent duplications in each of the six species. In particular, there was strong support for inter-locus recombination between the paralogues of the ant Atta cephalotes. In the five species where the location of paralogues is known, they are adjacent to each other in four cases and separated by only few genes in the fifth case. Because there have been extensive genomic rearrangements in these lineages, this suggests selection acting to conserve their synteny. In three species, we also find a signature of positive selection in one of the paralogues. In three bee species where information is available, the tra gene is also duplicated, the copies are adjacent and in at least one species there was recombination between paralogues. These results suggest that concerted evolution plays an adaptive role in the evolution of this gene family.
Liu, Yan-Yan; Yang, Ke-Zhen; Wei, Xiao-Xin; Wang, Xiao-Quan
2016-11-01
Angiosperms and gymnosperms are two major groups of extant seed plants. It has been suggested that gymnosperms lack FLOWERING LOCUS T (FT), a key integrator at the core of flowering pathways in angiosperms. Taking advantage of newly released gymnosperm genomes, we revisited the evolutionary history of the plant phosphatidylethanolamine-binding protein (PEBP) gene family through phylogenetic reconstruction. Expression patterns in three gymnosperm taxa and heterologous expression in Arabidopsis were studied to investigate the functions of gymnosperm FT-like and TERMINAL FLOWER 1 (TFL1)-like genes. Phylogenetic reconstruction suggests that an ancient gene duplication predating the divergence of seed plants gave rise to the FT and TFL1 genes. Expression patterns indicate that gymnosperm TFL1-like genes play a role in the reproductive development process, while GymFT1 and GymFT2, the FT-like genes resulting from a duplication event in the common ancestor of gymnosperms, function in both growth rhythm and sexual development pathways. When expressed in Arabidopsis, both spruce FT-like and TFL1-like genes repressed flowering. Our study demonstrates that gymnosperms do have FT-like and TFL1-like genes. Frequent gene and genome duplications contributed significantly to the expansion of the plant PEBP gene family. The expression patterns of gymnosperm PEBP genes provide novel insight into the functional evolution of this gene family. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
2013-01-01
Background WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. Results We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. Conclusions In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. PMID:24088323
Kakinuma, Hiroaki; Ozaki, Mamoru; Sato, Hitoshi; Takahashi, Hiroaki
2008-09-05
Autism has been associated with chromosomal aberrations, including duplications at chromosome 4, and the identification of genetic factors contributing to the etiology of this disease is the focus of much research. Here we report a Japanese girl with mosaic of chromosome 4p duplication, mos 46,XX,dup(4)(p12p16)[54]/46,XX[6], who was diagnosed with autism at 3 years of age. Fluorescence in situ hybridization (FISH) with probes covering the region spanning a cluster of the gamma aminobutyric acid A (GABA-A) receptor subunit genes in the proximal short arm of chromosome 4 demonstrated total three signals for the GABRG1, GABRA4, and GABRA2 genes, but only two signals for GABRB1. This suggests that aberrant copy number of the GABA-A receptor subunit genes may contribute to the etiology of autism in this patient. 2007 Wiley-Liss, Inc.
Paterson, Andrew D; Rommens, Johanna M; Bharaj, Bhupinder; Blavignac, Jessica; Wong, Isidro; Diamandis, Maria; Waye, John S; Rivard, Georges E; Hayward, Catherine P M
2010-02-11
Quebec platelet disorder (QPD) is an autosomal dominant bleeding disorder linked to a region on chromosome 10 that includes PLAU, the urokinase plasminogen activator gene. QPD increases urokinase plasminogen activator mRNA levels, particularly during megakaryocyte differentiation, without altering expression of flanking genes. Because PLAU sequence changes were excluded as the cause of this bleeding disorder, we investigated whether the QPD mutation involved PLAU copy number variation. All 38 subjects with QPD had a direct tandem duplication of a 78-kb genomic segment that includes PLAU. This mutation was specific to QPD as it was not present in any unaffected family members (n = 114), unrelated French Canadians (n = 221), or other persons tested (n = 90). This new information on the genetic mutation will facilitate diagnostic testing for QPD and studies of its pathogenesis and prevalence. QPD is the first bleeding disorder to be associated with a gene duplication event and a PLAU mutation.
Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio
2014-03-01
SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3-15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS.
Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio
2014-01-01
SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3–15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS. PMID:24689071
Genetics Home Reference: MECP2 duplication syndrome
... of autism spectrum disorders that affect communication and social interaction. Females with a MECP2 gene duplication tend to ... Accessibility FOIA Viewers & Players U.S. Department of Health & Human Services National Institutes of Health National Library of ...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bikker, H.; Hartog, M.T. den; Gons, M.H.
1994-07-01
In this study, the authors present the molecular basis of a total iodide organification defect causing severe congenital hypothyroidism. In the thyroid gland of the patient, thyroid peroxidase (TPO) activity and the iodination degree of thyroglobulin were below detection limits, and no TPO messenger ribonucleic acid was detectable by Northern blot analysis. Denaturing gradient gel electrophoretic analysis of the TPO gene of the patient revealed a homozygous mutation in exon 2. Sequence analysis showed the presence of a 20-basepair duplication, 47 basepairs down-stream of the ATG start codon. This duplication generates a frame shift, resulting in a termination signal inmore » exon 3, compatible with the complete absence of TPO. Both parents of the patient are heterozygous for the same duplication, confirming the recessive mode of inheritance of the mutation. 32 refs., 4 figs.« less
Kang, Sung-Hwan; Atallah, Osama O; Sun, Yong-Duo; Folimonova, Svetlana Y
2018-01-15
Viruses from the family Closteroviridae show an example of intra-genome duplications of more than one gene. In addition to the hallmark coat protein gene duplication, several members possess a tandem duplication of papain-like leader proteases. In this study, we demonstrate that domains encoding the L1 and L2 proteases in the Citrus tristeza virus genome underwent a significant functional divergence at the RNA and protein levels. We show that the L1 protease is crucial for viral accumulation and establishment of initial infection, whereas its coding region is vital for virus transport. On the other hand, the second protease is indispensable for virus infection of its natural citrus host, suggesting that L2 has evolved an important adaptive function that mediates virus interaction with the woody host. Copyright © 2017 Elsevier Inc. All rights reserved.
Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice.
Wang, Yiyi; Feng, Lin; Zhu, Yuxin; Li, Yuan; Yan, Hanwei; Xiang, Yan
2015-09-08
WRKY III genes have significant functions in regulating plant development and resistance. In plant, WRKY gene family has been studied in many species, however, there still lack a comprehensive analysis of WRKY III genes in the woody plant species poplar, three representative lineages of flowering plant species are incorporated in most analyses: Arabidopsis (a model plant for annual herbaceous dicots), grape (one model plant for perennial dicots) and Oryza sativa (a model plant for monocots). In this study, we identified 10, 6, 13 and 28 WRKY III genes in the genomes of Populus trichocarpa, grape (Vitis vinifera), Arabidopsis thaliana and rice (Oryza sativa), respectively. Phylogenetic analysis revealed that the WRKY III proteins could be divided into four clades. By microsynteny analysis, we found that the duplicated regions were more conserved between poplar and grape than Arabidopsis or rice. We dated their duplications by Ks analysis of Populus WRKY III genes and demonstrated that all the blocks were formed after the divergence of monocots and dicots. Strong purifying selection has played a key role in the maintenance of WRKY III genes in Populus. Tissue expression analysis of the WRKY III genes in Populus revealed that five were most highly expressed in the xylem. We also performed quantitative real-time reverse transcription PCR analysis of WRKY III genes in Populus treated with salicylic acid, abscisic acid and polyethylene glycol to explore their stress-related expression patterns. This study highlighted the duplication and diversification of the WRKY III gene family in Populus and provided a comprehensive analysis of this gene family in the Populus genome. Our results indicated that the majority of WRKY III genes of Populus was expanded by large-scale gene duplication. The expression pattern of PtrWRKYIII gene identified that these genes play important roles in the xylem during poplar growth and development, and may play crucial role in defense to drought stress. Our results presented here may aid in the selection of appropriate candidate genes for further characterization of their biological functions in poplar.
Marková, S; Searle, J B; Kotlík, P
2014-01-01
Gene duplication plays an important role in the origin of evolutionary novelties, but the mechanisms responsible for the retention and functional divergence of the duplicated copy are not fully understood. The α-globin genes provide an example of a gene family with different numbers of gene duplicates among rodents. Whereas Rattus and Peromyscus each have three adult α-globin genes (HBA-T1, HBA-T2 and HBA-T3), Mus has only two copies. High rates of amino acid evolution in the independently derived HBA-T3 genes of Peromyscus and Rattus have been attributed to positive selection. Using RACE PCR, reverse transcription-PCR (RT–PCR) and RNA-seq, we show that another rodent, the bank vole Clethrionomys glareolus, possesses three transcriptionally active α-globin genes. The bank vole HBA-T3 gene is distinguished from each HBA-T1 and HBA-T2 by 20 amino acids and is transcribed 23- and 4-fold lower than HBA-T1 and HBA-T2, respectively. Polypeptides corresponding to all three genes are detected by electrophoresis, demonstrating that the translated products of HBA-T3 are present in adult erythrocytes. Patterns of codon substitution and the presence of low-frequency null alleles suggest a postduplication relaxation of purifying selection on bank vole HBA-T3. PMID:24595364
Isles, Anthony R.; Ingason, Andrés; Lowther, Chelsea; Gawlick, Micha; Stöber, Gerald; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F.; Gejman, Pablo V.; Shi, Jianxin; Sanders, Alan R.; Duan, Jubao; Sisodiya, Sanjay; Costain, Gregory; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J.; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Stefansson, Hreinn; Stefansson, Kari; O’Donovan, Michael C.; Owen, Michael J.; Bassett, Anne; Kirov, George
2016-01-01
Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally expressed imprinted genes in the contribution of Copy Number Variants (CNVs) at this interval to the incidence of psychotic illness. This work will have tangible benefits for patients with 15q11.2-q13.3 duplications by aiding genetic counseling. PMID:27153221
Isles, Anthony R; Ingason, Andrés; Lowther, Chelsea; Walters, James; Gawlick, Micha; Stöber, Gerald; Rees, Elliott; Martin, Joanna; Little, Rosie B; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Aleksic, Branko; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F; Gejman, Pablo V; Shi, Jianxin; Sanders, Alan R; Duan, Jubao; Willis, Joseph; Sisodiya, Sanjay; Costain, Gregory; Werge, Thomas M; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Girirajan, Santhosh D; Stefansson, Hreinn; Stefansson, Kari; O'Donovan, Michael C; Owen, Michael J; Bassett, Anne; Kirov, George
2016-05-01
Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally expressed imprinted genes in the contribution of Copy Number Variants (CNVs) at this interval to the incidence of psychotic illness. This work will have tangible benefits for patients with 15q11.2-q13.3 duplications by aiding genetic counseling.
Zou, Hong; Jakovlić, Ivan; Chen, Rong; Zhang, Dong; Zhang, Jin; Li, Wen-Xiang; Wang, Gui-Tang
2017-11-02
Complete mitochondrial genomes are much better suited for the taxonomic identification and phylogenetic studies of nematodes than morphology or traditionally-used molecular markers, but they remain unavailable for the entire Camallanidae family (Chromadorea). As the only published mitogenome in the Camallanina suborder (Dracunculoidea superfamily) exhibited a unique gene order, the other objective of this research was to study the evolution of mitochondrial architecture in the Spirurida order. Thus, we sequenced the complete mitogenome of the Camallanus cotti fish parasite and conducted structural and phylogenomic comparative analyses with all available Spirurida mitogenomes. The mitogenome is exceptionally large (17,901 bp) among the Chromadorea and, with 46 (pseudo-) genes, exhibits a unique architecture among nematodes. Six protein-coding genes (PCGs) and six tRNAs are duplicated. An additional (seventh) tRNA (Trp) was probably duplicated by the remolding of tRNA-Ser2 (missing). Two pairs of these duplicated PCGs might be functional; three were incomplete and one contained stop codons. Apart from Ala and Asp, all other duplicated tRNAs are conserved and probably functional. Only 19 unique tRNAs were found. Phylogenomic analysis included Gnathostomatidae (Spirurina) in the Camallanina suborder. Within the Nematoda, comparable PCG duplications were observed only in the enoplean Mermithidae family, but those result from mitochondrial recombination, whereas characteristics of the studied mitogenome suggest that likely rearrangement mechanisms are either a series of duplications, transpositions and random loss events, or duplication, fragmentation and subsequent reassembly of the mitogenome. We put forward a hypothesis that the evolution of mitogenomic architecture is extremely discontinuous, and that once a long period of stasis in gene order and content has been punctuated by a rearrangement event, such a destabilised mitogenome is much more likely to undergo subsequent rearrangement events, resulting in an exponentially accelerated evolutionary rate of mitogenomic rearrangements. Implications of this model are particularly important for the application of gene order similarity as an additive source of phylogenetic information. Chromadorean nematodes, and particularly Camallanina clade (with C. cotti as an example of extremely accelerated rate of rearrangements), might be a good model to further study this discontinuity in the dynamics of mitogenomic evolution.
The zebrafish genome: a review and msx gene case study.
Postlethwait, J H
2006-01-01
Zebrafish is one of several important teleost models for understanding principles of vertebrate developmental, molecular, organismal, genetic, evolutionary, and genomic biology. Efficient investigation of the molecular genetic basis of induced mutations depends on knowledge of the zebrafish genome. Principles of zebrafish genomic analysis, including gene mapping, ortholog identification, conservation of syntenies, genome duplication, and evolution of duplicate gene function are discussed here using as a case study the zebrafish msxa, msxb, msxc, msxd, and msxe genes, which together constitute zebrafish orthologs of tetrapod Msx1, Msx2, and Msx3. Genomic analysis suggests orthologs for this difficult to understand group of paralogs.
Molecular Evolution and Expansion Analysis of the NAC Transcription Factor in Zea mays
Fan, Kai; Wang, Ming; Miao, Ying; Ni, Mi; Bibi, Noreen; Yuan, Shuna; Li, Feng; Wang, Xuede
2014-01-01
NAC (NAM, ATAF1, 2 and CUC2) family is a plant-specific transcription factor and it controls various plant developmental processes. In the current study, 124 NAC members were identified in Zea mays and were phylogenetically clustered into 13 distinct subfamilies. The whole genome duplication (WGD), especially an additional WGD event, may lead to expanding ZmNAC members. Different subfamily has different expansion rate, and NAC subfamily preference was found during the expansion in maize. Moreover, the duplication events might occur after the divergence of the lineages of Z. mays and S. italica, and segmental duplication seemed to be the dominant pattern for the gene duplication in maize. Furthermore, the expansion of ZmNAC members may be also related to gain and loss of introns. Besides, the restriction of functional divergence was discovered after most of the gene duplication events. These results could provide novel insights into molecular evolution and expansion analysis of NAC family in maize, and advance the NAC researches in other plants, especially polyploid plants. PMID:25369196
Landsverk, Megan L.; Ruzzo, Elizabeth K.; Mefford, Heather C.; Buysse, Karen; Buchan, Jillian G.; Eichler, Evan E.; Petty, Elizabeth M.; Peterson, Esther A.; Knutzen, Dana M.; Barnett, Karen; Farlow, Martin R.; Caress, Judy; Parry, Gareth J.; Quan, Dianna; Gardner, Kathy L.; Hong, Ming; Simmons, Zachary; Bird, Thomas D.; Chance, Phillip F.; Hannibal, Mark C.
2009-01-01
Hereditary neuralgic amyotrophy (HNA) is an autosomal dominant disorder associated with recurrent episodes of focal neuropathy primarily affecting the brachial plexus. Point mutations in the SEPT9 gene have been previously identified as the molecular basis of HNA in some pedigrees. However in many families, including those from North America demonstrating a genetic founder haplotype, no sequence mutations have been detected. We report an intragenic 38 Kb SEPT9 duplication that is linked to HNA in 12 North American families that share the common founder haplotype. Analysis of the breakpoints showed that the duplication is identical in all pedigrees, and molecular analysis revealed that the duplication includes the 645 bp exon in which previous HNA mutations were found. The SEPT9 transcript variants that span this duplication contain two in-frame repeats of this exon, and immunoblotting demonstrates larger molecular weight SEPT9 protein isoforms. This exon also encodes for a majority of the SEPT9 N-terminal proline rich region suggesting that this region plays a role in the pathogenesis of HNA. PMID:19139049
Landsverk, Megan L; Ruzzo, Elizabeth K; Mefford, Heather C; Buysse, Karen; Buchan, Jillian G; Eichler, Evan E; Petty, Elizabeth M; Peterson, Esther A; Knutzen, Dana M; Barnett, Karen; Farlow, Martin R; Caress, Judy; Parry, Gareth J; Quan, Dianna; Gardner, Kathy L; Hong, Ming; Simmons, Zachary; Bird, Thomas D; Chance, Phillip F; Hannibal, Mark C
2009-04-01
Hereditary neuralgic amyotrophy (HNA) is an autosomal dominant disorder associated with recurrent episodes of focal neuropathy primarily affecting the brachial plexus. Point mutations in the SEPT9 gene have been previously identified as the molecular basis of HNA in some pedigrees. However in many families, including those from North America demonstrating a genetic founder haplotype, no sequence mutations have been detected. We report an intragenic 38 Kb SEPT9 duplication that is linked to HNA in 12 North American families that share the common founder haplotype. Analysis of the breakpoints showed that the duplication is identical in all pedigrees, and molecular analysis revealed that the duplication includes the 645 bp exon in which previous HNA mutations were found. The SEPT9 transcript variants that span this duplication contain two in-frame repeats of this exon, and immunoblotting demonstrates larger molecular weight SEPT9 protein isoforms. This exon also encodes for a majority of the SEPT9 N-terminal proline rich region suggesting that this region plays a role in the pathogenesis of HNA.
Liang, Liyang; Xie, Yingjun; Shen, Yiping; Yin, Qibin; Yuan, Haiming
2016-01-01
Proximal 4p deletion syndrome is a relatively rare genetic condition characterized by dysmorphic facial features, limb anomalies, minor congenital heart defects, hypogonadism, cafe-au-lait spots, developmental delay, tall and thin habitus, and intellectual disability. At present, over 20 cases of this syndrome have been published. However, duplication of the same region in proximal 4p has never been reported. Here, we describe a 2-year-5-month-old boy with severe congenital heart defects, limb anomalies, hypogonadism, distinctive facial features, pre- and postnatal developmental delay, and mild cognitive impairments. A de novo 4.5-Mb interstitial duplication at 4p15.2p15.1 was detected by chromosomal microarray analysis. Next-generation sequencing was employed and confirmed the duplication, but revealed no additional pathogenic variants. Several candidate genes in this interval responsible for the complex clinical phenotype were identified, such as RBPJ, STIM2, CCKAR, and LGI2. The results suggest a novel contiguous gene duplication syndrome. © 2016 S. Karger AG, Basel.
Barbaro, Michela; Cook, Jackie; Lagerstedt-Robinson, Kristina; Wedell, Anna
2012-01-01
A 160 kb minimal common region in Xp21 has been determined as the cause of XY gonadal dysgenesis, if duplicated. The region contains the MAGEB genes and the NR0B1 gene; this is the candidate for gonadal dysgenesis if overexpressed. Most patients present gonadal dysgenesis within a more complex phenotype. However, few independent cases have recently been described presenting with isolated XY gonadal dysgenesis caused by relatively small NR0B1 locus duplications. We have identified another NR0B1 duplication in two sisters with isolated XY gonadal dysgenesis with an X-linked inheritance pattern. We performed X-inactivation studies in three fertile female carriers of three different small NR0B1 locus duplications identified by our group. The carrier mothers did not show obvious skewing of X-chromosome inactivation, suggesting that NR0B1 overexpression does not impair ovarian function. We furthermore emphasize the importance to investigate the NR0B1 locus also in patients with isolated XY gonadal dysgenesis. PMID:22518125
Expansion of signal transduction pathways in fungi by extensive genome duplication
Corrochano, Luis M.; Kuo, Alan; Marcet-Houben, Marina; Polaino, Silvia; Salamov, Asaf; Villalobos-Escobedo, José M.; Grimwood, Jane; Álvarez, M. Isabel; Avalos, Javier; Bauer, Diane; Benito, Ernesto P.; Benoit, Isabelle; Burger, Gertraud; Camino, Lola P.; Cánovas, David; Cerdá-Olmedo, Enrique; Cheng, Jan-Fang; Domínguez, Angel; Eliáš, Marek; Eslava, Arturo P.; Glaser, Fabian; Gutiérrez, Gabriel; Heitman, Joseph; Henrissat, Bernard; Iturriaga, Enrique A.; Lang, B. Franz; Lavín, José L.; Lee, Soo Chan; Li, Wenjun; Lindquist, Erika; López-García, Sergio; Luque, Eva M.; Marcos, Ana T.; Martin, Joel; McCluskey, Kevin; Medina, Humberto R.; Miralles-Durán, Alejandro; Miyazaki, Atsushi; Muñoz-Torres, Elisa; Oguiza, José A.; Ohm, Robin A.; Orejas, Margarita; Ortiz-Castellanos, Lucila; Pisabarro, Antonio G.; Rodríguez-Romero, Julio; Ruiz-Herrera, José; Ruiz-Vázquez, Rosa; Sanz, Catalina; Schackwitz, Wendy; Shahriari, Mahdi; Shelest, Ekaterina; Silva-Franco, Fátima; Soanes, Darren; Syed, Khajamohiddin; Tagua, Víctor G.; Talbot, Nicholas J.; Thon, Michael R.; Tice, Hope; de Vries, Ronald P.; Wiebenga, Ad; Yadav, Jagjit S.; Braun, Edward L.; Baker, Scott E.; Garre, Victoriano; Schmutz, Jeremy; Horwitz, Benjamin A.; Torres-Martínez, Santiago; Idnurm, Alexander; Herrera-Estrella, Alfredo; Gabaldón, Toni; Grigoriev, Igor V.
2016-01-01
Summary Plants and fungi use light and other signals to regulate development, growth, and metabolism. The fruiting bodies of the fungus Phycomyces blakesleeanus are single cells that react to environmental cues, including light, but the mechanisms are largely unknown [1]. The related fungus Mucor circinelloides is an opportunistic human pathogen that changes its mode of growth upon receipt of signals from the environment to facilitate pathogenesis [2]. Understanding how these organisms respond to environmental cues should provide insights into the mechanisms of sensory perception and signal transduction by a single eukaryotic cell, and their role in pathogenesis. We sequenced the genomes of P. blakesleeanus and M. circinelloides, and show that they have been shaped by an extensive genome duplication or, most likely, a whole genome duplication (WGD), which is rarely observed in fungi [3-6]. We show that the genome duplication has expanded gene families, including those involved in signal transduction, and that duplicated genes have specialized, as evidenced by differences in their regulation by light. The transcriptional response to light varies with the developmental stage and is still observed in a photoreceptor mutant of P. blakesleeanus. A phototropic mutant of P. blakesleeanus with a heterozygous mutation in the photoreceptor gene madA demonstrates that photosensor dosage is important for the magnitude of signal transduction. We conclude that the genome duplication provided the means to improve signal transduction for enhanced perception of environmental signals. Our results will help to understand the role of genome dynamics in the evolution of sensory perception in eukaryotes. PMID:27238284
Long-read sequencing uncovers the adaptive topography of a carnivorous plant genome
Lan, Tianying; Renner, Tanya; Ibarra-Laclette, Enrique; Farr, Kimberly M.; Chang, Tien-Hao; Cervantes-Pérez, Sergio Alan; Zheng, Chunfang; Sankoff, David; Tang, Haibao; Purbojati, Rikky W.; Putra, Alexander; Drautz-Moses, Daniela I.; Schuster, Stephan C.; Herrera-Estrella, Luis; Albert, Victor A.
2017-01-01
Utricularia gibba, the humped bladderwort, is a carnivorous plant that retains a tiny nuclear genome despite at least two rounds of whole genome duplication (WGD) since common ancestry with grapevine and other species. We used a third-generation genome assembly with several complete chromosomes to reconstruct the two most recent lineage-specific ancestral genomes that led to the modern U. gibba genome structure. Patterns of subgenome dominance in the most recent WGD, both architectural and transcriptional, are suggestive of allopolyploidization, which may have generated genomic novelty and led to instantaneous speciation. Syntenic duplicates retained in polyploid blocks are enriched for transcription factor functions, whereas gene copies derived from ongoing tandem duplication events are enriched in metabolic functions potentially important for a carnivorous plant. Among these are tandem arrays of cysteine protease genes with trap-specific expression that evolved within a protein family known to be useful in the digestion of animal prey. Further enriched functions among tandem duplicates (also with trap-enhanced expression) include peptide transport (intercellular movement of broken-down prey proteins), ATPase activities (bladder-trap acidification and transmembrane nutrient transport), hydrolase and chitinase activities (breakdown of prey polysaccharides), and cell-wall dynamic components possibly associated with active bladder movements. Whereas independently polyploid Arabidopsis syntenic gene duplicates are similarly enriched for transcriptional regulatory activities, Arabidopsis tandems are distinct from those of U. gibba, while still metabolic and likely reflecting unique adaptations of that species. Taken together, these findings highlight the special importance of tandem duplications in the adaptive landscapes of a carnivorous plant genome. PMID:28507139
2010-01-01
Background Clock family genes encode transcription factors that regulate clock-controlled genes and thus regulate many physiological mechanisms/processes in a circadian fashion. Clock1 duplicates and copies of Clock3 and NPAS2-like genes were partially characterized (genomic sequencing) and mapped using family-based indels/SNPs in rainbow trout (RT)(Oncorhynchus mykiss), Arctic charr (AC)(Salvelinus alpinus), and Atlantic salmon (AS)(Salmo salar) mapping panels. Results Clock1 duplicates mapped to linkage groups RT-8/-24, AC-16/-13 and AS-2/-18. Clock3/NPAS2-like genes mapped to RT-9/-20, AC-20/-43, and AS-5. Most of these linkage group regions containing the Clock gene duplicates were derived from the most recent 4R whole genome duplication event specific to the salmonids. These linkage groups contain quantitative trait loci (QTL) for life history and growth traits (i.e., reproduction and cell cycling). Comparative synteny analyses with other model teleost species reveal a high degree of conservation for genes in these chromosomal regions suggesting that functionally related or co-regulated genes are clustered in syntenic blocks. For example, anti-müllerian hormone (amh), regulating sexual maturation, and ornithine decarboxylase antizymes (oaz1 and oaz2), regulating cell cycling, are contained within these syntenic blocks. Conclusions Synteny analyses indicate that regions homologous to major life-history QTL regions in salmonids contain many candidate genes that are likely to influence reproduction and cell cycling. The order of these genes is highly conserved across the vertebrate species examined, and as such, these genes may make up a functional cluster of genes that are likely co-regulated. CLOCK, as a transcription factor, is found within this block and therefore has the potential to cis-regulate the processes influenced by these genes. Additionally, clock-controlled genes (CCGs) are located in other life-history QTL regions within salmonids suggesting that at least in part, trans-regulation of these QTL regions may also occur via Clock expression. PMID:20670436
Hostetler, Jessica B.; Lo, Eugenia; Kanjee, Usheer; Amaratunga, Chanaki; Suon, Seila; Sreng, Sokunthea; Mao, Sivanna; Yewhalaw, Delenasaw; Mascarenhas, Anjali; Kwiatkowski, Dominic P.; Ferreira, Marcelo U.; Rathod, Pradipsinh K.; Yan, Guiyun; Fairhurst, Rick M.; Duraisingh, Manoj T.; Rayner, Julian C.
2016-01-01
Background Plasmodium vivax causes the majority of malaria episodes outside Africa, but remains a relatively understudied pathogen. The pathology of P. vivax infection depends critically on the parasite’s ability to recognize and invade human erythrocytes. This invasion process involves an interaction between P. vivax Duffy Binding Protein (PvDBP) in merozoites and the Duffy antigen receptor for chemokines (DARC) on the erythrocyte surface. Whole-genome sequencing of clinical isolates recently established that some P. vivax genomes contain two copies of the PvDBP gene. The frequency of this duplication is particularly high in Madagascar, where there is also evidence for P. vivax infection in DARC-negative individuals. The functional significance and global prevalence of this duplication, and whether there are other copy number variations at the PvDBP locus, is unknown. Methodology/Principal Findings Using whole-genome sequencing and PCR to study the PvDBP locus in P. vivax clinical isolates, we found that PvDBP duplication is widespread in Cambodia. The boundaries of the Cambodian PvDBP duplication differ from those previously identified in Madagascar, meaning that current molecular assays were unable to detect it. The Cambodian PvDBP duplication did not associate with parasite density or DARC genotype, and ranged in prevalence from 20% to 38% over four annual transmission seasons in Cambodia. This duplication was also present in P. vivax isolates from Brazil and Ethiopia, but not India. Conclusions/Significance PvDBP duplications are much more widespread and complex than previously thought, and at least two distinct duplications are circulating globally. The same duplication boundaries were identified in parasites from three continents, and were found at high prevalence in human populations where DARC-negativity is essentially absent. It is therefore unlikely that PvDBP duplication is associated with infection of DARC-negative individuals, but functional tests will be required to confirm this hypothesis. PMID:27798646
The Evolution of Pepsinogen C Genes in Vertebrates: Duplication, Loss and Functional Diversification
Gonçalves, Odete; Wilson, Jonathan Mark
2012-01-01
Background Aspartic proteases comprise a large group of enzymes involved in peptide proteolysis. This collection includes prominent enzymes globally categorized as pepsins, which are derived from pepsinogen precursors. Pepsins are involved in gastric digestion, a hallmark of vertebrate physiology. An important member among the pepsinogens is pepsinogen C (Pgc). A particular aspect of Pgc is its apparent single copy status, which contrasts with the numerous gene copies found for example in pepsinogen A (Pga). Although gene sequences with similarity to Pgc have been described in some vertebrate groups, no exhaustive evolutionary framework has been considered so far. Methodology/Principal Findings By combining phylogenetics and genomic analysis, we find an unexpected Pgc diversity in the vertebrate sub-phylum. We were able to reconstruct gene duplication timings relative to the divergence of major vertebrate clades. Before tetrapod divergence, a single Pgc gene tandemly expanded to produce two gene lineages (Pgbc and Pgc2). These have been differentially retained in various classes. Accordingly, we find Pgc2 in sauropsids, amphibians and marsupials, but not in eutherian mammals. Pgbc was retained in amphibians, but duplicated in the ancestor of amniotes giving rise to Pgb and Pgc1. The latter was retained in mammals and probably in reptiles and marsupials but not in birds. Pgb was kept in all of the amniote clade with independent episodes of loss in some mammalian species. Lineage specific expansions of Pgc2 and Pgbc have also occurred in marsupials and amphibians respectively. We find that teleost and tetrapod Pgc genes reside in distinct genomic regions hinting at a possible translocation. Conclusions We conclude that the repertoire of Pgc genes is larger than previously reported, and that tandem duplications have modelled the history of Pgc genes. We hypothesize that gene expansion lead to functional divergence in tetrapods, coincident with the invasion of terrestrial habitats. PMID:22427897
Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi
2016-01-01
Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These duplications reveal dynamic genome evolution to forge the complex physiology that enables bivalves to employ a sessile lifestyle in the intertidal zone.
Barber, John C K; Hall, Victoria; Maloney, Viv K; Huang, Shuwen; Roberts, Angharad M; Brady, Angela F; Foulds, Nicki; Bewes, Beverley; Volleth, Marianne; Liehr, Thomas; Mehnert, Karl; Bateman, Mark; White, Helen
2013-01-01
Chromosome 16 contains multiple copy number variations (CNVs) that predispose to genomic disorders. Here, we differentiate pathogenic duplications of 16p11.2–p12.2 from microscopically similar euchromatic variants of 16p11.2. Patient 1 was a girl of 18 with autism, moderate intellectual disability, behavioural difficulties, dysmorphic features and a 7.71-Mb (megabase pair) duplication (16:21 521 005–29 233 146). Patient 2 had a 7.81-Mb duplication (16:21 382 561–29 191 527), speech delay and obsessional behaviour as a boy and, as an adult, short stature, macrocephaly and mild dysmorphism. The duplications contain 65 coding genes of which Polo-like kinase 1 (PLK1) has the highest likelihood of being haploinsufficient and, by implication, a triplosensitive gene. An additional 1.11-Mb CNV of 10q11.21 in Patient 1 was a possible modifier containing the G-protein-regulated inducer of neurite growth 2 (GPRIN2) gene. In contrast, the euchromatic variants in Patients 3 and 4 were amplifications from a 945-kb region containing non-functional immunoglobulin heavy chain (IGHV), hect domain pseudogene (HERC2P4) and TP53-inducible target gene 3 (TP53TG3) loci in proximal 16p11.2 (16:31 953 353–32 898 635). Paralogous pyrosequencing gave a total copy number of 3–8 in controls and 8 to >10 in Patients 3 and 4. The 16p11.2–p12.2 duplication syndrome is a recurrent genomic disorder with a variable phenotype including developmental delay, dysmorphic features, mild to severe intellectual disability, autism, obsessive or stereotyped behaviour, short stature and anomalies of the hands and fingers. It is important to differentiate pathogenic 16p11.2–p12.2 duplications from harmless, microscopically similar euchromatic variants of proximal 16p11.2, especially at prenatal diagnosis. PMID:22828807
Origin and functional diversification of an amphibian defense peptide arsenal.
Roelants, Kim; Fry, Bryan G; Ye, Lumeng; Stijlemans, Benoit; Brys, Lea; Kok, Philippe; Clynen, Elke; Schoofs, Liliane; Cornelis, Pierre; Bossuyt, Franky
2013-01-01
The skin secretion of many amphibians contains an arsenal of bioactive molecules, including hormone-like peptides (HLPs) acting as defense toxins against predators, and antimicrobial peptides (AMPs) providing protection against infectious microorganisms. Several amphibian taxa seem to have independently acquired the genes to produce skin-secreted peptide arsenals, but it remains unknown how these originated from a non-defensive ancestral gene and evolved diverse defense functions against predators and pathogens. We conducted transcriptome, genome, peptidome and phylogenetic analyses to chart the full gene repertoire underlying the defense peptide arsenal of the frog Silurana tropicalis and reconstruct its evolutionary history. Our study uncovers a cluster of 13 transcriptionally active genes, together encoding up to 19 peptides, including diverse HLP homologues and AMPs. This gene cluster arose from a duplicated gastrointestinal hormone gene that attained a HLP-like defense function after major remodeling of its promoter region. Instead, new defense functions, including antimicrobial activity, arose by mutation of the precursor proteins, resulting in the proteolytic processing of secondary peptides alongside the original ones. Although gene duplication did not trigger functional innovation, it may have subsequently facilitated the convergent loss of the original function in multiple gene lineages (subfunctionalization), completing their transformation from HLP gene to AMP gene. The processing of multiple peptides from a single precursor entails a mechanism through which peptide-encoding genes may establish new functions without the need for gene duplication to avoid adaptive conflicts with older ones.
Origin and Functional Diversification of an Amphibian Defense Peptide Arsenal
Roelants, Kim; Fry, Bryan G.; Ye, Lumeng; Stijlemans, Benoit; Brys, Lea; Kok, Philippe; Clynen, Elke; Schoofs, Liliane; Cornelis, Pierre; Bossuyt, Franky
2013-01-01
The skin secretion of many amphibians contains an arsenal of bioactive molecules, including hormone-like peptides (HLPs) acting as defense toxins against predators, and antimicrobial peptides (AMPs) providing protection against infectious microorganisms. Several amphibian taxa seem to have independently acquired the genes to produce skin-secreted peptide arsenals, but it remains unknown how these originated from a non-defensive ancestral gene and evolved diverse defense functions against predators and pathogens. We conducted transcriptome, genome, peptidome and phylogenetic analyses to chart the full gene repertoire underlying the defense peptide arsenal of the frog Silurana tropicalis and reconstruct its evolutionary history. Our study uncovers a cluster of 13 transcriptionally active genes, together encoding up to 19 peptides, including diverse HLP homologues and AMPs. This gene cluster arose from a duplicated gastrointestinal hormone gene that attained a HLP-like defense function after major remodeling of its promoter region. Instead, new defense functions, including antimicrobial activity, arose by mutation of the precursor proteins, resulting in the proteolytic processing of secondary peptides alongside the original ones. Although gene duplication did not trigger functional innovation, it may have subsequently facilitated the convergent loss of the original function in multiple gene lineages (subfunctionalization), completing their transformation from HLP gene to AMP gene. The processing of multiple peptides from a single precursor entails a mechanism through which peptide-encoding genes may establish new functions without the need for gene duplication to avoid adaptive conflicts with older ones. PMID:23935531
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
Fingert, John H.; Robin, Alan L.; Scheetz, Todd E.; Kwon, Young H.; Liebmann, Jeffrey M.; Ritch, Robert; Alward, Wallace L.M.
2016-01-01
Purpose To investigate the role of TANK-binding kinase 1 (TBK1) gene copy-number variations (ie, gene duplications and triplications) in the pathophysiology of various open-angle glaucomas. Methods In previous studies, we discovered that copy-number variations in the TBK1 gene are associated with normal-tension glaucoma. Here, we investigated the prevalence of copy-number variations in cohorts of patients with other open-angle glaucomas—juvenile-onset open-angle glaucoma (n=30), pigmentary glaucoma (n=209), exfoliation glaucoma (n=225), and steroid-induced glaucoma (n=79)—using a quantitative polymerase chain reaction assay. Results No TBK1 gene copy-number variations were detected in patients with juvenile-onset open-angle glaucoma, pigmentary glaucoma, or steroid-induced glaucoma. A TBK1 gene duplication was detected in one (0.44%) of the 225 exfoliation glaucoma patients. Conclusions TBK1 gene copy-number variations (gene duplications and triplications) have been previously associated with normal-tension glaucoma. An exploration of other open-angle glaucomas detected a TBK1 copy-number variation in a patient with exfoliation glaucoma, which is the first example of a TBK1 mutation in a glaucoma patient with a diagnosis other than normal-tension glaucoma. A broader phenotypic range may be associated with TBK1 copy-number variations, although mutations in this gene are most often detected in patients with normal-tension glaucoma. PMID:27881886
Jourda, Cyril; Cardi, Céline; Mbéguié-A-Mbéguié, Didier; Bocs, Stéphanie; Garsmeur, Olivier; D'Hont, Angélique; Yahiaoui, Nabila
2014-05-01
Whole-genome duplications (WGDs) are widespread in plants, and three lineage-specific WGDs occurred in the banana (Musa acuminata) genome. Here, we analysed the impact of WGDs on the evolution of banana gene families involved in ethylene biosynthesis and signalling, a key pathway for banana fruit ripening. Banana ethylene pathway genes were identified using comparative genomics approaches and their duplication modes and expression profiles were analysed. Seven out of 10 banana ethylene gene families evolved through WGD and four of them (1-aminocyclopropane-1-carboxylate synthase (ACS), ethylene-insensitive 3-like (EIL), ethylene-insensitive 3-binding F-box (EBF) and ethylene response factor (ERF)) were preferentially retained. Banana orthologues of AtEIN3 and AtEIL1, two major genes for ethylene signalling in Arabidopsis, were particularly expanded. This expansion was paralleled by that of EBF genes which are responsible for control of EIL protein levels. Gene expression profiles in banana fruits suggested functional redundancy for several MaEBF and MaEIL genes derived from WGD and subfunctionalization for some of them. We propose that EIL and EBF genes were co-retained after WGD in banana to maintain balanced control of EIL protein levels and thus avoid detrimental effects of constitutive ethylene signalling. In the course of evolution, subfunctionalization was favoured to promote finer control of ethylene signalling. © 2014 CIRAD New Phytologist © 2014 New Phytologist Trust.
Fingert, John H; Robin, Alan L; Scheetz, Todd E; Kwon, Young H; Liebmann, Jeffrey M; Ritch, Robert; Alward, Wallace L M
2016-08-01
To investigate the role of TANK-binding kinase 1 ( TBK1 ) gene copy-number variations (ie, gene duplications and triplications) in the pathophysiology of various open-angle glaucomas. In previous studies, we discovered that copy-number variations in the TBK1 gene are associated with normal-tension glaucoma. Here, we investigated the prevalence of copy-number variations in cohorts of patients with other open-angle glaucomas-juvenile-onset open-angle glaucoma (n=30), pigmentary glaucoma (n=209), exfoliation glaucoma (n=225), and steroid-induced glaucoma (n=79)-using a quantitative polymerase chain reaction assay. No TBK1 gene copy-number variations were detected in patients with juvenile-onset open-angle glaucoma, pigmentary glaucoma, or steroid-induced glaucoma. A TBK1 gene duplication was detected in one (0.44%) of the 225 exfoliation glaucoma patients. TBK1 gene copy-number variations (gene duplications and triplications) have been previously associated with normal-tension glaucoma. An exploration of other open-angle glaucomas detected a TBK1 copy-number variation in a patient with exfoliation glaucoma, which is the first example of a TBK1 mutation in a glaucoma patient with a diagnosis other than normal-tension glaucoma. A broader phenotypic range may be associated with TBK1 copy-number variations, although mutations in this gene are most often detected in patients with normal-tension glaucoma.
Yang, Yaodong; Mason, Annaliese S.; Lei, Xintao; Ma, Zilong
2013-01-01
MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events. PMID:23951162
Xiao, Yong; Xia, Wei; Yang, Yaodong; Mason, Annaliese S; Lei, Xintao; Ma, Zilong
2013-01-01
MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events.
El Chehadeh, Salima; Faivre, Laurence; Mosca-Boidron, Anne-Laure; Malan, Valérie; Amiel, Jeanne; Nizon, Mathilde; Touraine, Renaud; Prieur, Fabienne; Pasquier, Laurent; Callier, Patrick; Lefebvre, Mathilde; Marle, Nathalie; Dubourg, Christèle; Julia, Sophie; Sarret, Catherine; Francannet, Christine; Laffargue, Fanny; Boespflug-Tanguy, Odile; David, Albert; Isidor, Bertrand; Le Caignec, Cédric; Vigneron, Jacqueline; Leheup, Bruno; Lambert, Laetitia; Philippe, Christophe; Cuisset, Jean-Marie; Andrieux, Joris; Plessis, Ghislaine; Toutain, Annick; Goldenberg, Alice; Cormier-Daire, Valérie; Rio, Marlène; Bonnefont, Jean-Paul; Thevenon, Julien; Echenne, Bernard; Journel, Hubert; Afenjar, Alexandra; Burglen, Lydie; Bienvenu, Thierry; Addor, Marie-Claude; Lebon, Sébastien; Martinet, Danièle; Baumann, Clarisse; Perrin, Laurence; Drunat, Séverine; Jouk, Pierre-Simon; Devillard, Françoise; Coutton, Charles; Lacombe, Didier; Delrue, Marie-Ange; Philip, Nicole; Moncla, Anne; Badens, Catherine; Perreton, Nathalie; Masurel, Alice; Thauvin-Robinet, Christel; Des Portes, Vincent; Guibaud, Laurent
2016-01-01
Xq28 duplications encompassing MECP2 have been described in male patients with a severe neurodevelopmental disorder associated with hypotonia and spasticity, severe learning disability, stereotyped movements, and recurrent pulmonary infections. We report on standardized brain magnetic resonance imaging (MRI) data of 30 affected patients carrying an Xq28 duplication involving MECP2 of various sizes (228 kb to 11.7 Mb). The aim of this study was to seek recurrent malformations and attempt to determine whether variations in imaging features could be explained by differences in the size of the duplications. We showed that 93% of patients had brain MRI abnormalities such as corpus callosum abnormalities (n = 20), reduced volume of the white matter (WM) (n = 12), ventricular dilatation (n = 9), abnormal increased hyperintensities on T2-weighted images involving posterior periventricular WM (n = 6), and vermis hypoplasia (n = 5). The occipitofrontal circumference varied considerably between >+2SD in five patients and <-2SD in four patients. Among the nine patients with dilatation of the lateral ventricles, six had a duplication involving L1CAM. The only patient harboring bilateral posterior subependymal nodular heterotopia also carried an FLNA gene duplication. We could not demonstrate a correlation between periventricular WM hyperintensities/delayed myelination and duplication of the IKBKG gene. We thus conclude that patients with an Xq28 duplication involving MECP2 share some similar but non-specific brain abnormalities. These imaging features, therefore, could not constitute a diagnostic clue. The genotype-phenotype correlation failed to demonstrate a relationship between the presence of nodular heterotopia, ventricular dilatation, WM abnormalities, and the presence of FLNA, L1CAM, or IKBKG, respectively, in the duplicated segment. © 2015 Wiley Periodicals, Inc.
Large Deletions of TSPAN12 Cause Familial Exudative Vitreoretinopathy (FEVR).
Seo, Soo Hyun; Kim, Man Jin; Park, Sung Wook; Kim, Jeong Hun; Yu, Young Suk; Song, Ji Yun; Cho, Sung Im; Ahn, Joo Hyun; Oh, Yeon Hee; Lee, Jee-Soo; Lee, Seungjun; Seong, Moon-Woo; Park, Sung Sup; Kim, Ji Yeon
2016-12-01
Familial exudative vitreoretinopathy (FEVR) is a rare, hereditary visual disorder. The gene TSPAN12 is associated with autosomal dominant inheritance of FEVR. The prevalence and impact of large deletions/duplications of TSPAN12 on FEVR patients is unknown. To glean better insight of TSPAN12 on FEVR pathology, herein, we describe three FEVR patients with TSPAN12 deletions. Thirty-three Korean FEVR patients, who previously screened negative for TSPAN12 mutations, mutations in other FEVR-associated genes such as NDP, FZD4, LRP5, and large deletions and duplications of NDP, FZD4, and LRP5, were selected for TSPAN12 large deletion and duplication analyses. Semiquantitative multiplex PCR for TSPAN12 gene dosage analyses were performed, followed by droplet digital PCR (ddPCR) for validation. Among the 33 patients, three patients were confirmed to carry large TSPAN12 deletions. Two of them had whole-gene deletions of TSPAN12, and the other patient possessed a deletion of TSPAN12 in exon 4. FEVR severity detected in these patients was not more severe than in a patient with TSPAN12 point mutation. Regarding previously reported proportions of FEVR-associated genes contributing to the disorder's autosomal dominant inheritance pattern in Korea, we determined that patients with TSPAN12 large deletions were more common than patients with single nucleotide variants in TSPAN12. Evaluating TSPAN12 large deletions and duplications should be considered in FEVR screening and diagnosis as well as in routine genetic workups for FEVR patients.
Picinelli, Chiara; Lintas, Carla; Piras, Ignazio Stefano; Gabriele, Stefano; Sacco, Roberto; Brogna, Claudia; Persico, Antonio Maria
2016-12-01
Rare and common CNVs can contribute to the etiology of neurodevelopmental disorders. One of the recurrent genomic aberrations associated with these phenotypes and proposed as a susceptibility locus is the 15q11.2 BP1-BP2 CNV encompassing TUBGCP5, CYFIP1, NIPA2, and NIPA1. Characterizing by array-CGH a cohort of 243 families with various neurodevelopmental disorders, we identified five patients carrying the 15q11.2 duplication and one carrying the deletion. All CNVs were confirmed by qPCR and were inherited, except for one duplication where parents were not available. The phenotypic spectrum of CNV carriers was broad but mainly neurodevelopmental, in line with all four genes being implicated in axonal growth and neural connectivity. Phenotypically normal and mildly affected carriers complicate the interpretation of this aberration. This variability may be due to reduced penetrance or altered gene dosage on a particular genetic background. We evaluated the expression levels of the four genes in peripheral blood RNA and found the expected reduction in the deleted case, while duplicated carriers displayed high interindividual variability. These data suggest that differential expression of these genes could partially account for differences in clinical phenotypes, especially among duplication carriers. Furthermore, urinary Mg 2+ levels appear negatively correlated with NIPA2 gene copy number, suggesting they could potentially represent a useful biomarker, whose reliability will need replication in larger samples. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Yan, Jun; Cai, Zhonghua
2010-12-10
The cytochrome P450 (CYP) superfamily is a multifunctional hemethiolate enzyme that is widely distributed from Bacteria to Eukarya. The CYP3 family contains mainly the four subfamilies CYP3A, CYP3B, CYP3C and CYP3D in vertebrates; however, only the Actinopterygii (ray-finned fish) have all four subfamilies and detailed understanding of the evolutionary relationship of Actinopterygii CYP3 family members would be valuable. Phylogenetic relationships were constructed to trace the evolutionary history of the Actinopterygii CYP3 family genes. Selection analysis, relative rate tests and functional divergence analysis were combined to interpret the relationship of the site-specific evolution and functional divergence in the Actinopterygii CYP3 family. The results showed that the four CYP3 subfamilies in Actinopterygii might be formed by gene duplication. The first gene duplication event was responsible for divergence of the CYP3B/C clusters from ancient CYP3 before the origin of the Actinopterygii, which corresponded to the fish-specific whole genome duplication (WGD). Tandem repeat duplication in each of the homologue clusters produced stable CYP3B, CYP3C, CYP3A and CYP3D subfamilies. Acceleration of asymmetric evolutionary rates and purifying selection together were the main force for the production of new subfamilies and functional divergence in the new subset after gene duplication, whereas positive selection was detected only in the retained CYP3A subfamily. Furthermore, nearly half of the functional divergence sites appear to be related to substrate recognition, which suggests that site-specific evolution is closely related with functional divergence in the Actinopterygii CYP3 family. The split of fish-specific CYP3 subfamilies was related to the fish-specific WGD, and site-specific acceleration of asymmetric evolutionary rates and purifying selection was the main force for the origin of the new subfamilies and functional divergence in the new subset after gene duplication. Site-specific evolution in substrate recognition was related to functional divergence in the Actinopterygii CYP3 family.
Kim, Seon-Hee; Bae, Young-An
2017-09-01
Tyrosinase provides an essential activity during egg production in diverse platyhelminths by mediating sclerotization of eggshells. In this study, we investigated the genomic and evolutionary features of tyrosinases in parasitic platyhelminths whose genomic information is available. A pair of paralogous tyrosinases was detected in most trematodes, whereas they were lost in cyclophyllidean cestodes. A pseudophyllidean cestode displaying egg biology similar to that of trematodes possessed an orthologous gene. Interestingly, one of the paralogous tyrosinases appeared to have been multiplied into three copies in Clonorchis sinensis and Opisthorchis viverrini. In addition, a fifth tyrosinase gene that was minimally transcribed through all developmental stages was further detected in these opisthorchiid genomes. Phylogenetic analyses demonstrated that the tyrosinase gene has undergone duplication at least three times in platyhelminths. The additional opisthorchiid gene arose from the first duplication. A paralogous copy generated from these gene duplications, except for the last one, seemed to be lost in the major neodermatans lineages. In C. sinensis, tyrosinase gene expressions were initiated following sexual maturation and the levels were significantly enhanced by the presence of O2 and bile. Taken together, our data suggest that tyrosinase has evolved lineage-specifically across platyhelminths related to its copy number and induction mechanism.
Bartlett, Madelaine E; Specht, Chelsea D
2010-07-01
*The MADS box transcription factor family has long been identified as an important contributor to the control of floral development. It is often hypothesized that the evolution of floral development across angiosperms and within specific lineages may occur as a result of duplication, functional diversification, and changes in regulation of MADS box genes. Here we examine the role of Globosa (GLO)-like genes, members of the B-class MADS box gene lineage, in the evolution of floral development within the monocot order Zingiberales. *We assessed changes in perianth and stamen whorl morphology in a phylogenetic framework. We identified GLO homologs (ZinGLO1-4) from 50 Zingiberales species and investigated the evolution of this gene lineage. Expression of two GLO homologs was assessed in Costus spicatus and Musa basjoo. *Based on the phylogenetic data and expression results, we propose several family-specific losses and gains of GLO homologs that appear to be associated with key morphological changes. The GLO-like gene lineage has diversified concomitant with the evolution of the dimorphic perianth and the staminodial labellum. *Duplications and expression divergence within the GLO-like gene lineage may have played a role in floral diversification in the Zingiberales.
A conserved segmental duplication within ELA.
Brinkmeyer-Langford, C L; Murphy, W J; Childers, C P; Skow, L C
2010-12-01
The assembled genomic sequence of the horse major histocompatibility complex (MHC) (equine lymphocyte antigen, ELA) is very similar to the homologous human HLA, with the notable exception of a large segmental duplication at the boundary of ELA class I and class III that is absent in HLA. The segmental duplication consists of a ∼ 710 kb region of at least 11 repeated blocks: 10 blocks each contain an MHC class I-like sequence and the helicase domain portion of a BAT1-like sequence, and the remaining unit contains the full-length BAT1 gene. Similar genomic features were found in other Perissodactyls, indicating an ancient origin, which is consistent with phylogenetic analyses. Reverse-transcriptase PCR (RT-PCR) of mRNA from peripheral white blood cells of healthy and chronically or acutely infected horses detected transcription from predicted open reading frames in several of the duplicated blocks. This duplication is not present in the sequenced MHCs of most other mammals, although a similar feature at the same relative position is present in the feline MHC (FLA). Striking sequence conservation throughout Perissodactyl evolution is consistent with a functional role for at least some of the genes included within this segmental duplication. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.
Virts, Elizabeth L; Jankowska, Anna; Mackay, Craig; Glaas, Marcel F; Wiek, Constanze; Kelich, Stephanie L; Lottmann, Nadine; Kennedy, Felicia M; Marchal, Christophe; Lehnert, Erik; Scharf, Rüdiger E; Dufour, Carlo; Lanciotti, Marina; Farruggia, Piero; Santoro, Alessandra; Savasan, Süreyya; Scheckenbach, Kathrin; Schipper, Jörg; Wagenmann, Martin; Lewis, Todd; Leffak, Michael; Farlow, Janice L; Foroud, Tatiana M; Honisch, Ellen; Niederacher, Dieter; Chakraborty, Sujata C; Vance, Gail H; Pruss, Dmitry; Timms, Kirsten M; Lanchbury, Jerry S; Alpi, Arno F; Hanenberg, Helmut
2015-09-15
Fanconi anemia (FA) is a rare inherited disorder clinically characterized by congenital malformations, progressive bone marrow failure and cancer susceptibility. At the cellular level, FA is associated with hypersensitivity to DNA-crosslinking genotoxins. Eight of 17 known FA genes assemble the FA E3 ligase complex, which catalyzes monoubiquitination of FANCD2 and is essential for replicative DNA crosslink repair. Here, we identify the first FA patient with biallelic germline mutations in the ubiquitin E2 conjugase UBE2T. Both mutations were aluY-mediated: a paternal deletion and maternal duplication of exons 2-6. These loss-of-function mutations in UBE2T induced a cellular phenotype similar to biallelic defects in early FA genes with the absence of FANCD2 monoubiquitination. The maternal duplication produced a mutant mRNA that could encode a functional protein but was degraded by nonsense-mediated mRNA decay. In the patient's hematopoietic stem cells, the maternal allele with the duplication of exons 2-6 spontaneously reverted to a wild-type allele by monoallelic recombination at the duplicated aluY repeat, thereby preventing bone marrow failure. Analysis of germline DNA of 814 normal individuals and 850 breast cancer patients for deletion or duplication of UBE2T exons 2-6 identified the deletion in only two controls, suggesting aluY-mediated recombinations within the UBE2T locus are rare and not associated with an increased breast cancer risk. Finally, a loss-of-function germline mutation in UBE2T was detected in a high-risk breast cancer patient with wild-type BRCA1/2. Cumulatively, we identified UBE2T as a bona fide FA gene (FANCT) that also may be a rare cancer susceptibility gene. © The Author 2015. Published by Oxford University Press.
2013-01-01
Background The sequenced genomes of cucumber, melon and watermelon have relatively few R-genes, with 70, 75 and 55 copies only, respectively. The mechanism for low copy number of R-genes in Cucurbitaceae genomes remains unknown. Results Manual annotation of R-genes in the sequenced genomes of Cucurbitaceae species showed that approximately half of them are pseudogenes. Comparative analysis of R-genes showed frequent loss of R-gene loci in different Cucurbitaceae species. Phylogenetic analysis, data mining and PCR cloning using degenerate primers indicated that Cucurbitaceae has limited number of R-gene lineages (subfamilies). Comparison between R-genes from Cucurbitaceae and those from poplar and soybean suggested frequent loss of R-gene lineages in Cucurbitaceae. Furthermore, the average number of R-genes per lineage in Cucurbitaceae species is approximately 1/3 that in soybean or poplar. Therefore, both loss of lineages and deficient duplications in extant lineages accounted for the low copy number of R-genes in Cucurbitaceae. No extensive chimeras of R-genes were found in any of the sequenced Cucurbitaceae genomes. Nevertheless, one lineage of R-genes from Trichosanthes kirilowii, a wild Cucurbitaceae species, exhibits chimeric structures caused by gene conversions, and may contain a large number of distinct R-genes in natural populations. Conclusions Cucurbitaceae species have limited number of R-gene lineages and each genome harbors relatively few R-genes. The scarcity of R-genes in Cucurbitaceae species was due to frequent loss of R-gene lineages and infrequent duplications in extant lineages. The evolutionary mechanisms for large variation of copy number of R-genes in different plant species were discussed. PMID:23682795
Yang, Yizi; Luo, Damin
2013-01-01
Inferring evolutionary history of parasitism genes is important to understand how evolutionary mechanisms affect the occurrences of parasitism genes. In this study, we constructed multiple domain trees for parasitism genes and genes under free-living conditions. Further analyses of horizontal gene transfer (HGT)-like phylogenetic incongruences, duplications, and speciations were performed based on these trees. By comparing these analyses, the contributions of pre-adaptations were found to be more important to the evolution of parasitism genes than those of duplications, and pre-adaptations are as crucial as previously reported HGTs to parasitism. Furthermore, speciation may also affect the evolution of parasitism genes. In addition, Pristionchus pacificus was suggested to be a common model organism for studies of parasitic nematodes, including root-knot species. These analyses provided information regarding mechanisms that may have contributed to the evolution of parasitism genes.
Tang, Yuehui; Qin, Shanshan; Guo, Yali; Chen, Yanbo; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang
2016-01-01
The AP2/ERF transcription factors play crucial roles in plant growth, development and responses to biotic and abiotic stresses. A total of 119 AP2/ERF genes (JcAP2/ERFs) have been identified in the physic nut genome; they include 16 AP2, 4 RAV, 1 Soloist, and 98 ERF genes. Phylogenetic analysis indicated that physic nut AP2 genes could be divided into 3 subgroups, while ERF genes could be classed into 11 groups or 43 subgroups. The AP2/ERF genes are non-randomly distributed across the 11 linkage groups of the physic nut genome and retain many duplicates which arose from ancient duplication events. The expression patterns of several JcAP2/ERF duplicates in the physic nut showed differences among four tissues (root, stem, leaf, and seed), and 38 JcAP2/ERF genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots according to analysis of digital gene expression tag data. The expression of JcERF011 was downregulated by salinity stress in physic nut roots. Overexpression of the JcERF011 gene in rice plants increased its sensitivity to salinity stress. The increased expression levels of several salt tolerance-related genes were impaired in the JcERF011-overexpressing plants under salinity stress. PMID:26943337
Tang, Yuehui; Qin, Shanshan; Guo, Yali; Chen, Yanbo; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang
2016-01-01
The AP2/ERF transcription factors play crucial roles in plant growth, development and responses to biotic and abiotic stresses. A total of 119 AP2/ERF genes (JcAP2/ERFs) have been identified in the physic nut genome; they include 16 AP2, 4 RAV, 1 Soloist, and 98 ERF genes. Phylogenetic analysis indicated that physic nut AP2 genes could be divided into 3 subgroups, while ERF genes could be classed into 11 groups or 43 subgroups. The AP2/ERF genes are non-randomly distributed across the 11 linkage groups of the physic nut genome and retain many duplicates which arose from ancient duplication events. The expression patterns of several JcAP2/ERF duplicates in the physic nut showed differences among four tissues (root, stem, leaf, and seed), and 38 JcAP2/ERF genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots according to analysis of digital gene expression tag data. The expression of JcERF011 was downregulated by salinity stress in physic nut roots. Overexpression of the JcERF011 gene in rice plants increased its sensitivity to salinity stress. The increased expression levels of several salt tolerance-related genes were impaired in the JcERF011-overexpressing plants under salinity stress.
Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D
2009-05-13
The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1 and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3-35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7-13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5-26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84-113 million years for the divergence of all butterfly families. These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation.
Molecular evolution of GPCRs: GLP1/GLP1 receptors.
Hwang, Jong-Ik; Yun, Seongsik; Moon, Mi Jin; Park, Cho Rong; Seong, Jae Young
2014-06-01
Glucagon-like peptide 1 (GLP1) is an intestinal incretin that regulates glucose homeostasis through stimulation of insulin secretion from pancreatic β-cells and inhibits appetite by acting on the brain. Thus, it is a promising therapeutic agent for the treatment of type 2 diabetes mellitus and obesity. Studies using synteny and reconstructed ancestral chromosomes suggest that families for GLP1 and its receptor (GLP1R) have emerged through two rounds (2R) of whole genome duplication and local gene duplications before and after 2R. Exon duplications have also contributed to the expansion of the peptide family members. Specific changes in the amino acid sequence following exon/gene/genome duplications have established distinct yet related peptide and receptor families. These specific changes also confer selective interactions between GLP1 and GLP1R. In this review, we present a possible macro (genome level)- and micro (gene/exon level)-evolution mechanisms of GLP1 and GLP1R, which allows them to acquire selective interactions between this ligand-receptor pair. This information may provide critical insight for the development of potent therapeutic agents targeting GLP1R. © 2014 Society for Endocrinology.
Evolutionary Patterns of RNA-Based Duplication in Non-Mammalian Chordates
Li, Xin; Vibranovski, Maria D.; Gan, Xiaoni; Wang, Dengqiang; Wang, Wen; Long, Manyuan; He, Shunping
2011-01-01
The role of RNA-based duplication, or retroposition, in the evolution of new gene functions in mammals, plants, and Drosophila has been widely reported. However, little is known about RNA-based duplication in non-mammalian chordates. In this study, we screened ten non-mammalian chordate genomes for retrocopies and investigated their evolutionary patterns. We identified numerous retrocopies in these species. Examination of the age distribution of these retrocopies revealed no burst of young retrocopies in ancient chordate species. Upon comparing these non-mammalian chordate species to the mammalian species, we observed that a larger fraction of the non-mammalian retrocopies was under strong evolutionary constraints than mammalian retrocopies are, as evidenced by signals of purifying selection and expression profiles. For the Western clawed frog, Medaka, and Sea squirt, many retrogenes have evolved gonad and brain expression patterns, similar to what was observed in human. Testing of retrogene movement in the Medaka genome, where the nascent sex chrosomes have been well assembled, did not reveal any significant gene movement. Taken together, our analyses demonstrate that RNA-based duplication generates many functional genes and can make a significant contribution to the evolution of non-mammalian genomes. PMID:21779328
Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph
2016-01-01
Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.
Saqran, Lubna; Herrick, Scott P.; Frosch, Matthew P.; Hyman, Bradley T.
2017-01-01
Activity-dependent synaptic plasticity plays a critical role in the refinement of circuitry during postnatal development and may be disrupted in conditions that cause intellectual disability, such as Down syndrome (DS). To test this hypothesis, visual cortical plasticity was assessed in Ts65Dn mice that harbor a chromosomal duplication syntenic to human chromosome 21q. We find that Ts65Dn mice demonstrate a defect in ocular dominance plasticity (ODP) following monocular deprivation. This phenotype is similar to that of transgenic mice that express amyloid precursor protein (APP), which is duplicated in DS and in Ts65DN mice; however, normalizing APP gene copy number in Ts65Dn mice fails to rescue plasticity. Ts1Rhr mice harbor a duplication of the telomeric third of the Ts65Dn-duplicated sequence and demonstrate the same ODP defect, suggesting a gene or genes sufficient to drive the phenotype are located in that smaller duplication. In addition, we find that Ts65Dn mice demonstrate an abnormality in olfactory system connectivity, a defect in the refinement of connections to second-order neurons in the olfactory bulb. Ts1Rhr mice do not demonstrate a defect in glomerular refinement, suggesting that distinct genes or sets of genes underlie visual and olfactory system phenotypes. Importantly, these data suggest that developmental plasticity and connectivity are impaired in sensory systems in DS model mice, that such defects may contribute to functional impairment in DS, and that these phenotypes, present in male and female mice, provide novel means for examining the genetic and molecular bases for neurodevelopmental impairment in model mice in vivo. SIGNIFICANCE STATEMENT Our understanding of the basis for intellectual impairment in Down syndrome is hindered by the large number of genes duplicated in Trisomy 21 and a lack of understanding of the effect of disease pathology on the function of neural circuits in vivo. This work describes early postnatal developmental abnormalities in visual and olfactory sensory systems in Down syndrome model mice, which provide insight into defects in the function of neural circuits in vivo and provide an approach for exploring the genetic and molecular basis for impairment in the disease. In addition, these findings raise the possibility that basic dysfunction in primary sensory circuitry may illustrate mechanisms important for global learning and cognitive impairment in Down syndrome patients. PMID:28899917
The Goddard and Saturn Genes Are Essential for Drosophila Male Fertility and May Have Arisen De Novo
Gubala, Anna M.; Schmitz, Jonathan F.; Kearns, Michael J.; Vinh, Tery T.; Bornberg-Bauer, Erich; Wolfner, Mariana F.
2017-01-01
New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. PMID:28104747
Targeted tandem duplication of a large chromosomal segment in Aspergillus oryzae.
Takahashi, Tadashi; Sato, Atsushi; Ogawa, Masahiro; Hanya, Yoshiki; Oguma, Tetsuya
2014-08-01
We describe here the first successful construction of a targeted tandem duplication of a large chromosomal segment in Aspergillus oryzae. The targeted tandem chromosomal duplication was achieved by using strains that had a 5'-deleted pyrG upstream of the region targeted for tandem chromosomal duplication and a 3'-deleted pyrG downstream of the target region. Consequently,strains bearing a 210-kb targeted tandem chromosomal duplication near the centromeric region of chromosome 8 and strains bearing a targeted tandem chromosomal duplication of a 700-kb region of chromosome 2 were successfully constructed. The strains bearing the tandem chromosomal duplication were efficiently obtained from the regenerated protoplast of the parental strains. However, the generation of the chromosomal duplication did not depend on the introduction of double-stranded breaks(DSBs) by I-SceI. The chromosomal duplications of these strains were stably maintained after five generations of culture under nonselective conditions. The strains bearing the tandem chromosomal duplication in the 700-kb region of chromosome 2 showed highly increased protease activity in solid-state culture, indicating that the duplication of large chromosomal segments could be a useful new breeding technology and gene analysis method.
Gong, Pichang; Ao, Xiang; Liu, Gaixiu; Cheng, Fangyun; He, Chaoying
2017-03-01
Herbaceous peony (Paeonia lactiflora) is a globally important ornamental plant. Spontaneous floral mutations occur frequently during cultivation, and are selected as a way to release new cultivars, but the underlying evolutionary developmental genetics remain largely elusive. Here, we investigated a collection of spontaneous corolla mutational plants (SCMPs) whose other floral organs were virtually unaffected. Unlike the corolla in normal plants (NPs) that withered soon after fertilization, the transformed corolla (petals) in SCMPs was greenish and persistent similar to the calyx (sepals). Epidermal cellular morphology of the SCMP corolla was also similar to that of calyx cells, further suggesting a sepaloid corolla in SCMPs. Ten floral MADS-box genes from these Paeonia plants were comparatively characterized with respect to sequence and expression. Codogenic sequence variation of these MADS-box genes was not linked to corolla changes in SCMPs. However, we found that both APETALA3 (AP3) and PISTILLATA (PI) lineages of B-class MADS-box genes were duplicated, and subsequent selective expression alterations of these genes were closely associated with the origin of SCMPs. AP3-PI obligate heterodimerization, essential for organ identity of corolla and stamens, was robustly detected. However, selective down-regulation of these duplicated genes might result in a reduction of this obligate heterodimer concentration in a corolla-specific manner, leading to the sepaloid corolla in SCMPs, thus representing a new sepaloid corolla model taking advantage of gene duplication. Our work suggests that modifying floral MADS-box genes could facilitate the breeding of novel cultivars with distinct floral morphology in ornamental plants, and also provides new insights into the functional evolution of the MADS-box genes in plants. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Chen, Lin; Dong, Chuanju; Kong, Shengnan; Zhang, Jiangfan; Li, Xuejun; Xu, Peng
2017-09-05
Bone morphogenetic proteins (Bmps) are a group of signaling molecules known to play important roles during formation and maintenance of various organs, not only bone, but also muscle, blood and so on. Common carp (Cyprinus carpio) is one of the most intensively studied fish due to its economic and environmental importance. Besides, common carp has encountered an additional round of whole genome duplication (WGD) compared with many closely related diploid teleost, which make it one of the most important models for genome evolutionary studies in teleost. Comprehensive genome resources of common carp have been developed recently, which facilitate the thorough characterization of bmp gene family in the tetraploidized common carp genome. We identified a total of 44 bmps from the common carp genome, which are twice as many as that of zebrafish. Phylogenetic analysis revealed that most of bmps are highly conserved. Comparative analysis was performed across six typical vertebrate genomes. It appeared that all the bmp genes in common carp were duplicated. Obviously, the expansion of the bmp gene family in common carp was due to the latest additional round of whole genome duplication and made it more abundant than other diploid teleosts. Expression signatures were assessed in major tissues, including gill, intestine, liver, spleen, skin, heart, gonad, muscle, kidney, head kidney, brain and blood, which demonstrated the comprehensive expression profiles of bmp genes in the tetraploidized genome. Significant gene expression divergences were observed which revealed substantial functional divergences of those duplicated bmp genes post the latest WGD event. The conserved synteny blocks of bmp5s revealed the genome rearrangement of common carp post the 4R WGD. The whole set of bmp gene family in common carp provides insight into gene fate of tetraploidized common carp genome post recent WGD. Copyright © 2017. Published by Elsevier B.V.
Global Metabolic Reconstruction and Metabolic Gene Evolution in the Cattle Genome
Kim, Woonsu; Park, Hyesun; Seo, Seongwon
2016-01-01
The sequence of cattle genome provided a valuable opportunity to systematically link genetic and metabolic traits of cattle. The objectives of this study were 1) to reconstruct genome-scale cattle-specific metabolic pathways based on the most recent and updated cattle genome build and 2) to identify duplicated metabolic genes in the cattle genome for better understanding of metabolic adaptations in cattle. A bioinformatic pipeline of an organism for amalgamating genomic annotations from multiple sources was updated. Using this, an amalgamated cattle genome database based on UMD_3.1, was created. The amalgamated cattle genome database is composed of a total of 33,292 genes: 19,123 consensus genes between NCBI and Ensembl databases, 8,410 and 5,493 genes only found in NCBI or Ensembl, respectively, and 266 genes from NCBI scaffolds. A metabolic reconstruction of the cattle genome and cattle pathway genome database (PGDB) was also developed using Pathway Tools, followed by an intensive manual curation. The manual curation filled or revised 68 pathway holes, deleted 36 metabolic pathways, and added 23 metabolic pathways. Consequently, the curated cattle PGDB contains 304 metabolic pathways, 2,460 reactions including 2,371 enzymatic reactions, and 4,012 enzymes. Furthermore, this study identified eight duplicated genes in 12 metabolic pathways in the cattle genome compared to human and mouse. Some of these duplicated genes are related with specific hormone biosynthesis and detoxifications. The updated genome-scale metabolic reconstruction is a useful tool for understanding biology and metabolic characteristics in cattle. There has been significant improvements in the quality of cattle genome annotations and the MetaCyc database. The duplicated metabolic genes in the cattle genome compared to human and mouse implies evolutionary changes in the cattle genome and provides a useful information for further research on understanding metabolic adaptations of cattle. PMID:26992093
Goettel, Wolfgang; Ramirez, Martha; Upchurch, Robert G; An, Yong-Qiang Charles
2016-08-01
Identification and characterization of a 254-kb genomic deletion on a duplicated chromosome segment that resulted in a low level of palmitic acid in soybean seeds using transcriptome sequencing. A large number of soybean genotypes varying in seed oil composition and content have been identified. Understanding the molecular mechanisms underlying these variations is important for breeders to effectively utilize them as a genetic resource. Through design and application of a bioinformatics approach, we identified nine co-regulated gene clusters by comparing seed transcriptomes of nine soybean genotypes varying in oil composition and content. We demonstrated that four gene clusters in the genotypes M23, Jack and N0304-303-3 coincided with large-scale genome rearrangements. The co-regulated gene clusters in M23 and Jack mapped to a previously described 164-kb deletion and a copy number amplification of the Rhg1 locus, respectively. The coordinately down-regulated gene clusters in N0304-303-3 were caused by a 254-kb deletion containing 19 genes including a fatty acyl-ACP thioesterase B gene (FATB1a). This deletion was associated with reduced palmitic acid content in seeds and was the molecular cause of a previously reported nonfunctional FATB1a allele, fap nc . The M23 and N0304-304-3 deletions were located in duplicated genome segments retained from the Glycine-specific whole genome duplication that occurred 13 million years ago. The homoeologous genes in these duplicated regions shared a strong similarity in both their encoded protein sequences and transcript accumulation levels, suggesting that they may have conserved and important functions in seeds. The functional conservation of homoeologous genes may result in genetic redundancy and gene dosage effects for their associated seed traits, explaining why the large deletion did not cause lethal effects or completely eliminate palmitic acid in N0304-303-3.
Lorin, Thibault; Brunet, Frédéric G.; Laudet, Vincent; Volff, Jean-Nicolas
2018-01-01
Vertebrate pigmentation is a highly diverse trait mainly determined by neural crest cell derivatives. It has been suggested that two rounds (1R/2R) of whole-genome duplications (WGDs) at the basis of vertebrates allowed changes in gene regulation associated with neural crest evolution. Subsequently, the teleost fish lineage experienced other WGDs, including the teleost-specific Ts3R before teleost radiation and the more recent Ss4R at the basis of salmonids. As the teleost lineage harbors the highest number of pigment cell types and pigmentation diversity in vertebrates, WGDs might have contributed to the evolution and diversification of the pigmentation gene repertoire in teleosts. We have compared the impact of the basal vertebrate 1R/2R duplications with that of the teleost-specific Ts3R and salmonid-specific Ss4R WGDs on 181 gene families containing genes involved in pigmentation. We show that pigmentation genes (PGs) have been globally more frequently retained as duplicates than other genes after Ts3R and Ss4R but not after the early 1R/2R. This is also true for non-pigmentary paralogs of PGs, suggesting that the function in pigmentation is not the sole key driver of gene retention after WGDs. On the long-term, specific categories of PGs have been repeatedly preferentially retained after ancient 1R/2R and Ts3R WGDs, possibly linked to the molecular nature of their proteins (e.g., DNA binding transcriptional regulators) and their central position in protein-protein interaction networks. Taken together, our results support a major role of WGDs in the diversification of the pigmentation gene repertoire in the teleost lineage, with a possible link with the diversity of pigment cell lineages observed in these animals compared to other vertebrates. PMID:29599177
Du, Xinxin; Liu, Yuezhong; Liu, Jinxiang; Zhang, Quanqi
2016-01-01
Following the two rounds of whole-genome duplication (WGD) during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD), the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus). We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR) revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family. PMID:27703851
Evolution and Expression of Tissue Globins in Ray-Finned Fishes.
Gallagher, Michael D; Macqueen, Daniel J
2017-01-01
The globin gene family encodes oxygen-binding hemeproteins conserved across the major branches of multicellular life. The origins and evolutionary histories of complete globin repertoires have been established for many vertebrates, but there remain major knowledge gaps for ray-finned fish. Therefore, we used phylogenetic, comparative genomic and gene expression analyses to discover and characterize canonical “non-blood” globin family members (i.e., myoglobin, cytoglobin, neuroglobin, globin-X, and globin-Y) across multiple ray-finned fish lineages, revealing novel gene duplicates (paralogs) conserved from whole genome duplication (WGD) and small-scale duplication events. Our key findings were that: (1) globin-X paralogs in teleosts have been retained from the teleost-specific WGD, (2) functional paralogs of cytoglobin, neuroglobin, and globin-X, but not myoglobin, have been conserved from the salmonid-specific WGD, (3) triplicate lineage-specific myoglobin paralogs are conserved in arowanas (Osteoglossiformes), which arose by tandem duplication and diverged under positive selection, (4) globin-Y is retained in multiple early branching fish lineages that diverged before teleosts, and (5) marked variation in tissue-specific expression of globin gene repertoires exists across ray-finned fish evolution, including several previously uncharacterized sites of expression. In this respect, our data provide an interesting link between myoglobin expression and the evolution of air breathing in teleosts. Together, our findings demonstrate great-unrecognized diversity in the repertoire and expression of nonblood globins that has arisen during ray-finned fish evolution.
History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis.
Gazave, Eve; Guillou, Aurélien; Balavoine, Guillaume
2014-01-01
The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages.
Auguste, Aurélie; Bessière, Laurianne; Todeschini, Anne-Laure; Caburet, Sandrine; Sarnacki, Sabine; Prat, Jaime; D'angelo, Emanuela; De La Grange, Pierre; Ariste, Olivier; Lemoine, Fréderic; Legois, Bérangère; Sultan, Charles; Zider, Alain; Galmiche, Louise; Kalfa, Nicolas; Veitia, Reiner A
2015-12-01
Juvenile granulosa cell tumors (JGCTs) of the ovary are pediatric neoplasms representing 5% of all granulosa cell tumors (GCTs). Most GCTs are of adult type (AGCTs) and bear a mutation in the FOXL2 gene. The molecular basis of JGCTs is poorly understood, although mutations in the GNAS gene have been reported. We have detected in-frame duplications within the oncogene AKT1 in >60% of the JGCTs studied. Here, to evaluate the functional impact of these duplications and the existence of potential co-driver alterations, we have sequenced the transcriptome of four JGCTs and compared them with control transcriptomes. A search for gene variants detected only private alterations probably unrelated with tumorigenesis, suggesting that tandem duplications are the best candidates to underlie tumor formation in the absence of GNAS alterations. We previously showed that the duplications were specific to JGCTs. However, the screening of eight AGCTs samples without FOXL2 mutation showed the existence of an AKT1 duplication in one case, also having a stromal luteoma. The analysis of RNA-Seq data pinpointed a series of differentially expressed genes, involved in cytokine and hormone signaling and cell division-related processes. Further analyses pointed to the existence of a possible dedifferentiation process and suggested that most of the transcriptomic dysregulation might be mediated by a limited set of transcription factors perturbed by AKT1 activation. Finally, we show that commercially available AKT inhibitors can modulate the in vitro activity of various mutated forms. These results shed light on the pathogenesis of JGCTs and provide therapeutic leads for a targeted treatment. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Matoso, Eunice; Melo, Joana B; Ferreira, Susana I; Jardim, Ana; Castelo, Teresa M; Weise, Anja; Carreira, Isabel M
2013-08-01
An insertional translocation (IT) can result in pure segmental aneusomy for the inserted genomic segment allowing to define a more accurate clinical phenotype. Here, we report on two siblings sharing an unbalanced IT inherited from the mother with a history of learning difficulty. An 8-year-old girl with developmental delay, speech disability, and attention-deficit hyperactivity disorder (ADHD), showed by GTG banding analysis a subtle interstitial alteration in 21q21. Oligonucleotide array comparative genomic hybridization (array-CGH) analysis showed a 4q13.1-q13.3 duplication spanning 8.6 Mb. Fluorescence in situ hybridization (FISH) with bacterial artificial chromosome (BAC) clones confirmed the rearrangement, a der(21)ins(21;4)(q21;q13.1q13.3). The duplication described involves 50 RefSeq genes including the EPHA5 gene that encodes for the EphA5 receptor involved in embryonic development of the brain and also in synaptic remodeling and plasticity thought to underlie learning and memory. The same rearrangement was observed in a younger brother with behavioral problems and also exhibiting ADHD. ADHD is among the most heritable of neuropsychiatric disorders. There are few reports of patients with duplications involving the proximal region of 4q and a mild phenotype. To the best of our knowledge this is the first report of a duplication restricted to band 4q13. This abnormality could be easily missed in children who have nonspecific cognitive impairment. The presence of this behavioral disorder in the two siblings reinforces the hypothesis that the region involved could include genes involved in ADHD. Copyright © 2013 Wiley Periodicals, Inc.
Ye, Jun-jie; Ma, Li; Yang, Li-juan; Wang, Jin-huan; Wang, Yue-li; Guo, Hai; Gong, Ning; Nie, Wen-hui; Zhao, Shu-hua
2013-09-01
There are many reports on associations between spermatogenesis and partial azoospermia factor c (AZFc) deletions as well as duplications; however, results are conflicting, possibly due to differences in methodology and ethnic background. The purpose of this study is to investigate the association of AZFc polymorphisms and male infertility in the Yi ethnic population, residents within Yunnan Province, China. A total of 224 infertile patients and 153 fertile subjects were selected in the Yi ethnic population. The study was performed by sequence-tagged site plus/minus (STS+/-) analysis followed by gene dosage and gene copy definition analysis. Y haplotypes of 215 cases and 115 controls were defined by 12 binary markers using single nucleotide polymorphism on Y chromosome (Y-SNP) multiplex assays based on single base primer extension technology. The distribution of Y haplotypes was not significantly different between the case and control groups. The frequencies of both gr/gr (7.6% vs. 8.5%) and b2/b3 (6.3% vs. 8.5%) deletions do not show significant differences. Similarly, single nucleotide variant (SNV) analysis shows no significant difference of gene copy definition between the cases and controls. However, the frequency of partial duplications in the infertile group (4.0%) is significantly higher than that in the control group (0.7%). Further, we found a case with sY1206 deletion which had two CDY1 copies but removed half of DAZ genes. Our results show that male infertility is associated with partial AZFc duplications, but neither gr/gr nor b2/b3 deletions, suggesting that partial AZFc duplications rather than deletions are risk factors for male infertility in Chinese-Yi population.
On the Origin and Evolutionary History of NANOG
Vivien, Céline; Kodjabachian, Laurent; Demeneix, Barbara; Coen, Laurent; Girardot, Fabrice
2014-01-01
Though pluripotency is well characterized in mammals, many questions remain to be resolved regarding its evolutionary history. A necessary prerequisite for addressing this issue is to determine the phylogenetic distributions and orthology relationships of the transcription factor families sustaining or modulating this property. In mammals, the NANOG homeodomain transcription factor is one of the core players in the pluripotency network. However, its evolutionary history has not been thoroughly studied, hindering the interpretation of comparative studies. To date, the NANOG family was thought to be monogenic, with numerous pseudogenes described in mammals, including a tandem duplicate in Hominidae. By examining a wide-array of craniate genomes, we provide evidence that the NANOG family arose at the latest in the most recent common ancestor of osteichthyans and that NANOG genes are frequently found as tandem duplicates in sarcopterygians and as a single gene in actinopterygians. Their phylogenetic distribution is thus reminiscent of that recently shown for Class V POU paralogues, another key family of pluripotency-controlling factors. However, while a single ancestral duplication has been reported for the Class V POU family, we suggest that multiple independent duplication events took place during evolution of the NANOG family. These multiple duplications could have contributed to create a layer of complexity in the control of cell competence and pluripotency, which could explain the discrepancies relative to the functional evolution of this important gene family. Further, our analysis does not support the hypothesis that loss of NANOG and emergence of the preformation mode of primordial germ cell specification are causally linked. Our study therefore argues for the need of further functional comparisons between NANOG paralogues, notably regarding the novel duplicates identified in sauropsids and non-eutherian mammals. PMID:24465486
Patel, Vidushi S; Cooper, Steven JB; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer AM
2008-01-01
Background Vertebrate alpha (α)- and beta (β)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the α- and β-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil β-globin gene (ω) in the marsupial α-cluster, however, suggested that duplication of the α-β cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous α- and β-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. Results The platypus α-globin cluster (chromosome 21) contains embryonic and adult α- globin genes, a β-like ω-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-ζ-ζ'-αD-α3-α2-α1-ω-GBY-3'. The platypus β-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-ε-β-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate α-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal β-globin clusters are embedded in olfactory genes. Thus, the mammalian α- and β-globin clusters are orthologous to the bird α- and β-globin clusters respectively. Conclusion We propose that α- and β-globin clusters evolved from an ancient MPG-C16orf35-α-β-GBY-LUC7L arrangement 410 million years ago. A copy of the original β (represented by ω in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of β-globin genes with different expression profiles in different lineages. PMID:18657265
Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility.
Nuttle, Xander; Giannuzzi, Giuliana; Duyzend, Michael H; Schraiber, Joshua G; Narvaiza, Iñigo; Sudmant, Peter H; Penn, Osnat; Chiatante, Giorgia; Malig, Maika; Huddleston, John; Benner, Chris; Camponeschi, Francesca; Ciofi-Baffoni, Simone; Stessman, Holly A F; Marchetto, Maria C N; Denman, Laura; Harshman, Lana; Baker, Carl; Raja, Archana; Penewit, Kelsi; Janke, Nicolette; Tang, W Joyce; Ventura, Mario; Banci, Lucia; Antonacci, Francesca; Akey, Joshua M; Amemiya, Chris T; Gage, Fred H; Reymond, Alexandre; Eichler, Evan E
2016-08-11
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates, including more recently the genomes of archaic hominins. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for approximately 1% of cases of autism and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. Here we reconstruct the evolutionary history of the locus and identify bolA family member 2 (BOLA2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95-kilobase-pair segment containing BOLA2 duplicated across the critical region approximately 282 thousand years ago (ka), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carried one or more copies of the duplication, which nearly fixed early in the human lineage--a pattern unlikely to have arisen so rapidly in the absence of selection (P < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11. rearrangement show that more than 96% of breakpoints occur within the H. sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the H. sapiens lineage about 282 ka simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
Perkins, David D.
1975-01-01
Heterokaryon (vegetative) incompatibility, governing the fusion of somatic hyphal filaments to form stable heterokaryons, is of interest because of its widespread occurrence in fungi and its bearing on cellular recognition. Conventional investigations of the genetic basis of heterokaryon incompatibility in N. crassa are difficult because in commonly used stocks differences are present at several het loci, all with similar incompatibility phenotypes. This difficulty is overcome by using duplications (partial diploids) that are unlikely to contain more than one het locus. A phenotypically expressed incompatibility reaction occurs when unlike het alleles are present within the same somatic nucleus, and this parallels the heterokaryon incompatibility reaction that occurs when unlike alleles in different haploid nuclei are introduced into the same somatic hypha by mycelial fusion.—Nontandem duplications were used to confirm that the incompatibility reactions in heterokaryons and in duplications are alternate expressions of the same genes. This was demonstrated for three loci which had previously been established by conventional heterokaryon tests—het-e, het-c and mt. These were each obtained in duplications as recombinant meiotic segregants from crosses heterozygous for duplication-generating chromosome rearrangements. The particular method of producing the duplications is irrelevant so long as the incompatibility alleles are heterozygous.—The duplication technique has made it possible to determine easily the het-e and het-c genotypes of numerous laboratory and wild strains of unknown constitution. In laboratory strains both loci are represented simply by two alleles. Analysis of het-c is more complicated in some wild strains, where differences have been demonstrated at one or more additional het loci within the duplication used and multiple allelism is also possible.—The results show that the duplication method can be used to identify and map additional vegetative incompatibility loci, without the necessity of heterokaryon tests. PMID:124288
Melin, Amanda D; Danosi, Christina F; McCracken, Gary F; Dominy, Nathaniel J
2014-12-01
A nocturnal bottleneck during mammalian evolution left a majority of species with two cone opsins, or dichromatic color vision. Primate trichromatic vision arose from the duplication and divergence of an X-linked opsin gene, and is long attributed to tandem shifts from nocturnality to diurnality and from insectivory to frugivory. Opsin gene variation and at least one duplication event exist in the order Chiroptera, suggesting that trichromatic vision could evolve under favorable ecological conditions. The natural history of the Samoan flying fox (Pteropus samoensis) meets these conditions--it is a large bat that consumes nectar and fruit and demonstrates strong diurnal proclivities. It also possesses a visual system that is strikingly similar to that of primates. To explore the potential for opsin gene duplication and divergence in this species, we sequenced the opsin genes of 11 individuals (19 X-chromosomes) from three South Pacific islands. Our results indicate the uniform presence of two opsins with predicted peak sensitivities of ca. 360 and 553 nm. This result fails to support a causal link between diurnal frugivory and trichromatic vision, although it remains plausible that the diurnal activities of P. samoensis have insufficient antiquity to favor opsin gene renovation.
Giorgio, Elisa; Rolyan, Harshvardhan; Kropp, Laura; Chakka, Anish Baswanth; Yatsenko, Svetlana; Gregorio, Eleonora Di; Lacerenza, Daniela; Vaula, Giovanna; Talarico, Flavia; Mandich, Paola; Toro, Camilo; Pierre, Eleonore Eymard; Labauge, Pierre; Capellari, Sabina; Cortelli, Pietro; Vairo, Filippo Pinto; Miguel, Diego; Stubbolo, Danielle; Marques, Lourenco Charles; Gahl, William; Boespflug-Tanguy, Odile; Melberg, Atle; Hassin-Baer, Sharon; Cohen, Oren S; Pjontek, Rastislav; Grau, Armin; Klopstock, Thomas; Fogel, Brent; Meijer, Inge; Rouleau, Guy; Bouchard, Jean-Pierre L; Ganapathiraju, Madhavi; Vanderver, Adeline; Dahl, Niklas; Hobson, Grace; Brusco, Alfredo; Brussino, Alessandro; Padiath, Quasar Saleem
2013-01-01
ABSTRACT Autosomal dominant leukodystrophy (ADLD) is an adult onset demyelinating disorder that is caused by duplications of the lamin B1 (LMNB1) gene. However, as only a few cases have been analyzed in detail, the mechanisms underlying LMNB1 duplications are unclear. We report the detailed molecular analysis of the largest collection of ADLD families studied, to date. We have identified the minimal duplicated region necessary for the disease, defined all the duplication junctions at the nucleotide level and identified the first inverted LMNB1 duplication. We have demonstrated that the duplications are not recurrent; patients with identical duplications share the same haplotype, likely inherited from a common founder and that the duplications originated from intrachromosomal events. The duplication junction sequences indicated that nonhomologous end joining or replication-based mechanisms such fork stalling and template switching or microhomology-mediated break induced repair are likely to be involved. LMNB1 expression was increased in patients’ fibroblasts both at mRNA and protein levels and the three LMNB1 alleles in ADLD patients show equal expression, suggesting that regulatory regions are maintained within the rearranged segment. These results have allowed us to elucidate duplication mechanisms and provide insights into allele-specific LMNB1 expression levels. PMID:23649844
STRIDE: Species Tree Root Inference from Gene Duplication Events.
Emms, David M; Kelly, Steven
2017-12-01
The correct interpretation of any phylogenetic tree is dependent on that tree being correctly rooted. We present STRIDE, a fast, effective, and outgroup-free method for identification of gene duplication events and species tree root inference in large-scale molecular phylogenetic analyses. STRIDE identifies sets of well-supported in-group gene duplication events from a set of unrooted gene trees, and analyses these events to infer a probability distribution over an unrooted species tree for the location of its root. We show that STRIDE correctly identifies the root of the species tree in multiple large-scale molecular phylogenetic data sets spanning a wide range of timescales and taxonomic groups. We demonstrate that the novel probability model implemented in STRIDE can accurately represent the ambiguity in species tree root assignment for data sets where information is limited. Furthermore, application of STRIDE to outgroup-free inference of the origin of the eukaryotic tree resulted in a root probability distribution that provides additional support for leading hypotheses for the origin of the eukaryotes. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Stø, Ida M.; Orr, Russell J. S.; Fooyontphanich, Kim; Jin, Xu; Knutsen, Jonfinn M. B.; Fischer, Urs; Tranbarger, Timothy J.; Nordal, Inger; Aalen, Reidunn B.
2015-01-01
The peptide INFLORESCENCE DEFICIENT IN ABSCISSION (IDA), which signals through the leucine-rich repeat receptor-like kinases HAESA (HAE) and HAESA-LIKE2 (HSL2), controls different cell separation events in Arabidopsis thaliana. We hypothesize the involvement of this signaling module in abscission processes in other plant species even though they may shed other organs than A. thaliana. As the first step toward testing this hypothesis from an evolutionarily perspective we have identified genes encoding putative orthologs of IDA and its receptors by BLAST searches of publically available protein, nucleotide and genome databases for angiosperms. Genes encoding IDA or IDA-LIKE (IDL) peptides and HSL proteins were found in all investigated species, which were selected as to represent each angiosperm order with available genomic sequences. The 12 amino acids representing the bioactive peptide in A. thaliana have virtually been unchanged throughout the evolution of the angiosperms; however, the number of IDL and HSL genes varies between different orders and species. The phylogenetic analyses suggest that IDA, HSL2, and the related HSL1 gene, were present in the species that gave rise to the angiosperms. HAE has arisen from HSL1 after a genome duplication that took place after the monocot—eudicots split. HSL1 has also independently been duplicated in the monocots, while HSL2 has been lost in gingers (Zingiberales) and grasses (Poales). IDA has been duplicated in eudicots to give rise to functionally divergent IDL peptides. We postulate that the high number of IDL homologs present in the core eudicots is a result of multiple whole genome duplications (WGD). We substantiate the involvement of IDA and HAE/HSL2 homologs in abscission by providing gene expression data of different organ separation events from various species. PMID:26579174
Teng, Linhong; Han, Wentao; Fan, Xiao; Xu, Dong; Zhang, Xiaowen; Dittami, Simon M.; Ye, Naihao
2017-01-01
Lipoxygenase (LOX) plays important roles in fatty acid oxidation and lipid mediator biosynthesis. In this study, we give first insights into brown algal LOX evolution. Whole genome searches revealed four, three, and eleven LOXs in Ectocarpus siliculosus, Cladosiphon okamuranus, and Saccharina japonica, respectively. In phylogenetic analyses, LOXs from brown algae form a robust clade with those from prokaryotes, suggesting an ancestral origin and slow evolution. Brown algal LOXs were divided into two clades, C1 and C2 in a phylogenetic tree. Compared to the two species of Ectocarpales, LOX gene expansion occurred in the kelp S. japonica through tandem duplication and segmental duplication. Selection pressure analysis showed that LOX genes in brown algae have undergone strong purifying selection, while the selective constraint in the C2 clade was more relaxed than that in the C1 clade. Furthermore, within each clade, LOXs of S. japonica evolved under more relaxed selection constraints than E. siliculosus and C. okamuranus. Structural modeling showed that unlike LOXs of plants and animals, which contain a β barrel in the N-terminal part of the protein, LOXs in brown algae fold into a single domain. Analysis of previously published transcriptomic data showed that LOXs in E. siliculosus are responsive to hyposaline, hypersaline, oxidative, and copper stresses. Moreover, clear divergence of expression patterns was observed among different life stages, as well as between duplicate gene pairs. In E. siliculosus, all four LOXs are male-biased in immature gametophytes, and mature gametophytes showed significantly higher LOX mRNA levels than immature gametophytes and sporophytes. In S. japonica, however, our RNA-Seq data showed that most LOXs are highly expressed in sporophytes. Even the most recently duplicated gene pairs showed divergent expression patterns, suggesting that functional divergence has likely occurred since LOX genes duplicated, which potentially contributes to the production of various oxylipins in brown algae. PMID:29234336
Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi
2015-02-15
WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.
Dworschak, G C; Crétolle, C; Hilger, A; Engels, H; Korsch, E; Reutter, H; Ludwig, M
2017-05-01
Partial duplications of the long arm of chromosome 3, dup(3q), are a rare but well-described condition, sharing features of Cornelia de Lange syndrome. Around two thirds of cases are derived from unbalanced translocations, whereas pure dup(3q) have rarely been reported. Here, we provide an extensive review of the literature on dup(3q). This search revealed several patients with caudal malformations and anomalies, suggesting that caudal malformations or anomalies represent an inherent phenotypic feature of dup(3q). In this context, we report a patient with a pure de novo duplication 3q26.32-q27.2. The patient had the clinical diagnosis of Currarino syndrome (CS) (characterized by the triad of sacral anomalies, anorectal malformations and a presacral mass) and additional features, frequently detected in patients with a dup(3q). Mutations within the MNX1 gene were found to be causative in CS but no MNX1 mutation could be detected in our patient. Our comprehensive search for candidate genes located in the critical region of the duplication 3q syndrome, 3q26.3-q27, revealed a so far neglected phenotypic overlap of dup(3q) and the Pierpont syndrome, associated with a mutation of the TBL1XR1 gene on 3q26.32. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Fawcett, Jeffrey A.; Maere, Steven; Van de Peer, Yves
2009-01-01
Most flowering plants have been shown to be ancient polyploids that have undergone one or more whole genome duplications early in their evolution. Furthermore, many different plant lineages seem to have experienced an additional, more recent genome duplication. Starting from paralogous genes lying in duplicated segments or identified in large expressed sequence tag collections, we dated these youngest duplication events through penalized likelihood phylogenetic tree inference. We show that a majority of these independent genome duplications are clustered in time and seem to coincide with the Cretaceous–Tertiary (KT) boundary. The KT extinction event is the most recent mass extinction caused by one or more catastrophic events such as a massive asteroid impact and/or increased volcanic activity. These events are believed to have generated global wildfires and dust clouds that cut off sunlight during long periods of time resulting in the extinction of ≈60% of plant species, as well as a majority of animals, including dinosaurs. Recent studies suggest that polyploid species can have a higher adaptability and increased tolerance to different environmental conditions. We propose that polyploidization may have contributed to the survival and propagation of several plant lineages during or following the KT extinction event. Due to advantages such as altered gene expression leading to hybrid vigor and an increased set of genes and alleles available for selection, polyploid plants might have been better able to adapt to the drastically changed environment 65 million years ago. PMID:19325131
Brinch-Pedersen, Henrik
2013-01-01
The phytase activity in food and feedstuffs is an important nutritional parameter. Members of the Triticeae tribe accumulate purple acid phosphatase phytases (PAPhy) during grain filling. This accumulation elevates mature grain phytase activities (MGPA) up to levels between ~650 FTU/kg for barley and 6000 FTU/kg for rye. This is notably more than other cereals. For instance, rice, maize, and oat have MGPAs below 100 FTU/kg. The cloning and characterization of the PAPhy gene complement from wheat, barley, rye, einkorn, and Aegilops tauschii is reported here. The Triticeae PAPhy genes generally consist of a set of paralogues, PAPhy_a and PAPhy_b, and have been mapped to Triticeae chromosomes 5 and 3, respectively. The promoters share a conserved core but the PAPhy_a promoter have acquired a novel cis-acting regulatory element for expression during grain filling while the PAPhy_b promoter has maintained the archaic function and drives expression during germination. Brachypodium is the only sequenced Poaceae sharing the PAPhy duplication. As for the Triticeae, the duplication is reflected in a high MGPA of ~4200 FTU/kg in Brachypodium. The sequence conservation of the paralogous loci on Brachypodium chromosomes 1 and 2 does not extend beyond the PAPhy gene. The results indicate that a single-gene segmental duplication may have enabled the evolution of high MGPA by creating functional redundancy of the parent PAPhy gene. This implies that similar MGPA levels may be out of reach in breeding programs for some Poaceae, e.g. maize and rice, whereas Triticeae breeders should focus on PAPhy_a. PMID:23918958
On Computing Breakpoint Distances for Genomes with Duplicate Genes.
Shao, Mingfu; Moret, Bernard M E
2017-06-01
A fundamental problem in comparative genomics is to compute the distance between two genomes in terms of its higher level organization (given by genes or syntenic blocks). For two genomes without duplicate genes, we can easily define (and almost always efficiently compute) a variety of distance measures, but the problem is NP-hard under most models when genomes contain duplicate genes. To tackle duplicate genes, three formulations (exemplar, maximum matching, and any matching) have been proposed, all of which aim to build a matching between homologous genes so as to minimize some distance measure. Of the many distance measures, the breakpoint distance (the number of nonconserved adjacencies) was the first one to be studied and remains of significant interest because of its simplicity and model-free property. The three breakpoint distance problems corresponding to the three formulations have been widely studied. Although we provided last year a solution for the exemplar problem that runs very fast on full genomes, computing optimal solutions for the other two problems has remained challenging. In this article, we describe very fast, exact algorithms for these two problems. Our algorithms rely on a compact integer-linear program that we further simplify by developing an algorithm to remove variables, based on new results on the structure of adjacencies and matchings. Through extensive experiments using both simulations and biological data sets, we show that our algorithms run very fast (in seconds) on mammalian genomes and scale well beyond. We also apply these algorithms (as well as the classic orthology tool MSOAR) to create orthology assignment, then compare their quality in terms of both accuracy and coverage. We find that our algorithm for the "any matching" formulation significantly outperforms other methods in terms of accuracy while achieving nearly maximum coverage.
Zou, Zhi; Huang, Qixing; Xie, Guishui; Yang, Lifu
2018-01-10
Papain-like cysteine proteases (PLCPs) are a class of proteolytic enzymes involved in many plant processes. Compared with the extensive research in Arabidopsis thaliana, little is known in castor bean (Ricinus communis) and physic nut (Jatropha curcas), two Euphorbiaceous plants without any recent whole-genome duplication. In this study, a total of 26 or 23 PLCP genes were identified from the genomes of castor bean and physic nut respectively, which can be divided into nine subfamilies based on the phylogenetic analysis: RD21, CEP, XCP, XBCP3, THI, SAG12, RD19, ALP and CTB. Although most of them harbor orthologs in Arabidopsis, several members in subfamilies RD21, CEP, XBCP3 and SAG12 form new groups or subgroups as observed in other species, suggesting specific gene loss occurred in Arabidopsis. Recent gene duplicates were also identified in these two species, but they are limited to the SAG12 subfamily and were all derived from local duplication. Expression profiling revealed diverse patterns of different family members over various tissues. Furthermore, the evolution characteristics of PLCP genes were also compared and discussed. Our findings provide a useful reference to characterize PLCP genes and investigate the family evolution in Euphorbiaceae and species beyond.
Small homologous blocks in phytophthora genomes do not point to an ancient whole-genome duplication.
van Hooff, Jolien J E; Snel, Berend; Seidl, Michael F
2014-05-01
Genomes of the plant-pathogenic genus Phytophthora are characterized by small duplicated blocks consisting of two consecutive genes (2HOM blocks) and by an elevated abundance of similarly aged gene duplicates. Both properties, in particular the presence of 2HOM blocks, have been attributed to a whole-genome duplication (WGD) at the last common ancestor of Phytophthora. However, large intraspecies synteny-compelling evidence for a WGD-has not been detected. Here, we revisited the WGD hypothesis by deducing the age of 2HOM blocks. Two independent timing methods reveal that the majority of 2HOM blocks arose after divergence of the Phytophthora lineages. In addition, a large proportion of the 2HOM block copies colocalize on the same scaffold. Therefore, the presence of 2HOM blocks does not support a WGD at the last common ancestor of Phytophthora. Thus, genome evolution of Phytophthora is likely driven by alternative mechanisms, such as bursts of transposon activity.
Sugimoto, Naohisa; Takakura, Yasuaki; Shiraki, Kentaro; Honda, Shinya; Takaya, Naoki; Hoshino, Takayuki; Nakamura, Akira
2013-01-01
To obtain a selection marker gene functional in a thermophilic bacterium, Thermus thermophilus, an in vivo-directed evolutionary strategy was conducted on a hygromycin B phosphotransferase gene (hyg) from Streptomyces hygroscopicus. The expression of wild-type hyg in T. thermophilus provided hygromycin B (HygB) resistance up to 60 °C. Through selection of mutants showing HygB resistance at higher temperatures, eight amino acid substitutions and the duplication of three amino acids were identified. A variant containing seven substitutions and the duplication (HYG10) showed HygB resistance at a highest temperature of 74 °C. Biochemical and biophysical analyses of recombinant HYG and HYG10 revealed that HYG10 was in fact thermostabilized. Modeling of the three-dimensional structure of HYG10 suggests the possible roles of the various substitutions and the duplication on thermostabilization, of which three substitutions and the duplication located at the enzyme surface suggested that these mutations made the enzyme more hydrophilic and provided increased stability in aqueous solution.
Wang, Dongxu; Wang, He; Irfan, Muhammad; Fan, Mingxia; Lin, Feng
2014-08-01
Receptor-like kinase (RLKs) is an important member in protein kinase family which is widely involved in plant growth, development and defense responses. It is significant to analyze the kinase structure and evolution of pollen RLKs in order to study their mechanisms. In our study, 64 and 73 putative pollen RLKs were chosen from maize and Arabidopsis. Phylogenetic analysis showed that the pollen RLKs were conservative and might had existed before divergence between monocot and dicot which were mainly concentrated in RLCK-VII and LRR-III two subfamilies. Chromosomal localization and gene duplication analysis showed the expansion of pollen RLKs were mainly caused by segmental duplication. By calculating Ka/Ks value of extracellular domain, intracellular domain and kinase domain in pollen RLKs, we found that the pollen RLKs duplicated genes had mainly experienced the purifying selection, while maize might have experienced weaker purifying selection. Meanwhile, extracellular domain might have experienced stronger diversifying selection than intracellular domain in both species. Estimation of duplication time showed that the duplication events of Arabidopsis have occurred approximately between 18 and 69 million years ago, compared to 0.67-170 million years ago of maize. Copyright © 2014 Elsevier Ltd. All rights reserved.
Yang, Xiping; Wang, Jianping
2016-01-01
The nucleotide-binding site (NBS)–leucine-rich repeat (LRR) gene family is crucially important for offering resistance to pathogens. To explore evolutionary conservation and variability of NBS-LRR genes across grass species, we identified 88, 107, 24, and 44 full-length NBS-LRR genes in sorghum, rice, maize, and Brachypodium, respectively. A comprehensive analysis was performed on classification, genome organization, evolution, expression, and regulation of these NBS-LRR genes using sorghum as a representative of grass species. In general, the full-length NBS-LRR genes are highly clustered and duplicated in sorghum genome mainly due to local duplications. NBS-LRR genes have basal expression levels and are highly potentially targeted by miRNA. The number of NBS-LRR genes in the four grass species is positively correlated with the gene clustering rate. The results provided a valuable genomic resource and insights for functional and evolutionary studies of NBS-LRR genes in grass species. PMID:26792976
Almeida, Ana Maria R; Yockteng, Roxana; Otoni, Wagner C; Specht, Chelsea D
2015-01-01
The ABC model of flower development describes the molecular basis for specification of floral organ identity in model eudicots such as Arabidopsis and Antirrhinum. According to this model, expression of C-class genes is linked to stamen and gynoecium organ identity. The Zingiberales is an order of tropical monocots in which the evolution of floral morphology is characterized by a marked increase in petaloidy in the androecium. Petaloidy is a derived characteristic of the ginger families and seems to have arisen in the common ancestor of the ginger clade. We hypothesize that duplication of the C-class AGAMOUS (AG) gene followed by divergence of the duplicated AG copies during the diversification of the ginger clade lineages explains the evolution of petaloidy in the androecium. In order to address this hypothesis, we carried out phylogenetic analyses of the AG gene family across the Zingiberales and investigated patterns of gene expression within the androecium. Phylogenetic analysis supports a scenario in which Zingiberales-specific AG genes have undergone at least one round of duplication. Gene duplication was immediately followed by divergence of the retained copies. In particular, we detect positive selection in the third alpha-helix of the K domain of Zingiberales AGAMOUS copy 1 (ZinAG-1). A single fixed amino acid change is observed in ZinAG-1 within the ginger clade when compared to the banana grade. Expression analyses of AG and APETALA1/FRUITFULL (AP1/FUL) in Musa basjoo is similar to A- and C-class gene expressions in the Arabidopsis thaliana model, while Costus spicatus exhibits simultaneous expression of AG and AP1/FUL in most floral organs. We propose that this novel expression pattern could be correlated with the evolution of androecial petaloidy within the Zingiberales. Our results present an intricate story in which duplication of the AG lineage has lead to the retention of at least two diverged Zingiberales-specific copies, ZinAG-1 and Zingiberales AGAMOUS copy 2 (ZinAG-2). Positive selection on ZinAG-1 residues suggests a mechanism by which AG gene divergence may explain observed morphological changes in Zingiberales flowers. Expression data provides preliminary support for the proposed mechanism, although further studies are required to fully test this hypothesis.
2013-01-01
Background Cytokinins (CKs) have significant roles in various aspects of plant growth and development, and they are also involved in plant stress adaptations. The fine-tuning of the controlled CK levels in individual tissues, cells, and organelles is properly maintained by isopentenyl transferases (IPTs) and cytokinin oxidase/dehydrogenases (CKXs). Chinese cabbage is one of the most economically important vegetable crops worldwide. The whole genome sequencing of Brassica rapa enables us to perform the genome-wide identification and functional analysis of the IPT and CKX gene families. Results In this study, a total of 13 BrIPT genes and 12 BrCKX genes were identified. The gene structures, conserved domains and phylogenetic relationships were analyzed. The isoelectric point, subcellular localization and glycosylation sites of the proteins were predicted. Segmental duplicates were found in both BrIPT and BrCKX gene families. We also analyzed evolutionary patterns and divergence of the IPT and CKX genes in the Cruciferae family. The transcription levels of BrIPT and BrCKX genes were analyzed to obtain an initial picture of the functions of these genes. Abiotic stress elements related to adverse environmental stimuli were found in the promoter regions of BrIPT and BrCKX genes and they were confirmed to respond to drought and high salinity conditions. The effects of 6-BA and ABA on the expressions of BrIPT and BrCKX genes were also investigated. Conclusions The expansion of BrIPT and BrCKX genes after speciation from Arabidopsis thaliana is mainly attributed to segmental duplication events during the whole genome triplication (WGT) and substantial duplicated genes are lost during the long evolutionary history. Genes produced by segmental duplication events have changed their expression patterns or may adopted new functions and thus are obtained. BrIPT and BrCKX genes respond well to drought and high salinity stresses, and their transcripts are affected by exogenous hormones, such as 6-BA and ABA, suggesting their potential roles in abiotic stress conditions and regulatory mechanisms of plant hormone homeostasis. The appropriate modulation of endogenous CKs levels by IPT and CKX genes is a promising approach for developing economically important high-yielding and high-quality stress-tolerant crops in agriculture. PMID:24001366
Axelsen, Jacob Bock; Yan, Koon-Kiu; Maslov, Sergei
2007-01-01
Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw") duplication and deletion rates rdup∗, rdel∗ which include gene copies that will be removed soon after the duplication event and their dramatically reduced long-term counterparts rdup, rdel. High deletion rate among recently duplicated proteins is consistent with a scenario in which they didn't have enough time to significantly change their functional roles and thus are to a large degree disposable. Systematic trends of each of the four duplication/deletion rates with the total number of genes in the genome were analyzed. All but the deletion rate of recent duplicates rdel∗ were shown to systematically increase with Ngenes. Abnormally flat shapes of sequence identity histograms observed for yeast and human are consistent with lineages leading to these organisms undergoing one or more whole-genome duplications. This interpretation is corroborated by our analysis of the genome of Paramecium tetraurelia where the p-4 profile of the histogram is gradually restored by the successive removal of paralogs generated in its four known whole-genome duplication events. PMID:18039386
Krsticevic, Flavia J; Arce, Débora P; Ezpeleta, Joaquín; Tapia, Elizabeth
2016-10-13
In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. Copyright © 2016 Krsticevic et al.
2014-01-01
Background Expansins are plant cell wall loosening proteins that are involved in cell enlargement and a variety of other developmental processes. The expansin superfamily contains four subfamilies; namely, α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). Although the genome sequencing of soybeans is complete, our knowledge about the pattern of expansion and evolutionary history of soybean expansin genes remains limited. Results A total of 75 expansin genes were identified in the soybean genome, and grouped into four subfamilies based on their phylogenetic relationships. Structural analysis revealed that the expansin genes are conserved in each subfamily, but are divergent among subfamilies. Furthermore, in soybean and Arabidopsis, the expansin gene family has been mainly expanded through tandem and segmental duplications; however, in rice, segmental duplication appears to be the dominant process that generates this superfamily. The transcriptome atlas revealed notable differential expression in either transcript abundance or expression patterns under normal growth conditions. This finding was consistent with the differential distribution of the cis-elements in the promoter region, and indicated wide functional divergence in this superfamily. Moreover, some critical amino acids that contribute to functional divergence and positive selection were detected. Finally, site model and branch-site model analysis of positive selection indicated that the soybean expansin gene superfamily is under strong positive selection, and that divergent selection constraints might have influenced the evolution of the four subfamilies. Conclusion This study demonstrated that the soybean expansin gene superfamily has expanded through tandem and segmental duplication. Differential expression indicated wide functional divergence in this superfamily. Furthermore, positive selection analysis revealed that divergent selection constraints might have influenced the evolution of the four subfamilies. In conclusion, the results of this study contribute novel detailed information about the molecular evolution of the expansin gene superfamily in soybean. PMID:24720629
Krsticevic, Flavia J.; Arce, Débora P.; Ezpeleta, Joaquín; Tapia, Elizabeth
2016-01-01
In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. PMID:27565886
2011-01-01
Background The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. Results The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. Conclusions The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations. PMID:21612617
Andersen, Øivind; De Rosa, Maria Cristina; Pirolli, Davide; Tooming-Klunderud, Ave; Petersen, Petra E; André, Carl
2011-05-25
The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations.
Evolution of two Rh blood group-related genes of the amphioxus species Branchiostoma floridae.
Kitano, Takashi; Satou, Masahiro; Saitou, Naruya
2010-04-01
We determined cDNAs of two genes that belong to the Rhesus (Rh) blood group gene family in an amphioxus species (Branchiostoma floridae) and designated them Rh-related-1 (RhR-1) and Rh-related-2 (RhR-2). RhR-1 and RhR-2 consisted of 10 and 11 exons, respectively. 3' UTR sequences of RhR-1 were shorter (220-272 bp) than those of RhR-2 (1,505-1,650 bp). CDS lengths were 1,344 and 1,476 bp for RhR-1 and RhR-2, respectively, and the average nucleotide difference between their CDS regions was 0.33. The corresponding regions of Rh genes from exons 2 to 7 were relatively conserved among the chordate species examined in this study. Length difference numbers were in multiples of three, which implies that codon frames were conserved among them, and the same exon/intron boundary phases were observed in those regions. This region was used for the phylogenetic analyses. RhR-1 and RhR-2 formed a cluster on the phylogenetic tree of the Rh gene family. Gene duplication time of RhR-1 and RhR-2 was estimated to be ca. 500 million years ago. It is likely that the four Rh family genes in vertebrates emerged by gene duplications in the common ancestor of vertebrates, and functional differentiation has occurred after the first gene duplication.
Rai1 duplication causes physical and behavioral phenotypes in a mouse model of dup(17)(p11.2p11.2)
Walz, Katherina; Paylor, Richard; Yan, Jiong; Bi, Weimin; Lupski, James R.
2006-01-01
Genomic disorders are conditions that result from DNA rearrangements, such as deletions or duplications. The identification of the dosage-sensitive gene(s) within the rearranged genomic interval is important for the elucidation of genes responsible for complex neurobehavioral phenotypes. Smith-Magenis syndrome is associated with a 3.7-Mb deletion in 17p11.2, and its clinical presentation is caused by retinoic acid inducible 1 (RAI1) haploinsufficiency. The reciprocal microduplication syndrome, dup(17)(p11.2p11.2), manifests several neurobehavioral abnormalities, but the responsible dosage-sensitive gene(s) remain undefined. We previously generated a mouse model for dup(17)(p11.2p11.2), Dp(11)17/+, that recapitulated most of the phenotypes observed in human patients. We have now analyzed compound heterozygous mice carrying a duplication [Dp(11)17] in one chromosome 11 along with a null allele of Rai1 in the other chromosome 11 homologue [Dp(11)17/Rai1– mice] in order to study the relationship between Rai1 gene copy number and the Dp(11)17/+ phenotypes. Normal disomic Rai1 gene dosage was sufficient to rescue the complex physical and behavioral phenotypes observed in Dp(11)17/+ mice, despite altered trisomic copy number of the other 18 genes present in the rearranged genomic interval. These data provide a model for variation in copy number of single genes that could influence common traits such as obesity and behavior. PMID:17024248
Chi, Shan; Wu, Shuangxiu; Yu, Jun; Wang, Xumin; Tang, Xuexi; Liu, Tao
2014-01-01
Both Calvin-Benson-Bassham (C3) and Hatch-Slack (C4) cycles are most important autotrophic CO2 fixation pathways on today's Earth. C3 cycle is believed to be originated from cyanobacterial endosymbiosis. However, studies on evolution of different biochemical variants of C4 photosynthesis are limited to tracheophytes and origins of C4-cycle genes are not clear till now. Our comprehensive analyses on bioinformatics and phylogenetics of novel transcriptomic sequencing data of 21 rhodophytes and 19 Phaeophyceae marine species and public genomic data of more algae, tracheophytes, cyanobacteria, proteobacteria and archaea revealed the origin and evolution of C4 cycle-related genes. Almost all of C4-related genes were annotated in extensive algal lineages with proteobacterial or archaeal origins, except for phosphoenolpyruvate carboxykinase (PCK) and aspartate aminotransferase (AST) with both cyanobacterial and archaeal/proteobacterial origin. Notably, cyanobacteria may not possess complete C4 pathway because of the flawed annotation of pyruvate orthophosphate dikinase (PPDK) genes in public data. Most C4 cycle-related genes endured duplication and gave rise to functional differentiation and adaptation in different algal lineages. C4-related genes of NAD-ME (NAD-malic enzyme) and PCK subtypes exist in most algae and may be primitive ones, while NADP-ME (NADP-malic enzyme) subtype genes might evolve from NAD-ME subtype by gene duplication in chlorophytes and tracheophytes.
A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution across the Poales
McKain, Michael R.; Tang, Haibao; McNeal, Joel R.; Ayyampalayam, Saravanaraj; Davis, Jerrold I.; dePamphilis, Claude W.; Givnish, Thomas J.; Pires, J. Chris; Stevenson, Dennis Wm.; Leebens-Mack, James H.
2016-01-01
Comparisons of flowering plant genomes reveal multiple rounds of ancient polyploidy characterized by large intragenomic syntenic blocks. Three such whole-genome duplication (WGD) events, designated as rho (ρ), sigma (σ), and tau (τ), have been identified in the genomes of cereal grasses. Precise dating of these WGD events is necessary to investigate how they have influenced diversification rates, evolutionary innovations, and genomic characteristics such as the GC profile of protein-coding sequences. The timing of these events has remained uncertain due to the paucity of monocot genome sequence data outside the grass family (Poaceae). Phylogenomic analysis of protein-coding genes from sequenced genomes and transcriptome assemblies from 35 species, including representatives of all families within the Poales, has resolved the timing of rho and sigma relative to speciation events and placed tau prior to divergence of Asparagales and the commelinids but after divergence with eudicots. Examination of gene family phylogenies indicates that rho occurred just prior to the diversification of Poaceae and sigma occurred before early diversification of Poales lineages but after the Poales-commelinid split. Additional lineage-specific WGD events were identified on the basis of the transcriptome data. Gene families exhibiting high GC content are underrepresented among those with duplicate genes that persisted following these genome duplications. However, genome duplications had little overall influence on lineage-specific changes in the GC content of coding genes. Improved resolution of the timing of WGD events in monocot history provides evidence for the influence of polyploidization on functional evolution and species diversification. PMID:26988252
A novel X-linked disorder with developmental delay and autistic features.
Kaya, Namik; Colak, Dilek; Albakheet, Albandary; Al-Owain, Mohammad; Abu-Dheim, Nada; Al-Younes, Banan; Al-Zahrani, Jawaher; Mukaddes, Nahit M; Dervent, Aysin; Al-Dosari, Naji; Al-Odaib, Ali; Kayaalp, Inci V; Al-Sayed, Moeenaladin; Al-Hassnan, Zuhair; Nester, Michael J; Al-Dosari, Mohammad; Al-Dhalaan, Hesham; Chedrawi, Aziza; Gunoz, Hulya; Karakas, Bedri; Sakati, Nadia; Alkuraya, Fowzan S; Gascon, Generaso G; Ozand, Pinar T
2012-04-01
Genomic duplications that lead to autism and other human diseases are interesting pathological lesions since the underlying mechanism almost certainly involves dosage sensitive genes. We aim to understand a novel genomic disorder with profound phenotypic consequences, most notably global developmental delay, autism, psychosis, and anorexia nervosa. We evaluated the affected individuals, all maternally related, using childhood autism rating scale (CARS) and Vineland Adaptive scales, magnetic resonance imaging (MRI) and magnetic resonance spectroscopy (MRS) brain, electroencephalography (EEG), electromyography (EMG), muscle biopsy, high-resolution molecular karyotype arrays, Giemsa banding (G-banding) and fluorescent in situ hybridization (FISH) experiments, mitochondrial DNA (mtDNA) sequencing, X-chromosome inactivation study, global gene expression analysis on Epstein-Barr virus (EBV)-transformed lymphoblasts, and quantitative reverse-transcription polymerase chain reaction (qRT-PCR). We have identified a novel Xq12-q13.3 duplication in an extended family. Clinically normal mothers were completely skewed in favor of the normal chromosome X. Global transcriptional profiling of affected individuals and controls revealed significant alterations of genes and pathways in a pattern consistent with previous microarray studies of autism spectrum disorder patients. Moreover, expression analysis revealed copy number-dependent increased messenger RNA (mRNA) levels in affected patients compared to control individuals. A subset of differentially expressed genes was validated using qRT-PCR. Xq12-q13.3 duplication is a novel global developmental delay and autism-predisposing chromosomal aberration; pathogenesis of which may be mediated by increased dosage of genes contained in the duplication, including NLGN3, OPHN1, AR, EFNB1, TAF1, GJB1, and MED12. Copyright © 2011 American Neurological Association.
Solis-Escalante, Daniel; Kuijpers, Niels G. A.; Barrajon-Simancas, Nuria; van den Broek, Marcel; Pronk, Jack T.
2015-01-01
As a result of ancestral whole-genome and small-scale duplication events, the genomes of Saccharomyces cerevisiae and many eukaryotes still contain a substantial fraction of duplicated genes. In all investigated organisms, metabolic pathways, and more particularly glycolysis, are specifically enriched for functionally redundant paralogs. In ancestors of the Saccharomyces lineage, the duplication of glycolytic genes is purported to have played an important role leading to S. cerevisiae's current lifestyle favoring fermentative metabolism even in the presence of oxygen and characterized by a high glycolytic capacity. In modern S. cerevisiae strains, the 12 glycolytic reactions leading to the biochemical conversion from glucose to ethanol are encoded by 27 paralogs. In order to experimentally explore the physiological role of this genetic redundancy, a yeast strain with a minimal set of 14 paralogs was constructed (the “minimal glycolysis” [MG] strain). Remarkably, a combination of a quantitative systems approach and semiquantitative analysis in a wide array of growth environments revealed the absence of a phenotypic response to the cumulative deletion of 13 glycolytic paralogs. This observation indicates that duplication of glycolytic genes is not a prerequisite for achieving the high glycolytic fluxes and fermentative capacities that are characteristic of S. cerevisiae and essential for many of its industrial applications and argues against gene dosage effects as a means of fixing minor glycolytic paralogs in the yeast genome. The MG strain was carefully designed and constructed to provide a robust prototrophic platform for quantitative studies and has been made available to the scientific community. PMID:26071034
Furumizu, Chihiro; Alvarez, John Paul; Sakakibara, Keiko; Bowman, John L.
2015-01-01
Neofunctionalization following gene duplication is thought to be one of the key drivers in generating evolutionary novelty. A gene duplication in a common ancestor of land plants produced two classes of KNOTTED-like TALE homeobox genes, class I (KNOX1) and class II (KNOX2). KNOX1 genes are linked to tissue proliferation and maintenance of meristematic potentials of flowering plant and moss sporophytes, and modulation of KNOX1 activity is implicated in contributing to leaf shape diversity of flowering plants. While KNOX2 function has been shown to repress the gametophytic (haploid) developmental program during moss sporophyte (diploid) development, little is known about KNOX2 function in flowering plants, hindering syntheses regarding the relationship between two classes of KNOX genes in the context of land plant evolution. Arabidopsis plants harboring loss-of-function KNOX2 alleles exhibit impaired differentiation of all aerial organs and have highly complex leaves, phenocopying gain-of-function KNOX1 alleles. Conversely, gain-of-function KNOX2 alleles in conjunction with a presumptive heterodimeric BELL TALE homeobox partner suppressed SAM activity in Arabidopsis and reduced leaf complexity in the Arabidopsis relative Cardamine hirsuta, reminiscent of loss-of-function KNOX1 alleles. Little evidence was found indicative of epistasis or mutual repression between KNOX1 and KNOX2 genes. KNOX proteins heterodimerize with BELL TALE homeobox proteins to form functional complexes, and contrary to earlier reports based on in vitro and heterologous expression, we find high selectivity between KNOX and BELL partners in vivo. Thus, KNOX2 genes confer opposing activities rather than redundant roles with KNOX1 genes, and together they act to direct the development of all above-ground organs of the Arabidopsis sporophyte. We infer that following the KNOX1/KNOX2 gene duplication in an ancestor of land plants, neofunctionalization led to evolution of antagonistic biochemical activity thereby facilitating the evolution of more complex sporophyte transcriptional networks, providing plasticity for the morphological evolution of land plant body plans. PMID:25671434
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kubota, Akira; Bainy, Afonso C.D.; Departamento de Bioquímica, CCB, Universidade Federal de Santa Catarina, Florianopolis, SC 88040-900
2013-10-01
The cytochrome P450 (CYP) 2 gene family is the largest and most diverse CYP gene family in vertebrates. In zebrafish, we have identified 10 genes in a new subfamily, CYP2AA, which does not show orthology to any human or other mammalian CYP genes. Here we report evolutionary and structural relationships of the 10 CYP2AA genes and expression of the first two genes, CYP2AA1 and CYP2AA2. Parsimony reconstruction of the tandem duplication pattern for the CYP2AA cluster suggests that CYP2AA1, CYP2AA2 and CYP2AA3 likely arose in the earlier duplication events and thus are most diverged in function from the other CYP2AAs.more » On the other hand, CYP2AA8 and CYP2AA9 are genes that arose in the latest duplication event, implying functional similarity between these two CYPs. A molecular model of CYP2AA1 showing the sequence conservation across the CYP2AA cluster reveals that the regions with the highest variability within the cluster map onto CYP2AA1 near the substrate access channels, suggesting differing substrate specificities. Zebrafish CYP2AA1 transcript was expressed predominantly in the intestine, while CYP2AA2 was most highly expressed in the kidney, suggesting differing roles in physiology. In the liver CYP2AA2 expression but not that of CYP2AA1, was increased by 1,4-bis [2-(3,5-dichloropyridyloxy)] benzene (TCPOBOP) and, to a lesser extent, by phenobarbital (PB). In contrast, pregnenolone 16α-carbonitrile (PCN) increased CYP2AA1 expression, but not CYP2AA2 in the liver. The results identify a CYP2 subfamily in zebrafish that includes genes apparently induced by PB-type chemicals and PXR agonists, the first concrete in vivo evidence for a PB-type response in fish. - Highlights: • A tandemly duplicated cluster of ten CYP2AA genes was described in zebrafish. • Parsimony and duplication analyses suggest pathways to CYP2AA diversity. • Homology models reveal amino acid positions possibly related to functional diversity. • The CYP2AA locus does not share synteny with any CYP2 subfamily in mammals. • Induction of CYP2AA1 and CYP2AA2 indicates a phenobarbital-type response in fish.« less
Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms.
Chen, Bin; Zhong, Daibin; Monteiro, Antónia
2006-06-17
HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate the nuclear genomes of 32 species across all kingdoms of organisms, and all sequences available in GenBank, and address the diversity, evolution, gene structure, conservation and nomenclature of the HSP90 family of genes across all organisms. Twelve new genes and a new type HSP90C2 were identified. The chromosomal location, exon splicing, and prediction of whether they are functional copies were documented, as well as the amino acid length and molecular mass of their polypeptides. The conserved regions across all protein sequences, and signature sequences in each subfamily were determined, and a standardized nomenclature system for this gene family is presented. The proeukaryote HSP90 homologue, HTPG, exists in most Bacteria species but not in Archaea, and it evolved into three lineages (Groups A, B and C) via two gene duplication events. None of the organellar-localized HSP90s were derived from endosymbionts of early eukaryotes. Mitochondrial TRAP and endoplasmic reticulum HSP90B separately originated from the ancestors of HTPG Group A in Firmicutes-like organisms very early in the formation of the eukaryotic cell. TRAP is monophyletic and present in all Animalia and some Protista species, while HSP90B is paraphyletic and present in all eukaryotes with the exception of some Fungi species, which appear to have lost it. Both HSP90C (chloroplast HSP90C1 and location-undetermined SP90C2) and cytosolic HSP90A are monophyletic, and originated from HSP90B by independent gene duplications. HSP90C exists only in Plantae, and was duplicated into HSP90C1 and HSP90C2 isoforms in higher plants. HSP90A occurs across all eukaryotes, and duplicated into HSP90AA and HSP90AB in vertebrates. Diplomonadida was identified as the most basal organism in the eukaryote lineage. The present study presents the first comparative genomic study and evolutionary analysis of the HSP90 family of genes across all kingdoms of organisms. HSP90 family members underwent multiple duplications and also subsequent losses during their evolution. This study established an overall framework of information for the family of genes, which may facilitate and stimulate the study of this gene family across all organisms.
Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms
Chen, Bin; Zhong, Daibin; Monteiro, Antónia
2006-01-01
Background HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate the nuclear genomes of 32 species across all kingdoms of organisms, and all sequences available in GenBank, and address the diversity, evolution, gene structure, conservation and nomenclature of the HSP90 family of genes across all organisms. Results Twelve new genes and a new type HSP90C2 were identified. The chromosomal location, exon splicing, and prediction of whether they are functional copies were documented, as well as the amino acid length and molecular mass of their polypeptides. The conserved regions across all protein sequences, and signature sequences in each subfamily were determined, and a standardized nomenclature system for this gene family is presented. The proeukaryote HSP90 homologue, HTPG, exists in most Bacteria species but not in Archaea, and it evolved into three lineages (Groups A, B and C) via two gene duplication events. None of the organellar-localized HSP90s were derived from endosymbionts of early eukaryotes. Mitochondrial TRAP and endoplasmic reticulum HSP90B separately originated from the ancestors of HTPG Group A in Firmicutes-like organisms very early in the formation of the eukaryotic cell. TRAP is monophyletic and present in all Animalia and some Protista species, while HSP90B is paraphyletic and present in all eukaryotes with the exception of some Fungi species, which appear to have lost it. Both HSP90C (chloroplast HSP90C1 and location-undetermined SP90C2) and cytosolic HSP90A are monophyletic, and originated from HSP90B by independent gene duplications. HSP90C exists only in Plantae, and was duplicated into HSP90C1 and HSP90C2 isoforms in higher plants. HSP90A occurs across all eukaryotes, and duplicated into HSP90AA and HSP90AB in vertebrates. Diplomonadida was identified as the most basal organism in the eukaryote lineage. Conclusion The present study presents the first comparative genomic study and evolutionary analysis of the HSP90 family of genes across all kingdoms of organisms. HSP90 family members underwent multiple duplications and also subsequent losses during their evolution. This study established an overall framework of information for the family of genes, which may facilitate and stimulate the study of this gene family across all organisms. PMID:16780600
Evolution of the Rax family of developmental transcription factors in vertebrates.
Orquera, Daniela P; de Souza, Flávio S J
2017-04-01
Rax proteins comprise a small family of paired-type, homeodomain-containing transcription factors with essential functions in eye and forebrain development. While invertebrates possess only one Rax gene, vertebrates can have several Rax paralogue genes, but the evolutionary history of the members of the family has not been studied in detail. Here, we present a thorough analysis of the evolutionary relationships between vertebrate Rax genes and proteins available in diverse genomic databases. Phylogenetic and synteny analyses indicate that Rax genes went through a duplication in an ancestor of all jawed vertebrates (Gnathostomata), giving rise to the ancestral vertebrate Rax1 and Rax2 genes. This duplication event is likely related to the proposed polyploidisations that occurred during early vertebrate evolution. Subsequent genome-wide duplications in the lineage of ray-finned fish (Actinopterygii) originated new Rax2 paralogues in the genomes of teleosts. In the lobe-finned fish lineage (Sarcopterygii), the N-terminal octapeptide domain of Rax2 was lost in a common ancestor of tetrapods, giving rise to a shorter version of Rax2 in this lineage. Within placental mammals, the Rax2 gene was lost altogether in an ancestor of rodents and lagomorphs (Glires). Finally, we discuss the scientific literature in the light of Rax gene evolution and propose new avenues of research on the function of this important family of transcriptional regulators. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa
2006-01-01
Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. PMID:16606702
Clues to evolution of the SERA multigene family in 18 Plasmodium species.
Arisue, Nobuko; Kawai, Satoru; Hirai, Makoto; Palacpac, Nirianne M Q; Jia, Mozhi; Kaneko, Akira; Tanabe, Kazuyuki; Horii, Toshihiro
2011-03-15
SERA gene sequences were newly determined from 11 primate Plasmodium species including two human parasites, P. ovale and P. malariae, and the evolutionary history of SERA genes was analyzed together with 7 known species. All have one each of Group I to III cysteine-type SERA genes and varying number of Group IV serine-type SERA genes in tandem cluster. Notably, Group IV SERA genes were ascertained in all mammalian parasite lineages; and in two primate parasite lineages gene events such as duplication, truncation, fragmentation and gene loss occurred at high frequency in a manner that mimics the birth-and-death evolution model. Transcription profile of individual SERA genes varied greatly among rodent and monkey parasites. Results support the lineage-specific evolution of the Plasmodium SERA gene family. These findings provide further impetus for studies that could clarify/provide proof-of-concept that duplications of SERA genes were associated with the parasites' expansion of host range and the evolutionary conundrums of multigene families in Plasmodium.
Gubala, Anna M; Schmitz, Jonathan F; Kearns, Michael J; Vinh, Tery T; Bornberg-Bauer, Erich; Wolfner, Mariana F; Findlay, Geoffrey D
2017-05-01
New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important functional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila melanogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu
2011-01-01
Background Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Conclusions Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development. PMID:22164299
Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu
2011-01-01
Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.
Huang, Xue-shuang; Zhu, Bao; Jiang, Hai-ou; Wu, Su-fan; Zhang, Zai-qi; Xiao, Lin; Yi, Li-lan; Zhang, Jian-xiang
2013-04-01
We reported a 2-year-old boy with developmental delay, mild mental retardation, and severe craniofacial malformation, including facial asymmetry with hypoplasia of the left zygoma, maxilla, and mandible, and left anophthalmia and anotia. A genome-wide screen revealed a 1.38 Mb duplication on chromosome 1q31.1, which was absent in his parents and 27 healthy controls. The duplication region contains two Refseq genes, PLA2G4A and C1orf99, which have not been reported to be implicated in craniofacial malformation. Functional studies of these genes and additional clinical analysis are necessary to elucidate the pathogenesis of craniofacial malformation. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Bulky Trichomonad Genomes: Encoding a Swiss Army Knife.
Barratt, Joel; Gough, Rory; Stark, Damien; Ellis, John
2016-10-01
The trichomonads are a remarkably successful lineage of ancient, predominantly parasitic protozoa. Recent molecular analyses have revealed extensive duplication of certain genetic loci in trichomonads. Consequently, their genomes are exceptionally large compared to other parasitic protozoa. Retention of these large gene expansions across different trichomonad families raises the question: do these duplications afford an advantage? Many duplicated genes are linked to the parasitic lifestyle and some are regulated differently to their paralogues, suggesting they have acquired new functions. It is proposed that these large genomes encode a Swiss army knife of sorts, packed with a multitude of tools for use in many different circumstances. This may have bestowed trichomonads with the extraordinary versatility that has undoubtedly contributed to their success. Copyright © 2016 Elsevier Ltd. All rights reserved.
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.
Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H
2016-04-01
To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.
Salojärvi, Jarkko; Smolander, Olli-Pekka; Nieminen, Kaisa; Rajaraman, Sitaram; Safronov, Omid; Safdari, Pezhman; Lamminmäki, Airi; Immanen, Juha; Lan, Tianying; Tanskanen, Jaakko; Rastas, Pasi; Amiryousefi, Ali; Jayaprakash, Balamuralikrishna; Kammonen, Juhana I; Hagqvist, Risto; Eswaran, Gugan; Ahonen, Viivi Helena; Serra, Juan Alonso; Asiegbu, Fred O; de Dios Barajas-Lopez, Juan; Blande, Daniel; Blokhina, Olga; Blomster, Tiina; Broholm, Suvi; Brosché, Mikael; Cui, Fuqiang; Dardick, Chris; Ehonen, Sanna E; Elomaa, Paula; Escamez, Sacha; Fagerstedt, Kurt V; Fujii, Hiroaki; Gauthier, Adrien; Gollan, Peter J; Halimaa, Pauliina; Heino, Pekka I; Himanen, Kristiina; Hollender, Courtney; Kangasjärvi, Saijaliisa; Kauppinen, Leila; Kelleher, Colin T; Kontunen-Soppela, Sari; Koskinen, J Patrik; Kovalchuk, Andriy; Kärenlampi, Sirpa O; Kärkönen, Anna K; Lim, Kean-Jin; Leppälä, Johanna; Macpherson, Lee; Mikola, Juha; Mouhu, Katriina; Mähönen, Ari Pekka; Niinemets, Ülo; Oksanen, Elina; Overmyer, Kirk; Palva, E Tapio; Pazouki, Leila; Pennanen, Ville; Puhakainen, Tuula; Poczai, Péter; Possen, Boy J H M; Punkkinen, Matleena; Rahikainen, Moona M; Rousi, Matti; Ruonala, Raili; van der Schoot, Christiaan; Shapiguzov, Alexey; Sierla, Maija; Sipilä, Timo P; Sutela, Suvi; Teeri, Teemu H; Tervahauta, Arja I; Vaattovaara, Aleksia; Vahala, Jorma; Vetchinnikova, Lidia; Welling, Annikki; Wrzaczek, Michael; Xu, Enjun; Paulin, Lars G; Schulman, Alan H; Lascoux, Martin; Albert, Victor A; Auvinen, Petri; Helariutta, Ykä; Kangasjärvi, Jaakko
2017-06-01
Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.
The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons
Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.
2016-01-01
To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095
Insights into three whole-genome duplications gleaned from the Paramecium caudatum genome sequence.
McGrath, Casey L; Gout, Jean-Francois; Doak, Thomas G; Yanagi, Akira; Lynch, Michael
2014-08-01
Paramecium has long been a model eukaryote. The sequence of the Paramecium tetraurelia genome reveals a history of three successive whole-genome duplications (WGDs), and the sequences of P. biaurelia and P. sexaurelia suggest that these WGDs are shared by all members of the aurelia species complex. Here, we present the genome sequence of P. caudatum, a species closely related to the P. aurelia species group. P. caudatum shares only the most ancient of the three WGDs with the aurelia complex. We found that P. caudatum maintains twice as many paralogs from this early event as the P. aurelia species, suggesting that post-WGD gene retention is influenced by subsequent WGDs and supporting the importance of selection for dosage in gene retention. The availability of P. caudatum as an outgroup allows an expanded analysis of the aurelia intermediate and recent WGD events. Both the Guanine+Cytosine (GC) content and the expression level of preduplication genes are significant predictors of duplicate retention. We find widespread asymmetrical evolution among aurelia paralogs, which is likely caused by gradual pseudogenization rather than by neofunctionalization. Finally, cases of divergent resolution of intermediate WGD duplicates between aurelia species implicate this process acts as an ongoing reinforcement mechanism of reproductive isolation long after a WGD event. Copyright © 2014 by the Genetics Society of America.
Chen, Chih-Ping; Lin, Shuan-Pei; Lin, Chyi-Chyang; Chen, Yann-Jang; Chern, Schu-Rern; Li, Yueh-Chun; Hsieh, Lie-Jiau; Lee, Chen-Chi; Pan, Chen-Wen; Wang, Wayseen
2006-07-15
An 11-year-old girl presented with the phenotype of microcephaly, moderate mental retardation, motor retardation, short stature, strabismus, brachydactyly, and facial dysmorphism. She had undergone surgery for inguinal hernias. Detailed examinations of the heart and other internal organs revealed normal findings. Her karyotype was 46,XX,dup(5)(q35.2q35.3) de novo. Molecular cytogenetic analysis showed a paternally derived 5q35.2 --> q35.3 direct duplication and led to a correlation between the particular genotype and phenotype. This is the first description of a direct duplication of 5q35.2 --> q35.3. Our case represents the smallest distal duplication of chromosome 5q that is not associated with congenital heart defects. Our case also represents the smallest distal duplication of chromosome 5q that is associated with short stature and microcephaly. Mutations or deletions of the NSD1 gene, mapped to 5q35.2 --> q35.3, has been known to cause Sotos syndrome with cerebral gigantism, macrocephaly, advanced bone age and overgrowth. Our case provides evidence that the gene dosage effect of the NSD1 gene causes a reversed phenotype of microcephaly and short stature. Copyright 2006 Wiley-Liss, Inc.
Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado
2007-01-01
Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839