Science.gov

Sample records for duplicated gene family

  1. Familial Lymphoproliferative Malignancies and Tandem Duplication of NF1 Gene.

    PubMed

    Fernandes, Gustavo; Souto, Mirela; Costa, Frederico; Oliveira, Edite; Garicochea, Bernardo

    2014-01-01

    Background. Neurofibromatosis type 1 is a genetic disorder caused by loss-of-function mutations in a tumor suppressor gene (NF1) which codifies the protein neurofibromin. The frequent genetic alterations that modify neurofibromin function are deletions and insertions. Duplications are rare and phenotype in patients bearing duplication of NF1 gene is thought to be restricted to developmental abnormalities, with no reference to cancer susceptibility in these patients. We evaluated a patient who presented with few clinical signs of neurofibromatosis type 1 and a conspicuous personal and familiar history of different types of cancer, especially lymphoproliferative malignancies. The coding region of the NF-1 gene was analyzed by real-time polymerase chain reaction and direct sequencing. Multiplex ligation-dependent probe amplification was performed to detect the number of mutant copies. The NF1 gene analysis showed the following alterations: mosaic duplication of NF1, TRAF4, and MYO1D. Fluorescence in situ hybridization using probes (RP5-1002G3 and RP5-92689) flanking NF1 gene in 17q11.2 and CEP17 for 17q11.11.1 was performed. There were three signals (RP5-1002G3conRP5-92689) in the interphases analyzed and two signals (RP5-1002G3conRP5-92689) in 93% of cells. These findings show a tandem duplication of 17q11.2. Conclusion. The case suggests the possibility that NF1 gene duplication may be associated with a phenotype characterized by lymphoproliferative disorders. PMID:25580325

  2. Evolution of the vertebrate paralemmin gene family: ancient origin of gene duplicates suggests distinct functions.

    PubMed

    Hultqvist, Greta; Ocampo Daza, Daniel; Larhammar, Dan; Kilimann, Manfred W

    2012-01-01

    Paralemmin-1 is a protein implicated in plasma membrane dynamics, the development of filopodia, neurites and dendritic spines, as well as the invasiveness and metastatic potential of cancer cells. However, little is known about its mode of action, or about the biological functions of the other paralemmin isoforms: paralemmin-2, paralemmin-3 and palmdelphin. We describe here evolutionary analyses of the paralemmin gene family in a broad range of vertebrate species. Our results suggest that the four paralemmin isoform genes (PALM1, PALM2, PALM3 and PALMD) arose by quadruplication of an ancestral gene in the two early vertebrate genome duplications. Paralemmin-1 and palmdelphin were further duplicated in the teleost fish specific genome duplication. We identified a unique sequence motif common to all paralemmins, consisting of 11 highly conserved residues of which four are invariant. A single full-length paralemmin homolog with this motif was identified in the genome of the sea lamprey Petromyzon marinus and an isolated putative paralemmin motif could be detected in the genome of the lancelet Branchiostoma floridae. This allows us to conclude that the paralemmin gene family arose early and has been maintained throughout vertebrate evolution, suggesting functional diversification and specific biological roles of the paralemmin isoforms. The paralemmin genes have also maintained specific features of gene organisation and sequence. This includes the occurrence of closely linked downstream genes, initially identified as a readthrough fusion protein with mammalian paralemmin-2 (Palm2-AKAP2). We have found evidence for such an arrangement for paralemmin-1 and -2 in several vertebrate genomes, as well as for palmdelphin and paralemmin-3 in teleost fish genomes, and suggest the name paralemmin downstream genes (PDG) for this new gene family. Thus, our findings point to ancient roles for paralemmins and distinct biological functions of the gene duplicates. PMID:22855693

  3. Duplication of OsHAP family genes and their association with heading date in rice

    PubMed Central

    Li, Qiuping; Yan, Wenhao; Chen, Huaxia; Tan, Cong; Han, Zhongmin; Yao, Wen; Li, Guangwei; Yuan, Mengqi; Xing, Yongzhong

    2016-01-01

    Heterotrimeric Heme Activator Protein (HAP) family genes are involved in the regulation of flowering in plants. It is not clear how many HAP genes regulate heading date in rice. In this study, we identified 35 HAP genes, including seven newly identified genes, and performed gene duplication and candidate gene-based association analyses. Analyses showed that segmental duplication and tandem duplication are the main mechanisms of HAP gene duplication. Expression profiling and functional identification indicated that duplication probably diversifies the functions of HAP genes. A nucleotide diversity analysis revealed that 13 HAP genes underwent selection. A candidate gene-based association analysis detected four HAP genes related to heading date. An investigation of transgenic plants or mutants of 23 HAP genes confirmed that overexpression of at least four genes delayed heading date under long-day conditions, including the previously cloned Ghd8/OsHAP3H. Our results indicate that the large number of HAP genes in rice was mainly produced by gene duplication, and a few HAP genes function to regulate heading date. Selection of HAP genes is probably caused by their diverse functions rather than regulation of heading. PMID:26798026

  4. Duplication, divergence and persistence in the Phytochrome photoreceptor gene family of cottons (Gossypium spp.)

    PubMed Central

    2010-01-01

    Background Phytochromes are a family of red/far-red photoreceptors that regulate a number of important developmental traits in cotton (Gossypium spp.), including plant architecture, fiber development, and photoperiodic flowering. Little is known about the composition and evolution of the phytochrome gene family in diploid (G. herbaceum, G. raimondii) or allotetraploid (G. hirsutum, G. barbadense) cotton species. The objective of this study was to obtain a preliminary inventory and molecular-evolutionary characterization of the phytochrome gene family in cotton. Results We used comparative sequence resources to design low-degeneracy PCR primers that amplify genomic sequence tags (GSTs) for members of the PHYA, PHYB/D, PHYC and PHYE gene sub-families from A- and D-genome diploid and AD-genome allotetraploid Gossypium species. We identified two paralogous PHYA genes (designated PHYA1 and PHYA2) in diploid cottons, the result of a Malvaceae-specific PHYA gene duplication that occurred approximately 14 million years ago (MYA), before the divergence of the A- and D-genome ancestors. We identified a single gene copy of PHYB, PHYC, and PHYE in diploid cottons. The allotetraploid genomes have largely retained the complete gene complements inherited from both of the diploid genome ancestors, with at least four PHYA genes and two genes encoding PHYB, PHYC and PHYE in the AD-genomes. We did not identify a PHYD gene in any cotton genomes examined. Conclusions Detailed sequence analysis suggests that phytochrome genes retained after duplication by segmental duplication and allopolyploidy appear to be evolving independently under a birth-and-death-process with strong purifying selection. Our study provides a preliminary phytochrome gene inventory that is necessary and sufficient for further characterization of the biological functions of each of the cotton phytochrome genes, and for the development of 'candidate gene' markers that are potentially useful for cotton improvement via

  5. Duplications and losses in gene families of rust pathogens highlight putative effectors

    PubMed Central

    Pendleton, Amanda L.; Smith, Katherine E.; Feau, Nicolas; Martin, Francis M.; Grigoriev, Igor V.; Hamelin, Richard; Nelson, C. Dana; Burleigh, J. Gordon; Davis, John M.

    2014-01-01

    Rust fungi are a group of fungal pathogens that cause some of the world's most destructive diseases of trees and crops. A shared characteristic among rust fungi is obligate biotrophy, the inability to complete a lifecycle without a host. This dependence on a host species likely affects patterns of gene expansion, contraction, and innovation within rust pathogen genomes. The establishment of disease by biotrophic pathogens is reliant upon effector proteins that are encoded in the fungal genome and secreted from the pathogen into the host's cell apoplast or within the cells. This study uses a comparative genomic approach to elucidate putative effectors and determine their evolutionary histories. We used OrthoMCL to identify nearly 20,000 gene families in proteomes of 16 diverse fungal species, which include 15 basidiomycetes and one ascomycete. We inferred patterns of duplication and loss for each gene family and identified families with distinctive patterns of expansion/contraction associated with the evolution of rust fungal genomes. To recognize potential contributors for the unique features of rust pathogens, we identified families harboring secreted proteins that: (i) arose or expanded in rust pathogens relative to other fungi, or (ii) contracted or were lost in rust fungal genomes. While the origin of rust fungi appears to be associated with considerable gene loss, there are many gene duplications associated with each sampled rust fungal genome. We also highlight two putative effector gene families that have expanded in Cqf that we hypothesize have roles in pathogenicity. PMID:25018762

  6. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution.

    PubMed

    Gu, Xun; Wang, Yufeng; Gu, Jianying

    2002-06-01

    The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome. PMID:12032571

  7. Gains, Losses and Changes of Function after Gene Duplication: Study of the Metallothionein Family

    PubMed Central

    Moleirinho, Ana; Carneiro, João; Matthiesen, Rune; Silva, Raquel M.; Amorim, António; Azevedo, Luísa

    2011-01-01

    Metallothioneins (MT) are small proteins involved in heavy metal detoxification and protection against oxidative stress and cancer. The mammalian MT family originated through a series of duplication events which generated four major genes (MT1 to MT4). MT1 and MT2 encode for ubiquitous proteins, while MT3 and MT4 evolved to accomplish specific roles in brain and epithelium, respectively. Herein, phylogenetic, transcriptional and polymorphic analyses are carried out to expose gains, losses and diversification of functions that characterize the evolutionary history of the MT family. The phylogenetic analyses show that all four major genes originated through a single duplication event prior to the radiation of mammals. Further expansion of the MT1 gene has occurred in the primate lineage reaching in humans a total of 13 paralogs, five of which are pseudogenes. In humans, the reading frame of all five MT1 pseudogenes is reconstructed by sequence homology with a functional duplicate revealing that loss of invariant cysteines is the most frequent event accounting for pseudogeneisation. Expression analyses based on EST counts and RT-PCR experiments show that, as for MT1 and MT2, human MT3 is also ubiquitously expressed while MT4 transcripts are present in brain, testes, esophagus and mainly in thymus. Polymorphic variation reveals two deleterious mutations (Cys30Tyr and Arg31Trp) in MT4 with frequencies reaching about 30% in African and Asian populations suggesting the gene is inactive in some individuals and physiological compensation for its loss must arise from a functional equivalent. Altogether our findings provide novel data on the evolution and diversification of MT gene duplicates, a valuable resource for understanding the vast set of biological processes in which these proteins are involved. PMID:21541013

  8. Evolution of the KCS gene family in plants: the history of gene duplication, sub/neofunctionalization and redundancy.

    PubMed

    Guo, Hai-Song; Zhang, Yan-Mei; Sun, Xiao-Qin; Li, Mi-Mi; Hang, Yue-Yu; Xue, Jia-Yu

    2016-04-01

    Very long-chain fatty acids (VLCFAs) play an important role in the survival and development of plants, and VLCFA synthesis is regulated by β-ketoacyl-CoA synthases (KCSs), which catalyze the condensation of an acyl-CoA with malonyl-CoA. Here, we present a genome-wide survey of the genes encoding these enzymes, KCS genes, in 28 species (26 genomes and two transcriptomes), which represents a large phylogenetic scale, and also reconstruct the evolutionary history of this gene family. KCS genes were initially single-copy genes in the green plant lineage; duplication resulted in five ancestral copies in land plants, forming five fundamental monophyletic groups in the phylogenetic tree. Subsequently, KCS genes duplicated to generate 11 genes of angiosperm origin, expanding up to 20-30 members in further-diverged angiosperm species. During this process, tandem duplications had only a small contribution, whereas polyploidy events and large-scale segmental duplications appear to be the main driving force. Accompanying this expansion were variations that led to the sub- and neofunctionalization of different members, resulting in specificity that is likely determined by the 3-D protein structure. Novel functions involved in other physiological processes emerged as well, though redundancy is also observed, largely among recent duplications. Conserved sites and variable sites of KCS proteins are also identified by statistical analysis. The variable sites are likely to be involved in the emergence of product specificity and catalytic power, and conserved sites are possibly responsible for the preservation of fundamental function. PMID:26563433

  9. The polyphenol oxidase gene family in land plants: Lineage-specific duplication and expansion

    PubMed Central

    2012-01-01

    Background Plant polyphenol oxidases (PPOs) are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss) and Glycine max (soybean) each had 11 genes. Populus trichocarpa (poplar) contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae) genomes or Arabidopsis (A. lyrata and A. thaliana). We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic relationships based on

  10. Evolution of alternative splicing after gene duplication.

    PubMed

    Su, Zhixi; Wang, Jianmin; Yu, Jun; Huang, Xiaoqiu; Gu, Xun

    2006-02-01

    Alternative splicing and gene duplication are two major sources of proteomic function diversity. Here, we study the evolutionary trend of alternative splicing after gene duplication by analyzing the alternative splicing differences between duplicate genes. We observed that duplicate genes have fewer alternative splice (AS) forms than single-copy genes, and that a negative correlation exists between the mean number of AS forms and the gene family size. Interestingly, we found that the loss of alternative splicing in duplicate genes may occur shortly after the gene duplication. These results support the subfunctionization model of alternative splicing in the early stage after gene duplication. Further analysis of the alternative splicing distribution in human duplicate pairs showed the asymmetric evolution of alternative splicing after gene duplications; i.e., the AS forms between duplicates may differ dramatically. We therefore conclude that alternative splicing and gene duplication may not evolve independently. In the early stage after gene duplication, young duplicates may take over a certain amount of protein function diversity that previously was carried out by the alternative splicing mechanism. In the late stage, the gain and loss of alternative splicing seem to be independent between duplicates. PMID:16365379

  11. Duplication, Selection and Gene Conversion in a Drosophila mojavensis Female Reproductive Protein Family

    PubMed Central

    Kelleher, Erin S.; Markow, Therese A.

    2009-01-01

    Protein components of the Drosophila male ejaculate, several of which evolve rapidly, are critical modulators of reproductive success. Recent studies of female reproductive tract proteins indicate they also are extremely divergent between species, suggesting that reproductive molecules may coevolve between the sexes. Our current understanding of intersexual coevolution, however, is severely limited by the paucity of genetic and evolutionary studies on the female molecules involved. Physiological evidence of ejaculate–female coadaptation, paired with a promiscuous mating system, makes Drosophila mojavensis an exciting model system in which to study the evolution of reproductive proteins. Here we explore the evolutionary dynamics of a five-paralog gene family of female reproductive proteases within populations of D. mojavensis and throughout the repleta species group. We show that the proteins have experienced ongoing gene duplication and adaptive evolution and further exhibit dynamic patterns of pseudogenation, copy number variation, gene conversion, and selection within geographically isolated populations of D. mojavensis. The integration of these patterns in a single gene family has never before been documented in a reproductive protein. PMID:19204376

  12. Insights into the coupling of duplication events and macroevolution from an age profile of animal transmembrane gene families.

    PubMed

    Ding, Guohui; Kang, Jiuhong; Liu, Qi; Shi, Tieliu; Pei, Gang; Li, Yixue

    2006-08-11

    The evolution of new gene families subsequent to gene duplication may be coupled to the fluctuation of population and environment variables. Based upon that, we presented a systematic analysis of the animal transmembrane gene duplication events on a macroevolutionary scale by integrating the palaeontology repository. The age of duplication events was calculated by maximum likelihood method, and the age distribution was estimated by density histogram and normal kernel density estimation. We showed that the density of the duplicates displays a positive correlation with the estimates of maximum number of cell types of common ancestors, and the oxidation events played a key role in the major transitions of this density trace. Next, we focused on the Phanerozoic phase, during which more macroevolution data are available. The pulse mass extinction timepoints coincide with the local peaks of the age distribution, suggesting that the transmembrane gene duplicates fixed frequently when the environment changed dramatically. Moreover, a 61-million-year cycle is the most possible cycle in this phase by spectral analysis, which is consistent with the cycles recently detected in biodiversity. Our data thus elucidate a strong coupling of duplication events and macroevolution; furthermore, our method also provides a new way to address these questions. PMID:16895434

  13. Insights into the Coupling of Duplication Events and Macroevolution from an Age Profile of Animal Transmembrane Gene Families

    PubMed Central

    Ding, Guohui; Kang, Jiuhong; Liu, Qi; Shi, Tieliu; Pei, Gang; Li, Yixue

    2006-01-01

    The evolution of new gene families subsequent to gene duplication may be coupled to the fluctuation of population and environment variables. Based upon that, we presented a systematic analysis of the animal transmembrane gene duplication events on a macroevolutionary scale by integrating the palaeontology repository. The age of duplication events was calculated by maximum likelihood method, and the age distribution was estimated by density histogram and normal kernel density estimation. We showed that the density of the duplicates displays a positive correlation with the estimates of maximum number of cell types of common ancestors, and the oxidation events played a key role in the major transitions of this density trace. Next, we focused on the Phanerozoic phase, during which more macroevolution data are available. The pulse mass extinction timepoints coincide with the local peaks of the age distribution, suggesting that the transmembrane gene duplicates fixed frequently when the environment changed dramatically. Moreover, a 61-million-year cycle is the most possible cycle in this phase by spectral analysis, which is consistent with the cycles recently detected in biodiversity. Our data thus elucidate a strong coupling of duplication events and macroevolution; furthermore, our method also provides a new way to address these questions. PMID:16895434

  14. Gene Duplication, Lineage-Specific Expansion, and Subfunctionalization in the MADF-BESS Family Patterns the Drosophila Wing Hinge

    PubMed Central

    Shukla, Vallari; Habib, Farhat; Kulkarni, Apurv; Ratnaparkhi, Girish S.

    2014-01-01

    Gene duplication, expansion, and subsequent diversification are features of the evolutionary process. Duplicated genes can be lost, modified, or altered to generate novel functions over evolutionary timescales. These features make gene duplication a powerful engine of evolutionary change. In this study, we explore these features in the MADF-BESS family of transcriptional regulators. In Drosophila melanogaster, the family contains 16 similar members, each containing an N-terminal, DNA-binding MADF domain and a C-terminal, protein-interacting, BESS domain. Phylogenetic analysis shows that members of the MADF-BESS family are expanded in the Drosophila lineage. Three members, which we name hinge1, hinge2, and hinge3 are required for wing development, with a critical role in the wing hinge. hinge1 is a negative regulator of Winglesss expression and interacts with core wing-hinge patterning genes such as teashirt, homothorax, and jing. Double knockdowns along with heterologous rescue experiments are used to demonstrate that members of the MADF-BESS family retain function in the wing hinge, in spite of expansion and diversification for over 40 million years. The wing hinge connects the blade to the thorax and has critical roles in fluttering during flight. MADF-BESS family genes appear to retain redundant functions to shape and form elements of the wing hinge in a robust and fail-safe manner. PMID:24336749

  15. Diversification of the light-harvesting complex gene family via intra- and intergenic duplications in the coral symbiotic alga Symbiodinium.

    PubMed

    Maruyama, Shinichiro; Shoguchi, Eiichi; Satoh, Nori; Minagawa, Jun

    2015-01-01

    The light-harvesting complex (LHC) is an essential component in light energy capture and transduction to facilitate downstream photosynthetic reactions in plant and algal chloroplasts. The unicellular dinoflagellate alga Symbiodinium is an endosymbiont of cnidarian animals, including corals and sea anemones, and provides carbohydrates generated through photosynthesis to host animals. Although Symbiodinium possesses a unique LHC gene family, called chlorophyll a-chlorophyll c2-peridinin protein complex (acpPC), its genome-level diversity and evolutionary trajectories have not been investigated. Here, we describe a phylogenetic analysis revealing that many of the LHCs are encoded by highly duplicated genes with multi-subunit polyprotein structures in the nuclear genome of Symbiodinium minutum. This analysis provides an extended list of the LHC gene family in a single organism, including 80 loci encoding polyproteins composed of 145 LHC subunits recovered in the phylogenetic tree. In S. minutum, 5 phylogenetic groups of the Lhcf-type gene family, which is exclusively conserved in algae harboring secondary plastids of red algal origin, were identified. Moreover, 5 groups of the Lhcr-type gene family, of which members are known to be associated with PSI in red algal plastids and secondary plastids of red algal origin, were identified. Notably, members classified within a phylogenetic group of the Lhcf-type (group F1) are highly duplicated, which may explain the presence of an unusually large number of LHC genes in this species. Some gene units were homologous to other units within single loci of the polyprotein genes, whereas intergenic homologies between separate loci were conspicuous in other cases, implying that gene unit 'shuffling' by gene conversion and/or genome rearrangement might have been a driving force for diversification. These results suggest that vigorous intra- and intergenic gene duplication events have resulted in the genomic framework of

  16. Diversification of the Light-Harvesting Complex Gene Family via Intra- and Intergenic Duplications in the Coral Symbiotic Alga Symbiodinium

    PubMed Central

    Maruyama, Shinichiro; Shoguchi, Eiichi; Satoh, Nori; Minagawa, Jun

    2015-01-01

    The light-harvesting complex (LHC) is an essential component in light energy capture and transduction to facilitate downstream photosynthetic reactions in plant and algal chloroplasts. The unicellular dinoflagellate alga Symbiodinium is an endosymbiont of cnidarian animals, including corals and sea anemones, and provides carbohydrates generated through photosynthesis to host animals. Although Symbiodinium possesses a unique LHC gene family, called chlorophyll a-chlorophyll c2-peridinin protein complex (acpPC), its genome-level diversity and evolutionary trajectories have not been investigated. Here, we describe a phylogenetic analysis revealing that many of the LHCs are encoded by highly duplicated genes with multi-subunit polyprotein structures in the nuclear genome of Symbiodinium minutum. This analysis provides an extended list of the LHC gene family in a single organism, including 80 loci encoding polyproteins composed of 145 LHC subunits recovered in the phylogenetic tree. In S. minutum, 5 phylogenetic groups of the Lhcf-type gene family, which is exclusively conserved in algae harboring secondary plastids of red algal origin, were identified. Moreover, 5 groups of the Lhcr-type gene family, of which members are known to be associated with PSI in red algal plastids and secondary plastids of red algal origin, were identified. Notably, members classified within a phylogenetic group of the Lhcf-type (group F1) are highly duplicated, which may explain the presence of an unusually large number of LHC genes in this species. Some gene units were homologous to other units within single loci of the polyprotein genes, whereas intergenic homologies between separate loci were conspicuous in other cases, implying that gene unit ‘shuffling’ by gene conversion and/or genome rearrangement might have been a driving force for diversification. These results suggest that vigorous intra- and intergenic gene duplication events have resulted in the genomic framework of

  17. Subcellular Relocalization and Positive Selection Play Key Roles in the Retention of Duplicate Genes of Populus Class III Peroxidase Family.

    PubMed

    Ren, Lin-Ling; Liu, Yan-Jing; Liu, Hai-Jing; Qian, Ting-Ting; Qi, Li-Wang; Wang, Xiao-Ru; Zeng, Qing-Yin

    2014-06-16

    Gene duplication is the primary source of new genes and novel functions. Over the course of evolution, many duplicate genes lose their function and are eventually removed by deletion. However, some duplicates have persisted and evolved diverse functions. A particular challenge is to understand how this diversity arises and whether positive selection plays a role. In this study, we reconstructed the evolutionary history of the class III peroxidase (PRX) genes from the Populus trichocarpa genome. PRXs are plant-specific enzymes that play important roles in cell wall metabolism and in response to biotic and abiotic stresses. We found that two large tandem-arrayed clusters of PRXs evolved from an ancestral cell wall type PRX to vacuole type, followed by tandem duplications and subsequent functional specification. Substitution models identified seven positively selected sites in the vacuole PRXs. These positively selected sites showed significant effects on the biochemical functions of the enzymes. We also found that positive selection acts more frequently on residues adjacent to, rather than directly at, a critical active site of the enzyme, and on flexible regions rather than on rigid structural elements of the protein. Our study provides new insights into the adaptive molecular evolution of plant enzyme families. PMID:24934172

  18. A dynamic history of gene duplications and losses characterizes the evolution of the SPARC family in eumetazoans

    PubMed Central

    Bertrand, Stephanie; Fuentealba, Jaime; Aze, Antoine; Hudson, Clare; Yasuo, Hitoyoshi; Torrejon, Marcela; Escriva, Hector; Marcellini, Sylvain

    2013-01-01

    The vertebrates share the ability to produce a skeleton made of mineralized extracellular matrix. However, our understanding of the molecular changes that accompanied their emergence remains scarce. Here, we describe the evolutionary history of the SPARC (secreted protein acidic and rich in cysteine) family, because its vertebrate orthologues are expressed in cartilage, bones and teeth where they have been proposed to bind calcium and act as extracellular collagen chaperones, and because further duplications of specific SPARC members produced the small calcium-binding phosphoproteins (SCPP) family that is crucial for skeletal mineralization to occur. Both phylogeny and synteny conservation analyses reveal that, in the eumetazoan ancestor, a unique ancestral gene duplicated to give rise to SPARC and SPARCB described here for the first time. Independent losses have eliminated one of the two paralogues in cnidarians, protostomes and tetrapods. Hence, only non-tetrapod deuterostomes have conserved both genes. Remarkably, SPARC and SPARCB paralogues are still linked in the amphioxus genome. To shed light on the evolution of the SPARC family members in chordates, we performed a comprehensive analysis of their embryonic expression patterns in amphioxus, tunicates, teleosts, amphibians and mammals. Our results show that in the chordate lineage SPARC and SPARCB family members were recurrently recruited in a variety of unrelated tissues expressing collagen genes. We propose that one of the earliest steps of skeletal evolution involved the co-expression of SPARC paralogues with collagenous proteins. PMID:23446527

  19. Evolution of non-specific lipid transfer protein (nsLTP) genes in the Poaceae family: their duplication and diversity.

    PubMed

    Jang, Cheol Seong; Yim, Won Cheol; Moon, Jun-Cheol; Hung, Je Hyeong; Lee, Tong Geon; Lim, Sung Don; Cho, Seon Hae; Lee, Kwang Kook; Kim, Wook; Seo, Yong Weon; Lee, Byung-Moo

    2008-05-01

    Previously, the genes encoding non-specific lipid transfer proteins (nsLTPs) of the Poaceae family appear to evidence different genomic distribution and somewhat different shares of EST clones, which is suggestive of independent duplication(s) followed by functional diversity. To further evaluate the evolutionary fate of the Poaceae nsLTP genes, we have identified Ka/Ks values, conserved, mutated or lost cis-regulatory elements, responses to several elicitors, genome-wide expression profiles, and nsLTP gene-coexpression networks of both (or either) wheat and rice. The Ka/Ks values within each group and between groups appeared to be similar, but not identical, in both species. The conserved cis-regulatory elements, e.g. the RY repeat (CATGCA) element related to ABA regulation in group A, might be reflected in some degree of long-term conservation in transcriptional regulation post-dating speciation. In group A, wheat nsLTP genes, with the exception of TaLTP4, evidenced responses similar to those of plant elicitors; however, the rice nsLTP genes evidenced differences in expression profiles, even though the genes of both species have undergone purifying selection, thereby suggesting their independent functional diversity. The expression profiles of rice nsLTP genes with a microarray dataset of 155 gene expression omnibus sample (GSM) plates suggest that subfunctionalization is not the sole mechanism inherent to the evolutionary history of nsLTP genes but may, rather, function in concert with other mechanism(s). As inferred by the nsLTP gene-coexpression networks, the functional diversity of nsLTP genes appears not to be randomized, but rather to be specialized in the direction of specific biological processes over evolutionary time. PMID:18270740

  20. Identification of a rare 17p13.3 duplication including the BHLHA9 and YWHAE genes in a family with developmental delay and behavioural problems

    PubMed Central

    2012-01-01

    Background Deletions and duplications of the PAFAH1B1 and YWHAE genes in 17p13.3 are associated with different clinical phenotypes. In particular, deletion of PAFAH1B1 causes isolated lissencephaly while deletions involving both PAFAH1B1 and YWHAE cause Miller-Dieker syndrome. Isolated duplications of PAFAH1B1 have been associated with mild developmental delay and hypotonia, while isolated duplications of YWHAE have been associated with autism. In particular, different dysmorphic features associated with PAFAH1B1 or YWHAE duplication have suggested the need to classify the patient clinical features in two groups according to which gene is involved in the chromosomal duplication. Methods We analyze the proband and his family by classical cytogenetic and array-CGH analyses. The putative rearrangement was confirmed by fluorescence in situ hybridization. Results We have identified a family segregating a 17p13.3 duplication extending 329.5 kilobases by FISH and array-CGH involving the YWHAE gene, but not PAFAH1B1, affected by a mild dysmorphic phenotype with associated autism and mental retardation. We propose that BHLHA9, YWHAE, and CRK genes contribute to the phenotype of our patient. The small chromosomal duplication was inherited from his mother who was affected by a bipolar and borderline disorder and was alcohol addicted. Conclusions We report an additional familial case of small 17p13.3 chromosomal duplication including only BHLHA9, YWHAE, and CRK genes. Our observation and further cases with similar microduplications are expected to be diagnosed, and will help better characterise the clinical spectrum of phenotypes associated with 17p13.3 microduplications. PMID:23035971

  1. Familial interstitial Xq27.3q28 duplication encompassing the FMR1 gene but not the MECP2 gene causes a new syndromic mental retardation condition

    PubMed Central

    Rio, Marlène; Malan, Valérie; Boissel, Sarah; Toutain, Annick; Royer, Ghislaine; Gobin, Stéphanie; Morichon-Delvallez, Nicole; Turleau, Catherine; Bonnefont, Jean-Paul; Munnich, Arnold; Vekemans, Michel; Colleaux, Laurence

    2010-01-01

    X-linked mental retardation is a common disorder that accounts for 5–10% of cases of mental retardation in males. Fragile X syndrome is the most common form resulting from a loss of expression of the FMR1 gene. On the other hand, partial duplication of the long arm of the X chromosome is uncommon. It leads to functional disomy of the corresponding genes and has been reported in several cases of mental retardation in males. In this study, we report on the clinical and genetic characterization of a new X-linked mental retardation syndrome characterized by short stature, hypogonadism and facial dysmorphism, and show that this syndrome is caused by a small Xq27.3q28 interstitial duplication encompassing the FMR1 gene. This family broadens the phenotypic spectrum of FMR1 anomalies in an unexpected manner, and we suggest that this condition may represent the fragile X syndrome «contre-type». PMID:19844254

  2. Evolution of Gene Duplication in Plants.

    PubMed

    Panchy, Nicholas; Lehti-Shiu, Melissa; Shiu, Shin-Han

    2016-08-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. PMID:27288366

  3. Prenatal diagnosis by FISH in a family with Pelizaeus-Merzbacher disease caused by duplication of PLP gene.

    PubMed

    Woodward, K; Palmer, R; Rao, K; Malcolm, S

    1999-03-01

    A diagnosis of Pelizaeus-Merzbacher disease (MIM 312080) was made in a young boy. No mutation in the coding region of the proteolipid protein (PLP) gene had been found. The boy's maternal aunt came for prenatal diagnosis when 16+ weeks pregnant and carrying a male fetus. Samples were tested for duplication of the PLP gene, by interphase FISH, in lymphocyte preparations from the proband, his aunt and an amniotic fluid cell preparation from the fetus. The proband was found to carry the duplication, thus confirming the diagnosis of Pelizaeus Merzbacher disease, but neither the aunt nor the fetus carried a duplication. PMID:10210128

  4. Evolution and expansion of the Mycobacterium tuberculosis PE and PPE multigene families and their association with the duplication of the ESAT-6 (esx) gene cluster regions

    PubMed Central

    Gey van Pittius, Nicolaas C; Sampson, Samantha L; Lee, Hyeyoung; Kim, Yeun; van Helden, Paul D; Warren, Robin M

    2006-01-01

    Background The PE and PPE multigene families of Mycobacterium tuberculosis comprise about 10% of the coding potential of the genome. The function of the proteins encoded by these large gene families remains unknown, although they have been proposed to be involved in antigenic variation and disease pathogenesis. Interestingly, some members of the PE and PPE families are associated with the ESAT-6 (esx) gene cluster regions, which are regions of immunopathogenic importance, and encode a system dedicated to the secretion of members of the potent T-cell antigen ESAT-6 family. This study investigates the duplication characteristics of the PE and PPE gene families and their association with the ESAT-6 gene clusters, using a combination of phylogenetic analyses, DNA hybridization, and comparative genomics, in order to gain insight into their evolutionary history and distribution in the genus Mycobacterium. Results The results showed that the expansion of the PE and PPE gene families is linked to the duplications of the ESAT-6 gene clusters, and that members situated in and associated with the clusters represent the most ancestral copies of the two gene families. Furthermore, the emergence of the repeat protein PGRS and MPTR subfamilies is a recent evolutionary event, occurring at defined branching points in the evolution of the genus Mycobacterium. These gene subfamilies are thus present in multiple copies only in the members of the M. tuberculosis complex and close relatives. The study provides a complete analysis of all the PE and PPE genes found in the sequenced genomes of members of the genus Mycobacterium such as M. smegmatis, M. avium paratuberculosis, M. leprae, M. ulcerans, and M. tuberculosis. Conclusion This work provides insight into the evolutionary history for the PE and PPE gene families of the mycobacteria, linking the expansion of these families to the duplications of the ESAT-6 (esx) gene cluster regions, and showing that they are composed of subgroups

  5. Characterization of duplicated Dunaliella viridis SPT1 genes provides insights into early gene divergence after duplication.

    PubMed

    Guan, Zhenwei; Meng, Xiangzong; Sun, Zhenhua; Xu, Zhengkai; Song, Rentao

    2008-10-15

    The sodium-dependent phosphate transporter gene from unicellular green algae Dunaliella viridis, DvSPT1, shares similarity with members of Pi transporter family. Sequencing analysis of D. viridis BAC clone containing the DvSPT1 gene revealed two inverted duplicated copies of this gene (DvSPT1 and DvSPT1-2 respectively). The duplication covered most of both genes except for their 3' downstream region. The duplicated genomic sequences exhibited 97.9% identity with a synonymous divergence of Ks=0.0126 in the coding region. This data indicated very recent gene duplication in D. viridis genome, providing an excellent opportunity to investigate sequence and expression divergence of duplicated genes at an early stage. Scattered point mutations and length polymorphism of simple sequence repeats (SSRs) were predominant among the sequence divergence soon after gene duplication. Due to sequence divergence in the 5' regulatory regions and a swap of the entire 3' downstream regions (3'-UTR), DvSPT1 and DvSPT1-2 showed expression divergence in response to extra-cellular NaCl concentration changes. According to their expression patterns, the two diverged gene copies would provide better adaptation to a broader range of extra-cellular NaCl concentration. Furthermore, Southern blot analysis indicated that there might be a large phosphate transporter gene family in D. viridis. PMID:18662752

  6. Functional requirements driving the gene duplication in 12 Drosophila species

    PubMed Central

    2013-01-01

    Background Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. Results In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. Conclusions This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila. PMID:23945147

  7. Evolutionary analysis of multidrug resistance genes in fungi - impact of gene duplication and family conservation.

    PubMed

    Gossani, Cristiani; Bellieny-Rabelo, Daniel; Venancio, Thiago M

    2014-11-01

    Although the emergence of bacterial drug resistance is of great concern to the scientific community, few studies have evaluated this phenomenon systematically in fungi by using genome-wide datasets. In the present study, we assembled a large compendium of Saccharomyces cerevisiae chemical genetic data to study the evolution of multidrug resistance genes (MDRs) in the fungal lineage. We found that MDRs typically emerge in widely conserved families, most of which containing homologs from pathogenic fungi, such as Candida albicans and Coccidioides immitis, which could favor the evolution of drug resistance in those species. By integrating data from chemical genetics with protein family conservation, genetic and protein interactions, we found that gene families rarely have more than one MDR, indicating that paralogs evolve asymmetrically with regard to multidrug resistance roles. Furthermore, MDRs have more genetic and protein interaction partners than non-MDRs, supporting their participation in complex biochemical systems underlying the tolerance to multiple bioactive molecules. MDRs share more chemical genetic interactions with other MDRs than with non-MDRs, regardless of their evolutionary affinity. These results suggest the existence of an intricate system involved in the global drug tolerance phenotypes. Finally, MDRs are more likely to be hit repeatedly by mutations in laboratory evolution experiments, indicating that they have great adaptive potential. The results presented here not only reveal the main genomic features underlying the evolution of MDRs, but also shed light on the gene families from which drug resistance is more likely to emerge in fungi. PMID:25220072

  8. Independent gene duplications of the YidC/Oxa/Alb3 family enabled a specialized cotranslational function

    PubMed Central

    Funes, Soledad; Hasona, Adnan; Bauerschmitt, Heike; Grubbauer, Caroline; Kauff, Frank; Collins, Ryan; Crowley, Paula J.; Palmer, Sara R.; Brady, L. Jeannine; Herrmann, Johannes M.

    2009-01-01

    YidC/Oxa/Alb3 family proteins catalyze the insertion of integral membrane proteins in bacteria, mitochondria, and chloroplasts, respectively. Unlike gram-negative organisms, gram-positive bacteria express 2 paralogs of this family, YidC1/SpoIIIJ and YidC2/YgjG. In Streptococcus mutans, deletion of yidC2 results in a stress-sensitive phenotype similar to that of mutants lacking the signal recognition particle (SRP) protein translocation pathway, while deletion of yidC1 has a less severe phenotype. In contrast to eukaryotes and gram-negative bacteria, SRP-deficient mutants are viable in S. mutans; however, double SRP-yidC2 mutants are severely compromised. Thus, YidC2 may enable loss of the SRP by playing an independent but overlapping role in cotranslational protein insertion into the membrane. This is reminiscent of the situation in mitochondria that lack an SRP pathway and where Oxa1 facilitates cotranslational membrane protein insertion by binding directly to translation-active ribosomes. Here, we show that OXA1 complements a lack of yidC2 in S. mutans. YidC2 also functions reciprocally in oxa1-deficient Saccharomyces cerevisiae mutants and mediates the cotranslational insertion of mitochondrial translation products into the inner membrane. YidC2, like Oxa1, contains a positively charged C-terminal extension and associates with translating ribosomes. Our results are consistent with a gene-duplication event in gram-positive bacteria that enabled the specialization of a YidC isoform that mediates cotranslational activity independent of an SRP pathway. PMID:19366667

  9. Familial inverted duplication 7p

    SciTech Connect

    Schaefer, G.B.; Novak, K.; Steele, D.

    1995-03-27

    A 10-month-old female with developmental delay and failure to thrive was referred for genetic evaluation as part of an adoption assessment. Physical exam showed a mildly beaked nose and clinodactyly, but otherwise nothing remarkable. Chromosome analysis showed an inverted duplication of the p12.2{r_arrow}p13 portion of chromosome 7(46,XX,dup(7)(p13p12.2)). The proposita`s older brother, mother, and grandmother were cognitively delayed and had the same chromosome 7 duplication. A review of the literature showed no other cases involving this exact duplication. 5 refs., 3 figs., 1 tab.

  10. Subcellular Relocalization and Positive Selection Play Key Roles in the Retention of Duplicate Genes of Populus Class III Peroxidase Family[W][OPEN

    PubMed Central

    Ren, Lin-Ling; Liu, Yan-Jing; Liu, Hai-Jing; Qian, Ting-Ting; Qi, Li-Wang; Wang, Xiao-Ru; Zeng, Qing-Yin

    2014-01-01

    Gene duplication is the primary source of new genes and novel functions. Over the course of evolution, many duplicate genes lose their function and are eventually removed by deletion. However, some duplicates have persisted and evolved diverse functions. A particular challenge is to understand how this diversity arises and whether positive selection plays a role. In this study, we reconstructed the evolutionary history of the class III peroxidase (PRX) genes from the Populus trichocarpa genome. PRXs are plant-specific enzymes that play important roles in cell wall metabolism and in response to biotic and abiotic stresses. We found that two large tandem-arrayed clusters of PRXs evolved from an ancestral cell wall type PRX to vacuole type, followed by tandem duplications and subsequent functional specification. Substitution models identified seven positively selected sites in the vacuole PRXs. These positively selected sites showed significant effects on the biochemical functions of the enzymes. We also found that positive selection acts more frequently on residues adjacent to, rather than directly at, a critical active site of the enzyme, and on flexible regions rather than on rigid structural elements of the protein. Our study provides new insights into the adaptive molecular evolution of plant enzyme families. PMID:24934172

  11. Has gene duplication impacted the evolution of Eutherian longevity?

    PubMed

    Doherty, Aoife; de Magalhães, João Pedro

    2016-10-01

    One of the greatest unresolved questions in aging biology is determining the genetic basis of interspecies longevity variation. Gene duplication is often the key to understanding the origin and evolution of important Eutherian phenotypes. We systematically identified longevity-associated genes in model organisms that duplicated throughout Eutherian evolution. Longevity-associated gene families have a marginally significantly higher rate of duplication compared to non-longevity-associated gene families. Anti-longevity-associated gene families have significantly increased rate of duplication compared to pro-longevity gene families and are enriched in neurodegenerative disease categories. Conversely, duplicated pro-longevity-associated gene families are enriched in cell cycle genes. There is a cluster of longevity-associated gene families that expanded solely in long-lived species that is significantly enriched in pathways relating to 3-UTR-mediated translational regulation, metabolism of proteins and gene expression, pathways that have the potential to affect longevity. The identification of a gene cluster that duplicated solely in long-lived species involved in such fundamental processes provides a promising avenue for further exploration of Eutherian longevity evolution. PMID:27378378

  12. Expansion of banana (Musa acuminata) gene families involved in ethylene biosynthesis and signalling after lineage-specific whole-genome duplications.

    PubMed

    Jourda, Cyril; Cardi, Céline; Mbéguié-A-Mbéguié, Didier; Bocs, Stéphanie; Garsmeur, Olivier; D'Hont, Angélique; Yahiaoui, Nabila

    2014-05-01

    Whole-genome duplications (WGDs) are widespread in plants, and three lineage-specific WGDs occurred in the banana (Musa acuminata) genome. Here, we analysed the impact of WGDs on the evolution of banana gene families involved in ethylene biosynthesis and signalling, a key pathway for banana fruit ripening. Banana ethylene pathway genes were identified using comparative genomics approaches and their duplication modes and expression profiles were analysed. Seven out of 10 banana ethylene gene families evolved through WGD and four of them (1-aminocyclopropane-1-carboxylate synthase (ACS), ethylene-insensitive 3-like (EIL), ethylene-insensitive 3-binding F-box (EBF) and ethylene response factor (ERF)) were preferentially retained. Banana orthologues of AtEIN3 and AtEIL1, two major genes for ethylene signalling in Arabidopsis, were particularly expanded. This expansion was paralleled by that of EBF genes which are responsible for control of EIL protein levels. Gene expression profiles in banana fruits suggested functional redundancy for several MaEBF and MaEIL genes derived from WGD and subfunctionalization for some of them. We propose that EIL and EBF genes were co-retained after WGD in banana to maintain balanced control of EIL protein levels and thus avoid detrimental effects of constitutive ethylene signalling. In the course of evolution, subfunctionalization was favoured to promote finer control of ethylene signalling. PMID:24716518

  13. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms[OPEN

    PubMed Central

    Li, Zhen; Van de Peer, Yves; De Smet, Riet

    2016-01-01

    Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of “gene duplicability” is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. PMID:26744215

  14. Phylogeny of Na+/Ca2+ exchanger (NCX) genes from genomic data identifies new gene duplications and a new family member in fish species.

    PubMed

    Marshall, Christian R; Fox, Joanne A; Butland, Stefanie L; Ouellette, B F Francis; Brinkman, Fiona S L; Tibbits, Glen F

    2005-04-14

    The Na+/Ca2+ exchanger (NCX) is a member of the cation/Ca2+ antiporter (CaCA) family and plays a key role in maintaining cellular Ca2+ homeostasis in a variety of cell types. NCX is present in a diverse group of organisms and exhibits high overall identity across species. To date, three separate genes, i.e., NCX1, NCX2, and NCX3, have been identified in mammals. However, phylogenetic analysis of the exchanger has been hindered by the lack of nonmammalian NCX sequences. In this study, we expand and diversify the list of NCX sequences by identifying NCX homologs from whole-genome sequences accessible through the Ensembl Genome Browser. We identified and annotated 13 new NCX sequences, including 4 from zebrafish, 4 from Japanese pufferfish, 2 from chicken, and 1 each from honeybee, mosquito, and chimpanzee. Examination of NCX gene structure, together with construction of phylogenetic trees, provided novel insights into the molecular evolution of NCX and allowed us to more accurately annotate NCX gene names. For the first time, we report the existence of NCX2 and NCX3 in organisms other than mammals, yielding the hypothesis that two serial NCX gene duplications occurred around the time vertebrates and invertebrates diverged. In addition, we have found a putative new NCX protein, named NCX4, that is related to NCX1 but has been observed only in fish species genomes. These findings present a stronger foundation for our understanding of the molecular evolution of the NCX gene family and provide a framework for further NCX phylogenetic and molecular studies. PMID:15741504

  15. Altered patterns of gene duplication and differential gene gain and loss in fungal pathogens

    PubMed Central

    Powell, Amy J; Conant, Gavin C; Brown, Douglas E; Carbone, Ignazio; Dean, Ralph A

    2008-01-01

    Background Duplication, followed by fixation or random loss of novel genes, contributes to genome evolution. Particular outcomes of duplication events are possibly associated with pathogenic life histories in fungi. To date, differential gene gain and loss have not been studied at genomic scales in fungal pathogens, despite this phenomenon's known importance in virulence in bacteria and viruses. Results To determine if patterns of gene duplication differed between pathogens and non-pathogens, we identified gene families across nine euascomycete and two basidiomycete species. Gene family size distributions were fit to power laws to compare gene duplication trends in pathogens versus non-pathogens. Fungal phytopathogens showed globally altered patterns of gene duplication, as indicated by differences in gene family size distribution. We also identified sixteen examples of gene family expansion and five instances of gene family contraction in pathogenic lineages. Expanded gene families included those predicted to be important in melanin biosynthesis, host cell wall degradation and transport functions. Contracted families included those encoding genes involved in toxin production, genes with oxidoreductase activity, as well as subunits of the vacuolar ATPase complex. Surveys of the functional distribution of gene duplicates indicated that pathogens show enrichment for gene duplicates associated with receptor and hydrolase activities, while euascomycete pathogens appeared to have not only these differences, but also significantly more duplicates associated with regulatory and carbohydrate binding functions. Conclusion Differences in the overall levels of gene duplication in phytopathogenic species versus non-pathogenic relatives implicate gene inventory flux as an important virulence-associated process in fungi. We hypothesize that the observed patterns of gene duplicate enrichment, gene family expansion and contraction reflect adaptation within pathogenic life

  16. A Homozygous TPO Gene Duplication (c.1184_1187dup4) Causes Congenital Hypothyroidism in Three Siblings Born to a Consanguineous Family.

    PubMed

    Cangul, Hakan; Aydin, Banu K; Bas, Firdevs

    2015-12-01

    Congenital hypothyroidism (CH) is the most common neonatal endocrine disease, and germ-line mutations in the TPO gene cause the inherited form of the disease. Our aim in this study was to determine the genetic basis of congenital hypothyroidism in three affected children coming from a consanguineous Turkish family. Because CH is usually inherited in autosomal recessive manner in consanguineous/multicase families, we adopted a two-stage strategy of genetic linkage studies and targeted sequencing of the candidate genes. First, we investigated the potential genetic linkage of the family to any known CH locus, using microsatellite markers, and then screened for mutations in linked-gene by conventional sequencing. The family showed potential linkage to the TPO gene and we detected a homozygous duplication (c.1184_1187dup4) in all cases. The mutation segregated with disease status in the family. This study confirms the pathogenicity of the c.1184_1187dup4 mutation in the TPO gene and helps establish a genotype/phenotype correlation associated with this mutation. It also highlights the importance of molecular genetic studies in the definitive diagnosis and accurate classification of CH. PMID:27617131

  17. Simulating evolution by gene duplication.

    PubMed

    Ohta, T

    1987-01-01

    By considering the recent finding that unequal crossing over and other molecular interactions are contributing to the evolution of multigene families, a model of the origin of repetitive genes was studied by Monte Carlo simulations. Starting from a single gene copy, how genetic systems evolve was examined under unequal crossing over, random drift and natural selection. Both beneficial and deteriorating mutations were incorporated, and the latter were assumed to occur ten times more frequently than the former. Positive natural selection favors those chromosomes with more beneficial mutations in redundant copies than others in the population, but accumulation of deteriorating mutations (pseudogenes) have no effect on fitness so long as there remains a functional gene. The results imply the following: Positive natural selection is needed in order to acquire gene families with new functions. Without it, too many pseudogenes accumulate before attaining a functional gene family. There is a large fluctuation in the outcome even if parameters are the same. When unequal crossing over occurs more frequently, the system evolves more rapidly. It was also shown, under realistic values of parameters, that the genetic load for acquiring a new gene is not as large as J.B.S. Haldane suggested, but not so small as in a model in which a system for selection started from already redundant genes. PMID:3557113

  18. Clinical characterization and identification of duplication breakpoints in a Japanese family with Xq28 duplication syndrome including MECP2.

    PubMed

    Fukushi, Daisuke; Yamada, Kenichiro; Nomura, Noriko; Naiki, Misako; Kimura, Reiko; Yamada, Yasukazu; Kumagai, Toshiyuki; Yamaguchi, Kumiko; Miyake, Yoshishige; Wakamatsu, Nobuaki

    2014-04-01

    Xq28 duplication syndrome including MECP2 is a neurodevelopmental disorder characterized by axial hypotonia at infancy, severe intellectual disability, developmental delay, mild characteristic facial appearance, epilepsy, regression, and recurrent infections in males. We identified a Japanese family of Xq28 duplications, in which the patients presented with cerebellar ataxia, severe constipation, and small feet, in addition to the common clinical features. The 488-kb duplication spanned from L1CAM to EMD and contained 17 genes, two pseudo genes, and three microRNA-coding genes. FISH and nucleotide sequence analyses demonstrated that the duplication was tandem and in a forward orientation, and the duplication breakpoints were located in AluSc at the EMD side, with a 32-bp deletion, and LTR50 at the L1CAM side, with "tc" and "gc" microhomologies at the duplication breakpoints, respectively. The duplicated segment was completely segregated from the grandmother to the patients. These results suggest that the duplication was generated by fork-stalling and template-switching at the AluSc and LTR50 sites. This is the first report to determine the size and nucleotide sequences of the duplicated segments at Xq28 of three generations of a family and provides the genotype-phenotype correlation of the patients harboring the specific duplicated segment. PMID:24478188

  19. Duplication of the EFNB1 Gene in Familial Hypertelorism: Imbalance in Ephrin-B1 Expression and Abnormal Phenotypes in Humans and Mice

    PubMed Central

    Babbs, Christian; Stewart, Helen S; Williams, Louise J; Connell, Lyndsey; Goriely, Anne; Twigg, Stephen RF; Smith, Kim; Lester, Tracy; Wilkie, Andrew OM

    2011-01-01

    Familial hypertelorism, characterized by widely spaced eyes, classically shows autosomal dominant inheritance (Teebi type), but some pedigrees are compatible with X-linkage. No mechanism has been described previously, but clinical similarity has been noted to craniofrontonasal syndrome (CFNS), which is caused by mutations in the X-linked EFNB1 gene. Here we report a family in which females in three generations presented with hypertelorism, but lacked either craniosynostosis or a grooved nasal tip, excluding CFNS. DNA sequencing of EFNB1 was normal, but further analysis revealed a duplication of 937 kb including EFNB1 and two flanking genes: PJA1 and STARD8. We found that the X chromosome bearing the duplication produces ∼1.6-fold more EFNB1 transcript than the normal X chromosome and propose that, in the context of X-inactivation, this difference in expression level of EFNB1 results in abnormal cell sorting leading to hypertelorism. To support this hypothesis, we provide evidence from a mouse model carrying a targeted human EFNB1 cDNA, that abnormal cell sorting occurs in the cranial region. Hence, we propose that X-linked cases resembling Teebi hypertelorism may have a similar mechanism to CFNS, and that cellular mosaicism for different levels of ephrin-B1 (as well as simple presence/absence) leads to craniofacial abnormalities. Hum Mutat 32:1–9, 2011. © 2011 Wiley-Liss, Inc. PMID:21542058

  20. Duplication of floral regulatory genes in the Lamiales.

    PubMed

    Aagaard, Jan E; Olmstead, Richard G; Willis, John H; Phillips, Patrick C

    2005-08-01

    Duplication of some floral regulatory genes has occurred repeatedly in angiosperms, whereas others are thought to be single-copy in most lineages. We selected three genes that interact in a pathway regulating floral development conserved among higher tricolpates (LFY/FLO, UFO/FIM, and AP3/DEF) and screened for copy number among families of Lamiales that are closely related to the model species Antirrhinum majus. We show that two of three genes have duplicated at least twice in the Lamiales. Phylogenetic analyses of paralogs suggest that an ancient whole genome duplication shared among many families of Lamiales occurred after the ancestor of these families diverged from the lineage leading to Veronicaceae (including the single-copy species A. majus). Duplication is consistent with previous patterns among angiosperm lineages for AP3/DEF, but this is the first report of functional duplicate copies of LFY/FLO outside of tetraploid species. We propose Lamiales taxa will be good models for understanding mechanisms of duplicate gene preservation and how floral regulatory genes may contribute to morphological diversity. PMID:21646149

  1. Gene duplication, gene loss and evolution of expression domains in the vertebrate nuclear receptor NR5A (Ftz-F1) family

    PubMed Central

    Kuo, Ming-Wei; Postlethwait, John; Lee, Wen-Chih; Lou, Show-Wan; Chan, Woon-Khiong; Chung, Bon-chu

    2005-01-01

    Fushi tarazu factor 1 (Ftz-F1, NR5A) is a zinc-finger transcription factor that belongs to the nuclear receptor superfamily and regulates genes that are involved in sterol and steroid metabolism in gonads, adrenals, liver and other tissues. To understand the evolutionary origins and developmental genetic relationships of the Ftz-F1 genes, we have cloned four homologous Ftz-f1 genes in zebrafish, called ff1a, ff1b, ff1c and ff1d. These four genes have different temporal and spatial expression patterns during development, indicating that they have distinct mechanisms of genetic regulation. Among them, the ff1a expression pattern is similar to mammalian Nr5a2, while the ff1b pattern is similar to that of mammalian Nr5a1. Genetic mapping experiments show that these four ff1 genes are located on chromosome segments conserved between the zebrafish and human genomes, indicating a common ancestral origin. Phylogenetic and conserved synteny analysis show that ff1a is the orthologue of NR5A2, and that ff1b and ff1d genes are co-orthologues of NR5A1 that arose by a gene-duplication event, probably a whole-genome duplication, in the ray-fin lineage, and each gene is located next to an NR6A1 co-orthologue as in humans, showing that the tandem duplication occurred before the divergence of human and zebrafish lineages. ff1c does not have a mammalian counterpart. Thus we have characterized the phylogenetic relationships, expression patterns and chromosomal locations of these Ftz-F1 genes, and have demonstrated their identities as NR5A genes in relation to the orthologous genes in other species. PMID:15725073

  2. Divergence of the bZIP Gene Family in Strawberry, Peach, and Apple Suggests Multiple Modes of Gene Evolution after Duplication

    PubMed Central

    Wang, Xiao-Long; Zhong, Yan; Cheng, Zong-Ming; Xiong, Jin-Song

    2015-01-01

    The basic leucine zipper (bZIP) transcription factors are the most diverse members of dimerizing transcription factors. In the present study, 50, 116, and 47 bZIP genes were identified in Malus domestica (apple), Prunus persica (peach), and Fragaria vesca (strawberry), respectively. Species-specific duplication was the main contributor to the large number of bZIPs observed in apple. After WGD in apple genome, orthologous bZIP genes corresponding to strawberry on duplicated regions in apple genome were retained. However, in peach ancestor, these syntenic regions were quickly lost or deleted. Maybe the positive selection contributed to the expansion of clade S to adapt to the development and environment stresses. In addition, purifying selection was mainly responsible for bZIP sequence-specific DNA binding. The analysis of orthologous pairs between chromosomes indicates that these orthologs derived from one gene duplication located on one of the nine ancient chromosomes in the Rosaceae. The comparative analysis of bZIP genes in three species provides information on the evolutionary fate of bZIP genes in apple and peach after they diverged from strawberry. PMID:26770968

  3. Divergence of the bZIP Gene Family in Strawberry, Peach, and Apple Suggests Multiple Modes of Gene Evolution after Duplication.

    PubMed

    Wang, Xiao-Long; Zhong, Yan; Cheng, Zong-Ming; Xiong, Jin-Song

    2015-01-01

    The basic leucine zipper (bZIP) transcription factors are the most diverse members of dimerizing transcription factors. In the present study, 50, 116, and 47 bZIP genes were identified in Malus domestica (apple), Prunus persica (peach), and Fragaria vesca (strawberry), respectively. Species-specific duplication was the main contributor to the large number of bZIPs observed in apple. After WGD in apple genome, orthologous bZIP genes corresponding to strawberry on duplicated regions in apple genome were retained. However, in peach ancestor, these syntenic regions were quickly lost or deleted. Maybe the positive selection contributed to the expansion of clade S to adapt to the development and environment stresses. In addition, purifying selection was mainly responsible for bZIP sequence-specific DNA binding. The analysis of orthologous pairs between chromosomes indicates that these orthologs derived from one gene duplication located on one of the nine ancient chromosomes in the Rosaceae. The comparative analysis of bZIP genes in three species provides information on the evolutionary fate of bZIP genes in apple and peach after they diverged from strawberry. PMID:26770968

  4. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. PMID:25481634

  5. Functional divergence of gene duplicates through ectopic recombination

    PubMed Central

    Christiaens, Joaquin F; Van Mulders, Sebastiaan E; Duitama, Jorge; Brown, Chris A; Ghequire, Maarten G; De Meester, Luc; Michiels, Jan; Wenseleers, Tom; Voordeckers, Karin; Verstrepen, Kevin J

    2012-01-01

    Gene duplication stimulates evolutionary innovation as the resulting paralogs acquire mutations that lead to sub- or neofunctionalization. A comprehensive in silico analysis of paralogs in Saccharomyces cerevisiae reveals that duplicates of cell-surface and subtelomeric genes also undergo ectopic recombination, which leads to new chimaeric alleles. Mimicking such intergenic recombination events in the FLO (flocculation) family of cell-surface genes shows that chimaeric FLO alleles confer different adhesion phenotypes than the parental genes. Our results indicate that intergenic recombination between paralogs can generate a large set of new alleles, thereby providing the raw material for evolutionary adaptation and innovation. PMID:23070367

  6. Evolution of Gene Duplication in Plants1[OPEN

    PubMed Central

    2016-01-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. PMID:27288366

  7. Duplication and maintenance of the Myb genes of vertebrate animals

    PubMed Central

    Davidson, Colin J.; Guthrie, Erin E.; Lipsick, Joseph S.

    2013-01-01

    Summary Gene duplication is an important means of generating new genes. The major mechanisms by which duplicated genes are preserved in the face of purifying selection are thought to be neofunctionalization, subfunctionalization, and increased gene dosage. However, very few duplicated gene families in vertebrate species have been analyzed by functional tests in vivo. We have therefore examined the three vertebrate Myb genes (c-Myb, A-Myb, and B-Myb) by cytogenetic map analysis, by sequence analysis, and by ectopic expression in Drosophila. We provide evidence that the vertebrate Myb genes arose by two rounds of regional genomic duplication. We found that ubiquitous expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, was lethal in Drosophila. Expression of any of these genes during early larval eye development was well tolerated. However, expression of c-Myb and A-Myb, but not of B-Myb or Drosophila Myb, during late larval eye development caused drastic alterations in adult eye morphology. Mosaic analysis implied that this eye phenotype was cell-autonomous. Interestingly, some of the eye phenotypes caused by the retroviral v-Myb oncogene and the normal c-Myb proto-oncogene from which v-Myb arose were quite distinct. Finally, we found that post-translational modifications of c-Myb by the GSK-3 protein kinase and by the Ubc9 SUMO-conjugating enzyme that normally occur in vertebrate cells can modify the eye phenotype caused by c-Myb in Drosophila. These results support a model in which the three Myb genes of vertebrates arose by two sequential duplications. The first duplication was followed by a subfunctionalization of gene expression, then neofunctionalization of protein function to yield a c/A-Myb progenitor. The duplication of this progenitor was followed by subfunctionalization of gene expression to give rise to tissue-specific c-Myb and A-Myb genes. PMID:23431116

  8. Pervasive and Persistent Redundancy among Duplicated Genes in Yeast

    PubMed Central

    Dean, E. Jedediah; Davis, Jerel C.; Davis, Ronald W.; Petrov, Dmitri A.

    2008-01-01

    The loss of functional redundancy is the key process in the evolution of duplicated genes. Here we systematically assess the extent of functional redundancy among a large set of duplicated genes in Saccharomyces cerevisiae. We quantify growth rate in rich medium for a large number of S. cerevisiae strains that carry single and double deletions of duplicated and singleton genes. We demonstrate that duplicated genes can maintain substantial redundancy for extensive periods of time following duplication (∼100 million years). We find high levels of redundancy among genes duplicated both via the whole genome duplication and via smaller scale duplications. Further, we see no evidence that two duplicated genes together contribute to fitness in rich medium substantially beyond that of their ancestral progenitor gene. We argue that duplicate genes do not often evolve to behave like singleton genes even after very long periods of time. PMID:18604285

  9. Drosophila melanogaster metallothionein genes: Selection for duplications

    SciTech Connect

    Lange, B.W.

    1989-01-01

    The metallothionein genes of Drosophila melanogaster, Mtn and Mto, may play an important role in heavy-metal detoxification. In order to investigate the possibility of increased selection for duplications of these genes in natural populations exposed to high levels of heavy metals, I compared the frequencies of such duplications among flies collected from metal-contaminated and non-contaminated orchards in Pennsylvania, Tennessee, and Georgia. Contaminated of collection sites and of local flies was confirmed by atomic absorption spectrosphotometry. Six-nucleotide-recognizing restriction enzyme analysis was used to screen 1666 wild third chromosomes for Mtn duplications. A subset (327) of these lines was screened for Mto duplications: none were found. Cadmium tolerance test performed on F{sub 2} progeny of wild females failed to detect a difference in tolerance levels between flies from contaminated orchards and flies from control orchards. Estimates of sequence diversity among a subsample (92) of the chromosomes used in the duplication survey, including all 27 Mtn duplication chromosomes, were obtained using four-nucleotide-recognizing restriction enzyme analysis.

  10. Wanda: a database of duplicated fish genes

    PubMed Central

    Van de Peer, Yves; Taylor, John S.; Joseph, Jayabalan; Meyer, Axel

    2002-01-01

    Comparative genomics has shown that ray-finned fish (Actinopterygii) contain more copies of many genes than other vertebrates. A large number of these additional genes appear to have been produced during a genome duplication event that occurred early during the evolution of Actinopterygii (i.e. before the teleost radiation). In addition to this ancient genome duplication event, many lineages within Actinopterygii have experienced more recent genome duplications. Here we introduce a curated database named Wanda that lists groups of orthologous genes with one copy from man, mouse and chicken, one or two from tetraploid Xenopus and two or more ancient copies (i.e. paralogs) from ray-finned fish. The database also contains the sequence alignments and phylogenetic trees that were necessary for determining the correct orthologous and paralogous relationships among genes. Where available, map positions and functional data are also reported. The Wanda database should be of particular use to evolutionary and developmental biologists who are interested in the evolutionary and functional divergence of genes after duplication. Wanda is available at http://www.evolutionsbiologie.uni-konstanz.de/Wanda/. PMID:11752268

  11. Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

    PubMed Central

    Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106

  12. Evolutionary Analysis of Sequence Divergence and Diversity of Duplicate Genes in Aspergillus fumigatus

    PubMed Central

    Yang, Ence; Hulse, Amanda M.; Cai, James J.

    2012-01-01

    Gene duplication as a major source of novel genetic material plays an important role in evolution. In this study, we focus on duplicate genes in Aspergillus fumigatus, a ubiquitous filamentous fungus causing life-threatening human infections. We characterize the extent and evolutionary patterns of the duplicate genes in the genome of A. fumigatus. Our results show that A. fumigatus contains a large amount of duplicate genes with pronounced sequence divergence between two copies, and approximately 10% of them diverge asymmetrically, i.e. two copies of a duplicate gene pair diverge at significantly different rates. We use a Bayesian approach of the McDonald-Kreitman test to infer distributions of selective coefficients γ(=2Nes) and find that (1) the values of γ for two copies of duplicate genes co-vary positively and (2) the average γ for the two copies differs between genes from different gene families. This analysis highlights the usefulness of combining divergence and diversity data in studying the evolution of duplicate genes. Taken together, our results provide further support and refinement to the theories of gene duplication. Through characterizing the duplicate genes in the genome of A. fumigatus, we establish a computational framework, including parameter settings and methods, for comparative study of genetic redundancy and gene duplication between different fungal species. PMID:23225993

  13. Phosphorylation network rewiring by gene duplication

    PubMed Central

    Freschi, Luca; Courcelles, Mathieu; Thibault, Pierre; Michnick, Stephen W; Landry, Christian R

    2011-01-01

    Elucidating how complex regulatory networks have assembled during evolution requires a detailed understanding of the evolutionary dynamics that follow gene duplication events, including changes in post-translational modifications. We compared the phosphorylation profiles of paralogous proteins in the budding yeast Saccharomyces cerevisiae to that of a species that diverged from the budding yeast before the duplication of those genes. We found that 100 million years of post-duplication divergence are sufficient for the majority of phosphorylation sites to be lost or gained in one paralog or the other, with a strong bias toward losses. However, some losses may be partly compensated for by the evolution of other phosphosites, as paralogous proteins tend to preserve similar numbers of phosphosites over time. We also found that up to 50% of kinase–substrate relationships may have been rewired during this period. Our results suggest that after gene duplication, proteins tend to subfunctionalize at the level of post-translational regulation and that even when phosphosites are preserved, there is a turnover of the kinases that phosphorylate them. PMID:21734643

  14. Growth of Novel Epistatic Interactions by Gene Duplication

    PubMed Central

    Jiang, Huifeng; Xu, Lin; Gu, Zhenglong

    2011-01-01

    Epistasis has long been recognized as fundamentally important in understanding the structure, function, and evolutionary dynamics of biological systems. Gene duplication is a major mechanism of evolution for genetic novelties. Here, we demonstrate that genes evolved significantly more epistatic interactions after duplication. The connectivity of duplicate gene pairs in epistatic networks is positively correlated with the extent of their sequence divergence. Furthermore, duplicate gene pairs tend to epistatically interact with genes that occupy more functional spaces than do single-copy genes. These results show that gene duplication plays an important role in the evolution of epistasis. PMID:21402864

  15. Signals of historical interlocus gene conversion in human segmental duplications.

    PubMed

    Dumont, Beth L; Eichler, Evan E

    2013-01-01

    Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC). Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i) a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii) the alignment-based method implemented in the GENECONV program. One-quarter (25.4%) of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans. PMID:24124524

  16. Phylogenetic detection of numerous gene duplications shared by animals, fungi and plants

    PubMed Central

    2010-01-01

    Background Gene duplication is considered a major driving force for evolution of genetic novelty, thereby facilitating functional divergence and organismal diversity, including the process of speciation. Animals, fungi and plants are major eukaryotic kingdoms and the divergences between them are some of the most significant evolutionary events. Although gene duplications in each lineage have been studied extensively in various contexts, the extent of gene duplication prior to the split of plants and animals/fungi is not clear. Results Here, we have studied gene duplications in early eukaryotes by phylogenetic relative dating. We have reconstructed gene families (with one or more orthogroups) with members from both animals/fungi and plants by using two different clustering strategies. Extensive phylogenetic analyses of the gene families show that, among nearly 2,600 orthogroups identified, at least 300 of them still retain duplication that occurred before the divergence of the three kingdoms. We further found evidence that such duplications were also detected in some highly divergent protists, suggesting that these duplication events occurred in the ancestors of most major extant eukaryotic groups. Conclusions Our phylogenetic analyses show that numerous gene duplications happened at the early stage of eukaryotic evolution, probably before the separation of known major eukaryotic lineages. We discuss the implication of our results in the contexts of different models of eukaryotic phylogeny. One possible explanation for the large number of gene duplication events is one or more large-scale duplications, possibly whole genome or segmental duplication(s), which provides a genomic basis for the successful radiation of early eukaryotes. PMID:20370904

  17. Gene duplication and transfer events in plant mitochondria genome

    SciTech Connect

    Xiong Aisheng Peng Rihe; Zhuang Jing; Gao Feng; Zhu Bo; Fu Xiaoyan; Xue Yong; Jin Xiaofen; Tian Yongsheng; Zhao Wei; Yao Quanhong

    2008-11-07

    Gene or genome duplication events increase the amount of genetic material available to increase the genomic, and thereby phenotypic, complexity of organisms during evolution. Gene duplication and transfer events have been important to molecular evolution in all three domains of life, and may be the first step in the emergence of new gene functions. Gene transfer events have been proposed as another accelerator of evolution. The duplicated gene or genome, mainly nuclear, has been the subject of several recent reviews. In addition to the nuclear genome, organisms have organelle genomes, including mitochondrial genome. In this review, we briefly summarize gene duplication and transfer events in the plant mitochondrial genome.

  18. Molecular trajectories leading to the alternative fates of duplicate genes.

    PubMed

    Marotta, Michael; Piontkivska, Helen; Tanaka, Hisashi

    2012-01-01

    Gene duplication generates extra gene copies in which mutations can accumulate without risking the function of pre-existing genes. Such mutations modify duplicates and contribute to evolutionary novelties. However, the vast majority of duplicates appear to be short-lived and experience duplicate silencing within a few million years. Little is known about the molecular mechanisms leading to these alternative fates. Here we delineate differing molecular trajectories of a relatively recent duplication event between humans and chimpanzees by investigating molecular properties of a single duplicate: DNA sequences, gene expression and promoter activities. The inverted duplication of the Glutathione S-transferase Theta 2 (GSTT2) gene had occurred at least 7 million years ago in the common ancestor of African great apes and is preserved in chimpanzees (Pan troglodytes), whereas a deletion polymorphism is prevalent in humans. The alternative fates are associated with expression divergence between these species, and reduced expression in humans is regulated by silencing mutations that have been propagated between duplicates by gene conversion. In contrast, selective constraint preserved duplicate divergence in chimpanzees. The difference in evolutionary processes left a unique DNA footprint in which dying duplicates are significantly more similar to each other (99.4%) than preserved ones. Such molecular trajectories could provide insights for the mechanisms underlying duplicate life and death in extant genomes. PMID:22720000

  19. Preservation of duplicate genes by complementary, degenerative mutations.

    PubMed Central

    Force, A; Lynch, M; Pickett, F B; Amores, A; Yan, Y L; Postlethwait, J

    1999-01-01

    The origin of organismal complexity is generally thought to be tightly coupled to the evolution of new gene functions arising subsequent to gene duplication. Under the classical model for the evolution of duplicate genes, one member of the duplicated pair usually degenerates within a few million years by accumulating deleterious mutations, while the other duplicate retains the original function. This model further predicts that on rare occasions, one duplicate may acquire a new adaptive function, resulting in the preservation of both members of the pair, one with the new function and the other retaining the old. However, empirical data suggest that a much greater proportion of gene duplicates is preserved than predicted by the classical model. Here we present a new conceptual framework for understanding the evolution of duplicate genes that may help explain this conundrum. Focusing on the regulatory complexity of eukaryotic genes, we show how complementary degenerative mutations in different regulatory elements of duplicated genes can facilitate the preservation of both duplicates, thereby increasing long-term opportunities for the evolution of new gene functions. The duplication-degeneration-complementation (DDC) model predicts that (1) degenerative mutations in regulatory elements can increase rather than reduce the probability of duplicate gene preservation and (2) the usual mechanism of duplicate gene preservation is the partitioning of ancestral functions rather than the evolution of new functions. We present several examples (including analysis of a new engrailed gene in zebrafish) that appear to be consistent with the DDC model, and we suggest several analytical and experimental approaches for determining whether the complementary loss of gene subfunctions or the acquisition of novel functions are likely to be the primary mechanisms for the preservation of gene duplicates. For a newly duplicated paralog, survival depends on the outcome of the race between

  20. Recurrent Gene Duplication Diversifies Genome Defense Repertoire in Drosophila.

    PubMed

    Levine, Mia T; Vander Wende, Helen M; Hsieh, Emily; Baker, EmilyClare P; Malik, Harmit S

    2016-07-01

    Transposable elements (TEs) comprise large fractions of many eukaryotic genomes and imperil host genome integrity. The host genome combats these challenges by encoding proteins that silence TE activity. Both the introduction of new TEs via horizontal transfer and TE sequence evolution requires constant innovation of host-encoded TE silencing machinery to keep pace with TEs. One form of host innovation is the adaptation of existing, single-copy host genes. Indeed, host suppressors of TE replication often harbor signatures of positive selection. Such signatures are especially evident in genes encoding the piwi-interacting-RNA pathway of gene silencing, for example, the female germline-restricted TE silencer, HP1D/Rhino Host genomes can also innovate via gene duplication and divergence. However, the importance of gene family expansions, contractions, and gene turnover to host genome defense has been largely unexplored. Here, we functionally characterize Oxpecker, a young, tandem duplicate gene of HP1D/rhino We demonstrate that Oxpecker supports female fertility in Drosophila melanogaster and silences several TE families that are incompletely silenced by HP1D/Rhino in the female germline. We further show that, like Oxpecker, at least ten additional, structurally diverse, HP1D/rhino-derived daughter and "granddaughter" genes emerged during a short 15-million year period of Drosophila evolution. These young paralogs are transcribed primarily in germline tissues, where the genetic conflict between host genomes and TEs plays out. Our findings suggest that gene family expansion is an underappreciated yet potent evolutionary mechanism of genome defense diversification. PMID:26979388

  1. Impact of recurrent gene duplication on adaptation of plant genomes

    PubMed Central

    2014-01-01

    Background Recurrent gene duplication and retention played an important role in angiosperm genome evolution. It has been hypothesized that these processes contribute significantly to plant adaptation but so far this hypothesis has not been tested at the genome scale. Results We studied available sequenced angiosperm genomes to assess the frequency of positive selection footprints in lineage specific expanded (LSE) gene families compared to single-copy genes using a dN/dS-based test in a phylogenetic framework. We found 5.38% of alignments in LSE genes with codons under positive selection. In contrast, we found no evidence for codons under positive selection in the single-copy reference set. An analysis at the branch level shows that purifying selection acted more strongly on single-copy genes than on LSE gene clusters. Moreover we detect significantly more branches indicating evolution under positive selection and/or relaxed constraint in LSE genes than in single-copy genes. Conclusions In this – to our knowledge –first genome-scale study we provide strong empirical support for the hypothesis that LSE genes fuel adaptation in angiosperms. Our conservative approach for detecting selection footprints as well as our results can be of interest for further studies on (plant) gene family evolution. PMID:24884640

  2. Duplicate gene divergence by changes in microRNA binding sites in Arabidopsis and Brassica.

    PubMed

    Wang, Sishuo; Adams, Keith L

    2015-03-01

    Gene duplication provides large numbers of new genes that can lead to the evolution of new functions. Duplicated genes can diverge by changes in sequences, expression patterns, and functions. MicroRNAs play an important role in the regulation of gene expression in many eukaryotes. After duplication, two paralogs may diverge in their microRNA binding sites, which might impact their expression and function. Little is known about conservation and divergence of microRNA binding sites in duplicated genes in plants. We analyzed microRNA binding sites in duplicated genes in Arabidopsis thaliana and Brassica rapa. We found that duplicates are more often targeted by microRNAs than singletons. The vast majority of duplicated genes in A. thaliana with microRNA binding sites show divergence in those sites between paralogs. Analysis of microRNA binding sites in genes derived from the ancient whole-genome triplication in B. rapa also revealed extensive divergence. Paralog pairs with divergent microRNA binding sites show more divergence in expression patterns compared with paralog pairs with the same microRNA binding sites in Arabidopsis. Close to half of the cases of binding site divergence are caused by microRNAs that are specific to the Arabidopsis genus, indicating evolutionarily recent gain of binding sites after target gene duplication. We also show rapid evolution of microRNA binding sites in a jacalin gene family. Our analyses reveal a dynamic process of changes in microRNA binding sites after gene duplication in Arabidopsis and highlight the role of microRNA regulation in the divergence and contrasting evolutionary fates of duplicated genes. PMID:25644246

  3. Expression of tandem gene duplicates is often greater than twofold

    PubMed Central

    Loehlin, David W.; Carroll, Sean B.

    2016-01-01

    Tandem gene duplication is an important mutational process in evolutionary adaptation and human disease. Hypothetically, two tandem gene copies should produce twice the output of a single gene, but this expectation has not been rigorously investigated. Here, we show that tandem duplication often results in more than double the gene activity. A naturally occurring tandem duplication of the Alcohol dehydrogenase (Adh) gene exhibits 2.6-fold greater expression than the single-copy gene in transgenic Drosophila. This tandem duplication also exhibits greater activity than two copies of the gene in trans, demonstrating that it is the tandem arrangement and not copy number that is the cause of overactivity. We also show that tandem duplication of an unrelated synthetic reporter gene is overactive (2.3- to 5.1-fold) at all sites in the genome that we tested, suggesting that overactivity could be a general property of tandem gene duplicates. Overactivity occurs at the level of RNA transcription, and therefore tandem duplicate overactivity appears to be a previously unidentified form of position effect. The increment of surplus gene expression observed is comparable to many regulatory mutations fixed in nature and, if typical of other genomes, would shape the fate of tandem duplicates in evolution. PMID:27162370

  4. TBK1 gene duplication and normal-tension glaucoma

    PubMed Central

    Ritch, Robert; Darbro, Ben; Menon, Geeta; Khanna, Cheryl L.; Solivan-Timpe, Frances; Roos, Ben R.; Sarfarzi, Mansoor; Kawase, Kazuhide; Yamamoto, Tetsuya; Robin, Alan L.; Lotery, Andrew J.; Fingert, John H.

    2015-01-01

    IMPORTANCE Normal-tension glaucoma (NTG) is a common cause of vision loss. OBJECTIVE To investigate the role of TANK binding kinase1(TBK1) gene duplications in NTG to gain insights into the causes of glaucoma that occurs at low intraocular pressure (IOP). DESIGN, SETTING, AND PARTICIPANTS In this multicenter case-control study, we investigated patients who met the criteria for NTG, including glaucomatous optic neuropathy, visual field defects, and maximum recorded untreated IOP of 21 mm Hg or less, and matched controls. Participants (N = 755) were recruited from Southampton, United Kingdom (180 patients and 178 controls), Rochester, Minnesota (65 patients and 12 controls), New York, New York (96 patients and 16 controls), and Iowa City, Iowa (208 controls). MAIN OUTCOMES AND MEASURES Detection of TBK1 gene duplications and comparison of the extent of the identified DNA that is duplicated with prior TBK1 copy number variations associated with NTG. RESULTS A TBK1 gene duplication was detected in 1 of 96 patients (1.0%) from New York and none of the controls. Analysis of duplication borders with comparative genome hybridization demonstrated that this patient has a novel duplication that has not been previously reported. No gene duplications were detected in any of the other cohorts of patients or controls. CONCLUSIONS AND RELEVANCE Duplication of the TBK1 gene is a rare cause of NTG. The identification of another case of NTG attributed to TBK1 gene duplication strengthens the case that this mutation causes glaucoma. PMID:24699864

  5. Recurrent Tandem Gene Duplication Gave Rise to Functionally Divergent Genes in Drosophila

    PubMed Central

    Chen, Ying; Long, Manyuan

    2008-01-01

    Tandem gene duplication is one of the major gene duplication mechanisms in eukaryotes, as illustrated by the prevalence of gene family clusters. Tandem duplicated paralogs usually share the same regulatory element, and as a consequence, they are likely to perform similar biological functions. Here, we provide an example of a newly evolved tandem duplicate acquiring novel functions, which were driven by positive selection. CG32708, CG32706, and CG6999 are 3 clustered genes residing in the X chromosome of Drosophila melanogaster. CG6999 and CG32708 have been examined for their molecular population genetic properties (Thornton and Long 2005). We further investigated the evolutionary forces acting on these genes with greater sample sizes and a broader approach that incorporate between-species divergence, using more variety of statistical methods. We explored the possible functional implications by characterizing the tissue-specific and developmental expression patterns of these genes. Sequence comparison of species within D. melanogaster subgroup reveals that this 3-gene cluster was created by 2 rounds of tandem gene duplication in the last 5 Myr. Based on phylogenetic analysis, CG32708 is clearly the parental copy that is shared by all species. CG32706 appears to have originated in the ancestor of Drosophila simulans and D. melanogaster about 5 Mya, and CG6999 is the newest duplicate that is unique to D. melanogaster. All 3 genes have different expression profiles, and CG6999 has in addition acquired a novel transcript. Biased polymorphism frequency spectrum, linkage disequilibrium, nucleotide substitution, and McDonald–Kreitman analyses suggested that the evolution of CG6999 and CG32706 were driven by positive Darwinian selection. PMID:18408233

  6. Multiple Routes to Subfunctionalization and Gene Duplicate Specialization

    PubMed Central

    Proulx, Stephen R.

    2012-01-01

    Gene duplication is arguably the most significant source of new functional genetic material. A better understanding of the processes that lead to the stable incorporation of gene duplications into the genome is important both because it relates to interspecific differences in genome composition and because it can shed light on why some classes of gene are more prone to duplication than others. Typically, models of gene duplication consider the periods before duplication, during the spread and fixation of a new duplicate, and following duplication as distinct phases without a common underlying selective environment. I consider a scenario where a gene that is initially expressed in multiple contexts can undergo mutations that alter its expression profile or its functional coding sequence. The selective regime that acts on the functional output of the allele copies carried by an individual is constant. If there is a potential selective benefit to having different coding sequences expressed in each context, then, regardless of the constraints on functional variation at the single-locus gene, the waiting time until a gene duplication is incorporated goes down as population size increases. PMID:22143920

  7. Protein Subcellular Relocalization Increases the Retention of Eukaryotic Duplicate Genes

    PubMed Central

    Byun, S. Ashley; Singh, Sarabdeep

    2013-01-01

    Gene duplication is widely accepted as a key evolutionary process, leading to new genes and novel protein functions. By providing the raw genetic material necessary for functional expansion, the mechanisms that involve the retention and functional diversification of duplicate genes are one of the central topics in evolutionary and comparative genomics. One proposed source of retention and functional diversification is protein subcellular relocalization (PSR). PSR postulates that changes in the subcellular location of eukaryotic duplicate proteins can positively modify function and therefore be beneficial to the organism. As such, PSR would promote retention of those relocalized duplicates and result in significantly lower death rates compared with death rates of nonrelocalized duplicate pairs. We surveyed both relocalized and nonrelocalized duplicate proteins from the available genomes and proteomes of 59 eukaryotic species and compared their relative death rates over a Ks range between 0 and 1. Using the Cox proportional hazard model, we observed that the death rates of relocalized duplicate pairs were significantly lower than the death rates of the duplicates without relocalization in most eukaryotic species examined in this study. These observations suggest that PSR significantly increases retention of duplicate genes and that it plays an important, but currently underappreciated, role in the evolution of eukaryotic genomes. PMID:24265504

  8. Germline duplication of ATG2B and GSKIP predisposes to familial myeloid malignancies.

    PubMed

    Saliba, Joseph; Saint-Martin, Cécile; Di Stefano, Antonio; Lenglet, Gaëlle; Marty, Caroline; Keren, Boris; Pasquier, Florence; Valle, Véronique Della; Secardin, Lise; Leroy, Gwendoline; Mahfoudhi, Emna; Grosjean, Sarah; Droin, Nathalie; Diop, M'boyba; Dessen, Philippe; Charrier, Sabine; Palazzo, Alberta; Merlevede, Jane; Meniane, Jean-Côme; Delaunay-Darivon, Christine; Fuseau, Pascal; Isnard, Françoise; Casadevall, Nicole; Solary, Eric; Debili, Najet; Bernard, Olivier A; Raslova, Hana; Najman, Albert; Vainchenker, William; Bellanné-Chantelot, Christine; Plo, Isabelle

    2015-10-01

    No major predisposition gene for familial myeloproliferative neoplasms (MPN) has been identified. Here we demonstrate that the autosomal dominant transmission of a 700-kb duplication in four genetically related families predisposes to myeloid malignancies, including MPN, frequently progressing to leukemia. Using induced pluripotent stem cells and primary cells, we demonstrate that overexpression of ATG2B and GSKIP enhances hematopoietic progenitor differentiation, including of megakaryocytes, by increasing progenitor sensitivity to thrombopoietin (TPO). ATG2B and GSKIP cooperate with acquired JAK2, MPL and CALR mutations during MPN development. Thus, the germline duplication may change the fitness of cells harboring signaling pathway mutations and increases the probability of disease development. PMID:26280900

  9. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    PubMed

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-07-01

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392

  10. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution

    PubMed Central

    Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.

    2015-01-01

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392

  11. Histone modification pattern evolution after yeast gene duplication

    PubMed Central

    2012-01-01

    Background Gene duplication and subsequent functional divergence especially expression divergence have been widely considered as main sources for evolutionary innovations. Many studies evidenced that genetic regulatory network evolved rapidly shortly after gene duplication, thus leading to accelerated expression divergence and diversification. However, little is known whether epigenetic factors have mediated the evolution of expression regulation since gene duplication. In this study, we conducted detailed analyses on yeast histone modification (HM), the major epigenetics type in this organism, as well as other available functional genomics data to address this issue. Results Duplicate genes, on average, share more common HM-code patterns than random singleton pairs in their promoters and open reading frames (ORF). Though HM-code divergence between duplicates in both promoter and ORF regions increase with their sequence divergence, the HM-code in ORF region evolves slower than that in promoter region, probably owing to the functional constraints imposed on protein sequences. After excluding the confounding effect of sequence divergence (or evolutionary time), we found the evidence supporting the notion that in yeast, the HM-code may co-evolve with cis- and trans-regulatory factors. Moreover, we observed that deletion of some yeast HM-related enzymes increases the expression divergence between duplicate genes, yet the effect is lower than the case of transcription factor (TF) deletion or environmental stresses. Conclusions Our analyses demonstrate that after gene duplication, yeast histone modification profile between duplicates diverged with evolutionary time, similar to genetic regulatory elements. Moreover, we found the evidence of the co-evolution between genetic and epigenetic elements since gene duplication, together contributing to the expression divergence between duplicate genes. PMID:22776110

  12. Gene duplication and the evolution of moonlighting proteins.

    PubMed

    Espinosa-Cantú, Adriana; Ascencio, Diana; Barona-Gómez, Francisco; DeLuna, Alexander

    2015-01-01

    Gene duplication is a recurring phenomenon in genome evolution and a major driving force in the gain of biological functions. Here, we examine the role of gene duplication in the origin and maintenance of moonlighting proteins, with special focus on functional redundancy and innovation, molecular tradeoffs, and genetic robustness. An overview of specific examples-mainly from yeast-suggests a widespread conservation of moonlighting behavior in duplicate genes after long evolutionary times. Dosage amplification and incomplete subfunctionalization appear to be prevalent in the maintenance of multifunctionality. We discuss the role of gene-expression divergence and paralog responsiveness in moonlighting proteins with overlapping biochemical properties. Future studies analyzing multifunctional genes in a more systematic and comprehensive manner will not only enable a better understanding of how this emerging class of protein behavior originates and is maintained, but also provide new insights on the mechanisms of evolution by gene duplication. PMID:26217376

  13. A familial 7q36.3 duplication associated with agenesis of the corpus callosum.

    PubMed

    Wong, Keith; Moldrich, Randal; Hunter, Matthew; Edwards, Matthew; Finlay, David; O'Donnell, Sheridan; MacDougall, Tom; Bain, Nicole; Kamien, Benjamin

    2015-09-01

    Small chromosomal duplications involving 7q36.3 have rarely been reported. This clinical report describes four individuals from a three-generation family with agenesis of the corpus callosum (ACC) and a 0.73 Mb duplication of 7q36.3 detected by array CGH. The 7q36.3 duplication involves two genes: RNA Binding Motif Protein 33 (RBM33) and Sonic Hedgehog (SHH). Most affected family members had mild intellectual disability or borderline intellectual functioning, macrocephaly, a broad forehead, and widely spaced eyes. Two individuals had a Chiari type I malformation. This is the first family reported with ACC associated with a small duplication of these genes. While we cannot establish causation for the relationship between any single gene and the ACC in this family, there is a role for SHH in the formation of the corpus callosum through correct patterning and assembly of the commissural plate, and these data concur with vertebrate studies showing that a gain of SHH expands the facial primordium. PMID:25944787

  14. Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions

    PubMed Central

    2008-01-01

    Background One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY) receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes) and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. Results Chromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs) showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains) and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events). RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs. Conclusion We conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate tetraploidizations forming a

  15. Gene duplication, genome duplication, and the functional diversification of vertebrate globins

    PubMed Central

    Storz, Jay F.; Opazo, Juan C.; Hoffmann, Federico G.

    2015-01-01

    The functional diversification of the vertebrate globin gene superfamily provides an especially vivid illustration of the role of gene duplication and whole-genome duplication in promoting evolutionary innovation. For example, key globin proteins that evolved specialized functions in various aspects of oxidative metabolism and oxygen signaling pathways (hemoglobin [Hb], myoglobin [Mb], and cytoglobin [Cygb]) trace their origins to two whole-genome duplication events in the stem lineage of vertebrates. The retention of the proto-Hb and Mb genes in the ancestor of jawed vertebrates permitted a physiological division of labor between the oxygen-carrier function of Hb and the oxygen-storage function of Mb. In the Hb gene lineage, a subsequent tandem gene duplication gave rise to the proto α- and β-globin genes, which permitted the formation of multimeric Hbs composed of unlike subunits (α2β2). The evolution of this heteromeric quaternary structure was central to the emergence of Hb as a specialized oxygen-transport protein because it provided a mechanism for cooperative oxygen-binding and allosteric regulatory control. Subsequent rounds of duplication and divergence have produced diverse repertoires of α- and β-like globin genes that are ontogenetically regulated such that functionally distinct Hb isoforms are expressed during different stages of prenatal development and postnatal life. In the ancestor of jawless fishes, the proto Mb and Hb genes appear to have been secondarily lost, and the Cygb homolog evolved a specialized respiratory function in blood-oxygen transport. Phylogenetic and comparative genomic analyses of the vertebrate globin gene superfamily have revealed numerous instances in which paralogous globins have convergently evolved similar expression patterns and/or similar functional specializations in different organismal lineages. PMID:22846683

  16. Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.

    PubMed

    Gout, Jean-Francois; Lynch, Michael

    2015-08-01

    Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution. PMID:25908670

  17. Divergence of recently duplicated M{gamma}-type MADS-box genes in Petunia.

    PubMed

    Bemer, Marian; Gordon, Jonathan; Weterings, Koen; Angenent, Gerco C

    2010-02-01

    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications, as well as during more recent events, the type I loci appear to experience high turnover with many recent duplications. This different mode of origin also suggests a different fate for the type I duplicates, which are thought to have a higher chance to become silenced or lost from the genome. To get more insight into the evolution of the type I MADS-box genes, we isolated nine type I genes from Petunia, which belong to the Mgamma subclass, and investigated the divergence of their coding and regulatory regions. The isolated genes could be subdivided into two categories: two genes were highly similar to Arabidopsis Mgamma-type genes, whereas the other seven genes showed less similarity to Arabidopsis genes and originated more recently. Two of the recently duplicated genes were found to contain deleterious mutations in their coding regions, and expression analysis revealed that a third paralog was silenced by mutations in its regulatory region. However, in addition to the three genes that were subjected to nonfunctionalization, we also found evidence for neofunctionalization of one of the Petunia Mgamma-type genes. Our study shows a rapid divergence of recently duplicated Mgamma-type MADS-box genes and suggests that redundancy among type I paralogs may be less common than expected. PMID:19933156

  18. Comparative Evolution of Duplicated Ddx3 Genes in Teleosts: Insights from Japanese Flounder, Paralichthys olivaceus

    PubMed Central

    Wang, Zhongkai; Liu, Wei; Song, Huayu; Wang, Huizhen; Liu, Jinxiang; Zhao, Haitao; Du, Xinxin; Zhang, Quanqi

    2015-01-01

    Following the two rounds of whole-genome duplication that occurred during deuterostome evolution, a third genome duplication event occurred in the stem lineage of ray-finned fishes. This teleost-specific genome duplication is thought to be responsible for the biological diversification of ray-finned fishes. DEAD-box polypeptide 3 (DDX3) belongs to the DEAD-box RNA helicase family. Although their functions in humans have been well studied, limited information is available regarding their function in teleosts. In this study, two teleost Ddx3 genes were first identified in the transcriptome of Japanese flounder (Paralichthys olivaceus). We confirmed that the two genes originated from teleost-specific genome duplication through synteny and phylogenetic analysis. Additionally, comparative analysis of genome structure, molecular evolution rate, and expression pattern of the two genes in Japanese flounder revealed evidence of subfunctionalization of the duplicated Ddx3 genes in teleosts. Thus, the results of this study reveal novel insights into the evolution of the teleost Ddx3 genes and constitute important groundwork for further research on this gene family. PMID:26109358

  19. The duplication of the Hox gene clusters in teleost fishes.

    PubMed

    Prohaska, Sonja J; Stadler, Peter F

    2004-06-01

    Higher teleost fishes, including zebrafish and fugu, have duplicated their Hox genes relative to the gene inventory of other gnathostome lineages. The most widely accepted theory contends that the duplicate Hox clusters orginated synchronously during a single genome duplication event in the early history of ray-finned fishes. In this contribution we collect and re-evaluate all publicly available sequence information. In particular, we show that the short Hox gene fragments from published PCR surveys of the killifish Fundulus heteroclitus, the medaka Oryzias latipes and the goldfish Carassius auratus can be used to determine with little ambiguity not only their paralog group but also their membership in a particular cluster.Together with a survey of the genomic sequence data from the pufferfish Tetraodon nigroviridis we show that at least percomorpha, and possibly all eutelosts, share a system of 7 or 8 orthologous Hox gene clusters. There is little doubt about the orthology of the two teleost duplicates of the HoxA and HoxB clusters. A careful analysis of both the coding sequence of Hox genes and of conserved non-coding sequences provides additional support for the "duplication early" hypothesis that the Hox clusters in teleosts are derived from eight ancestral clusters by means of subsequent gene loss; the data remain ambiguous, however, in particular for the HoxC clusters.Assuming the "duplication early" hypothesis we use the new evidence on the Hox gene complements to determine the phylogenetic positions of gene-loss events in the wake of the cluster duplication. Surprisingly, we find that the resolution of redundancy seems to be a slow process that is still ongoing. A few suggestions on which additional sequence data would be most informative for resolving the history of the teleostean Hox genes are discussed. PMID:18202881

  20. Explosive tandem and segmental duplications of multigenic families in Eucalyptus grandis.

    PubMed

    Li, Qiang; Yu, Hong; Cao, Phi Bang; Fawal, Nizar; Mathé, Catherine; Azar, Sahar; Cassan-Wang, Hua; Myburg, Alexander A; Grima-Pettenati, Jacqueline; Marque, Christiane; Teulières, Chantal; Dunand, Christophe

    2015-04-01

    Plant organisms contain a large number of genes belonging to numerous multigenic families whose evolution size reflects some functional constraints. Sequences from eight multigenic families, involved in biotic and abiotic responses, have been analyzed in Eucalyptus grandis and compared with Arabidopsis thaliana. Two transcription factor families APETALA 2 (AP2)/ethylene responsive factor and GRAS, two auxin transporter families PIN-FORMED and AUX/LAX, two oxidoreductase families (ascorbate peroxidases [APx] and Class III peroxidases [CIII Prx]), and two families of protective molecules late embryogenesis abundant (LEA) and DNAj were annotated in expert and exhaustive manner. Many recent tandem duplications leading to the emergence of species-specific gene clusters and the explosion of the gene numbers have been observed for the AP2, GRAS, LEA, PIN, and CIII Prx in E. grandis, while the APx, the AUX/LAX and DNAj are conserved between species. Although no direct evidence has yet demonstrated the roles of these recent duplicated genes observed in E. grandis, this could indicate their putative implications in the morphological and physiological characteristics of E. grandis, and be the key factor for the survival of this nondormant species. Global analysis of key families would be a good criterion to evaluate the capabilities of some organisms to adapt to environmental variations. PMID:25769696

  1. Explosive Tandem and Segmental Duplications of Multigenic Families in Eucalyptus grandis

    PubMed Central

    Li, Qiang; Yu, Hong; Cao, Phi Bang; Fawal, Nizar; Mathé, Catherine; Azar, Sahar; Cassan-Wang, Hua; Myburg, Alexander A.; Grima-Pettenati, Jacqueline; Marque, Christiane; Teulières, Chantal; Dunand, Christophe

    2015-01-01

    Plant organisms contain a large number of genes belonging to numerous multigenic families whose evolution size reflects some functional constraints. Sequences from eight multigenic families, involved in biotic and abiotic responses, have been analyzed in Eucalyptus grandis and compared with Arabidopsis thaliana. Two transcription factor families APETALA 2 (AP2)/ethylene responsive factor and GRAS, two auxin transporter families PIN-FORMED and AUX/LAX, two oxidoreductase families (ascorbate peroxidases [APx] and Class III peroxidases [CIII Prx]), and two families of protective molecules late embryogenesis abundant (LEA) and DNAj were annotated in expert and exhaustive manner. Many recent tandem duplications leading to the emergence of species-specific gene clusters and the explosion of the gene numbers have been observed for the AP2, GRAS, LEA, PIN, and CIII Prx in E. grandis, while the APx, the AUX/LAX and DNAj are conserved between species. Although no direct evidence has yet demonstrated the roles of these recent duplicated genes observed in E. grandis, this could indicate their putative implications in the morphological and physiological characteristics of E. grandis, and be the key factor for the survival of this nondormant species. Global analysis of key families would be a good criterion to evaluate the capabilities of some organisms to adapt to environmental variations. PMID:25769696

  2. Assessment and Reconstruction of Novel HSP90 Genes: Duplications, Gains and Losses in Fungal and Animal Lineages

    PubMed Central

    Pantzartzi, Chrysoula N.; Drosopoulou, Elena; Scouras, Zacharias G.

    2013-01-01

    Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b) isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata. PMID:24066039

  3. Prevertebrate Local Gene Duplication Facilitated Expansion of the Neuropeptide GPCR Superfamily.

    PubMed

    Yun, Seongsik; Furlong, Michael; Sim, Mikang; Cho, Minah; Park, Sumi; Cho, Eun Bee; Reyes-Alcaraz, Arfaxad; Hwang, Jong-Ik; Kim, Jaebum; Seong, Jae Young

    2015-11-01

    In humans, numerous genes encode neuropeptides that comprise a superfamily of more than 70 genes in approximately 30 families and act mainly through rhodopsin-like G protein-coupled receptors (GPCRs). Two rounds of whole-genome duplication (2R WGD) during early vertebrate evolution greatly contributed to proliferation within gene families; however, the mechanisms underlying the initial emergence and diversification of these gene families before 2R WGD are largely unknown. In this study, we analyzed 25 vertebrate rhodopsin-like neuropeptide GPCR families and their cognate peptides using phylogeny, synteny, and localization of these genes on reconstructed vertebrate ancestral chromosomes (VACs). Based on phylogeny, these GPCR families can be divided into five distinct clades, and members of each clade tend to be located on the same VACs. Similarly, their neuropeptide gene families also tend to reside on distinct VACs. Comparison of these GPCR genes with those of invertebrates including Drosophila melanogaster, Caenorhabditis elegans, Branchiostoma floridae, and Ciona intestinalis indicates that these GPCR families emerged through tandem local duplication during metazoan evolution prior to 2R WGD. Our study describes a presumptive evolutionary mechanism and development pathway of the vertebrate rhodopsin-like GPCR and cognate neuropeptide families from the urbilaterian ancestor to modern vertebrates. PMID:26337547

  4. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most wide...

  5. The Zebrafish Annexin Gene Family

    PubMed Central

    Farber, Steven A.; De Rose, Robert A.; Olson, Eric S.; Halpern, Marnie E.

    2003-01-01

    The Annexins (ANXs) are a family of calcium- and phospholipid-binding proteins that have been implicated in many cellular processes, including channel formation, membrane fusion, vesicle transport, and regulation of phospholipase A2 activity. As a first step toward understanding in vivo function, we have cloned 11 zebrafish anx genes. Four genes (anx1a, anx2a, anx5,and anx11a) were identified by screening a zebrafish cDNA library with a Xenopus anx2 fragment. For these genes, full-length cDNA sequences were used to cluster 212 EST sequences generated by the Zebrafish Genome Resources Project. The EST analysis revealed seven additional anx genes that were subsequently cloned. The genetic map positions of all 11 genes were determined by using a zebrafish radiation hybrid panel. Sequence and syntenic relationships between zebrafish and human genes indicate that the 11 genes represent orthologs of human anx1,2,4,5,6,11,13,and suggest that several zebrafish anx genes resulted from duplications that arose after divergence of the zebrafish and mammalian genomes. Zebrafish anx genes are expressed in a wide range of tissues during embryonic and larval stages. Analysis of the expression patterns of duplicated genes revealed both redundancy and divergence, with the most similar genes having almost identical tissue-specific patterns of expression and with less similar duplicates showing no overlap. The differences in gene expression of recently duplicated anx genes could explain why highly related paralogs were maintained in the genome and did not rapidly become pseudogenes. PMID:12799347

  6. Multiple bursts of pancreatic ribonuclease gene duplication in insect-eating bats.

    PubMed

    Xu, Huihui; Liu, Yang; Meng, Fanxing; He, Beibei; Han, Naijian; Li, Gang; Rossiter, Stephen J; Zhang, Shuyi

    2013-09-10

    Pancreatic ribonuclease gene (RNASE1) was previously shown to have undergone duplication and adaptive evolution related to digestive efficiency in several mammalian groups that have evolved foregut fermentation, including ruminants and some primates. RNASE1 gene duplications thought to be linked to diet have also been recorded in some carnivores. Of all mammals, bats have evolved the most diverse dietary specializations, mainly including frugivory and insectivory. Here we cloned, sequenced and analyzed RNASE1 gene sequences from a range of bat species to determine whether their dietary adaptation is mirrored by molecular adaptation. We found that seven insect-eating members of the families Vespertilionidae and Molossidae possessed two or more duplicates, and we also detected three pseudogenes. Reconstructed RNASE1 gene trees based on both Bayesian and maximum likelihood methods supported independent duplication events in these two families. Selection tests revealed that RNASE1 gene duplicates have undergone episodes of positive selection indicative of functional modification, and lineage-specific tests revealed strong adaptive evolution in the Tadarida β clade. However, unlike the RNASE1 duplicates that function in digestion in some mammals, the bat RNASE1 sequences were found to be characterized by relatively high isoelectric points, a feature previously suggested to promote defense against viruses via the breakdown of double-stranded RNA. Taken together, our findings point to an adaptive diversification of RNASE1 in these two bat families, although we find no clear evidence that this was driven by diet. Future experimental assays are needed to resolve the functions of these enzymes in bats. PMID:23644026

  7. Alternative Transposition Generates New Chimeric Genes and Segmental Duplications at the Maize p1 Locus.

    PubMed

    Wang, Dafang; Yu, Chuanhe; Zuo, Tao; Zhang, Jianbo; Weber, David F; Peterson, Thomas

    2015-11-01

    The maize Ac/Ds transposon family was the first transposable element system identified and characterized by Barbara McClintock. Ac/Ds transposons belong to the hAT family of class II DNA transposons. We and others have shown that Ac/Ds elements can undergo a process of alternative transposition in which the Ac/Ds transposase acts on the termini of two separate, nearby transposons. Because these termini are present in different elements, alternative transposition can generate a variety of genome alterations such as inversions, duplications, deletions, and translocations. Moreover, Ac/Ds elements transpose preferentially into genic regions, suggesting that structural changes arising from alternative transposition may potentially generate chimeric genes at the rearrangement breakpoints. Here we identified and characterized 11 independent cases of gene fusion induced by Ac alternative transposition. In each case, a functional chimeric gene was created by fusion of two linked, paralogous genes; moreover, each event was associated with duplication of the ∼70-kb segment located between the two paralogs. An extant gene in the maize B73 genome that contains an internal duplication apparently generated by an alternative transposition event was also identified. Our study demonstrates that alternative transposition-induced duplications may be a source for spontaneous creation of diverse genome structures and novel genes in maize. PMID:26434719

  8. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders.

    PubMed

    Haney, Robert A; Clarke, Thomas H; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y; Ayoub, Nadia A; Garb, Jessica E

    2016-01-01

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland-specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576

  9. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders

    PubMed Central

    Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.

    2016-01-01

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576

  10. Impact of gene family evolutionary histories on phylogenetic species tree inference by gene tree parsimony.

    PubMed

    Shi, Tao

    2016-03-01

    Complicated history of gene duplication and loss brings challenge to molecular phylogenetic inference, especially in deep phylogenies. However, phylogenomic approaches, such as gene tree parsimony (GTP), show advantage over some other approaches in its ability to use gene families with duplications. GTP searches the 'optimal' species tree by minimizing the total cost of biological events such as duplications, but accuracy of GTP and phylogenetic signal in the context of different gene families with distinct histories of duplication and loss are unclear. To evaluate how different evolutionary properties of different gene families can impact on species tree inference, 3900 gene families from seven angiosperms encompassing a wide range of gene content, lineage-specific expansions and contractions were analyzed. It was found that the gene content and total duplication number in a gene family strongly influence species tree inference accuracy, with the highest accuracy achieved at either very low or very high gene content (or duplication number) and lowest accuracy centered in intermediate gene content (or duplication number), as the relationship can fit a binomial regression. Besides, for gene families of similar level of average gene content, those with relatively higher lineage-specific expansion or duplication rates tend to show lower accuracy. Additional correlation tests support that high accuracy for those gene families with large gene content may rely on abundant ancestral copies to provide many subtrees to resolve conflicts, whereas high accuracy for single or low copy gene families are just subject to sequence substitution per se. Very low accuracy reached by gene families of intermediate gene content or duplication number can be due to insufficient subtrees to resolve the conflicts from loss of alternative copies. As these evolutionary properties can significantly influence species tree accuracy, I discussed the potential weighting of the duplication cost by

  11. PGDD: a database of gene and genome duplication in plants

    PubMed Central

    Lee, Tae-Ho; Tang, Haibao; Wang, Xiyin; Paterson, Andrew H.

    2013-01-01

    Genome duplication (GD) has permanently shaped the architecture and function of many higher eukaryotic genomes. The angiosperms (flowering plants) are outstanding models in which to elucidate consequences of GD for higher eukaryotes, owing to their propensity for chromosomal duplication or even triplication in a few cases. Duplicated genome structures often require both intra- and inter-genome alignments to unravel their evolutionary history, also providing the means to deduce both obvious and otherwise-cryptic orthology, paralogy and other relationships among genes. The burgeoning sets of angiosperm genome sequences provide the foundation for a host of investigations into the functional and evolutionary consequences of gene and GD. To provide genome alignments from a single resource based on uniform standards that have been validated by empirical studies, we built the Plant Genome Duplication Database (PGDD; freely available at http://chibba.agtec.uga.edu/duplication/), a web service providing synteny information in terms of colinearity between chromosomes. At present, PGDD contains data for 26 plants including bryophytes and chlorophyta, as well as angiosperms with draft genome sequences. In addition to the inclusion of new genomes as they become available, we are preparing new functions to enhance PGDD. PMID:23180799

  12. Evaluating and Characterizing Ancient Whole-Genome Duplications in Plants with Gene Count Data

    PubMed Central

    Tiley, George P.; Ané, Cécile; Burleigh, J. Gordon

    2016-01-01

    Whole-genome duplications (WGDs) have helped shape the genomes of land plants, and recent evidence suggests that the genomes of all angiosperms have experienced at least two ancient WGDs. In plants, WGDs often are followed by rapid fractionation, in which many homeologous gene copies are lost. Thus, it can be extremely difficult to identify, let alone characterize, ancient WGDs. In this study, we use a new maximum likelihood estimator to test for evidence of ancient WGDs in land plants and estimate the fraction of new genes copies that are retained following a WGD using gene count data, the number of gene copies in gene families. We identified evidence of many putative ancient WGDs in land plants and found that the genome fractionation rates vary tremendously among ancient WGDs. Analyses of WGDs within Brassicales also indicate that background gene duplication and loss rates vary across land plants, and different gene families have different probabilities of being retained following a WGD. Although our analyses are largely robust to errors in duplication and loss rates and the choice of priors, simulations indicate that this method can have trouble detecting multiple WGDs that occur on the same branch, especially when the gene retention rates for ancient WGDs are very low. They also suggest that we should carefully evaluate evidence for some ancient plant WGD hypotheses. PMID:26988251

  13. Evaluating and Characterizing Ancient Whole-Genome Duplications in Plants with Gene Count Data.

    PubMed

    Tiley, George P; Ané, Cécile; Burleigh, J Gordon

    2016-01-01

    Whole-genome duplications (WGDs) have helped shape the genomes of land plants, and recent evidence suggests that the genomes of all angiosperms have experienced at least two ancient WGDs. In plants, WGDs often are followed by rapid fractionation, in which many homeologous gene copies are lost. Thus, it can be extremely difficult to identify, let alone characterize, ancient WGDs. In this study, we use a new maximum likelihood estimator to test for evidence of ancient WGDs in land plants and estimate the fraction of new genes copies that are retained following a WGD using gene count data, the number of gene copies in gene families. We identified evidence of many putative ancient WGDs in land plants and found that the genome fractionation rates vary tremendously among ancient WGDs. Analyses of WGDs within Brassicales also indicate that background gene duplication and loss rates vary across land plants, and different gene families have different probabilities of being retained following a WGD. Although our analyses are largely robust to errors in duplication and loss rates and the choice of priors, simulations indicate that this method can have trouble detecting multiple WGDs that occur on the same branch, especially when the gene retention rates for ancient WGDs are very low. They also suggest that we should carefully evaluate evidence for some ancient plant WGD hypotheses. PMID:26988251

  14. Inferring optimal species trees under gene duplication and loss.

    PubMed

    Bayzid, M S; Mirarab, S; Warnow, T

    2013-01-01

    Species tree estimation from multiple markers is complicated by the fact that gene trees can differ from each other (and from the true species tree) due to several biological processes, one of which is gene duplication and loss. Local search heuristics for two NP-hard optimization problems - minimize gene duplications (MGD) and minimize gene duplications and losses (MGDL) - are popular techniques for estimating species trees in the presence of gene duplication and loss. In this paper, we present an alternative approach to solving MGD and MGDL from rooted gene trees. First, we characterize each tree in terms of its "subtree-bipartitions" (a concept we introduce). Then we show that the MGD species tree is defined by a maximum weight clique in a vertex-weighted graph that can be computed from the subtree-bipartitions of the input gene trees, and the MGDL species tree is defined by a minimum weight clique in a similarly constructed graph. We also show that these optimal cliques can be found in polynomial time in the number of vertices of the graph using a dynamic programming algorithm (similar to that of Hallett and Lagergren(1)), because of the special structure of the graphs. Finally, we show that a constrained version of these problems, where the subtree-bipartitions of the species tree are drawn from the subtree-bipartitions of the input gene trees, can be solved in time that is polynomial in the number of gene trees and taxa. We have implemented our dynamic programming algorithm in a publicly available software tool, available at http://www.cs.utexas.edu/users/phylo/software/dynadup/. PMID:23424130

  15. Ancient intron insertion sites and palindromic genomic duplication evolutionally shapes an elementally functioning membrane protein family

    PubMed Central

    Tanaka-Kunishima, Motoko; Ishida, Yoshihiro; Takahashi, Kunitaro; Honda, Motoo; Oonuma, Takashi

    2007-01-01

    Background In spite of the recent accumulation of genomic data, the evolutionary pathway in the individual genes of present-day living taxa is still elusive for most genes. Among ion channels, inward K+ rectifier (IRK) channels are the fundamental and well-defined protein group. We analyzed the genomic structures of this group and compared them among a phylogenetically wide range with our sequenced Halocynthia roretzi, a tunicate, IRK genomic genes. Results A total of 131 IRK genomic genes were analyzed. The phylogenic trees of amino acid sequences revealed a clear diversification of deuterostomic IRKs from protostomic IRKs and suggested that the tunicate IRKs are possibly representatives of the descendants of ancestor forms of three major groups of IRKs in the vertebrate. However, the exon-intron structures of the tunicate IRK genomes showed considerable similarities to those of Caenorhabditis. In the vertebrate clade, the members in each major group increased at least four times those in the tunicate by various types of global gene duplication. The generation of some major groups was inferred to be due to anti-tandem (palindromic) duplication in early history. The intron insertion points greatly decreased during the evolution of the vertebrates, remaining as a unique conservation of an intron insertion site in the portion of protein-protein interaction within the coding regions of all vertebrate G-protein-activated IRK genes. Conclusion From the genomic survey of a family of IRK genes, it was suggested that the ancient intron insertion sites and the unique palindromic genomic duplication evolutionally shaped this membrane protein family. PMID:17708769

  16. Evolution of Arabidopsis microRNA families through duplication events

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recently there has been a great interest in the identification of microRNAs and their targets as well as understanding the spatial and temporal regulation of microRNA genes. To understand how microRNA genes evolve, we looked at several rapidly evolving families in Arabidopsis thaliana, and found th...

  17. Functional divergence in tandemly duplicated Arabidopsis thaliana trypsin inhibitor genes.

    PubMed Central

    Clauss, M J; Mitchell-Olds, T

    2004-01-01

    In multigene families, variation among loci and alleles can contribute to trait evolution. We explored patterns of functional and genetic variation in six duplicated Arabidopsis thaliana trypsin inhibitor (ATTI) loci. We demonstrate significant variation in constitutive and herbivore-induced transcription among ATTI loci that show, on average, 65% sequence divergence. Significant variation in ATTI expression was also found between two molecularly defined haplotype classes. Population genetic analyses for 17 accessions of A. thaliana showed that six ATTI loci arranged in tandem within 10 kb varied 10-fold in nucleotide diversity, from 0.0009 to 0.0110, and identified a minimum of six recombination events throughout the tandem array. We observed a significant peak in nucleotide and indel polymorphism spanning ATTI loci in the interior of the array, due primarily to divergence between the two haplotype classes. Significant deviation from the neutral equilibrium model for individual genes was interpreted within the context of intergene linkage disequilibrium and correlated patterns of functional differentiation. In contrast to the outcrosser Arabidopsis lyrata for which recombination is observed even within ATTI loci, our data suggest that response to selection was slowed in the inbreeding, annual A. thaliana because of interference among functionally divergent ATTI loci. PMID:15082560

  18. Gene duplication of the human peptide YY gene (PYY) generated the pancreatic polypeptide gene (PPY) on chromosome 17q21.1

    SciTech Connect

    Hort, Y.; Shine, J.; Herzog, H.

    1995-03-01

    Neuropeptide Y (NPY), peptide YY (PYY), and pancreatic polypeptide (PP) are structurally related but functionally diverse peptides, encoded by separate genes and expressed in different tissues. Although the human NPY gene has been mapped to chromosome 7, the authors demonstrate here that the genes for human PYY and PP (PPY) are localized only 10 kb apart from each another on chromosome 17q21.1. The high degree of homology between the members of this gene family, both in primary sequence and exon/intron structure, suggests that the NYP and the PYY genes arose from an initial gene duplication event, with a subsequent tandem duplication of the PYY gene being responsible for the creation of the PPY gene. A second weaker hybridization signal also found on chromosome 17q11 and results obtained by Southern blot analysis suggest that the entire PYY-PPY region has undergone a further duplication event. 27 refs., 5 figs.

  19. Recombination facilitates neofunctionalization of duplicate genes via originalization

    PubMed Central

    2010-01-01

    Background Recently originalization was proposed to be an effective way of duplicate-gene preservation, in which recombination provokes the high frequency of original (or wild-type) allele on both duplicated loci. Because the high frequency of wild-type allele might drive the arising and accumulating of advantageous mutation, it is hypothesized that recombination might enlarge the probability of neofunctionalization (Pneo) of duplicate genes. In this article this hypothesis has been tested theoretically. Results Results show that through originalization recombination might not only shorten mean time to neofunctionalizaiton, but also enlarge Pneo. Conclusions Therefore, recombination might facilitate neofunctionalization via originalization. Several extensive applications of these results on genomic evolution have been discussed: 1. Time to nonfunctionalization can be much longer than a few million generations expected before; 2. Homogenization on duplicated loci results from not only gene conversion, but also originalization; 3. Although the rate of advantageous mutation is much small compared with that of degenerative mutation, Pneo cannot be expected to be small. PMID:20534125

  20. Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima)

    PubMed Central

    Zhong, Yan; Li, Yingjun; Huang, Kaihui; Cheng, Zong-Ming

    2015-01-01

    The disease resistance (R) genes play an important role in protecting plants from infection by diverse pathogens in the environment. The nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes is one of the largest R gene families. Chinese chestnut (Castanea mollissima) is resistant to Chestnut Blight Disease, but relatively little is known about the resistance mechanism. We identified 519 NBS-encoding genes, including 374 NBS-LRR genes and 145 NBS-only genes. The majority of Ka/Ks were less than 1, suggesting the purifying selection operated during the evolutionary history of NBS-encoding genes. A minority (4/34) of Ka/Ks in non-TIR gene families were greater than 1, showing that some genes were under positive selection pressure. Furthermore, Ks peaked at a range of 0.4 to 0.5, indicating that ancient duplications arose during the evolution. The relationship between Ka/Ks and Ks indicated greater selective pressure on the newer and older genes with the critical value of Ks = 0.4–0.5. Notably, species-specific duplications were detected in NBS-encoding genes. In addition, the group of RPW8-NBS-encoding genes clustered together as an independent clade located at a relatively basal position in the phylogenetic tree. Many cis-acting elements related to plant defense responses were detected in promoters of NBS-encoding genes. PMID:26559332

  1. Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima).

    PubMed

    Zhong, Yan; Li, Yingjun; Huang, Kaihui; Cheng, Zong-Ming

    2015-01-01

    The disease resistance (R) genes play an important role in protecting plants from infection by diverse pathogens in the environment. The nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes is one of the largest R gene families. Chinese chestnut (Castanea mollissima) is resistant to Chestnut Blight Disease, but relatively little is known about the resistance mechanism. We identified 519 NBS-encoding genes, including 374 NBS-LRR genes and 145 NBS-only genes. The majority of Ka/Ks were less than 1, suggesting the purifying selection operated during the evolutionary history of NBS-encoding genes. A minority (4/34) of Ka/Ks in non-TIR gene families were greater than 1, showing that some genes were under positive selection pressure. Furthermore, Ks peaked at a range of 0.4 to 0.5, indicating that ancient duplications arose during the evolution. The relationship between Ka/Ks and Ks indicated greater selective pressure on the newer and older genes with the critical value of Ks = 0.4-0.5. Notably, species-specific duplications were detected in NBS-encoding genes. In addition, the group of RPW8-NBS-encoding genes clustered together as an independent clade located at a relatively basal position in the phylogenetic tree. Many cis-acting elements related to plant defense responses were detected in promoters of NBS-encoding genes. PMID:26559332

  2. Functional resolution of duplicated hoxb5 genes in teleosts.

    PubMed

    Jarinova, Olga; Hatch, Gary; Poitras, Luc; Prudhomme, Christelle; Grzyb, Magdalena; Aubin, Josée; Bérubé-Simard, Félix-Antoine; Jeannotte, Lucie; Ekker, Marc

    2008-11-01

    The duplication-degeneration-complementation (DDC) model predicts that subfunctionalization of duplicated genes is a common mechanism for their preservation. The additional Hox complexes of teleost fish constitute a good system in which to test this hypothesis. Zebrafish have two hoxb complexes, with two hoxb5 genes, hoxb5a and hoxb5b, the expression patterns of which suggest subfunctionalization of an ancestral hoxb5 gene. We characterized conserved non-coding elements (CNEs) near the zebrafish hoxb5 genes. One CNE, J3, is only retained in the hoxb5a locus, whereas the others, J1 and J2, are present in both hoxb5 loci. When tested individually, the enhancer activity of individual CNEs, including J3, extensively overlapped and did not support a role in subfunctionalization. By contrast, reporter transgene constructs encompassing multiple CNEs were able to target reporter gene expression to unique domains of hoxb5a and hoxb5b expression. The deletion of J3 from the hoxb5a locus resulted in expression that approached that of hoxb5b, whereas its insertion in the hoxb5b locus increased reporter expression and rendered it more similar to that of hoxb5a. Our results highlight the importance of interactions between CNEs in the execution of complementary subfunctions of duplicated genes. PMID:18832391

  3. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae)

    PubMed Central

    Baker, Richard H.; Narechania, Apurva; Johns, Philip M.; Wilkinson, Gerald S.

    2012-01-01

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict. PMID:22777023

  4. Role of Duplicate Genes in Robustness against Deleterious Human Mutations

    PubMed Central

    Hsiao, Tzu-Lin; Vitkup, Dennis

    2008-01-01

    It is now widely recognized that robustness is an inherent property of biological systems [1],[2],[3]. The contribution of close sequence homologs to genetic robustness against null mutations has been previously demonstrated in simple organisms [4],[5]. In this paper we investigate in detail the contribution of gene duplicates to back-up against deleterious human mutations. Our analysis demonstrates that the functional compensation by close homologs may play an important role in human genetic disease. Genes with a 90% sequence identity homolog are about 3 times less likely to harbor known disease mutations compared to genes with remote homologs. Moreover, close duplicates affect the phenotypic consequences of deleterious mutations by making a decrease in life expectancy significantly less likely. We also demonstrate that similarity of expression profiles across tissues significantly increases the likelihood of functional compensation by homologs. PMID:18369440

  5. Divergence in Enzymatic Activities in the Soybean GST Supergene Family Provides New Insight into the Evolutionary Dynamics of Whole-Genome Duplicates

    PubMed Central

    Liu, Hai-Jing; Tang, Zhen-Xin; Han, Xue-Min; Yang, Zhi-Ling; Zhang, Fu-Min; Yang, Hai-Ling; Liu, Yan-Jing; Zeng, Qing-Yin

    2015-01-01

    Whole-genome duplication (WGD), or polyploidy, is a major force in plant genome evolution. A duplicate of all genes is present in the genome immediately following a WGD event. However, the evolutionary mechanisms responsible for the loss of, or retention and subsequent functional divergence of polyploidy-derived duplicates remain largely unknown. In this study we reconstructed the evolutionary history of the glutathione S-transferase (GST) gene family from the soybean genome, and identified 72 GST duplicated gene pairs formed by a recent Glycine-specific WGD event occurring approximately 13 Ma. We found that 72% of duplicated GST gene pairs experienced gene losses or pseudogenization, whereas 28% of GST gene pairs have been retained in the soybean genome. The GST pseudogenes were under relaxed selective constraints, whereas functional GSTs were subject to strong purifying selection. Plant GST genes play important roles in stress tolerance and detoxification metabolism. By examining the gene expression responses to abiotic stresses and enzymatic properties of the ancestral and current proteins, we found that polyploidy-derived GST duplicates show the divergence in enzymatic activities. Through site-directed mutagenesis of ancestral proteins, this study revealed that nonsynonymous substitutions of key amino acid sites play an important role in the divergence of enzymatic functions of polyploidy-derived GST duplicates. These findings provide new insights into the evolutionary and functional dynamics of polyploidy-derived duplicate genes. PMID:26219583

  6. Duplications of hox gene clusters and the emergence of vertebrates.

    PubMed

    Soshnikova, Natalia; Dewaele, Romain; Janvier, Philippe; Krumlauf, Robb; Duboule, Denis

    2013-06-15

    The vertebrate body plan is characterized by an increased complexity relative to that of all other chordates and large-scale gene amplifications have been associated with key morphological innovations leading to their remarkable evolutionary success. Here, we use compound full Hox clusters deletions to investigate how Hox genes duplications may have contributed to the emergence of vertebrate-specific innovations. We show that the combined deletion of HoxA and HoxB leads to an atavistic heart phenotype, suggesting that the ancestral HoxA/B cluster was co-opted to help in diversifying the complex organ in vertebrates. Other phenotypic effects observed seem to illustrate the resurgence of ancestral (plesiomorphic) features. This indicates that the duplications of Hox clusters were associated with the recruitment or formation of novel cis-regulatory controls, which were key to the evolution of many vertebrate features and hence to the evolutionary radiation of this group. PMID:23501471

  7. Six Subgroups and Extensive Recent Duplications Characterize the Evolution of the Eukaryotic Tubulin Protein Family

    PubMed Central

    Findeisen, Peggy; Mühlhausen, Stefanie; Dempewolf, Silke; Hertzog, Jonny; Zietlow, Alexander; Carlomagno, Teresa; Kollmar, Martin

    2014-01-01

    Tubulins belong to the most abundant proteins in eukaryotes providing the backbone for many cellular substructures like the mitotic and meiotic spindles, the intracellular cytoskeletal network, and the axonemes of cilia and flagella. Homologs have even been reported for archaea and bacteria. However, a taxonomically broad and whole-genome-based analysis of the tubulin protein family has never been performed, and thus, the number of subfamilies, their taxonomic distribution, and the exact grouping of the supposed archaeal and bacterial homologs are unknown. Here, we present the analysis of 3,524 tubulins from 504 species. The tubulins formed six major subfamilies, α to ζ. Species of all major kingdoms of the eukaryotes encode members of these subfamilies implying that they must have already been present in the last common eukaryotic ancestor. The proposed archaeal homologs grouped together with the bacterial TubZ proteins as sister clade to the FtsZ proteins indicating that tubulins are unique to eukaryotes. Most species contained α- and/or β-tubulin gene duplicates resulting from recent branch- and species-specific duplication events. This shows that tubulins cannot be used for constructing species phylogenies without resolving their ortholog–paralog relationships. The many gene duplicates and also the independent loss of the δ-, ε-, or ζ-tubulins, which have been shown to be part of the triplet microtubules in basal bodies, suggest that tubulins can functionally substitute each other. PMID:25169981

  8. Recurrent duplications of the annexin A1 gene (ANXA1) in autism spectrum disorders

    PubMed Central

    2014-01-01

    Background Validating the potential pathogenicity of copy number variants (CNVs) identified in genome-wide studies of autism spectrum disorders (ASD) requires detailed assessment of case/control frequencies, inheritance patterns, clinical correlations, and functional impact. Here, we characterize a small recurrent duplication in the annexin A1 (ANXA1) gene, identified by the Autism Genome Project (AGP) study. Methods From the AGP CNV genomic screen in 2,147 ASD individuals, we selected for characterization an ANXA1 gene duplication that was absent in 4,964 population-based controls. We further screened the duplication in a follow-up sample including 1,496 patients and 410 controls, and evaluated clinical correlations and family segregation. Sequencing of exonic/downstream ANXA1 regions was performed in 490 ASD patients for identification of additional variants. Results The ANXA1 duplication, overlapping the last four exons and 3’UTR region, had an overall prevalence of 11/3,643 (0.30%) in unrelated ASD patients but was not identified in 5,374 controls. Duplication carriers presented no distinctive clinical phenotype. Family analysis showed neuropsychiatric deficits and ASD traits in multiple relatives carrying the duplication, suggestive of a complex genetic inheritance. Sequencing of exonic regions and the 3’UTR identified 11 novel changes, but no obvious variants with clinical significance. Conclusions We provide multilevel evidence for a role of ANXA1 in ASD etiology. Given its important role as mediator of glucocorticoid function in a wide variety of brain processes, including neuroprotection, apoptosis, and control of the neuroendocrine system, the results add ANXA1 to the growing list of rare candidate genetic etiological factors for ASD. PMID:24720851

  9. Evidence for the fixation of gene duplications by positive selection in Drosophila.

    PubMed

    Cardoso-Moreira, Margarida; Arguello, J Roman; Gottipati, Srikanth; Harshman, L G; Grenier, Jennifer K; Clark, Andrew G

    2016-06-01

    Gene duplications play a key role in the emergence of novel traits and in adaptation. But despite their centrality to evolutionary processes, it is still largely unknown how new gene duplicates are initially fixed within populations and later maintained in genomes. Long-standing debates on the evolution of gene duplications could be settled by determining the relative importance of genetic drift vs. positive selection in the fixation of new gene duplicates. Using the Drosophila Global Diversity Lines (GDL), we have combined genome-wide SNP polymorphism data with a novel set of copy number variant calls and gene expression profiles to characterize the polymorphic phase of new genes. We found that approximately half of the roughly 500 new complete gene duplications segregating in the GDL lead to significant increases in the expression levels of the duplicated genes and that these duplications are more likely to be found at lower frequencies, suggesting a negative impact on fitness. However, we also found that six of the nine gene duplications that are fixed or close to fixation in at least one of the five populations in our study show signs of being under positive selection, and that these duplications are likely beneficial because of dosage effects, with a possible role for additional mutations in two duplications. Our work suggests that in Drosophila, theoretical models that posit that gene duplications are immediately beneficial and fixed by positive selection are most relevant to explain the long-term evolution of gene duplications in this species. PMID:27197209

  10. Hox gene duplications correlate with posterior heteronomy in scorpions

    PubMed Central

    Sharma, Prashant P.; Schwager, Evelyn E.; Extavour, Cassandra G.; Wheeler, Ward C.

    2014-01-01

    The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions. PMID:25122224

  11. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups

    PubMed Central

    2013-01-01

    Background WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. Results We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. Conclusions In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have

  12. Identification of duplication downstream of BMP2 in a Chinese family with brachydactyly type A2 (BDA2).

    PubMed

    Liu, Xudong; Gao, Linghan; Zhao, Aman; Zhang, Rui; Ji, Baohu; Wang, Lei; Zheng, Yonglan; Zeng, Bingfang; Valenzuela, Robert K; He, Lin; Ma, Jie

    2014-01-01

    Brachydactyly type A2 (BDA2, MIM 112600) is characterized by the deviation and shortening of the middle phalange of the index finger and the second toe. Using genome-wide linkage analysis in a Chinese BDA2 family, we mapped the maximum candidate interval of BDA2 to a ∼1.5 Mb region between D20S194 and D20S115 within chromosome 20p12.3 and found that the pairwise logarithm of the odds score was highest for marker D20S156 (Zmax = 6.09 at θ = 0). Based on functional and positional perspectives, the bone morphogenetic protein 2 (BMP2) gene was identified as the causal gene for BDA2 in this region, even though no point mutation was detected in BMP2. Through further investigation, we identified a 4,671 bp (Chr20: 6,809,218-6,813,888) genomic duplication downstream of BMP2. This duplication was located within the linked region, co-segregated with the BDA2 phenotype in this family, and was not found in the unaffected family members and the unrelated control individuals. Compared with the previously reported duplications, the duplication in this family has a different breakpoint flanked by the microhomologous sequence GATCA and a slightly different length. Some other microhomologous nucleotides were also found in the duplicated region. In summary, our findings support the conclusions that BMP2 is the causing gene for BDA2, that the genomic location corresponding to the duplication region is prone to structural changes associated with malformation of the digits, and that this tendency is probably caused by the abundance of microhomologous sequences in the region. PMID:24710560

  13. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate

    PubMed Central

    Adomako-Ankomah, Yaw; English, Elizabeth D.; Danielson, Jeffrey J.; Pernas, Lena F.; Parker, Michelle L.; Boulanger, Martin J.; Dubey, Jitender P.; Boyle, Jon P.

    2016-01-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum. Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA+ paralogs. Additionally, we found that exogenous expression of an HMA+ paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. PMID:26920761

  14. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate.

    PubMed

    Adomako-Ankomah, Yaw; English, Elizabeth D; Danielson, Jeffrey J; Pernas, Lena F; Parker, Michelle L; Boulanger, Martin J; Dubey, Jitender P; Boyle, Jon P

    2016-05-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA(+) paralogs. Additionally, we found that exogenous expression of an HMA(+) paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. PMID:26920761

  15. Molecular evolution accompanying functional divergence of duplicated genes along the plant starch biosynthesis pathway

    PubMed Central

    2014-01-01

    Background Starch is the main source of carbon storage in the Archaeplastida. The starch biosynthesis pathway (sbp) emerged from cytosolic glycogen metabolism shortly after plastid endosymbiosis and was redirected to the plastid stroma during the green lineage divergence. The SBP is a complex network of genes, most of which are members of large multigene families. While some gene duplications occurred in the Archaeplastida ancestor, most were generated during the sbp redirection process, and the remaining few paralogs were generated through compartmentalization or tissue specialization during the evolution of the land plants. In the present study, we tested models of duplicated gene evolution in order to understand the evolutionary forces that have led to the development of SBP in angiosperms. We combined phylogenetic analyses and tests on the rates of evolution along branches emerging from major duplication events in six gene families encoding sbp enzymes. Results We found evidence of positive selection along branches following cytosolic or plastidial specialization in two starch phosphorylases and identified numerous residues that exhibited changes in volume, polarity or charge. Starch synthases, branching and debranching enzymes functional specializations were also accompanied by accelerated evolution. However, none of the sites targeted by selection corresponded to known functional domains, catalytic or regulatory. Interestingly, among the 13 duplications tested, 7 exhibited evidence of positive selection in both branches emerging from the duplication, 2 in only one branch, and 4 in none of the branches. Conclusions The majority of duplications were followed by accelerated evolution targeting specific residues along both branches. This pattern was consistent with the optimization of the two sub-functions originally fulfilled by the ancestral gene before duplication. Our results thereby provide strong support to the so-called “Escape from Adaptive Conflict

  16. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes.

    PubMed

    Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping

    2016-01-01

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299

  17. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes

    PubMed Central

    Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping

    2016-01-01

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299

  18. Inferring gene duplications, transfers and losses can be done in a discrete framework.

    PubMed

    Ranwez, Vincent; Scornavacca, Celine; Doyon, Jean-Philippe; Berry, Vincent

    2016-06-01

    In the field of phylogenetics, the evolutionary history of a set of organisms is commonly depicted by a species tree-whose internal nodes represent speciation events-while the evolutionary history of a gene family is depicted by a gene tree-whose internal nodes can also represent macro-evolutionary events such as gene duplications and transfers. As speciation events are only part of the events shaping a gene history, the topology of a gene tree can show incongruences with that of the corresponding species tree. These incongruences can be used to infer the macro-evolutionary events undergone by the gene family. This is done by embedding the gene tree inside the species tree and hence providing a reconciliation of those trees. In the past decade, several parsimony-based methods have been developed to infer such reconciliations, accounting for gene duplications ([Formula: see text]), transfers ([Formula: see text]) and losses ([Formula: see text]). The main contribution of this paper is to formally prove an important assumption implicitly made by previous works on these reconciliations, namely that solving the (maximum) parsimony [Formula: see text] reconciliation problem in the discrete framework is equivalent to finding a most parsimonious [Formula: see text] scenario in the continuous framework. In the process, we also prove several intermediate results that are useful on their own and constitute a theoretical toolbox that will likely facilitate future theoretical contributions in the field. PMID:26337177

  19. Gene duplications in prokaryotes can be associated with environmental adaptation

    PubMed Central

    2010-01-01

    Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism

  20. Evolutionary Diversification of Plant Shikimate Kinase Gene Duplicates

    PubMed Central

    Fucile, Geoffrey; Falconer, Shannon; Christendat, Dinesh

    2008-01-01

    Shikimate kinase (SK; EC 2.7.1.71) catalyzes the fifth reaction of the shikimate pathway, which directs carbon from the central metabolism pool to a broad range of secondary metabolites involved in plant development, growth, and stress responses. In this study, we demonstrate the role of plant SK gene duplicate evolution in the diversification of metabolic regulation and the acquisition of novel and physiologically essential function. Phylogenetic analysis of plant SK homologs resolves an orthologous cluster of plant SKs and two functionally distinct orthologous clusters. These previously undescribed genes, shikimate kinase-like 1 (SKL1) and -2 (SKL2), do not encode SK activity, are present in all major plant lineages, and apparently evolved under positive selection following SK gene duplication over 400 MYA. This is supported by functional assays using recombinant SK, SKL1, and SKL2 from Arabidopsis thaliana (At) and evolutionary analyses of the diversification of SK-catalytic and -substrate binding sites based on theoretical structure models. AtSKL1 mutants yield albino and novel variegated phenotypes, which indicate SKL1 is required for chloroplast biogenesis. Extant SKL2 sequences show a strong genetic signature of positive selection, which is enriched in a protein–protein interaction module not found in other SK homologs. We also report the first kinetic characterization of plant SKs and show that gene expression diversification among the AtSK inparalogs is correlated with developmental processes and stress responses. This study examines the functional diversification of ancient and recent plant SK gene duplicates and highlights the utility of SKs as scaffolds for functional innovation. PMID:19057671

  1. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes.

    PubMed

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C; Fan, Chuanzhu

    2016-09-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. PMID:27485883

  2. A Limited Role for Gene Duplications in the Evolution of Platypus Venom

    PubMed Central

    Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine

    2012-01-01

    Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864

  3. Patterns of Gene Conversion in Duplicated Yeast Histones Suggest Strong Selection on a Coadapted Macromolecular Complex

    PubMed Central

    Scienski, Kathy; Fay, Justin C.; Conant, Gavin C.

    2015-01-01

    We find evidence for interlocus gene conversion in five duplicated histone genes from six yeast species. The sequences of these duplicated genes, surviving from the ancient genome duplication, show phylogenetic patterns inconsistent with the well-resolved orthology relationships inferred from a likelihood model of gene loss after the genome duplication. Instead, these paralogous genes are more closely related to each other than any is to its nearest ortholog. In addition to simulations supporting gene conversion, we also present evidence for elevated rates of radical amino acid substitutions along the branches implicated in the conversion events. As these patterns are similar to those seen in ribosomal proteins that have undergone gene conversion, we speculate that in cases where duplicated genes code for proteins that are a part of tightly interacting complexes, selection may favor the fixation of gene conversion events in order to maintain high protein identities between duplicated copies. PMID:26560339

  4. Familial partial duplication (1)(p21p31)

    SciTech Connect

    Hoechstetter, L.; Soukup, S.; Schorry, E.K.

    1995-11-20

    A partial duplication (1)(p21p31), resulting from a maternal direct insertion (13,1) (q22p21p31), was found in a 30-year-old woman with mental retardation, cleft palate, and multiple minor anomalies. Two other affected and deceased relatives were presumed to have the same chromosome imbalance. Duplication 1p cases are reviewed. 8 refs., 5 figs., 1 tab.

  5. Simulating evolution of protein complexes through gene duplication and co-option.

    PubMed

    Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

    2016-06-21

    We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. PMID:27038665

  6. Expression Divergence of Duplicate Genes in the Protein Kinase Superfamily in Pacific Oyster.

    PubMed

    Gao, Dahai; Ko, Dennis C; Tian, Xinmin; Yang, Guang; Wang, Liuyang

    2015-01-01

    Gene duplication has been proposed to serve as the engine of evolutionary innovation. It is well recognized that eukaryotic genomes contain a large number of duplicated genes that evolve new functions or expression patterns. However, in mollusks, the evolutionary mechanisms underlying the divergence and the functional maintenance of duplicate genes remain little understood. In the present study, we performed a comprehensive analysis of duplicate genes in the protein kinase superfamily using whole genome and transcriptome data for the Pacific oyster. A total of 64 duplicated gene pairs were identified based on a phylogenetic approach and the reciprocal best BLAST method. By analyzing gene expression from RNA-seq data from 69 different developmental and stimuli-induced conditions (nine tissues, 38 developmental stages, eight dry treatments, seven heat treatments, and seven salty treatments), we found that expression patterns were significantly correlated for a number of duplicate gene pairs, suggesting the conservation of regulatory mechanisms following divergence. Our analysis also identified a subset of duplicate gene pairs with very high expression divergence, indicating that these gene pairs may have been subjected to transcriptional subfunctionalization or neofunctionalization after the initial duplication events. Further analysis revealed a significant correlation between expression and sequence divergence (as revealed by synonymous or nonsynonymous substitution rates) under certain conditions. Taken together, these results provide evidence for duplicate gene sequence and expression divergence in the Pacific oyster, accompanying its adaptation to harsh environments. Our results provide new insights into the evolution of duplicate genes and their expression levels in the Pacific oyster. PMID:26417197

  7. Gene organization and transcription of duplicated MBP genes of myelin deficient (shi(mld)) mutant mouse.

    PubMed Central

    Okano, H; Tamura, T; Miura, M; Aoyama, A; Ikenaka, K; Oshimura, M; Mikoshiba, K

    1988-01-01

    A hereditary dysmyelinating mutation, named myelin deficient (shi(mld)), is characterized by reduced expression of myelin basic protein (MBP). In shi(mld), the MBP gene is duplicated and its reduced expression is mainly determined by the level of mRNA. We have characterized the structure and function of the promoter regions of the duplicated MBP genes in shi(mld). Among the lambda clones containing promoter regions of the duplicated MBP genes in shi(mld), one (gene 1) had the same restriction enzyme pattern as that in control mice, but another (gene 2) had a rearrangement on a distal part of the promoter. A 712-bp nucleotide sequence upstream of the first exons of both of the duplicated MBP genes of shi(mld) was completely consistent with that of the control. Promoter activities of 1.3-kb 5'-flanking regions from respective genes of shi(mld) measured by in vitro run-off assay using HeLa whole-cell extracts were indistinguishable from that of the control MPB gene. Chromosomal mapping by in situ hybridization suggested that the duplicated MBP genes were located closely to each other at the distal part of chromosome 18. A recombinational event including the inversion seemed to have occurred within gene 1 and its possible relationship to the reduced expression of MBP is discussed. Images PMID:2452084

  8. Coregulation of tandem duplicate genes slows evolution of subfunctionalization in mammals.

    PubMed

    Lan, Xun; Pritchard, Jonathan K

    2016-05-20

    Gene duplication is a fundamental process in genome evolution. However, most young duplicates are degraded by loss-of-function mutations, and the factors that allow some duplicate pairs to survive long-term remain controversial. One class of models to explain duplicate retention invokes sub- or neofunctionalization, whereas others focus on sharing of gene dosage. RNA-sequencing data from 46 human and 26 mouse tissues indicate that subfunctionalization of expression evolves slowly and is rare among duplicates that arose within the placental mammals, possibly because tandem duplicates are coregulated by shared genomic elements. Instead, consistent with the dosage-sharing hypothesis, most young duplicates are down-regulated to match expression levels of single-copy genes. Thus, dosage sharing of expression allows for the initial survival of mammalian duplicates, followed by slower functional adaptation enabling long-term preservation. PMID:27199432

  9. Clinical and genetic study of a family with a paternally inherited 15q11-q13 duplication.

    PubMed

    Marini, Carla; Cecconi, Antonella; Contini, Elisa; Pantaleo, Marilena; Metitieri, Tiziana; Guarducci, Silvia; Giglio, Sabrina; Guerrini, Renzo; Genuardi, Maurizio

    2013-06-01

    Interstitial chromosome 15q11-q13 duplications are associated with developmental delay, behavioral problems and additional manifestations, including epilepsy. In most affected individuals the duplicated chromosome is maternally derived, whereas paternal inheritance is more often associated with a normal phenotype. Seizures have not been described in patients with paternal dup 15q11-q13. We describe a family with five individuals in three generations with a paternally-inherited 15q11-q13 duplication, four of whom exhibited abnormal phenotypic characteristics, including seizures. The 18-year-old female proband presented with moderate intellectual disability, obesity, and epilepsy. Her brother manifested learning disability and behavioral problems. They both inherited the 15q11-q13 dup from their father who had a normal phenotype. Their paternal uncle and grandfather also had the duplication and were reported to have had seizures. Array-CGH and MLPA analyses showed that the duplication included the TUBGCP5, CYFIP1, MKRN3, MAGEL2, NDN, SNRPN, UBE3A, ATP10A, GABRB3, GABRA5, GABRG3, and OCA2 genes. This report provides evidence for intrafamilial phenotypic variability of paternal dup 15q11-q13, ranging from normal to intellectual disability and seizures, and potentially expanding the phenotype of paternal 15q11-q13 interstitial duplications. PMID:23633446

  10. Gene Duplication, Gene Conversion and the Evolution of the Y Chromosome

    PubMed Central

    Connallon, Tim; Clark, Andrew G.

    2010-01-01

    Nonrecombining chromosomes, such as the Y, are expected to degenerate over time due to reduced efficacy of natural selection compared to chromosomes that recombine. However, gene duplication, coupled with gene conversion between duplicate pairs, can potentially counteract forces of evolutionary decay that accompany asexual reproduction. Using a combination of analytical and computer simulation methods, we explicitly show that, although gene conversion has little impact on the probability that duplicates become fixed within a population, conversion can be effective at maintaining the functionality of Y-linked duplicates that have already become fixed. The coupling of Y-linked gene duplication and gene conversion between paralogs can also prove costly by increasing the rate of nonhomologous crossovers between duplicate pairs. Such crossovers can generate an abnormal Y chromosome, as was recently shown to reduce male fertility in humans. The results represent a step toward explaining some of the more peculiar attributes of the human Y as well as preliminary Y-linked sequence data from other mammals and Drosophila. The results may also be applicable to the recently observed pattern of tetraploidy and gene conversion in asexual, bdelloid rotifers. PMID:20551442

  11. A family with an inverted tandem duplication 5q22.1q23.2.

    PubMed

    Schmidt, T; Bartels, I; Liehr, T; Burfeind, P; Zoll, B; Shoukier, M

    2013-01-01

    Here, we report a 3-year-old boy with short stature, developmental delay and mild facial dysmorphic signs. Karyotype analysis and array-CGH revealed a pure duplication 5q22.1q23.2 with a length of 14.25 Mb. As demonstrated by multicolor-fluorescence in situ hybridization, the duplicated segment was orientated in an inverted tandem manner. One of the 2 older half-brothers of the index patient was intellectually disabled and showed short stature as well. The mother of the siblings was only 149 cm in height. The affected half-brother as well as the mother of the siblings were tested positive for the same duplication. Duplications of the long arm of chromosome 5 are rare. There are 16 reported cases of different 5q segments with a pure duplication and no additional chromosomal imbalance. In order to refine the 5q-duplication phenotype, reported cases were recently classified in 3 groups on the basis of clinical findings and the involved chromosome segments. However, our case does not fit in any of these groups but is placed in the interjacent chromosomal area between 2 of these groups. Overall, this is the second reported family with a duplication of 5q22.1q23.2 and both families share phenotypic features like short stature, facial dysmorphic signs and speech delay. The reported family provides further information for delineating phenotype-genotype correlations of pure duplications of the 5q region. PMID:23051634

  12. Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

    PubMed Central

    Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

    2014-01-01

    It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342

  13. Evolution by gene duplication of Medicago truncatula PISTILLATA-like transcription factors.

    PubMed

    Roque, Edelín; Fares, Mario A; Yenush, Lynne; Rochina, Mari Cruz; Wen, Jiangqi; Mysore, Kirankumar S; Gómez-Mena, Concepción; Beltrán, José Pío; Cañas, Luis A

    2016-04-01

    PISTILLATA (PI) is a member of the B-function MADS-box gene family, which controls the identity of both petals and stamens in Arabidopsis thaliana. In Medicago truncatula (Mt), there are two PI-like paralogs, known as MtPI and MtNGL9. These genes differ in their expression patterns, but it is not known whether their functions have also diverged. Describing the evolution of certain duplicated genes, such as transcription factors, remains a challenge owing to the complex expression patterns and functional divergence between the gene copies. Here, we report a number of functional studies, including analyses of gene expression, protein-protein interactions, and reverse genetic approaches designed to demonstrate the respective contributions of each M. truncatula PI-like paralog to the B-function in this species. Also, we have integrated molecular evolution approaches to determine the mode of evolution of Mt PI-like genes after duplication. Our results demonstrate that MtPI functions as a master regulator of B-function in M. truncatula, maintaining the overall ancestral function, while MtNGL9 does not seem to have a role in this regard, suggesting that the pseudogenization could be the functional evolutionary fate for this gene. However, we provide evidence that purifying selection is the primary evolutionary force acting on this paralog, pinpointing the conservation of its biochemical function and, alternatively, the acquisition of a new role for this gene. PMID:26773809

  14. Evolution by gene duplication of Medicago truncatula PISTILLATA-like transcription factors

    PubMed Central

    Roque, Edelín; Fares, Mario A.; Yenush, Lynne; Rochina, Mari Cruz; Wen, Jiangqi; Mysore, Kirankumar S.; Gómez-Mena, Concepción; Beltrán, José Pío; Cañas, Luis A.

    2016-01-01

    PISTILLATA (PI) is a member of the B-function MADS-box gene family, which controls the identity of both petals and stamens in Arabidopsis thaliana. In Medicago truncatula (Mt), there are two PI-like paralogs, known as MtPI and MtNGL9. These genes differ in their expression patterns, but it is not known whether their functions have also diverged. Describing the evolution of certain duplicated genes, such as transcription factors, remains a challenge owing to the complex expression patterns and functional divergence between the gene copies. Here, we report a number of functional studies, including analyses of gene expression, protein–protein interactions, and reverse genetic approaches designed to demonstrate the respective contributions of each M. truncatula PI-like paralog to the B-function in this species. Also, we have integrated molecular evolution approaches to determine the mode of evolution of Mt PI-like genes after duplication. Our results demonstrate that MtPI functions as a master regulator of B-function in M. truncatula, maintaining the overall ancestral function, while MtNGL9 does not seem to have a role in this regard, suggesting that the pseudogenization could be the functional evolutionary fate for this gene. However, we provide evidence that purifying selection is the primary evolutionary force acting on this paralog, pinpointing the conservation of its biochemical function and, alternatively, the acquisition of a new role for this gene. PMID:26773809

  15. Duplication and Diversification of Dipteran Argonaute Genes, and the Evolutionary Divergence of Piwi and Aubergine

    PubMed Central

    Lewis, Samuel H.; Salmela, Heli; Obbard, Darren J.

    2016-01-01

    Genetic studies of Drosophila melanogaster have provided a paradigm for RNA interference (RNAi) in arthropods, in which the microRNA and antiviral pathways are each mediated by a single Argonaute (Ago1 and Ago2) and germline suppression of transposable elements is mediated by a trio of Piwi-subfamily Argonaute proteins (Ago3, Aub, and Piwi). Without a suitable evolutionary context, deviations from this can be interpreted as derived or idiosyncratic. Here we analyze the evolution of Argonaute genes across the genomes and transcriptomes of 86 Dipteran species, showing that variation in copy number can occur rapidly, and that there is constant flux in some RNAi mechanisms. The lability of the RNAi pathways is illustrated by the divergence of Aub and Piwi (182–156 Ma), independent origins of multiple Piwi-family genes in Aedes mosquitoes (less than 25Ma), and the recent duplications of Ago2 and Ago3 in the tsetse fly Glossina morsitans. In each case the tissue specificity of these genes has altered, suggesting functional divergence or innovation, and consistent with the action of dynamic selection pressures across the Argonaute gene family. We find there are large differences in evolutionary rates and gene turnover between pathways, and that paralogs of Ago2, Ago3, and Piwi/Aub show contrasting rates of evolution after duplication. This suggests that Argonautes undergo frequent evolutionary expansions that facilitate functional divergence. PMID:26868596

  16. Did homeobox gene duplications contribute to the Cambrian explosion?

    PubMed

    Holland, Peter W H

    2015-01-01

    The Cambrian explosion describes an apparently rapid increase in the diversity of bilaterian animals around 540-515 million years ago. Bilaterian animals explore the world in three-dimensions deploying forward-facing sense organs, a brain, and an anterior mouth; they possess muscle blocks enabling efficient crawling and burrowing in sediments, and they typically have an efficient 'through-gut' with separate mouth and anus to process bulk food and eject waste, even when burrowing in sediment. A variety of ecological, environmental, genetic, and developmental factors have been proposed as possible triggers and correlates of the Cambrian explosion, and it is likely that a combination of factors were involved. Here, I focus on a set of developmental genetic changes and propose these are part of the mix of permissive factors. I describe how ANTP-class homeobox genes, which encode transcription factors involved in body patterning, increased in number in the bilaterian stem lineage and earlier. These gene duplications generated a large array of ANTP class genes, including three distinct gene clusters called NK, Hox, and ParaHox. Comparative data supports the idea that NK genes were deployed primarily to pattern the bilaterian mesoderm, Hox genes coded position along the central nervous system, and ParaHox genes most likely originally specified the mouth, midgut, and anus of the newly evolved through-gut. It is proposed that diversification of ANTP class genes played a role in the Cambrian explosion by contributing to the patterning systems used to build animal bodies capable of high-energy directed locomotion, including active burrowing. PMID:26605046

  17. Soybean Cyst Nematode Resistance Emerged via Artificial Selection of Duplicated Serine Hydroxymethyltransferase Genes.

    PubMed

    Wu, Xiao-Yi; Zhou, Guang-Can; Chen, Yun-Xia; Wu, Ping; Liu, Li-Wei; Ma, Fang-Fang; Wu, Mian; Liu, Cheng-Chen; Zeng, Ying-Jie; Chu, Alexander E; Hang, Yue-Yu; Chen, Jian-Qun; Wang, Bin

    2016-01-01

    A major soybean (Forrest cultivar) quantitative trait locus (QTL) gene, Rhg4, which controls resistance to soybean cyst nematodes (SCN), encodes the enzyme serine hydroxylmethyltransferase (SHMT). The resistant allele possesses two critical missense mutations (P130R and N358Y) compared to that of the sensitive allele, rhg4. To understand the evolutionary history of this gene, sequences of 117 SHMT family members from 18 representative plant species were used to reconstruct their phylogeny. According to this phylogeny, the plant SHMT gene family can be divided into two groups and four subgroups (Ia, Ib, IIa, and IIb). Belonging to the Subgroup Ia lineage, the rhg4 gene evolved from a recent duplication event in Glycine sp.. To further explore how the SCN-resistant allele emerged, both the rhg4 gene and its closest homolog, the rhg4h gene, were isolated from 33 cultivated and 68 wild soybean varieties. The results suggested that after gene duplication, the soybean rhg4 gene accumulated a higher number of non-synonymous mutations than rhg4h. Although a higher number of segregating sites and gene haplotypes were detected in wild soybeans than in cultivars, the SCN-resistant Rhg4 allele (represented by haplotype 4) was not found in wild varieties. Instead, a very similar allele, haplotype 3, was observed in wild soybeans at a frequency of 7.4%, although it lacked the two critical non-synonymous substitutions. Taken together, these findings support that the SCN-resistant Rhg4 allele likely emerged via artificial selection during the soybean domestication process, based on a SCN-sensitive allele inherited from wild soybeans. PMID:27458476

  18. Soybean Cyst Nematode Resistance Emerged via Artificial Selection of Duplicated Serine Hydroxymethyltransferase Genes

    PubMed Central

    Wu, Xiao-Yi; Zhou, Guang-Can; Chen, Yun-Xia; Wu, Ping; Liu, Li-Wei; Ma, Fang-Fang; Wu, Mian; Liu, Cheng-Chen; Zeng, Ying-Jie; Chu, Alexander E.; Hang, Yue-Yu; Chen, Jian-Qun; Wang, Bin

    2016-01-01

    A major soybean (Forrest cultivar) quantitative trait locus (QTL) gene, Rhg4, which controls resistance to soybean cyst nematodes (SCN), encodes the enzyme serine hydroxylmethyltransferase (SHMT). The resistant allele possesses two critical missense mutations (P130R and N358Y) compared to that of the sensitive allele, rhg4. To understand the evolutionary history of this gene, sequences of 117 SHMT family members from 18 representative plant species were used to reconstruct their phylogeny. According to this phylogeny, the plant SHMT gene family can be divided into two groups and four subgroups (Ia, Ib, IIa, and IIb). Belonging to the Subgroup Ia lineage, the rhg4 gene evolved from a recent duplication event in Glycine sp.. To further explore how the SCN-resistant allele emerged, both the rhg4 gene and its closest homolog, the rhg4h gene, were isolated from 33 cultivated and 68 wild soybean varieties. The results suggested that after gene duplication, the soybean rhg4 gene accumulated a higher number of non-synonymous mutations than rhg4h. Although a higher number of segregating sites and gene haplotypes were detected in wild soybeans than in cultivars, the SCN-resistant Rhg4 allele (represented by haplotype 4) was not found in wild varieties. Instead, a very similar allele, haplotype 3, was observed in wild soybeans at a frequency of 7.4%, although it lacked the two critical non-synonymous substitutions. Taken together, these findings support that the SCN-resistant Rhg4 allele likely emerged via artificial selection during the soybean domestication process, based on a SCN-sensitive allele inherited from wild soybeans. PMID:27458476

  19. Structure and origin of a tandem duplication of a Drosophila metallothionein gene

    SciTech Connect

    Otto, E.; Maroni, G.

    1987-01-01

    A strain of cadmium-resistant Drosophila was isolated that contained a chromosomal duplication of the metallothionein gene, Mtn. This duplication was a direct, tandem repeat of 2.2 kilobases of DNA: 228 bases of 5' flanking DNA, the entire transcription unit, and 1.4 kilobases of 3' flanking DNA. The entire duplication was cloned and DNA sequences of the regions relevant to the duplication process were determined. Comparison of the sequences of the 5' and 3' boundaries revealed no extensive regions of similarity, thus indicating that this duplication was formed by nonhomologous breakage and reunion. Recently, results of similar analyses by other investigators have suggested that this process was involved in the origin of three other eukaryotic duplications. The authors have observed a chi-like sequence near one of the boundaries of each duplication, and therefore suggest that this sequence may be important in generating one of the breaks required for duplication formation.

  20. Identification and functional analysis of essential, conserved, housekeeping and duplicated genes.

    PubMed

    Arun, P V Parvati Sai; Miryala, Sravan Kumar; Chattopadhyay, Subhayan; Thiyyagura, Kranthi; Bawa, Payal; Bhattacharjee, Madhuchhanda; Yellaboina, Sailu

    2016-05-01

    Gene conservation, duplication and constitutive expression are intricately linked and strong predictors of essentiality. Here, we introduce metrics based on diversity indices to measure gene conservation, duplication and constitutive expression and validate them by measuring their performance in prediction of essential genes. Conservation and duplication were measured using the diversity indices on the bit score profile of Escherichia coli K12 orthologues, across the genomes, and paralogues, within the genome respectively. Constitutive expression was measured using expression diversity of E. coli K12 genes across different conditions. In addition, we developed a systematic method for enrichment analysis of gene-sets in a given ranked list of genes. The method was used to identify genome-wide functions of essential, conserved, constitutively expressed and duplicated genes. Furthermore, we also ranked various operons, complexes and pathways according to their essentiality, conservation, constitutive expression and duplication. PMID:27129600

  1. Adaptive Evolution of Genes Duplicated from the Drosophila pseudoobscura neo-X Chromosome

    PubMed Central

    Meisel, Richard P.; Hilldorfer, Benedict B.; Koch, Jessica L.; Lockton, Steven; Schaeffer, Stephen W.

    2010-01-01

    Drosophila X chromosomes are disproportionate sources of duplicated genes, and these duplications are usually the result of retrotransposition of X-linked genes to the autosomes. The excess duplication is thought to be driven by natural selection for two reasons: X chromosomes are inactivated during spermatogenesis, and the derived copies of retroposed duplications tend to be testis expressed. Therefore, autosomal derived copies of retroposed genes provide a mechanism for their X-linked paralogs to “escape” X inactivation. Once these duplications have fixed, they may then be selected for male-specific functions. Throughout the evolution of the Drosophila genus, autosomes have fused with X chromosomes along multiple lineages giving rise to neo-X chromosomes. There has also been excess duplication from the two independent neo-X chromosomes that have been examined—one that occurred prior to the common ancestor of the willistoni species group and another that occurred along the lineage leading to Drosophila pseudoobscura. To determine what role natural selection plays in the evolution of genes duplicated from the D. pseudoobscura neo-X chromosome, we analyzed DNA sequence divergence between paralogs, polymorphism within each copy, and the expression profiles of these duplicated genes. We found that the derived copies of all duplicated genes have elevated nonsynonymous polymorphism, suggesting that they are under relaxed selective constraints. The derived copies also tend to have testis- or male-biased expression profiles regardless of their chromosome of origin. Genes duplicated from the neo-X chromosome appear to be under less constraints than those duplicated from other chromosome arms. We also find more evidence for historical adaptive evolution in genes duplicated from the neo-X chromosome, suggesting that they are under a unique selection regime in which elevated nonsynonymous polymorphism provides a large reservoir of functional variants, some of which are

  2. Evolution of the Vertebrate Resistin Gene Family

    PubMed Central

    Hu, Qingda; Tan, Huanran; Irwin, David M.

    2015-01-01

    Resistin (encoded by Retn) was previously identified in rodents as a hormone associated with diabetes; however human resistin is instead linked to inflammation. Resistin is a member of a small gene family that includes the resistin-like peptides (encoded by Retnl genes) in mammals. Genomic searches of available genome sequences of diverse vertebrates and phylogenetic analyses were conducted to determine the size and origin of the resistin-like gene family. Genes encoding peptides similar to resistin were found in Mammalia, Sauria, Amphibia, and Actinistia (coelacanth, a lobe-finned fish), but not in Aves or fish from Actinopterygii, Chondrichthyes, or Agnatha. Retnl originated by duplication and transposition from Retn on the early mammalian lineage after divergence of the platypus, but before the placental and marsupial mammal divergence. The resistin-like gene family illustrates an instance where the locus of origin of duplicated genes can be identified, with Retn continuing to reside at this location. Mammalian species typically have a single copy Retn gene, but are much more variable in their numbers of Retnl genes, ranging from 0 to 9. Since Retn is located at the locus of origin, thus likely retained the ancestral expression pattern, largely maintained its copy number, and did not display accelerated evolution, we suggest that it is more likely to have maintained an ancestral function, while Retnl, which transposed to a new location, displays accelerated evolution, and shows greater variability in gene number, including gene loss, likely evolved new, but potentially lineage-specific, functions. PMID:26076481

  3. Evolution of the Vertebrate Resistin Gene Family.

    PubMed

    Hu, Qingda; Tan, Huanran; Irwin, David M

    2015-01-01

    Resistin (encoded by Retn) was previously identified in rodents as a hormone associated with diabetes; however human resistin is instead linked to inflammation. Resistin is a member of a small gene family that includes the resistin-like peptides (encoded by Retnl genes) in mammals. Genomic searches of available genome sequences of diverse vertebrates and phylogenetic analyses were conducted to determine the size and origin of the resistin-like gene family. Genes encoding peptides similar to resistin were found in Mammalia, Sauria, Amphibia, and Actinistia (coelacanth, a lobe-finned fish), but not in Aves or fish from Actinopterygii, Chondrichthyes, or Agnatha. Retnl originated by duplication and transposition from Retn on the early mammalian lineage after divergence of the platypus, but before the placental and marsupial mammal divergence. The resistin-like gene family illustrates an instance where the locus of origin of duplicated genes can be identified, with Retn continuing to reside at this location. Mammalian species typically have a single copy Retn gene, but are much more variable in their numbers of Retnl genes, ranging from 0 to 9. Since Retn is located at the locus of origin, thus likely retained the ancestral expression pattern, largely maintained its copy number, and did not display accelerated evolution, we suggest that it is more likely to have maintained an ancestral function, while Retnl, which transposed to a new location, displays accelerated evolution, and shows greater variability in gene number, including gene loss, likely evolved new, but potentially lineage-specific, functions. PMID:26076481

  4. Gene Duplicability-Connectivity-Complexity across Organisms and a Neutral Evolutionary Explanation

    PubMed Central

    Zhu, Yun; Du, Peng; Nakhleh, Luay

    2012-01-01

    Gene duplication has long been acknowledged by biologists as a major evolutionary force shaping genomic architectures and characteristics across the Tree of Life. Major research has been conducting on elucidating the fate of duplicated genes in a variety of organisms, as well as factors that affect a gene’s duplicability–that is, the tendency of certain genes to retain more duplicates than others. In particular, two studies have looked at the correlation between gene duplicability and its degree in a protein-protein interaction network in yeast, mouse, and human, and another has looked at the correlation between gene duplicability and its complexity (length, number of domains, etc.) in yeast. In this paper, we extend these studies to six species, and two trends emerge. There is an increase in the duplicability-connectivity correlation that agrees with the increase in the genome size as well as the phylogenetic relationship of the species. Further, the duplicability-complexity correlation seems to be constant across the species. We argue that the observed correlations can be explained by neutral evolutionary forces acting on the genomic regions containing the genes. For the duplicability-connectivity correlation, we show through simulations that an increasing trend can be obtained by adjusting parameters to approximate genomic characteristics of the respective species. Our results call for more research into factors, adaptive and non-adaptive alike, that determine a gene’s duplicability. PMID:22984517

  5. Neofunctionalization of a Duplicate dachshund Gene Underlies the Evolution of a Novel Leg Segment in Arachnids.

    PubMed

    Turetzek, Natascha; Pechmann, Matthias; Schomburg, Christoph; Schneider, Julia; Prpic, Nikola-Michael

    2016-01-01

    The acquisition of a novel function, or neofunctionalization, protects duplicated genes from redundancy and subsequent loss, and is a major force that drives adaptive evolution. Neofunctionalization has been inferred for many duplicated genes based on differences in regulation between the parental gene and its duplicate. However, only few studies actually link the new function of a duplicated gene to a novel morphological or physiological character of the organism. Here we show that the duplication of dachshund (dac) in arachnids (spiders and allies) is linked with the evolution of a novel leg segment, the patella. We have studied dac genes in two distantly related spider species, the entelegyne spider Parasteatoda tepidariorum and the haplogyne spider Pholcus phalangioides. Both species possess two paralogous dac genes that duplicated before the split between entelegyne and haplogyne spiders. In contrast to the evolutionarily highly conserved dac1, its duplicate dac2 is strongly expressed in the patella leg segment during embryogenesis in both species. Using parental RNA interference in P. tepidariorum we show that dac2 is required for the development of the patella segment. If dac2 function is impaired, then the patella is fused with the tibia into a single leg segment. Thus, removing the function of dac2 experimentally reverts P. tepidariorum leg morphology into a stage before the duplication of dac and the evolution of the patella segment. Our results indicate that the origin of the patella is the result of the duplication and subsequent neofunctionalization of dac in the arachnid lineage. PMID:26443673

  6. Lineage-specific expansion of IFIT gene family: an insight into coevolution with IFN gene family.

    PubMed

    Liu, Ying; Zhang, Yi-Bing; Liu, Ting-Kai; Gui, Jian-Fang

    2013-01-01

    In mammals, IFIT (Interferon [IFN]-induced proteins with Tetratricopeptide Repeat [TPR] motifs) family genes are involved in many cellular and viral processes, which are tightly related to mammalian IFN response. However, little is known about non-mammalian IFIT genes. In the present study, IFIT genes are identified in the genome databases from the jawed vertebrates including the cartilaginous elephant shark but not from non-vertebrates such as lancelet, sea squirt and acorn worm, suggesting that IFIT gene family originates from a vertebrate ancestor about 450 million years ago. IFIT family genes show conserved gene structure and gene arrangements. Phylogenetic analyses reveal that this gene family has expanded through lineage-specific and species-specific gene duplication. Interestingly, IFN gene family seem to share a common ancestor and a similar evolutionary mechanism; the function link of IFIT genes to IFN response is present early since the origin of both gene families, as evidenced by the finding that zebrafish IFIT genes are upregulated by fish IFNs, poly(I:C) and two transcription factors IRF3/IRF7, likely via the IFN-stimulated response elements (ISRE) within the promoters of vertebrate IFIT family genes. These coevolution features creates functional association of both family genes to fulfill a common biological process, which is likely selected by viral infection during evolution of vertebrates. Our results are helpful for understanding of evolution of vertebrate IFN system. PMID:23818968

  7. Lineage-Specific Expansion of IFIT Gene Family: An Insight into Coevolution with IFN Gene Family

    PubMed Central

    Liu, Ying; Zhang, Yi-Bing; Liu, Ting-Kai; Gui, Jian-Fang

    2013-01-01

    In mammals, IFIT (Interferon [IFN]-induced proteins with Tetratricopeptide Repeat [TPR] motifs) family genes are involved in many cellular and viral processes, which are tightly related to mammalian IFN response. However, little is known about non-mammalian IFIT genes. In the present study, IFIT genes are identified in the genome databases from the jawed vertebrates including the cartilaginous elephant shark but not from non-vertebrates such as lancelet, sea squirt and acorn worm, suggesting that IFIT gene family originates from a vertebrate ancestor about 450 million years ago. IFIT family genes show conserved gene structure and gene arrangements. Phylogenetic analyses reveal that this gene family has expanded through lineage-specific and species-specific gene duplication. Interestingly, IFN gene family seem to share a common ancestor and a similar evolutionary mechanism; the function link of IFIT genes to IFN response is present early since the origin of both gene families, as evidenced by the finding that zebrafish IFIT genes are upregulated by fish IFNs, poly(I:C) and two transcription factors IRF3/IRF7, likely via the IFN-stimulated response elements (ISRE) within the promoters of vertebrate IFIT family genes. These coevolution features creates functional association of both family genes to fulfill a common biological process, which is likely selected by viral infection during evolution of vertebrates. Our results are helpful for understanding of evolution of vertebrate IFN system. PMID:23818968

  8. CG gene body DNA methylation changes and evolution of duplicated genes in cassava.

    PubMed

    Wang, Haifeng; Beyene, Getu; Zhai, Jixian; Feng, Suhua; Fahlgren, Noah; Taylor, Nigel J; Bart, Rebecca; Carrington, James C; Jacobsen, Steven E; Ausin, Israel

    2015-11-01

    DNA methylation is important for the regulation of gene expression and the silencing of transposons in plants. Here we present genome-wide methylation patterns at single-base pair resolution for cassava (Manihot esculenta, cultivar TME 7), a crop with a substantial impact in the agriculture of subtropical and tropical regions. On average, DNA methylation levels were higher in all three DNA sequence contexts (CG, CHG, and CHH, where H equals A, T, or C) than those of the most well-studied model plant Arabidopsis thaliana. As in other plants, DNA methylation was found both on transposons and in the transcribed regions (bodies) of many genes. Consistent with these patterns, at least one cassava gene copy of all of the known components of Arabidopsis DNA methylation pathways was identified. Methylation of LTR transposons (GYPSY and COPIA) was found to be unusually high compared with other types of transposons, suggesting that the control of the activity of these two types of transposons may be especially important. Analysis of duplicated gene pairs resulting from whole-genome duplication showed that gene body DNA methylation and gene expression levels have coevolved over short evolutionary time scales, reinforcing the positive relationship between gene body methylation and high levels of gene expression. Duplicated genes with the most divergent gene body methylation and expression patterns were found to have distinct biological functions and may have been under natural or human selection for cassava traits. PMID:26483493

  9. CG gene body DNA methylation changes and evolution of duplicated genes in cassava

    PubMed Central

    Wang, Haifeng; Beyene, Getu; Zhai, Jixian; Feng, Suhua; Fahlgren, Noah; Taylor, Nigel J.; Bart, Rebecca; Carrington, James C.; Jacobsen, Steven E.; Ausin, Israel

    2015-01-01

    DNA methylation is important for the regulation of gene expression and the silencing of transposons in plants. Here we present genome-wide methylation patterns at single-base pair resolution for cassava (Manihot esculenta, cultivar TME 7), a crop with a substantial impact in the agriculture of subtropical and tropical regions. On average, DNA methylation levels were higher in all three DNA sequence contexts (CG, CHG, and CHH, where H equals A, T, or C) than those of the most well-studied model plant Arabidopsis thaliana. As in other plants, DNA methylation was found both on transposons and in the transcribed regions (bodies) of many genes. Consistent with these patterns, at least one cassava gene copy of all of the known components of Arabidopsis DNA methylation pathways was identified. Methylation of LTR transposons (GYPSY and COPIA) was found to be unusually high compared with other types of transposons, suggesting that the control of the activity of these two types of transposons may be especially important. Analysis of duplicated gene pairs resulting from whole-genome duplication showed that gene body DNA methylation and gene expression levels have coevolved over short evolutionary time scales, reinforcing the positive relationship between gene body methylation and high levels of gene expression. Duplicated genes with the most divergent gene body methylation and expression patterns were found to have distinct biological functions and may have been under natural or human selection for cassava traits. PMID:26483493

  10. The Dca gene involved in cold adaptation in Drosophila melanogaster arose by duplication of the ancestral regucalcin gene.

    PubMed

    Arboleda-Bustos, Carlos E; Segarra, Carmen

    2011-08-01

    The Drosophila cold acclimation gene (Dca) is involved in the adaptive response to low temperatures. This gene is upregulated at the transcriptional level when D. melanogaster flies are exposed 1 day to 15 °C. Dca (or smp-30) is a member of the SMP-30/Gluconolactonase/LRE-like family. In the current study, we characterized the members of this gene family in the 12 Drosophila species with available complete genomes sequences. Two paralogous genes, Dca and regucalcin, were identified in all the Sophophora subgenus species (9 of the 12 species), and their presence was further confirmed in three other species of the subgenus (D. subobscura, D. madeirensis, and D. guanche). However, only regucalcin was present in the species of the Drosophila subgenus (D. grimshawi, D. virilis, and D. mojavensis). The phylogenetic analysis and the molecular organization of Dca that is a nested intronic gene support that Dca arose by a duplication event from the ancestral regucalcin gene after the split of the Sophophora and Drosophila subgenera but before the Sophophora radiation. After the duplication event, the nonsynonymous fixation rate increased in the branch leading to Dca (but not to regucalcin), suggesting the neofunctionalization of the former duplicate. Thus, regucalcin would have maintained the ancestral gene function, and Dca would have acquired a new function likely related to Ca²⁺ homeostasis and cold acclimation. Molecular evolution of Dca has been affected by its implication in the adaptive response to cold temperatures. Indeed, the gene has evolved under stronger purifying selection in the temperate than in the tropical Sophophora species, as reflected by the ratio of nonsynonymous to synonymous substitutions. This result is consistent with functional constraints acting on the DCA protein to keep species adaptation to temperate climates. Dca and regucalcin also differ in their expression patterns. The expression profile of regucalcin is similar to that of the

  11. Erroneous prenatal diagnosis of congenital adrenal hyperplasia owing to a duplication of the CYP21A2 gene.

    PubMed

    Lekarev, O; Tafuri, K; Lane, A H; Zhu, G; Nakamoto, J M; Buller-Burckle, A M; Wilson, T A; New, M I

    2013-01-01

    Congenital adrenal hyperplasia (CAH) is an autosomal recessive disorder where steroidogenesis in the adrenal cortex is impaired. The most common form is caused by 21-hydroxylase deficiency (21OHD). Classical 21OHD is characterized by glucocorticoid and mineralocorticoid deficiency and by overproduction of adrenal androgens. The diagnosis rests on biochemical and genetic analyses. In families with history of CAH, prenatal genetic diagnosis is offered. We herein present a case of an infant whose parents were identified to carry mutations on the CYP21A2 gene. The fetal DNA analysis demonstrated that the fetus carried a paternal exon 8 (Q318X) mutation and a maternal exon 8 (R356X) mutation. The fetus was presumed to be affected with CAH, yet his clinical presentation at birth was not consistent with the diagnosis. Repeated genetic analysis identified a paternal CYP21A2 gene duplication with Q318X mutation on one copy of CYP21A2. We conclude that a duplication of the CYP21A2 gene should be suspected when clinical and hormonal findings do not support the genetic diagnosis. Furthermore, because individuals with Q318X mutation frequently have a duplication of the CYP21A2 gene, when Q318X is detected, it is important to distinguish the severe point mutation in single gene copy alleles from the non-deficient variant in gene-duplicated alleles. PMID:23269230

  12. NADP-Malate Dehydrogenase Gene Evolution in Andropogoneae (Poaceae): Gene Duplication Followed by Sub-functionalization

    PubMed Central

    RONDEAU, P.; ROUCH, C.; BESNARD, G.

    2005-01-01

    • Background and Aims Plastid NADP-dependent malate dehydrogenase (MDH) catalyses the conversion of oxaloacetate to malate. In C4 plants, it is involved in photosynthetic carbon assimilation. In Poaceae, one NADP-MDH gene has been identified in rice (C3; Erhartoideae) and maize (C4; Panicoideae), whereas two tandemly repeated genes have been identified in Sorghum (C4; Panicoideae). In the present study, the molecular evolution of the NADP-MDH multigene family was investigated in order to analyse how the C4 isoform has evolved over a broader range of panicoid grasses. • Methods Polymerase chain reaction (PCR)-based cloning was used to isolate cDNAs encoding NADP-MDHs from 15 species of Panicoideae. A gene phylogeny was reconstructed based on cDNA sequences using distance and maximum parsimony methods. Episodic selection along some branches of the phylogenetic tree was tested by analysing non-synonymous and synonymous rate ratios.Transcription of NADP-MDH genes was compared in green leaves of five accessions of Saccharum, Sorghum and Vetiveria using a semi-quantitative PCR approach. • Key Results Phylogenetic analyses of these data support the existence of two NADP-MDH gene lineages (NMDH-I and NMDH-II) in several Andropogoneae (i.e. Saccharum, Sorghum and Vetiveria). Episodic positive selection was shown along the basal branch of the NMDH-II clade. Three amino acid modifications allow the two gene lineages to be distinguished, suggesting a positive selection at these sites. In green leaves, we showed that the transcript accumulation was higher for NMDH-I than for NMDH-II. • Conclusions It is hypothesized that the maintenance of both NADP-MDH genes in some Andropogoneae is due to a partition of the original functions across both copies. NMDH-I probably corresponds to the C4 isoform as previously suggested. Nevertheless, some C4 species (e.g. maize) only have one gene which should be selected for its high expression level in leaves. This study confirms that

  13. Different patterns of evolution for duplicated DNA repair genes in bacteria of the Xanthomonadales group

    PubMed Central

    Martins-Pinheiro, Marinalva; Galhardo, Rodrigo S; Lage, Claudia; Lima-Bessa, Keronninn M; Aires, Karina A; Menck, Carlos FM

    2004-01-01

    Background DNA repair genes encode proteins that protect organisms against genetic damage generated by environmental agents and by-products of cell metabolism. The importance of these genes in life maintenance is supported by their high conservation, and the presence of duplications of such genes may be easily traced, especially in prokaryotic genomes. Results The genome sequences of two Xanthomonas species were used as the basis for phylogenetic analyses of genes related to DNA repair that were found duplicated. Although 16S rRNA phylogenetic analyses confirm their classification at the basis of the gamma proteobacteria subdivision, differences were found in the origin of the various genes investigated. Except for lexA, detected as a recent duplication, most of the genes in more than one copy are represented by two highly divergent orthologs. Basically, one of such duplications is frequently positioned close to other gamma proteobacteria, but the second is often positioned close to unrelated bacteria. These orthologs may have occurred from old duplication events, followed by extensive gene loss, or were originated from lateral gene transfer (LGT), as is the case of the uvrD homolog. Conclusions Duplications of DNA repair related genes may result in redundancy and also improve the organisms' responses to environmental challenges. Most of such duplications, in Xanthomonas, seem to have arisen from old events and possibly enlarge both functional and evolutionary genome potentiality. PMID:15333143

  14. First evidence of a large CHEK2 duplication involved in cancer predisposition in an Italian family with hereditary breast cancer

    PubMed Central

    2014-01-01

    Background CHEK2 is a multi-cancer susceptibility gene whose common germline mutations are known to contribute to the risk of developing breast and prostate cancer. Case presentation Here, we describe an Italian family with a high number of cases of breast cancer and other types of tumour subjected to the MLPA test to verify the presence of BRCA1, BRCA2 and CHEK2 deletions and duplications. We identified a new 23-kb duplication in the CHEK2 gene extending from intron 5 to 13 that was associated with breast cancer in the family. The presence and localisation of the alteration was confirmed by a second analysis by Next-Generation Sequencing. Conclusions This finding suggests that CHEK2 mutations are heterogeneous and that techniques other than sequencing, such as MLPA, are needed to identify CHEK2 mutations. It also indicates that CHEK2 rare variants, such as duplications, can confer a high susceptibility to cancer development and should thus be studied in depth as most of our knowledge of CHEK2 concerns common mutations. PMID:24986639

  15. The Evolutionary Fate of Alternatively Spliced Homologous Exons after Gene Duplication

    PubMed Central

    Abascal, Federico; Tress, Michael L.; Valencia, Alfonso

    2015-01-01

    Alternative splicing and gene duplication are the two main processes responsible for expanding protein functional diversity. Although gene duplication can generate new genes and alternative splicing can introduce variation through alternative gene products, the interplay between the two processes is complex and poorly understood. Here, we have carried out a study of the evolution of alternatively spliced exons after gene duplication to better understand the interaction between the two processes. We created a manually curated set of 97 human genes with mutually exclusively spliced homologous exons and analyzed the evolution of these exons across five distantly related vertebrates (lamprey, spotted gar, zebrafish, fugu, and coelacanth). Most of these exons had an ancient origin (more than 400 Ma). We found examples supporting two extreme evolutionary models for the behaviour of homologous axons after gene duplication. We observed 11 events in which gene duplication was accompanied by splice isoform separation, that is, each paralog specifically conserved just one distinct ancestral homologous exon. At other extreme, we identified genes in which the homologous exons were always conserved within paralogs, suggesting that the alternative splicing event cannot easily be separated from the function in these genes. That many homologous exons fall in between these two extremes highlights the diversity of biological systems and suggests that the subtle balance between alternative splicing and gene duplication is adjusted to the specific cellular context of each gene. PMID:25931610

  16. The evolutionary fate of alternatively spliced homologous exons after gene duplication.

    PubMed

    Abascal, Federico; Tress, Michael L; Valencia, Alfonso

    2015-06-01

    Alternative splicing and gene duplication are the two main processes responsible for expanding protein functional diversity. Although gene duplication can generate new genes and alternative splicing can introduce variation through alternative gene products, the interplay between the two processes is complex and poorly understood. Here, we have carried out a study of the evolution of alternatively spliced exons after gene duplication to better understand the interaction between the two processes. We created a manually curated set of 97 human genes with mutually exclusively spliced homologous exons and analyzed the evolution of these exons across five distantly related vertebrates (lamprey, spotted gar, zebrafish, fugu, and coelacanth). Most of these exons had an ancient origin (more than 400 Ma). We found examples supporting two extreme evolutionary models for the behaviour of homologous axons after gene duplication. We observed 11 events in which gene duplication was accompanied by splice isoform separation, that is, each paralog specifically conserved just one distinct ancestral homologous exon. At other extreme, we identified genes in which the homologous exons were always conserved within paralogs, suggesting that the alternative splicing event cannot easily be separated from the function in these genes. That many homologous exons fall in between these two extremes highlights the diversity of biological systems and suggests that the subtle balance between alternative splicing and gene duplication is adjusted to the specific cellular context of each gene. PMID:25931610

  17. The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.

    PubMed Central

    Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W

    1998-01-01

    At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over. PMID:9811791

  18. Gene duplication and divergence affecting drug content in Cannabis sativa.

    PubMed

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency. PMID:26189495

  19. Tandem gene arrays in Trypanosoma brucei: Comparative phylogenomic analysis of duplicate sequence variation

    PubMed Central

    Jackson, Andrew P

    2007-01-01

    Background The genome sequence of the protistan parasite Trypanosoma brucei contains many tandem gene arrays. Gene duplicates are created through tandem duplication and are expressed through polycistronic transcription, suggesting that the primary purpose of long, tandem arrays is to increase gene dosage in an environment where individual gene promoters are absent. This report presents the first account of the tandem gene arrays in the T. brucei genome, employing several related genome sequences to establish how variation is created and removed. Results A systematic survey of tandem gene arrays showed that substantial sequence variation existed across the genome; variation from different regions of an array often produced inconsistent phylogenetic affinities. Phylogenetic relationships of gene duplicates were consistent with concerted evolution being a widespread homogenising force. However, tandem duplicates were not usually identical; therefore, any homogenising effect was coincident with divergence among duplicates. Allelic gene conversion was detected using various criteria and was apparently able to both remove and introduce sequence variation. Tandem arrays containing structural heterogeneity demonstrated how sequence homogenisation and differentiation can occur within a single locus. Conclusion The use of multiple genome sequences in a comparative analysis of tandem gene arrays identified substantial sequence variation among gene duplicates. The distribution of sequence variation is determined by a dynamic balance of conservative and innovative evolutionary forces. Gene trees from various species showed that intraspecific duplicates evolve in concert, perhaps through frequent gene conversion, although this does not prevent sequence divergence, especially where structural heterogeneity physically separates a duplicate from its neighbours. In describing dynamics of sequence variation that have consequences beyond gene dosage, this survey provides a basis for

  20. The roles of whole-genome and small-scale duplications in the functional specialization of Saccharomyces cerevisiae genes.

    PubMed

    Fares, Mario A; Keane, Orla M; Toft, Christina; Carretero-Paulet, Lorenzo; Jones, Gary W

    2013-01-01

    Researchers have long been enthralled with the idea that gene duplication can generate novel functions, crediting this process with great evolutionary importance. Empirical data shows that whole-genome duplications (WGDs) are more likely to be retained than small-scale duplications (SSDs), though their relative contribution to the functional fate of duplicates remains unexplored. Using the map of genetic interactions and the re-sequencing of 27 Saccharomyces cerevisiae genomes evolving for 2,200 generations we show that SSD-duplicates lead to neo-functionalization while WGD-duplicates partition ancestral functions. This conclusion is supported by: (a) SSD-duplicates establish more genetic interactions than singletons and WGD-duplicates; (b) SSD-duplicates copies share more interaction-partners than WGD-duplicates copies; (c) WGD-duplicates interaction partners are more functionally related than SSD-duplicates partners; (d) SSD-duplicates gene copies are more functionally divergent from one another, while keeping more overlapping functions, and diverge in their sub-cellular locations more than WGD-duplicates copies; and (e) SSD-duplicates complement their functions to a greater extent than WGD-duplicates. We propose a novel model that uncovers the complexity of evolution after gene duplication. PMID:23300483

  1. The power-law distribution of gene family size is driven by the pseudogenisation rate's heterogeneity between gene families.

    PubMed

    Hughes, Timothy; Liberles, David A

    2008-05-15

    Genome sequencing has shown that the number of homologous gene families of a given size declines rapidly with family size. A power-law has been shown to provide the best mathematical description of this relationship. However, it remains unclear what evolutionary forces drive this observation. We use models of gene duplication, pseudogenisation and accumulation of replacement substitutions, which have been validated and parameterised using genomic data, to build a model of homologous gene evolution. We use this model to simulate the evolution of the distribution of gene family size and show that the power-law distribution is driven by the pseudogenisation rate's heterogeneity across gene families and its correlation within families. Moreover, we show that gene duplication and pseudogenisation are necessary and sufficient for the emergence of the power-law. PMID:18378100

  2. Gene duplication of type-B ARR transcription factors systematically extends transcriptional regulatory structures in Arabidopsis

    PubMed Central

    Choi, Seung Hee; Hyeon, Do Young; Lee, ll Hwan; Park, Su Jin; Han, Seungmin; Lee, In Chul; Hwang, Daehee; Nam, Hong Gil

    2014-01-01

    Many of duplicated genes are enriched in signaling pathways. Recently, gene duplication of kinases has been shown to provide genetic buffering and functional diversification in cellular signaling. Transcription factors (TFs) are also often duplicated. However, how duplication of TFs affects their regulatory structures and functions of target genes has not been explored at the systems level. Here, we examined regulatory and functional roles of duplication of three major ARR TFs (ARR1, 10, and 12) in Arabidopsis cytokinin signaling using wild-type and single, double, and triple deletion mutants of the TFs. Comparative analysis of gene expression profiles obtained from Arabidopsis roots in wild-type and these mutants showed that duplication of ARR TFs systematically extended their transcriptional regulatory structures, leading to enhanced robustness and diversification in functions of target genes, as well as in regulation of cellular networks of target genes. Therefore, our results suggest that duplication of TFs contributes to robustness and diversification in functions of target genes by extending transcriptional regulatory structures. PMID:25425016

  3. Evolution of Cis-Regulatory Elements and Regulatory Networks in Duplicated Genes of Arabidopsis1[OPEN

    PubMed Central

    Guo, Xu Qiu; Adams, Keith L.

    2015-01-01

    Plant genomes contain large numbers of duplicated genes that contribute to the evolution of new functions. Following duplication, genes can exhibit divergence in their coding sequence and their expression patterns. Changes in the cis-regulatory element landscape can result in changes in gene expression patterns. High-throughput methods developed recently can identify potential cis-regulatory elements on a genome-wide scale. Here, we use a recent comprehensive data set of DNase I sequencing-identified cis-regulatory binding sites (footprints) at single-base-pair resolution to compare binding sites and network connectivity in duplicated gene pairs in Arabidopsis (Arabidopsis thaliana). We found that duplicated gene pairs vary greatly in their cis-regulatory element architecture, resulting in changes in regulatory network connectivity. Whole-genome duplicates (WGDs) have approximately twice as many footprints in their promoters left by potential regulatory proteins than do tandem duplicates (TDs). The WGDs have a greater average number of footprint differences between paralogs than TDs. The footprints, in turn, result in more regulatory network connections between WGDs and other genes, forming denser, more complex regulatory networks than shown by TDs. When comparing regulatory connections between duplicates, WGDs had more pairs in which the two genes are either partially or fully diverged in their network connections, but fewer genes with no network connections than the TDs. There is evidence of younger TDs and WGDs having fewer unique connections compared with older duplicates. This study provides insights into cis-regulatory element evolution and network divergence in duplicated genes. PMID:26474639

  4. Extensive Local Gene Duplication and Functional Divergence among Paralogs in Atlantic Salmon

    PubMed Central

    Warren, Ian A.; Ciborowski, Kate L.; Casadei, Elisa; Hazlerigg, David G.; Martin, Sam; Jordan, William C.; Sumner, Seirian

    2014-01-01

    Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle. PMID:24951567

  5. The duplication 17p13.3 phenotype: analysis of 21 families delineates developmental, behavioral and brain abnormalities, and rare variant phenotypes.

    PubMed

    Curry, Cynthia J; Rosenfeld, Jill A; Grant, Erica; Gripp, Karen W; Anderson, Carol; Aylsworth, Arthur S; Saad, Taha Ben; Chizhikov, Victor V; Dybose, Giedre; Fagerberg, Christina; Falco, Michelle; Fels, Christina; Fichera, Marco; Graakjaer, Jesper; Greco, Donatella; Hair, Jennifer; Hopkins, Elizabeth; Huggins, Marlene; Ladda, Roger; Li, Chumei; Moeschler, John; Nowaczyk, Malgorzata J M; Ozmore, Jillian R; Reitano, Santina; Romano, Corrado; Roos, Laura; Schnur, Rhonda E; Sell, Susan; Suwannarat, Pim; Svaneby, Dea; Szybowska, Marta; Tarnopolsky, Mark; Tervo, Raymond; Tsai, Anne Chun-Hui; Tucker, Megan; Vallee, Stephanie; Wheeler, Ferrin C; Zand, Dina J; Barkovich, A James; Aradhya, Swaroop; Shaffer, Lisa G; Dobyns, William B

    2013-08-01

    Chromosome 17p13.3 is a gene rich region that when deleted is associated with the well-known Miller-Dieker syndrome. A recently described duplication syndrome involving this region has been associated with intellectual impairment, autism and occasional brain MRI abnormalities. We report 34 additional patients from 21 families to further delineate the clinical, neurological, behavioral, and brain imaging findings. We found a highly diverse phenotype with inter- and intrafamilial variability, especially in cognitive development. The most specific phenotype occurred in individuals with large duplications that include both the YWHAE and LIS1 genes. These patients had a relatively distinct facial phenotype and frequent structural brain abnormalities involving the corpus callosum, cerebellar vermis, and cranial base. Autism spectrum disorders were seen in a third of duplication probands, most commonly in those with duplications of YWHAE and flanking genes such as CRK. The typical neurobehavioral phenotype was usually seen in those with the larger duplications. We did not confirm the association of early overgrowth with involvement of YWHAE and CRK, or growth failure with duplications of LIS1. Older patients were often overweight. Three variant phenotypes included cleft lip/palate (CLP), split hand/foot with long bone deficiency (SHFLD), and a connective tissue phenotype resembling Marfan syndrome. The duplications in patients with clefts appear to disrupt ABR, while the SHFLD phenotype was associated with duplication of BHLHA9 as noted in two recent reports. The connective tissue phenotype did not have a convincing critical region. Our experience with this large cohort expands knowledge of this diverse duplication syndrome. PMID:23813913

  6. The evolution of vertebrate tetraspanins: gene loss, retention, and massive positive selection after whole genome duplications

    PubMed Central

    2010-01-01

    Background The vertebrate tetraspanin family has many features which make it suitable for preserving the imprint of ancient sequence evolution and amenable for phylogenomic analysis. So we believe that an in-depth analysis of the tetraspanin evolution not only provides more complete understanding of tetraspanin biology, but offers new insights into the influence of the two rounds of whole genome duplication (2R-WGD) at the origin of vertebrates. Results A detailed phylogeny of vertebrate tetraspanins was constructed by using multiple lines of information, including sequence-based phylogenetics, key structural features, intron configuration and genomic synteny. In particular, a total of 38 modern tetraspanin ortholog lineages in bony vertebrates have been identified and subsequently classified into 17 ancestral lineages existing before 2R-WGD. Based on this phylogeny, we found that the ohnolog retention rate of tetraspanins after 2R-WGD was three times as the average (a rate similar to those of transcription factors and protein kinases). This high rate didn't increase the tetrapanin family size, but changed the family composition, possibly by displacing vertebrate-specific gene lineages with the lineages conserved across deuterostomes. We also found that the period from 2R-WGD to recent time is controlled by gene losses. Meanwhile, positive selection has been detected on 80% of the branches right after 2R-WGDs, which declines significantly on both magnitude and extensity on the following speciation branches. Notably, the loss of mammalian RDS2 is accompanied by strong positive selection on mammalian ROM1, possibly due to gene loss-induced compensatory evolution. Conclusions First, different from transcription factors and kinases, high duplicate retention rate after 2R-WGD didn't increase the tetraspanin family size but just reshaped the family composition. Second, the evolution of tetraspanins right after 2R-WGD had been impacted by a massive wave of gene loss and

  7. Preservation of Gene Duplication Increases the Regulatory Spectrum of Ribosomal Protein Genes and Enhances Growth under Stress.

    PubMed

    Parenteau, Julie; Lavoie, Mathieu; Catala, Mathieu; Malik-Ghulam, Mustafa; Gagnon, Jules; Abou Elela, Sherif

    2015-12-22

    In baker's yeast, the majority of ribosomal protein genes (RPGs) are duplicated, and it was recently proposed that such duplications are preserved via the functional specialization of the duplicated genes. However, the origin and nature of duplicated RPGs' (dRPGs) functional specificity remain unclear. In this study, we show that differences in dRPG functions are generated by variations in the modality of gene expression and, to a lesser extent, by protein sequence. Analysis of the sequence and expression patterns of non-intron-containing RPGs indicates that each dRPG is controlled by specific regulatory sequences modulating its expression levels in response to changing growth conditions. Homogenization of dRPG sequences reduces cell tolerance to growth under stress without changing the number of expressed genes. Together, the data reveal a model where duplicated genes provide a means for modulating the expression of ribosomal proteins in response to stress. PMID:26686636

  8. New Organelles by Gene Duplication in a Biophysical Model of Eukaryote Endomembrane Evolution

    PubMed Central

    Ramadas, Rohini; Thattai, Mukund

    2013-01-01

    Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organelles, using a minimal biophysical model of traffic. Our model incorporates membrane-bound compartments, coat proteins and adaptors that drive vesicles to bud and segregate cargo from source compartments, and SNARE proteins and associated factors that cause vesicles to fuse into specific destination compartments. In simulations, arbitrary numbers of compartments with heterogeneous initial compositions segregate into a few compositionally distinct subsets that we term organelles. The global structure of the traffic system (i.e., the number, composition, and connectivity of organelles) is determined completely by local molecular interactions. On evolutionary timescales, duplication of the budding and fusion machinery followed by loss of cross-interactions leads to the emergence of new organelles, with increased molecular specificity being necessary to maintain larger organellar repertoires. These results clarify potential modes of early eukaryotic evolution as well as more recent eukaryotic diversification. PMID:23746528

  9. Genesis of the vertebrate FoxP subfamily member genes occurred during two ancestral whole genome duplication events.

    PubMed

    Song, Xiaowei; Tang, Yezhong; Wang, Yajun

    2016-08-22

    The vertebrate FoxP subfamily genes play important roles in the construction of essential functional modules involved in physiological and developmental processes. To explore the adaptive evolution of functional modules associated with the FoxP subfamily member genes, it is necessary to study the gene duplication process. We detected four member genes of the FoxP subfamily in sea lampreys (a representative species of jawless vertebrates) through genome screenings and phylogenetic analyses. Reliable paralogons (i.e. paralogous chromosome segments) have rarely been detected in scaffolds of FoxP subfamily member genes in sea lampreys due to the considerable existence of HTH_Tnp_Tc3_2 transposases. However, these transposases did not alter gene numbers of the FoxP subfamily in sea lampreys. The coincidence between the "1-4" gene duplication pattern of FoxP subfamily genes from invertebrates to vertebrates and two rounds of ancestral whole genome duplication (1R- and 2R-WGD) events reveal that the FoxP subfamily of vertebrates was quadruplicated in the 1R- and 2R-WGD events. Furthermore, we deduced that a synchronous gene duplication process occurred for the FoxP subfamily and for three linked gene families/subfamilies (i.e. MIT family, mGluR group III and PLXNA subfamily) in the 1R- and 2R-WGD events using phylogenetic analyses and mirror-dendrogram methods (i.e. algorithms to test protein-protein interactions). Specifically, the ancestor of FoxP1 and FoxP3 and the ancestor of FoxP2 and FoxP4 were generated in 1R-WGD event. In the subsequent 2R-WGD event, these two ancestral genes were changed into FoxP1, FoxP2, FoxP3 and FoxP4. The elucidation of these gene duplication processes shed light on the phylogenetic relationships between functional modules of the FoxP subfamily member genes. PMID:27188254

  10. Gene duplication in the major insecticide target site, Rdl, in Drosophila melanogaster.

    PubMed

    Remnant, Emily J; Good, Robert T; Schmidt, Joshua M; Lumb, Christopher; Robin, Charles; Daborn, Phillip J; Batterham, Philip

    2013-09-01

    The Resistance to Dieldrin gene, Rdl, encodes a GABA-gated chloride channel subunit that is targeted by cyclodiene and phenylpyrazole insecticides. The gene was first characterized in Drosophila melanogaster by genetic mapping of resistance to the cyclodiene dieldrin. The 4,000-fold resistance observed was due to a single amino acid replacement, Ala(301) to Ser. The equivalent change was subsequently identified in Rdl orthologs of a large range of resistant insect species. Here, we report identification of a duplication at the Rdl locus in D. melanogaster. The 113-kb duplication contains one WT copy of Rdl and a second copy with two point mutations: an Ala(301) to Ser resistance mutation and Met(360) to Ile replacement. Individuals with this duplication exhibit intermediate dieldrin resistance compared with single copy Ser(301) homozygotes, reduced temperature sensitivity, and altered RNA editing associated with the resistant allele. Ectopic recombination between Roo transposable elements is involved in generating this genomic rearrangement. The duplication phenotypes were confirmed by construction of a transgenic, artificial duplication integrating the 55.7-kb Rdl locus with a Ser(301) change into an Ala(301) background. Gene duplications can contribute significantly to the evolution of insecticide resistance, most commonly by increasing the amount of gene product produced. Here however, duplication of the Rdl target site creates permanent heterozygosity, providing unique potential for adaptive mutations to accrue in one copy, without abolishing the endogenous role of an essential gene. PMID:23959864

  11. Duplication of partial spinosyn biosynthetic gene cluster in Saccharopolyspora spinosa enhances spinosyn production.

    PubMed

    Tang, Ying; Xia, Liqiu; Ding, Xuezhi; Luo, Yushuang; Huang, Fan; Jiang, Yuanwei

    2011-12-01

    Spinosyns, the secondary metabolites produced by Saccharopolyspora spinosa, are the active ingredients in a family of insect control agents. Most of the S. spinosa genes involved in spinosyn biosynthesis are found in a contiguous c. 74-kb cluster. To increase the spinosyn production through overexpression of their biosynthetic genes, part of its gene cluster (c. 18 kb) participating in the conversion of the cyclized polyketide to spinosyn was obtained by direct cloning via Red/ET recombination rather than by constructing and screening the genomic library. The resultant plasmid pUCAmT-spn was introduced into S. spinosa CCTCC M206084 from Escherichia coli S17-1 by conjugal transfer. The subsequent single-crossover homologous recombination caused a duplication of the partial gene cluster. Integration of this plasmid enhanced production of spinosyns with a total of 388 (± 25.0) mg L(-1) for spinosyns A and D in the exconjugant S. spinosa trans1 compared with 100 (± 7.7) mg L(-1) in the parental strain. Quantitative real time polymerase chain reaction analysis of three selected genes (spnH, spnI, and spnK) confirmed the positive effect of the overexpression of these genes on the spinosyn production. This study provides a simple avenue for enhancing spinosyn production. The strategies could also be used to improve the yield of other secondary metabolites. PMID:22092858

  12. Evolution of human-specific neural SRGAP2 genes by incomplete segmental duplication.

    PubMed

    Dennis, Megan Y; Nuttle, Xander; Sudmant, Peter H; Antonacci, Francesca; Graves, Tina A; Nefedov, Mikhail; Rosenfeld, Jill A; Sajjadian, Saba; Malig, Maika; Kotkiewicz, Holland; Curry, Cynthia J; Shafer, Susan; Shaffer, Lisa G; de Jong, Pieter J; Wilson, Richard K; Eichler, Evan E

    2012-05-11

    Gene duplication is an important source of phenotypic change and adaptive evolution. We leverage a haploid hydatidiform mole to identify highly identical sequences missing from the reference genome, confirming that the cortical development gene Slit-Robo Rho GTPase-activating protein 2 (SRGAP2) duplicated three times exclusively in humans. We show that the promoter and first nine exons of SRGAP2 duplicated from 1q32.1 (SRGAP2A) to 1q21.1 (SRGAP2B) ∼3.4 million years ago (mya). Two larger duplications later copied SRGAP2B to chromosome 1p12 (SRGAP2C) and to proximal 1q21.1 (SRGAP2D) ∼2.4 and ∼1 mya, respectively. Sequence and expression analyses show that SRGAP2C is the most likely duplicate to encode a functional protein and is among the most fixed human-specific duplicate genes. Our data suggest a mechanism where incomplete duplication created a novel gene function-antagonizing parental SRGAP2 function-immediately "at birth" 2-3 mya, which is a time corresponding to the transition from Australopithecus to Homo and the beginning of neocortex expansion. PMID:22559943

  13. beta. amyloid gene duplication in Alzheimer's disease and karyotypically normal Down syndrome

    SciTech Connect

    Delabar, J.; Goldgaber, D.; Lamour, Y.; Nicole, A.; Huret, J.; De Groucy, J.; Brown, P.; Gajdusek, D.C.; Sinet, P.

    1987-03-13

    With the recently cloned complementary DNA probe, lambdaAm4 for the chromosome 21 gene encoding brain amyloid polypeptide (..beta.. amyloid protein) of Alzheimer's disease, leukocyte DNA from three patients with sporadic Alzheimer's disease and two patients with karyotypically normal Down syndrome was found to contain three copies of this bene. Because a small region of chromosome 21 containing the ets-2 gene is duplicated in patients with Alzheimer's disease, as well as in karyotypically normal Down syndrome, duplication of a subsection of the critical segment of chromosome 21 that is duplicated in Down syndrome may be the genetic defect in Alzeimer's disease.

  14. Extensive horizontal gene transfer, duplication, and loss of chlorophyll synthesis genes in the algae

    DOE PAGESBeta

    Hunsperger, Heather M.; Randhawa, Tejinder; Cattolico, Rose Ann

    2015-02-10

    Two non-homologous, isofunctional enzymes catalyze the penultimate step of chlorophyll a synthesis in oxygenic photosynthetic organisms such as cyanobacteria, eukaryotic algae and land plants: the light independent (LIPOR) and light-dependent (POR) protochlorophyllide oxidoreductases. Whereas the distribution of these enzymes in cyanobacteria and land plants is well understood, the presence, loss, duplication, and replacement of these genes have not been surveyed in the polyphyletic and remarkably diverse eukaryotic algal lineages.

  15. Duplication and divergence: the evolution of new genes and old ideas.

    PubMed

    Taylor, John S; Raes, Jeroen

    2004-01-01

    Over 35 years ago, Susumu Ohno stated that gene duplication was the single most important factor in evolution. He reiterated this point a few years later in proposing that without duplicated genes the creation of metazoans, vertebrates, and mammals from unicellular organisms would have been impossible. Such big leaps in evolution, he argued, required the creation of new gene loci with previously nonexistent functions. Bold statements such as these, combined with his proposal that at least one whole-genome duplication event facilitated the evolution of vertebrates, have made Ohno an icon in the literature on genome evolution. However, discussion on the occurrence and consequences of gene and genome duplication events has a much longer, and often neglected, history. Here we review literature dealing with the occurrence and consequences of gene duplication, beginning in 1911. We document conceptual and technological advances in gene duplication research from this early research in comparative cytology up to recent research on whole genomes, "transcriptomes," and "interactomes." PMID:15568988

  16. Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins

    PubMed Central

    Hargreaves, Adam D.; Swain, Martin T.; Hegarty, Matthew J.; Logan, Darren W.; Mulley, John F.

    2014-01-01

    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive “just-so story” in evolutionary biology. PMID:25079342

  17. Evolution of Vertebrate Adam Genes; Duplication of Testicular Adams from Ancient Adam9/9-like Loci

    PubMed Central

    Wei, Shuo

    2015-01-01

    Members of the disintegrin metalloproteinase (ADAM) family have important functions in regulating cell-cell and cell-matrix interactions as well as cell signaling. There are two major types of ADAMs: the somatic ADAMs (sADAMs) that have a significant presence in somatic tissues, and the testicular ADAMs (tADAMs) that are expressed predominantly in the testis. Genes encoding tADAMs can be further divided into two groups: group I (intronless) and group II (intron-containing). To date, tAdams have only been reported in placental mammals, and their evolutionary origin and relationship to sAdams remain largely unknown. Using phylogenetic and syntenic tools, we analyzed the Adam genes in various vertebrates ranging from fishes to placental mammals. Our analyses reveal duplication and loss of some sAdams in certain vertebrate species. In particular, there exists an Adam9-like gene in non-mammalian vertebrates but not mammals. We also identified putative group I and group II tAdams in all amniote species that have been examined. These tAdam homologues are more closely related to Adams 9 and 9-like than to other sAdams. In all amniote species examined, group II tAdams lie in close vicinity to Adam9 and hence likely arose from tandem duplication, whereas group I tAdams likely originated through retroposition because of their lack of introns. Clusters of multiple group I tAdams are also common, suggesting tandem duplication after retroposition. Therefore, Adam9/9-like and some of the derived tAdam loci are likely preferred targets for tandem duplication and/or retroposition. Consistent with this hypothesis, we identified a young retroposed gene that duplicated recently from Adam9 in the opossum. As a result of gene duplication, some tAdams were pseudogenized in certain species, whereas others acquired new expression patterns and functions. The rapid duplication of Adam genes has a major contribution to the diversity of ADAMs in various vertebrate species. PMID:26308360

  18. Human-specific evolution of novel SRGAP2 genes by incomplete segmental duplication

    PubMed Central

    Dennis, Megan Y.; Nuttle, Xander; Sudmant, Peter H.; Antonacci, Francesca; Graves, Tina A.; Nefedov, Mikhail; Rosenfeld, Jill A.; Sajjadian, Saba; Malig, Maika; Kotkiewicz, Holland; Curry, Cynthia J.; Shafer, Susan; Shaffer, Lisa G.; de Jong, Pieter J.; Wilson, Richard K.; Eichler, Evan E.

    2012-01-01

    SUMMARY Gene duplication is an important source of phenotypic change and adaptive evolution. We use a novel genomic approach to identify highly identical sequence missing from the reference genome, confirming the cortical development gene Slit-Robo Rho GTPase activating protein 2 (SRGAP2) duplicated three times in humans. We show that the promoter and first nine exons of SRGAP2 duplicated from 1q32.1 (SRGAP2A) to 1q21.1 (SRGAP2B) ~3.4 million years ago (mya). Two larger duplications later copied SRGAP2B to chromosome 1p12 (SRGAP2C) and to proximal 1q21.1 (SRGAP2D), ~2.4 and ~1 mya, respectively. Sequence and expression analysis shows SRGAP2C is the most likely duplicate to encode a functional protein and among the most fixed human-specific duplicate genes. Our data suggest a mechanism where incomplete duplication created a novel function —at birth, antagonizing parental SRGAP2 function 2–3 mya a time corresponding to the transition from Australopithecus to Homo and the beginning of neocortex expansion. PMID:22559943

  19. The MDM2 gene family.

    PubMed

    Mendoza, Michael; Mandani, Garni; Momand, Jamil

    2014-03-01

    MDM2 is an oncoprotein that blocks p53 tumor suppressor-mediated transcriptional transactivation, escorts p53 from the cell nucleus to the cytoplasm, and polyubiquitylates p53. Polyubiquitylated p53 is rapidly degraded in the cytoplasm by the 26S proteasome. MDM2 is abnormally upregulated in several types of cancers, especially those of mesenchymal origin. MDM4 is a homolog of MDM2 that also inhibits p53 by blocking p53-mediated transactivation. MDM4 is required for MDM2-mediated polyubiquitylated of p53 and is abnormally upregulated in several cancer types. MDM2 and MDM4 genes have been detected in all vertebrates to date and only a single gene homolog, named MDM, has been detected in some invertebrates. MDM2, MDM4, and MDM have similar gene structures, suggesting that MDM2 and MDM4 arose through a duplication event more than 440 million years ago. All members of this small MDM2 gene family contain a single really interesting new gene (RING) domain (with the possible exception of lancelet MDM) which places them in the RING-domain superfamily. Similar to MDM2, the vast majority of proteins with RING domains are E3 ubiquitin ligases. Other RING domain E3 ubiquitin ligases that target p53 are COP1, Pirh2, and MSL2. In this report, we present evidence that COP1, Pirh2, and MSL2 evolved independently of MDM2 and MDM4. We also show, through structure homology models of invertebrate MDM RING domains, that MDM2 is more evolutionarily conserved than MDM4. PMID:25372739

  20. A novel KCNQ4 mutation and a private IMMP2L-DOCK4 duplication segregating with nonsyndromic hearing loss in a Brazilian family

    PubMed Central

    Uehara, Daniela T; Freitas, Érika L; Alves, Leandro U; Mazzeu, Juliana F; Auricchio, Maria TBM; Tabith, Alfredo; Monteiro, Mário LR; Rosenberg, Carla; Mingroni-Netto, Regina C

    2015-01-01

    Here we describe a novel missense variant in the KCNQ4 gene and a private duplication at 7q31.1 partially involving two genes (IMMP2L and DOCK4). Both mutations segregated with nonsyndromic hearing loss in a family with three affected individuals. Initially, we identified the duplication in a screening of 132 unrelated cases of hearing loss with a multiplex ligation-dependent probe amplification panel of genes that are candidates to have a role in hearing, including IMMP2L. Mapping of the duplication by array-CGH revealed that the duplication also encompassed the 3′-end of DOCK4. Subsequently, whole-exome sequencing identified the breakpoint of the rearrangement, thereby confirming the existence of a fusion IMMP2L-DOCK4 gene. Transcription products of the fusion gene were identified, indicating that they escaped nonsense-mediated messenger RNA decay. A missense substitution (c.701A>T) in KCNQ4 (a gene at the DFNA2A locus) was also identified by whole-exome sequencing. Because the substitution is predicted to be probably damaging and KCNQ4 has been implicated in hearing loss, this mutation might explain the deafness in the affected individuals, although a hypothetical effect of the product of the fusion gene on hearing cannot be completely ruled out. PMID:27081546

  1. Detecting Functional Divergence after Gene Duplication through Evolutionary Changes in Posttranslational Regulatory Sequences

    PubMed Central

    Nguyen Ba, Alex N.; Strome, Bob; Hua, Jun Jie; Desmond, Jonathan; Gagnon-Arsenault, Isabelle; Weiss, Eric L.; Landry, Christian R.; Moses, Alan M.

    2014-01-01

    Gene duplication is an important evolutionary mechanism that can result in functional divergence in paralogs due to neo-functionalization or sub-functionalization. Consistent with functional divergence after gene duplication, recent studies have shown accelerated evolution in retained paralogs. However, little is known in general about the impact of this accelerated evolution on the molecular functions of retained paralogs. For example, do new functions typically involve changes in enzymatic activities, or changes in protein regulation? Here we study the evolution of posttranslational regulation by examining the evolution of important regulatory sequences (short linear motifs) in retained duplicates created by the whole-genome duplication in budding yeast. To do so, we identified short linear motifs whose evolutionary constraint has relaxed after gene duplication with a likelihood-ratio test that can account for heterogeneity in the evolutionary process by using a non-central chi-squared null distribution. We find that short linear motifs are more likely to show changes in evolutionary constraints in retained duplicates compared to single-copy genes. We examine changes in constraints on known regulatory sequences and show that for the Rck1/Rck2, Fkh1/Fkh2, Ace2/Swi5 paralogs, they are associated with previously characterized differences in posttranslational regulation. Finally, we experimentally confirm our prediction that for the Ace2/Swi5 paralogs, Cbk1 regulated localization was lost along the lineage leading to SWI5 after gene duplication. Our analysis suggests that changes in posttranslational regulation mediated by short regulatory motifs systematically contribute to functional divergence after gene duplication. PMID:25474245

  2. Contribution of nonohnologous duplicated genes to high habitat variability in mammals.

    PubMed

    Tamate, Satoshi C; Kawata, Masakado; Makino, Takashi

    2014-07-01

    The mechanism by which genetic systems affect environmental adaptation is a focus of considerable attention in the fields of ecology, evolution, and conservation. However, the genomic characteristics that constrain adaptive evolution have remained unknown. A recent study showed that the proportion of duplicated genes in whole Drosophila genomes correlated with environmental variability within habitat, but it remains unclear whether the correlation is observed even in vertebrates whose genomes including a large number of duplicated genes generated by whole-genome duplication (WGD). Here, we focus on fully sequenced mammalian genomes that experienced WGD in early vertebrate lineages and show that the proportion of small-scale duplication (SSD) genes in the genome, but not that of WGD genes, is significantly correlated with habitat variability. Moreover, species with low habitat variability have a higher proportion of lost duplicated genes, particularly SSD genes, than those with high habitat variability. These results indicate that species that inhabit variable environments may maintain more SSD genes in their genomes and suggest that SSD genes are important for adapting to novel environments and surviving environmental changes. These insights may be applied to predicting invasive and endangered species. PMID:24714078

  3. Evidence for gene duplication in the voltage-gated sodium channel gene of Aedes aegypti

    PubMed Central

    Martins, Ademir Jesus; Brito, Luiz Paulo; Linss, Jutta Gerlinde Birggitt; Rivas, Gustavo Bueno da Silva; Machado, Ricardo; Bruno, Rafaela Vieira; Lima, José Bento Pereira; Valle, Denise; Peixoto, Alexandre Afranio

    2013-01-01

    Background and objectives: Mutations in the voltage-gated sodium channel gene (NaV), known as kdr mutations, are associated with pyrethroid and DDT insecticide resistance in a number of species. In the mosquito dengue vector Aedes aegypti, besides kdr, other polymorphisms allowed grouping AaNaV sequences as type ‘A’ or ‘B’. Here, we point a series of evidences that these polymorphisms are actually involved in a gene duplication event. Methodology: Four series of methods were employed: (i) genotypying, with allele-specific PCR (AS-PCR), of two AaNaV sites that can harbor kdr mutations (Ile1011Met and Val1016Ile), (ii) cloning and sequencing of part of the AaNaV gene, (iii) crosses with specific lineages and analysis of the offspring genotypes and (iv) copy number variation assays, with TaqMan quantitative real-time PCR. Results: kdr mutations in 1011 and 1016 sites were present only in type ‘A’ sequences, but never in the same haplotype. In addition, although the 1011Met-mutant allele is widely disseminated, no homozygous (1011Met/Met) was detected. Sequencing revealed three distinct haplotypes in some individuals, raising the hypothesis of gene duplication, which was supported by the genotype frequencies in the offspring of specific crosses. Furthermore, it was estimated that a laboratory strain selected for insecticide resistance had 5-fold more copies of the sodium channel gene compared with a susceptible reference strain. Conclusions and implications: The AaNaV duplication here found might be a recent adaptive response to the intense use of insecticides, maintaining together wild-type and mutant alleles in the same organism, conferring resistance and reducing some of its deleterious effects. PMID:24481195

  4. Evolution of CONSTANS Regulation and Function after Gene Duplication Produced a Photoperiodic Flowering Switch in the Brassicaceae.

    PubMed

    Simon, Samson; Rühl, Mark; de Montaigu, Amaury; Wötzel, Stefan; Coupland, George

    2015-09-01

    Environmental control of flowering allows plant reproduction to occur under optimal conditions and facilitates adaptation to different locations. At high latitude, flowering of many plants is controlled by seasonal changes in day length. The photoperiodic flowering pathway confers this response in the Brassicaceae, which colonized temperate latitudes after divergence from the Cleomaceae, their subtropical sister family. The CONSTANS (CO) transcription factor of Arabidopsis thaliana, a member of the Brassicaceae, is central to the photoperiodic flowering response and shows characteristic patterns of transcription required for day-length sensing. CO is believed to be widely conserved among flowering plants; however, we show that it arose after gene duplication at the root of the Brassicaceae followed by divergence of transcriptional regulation and protein function. CO has two close homologs, CONSTANS-LIKE1 (COL1) and COL2, which are related to CO by tandem duplication and whole-genome duplication, respectively. The single CO homolog present in the Cleomaceae shows transcriptional and functional features similar to those of COL1 and COL2, suggesting that these were ancestral. We detect cis-regulatory and codon changes characteristic of CO and use transgenic assays to demonstrate their significance in the day-length-dependent activation of the CO target gene FLOWERING LOCUS T. Thus, the function of CO as a potent photoperiodic flowering switch evolved in the Brassicaceae after gene duplication. The origin of CO may have contributed to the range expansion of the Brassicaceae and suggests that in other families CO genes involved in photoperiodic flowering arose by convergent evolution. PMID:25972346

  5. Evolution of CONSTANS Regulation and Function after Gene Duplication Produced a Photoperiodic Flowering Switch in the Brassicaceae

    PubMed Central

    Simon, Samson; Rühl, Mark; de Montaigu, Amaury; Wötzel, Stefan; Coupland, George

    2015-01-01

    Environmental control of flowering allows plant reproduction to occur under optimal conditions and facilitates adaptation to different locations. At high latitude, flowering of many plants is controlled by seasonal changes in day length. The photoperiodic flowering pathway confers this response in the Brassicaceae, which colonized temperate latitudes after divergence from the Cleomaceae, their subtropical sister family. The CONSTANS (CO) transcription factor of Arabidopsis thaliana, a member of the Brassicaceae, is central to the photoperiodic flowering response and shows characteristic patterns of transcription required for day-length sensing. CO is believed to be widely conserved among flowering plants; however, we show that it arose after gene duplication at the root of the Brassicaceae followed by divergence of transcriptional regulation and protein function. CO has two close homologs, CONSTANS-LIKE1 (COL1) and COL2, which are related to CO by tandem duplication and whole-genome duplication, respectively. The single CO homolog present in the Cleomaceae shows transcriptional and functional features similar to those of COL1 and COL2, suggesting that these were ancestral. We detect cis-regulatory and codon changes characteristic of CO and use transgenic assays to demonstrate their significance in the day-length-dependent activation of the CO target gene FLOWERING LOCUS T. Thus, the function of CO as a potent photoperiodic flowering switch evolved in the Brassicaceae after gene duplication. The origin of CO may have contributed to the range expansion of the Brassicaceae and suggests that in other families CO genes involved in photoperiodic flowering arose by convergent evolution. PMID:25972346

  6. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub

    PubMed Central

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.

    2014-01-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  7. Gene duplication, population genomics, and species-level differentiation within a tropical mountain shrub.

    PubMed

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C

    2014-10-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  8. Evolution dynamics of a model for gene duplication under adaptive conflict

    NASA Astrophysics Data System (ADS)

    Ancliff, Mark; Park, Jeong-Man

    2014-06-01

    We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.

  9. A Sensitive Method for Detecting Variation in Copy Numbers of Duplicated Genes

    PubMed Central

    Pielberg, Gerli; Day, Andy E.; Plastow, Graham S.; Andersson, Leif

    2003-01-01

    Gene duplications are common in the vertebrate genome, and duplicated loci often show a variation in copy number that may have important phenotypic effects. Here we describe a powerful method for quantification of duplicated copies based on pyrosequencing. A reliable quantification was obtained by amplification of the duplication break-point and a corresponding nonduplicated sequence in a competitive PCR assay. A comparison with an independent method for quantification based on the Invader technology revealed an excellent correlation between the two methods. The pyrosequencing-based method was evaluated by analyzing variation in copy number at the duplicated KIT/Dominant white locus in pigs. We were able to distinguish haplotypes at this locus by combining the duplication breakpoint test with a diagnostic test for a functionally important splice mutation in the duplicated gene. An extensive allelic variation, including the presence of a new allele carrying a single KIT copy expected to encode a truncated KIT receptor, was revealed when analyzing white pigs from commercial lines. PMID:12952884

  10. Evolutionary analyses of non-family genes in plants

    SciTech Connect

    Ye, Chuyu; Li, Ting; Yin, Hengfu; Weston, David; Tuskan, Gerald A; Tschaplinski, Timothy J; Yang, Xiaohan

    2013-03-01

    There are a large number of non-family (NF) genes that do not cluster into families with three or more members per genome. While gene families have been extensively studied, a systematic analysis of NF genes has not been reported. We performed comparative studies on NF genes in 14 plant species. Based on the clustering of protein sequences, we identified ~94,000 NF genes across these species that were divided into five evolutionary groups: Viridiplantae-wide, angiosperm-specific, monocot-specific, dicot-specific, and those that were species-specific. Our analysis revealed that the NF genes resulted largely from less frequent gene duplications and/or a higher rate of gene loss after segmental duplication relative to genes in both low-copy-number families (LF; 3 10 copies per genome) and high-copy-number families (HF; >10 copies). Furthermore, we identified functions enriched in the NF gene set as compared with the HF genes. We found that NF genes were involved in essential biological processes shared by all plant lineages (e.g., photosynthesis and translation), as well as gene regulation and stress responses associated with phylogenetic diversification. In particular, our analysis of an Arabidopsis protein-protein interaction network revealed that hub proteins with the top 10% most connections were over-represented in the NF set relative to the HF set. This research highlights the roles that NF genes may play in evolutionary and functional genomics research.

  11. Evolutionary analyses of non-family genes in plants

    SciTech Connect

    Ye, Chuyu; Li, Ting; Yin, Hengfu; Weston, David; Tuskan, Gerald A; Tschaplinski, Timothy J; Yang, Xiaohan

    2013-01-01

    There are a large number of non-family (NF) genes that do not cluster into families with three or more members per genome. While gene families have been extensively studied, a systematic analysis of NF genes has not been reported. We performed comparative studies on NF genes in 14 plant species. Based on the clustering of protein sequences, we identified ~94 000 NF genes across these species that were divided into five evolutionary groups: Viridiplantae wide, angiosperm specific, monocot specific, dicot specific, and those that were species specific. Our analysis revealed that the NF genes resulted largely from less frequent gene duplications and/or a higher rate of gene loss after segmental duplication relative to genes in both lowcopy- number families (LF; 3 10 copies per genome) and high-copy-number families (HF; >10 copies). Furthermore, we identified functions enriched in the NF gene set as compared with the HF genes. We found that NF genes were involved in essential biological processes shared by all plant lineages (e.g. photosynthesis and translation), as well as gene regulation and stress responses associated with phylogenetic diversification. In particular, our analysis of an Arabidopsis protein protein interaction network revealed that hub proteins with the top 10% most connections were over-represented in the NF set relative to the HF set. This research highlights the roles that NF genes may play in evolutionary and functional genomics research.

  12. Molecular evolution of the duplicated TFIIAγ genes in Oryzeae and its relatives

    PubMed Central

    2010-01-01

    Background Gene duplication provides raw genetic materials for evolutionary novelty and adaptation. The evolutionary fate of duplicated transcription factor genes is less studied although transcription factor gene plays important roles in many biological processes. TFIIAγ is a small subunit of TFIIA that is one of general transcription factors required by RNA polymerase II. Previous studies identified two TFIIAγ-like genes in rice genome and found that these genes either conferred resistance to rice bacterial blight or could be induced by pathogen invasion, raising the question as to their functional divergence and evolutionary fates after gene duplication. Results We reconstructed the evolutionary history of the TFIIAγ genes from main lineages of angiosperms and demonstrated that two TFIIAγ genes (TFIIAγ1 and TFIIAγ5) arose from a whole genome duplication that happened in the common ancestor of grasses. Likelihood-based analyses with branch, codon, and branch-site models showed no evidence of positive selection but a signature of relaxed selective constraint after the TFIIAγ duplication. In particular, we found that the nonsynonymous/synonymous rate ratio (ω = dN/dS) of the TFIIAγ1 sequences was two times higher than that of TFIIAγ5 sequences, indicating highly asymmetric rates of protein evolution in rice tribe and its relatives, with an accelerated rate of TFIIAγ1 gene. Our expression data and EST database search further indicated that after whole genome duplication, the expression of TFIIAγ1 gene was significantly reduced while TFIIAγ5 remained constitutively expressed and maintained the ancestral role as a subunit of the TFIIA complex. Conclusion The evolutionary fate of TFIIAγ duplicates is not consistent with the neofunctionalization model that predicts that one of the duplicated genes acquires a new function because of positive Darwinian selection. Instead, we suggest that subfunctionalization might be involved in TFIIAγ evolution in grasses

  13. Mosaic gene conversion after a tandem duplication of mtDNA sequence in Diomedeidae (albatrosses).

    PubMed

    Eda, Masaki; Kuro-o, Masaki; Higuchi, Hiroyoshi; Hasegawa, Hiroshi; Koike, Hiroko

    2010-04-01

    Although the tandem duplication of mitochondrial (mt) sequences, especially those of the control region (CR), has been detected in metazoan species, few studies have focused on the features of the duplicated sequence itself, such as the gene conversion rate, distribution patterns of the variation, and relative rates of evolution between the copies. To investigate the features of duplicated mt sequences, we partially sequenced the mt genome of 16 Phoebastria albatrosses belonging to three species (P. albatrus, P. nigripes, and P. immutabilis). More than 2,300 base pairs of tandemly-duplicated sequence were shared by all three species. The observed gene arrangement was shared in the three Phoebastria albatrosses and suggests that the duplication event occurred in the common ancestor of the three species. Most of the copies in each individual were identical or nearly identical, and were maintained through frequent gene conversions. By contrast, portions of CR domains I and III had different phylogenetic signals, suggesting that gene conversion had not occurred in those sections after the speciation of the three species. Several lines of data, including the heterogeneity of the rate of molecular evolution, nucleotide differences, and putative secondary structures, suggests that the two sequences in CR domain I are maintained through selection; however, additional studies into the mechanisms of gene conversion and mtDNA synthesis are required to confirm this hypothesis. PMID:20558899

  14. Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species

    NASA Astrophysics Data System (ADS)

    Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.

    2016-04-01

    The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.

  15. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    PubMed Central

    Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan

    2009-01-01

    Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes

  16. Familial 1.3-Mb 11p15.5p15.4 Duplication in Three Generations Causing Silver-Russell and Beckwith-Wiedemann Syndromes

    PubMed Central

    Vals, Mari-Anne; Kahre, Tiina; Mee, Pille; Muru, Kai; Kallas, Eha; Žilina, Olga; Tillmann, Vallo; Õunap, Katrin

    2015-01-01

    Silver-Russell syndrome (SRS) and Beckwith-Wiedemann syndrome (BWS) are 2 opposite growth-affecting disorders. The common molecular cause for both syndromes is an abnormal regulation of genes in chromosomal region 11p15, where 2 imprinting control regions (ICR) control fetal and postnatal growth. Also, many submicroscopic chromosomal disturbances like duplications in 11p15 have been described among SRS and BWS patients. Duplications involving both ICRs cause SRS or BWS, depending on which parent the aberration is inherited from. We describe to our knowledge the smallest familial pure 1.3-Mb duplication in chromosomal region 11p15.5p15.4 that involves both ICRs and is present in 3 generations causing an SRS or BWS phenotype. PMID:26732610

  17. Familial 1.3-Mb 11p15.5p15.4 Duplication in Three Generations Causing Silver-Russell and Beckwith-Wiedemann Syndromes.

    PubMed

    Vals, Mari-Anne; Kahre, Tiina; Mee, Pille; Muru, Kai; Kallas, Eha; Žilina, Olga; Tillmann, Vallo; Õunap, Katrin

    2015-09-01

    Silver-Russell syndrome (SRS) and Beckwith-Wiedemann syndrome (BWS) are 2 opposite growth-affecting disorders. The common molecular cause for both syndromes is an abnormal regulation of genes in chromosomal region 11p15, where 2 imprinting control regions (ICR) control fetal and postnatal growth. Also, many submicroscopic chromosomal disturbances like duplications in 11p15 have been described among SRS and BWS patients. Duplications involving both ICRs cause SRS or BWS, depending on which parent the aberration is inherited from. We describe to our knowledge the smallest familial pure 1.3-Mb duplication in chromosomal region 11p15.5p15.4 that involves both ICRs and is present in 3 generations causing an SRS or BWS phenotype. PMID:26732610

  18. Introns regulate the production of ribosomal proteins by modulating splicing of duplicated ribosomal protein genes.

    PubMed

    Petibon, Cyrielle; Parenteau, Julie; Catala, Mathieu; Elela, Sherif Abou

    2016-05-01

    Most budding yeast introns exist in the many duplicated ribosomal protein genes (RPGs) and it has been posited that they remain there to modulate the expression of RPGs and cell growth in response to stress. However, the mechanism by which introns regulate the expression of RPGs and their impact on the synthesis of ribosomal proteins remain unclear. In this study, we show that introns determine the ratio of ribosomal protein isoforms through asymmetric paralog-specific regulation of splicing. Exchanging the introns and 3' untranslated regions of the duplicated RPS9 genes altered the splicing efficiency and changed the ratio of the ribosomal protein isoforms. Mutational analysis of the RPS9 genes indicated that splicing is regulated by variations in the intron structure and the 3' untranslated region. Together these data suggest that preferential splicing of duplicated RPGs provides a means for adjusting the ratio of different ribosomal protein isoforms, while maintaining the overall expression level of each ribosomal protein. PMID:26945043

  19. Opossum carboxylesterases: sequences, phylogeny and evidence for CES gene duplication events predating the marsupial-eutherian common ancestor

    PubMed Central

    2008-01-01

    Background Carboxylesterases (CES) perform diverse metabolic roles in mammalian organisms in the detoxification of a broad range of drugs and xenobiotics and may also serve in specific roles in lipid, cholesterol, pheromone and lung surfactant metabolism. Five CES families have been reported in mammals with human CES1 and CES2 the most extensively studied. Here we describe the genetics, expression and phylogeny of CES isozymes in the opossum and report on the sequences and locations of CES1, CES2 and CES6 'like' genes within two gene clusters on chromosome one. We also discuss the likely sequence of gene duplication events generating multiple CES genes during vertebrate evolution. Results We report a cDNA sequence for an opossum CES and present evidence for CES1 and CES2 like genes expressed in opossum liver and intestine and for distinct gene locations of five opossum CES genes,CES1, CES2.1, CES2.2, CES2.3 and CES6, on chromosome 1. Phylogenetic and sequence alignment studies compared the predicted amino acid sequences for opossum CES with those for human, mouse, chicken, frog, salmon and Drosophila CES gene products. Phylogenetic analyses produced congruent phylogenetic trees depicting a rapid early diversification into at least five distinct CES gene family clusters: CES2, CES1, CES7, CES3, and CES6. Molecular divergence estimates based on a Bayesian relaxed clock approach revealed an origin for the five mammalian CES gene families between 328–378 MYA. Conclusion The deduced amino acid sequence for an opossum cDNA was consistent with its identity as a mammalian CES2 gene product (designated CES2.1). Distinct gene locations for opossum CES1 (1: 446,222,550–446,274,850), three CES2 genes (1: 677,773,395–677,927,030) and a CES6 gene (1: 677,585,520–677,730,419) were observed on chromosome 1. Opossum CES1 and multiple CES2 genes were expressed in liver and intestine. Amino acid sequences for opossum CES1 and three CES2 gene products revealed conserved

  20. The evolution of the vertebrate Dlx gene family.

    PubMed Central

    Stock, D W; Ellies, D L; Zhao, Z; Ekker, M; Ruddle, F H; Weiss, K M

    1996-01-01

    The vertebrate Dlx gene family consists of homeobox-containing transcription factors distributed in pairs on the same chromosomes as the Hox genes. To investigate the evolutionary history of Dlx genes, we have cloned five new zebrafish family members and have provided additional sequence information for two mouse genes. Phylogenetic analyses of Dlx gene sequences considered in the context of their chromosomal arrangements suggest that an initial tandem duplication produced a linked pair of Dlx genes after the divergence of chordates and arthropods but prior to the divergence of tunicates and vertebrates. This pair of Dlx genes was then duplicated in the chromosomal events that led to the four clusters of Hox genes characteristic of bony fish and tetrapods. It is possible that a pair of Dlx genes linked to the Hoxc cluster has been lost from mammals. We were unable to distinguish between independent duplication and retention of the ancestral state of bony vertebrates to explain the presence of a greater number of Dlx genes in zebrafish than mammals. Determination of the linkage relationship of these additional zebrafish Dlx genes to Hox clusters should help resolve this issue. PMID:8855272

  1. A gene duplication affecting expression of the ovine ASIP gene is responsible for white and black sheep

    PubMed Central

    Norris, Belinda J.; Whan, Vicki A.

    2008-01-01

    Agouti signaling protein (ASIP) functions to regulate pigmentation in mice, while its role in many other animals and in humans has not been fully determined. In this study, we identify a 190-kb tandem duplication encompassing the ovine ASIP and AHCY coding regions and the ITCH promoter region as the genetic cause of white coat color of dominant white/tan (AWt) agouti sheep. The duplication 5′ breakpoint is located upstream of the ASIP coding sequence. Ubiquitous expression of a second copy of the ASIP coding sequence regulated by a duplicated copy of the nearby ITCH promoter causes the white sheep phenotype. A single copy ASIP gene with a silenced ASIP promoter occurs in recessive black sheep. In contrast, a single copy functional wild-type (A+) ASIP is responsible for the ancient Barbary sheep coat color phenotype. The gene duplication was facilitated by homologous recombination between two non-LTR SINE sequences flanking the duplicated segment. This is the first sheep trait attributable to gene duplication and shows nonallelic homologous recombination and gene conversion events at the ovine ASIP locus could have an important role in the evolution of sheep pigmentation. PMID:18493018

  2. Gene duplication as a mechanism of genomic adaptation to a changing environment

    PubMed Central

    Kondrashov, Fyodor A.

    2012-01-01

    A subject of extensive study in evolutionary theory has been the issue of how neutral, redundant copies can be maintained in the genome for long periods of time. Concurrently, examples of adaptive gene duplications to various environmental conditions in different species have been described. At this point, it is too early to tell whether or not a substantial fraction of gene copies have initially achieved fixation by positive selection for increased dosage. Nevertheless, enough examples have accumulated in the literature that such a possibility should be considered. Here, I review the recent examples of adaptive gene duplications and make an attempt to draw generalizations on what types of genes may be particularly prone to be selected for under certain environmental conditions. The identification of copy-number variation in ecological field studies of species adapting to stressful or novel environmental conditions may improve our understanding of gene duplications as a mechanism of adaptation and its relevance to the long-term persistence of gene duplications. PMID:22977152

  3. Multispecies Analysis of Expression Pattern Diversification in the Recently Expanded Insect Ly6 Gene Family

    PubMed Central

    Tanaka, Kohtaro; Hazbun, Alexis; Hijazi, Assia; Vreede, Barbara; Sucena, Élio

    2015-01-01

    Gene families often consist of members with diverse expression domains reflecting their functions in a wide variety of tissues. However, how the expression of individual members, and thus their tissue-specific functions, diversified during the course of gene family expansion is not well understood. In this study, we approached this question through the analysis of the duplication history and transcriptional evolution of a rapidly expanding subfamily of insect Ly6 genes. We analyzed different insect genomes and identified seven Ly6 genes that have originated from a single ancestor through sequential duplication within the higher Diptera. We then determined how the original embryonic expression pattern of the founding gene diversified by characterizing its tissue-specific expression in the beetle Tribolium castaneum, the butterfly Bicyclus anynana, and the mosquito Anopheles stephensi and those of its duplicates in three higher dipteran species, representing various stages of the duplication history (Megaselia abdita, Ceratitis capitata, and Drosophila melanogaster). Our results revealed that frequent neofunctionalization episodes contributed to the increased expression breadth of this subfamily and that these events occurred after duplication and speciation events at comparable frequencies. In addition, at each duplication node, we consistently found asymmetric expression divergence. One paralog inherited most of the tissue-specificities of the founder gene, whereas the other paralog evolved drastically reduced expression domains. Our approach attests to the power of combining a well-established duplication history with a comprehensive coverage of representative species in acquiring unequivocal information about the dynamics of gene expression evolution in gene families. PMID:25743545

  4. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species

    PubMed Central

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; Kelly, Steven

    2016-01-01

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. Key words: C4 photosynthesis, gene duplication, gene families, parallel evolution. PMID:27016024

  5. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

    PubMed

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

    2016-01-01

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646

  6. Dose-sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses.

    PubMed

    Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525

  7. Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention Through Multiple Tetraploidies in the Grasses

    PubMed Central

    Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525

  8. Inferring the Recent Duplication History of a Gene Cluster

    NASA Astrophysics Data System (ADS)

    Song, Giltae; Zhang, Louxin; Vinař, Tomáš; Miller, Webb

    Much important evolutionary activity occurs in gene clusters, where a copy of a gene may be free to evolve new functions. Computational methods to extract evolutionary information from sequence data for such clusters are currently imperfect, in part because accurate sequence data are often lacking in these genomic regions, making the existing methods difficult to apply. We describe a new method for reconstructing the recent evolutionary history of gene clusters. The method’s performance is evaluated on simulated data and on actual human gene clusters.

  9. The DMRT gene family in amphioxus.

    PubMed

    Wang, Fei; Yu, Yang; Ji, Dongrui; Li, Hongyan

    2012-01-01

    Doublesex and Mab-3-related transcription factor (DMRT) gene family is widely known for its involvement in sex determination and/or differentiation among different phyla. In this study, we identify eight DMRT genes in the cephalochordate amphioxus, a protochordate holding a key phylogenetic position. The eight DMRTs can be divided into two groups based on the conserved domain: BfDM044, BfDM045, BfDM55.1, BfDM115.1, and BfDM17.1 belong to the first group which have both DM and DMA domains, while BfDM246.1, BfDM084, and BfDM175 belong to the second group which have only DM domain. Most of the first group members have same genomic structure except BfDM17.1, while no regular pattern exists in the second group. Phylogenetic analysis of the DM domain sequences shows that DMRT genes in vertebrates form seven different independent clusters, and some even contain genes from invertebrates with high bootstrap. Notably, the first group members of amphioxus cluster with vertebrate DMRTs; while the second group members cluster into a single branch, which diverge from the vertebrate classes. The results suggest that several DMRT genes in vertebrates may evolve from homologous genes in invertebrates. As in nematode, drosophila, fish, and vertebrates, DMRT genes cluster is also found in amphioxus, which may be the result of gene duplication. Interspecific differences in the amphioxus DMRTs and sea squirt DMRTs may suggest post-speciation duplication of some DMRT genes. PMID:22702730

  10. Becker Muscular Dystrophy (BMD) caused by duplication of exons 3-6 of the dystrophin gene presenting as dilated cardiomyopathy

    SciTech Connect

    Tsai, A.C.; Allingham-Hawkins, D.J.; Becker, L.

    1994-09-01

    X-linked dilated cardiomyopathy (XLCM) is a progressive myocardial disease presenting with congestive heart failure in teenage males without clinical signs of skeletal myopathy. Tight linkage of XLCM to the DMD locus has been demonstrated; it has been suggested that, at least in some families, XLCM is a {open_quotes}dystrophinopathy.{close_quotes} We report a 14-year-old boy who presented with acute heart failure due to dilated cardiomyopathy. He had no history of muscle weakness, but physical examination revealed pseudohypertrophy of the calf muscles. He subsequently received a heart transplantation. Family history was negative. Serum CK level at the time of diagnosis was 10,416. Myocardial biopsy showed no evidence of carditis. Dystrophin staining of cardiac and skeletal muscle with anti-sera to COOH and NH{sub 2}termini showed a patchy distribution of positivity suggestive of Becker muscular dystrophy. Analysis of 18 of the 79 dystrophin exons detected a duplication that included exons 3-6. The proband`s mother has an elevated serum CK and was confirmed to be a carrier of the same duplication. A mutation in the muscle promotor region of the dystrophin gene has been implicated in the etiology of SLCM. However, Towbin et al. (1991) argued that other 5{prime} mutations in the dystrophin gene could cause selective cardiomyopathy. The findings in our patient support the latter hypothesis. This suggests that there are multiple regions in the dystrophin gene which, when disrupted, can cause isolated dilated cardiomyopathy.

  11. The Exon 13 Duplication in the BRCA1 Gene Is a Founder Mutation Present in Geographically Diverse Populations

    PubMed Central

    2000-01-01

    Recently, a 6-kb duplication of exon 13, which creates a frameshift in the coding sequence of the BRCA1 gene, has been described in three unrelated U.S. families of European ancestry and in one Portuguese family. Here, our goal was to estimate the frequency and geographic diversity of carriers of this duplication. To do this, a collaborative screening study was set up that involved 39 institutions from 19 countries and included 3,580 unrelated individuals with a family history of the disease and 934 early-onset breast and/or ovarian cancer cases. A total of 11 additional families carrying this mutation were identified in Australia (1), Belgium (1), Canada (1), Great Britain (6), and the United States (2). Haplotyping showed that they are likely to derive from a common ancestor, possibly of northern British origin. Our results demonstrate that it is strongly advisable, for laboratories carrying out screening either in English-speaking countries or in countries with historical links with Britain, to include within their BRCA1 screening protocols the polymerase chain reaction–based assay described in this report. PMID:10827109

  12. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies

    PubMed Central

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D.

    2016-01-01

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella. We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes—and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646

  13. Molecular diagnosis of PMP22 gene duplications and deletions: comparison of different methods.

    PubMed

    Stangler Herodez, Spela; Zagradisnik, B; Erjavec Skerget, A; Zagorac, A; Kokalj Vokac, N

    2009-01-01

    Several techniques can be used to diagnose Charcot-Marie-Tooth disease type 1A (CMT1A) and hereditary neuro pathy with liability to pressure palsies (HNPP), but no technique combines simplicity with high sensitivity. Multiplex ligation-dependent probe amplification (MLPA) was applied to develop an efficient and sensitive test for the detection of duplication/deletion of the peripheral myelin protein 22 (PMP22) gene. The study sample included 70 probands that had each been previously analysed by fluorescence in situ hibridization (FISH) and the restriction fragment length polymorphism-polymerase chain reaction (RFLP-PCR) assay, both of which detect a unique recombination fragment uniquely present in most patients with the duplication. A total of nine duplications and 19 deletions were detected in the 70 probands using MLPA, and there was 100% concordance between MPLA and FISH. A single duplication was missed by the RFLP-PCR assay, which accords with the lower sensitivity of this method. It is concluded that the MLPA allows accurate detection of PMP22 gene duplications/deletions and could be used for the molecular diagnosis of these two neuropathies. PMID:19930872

  14. A possible role of DNA methylation in functional divergence of a fast evolving duplicate gene encoding odorant binding protein 11 in the honeybee.

    PubMed

    Kucharski, R; Maleszka, J; Maleszka, R

    2016-06-29

    Although gene duplication is seen as the main path to evolution of new functions, molecular mechanisms by which selection favours the gain versus loss of newly duplicated genes and minimizes the fixation of pseudo-genes are not well understood. Here, we investigate in detail a duplicate honeybee gene obp11 belonging to a fast evolving insect gene family encoding odorant binding proteins (OBPs). We report that obp11 is expressed only in female bees in rare antennal sensilla basiconica in contrast to its tandem partner obp10 that is expressed in the brain in both females and males (drones). Unlike all other obp genes in the honeybee, obp11 is methylated suggesting that functional diversification of obp11 and obp10 may have been driven by an epigenetic mechanism. We also show that increased methylation in drones near one donor splice site that correlates with higher abundance of a transcript variant encoding a truncated OBP11 protein is one way of controlling its contrasting expression. Our data suggest that like in mammals and plants, DNA methylation in insects may contribute to functional diversification of proteins produced from duplicated genes, in particular to their subfunctionalization by generating complementary patterns of expression. PMID:27358363

  15. Whole-Gene Positive Selection, Elevated Synonymous Substitution Rates, Duplication, and Indel Evolution of the Chloroplast clpP1 Gene

    PubMed Central

    Erixon, Per; Oxelman, Bengt

    2008-01-01

    Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue

  16. The polycystic kidney disease 1 gene lies in a duplicated genomic region

    SciTech Connect

    Ward, C.J.; Hughes, J.; Peral, B. |

    1994-09-01

    The polycystic kidney disease 1 (PKD1) gene is situated in chromosomal band 16p13.3 and encodes a 14 kb transcript. The 5{prime} region of the PKD1 gene is located within a 40-50 kb stretch of genomic DNA which is duplicated several times in the more proximal region, 16p13.1. This proximal area gives rise to at least three transcripts designated homologous gene A (HG-A; 21 kb), HG-B (17 kb) and HG-C (8.5 kb). These three transcripts share substantial homology with each other and the PKD1 transcript. However, the 3{prime} 3.8 kb section of the PKD1 transcript is unique because it is encoded by a region of the gene that lies outside the duplicated area. The presence of the duplicate transcripts in all tissues analyzed has hampered attempts to clone and sequence the bone fide PKD1 gene. Comparison of cDNAs known to arise from the PKD1 transcript to those from the HG transcripts reveals that divergence of 2-3% has occurred between these sequences. To overcome the problem of the duplication, a large 15 kb section of genomic DNA has been sequenced together with several large HG cDNAs. Utilizing a radiation hybrid which contains only the 16p13.3 region and expresses low levels of the PKD1 transcript, we are now attempting to clone the duplicated part of the PKD1 gene by exon linking.

  17. Pericentromeric effects shape the patterns of divergence, retention, and expression of duplicated genes in the Paleopolyploid Soybean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Sequence divergence and fractionation of duplicated genes following whole genome duplication (WGD) are important processes in the course of polyploid genome evolution. However, the evolutionary forces that govern the divergence and retention of WGD-derived genes are poorly understood. In this study,...

  18. Expansion of transducin subunit gene families in early vertebrate tetraploidizations.

    PubMed

    Lagman, David; Sundström, Görel; Ocampo Daza, Daniel; Abalo, Xesús M; Larhammar, Dan

    2012-10-01

    Hundreds of gene families expanded in the early vertebrate tetraploidizations including many gene families in the phototransduction cascade. We have investigated the evolution of the heterotrimeric G-proteins of photoreceptors, the transducins, in relation to these events using both phylogenetic analyses and synteny comparisons. Three alpha subunit genes were identified in amniotes and the coelacanth, GNAT1-3; two of these were identified in amphibians and teleost fish, GNAT1 and GNAT2. Most tetrapods have four beta genes, GNB1-4, and teleosts have additional duplicates. Finally, three gamma genes were identified in mammals, GNGT1, GNG11 and GNGT2. Of these, GNGT1 and GNGT2 were found in the other vertebrates. In frog and zebrafish additional duplicates of GNGT2 were identified. Our analyses show all three transducin families expanded during the early vertebrate tetraploidizations and the beta and gamma families gained additional copies in the teleost-specific genome duplication. This suggests that the tetraploidizations contributed to visual specialisations. PMID:22814267

  19. Ascorbate peroxidase-related (APx-R) is not a duplicable gene

    PubMed Central

    Dunand, Christophe; Mathé, Catherine; Lazzarotto, Fernanda; Margis, Rogério; Margis-Pinheiro, Marcia

    2011-01-01

    Phylogenetic, genomic and functional analyses have allowed the identification of a new class of putative heme peroxidases, so called APx-R (APx-Related). These new class, mainly present in the green lineage (including green algae and land plants), can also be detected in other unicellular chloroplastic organisms. Except for recent polyploid organisms, only single-copy of APx-R gene was detected in each genome, suggesting that the majority of the APx-R extra-copies were lost after chromosomal or segmental duplications. In a similar way, most APx-R co-expressed genes in Arabidopsis genome do not have conserved extra-copies after chromosomal duplications and are predicted to be localized in organelles, as are the APx-R. The member of this gene network can be considered as unique gene, well conserved through the evolution due to a strong negative selection pressure and a low evolution rate. PMID:22231200

  20. Subfunctionalization of Duplicated Zebrafish pax6 Genes by cis-Regulatory Divergence

    PubMed Central

    Gautier, Philippe; Dahm, Ralf; Schonthaler, Helia B; Damante, Giuseppe; Seawright, Anne; Hever, Ann M; Yeyati, Patricia L; van Heyningen, Veronica; Coutinho, Pedro

    2008-01-01

    Gene duplication is a major driver of evolutionary divergence. In most vertebrates a single PAX6 gene encodes a transcription factor required for eye, brain, olfactory system, and pancreas development. In zebrafish, following a postulated whole-genome duplication event in an ancestral teleost, duplicates pax6a and pax6b jointly fulfill these roles. Mapping of the homozygously viable eye mutant sunrise identified a homeodomain missense change in pax6b, leading to loss of target binding. The mild phenotype emphasizes role-sharing between the co-orthologues. Meticulous mapping of isolated BACs identified perturbed synteny relationships around the duplicates. This highlights the functional conservation of pax6 downstream (3′) control sequences, which in most vertebrates reside within the introns of a ubiquitously expressed neighbour gene, ELP4, whose pax6a-linked exons have been lost in zebrafish. Reporter transgenic studies in both mouse and zebrafish, combined with analysis of vertebrate sequence conservation, reveal loss and retention of specific cis-regulatory elements, correlating strongly with the diverged expression of co-orthologues, and providing clear evidence for evolution by subfunctionalization. PMID:18282108

  1. Subfunctionalization of duplicated zebrafish pax6 genes by cis-regulatory divergence.

    PubMed

    Kleinjan, Dirk A; Bancewicz, Ruth M; Gautier, Philippe; Dahm, Ralf; Schonthaler, Helia B; Damante, Giuseppe; Seawright, Anne; Hever, Ann M; Yeyati, Patricia L; van Heyningen, Veronica; Coutinho, Pedro

    2008-02-01

    Gene duplication is a major driver of evolutionary divergence. In most vertebrates a single PAX6 gene encodes a transcription factor required for eye, brain, olfactory system, and pancreas development. In zebrafish, following a postulated whole-genome duplication event in an ancestral teleost, duplicates pax6a and pax6b jointly fulfill these roles. Mapping of the homozygously viable eye mutant sunrise identified a homeodomain missense change in pax6b, leading to loss of target binding. The mild phenotype emphasizes role-sharing between the co-orthologues. Meticulous mapping of isolated BACs identified perturbed synteny relationships around the duplicates. This highlights the functional conservation of pax6 downstream (3') control sequences, which in most vertebrates reside within the introns of a ubiquitously expressed neighbour gene, ELP4, whose pax6a-linked exons have been lost in zebrafish. Reporter transgenic studies in both mouse and zebrafish, combined with analysis of vertebrate sequence conservation, reveal loss and retention of specific cis-regulatory elements, correlating strongly with the diverged expression of co-orthologues, and providing clear evidence for evolution by subfunctionalization. PMID:18282108

  2. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database.

    PubMed

    Pasquier, Jeremy; Cabau, Cédric; Nguyen, Thaovi; Jouanno, Elodie; Severac, Dany; Braasch, Ingo; Journot, Laurent; Pontarotti, Pierre; Klopp, Christophe; Postlethwait, John H; Guiguen, Yann; Bobe, Julien

    2016-01-01

    With more than 30,000 species, ray-finned fish represent approximately half of vertebrates. The evolution of ray-finned fish was impacted by several whole genome duplication (WGD) events including a teleost-specific WGD event (TGD) that occurred at the root of the teleost lineage about 350 million years ago (Mya) and more recent WGD events in salmonids, carps, suckers and others. In plants and animals, WGD events are associated with adaptive radiations and evolutionary innovations. WGD-spurred innovation may be especially relevant in the case of teleost fish, which colonized a wide diversity of habitats on earth, including many extreme environments. Fish biodiversity, the use of fish models for human medicine and ecological studies, and the importance of fish in human nutrition, fuel an important need for the characterization of gene expression repertoires and corresponding evolutionary histories of ray-finned fish genes. To this aim, we performed transcriptome analyses and developed the PhyloFish database to provide (i) de novo assembled gene repertoires in 23 different ray-finned fish species including two holosteans (i.e. a group that diverged from teleosts before TGD) and 21 teleosts (including six salmonids), and (ii) gene expression levels in ten different tissues and organs (and embryos for many) in the same species. This resource was generated using a common deep RNA sequencing protocol to obtain the most exhaustive gene repertoire possible in each species that allows between-species comparisons to study the evolution of gene expression in different lineages. The PhyloFish database described here can be accessed and searched using RNAbrowse, a simple and efficient solution to give access to RNA-seq de novo assembled transcripts. PMID:27189481

  3. Haloperidol plasma concentration in Japanese psychiatric subjects with gene duplication of CYP2D6

    PubMed Central

    Ohnuma, Tohru; Shibata, Nobuto; Matsubara, Yoichiro; Arai, Heii

    2003-01-01

    Aims The cytochrome P-450 2D6 (CYP2D6) gene duplication/multiduplication producing an increase in enzyme activity, and the common Japanese mutation, CYP2D6*10A producing a decrease of enzyme activity were screened in a large number of Japanese psychiatric subjects (n = 111) in order to investigate whether these mutated alleles affected the plasma concentration of haloperidol. Methods Polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method was performed to identify the CYP2D6*10A and CYP2D6*2 genotypes in subjects who had been taking haloperidol. For the screening of duplicated active CYP2D6 gene, allele-specific long PCR was performed. Plasma concentration of haloperidol was measured by the enzyme immunoassay, and expressed as ‘plasma concentration dose ratio’ to normalize individual differences. Results The plasma concentration–dose ratio showed large interindividual differences of approximately 18-fold. PCR-RFLP methods revealed that 29 (26.1%), 10 (9.0%), 39 (35.1%), 0 (0%), seven (6.3%) and 26 (23.4%) cases possessed the CYP2D6 genotypes *1/*1, *1/*2, *1/*10A, *2/*2, *2/*10A and *10 A/*10A, respectively. Six cases (5.4%) had duplicated CYP2D6 genes. There were no significant differences of plasma concentration–dose ratio between the groups classified by CYP2D6*10A and *2 genotypes (Kruskal–Wallis test; P = 0.37), even in those cases whose daily doses were lower than 20 mg (n = 90, P = 0.91). Subjects having duplicated genes (n = 6) did not show significant differences of plasma concentration–dose ratio by comparison with subjects who had no duplicated genes (Mann–Whitney U-test; P = 0.80). Conclusions Gene duplication, and the common Japanese mutation CYP2D6*10A on CYP2D6 gene are not likely to be the main modulatory factors of plasma concentration of haloperidol in Japanese psychiatric subjects. PMID:12919180

  4. Regulatory circuit rewiring and functional divergence of the duplicate admp genes in dorsoventral axial patterning.

    PubMed

    Chang, Yi-Cheng; Pai, Chih-Yu; Chen, Yi-Chih; Ting, Hsiu-Chi; Martinez, Pedro; Telford, Maximilian J; Yu, Jr-Kai; Su, Yi-Hsien

    2016-02-01

    The spatially opposed expression of Antidorsalizing morphogenetic protein (Admp) and BMP signals controls dorsoventral (DV) polarity across Bilateria and hence represents an ancient regulatory circuit. Here, we show that in addition to the conserved admp1 that constitutes the ancient circuit, a second admp gene (admp2) is present in Ambulacraria (Echinodermata+Hemichordata) and two marine worms belonging to Xenoturbellida and Acoelomorpha. The phylogenetic distribution implies that the two admp genes were duplicated in the Bilaterian common ancestor and admp2 was subsequently lost in chordates and protostomes. We show that the ambulacrarian admp1 and admp2 are under opposite transcriptional control by BMP signals and knockdown of Admps in sea urchins impaired their DV polarity. Over-expression of either Admps reinforced BMP signaling but resulted in different phenotypes in the sea urchin embryo. Our study provides an excellent example of signaling circuit rewiring and protein functional changes after gene duplications. PMID:26719126

  5. Cheetahs have 4 serum amyloid a genes evolved through repeated duplication events.

    PubMed

    Chen, Lei; Une, Yumi; Higuchi, Keiichi; Mori, Masayuki

    2012-01-01

    Amyloid A (AA) amyloidosis is a leading cause of mortality in captive cheetahs (Acinonyx jubatus). We performed genome walking and PCR cloning and revealed that cheetahs have 4 SAA genes (provisionally named SAA1A, SAA1B, SAA3A, and SAA3B). In addition, we identified multiple nucleotide polymorphisms in the 4 SAA genes by screening 51 cheetahs. The polymorphisms defined 4, 7, 6, and 4 alleles for SAA1A, SAA3A, SAA1B, and SAA3B, respectively. Pedigree analysis of the inheritance of genotypes for the SAA genes revealed that specific combinations of alleles for the 4 SAA genes cosegregated as a unit (haplotype) in pedigrees, indicating that the 4 genes were linked on the same chromosome. Notably, cheetah SAA1A and SAA1B were highly homologous in their nucleotide sequences. Likewise, SAA3A and SAA3B genes were homologous. These observations suggested a model for the evolution of the 4 SAA genes in cheetahs in which duplication of an ancestral SAA gene first gave rise to SAA1 and SAA3. Subsequently, each gene duplicated one more time, uniquely making 4 genes in the cheetah genome. The monomorphism of the cheetah SAA1A protein might be one of the factors responsible for the high incidence of AA amyloidosis in this species. PMID:21987659

  6. Microevolution of Duplications and Deletions and Their Impact on Gene Expression in the Nematode Pristionchus pacificus

    PubMed Central

    2015-01-01

    The evolution of diversity across the animal kingdom has been accompanied by tremendous gene loss and gain. While comparative genomics has been fruitful to characterize differences in gene content across highly diverged species, little is known about the microevolution of structural variations that cause these differences in the first place. In order to investigate the genomic impact of structural variations, we made use of genomic and transcriptomic data from the nematode Pristionchus pacificus, which has been established as a satellite model to Caenorhabditis elegans for comparative biology. We exploit the fact that P. pacificus is a highly diverse species for which various genomic data including the draft genome of a sister species P. exspectatus is available. Based on resequencing coverage data for two natural isolates we identified large (> 2kb) deletions and duplications relative to the reference strain. By restriction to completely syntenic regions between P. pacificus and P. exspectatus, we were able to polarize the comparison and to assess the impact of structural variations on expression levels. We found that while loss of genes correlates with lack of expression, duplication of genes has virtually no effect on gene expression. Further investigating expression of individual copies at sites that segregate between the duplicates, we found in the majority of cases only one of the copies to be expressed. Nevertheless, we still find that certain gene classes are strongly depleted in deletions as well as duplications, suggesting evolutionary constraint acting on synteny. In summary, our results are consistent with a model, where most structural variations are either deleterious or neutral and provide first insights into the microevolution of structural variations in the P. pacificus genome. PMID:26125626

  7. Duplication 16p13.3 and the CREBBP gene: confirmation of the phenotype.

    PubMed

    Demeer, Bénédicte; Andrieux, Joris; Receveur, Aline; Morin, Gilles; Petit, Florence; Julia, Sophie; Plessis, Ghislaine; Martin-Coignard, Dominique; Delobel, Bruno; Firth, Helen V; Thuresson, Ann C; Lanco Dosen, Sandrine; Sjörs, Kerstin; Le Caignec, Cedric; Devriendt, Koenraad; Mathieu-Dramard, Michèle

    2013-01-01

    The introduction of molecular karyotyping technologies into the diagnostic work-up of patients with congenital disorders permitted the identification and delineation of novel microdeletion and microduplication syndromes. Interstitial 16p13.3 duplication, encompassing the CREBBP gene, which is mutated or deleted in the Rubinstein-Taybi syndrome, have been proposed to cause a recognisable syndrome with variable intellectual disability, normal growth, mild facial dysmorphism, mild anomalies of the extremities, and occasional findings such as developmental defects of the heart, genitalia, palate or the eyes. We here report the phenotypic and genotypic delineation of 9 patients carrying a submicroscopic 16p13.3 duplication, including the smallest 16p13.3 duplication reported so far. Careful clinical assessment confirms the distinctive clinical phenotype and also defines frequent associated features : marked speech problems, frequent ocular region involvement with upslanting of the eyes, narrow palpebral fissures, ptosis and strabismus, frequent proximal implantation of thumbs, cleft palate/bifid uvula and inguinal hernia. It also confirms that CREBBP is the critical gene involved in the duplication 16p13.3 syndrome. PMID:23063576

  8. Divergence of genes encoding non-specific lipid transfer proteins in the poaceae family.

    PubMed

    Jang, Cheol Seong; Jung, Jae Hyeong; Yim, Won Cheol; Lee, Byung-Moo; Seo, Yong Weon; Kim, Wook

    2007-10-31

    The genes encoding non-specific lipid transfer proteins (nsLTPs), members of a small multigene family, show a complex pattern of expressional regulation, suggesting that some diversification may have resulted from changes in their expression after duplication. In this study, the evolution of nsLTP genes within the Poaceae family was characterized via a survey of the pseudogenes and unigenes encoding the nsLTP in rice pseudomolecules and the NCBI unigene database. nsLTP-rich regions were detected in the distal portions of rice chromosomes 11 and 12; these may have resulted from the most recent large segmental duplication in the rice genome. Two independent tandem duplications were shown to occur within the nsLTP-rich regions of rice. The genomic distribution of the nsLTP genes in the rice genome differs from that in wheat. This may be attributed to gene migration, chromosomal rearrangement, and/or differential gene loss. The genomic distribution pattern of nsLTP genes in the Poaceae family points to the existence of some differences among cereal nsLTP genes, all of which diverged from an ancient gene. The unigenes encoding nsLTPs in each cereal species are clustered into five groups. The somewhat different distribution of nsLTP-encoding EST clones between the groups across cereal species imply that independent duplication(s) followed by subfunctionalization (and/or neofunctionalization) of the nsLTP gene family in each species occurred during speciation. PMID:17978574

  9. How segmental duplications shape our genome: recent evolution of ABCC6 and PKD1 Mendelian disease genes.

    PubMed

    Symmons, Orsolya; Váradi, András; Arányi, Tamás

    2008-12-01

    The completion of the Human Genome Project has brought the understanding that our genome contains an unexpectedly large proportion of segmental duplications. This poses the challenge of elucidating the consequences of recent duplications on physiology. We have conducted an in-depth study of a subset of segmental duplications on chromosome 16. We focused on PKD1 and ABCC6 duplications because mutations affecting these genes are responsible for the Mendelian disorders autosomal dominant polycystic kidney disease and pseudoxanthoma elasticum, respectively. We establish that duplications of PKD1 and ABCC6 are associated to low-copy repeat 16a and show that such duplications have occurred several times independently in different primate species. We demonstrate that partial duplication of PKD1 and ABCC6 has numerous consequences: the pseudogenes give rise to new transcripts and mediate gene conversion, which not only results in disease-causing mutations but also serves as a reservoir for sequence variation. The duplicated segments are also involved in submicroscopic and microscopic genomic rearrangements, contributing to structural variation in human and chromosomal break points in the gibbon. In conclusion, our data shed light on the recent and ongoing evolution of chromosome 16 mediated by segmental duplication and deepen our understanding of the history of two Mendelian disorder genes. PMID:18791038

  10. Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications.

    PubMed

    Brown, J R; Doolittle, W F

    1995-03-28

    Universal trees based on sequences of single gene homologs cannot be rooted. Iwabe et al. [Iwabe, N., Kuma, K.-I., Hasegawa, M., Osawa, S. & Miyata, T. (1989) Proc. Natl. Acad. Sci. USA 86, 9355-9359] circumvented this problem by using ancient gene duplications that predated the last common ancestor of all living things. Their separate, reciprocally rooted gene trees for elongation factors and ATPase subunits showed Bacteria (eubacteria) as branching first from the universal tree with Archaea (archaebacteria) and Eucarya (eukaryotes) as sister groups. Given its topical importance to evolutionary biology and concerns about the appropriateness of the ATPase data set, an evaluation of the universal tree root using other ancient gene duplications is essential. In this study, we derive a rooting for the universal tree using aminoacyl-tRNA synthetase genes, an extensive multigene family whose divergence likely preceded that of prokaryotes and eukaryotes. An approximately 1600-bp conserved region was sequenced from the isoleucyl-tRNA synthetases of several species representing deep evolutionary branches of eukaryotes (Nosema locustae), Bacteria (Aquifex pyrophilus and Thermotoga maritima) and Archaea (Pyrococcus furiosus and Sulfolobus acidocaldarius). In addition, a new valyl-tRNA synthetase was characterized from the protist Trichomonas vaginalis. Different phylogenetic methods were used to generate trees of isoleucyl-tRNA synthetases rooted by valyl- and leucyl-tRNA synthetases. All isoleucyl-tRNA synthetase trees showed Archaea and Eucarya as sister groups, providing strong confirmation for the universal tree rooting reported by Iwabe et al. As well, there was strong support for the monophyly (sensu Hennig) of Archaea. The valyl-tRNA synthetase gene from Tr. vaginalis clustered with other eukaryotic ValRS genes, which may have been transferred from the mitochondrial genome to the nuclear genome, suggesting that this amitochondrial trichomonad once harbored an

  11. Neuropeptide evolution: Chelicerate neurohormone and neuropeptide genes may reflect one or more whole genome duplications.

    PubMed

    Veenstra, Jan A

    2016-04-01

    Four genomes and two transcriptomes from six Chelicerate species were analyzed for the presence of neuropeptide and neurohormone precursors and their GPCRs. The genome from the spider Stegodyphus mimosarum yielded 87 neuropeptide precursors and 120 neuropeptide GPCRs. Many neuropeptide transcripts were also found in the transcriptomes of three other spiders, Latrodectus hesperus, Parasteatoda tepidariorum and Acanthoscurria geniculata. For the scorpion Mesobuthus martensii the numbers are 79 and 93 respectively. The very small genome of the house dust mite, Dermatophagoides farinae, on the other hand contains a much smaller number of such genes. A few new putative Arthropod neuropeptide genes were discovered. Thus, both spiders and the scorpion have an achatin gene and in spiders there are two different genes encoding myosuppressin-like peptides while spiders also have two genes encoding novel LGamides. Another finding is the presence of trissin in spiders and scorpions, while neuropeptide genes that seem to be orthologs of Lottia LFRYamide and Platynereis CCRFamide were also found. Such genes were also found in various insect species, but seem to be lacking from the Holometabola. The Chelicerate neuropeptide and neuropeptide GPCR genes often have paralogs. As the large majority of these are probably not due to local gene duplications, is plausible that they reflect the effects of one or more ancient whole genome duplications. PMID:26928473

  12. Expression Pattern Similarities Support the Prediction of Orthologs Retaining Common Functions after Gene Duplication Events.

    PubMed

    Das, Malay; Haberer, Georg; Panda, Arup; Das Laha, Shayani; Ghosh, Tapas Chandra; Schäffner, Anton R

    2016-08-01

    The identification of functionally equivalent, orthologous genes (functional orthologs) across genomes is necessary for accurate transfer of experimental knowledge from well-characterized organisms to others. This frequently relies on automated, coding sequence-based approaches such as OrthoMCL, Inparanoid, and KOG, which usually work well for one-to-one homologous states. However, this strategy does not reliably work for plants due to the occurrence of extensive gene/genome duplication. Frequently, for one query gene, multiple orthologous genes are predicted in the other genome, and it is not clear a priori from sequence comparison and similarity which one preserves the ancestral function. We have studied 11 organ-dependent and stress-induced gene expression patterns of 286 Arabidopsis lyrata duplicated gene groups and compared them with the respective Arabidopsis (Arabidopsis thaliana) genes to predict putative expressologs and nonexpressologs based on gene expression similarity. Promoter sequence divergence as an additional tool to substantiate functional orthology only partially overlapped with expressolog classification. By cloning eight A. lyrata homologs and complementing them in the respective four Arabidopsis loss-of-function mutants, we experimentally proved that predicted expressologs are indeed functional orthologs, while nonexpressologs or nonfunctionalized orthologs are not. Our study demonstrates that even a small set of gene expression data in addition to sequence homologies are instrumental in the assignment of functional orthologs in the presence of multiple orthologs. PMID:27303025

  13. Genome-wide identification and characterization of the Dof gene family in Medicago truncatula.

    PubMed

    Shu, Y J; Song, L L; Zhang, J; Liu, Y; Guo, C H

    2015-01-01

    The DNA-binding one zinc finger (Dof) family is a classic plant-specific zinc-finger transcription factor family, which is involved in many important processes, including seed maturation and germination, plant growth and development, and light responses. Investigation of the Medicago truncatula genome revealed 42 putative Dof genes, each of which holds one Dof domain. These genes were classified into four groups based on phylogenetic analysis, which are similar to the groups reported for Arabidopsis and rice. Based on genome duplication analysis, it was found that the MtDof genes were distributed on all chromosomes and had expanded through tandem gene duplication and segmental duplication events. Two main duplication regions were identified, one from tandem duplication and another from segmental duplication. By analyzing high-throughput sequencing data from M. truncatula, we found that most of the MtDof genes showed specific expression patterns in different tissues. According to cis-regulatory element analysis, these MtDof genes are regulated by different cis-acting motifs, which are important for the functional divergence of the MtDof genes in different processes. Thus, using genome-wide identification, evolution, and expression pattern analysis of the Dof genes in M. truncatula, our study provides valuable information for understanding the potential function of the Dof genes in regulating the growth and development of M. truncatula. PMID:26400295

  14. Convergent evolution of human and bovine haptoglobin: partial duplication of the genes.

    PubMed

    Wicher, Krzysztof B; Fries, Erik

    2007-10-01

    Haptoglobin (Hp) is a hemoglobin-binding plasma protein consisting of two types of chains, called alpha and beta, which originate from a common polypeptide. In humans, but not in other mammals, Hp has been shown to occur in two allelic forms, Hp1 and Hp2, which differ in the length of the alpha-chain. The longer alpha-chain (in Hp2) seems to have arisen by an internal duplication of a gene segment coding for almost the entire alpha-chain of Hp1. In this article we show that Hp of cow (Bos taurus) contains an alpha-chain, the structure of which is similar to that of the human Hp2 alpha-chain. Furthermore, comparison of the structure of bovine Hp and human Hp2 suggests that the bovine gene arose by a duplication of the gene segment homologous to that duplicated in human Hp2. However, a phylogenetic analysis indicates that the two genes were formed independently. The evolutionary pressure that has led to the fixation of the Hps with a longer alpha-chain is not known. PMID:17922076

  15. The effect of functional compensation among duplicate genes can constrain their evolutionary divergence.

    PubMed

    Bozorgmehr, Joseph Esfandiar Hannon

    2012-01-01

    Gene duplicates have the inherent property of initially being functionally redundant. This means that they can compensate for the effect of deleterious variation occurring at one or more sister sites. Here, I present data bearing on evolutionary theory that illustrates the manner in which any functional adaptation in duplicate genes is markedly constrained because of the compensatory utility provided by a sustained genetic redundancy. Specifically, a two-locus epistatic model of paralogous genes was simulated to investigate the degree of purifying selection imposed, and whether this would serve to impede any possible biochemical innovation. Three population sizes were considered to see if, as expected, there was a significant difference in any selection for robustness. Interestingly, physical linkage between tandem duplicates was actually found to increase the probability of any neofunctionalization and the efficacy of selection, contrary to what is expected in the case of singleton genes. The results indicate that an evolutionary trade-off often exists between any functional change under either positive or relaxed selection and the need to compensate for failures due to degenerative mutations, thereby guaranteeing the reliability of protein production. PMID:22546821

  16. The butterfly plant arms-race escalated by gene and genome duplications

    PubMed Central

    Edger, Patrick P.; Heidel-Fischer, Hanna M.; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E.; Heckel, David G.; Der, Joshua P.; Wafula, Eric K.; Tang, Michelle; Hofberger, Johannes A.; Smithson, Ann; Hall, Jocelyn C.; Blanchette, Matthieu; Bureau, Thomas E.; Wright, Stephen I.; dePamphilis, Claude W.; Eric Schranz, M.; Barker, Michael S.; Conant, Gavin C.; Wahlberg, Niklas; Vogel, Heiko; Pires, J. Chris; Wheat, Christopher W.

    2015-01-01

    Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits. PMID:26100883

  17. Adaptations to Endosymbiosis in a Cnidarian-Dinoflagellate Association: Differential Gene Expression and Specific Gene Duplications

    PubMed Central

    Magnone, Virginie; Allemand, Denis; Furla, Paola; Sabourault, Cécile

    2011-01-01

    Trophic endosymbiosis between anthozoans and photosynthetic dinoflagellates forms the key foundation of reef ecosystems. Dysfunction and collapse of symbiosis lead to bleaching (symbiont expulsion), which is responsible for the severe worldwide decline of coral reefs. Molecular signals are central to the stability of this partnership and are therefore closely related to coral health. To decipher inter-partner signaling, we developed genomic resources (cDNA library and microarrays) from the symbiotic sea anemone Anemonia viridis. Here we describe differential expression between symbiotic (also called zooxanthellate anemones) or aposymbiotic (also called bleached) A. viridis specimens, using microarray hybridizations and qPCR experiments. We mapped, for the first time, transcript abundance separately in the epidermal cell layer and the gastrodermal cells that host photosynthetic symbionts. Transcriptomic profiles showed large inter-individual variability, indicating that aposymbiosis could be induced by different pathways. We defined a restricted subset of 39 common genes that are characteristic of the symbiotic or aposymbiotic states. We demonstrated that transcription of many genes belonging to this set is specifically enhanced in the symbiotic cells (gastroderm). A model is proposed where the aposymbiotic and therefore heterotrophic state triggers vesicular trafficking, whereas the symbiotic and therefore autotrophic state favors metabolic exchanges between host and symbiont. Several genetic pathways were investigated in more detail: i) a key vitamin K–dependant process involved in the dinoflagellate-cnidarian recognition; ii) two cnidarian tissue-specific carbonic anhydrases involved in the carbon transfer from the environment to the intracellular symbionts; iii) host collagen synthesis, mostly supported by the symbiotic tissue. Further, we identified specific gene duplications and showed that the cnidarian-specific isoform was also up-regulated both in the

  18. The Discoidin I Gene Family of Dictyostelium Discoideum Is Linked to Genes Regulating Its Expression

    PubMed Central

    Welker, D. L.

    1988-01-01

    The discoidin I protein has been studied extensively as a marker of early development in the cellular slime mold Dictyostelium discoideum. However, like most other developmentally regulated proteins in this system, no reliable information was available on the linkage of the discoidin genes to other known genes. Analysis of the linkage of the discoidin I genes by use of restriction fragment length polymorphisms revealed that all three discoidin I genes as well as a pseudogene are located on linkage group II. This evidence is consistent with the discoidin I genes forming a gene cluster that may be under the control of a single regulatory element. The discoidin I genes are linked to three genetic loci (disA, motA, daxA) that affect the expression of the discoidin I protein. Linkage of the gene family members to regulatory loci may be important in the coordinate maintenance of the gene family and regulatory loci. A duplication affecting the entire discoidin gene family is also linked to group II; this appears to be a small tandem duplication. This duplication was mapped using a DNA polymorphism generated by insertion of the Tdd-3 mobile genetic element into a Tdd-2 element flanking the γ gene. A probe for Tdd-2 identified a restriction fragment length polymorphism in strain AX3K that was consistent with generation by a previously proposed Tdd-3 insertion event. A putative duplication or rearrangement of a second Tdd-2 element on linkage group IV of strain AX3K was also identified. This is the first linkage information available for mobile genetic elements in D. discoideum. PMID:3402731

  19. Mutation-Selection Balance in Multi-Locus Systems. I. Duplicate Gene Action

    PubMed Central

    Pritchett-Ewing, Evelyn

    1981-01-01

    A theoretical model is presented that extends the case of selection against homozygous recessives counterbalanced by mutation to a system of n loci. This extension allows analysis of the role of gene duplication in the evolution of new function. The aspect of retention of function for sufficiently long periods of time to allow for divergence vs. silencing of nonfunctional loci is discussed in relation to examples in salmonid and catastomid fishes and in the globin-like clusters. PMID:17249091

  20. Duplication and expression of CYC2-like genes in the origin and maintenance of corolla zygomorphy in Lamiales.

    PubMed

    Zhong, Jinshun; Kellogg, Elizabeth A

    2015-01-01

    Duplication, retention, and expression of CYCLOIDEA2 (CYC2)-like genes are thought to affect evolution of corolla symmetry. However, exactly what and how changes in CYC2-like genes correlate with the origin of corolla zygomorphy are poorly understood. We inferred and calibrated a densely sampled phylogeny of CYC2-like genes across the Lamiales and examined their expression in early diverging (EDL) and higher core clades (HCL). CYC2-like genes duplicated extensively in Lamiales, at least six times in core Lamiales (CL) around the Cretaceous-Paleogene (K-Pg) boundary, and seven more in EDL relatively more recently. Nested duplications and losses of CYC2-like paralogs are pervasive but may not correlate with transitions in corolla symmetry. We found evidence for dN/dS (ω) variation following gene duplications. CYC2-like paralogs in HCL show differential expression with higher expression in adaxial petals. Asymmetric expression but not recurrent duplication of CYC2-like genes correlates with the origin of corolla zygomorphy. Changes in both cis-regulatory and coding domains of CYC2-like genes are probably crucial for the evolution of corolla zygomorphy. Multiple selection regimes appear likely to play important roles in gene retention. The parallel duplications of CYC2-like genes are after the initial diversification of bumble bees and Euglossine bees. PMID:25329857

  1. Gene duplication in Mimulus underlies parallel floral evolution via independent trans-regulatory changes.

    PubMed

    Cooley, Arielle M; Modliszewski, Jennifer L; Rommel, Megan L; Willis, John H

    2011-04-26

    Identifying the genetic basis of parallelism reveals the means by which evolution repeats itself and shows what aspects-if any-may be predictable. The recently tetraploid luteus group of Mimulus contains five species native to central Chile, three of which have evolved extensive red floral pigmentation using at least two distinct loci . Here we show that the parallel evolution of petal lobe anthocyanin (PLA) pigmentation in M. cupreus and M. luteus var. variegatus occurred via separate yet strikingly similar mechanisms. In each case, a dominant, single-locus gain of pigmentation maps to a genomic region (pla1 and pla2, respectively) containing adjacent, apparently recently duplicated paralogs of MYB anthocyanin-regulating transcription factors. Interestingly, candidate genes in pla1 and pla2 are themselves related by an older duplication. In both cases, pla genotype cosegregates with expression of multiple genes in the anthocyanin biosynthetic pathway, revealing a mechanism of coordinated trans-regulatory expression changes across functionally related enzyme-encoding genes. We conclude that in this instance, evolution has repeated itself with marked consistency. Duplication has enabled that repetition to occur using two physically independent but functionally similar loci, highlighting the importance of genomic complexity to the evolutionary process. PMID:21474312

  2. Introns regulate the production of ribosomal proteins by modulating splicing of duplicated ribosomal protein genes

    PubMed Central

    Petibon, Cyrielle; Parenteau, Julie; Catala, Mathieu; Elela, Sherif Abou

    2016-01-01

    Most budding yeast introns exist in the many duplicated ribosomal protein genes (RPGs) and it has been posited that they remain there to modulate the expression of RPGs and cell growth in response to stress. However, the mechanism by which introns regulate the expression of RPGs and their impact on the synthesis of ribosomal proteins remain unclear. In this study, we show that introns determine the ratio of ribosomal protein isoforms through asymmetric paralog-specific regulation of splicing. Exchanging the introns and 3′ untranslated regions of the duplicated RPS9 genes altered the splicing efficiency and changed the ratio of the ribosomal protein isoforms. Mutational analysis of the RPS9 genes indicated that splicing is regulated by variations in the intron structure and the 3′ untranslated region. Together these data suggest that preferential splicing of duplicated RPGs provides a means for adjusting the ratio of different ribosomal protein isoforms, while maintaining the overall expression level of each ribosomal protein. PMID:26945043

  3. Gene Duplication as a Mechanism of Genetic Adaptation in SACCHAROMYCES CEREVISIAE

    PubMed Central

    Hansche, Paul E.

    1975-01-01

    It has been shown that specific mutations of the gene that codes for the general acid monophosphatase (Aphtase) of S. cerevisiae can increase the affinity of this enzyme for β-glycerophosphate (BGP) and thereby provide this organism with the capacity to exploit extremely low concentrations of this organic phosphate (Francis and Hansche 1973). In this report two additional avenues are demonstrated to be available to this organism for increasing its capacity to exploit low concentrations of organic phosphates. One avenue is through mutations that increase the amount of Aphtase that associates with the cell wall, where it catalizes the hydrolysis of exogenous organic phosphates. The other avenue is through duplication of the gene that codes for Aphtase, doubling the amount of Aphtase synthesized. — The spontaneous duplication of the structural gene of Aphtase and the incorporation of the duplicate into this experimental population as a means of exploiting low concentrations of exogenous organic phosphates provides direct support for the first step of the mechanism through which new metabolic functions are postulated to evolve. PMID:236976

  4. GENE-dosage effects on fitness in recent adaptive duplications: ace-1 in the mosquito Culex pipiens.

    PubMed

    Labbé, Pierrick; Milesi, Pascal; Yébakima, André; Pasteur, Nicole; Weill, Mylène; Lenormand, Thomas

    2014-07-01

    Gene duplications have long been advocated to contribute to the evolution of new functions. The role of selection in their early spread is more controversial. Unless duplications are favored for a direct benefit of increased expression, they are likely detrimental. In this article, we investigated the case of duplications favored because they combine already functionally divergent alleles. Their gene-dosage/fitness relations are poorly known because selection may operate on both overall expression and duplicates relative dosage. Using the well-documented case of Culex pipiens resistance to insecticides, we compared strains with various ace-1 allele combinations, including two duplicated alleles carrying both susceptible and resistant copies. The overall protein activity was nearly additive, but, surprisingly, fitness correlated better with the relative proportion of susceptible and resistant copies rather than any absolute measure of activity. Gene dosage is thus crucial, duplications stabilizing a "heterozygote" phenotype. It corroborates the view that these were favored because they fix a permanent heterosis, thereby solving the irreducible trade-off between resistance and synaptic transmission. Moreover, we showed that the contrasted successes of the two duplicated alleles in natural populations depend on genetic changes unrelated to ace-1, confirming the probable implication of recessive sublethal mutations linked to structural rearrangements in some duplications. PMID:24494966

  5. Comparative Transcriptome Analyses Reveal Core Parasitism Genes and Suggest Gene Duplication and Repurposing as Sources of Structural Novelty

    PubMed Central

    Yang, Zhenzhen; Wafula, Eric K.; Honaas, Loren A.; Zhang, Huiting; Das, Malay; Fernandez-Aparicio, Monica; Huang, Kan; Bandaranayake, Pradeepa C.G.; Wu, Biao; Der, Joshua P.; Clarke, Christopher R.; Ralph, Paula E.; Landherr, Lena; Altman, Naomi S.; Timko, Michael P.; Yoder, John I.; Westwood, James H.; dePamphilis, Claude W.

    2015-01-01

    The origin of novel traits is recognized as an important process underlying many major evolutionary radiations. We studied the genetic basis for the evolution of haustoria, the novel feeding organs of parasitic flowering plants, using comparative transcriptome sequencing in three species of Orobanchaceae. Around 180 genes are upregulated during haustorial development following host attachment in at least two species, and these are enriched in proteases, cell wall modifying enzymes, and extracellular secretion proteins. Additionally, about 100 shared genes are upregulated in response to haustorium inducing factors prior to host attachment. Collectively, we refer to these newly identified genes as putative “parasitism genes.” Most of these parasitism genes are derived from gene duplications in a common ancestor of Orobanchaceae and Mimulus guttatus, a related nonparasitic plant. Additionally, the signature of relaxed purifying selection and/or adaptive evolution at specific sites was detected in many haustorial genes, and may play an important role in parasite evolution. Comparative analysis of gene expression patterns in parasitic and nonparasitic angiosperms suggests that parasitism genes are derived primarily from root and floral tissues, but with some genes co-opted from other tissues. Gene duplication, often taking place in a nonparasitic ancestor of Orobanchaceae, followed by regulatory neofunctionalization, was an important process in the origin of parasitic haustoria. PMID:25534030

  6. An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.

    PubMed

    Shao, Mingfu; Lin, Yu; Moret, Bernard M E

    2015-05-01

    Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR. PMID:25517208

  7. Gene structure variation in segmental duplication block C of human chromosome 7q 11.23 during primate evolution.

    PubMed

    Kim, Yun-Ji; Ahn, Kung; Gim, Jeong-An; Oh, Man Hwan; Han, Kyudong; Kim, Heui-Soo

    2015-12-01

    Segmental duplication, or low-copy repeat (LCR) event, occurs during primate evolution and is an important source of genomic diversity, including gain or loss of gene function. The human chromosome 7q 11.23 is related to the William-Beuren syndrome and contains large region-specific LCRs composed of blocks A, B, and C that have different copy numbers in humans and different primates. We analyzed the structure of POM121, NSUN5, FKBP6, and TRIM50 genes in the LCRs of block C. Based on computational analysis, POM121B created by a segmental duplication acquired a new exonic region, whereas NSUN5B (NSUN5C) showed structural variation by integration of HERV-K LTR after duplication from the original NSUN5 gene. The TRIM50 gene originally consists of seven exons, whereas the duplicated TRIM73 and TRIM74 genes present five exons because of homologous recombination-mediated deletion. In addition, independent duplication events of the FKBP6 gene generated two pseudogenes at different genomic locations. In summary, these clustered genes are created by segmental duplication, indicating that they show dynamic evolutionary events, leading to structure variation in the primate genome. PMID:26196062

  8. Patterns of gene duplication and functional evolution during the diversification of the AGAMOUS subfamily of MADS box genes in angiosperms.

    PubMed Central

    Kramer, Elena M; Jaramillo, M Alejandra; Di Stilio, Verónica S

    2004-01-01

    Members of the AGAMOUS (AG) subfamily of MIKC-type MADS-box genes appear to control the development of reproductive organs in both gymnosperms and angiosperms. To understand the evolution of this subfamily in the flowering plants, we have identified 26 new AG-like genes from 15 diverse angiosperm species. Phylogenetic analyses of these genes within a large data set of AG-like sequences show that ancient gene duplications were critical in shaping the evolution of the subfamily. Before the radiation of extant angiosperms, one event produced the ovule-specific D lineage and the well-characterized C lineage, whose members typically promote stamen and carpel identity as well as floral meristem determinacy. Subsequent duplications in the C lineage resulted in independent instances of paralog subfunctionalization and maintained functional redundancy. Most notably, the functional homologs AG from Arabidopsis and PLENA (PLE) from Antirrhinum are shown to be representatives of separate paralogous lineages rather than simple genetic orthologs. The multiple subfunctionalization events that have occurred in this subfamily highlight the potential for gene duplication to lead to dissociation among genetic modules, thereby allowing an increase in morphological diversity. PMID:15020484

  9. The role of human-specific gene duplications during brain development and evolution.

    PubMed

    Sassa, Takayuki

    2013-09-01

    One of the most fascinating questions in evolutionary biology is how traits unique to humans, such as their high cognitive abilities, erect bipedalism, and hairless skin, are encoded in the genome. Recent advances in genomics have begun to reveal differences between the genomes of the great apes. It has become evident that one of the many mutation types, segmental duplication, has drastically increased in the primate genomes, and most remarkably in the human genome. Genes contained in these segmental duplications have a tremendous potential to cause genetic innovation, probably accounting for the acquisition of human-specific traits. In this review, I begin with an overview of the genes, which have increased their copy number specifically in the human lineage, following its separation from the common ancestor with our closest living relative, the chimpanzee. Then, I introduce the recent experimental approaches, focusing on SRGAP2, which has been partially duplicated, to elucidate the role of SRGAP2 protein and its human-specific paralogs in human brain development and evolution. PMID:23782070

  10. Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

    PubMed

    Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

    2016-01-01

    Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans. PMID:27401671