Sample records for tandemly duplicated genes

  1. PTGBase: an integrated database to study tandem duplicated genes in plants.

    PubMed

    Yu, Jingyin; Ke, Tao; Tehrim, Sadia; Sun, Fengming; Liao, Boshou; Hua, Wei

    2015-01-01

    Tandem duplication is a wide-spread phenomenon in plant genomes and plays significant roles in evolution and adaptation to changing environments. Tandem duplicated genes related to certain functions will lead to the expansion of gene families and bring increase of gene dosage in the form of gene cluster arrays. Many tandem duplication events have been studied in plant genomes; yet, there is a surprising shortage of efforts to systematically present the integration of large amounts of information about publicly deposited tandem duplicated gene data across the plant kingdom. To address this shortcoming, we developed the first plant tandem duplicated genes database, PTGBase. It delivers the most comprehensive resource available to date, spanning 39 plant genomes, including model species and newly sequenced species alike. Across these genomes, 54 130 tandem duplicated gene clusters (129 652 genes) are presented in the database. Each tandem array, as well as its member genes, is characterized in complete detail. Tandem duplicated genes in PTGBase can be explored through browsing or searching by identifiers or keywords of functional annotation and sequence similarity. Users can download tandem duplicated gene arrays easily to any scale, up to the complete annotation data set for an entire plant genome. PTGBase will be updated regularly with newly sequenced plant species as they become available. © The Author(s) 2015. Published by Oxford University Press.

  2. Detecting long tandem duplications in genomic sequences.

    PubMed

    Audemard, Eric; Schiex, Thomas; Faraut, Thomas

    2012-05-08

    Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication. In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS  <  1) and that it is also able to predict tandem duplications involving non coding elements such as pseudo-genes or RNA genes. ReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.

  3. Profiling of gene duplication patterns of sequenced teleost genomes: evidence for rapid lineage-specific genome expansion mediated by recent tandem duplications.

    PubMed

    Lu, Jianguo; Peatman, Eric; Tang, Haibao; Lewis, Joshua; Liu, Zhanjiang

    2012-06-15

    Gene duplication has had a major impact on genome evolution. Localized (or tandem) duplication resulting from unequal crossing over and whole genome duplication are believed to be the two dominant mechanisms contributing to vertebrate genome evolution. While much scrutiny has been directed toward discerning patterns indicative of whole-genome duplication events in teleost species, less attention has been paid to the continuous nature of gene duplications and their impact on the size, gene content, functional diversity, and overall architecture of teleost genomes. Here, using a Markov clustering algorithm directed approach we catalogue and analyze patterns of gene duplication in the four model teleost species with chromosomal coordinates: zebrafish, medaka, stickleback, and Tetraodon. Our analyses based on set size, duplication type, synonymous substitution rate (Ks), and gene ontology emphasize shared and lineage-specific patterns of genome evolution via gene duplication. Most strikingly, our analyses highlight the extraordinary duplication and retention rate of recent duplicates in zebrafish and their likely role in the structural and functional expansion of the zebrafish genome. We find that the zebrafish genome is remarkable in its large number of duplicated genes, small duplicate set size, biased Ks distribution toward minimal mutational divergence, and proportion of tandem and intra-chromosomal duplicates when compared with the other teleost model genomes. The observed gene duplication patterns have played significant roles in shaping the architecture of teleost genomes and appear to have contributed to the recent functional diversification and divergence of important physiological processes in zebrafish. We have analyzed gene duplication patterns and duplication types among the available teleost genomes and found that a large number of genes were tandemly and intrachromosomally duplicated, suggesting their origin of independent and continuous duplication

  4. Targeted tandem duplication of a large chromosomal segment in Aspergillus oryzae.

    PubMed

    Takahashi, Tadashi; Sato, Atsushi; Ogawa, Masahiro; Hanya, Yoshiki; Oguma, Tetsuya

    2014-08-01

    We describe here the first successful construction of a targeted tandem duplication of a large chromosomal segment in Aspergillus oryzae. The targeted tandem chromosomal duplication was achieved by using strains that had a 5'-deleted pyrG upstream of the region targeted for tandem chromosomal duplication and a 3'-deleted pyrG downstream of the target region. Consequently,strains bearing a 210-kb targeted tandem chromosomal duplication near the centromeric region of chromosome 8 and strains bearing a targeted tandem chromosomal duplication of a 700-kb region of chromosome 2 were successfully constructed. The strains bearing the tandem chromosomal duplication were efficiently obtained from the regenerated protoplast of the parental strains. However, the generation of the chromosomal duplication did not depend on the introduction of double-stranded breaks(DSBs) by I-SceI. The chromosomal duplications of these strains were stably maintained after five generations of culture under nonselective conditions. The strains bearing the tandem chromosomal duplication in the 700-kb region of chromosome 2 showed highly increased protease activity in solid-state culture, indicating that the duplication of large chromosomal segments could be a useful new breeding technology and gene analysis method.

  5. Heterogeneous expression pattern of tandem duplicated sHsps genes during fruit ripening in two tomato species

    NASA Astrophysics Data System (ADS)

    Arce, DP; Krsticevic, FJ; Ezpeleta, J.; Ponce, SD; Pratta, GR; Tapia, E.

    2016-04-01

    The small heat shock proteins (sHSPs) have been found to play a critical role in physiological stress conditions in protecting proteins from irreversible aggregation. To characterize the gene expression profile of four sHsps with a tandem gene structure arrangement in the domesticated Solanum lycopersicum (Heinz 1706) genome and its wild close relative Solanum pimpinellifolium (LA1589), differential gene expression analysis using RNA-Seq was conducted in three ripening stages in both cultivars fruits. Gene promoter analysis was performed to explain the heterogeneous pattern of gene expression found for these tandem duplicated sHsps. In silico analysis results contribute to refocus wet experiment analysis in tomato sHsp family proteins.

  6. Diverse Cis-Regulatory Mechanisms Contribute to Expression Evolution of Tandem Gene Duplicates

    PubMed Central

    Baudouin-Gonzalez, Luís; Santos, Marília A; Tempesta, Camille; Sucena, Élio; Roch, Fernando; Tanaka, Kohtaro

    2017-01-01

    Abstract Pairs of duplicated genes generally display a combination of conserved expression patterns inherited from their unduplicated ancestor and newly acquired domains. However, how the cis-regulatory architecture of duplicated loci evolves to produce these expression patterns is poorly understood. We have directly examined the gene-regulatory evolution of two tandem duplicates, the Drosophila Ly6 genes CG9336 and CG9338, which arose at the base of the drosophilids between 40 and 60 Ma. Comparing the expression patterns of the two paralogs in four Drosophila species with that of the unduplicated ortholog in the tephritid Ceratitis capitata, we show that they diverged from each other as well as from the unduplicated ortholog. Moreover, the expression divergence appears to have occurred close to the duplication event and also more recently in a lineage-specific manner. The comparison of the tissue-specific cis-regulatory modules (CRMs) controlling the paralog expression in the four Drosophila species indicates that diverse cis-regulatory mechanisms, including the novel tissue-specific enhancers, differential inactivation, and enhancer sharing, contributed to the expression evolution. Our analysis also reveals a surprisingly variable cis-regulatory architecture, in which the CRMs driving conserved expression domains change in number, location, and specificity. Altogether, this study provides a detailed historical account that uncovers a highly dynamic picture of how the paralog expression patterns and their underlying cis-regulatory landscape evolve. We argue that our findings will encourage studying cis-regulatory evolution at the whole-locus level to understand how interactions between enhancers and other regulatory levels shape the evolution of gene expression. PMID:28961967

  7. Both mechanism and age of duplications contribute to biased gene retention patterns in plants.

    PubMed

    Rody, Hugo V S; Baute, Gregory J; Rieseberg, Loren H; Oliveira, Luiz O

    2017-01-06

    All extant seed plants are successful paleopolyploids, whose genomes carry duplicate genes that have survived repeated episodes of diploidization. However, the survival of gene duplicates is biased with respect to gene function and mechanism of duplication. Transcription factors, in particular, are reported to be preferentially retained following whole-genome duplications (WGDs), but disproportionately lost when duplicated by tandem events. An explanation for this pattern is provided by the Gene Balance Hypothesis (GBH), which posits that duplicates of highly connected genes are retained following WGDs to maintain optimal stoichiometry among gene products; but such connected gene duplicates are disfavored following tandem duplications. We used genomic data from 25 taxonomically diverse plant species to investigate the roles of duplication mechanism, gene function, and age of duplication in the retention of duplicate genes. Enrichment analyses were conducted to identify Gene Ontology (GO) functional categories that were overrepresented in either WGD or tandem duplications, or across ranges of divergence times. Tandem paralogs were much younger, on average, than WGD paralogs and the most frequently overrepresented GO categories were not shared between tandem and WGD paralogs. Transcription factors were overrepresented among ancient paralogs regardless of mechanism of origin or presence of a WGD. Also, in many cases, there was no bias toward transcription factor retention following recent WGDs. Both the fixation and the retention of duplicated genes in plant genomes are context-dependent events. The strong bias toward ancient transcription factor duplicates can be reconciled with the GBH if selection for optimal stoichiometry among gene products is strongest following the earliest polyploidization events and becomes increasingly relaxed as gene families expand.

  8. North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene

    PubMed Central

    Sullivan, Lori S.; Wheaton, Dianna K.; Locke, Kirsten G.; Jones, Kaylie D.; Koboldt, Daniel C.; Fulton, Robert S.; Wilson, Richard K.; Blanton, Susan H.; Birch, David G.; Daiger, Stephen P.

    2016-01-01

    Purpose To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). Methods A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Results Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13. The duplication creates a partial copy of CCNC and a complete copy of PRDM13. The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. Conclusions The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1

  9. North Carolina macular dystrophy (MCDR1) caused by a novel tandem duplication of the PRDM13 gene.

    PubMed

    Bowne, Sara J; Sullivan, Lori S; Wheaton, Dianna K; Locke, Kirsten G; Jones, Kaylie D; Koboldt, Daniel C; Fulton, Robert S; Wilson, Richard K; Blanton, Susan H; Birch, David G; Daiger, Stephen P

    2016-01-01

    To identify the underlying cause of disease in a large family with North Carolina macular dystrophy (NCMD). A large four-generation family (RFS355) with an autosomal dominant form of NCMD was ascertained. Family members underwent comprehensive visual function evaluations. Blood or saliva from six affected family members and three unaffected spouses was collected and DNA tested for linkage to the MCDR1 locus on chromosome 6q12. Three affected family members and two unaffected spouses underwent whole exome sequencing (WES) and subsequently, custom capture of the linkage region followed by next-generation sequencing (NGS). Standard PCR and dideoxy sequencing were used to further characterize the mutation. Of the 12 eyes examined in six affected individuals, all but two had Gass grade 3 macular degeneration features. Large central excavation of the retinal and choroid layers, referred to as a macular caldera, was seen in an age-independent manner in the grade 3 eyes. The calderas are unique to affected individuals with MCDR1. Genome-wide linkage mapping and haplotype analysis of markers from the chromosome 6q region were consistent with linkage to the MCDR1 locus. Whole exome sequencing and custom-capture NGS failed to reveal any rare coding variants segregating with the phenotype. Analysis of the custom-capture NGS sequencing data for copy number variants uncovered a tandem duplication of approximately 60 kb on chromosome 6q. This region contains two genes, CCNC and PRDM13 . The duplication creates a partial copy of CCNC and a complete copy of PRDM13 . The duplication was found in all affected members of the family and is not present in any unaffected members. The duplication was not seen in 200 ethnically matched normal chromosomes. The cause of disease in the original family with MCDR1 and several others has been recently reported to be dysregulation of the PRDM13 gene, caused by either single base substitutions in a DNase 1 hypersensitive site upstream of the CCNC

  10. Persons with Quebec platelet disorder have a tandem duplication of PLAU, the urokinase plasminogen activator gene.

    PubMed

    Paterson, Andrew D; Rommens, Johanna M; Bharaj, Bhupinder; Blavignac, Jessica; Wong, Isidro; Diamandis, Maria; Waye, John S; Rivard, Georges E; Hayward, Catherine P M

    2010-02-11

    Quebec platelet disorder (QPD) is an autosomal dominant bleeding disorder linked to a region on chromosome 10 that includes PLAU, the urokinase plasminogen activator gene. QPD increases urokinase plasminogen activator mRNA levels, particularly during megakaryocyte differentiation, without altering expression of flanking genes. Because PLAU sequence changes were excluded as the cause of this bleeding disorder, we investigated whether the QPD mutation involved PLAU copy number variation. All 38 subjects with QPD had a direct tandem duplication of a 78-kb genomic segment that includes PLAU. This mutation was specific to QPD as it was not present in any unaffected family members (n = 114), unrelated French Canadians (n = 221), or other persons tested (n = 90). This new information on the genetic mutation will facilitate diagnostic testing for QPD and studies of its pathogenesis and prevalence. QPD is the first bleeding disorder to be associated with a gene duplication event and a PLAU mutation.

  11. Soybean (Glycine max) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies

    PubMed Central

    2014-01-01

    Background Expansins are plant cell wall loosening proteins that are involved in cell enlargement and a variety of other developmental processes. The expansin superfamily contains four subfamilies; namely, α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). Although the genome sequencing of soybeans is complete, our knowledge about the pattern of expansion and evolutionary history of soybean expansin genes remains limited. Results A total of 75 expansin genes were identified in the soybean genome, and grouped into four subfamilies based on their phylogenetic relationships. Structural analysis revealed that the expansin genes are conserved in each subfamily, but are divergent among subfamilies. Furthermore, in soybean and Arabidopsis, the expansin gene family has been mainly expanded through tandem and segmental duplications; however, in rice, segmental duplication appears to be the dominant process that generates this superfamily. The transcriptome atlas revealed notable differential expression in either transcript abundance or expression patterns under normal growth conditions. This finding was consistent with the differential distribution of the cis-elements in the promoter region, and indicated wide functional divergence in this superfamily. Moreover, some critical amino acids that contribute to functional divergence and positive selection were detected. Finally, site model and branch-site model analysis of positive selection indicated that the soybean expansin gene superfamily is under strong positive selection, and that divergent selection constraints might have influenced the evolution of the four subfamilies. Conclusion This study demonstrated that the soybean expansin gene superfamily has expanded through tandem and segmental duplication. Differential expression indicated wide functional divergence in this superfamily. Furthermore, positive selection analysis revealed that divergent selection constraints might have

  12. Afrobatrachian mitochondrial genomes: genome reorganization, gene rearrangement mechanisms, and evolutionary trends of duplicated and rearranged genes

    PubMed Central

    2013-01-01

    Background Mitochondrial genomic (mitogenomic) reorganizations are rarely found in closely-related animals, yet drastic reorganizations have been found in the Ranoides frogs. The phylogenetic relationships of the three major ranoid taxa (Natatanura, Microhylidae, and Afrobatrachia) have been problematic, and mitogenomic information for afrobatrachians has not been available. Several molecular models for mitochondrial (mt) gene rearrangements have been proposed, but observational evidence has been insufficient to evaluate them. Furthermore, evolutionary trends in rearranged mt genes have not been well understood. To gain molecular and phylogenetic insights into these issues, we analyzed the mt genomes of four afrobatrachian species (Breviceps adspersus, Hemisus marmoratus, Hyperolius marmoratus, and Trichobatrachus robustus) and performed molecular phylogenetic analyses. Furthermore we searched for two evolutionary patterns expected in the rearranged mt genes of ranoids. Results Extensively reorganized mt genomes having many duplicated and rearranged genes were found in three of the four afrobatrachians analyzed. In fact, Breviceps has the largest known mt genome among vertebrates. Although the kinds of duplicated and rearranged genes differed among these species, a remarkable gene rearrangement pattern of non-tandemly copied genes situated within tandemly-copied regions was commonly found. Furthermore, the existence of concerted evolution was observed between non-neighboring copies of triplicated 12S and 16S ribosomal RNA regions. Conclusions Phylogenetic analyses based on mitogenomic data support a close relationship between Afrobatrachia and Microhylidae, with their estimated divergence 100 million years ago consistent with present-day endemism of afrobatrachians on the African continent. The afrobatrachian mt data supported the first tandem and second non-tandem duplication model for mt gene rearrangements and the recombination-based model for concerted

  13. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    PubMed

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types

  14. Phylogenetics of Lophotrochozoan bHLH Genes and the Evolution of Lineage-Specific Gene Duplicates.

    PubMed

    Bao, Yongbo; Xu, Fei; Shimeld, Sebastian M

    2017-04-01

    The gain and loss of genes encoding transcription factors is of importance to understanding the evolution of gene regulatory complexity. The basic helix-loop-helix (bHLH) genes encode a large superfamily of transcription factors. We systematically classify the bHLH genes from five mollusc, two annelid and one brachiopod genomes, tracing the pattern of bHLH gene evolution across these poorly studied Phyla. In total, 56-88 bHLH genes were identified in each genome, with most identifiable as members of previously described bilaterian families, or of new families we define. Of such families only one, Mesp, appears lost by all these species. Additional duplications have also played a role in the evolution of the bHLH gene repertoire, with many new lophotrochozoan-, mollusc-, bivalve-, or gastropod-specific genes defined. Using a combination of transcriptome mining, RT-PCR, and in situ hybridization we compared the expression of several of these novel genes in tissues and embryos of the molluscs Crassostrea gigas and Patella vulgata, finding both conserved expression and evidence for neofunctionalization. We also map the positions of the genes across these genomes, identifying numerous gene linkages. Some reflect recent paralog divergence by tandem duplication, others are remnants of ancient tandem duplications dating to the lophotrochozoan or bilaterian common ancestors. These data are built into a model of the evolution of bHLH genes in molluscs, showing formidable evolutionary stasis at the family level but considerable within-family diversification by tandem gene duplication. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. Whole Genome and Tandem Duplicate Retention Facilitated Glucosinolate Pathway Diversification in the Mustard Family

    PubMed Central

    Hofberger, Johannes A.; Lyons, Eric; Edger, Patrick P.; Chris Pires, J.; Eric Schranz, M.

    2013-01-01

    Plants share a common history of successive whole-genome duplication (WGD) events retaining genomic patterns of duplicate gene copies (ohnologs) organized in conserved syntenic blocks. Duplication was often proposed to affect the origin of novel traits during evolution. However, genetic evidence linking WGD to pathway diversification is scarce. We show that WGD and tandem duplication (TD) accelerated genetic versatility of plant secondary metabolism, exemplified with the glucosinolate (GS) pathway in the mustard family. GS biosynthesis is a well-studied trait, employing at least 52 biosynthetic and regulatory genes in the model plant Arabidopsis. In a phylogenomics approach, we identified 67 GS loci in Aethionema arabicum of the tribe Aethionemae, sister group to all mustard family members. All but one of the Arabidopsis GS gene families evolved orthologs in Aethionema and all but one of the orthologous sequence pairs exhibit synteny. The 45% fraction of duplicates among all protein-coding genes in Arabidopsis was increased to 95% and 97% for Arabidopsis and Aethionema GS pathway inventory, respectively. Compared with the 22% average for all protein-coding genes in Arabidopsis, 52% and 56% of Aethionema and Arabidopsis GS loci align to ohnolog copies dating back to the last common WGD event. Although 15% of all Arabidopsis genes are organized in tandem arrays, 45% and 48% of GS loci in Arabidopsis and Aethionema descend from TD, respectively. We describe a sequential combination of TD and WGD events driving gene family extension, thereby expanding the evolutionary playground for functional diversification and thus potential novelty and success. PMID:24171911

  16. EqualTDRL: illustrating equivalent tandem duplication random loss rearrangements.

    PubMed

    Hartmann, Tom; Bernt, Matthias; Middendorf, Martin

    2018-05-30

    To study the differences between two unichromosomal circular genomes, e.g., mitochondrial genomes, under the tandem duplication random loss (TDRL) rearrangement it is important to consider the whole set of potential TDRL rearrangement events that could have taken place. The reason is that for two given circular gene orders there can exist different TDRL rearrangements that transform one of the gene orders into the other. Hence, a TDRL event cannot always be reconstructed only from the knowledge of the circular gene order before a TDRL event and the circular gene order after it. We present the program EqualTDRL that computes and illustrates the complete set of TDRLs for pairs of circular gene orders that differ by only one TDRL. EqualTDRL considers the circularity of the given genomes and certain restrictions on the TDRL rearrangements. Examples for the latter are sequences of genes that have to be conserved during a TDRL or pairs of genes that frame intergenic regions which might represent remnants of duplicated genes. Additionally, EqualTDRL allows to determine the set of TDRLs that are minimum with respect to the number of duplicated genes. EqualTDRL supports scientists to study the complete set of TDRLs that possibly could have taken place in the evolution of mitochondrial genomes. EqualTDRL is implemented in C++ using the ggplot2 package of the open source programming language R and is freely available from http://pacosy.informatik.uni-leipzig.de/equaltdrl .

  17. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706)

    PubMed Central

    Krsticevic, Flavia J.; Arce, Débora P.; Ezpeleta, Joaquín; Tapia, Elizabeth

    2016-01-01

    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. PMID:27565886

  18. New genes from old: asymmetric divergence of gene duplicates and the evolution of development.

    PubMed

    Holland, Peter W H; Marlétaz, Ferdinand; Maeso, Ignacio; Dunwell, Thomas L; Paps, Jordi

    2017-02-05

    Gene duplications and gene losses have been frequent events in the evolution of animal genomes, with the balance between these two dynamic processes contributing to major differences in gene number between species. After gene duplication, it is common for both daughter genes to accumulate sequence change at approximately equal rates. In some cases, however, the accumulation of sequence change is highly uneven with one copy radically diverging from its paralogue. Such 'asymmetric evolution' seems commoner after tandem gene duplication than after whole-genome duplication, and can generate substantially novel genes. We describe examples of asymmetric evolution in duplicated homeobox genes of moths, molluscs and mammals, in each case generating new homeobox genes that were recruited to novel developmental roles. The prevalence of asymmetric divergence of gene duplicates has been underappreciated, in part, because the origin of highly divergent genes can be difficult to resolve using standard phylogenetic methods.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'. © 2016 The Author(s).

  19. DB2: a probabilistic approach for accurate detection of tandem duplication breakpoints using paired-end reads.

    PubMed

    Yavaş, Gökhan; Koyutürk, Mehmet; Gould, Meetha P; McMahon, Sarah; LaFramboise, Thomas

    2014-03-05

    With the advent of paired-end high throughput sequencing, it is now possible to identify various types of structural variation on a genome-wide scale. Although many methods have been proposed for structural variation detection, most do not provide precise boundaries for identified variants. In this paper, we propose a new method, Distribution Based detection of Duplication Boundaries (DB2), for accurate detection of tandem duplication breakpoints, an important class of structural variation, with high precision and recall. Our computational experiments on simulated data show that DB2 outperforms state-of-the-art methods in terms of finding breakpoints of tandem duplications, with a higher positive predictive value (precision) in calling the duplications' presence. In particular, DB2's prediction of tandem duplications is correct 99% of the time even for very noisy data, while narrowing down the space of possible breakpoints within a margin of 15 to 20 bps on the average. Most of the existing methods provide boundaries in ranges that extend to hundreds of bases with lower precision values. Our method is also highly robust to varying properties of the sequencing library and to the sizes of the tandem duplications, as shown by its stable precision, recall and mean boundary mismatch performance. We demonstrate our method's efficacy using both simulated paired-end reads, and those generated from a melanoma sample and two ovarian cancer samples. Newly discovered tandem duplications are validated using PCR and Sanger sequencing. Our method, DB2, uses discordantly aligned reads, taking into account the distribution of fragment length to predict tandem duplications along with their breakpoints on a donor genome. The proposed method fine tunes the breakpoint calls by applying a novel probabilistic framework that incorporates the empirical fragment length distribution to score each feasible breakpoint. DB2 is implemented in Java programming language and is freely available

  20. Origin of a function by tandem gene duplication limits the evolutionary capability of its sister copy.

    PubMed

    Hasselmann, Martin; Lechner, Sarah; Schulte, Christina; Beye, Martin

    2010-07-27

    The most remarkable outcome of a gene duplication event is the evolution of a novel function. Little information exists on how the rise of a novel function affects the evolution of its paralogous sister gene copy, however. We studied the evolution of the feminizer (fem) gene from which the gene complementary sex determiner (csd) recently derived by tandem duplication within the honey bee (Apis) lineage. Previous studies showed that fem retained its sex determination function, whereas the rise of csd established a new primary signal of sex determination. We observed a specific reduction of nonsynonymous to synonymous substitution ratios in Apis to non-Apis fem. We found a contrasting pattern at two other genetically linked genes, suggesting that hitchhiking effects to csd, the locus under balancing selection, is not the cause of this evolutionary pattern. We also excluded higher synonymous substitution rates by relative rate testing. These results imply that stronger purifying selection is operating at the fem gene in the presence of csd. We propose that csd's new function interferes with the function of Fem protein, resulting in molecular constraints and limited evolvability of fem in the Apis lineage. Elevated silent nucleotide polymorphism in fem relative to the genome-wide average suggests that genetic linkage to the csd gene maintained more nucleotide variation in today's population. Our findings provide evidence that csd functionally and genetically interferes with fem, suggesting that a newly evolved gene and its functions can limit the evolutionary capability of other genes in the genome.

  1. A diffusion model for the fate of tandem gene duplicates in diploids.

    PubMed

    O'Hely, Martin

    2007-06-01

    Suppose one chromosome in one member of a population somehow acquires a duplicate copy of the gene, fully linked to the original gene's locus. Preservation is the event that eventually every chromosome in the population is a descendant of the one which initially carried the duplicate. For a haploid population in which the absence of all copies of the gene is lethal, the probability of preservation has recently been estimated via a diffusion approximation. That approximation is shown to carry over to the case of diploids and arbitrary strong selection against the absence of the gene. The techniques used lead to some new results. In the large population limit, it is shown that the relative probability that descendants of a small number of individuals carrying multiple copies of the gene fix in the population is proportional to the number of copies carried. The probability of preservation is approximated when chromosomes carrying two copies of the gene are subject to additional, fully non-functionalizing mutations, thereby modelling either an additional cost of replicating a longer genome, or a partial duplication of the gene. In the latter case the preservation probability depends only on the mutation rate to null for the duplicated portion of the gene.

  2. Tandem Duplication Events in the Expansion of the Small Heat Shock Protein Gene Family in Solanum lycopersicum (cv. Heinz 1706).

    PubMed

    Krsticevic, Flavia J; Arce, Débora P; Ezpeleta, Joaquín; Tapia, Elizabeth

    2016-10-13

    In plants, fruit maturation and oxidative stress can induce small heat shock protein (sHSP) synthesis to maintain cellular homeostasis. Although the tomato reference genome was published in 2012, the actual number and functionality of sHSP genes remain unknown. Using a transcriptomic (RNA-seq) and evolutionary genomic approach, putative sHSP genes in the Solanum lycopersicum (cv. Heinz 1706) genome were investigated. A sHSP gene family of 33 members was established. Remarkably, roughly half of the members of this family can be explained by nine independent tandem duplication events that determined, evolutionarily, their functional fates. Within a mitochondrial class subfamily, only one duplicated member, Solyc08g078700, retained its ancestral chaperone function, while the others, Solyc08g078710 and Solyc08g078720, likely degenerated under neutrality and lack ancestral chaperone function. Functional conservation occurred within a cytosolic class I subfamily, whose four members, Solyc06g076570, Solyc06g076560, Solyc06g076540, and Solyc06g076520, support ∼57% of the total sHSP RNAm in the red ripe fruit. Subfunctionalization occurred within a new subfamily, whose two members, Solyc04g082720 and Solyc04g082740, show heterogeneous differential expression profiles during fruit ripening. These findings, involving the birth/death of some genes or the preferential/plastic expression of some others during fruit ripening, highlight the importance of tandem duplication events in the expansion of the sHSP gene family in the tomato genome. Despite its evolutionary diversity, the sHSP gene family in the tomato genome seems to be endowed with a core set of four homeostasis genes: Solyc05g014280, Solyc03g082420, Solyc11g020330, and Solyc06g076560, which appear to provide a baseline protection during both fruit ripening and heat shock stress in different tomato tissues. Copyright © 2016 Krsticevic et al.

  3. Generation of Tandem Direct Duplications by Reversed-Ends Transposition of Maize Ac Elements

    PubMed Central

    Peterson, Thomas

    2013-01-01

    Tandem direct duplications are a common feature of the genomes of eukaryotes ranging from yeast to human, where they comprise a significant fraction of copy number variations. The prevailing model for the formation of tandem direct duplications is non-allelic homologous recombination (NAHR). Here we report the isolation of a series of duplications and reciprocal deletions isolated de novo from a maize allele containing two Class II Ac/Ds transposons. The duplication/deletion structures suggest that they were generated by alternative transposition reactions involving the termini of two nearby transposable elements. The deletion/duplication breakpoint junctions contain 8 bp target site duplications characteristic of Ac/Ds transposition events, confirming their formation directly by an alternative transposition mechanism. Tandem direct duplications and reciprocal deletions were generated at a relatively high frequency (∼0.5 to 1%) in the materials examined here in which transposons are positioned nearby each other in appropriate orientation; frequencies would likely be much lower in other genotypes. To test whether this mechanism may have contributed to maize genome evolution, we analyzed sequences flanking Ac/Ds and other hAT family transposons and identified three small tandem direct duplications with the structural features predicted by the alternative transposition mechanism. Together these results show that some class II transposons are capable of directly inducing tandem sequence duplications, and that this activity has contributed to the evolution of the maize genome. PMID:23966872

  4. Modes of gene duplication contribute differently to genetic novelty and redundancy, but show parallels across divergent angiosperms.

    PubMed

    Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P; Feltus, F Alex; Paterson, Andrew H

    2011-01-01

    Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution.

  5. Modes of Gene Duplication Contribute Differently to Genetic Novelty and Redundancy, but Show Parallels across Divergent Angiosperms

    PubMed Central

    Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P.; Feltus, F. Alex; Paterson, Andrew H.

    2011-01-01

    Background Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. Results In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Conclusion Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution. PMID:22164235

  6. Polymorphism, selection and tandem duplication of transferrin genes in Atlantic cod (Gadus morhua) - Conserved synteny between fish monolobal and tetrapod bilobal transferrin loci

    PubMed Central

    2011-01-01

    Background The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. Results The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. Conclusions The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive

  7. Polymorphism, selection and tandem duplication of transferrin genes in Atlantic cod (Gadus morhua)--conserved synteny between fish monolobal and tetrapod bilobal transferrin loci.

    PubMed

    Andersen, Øivind; De Rosa, Maria Cristina; Pirolli, Davide; Tooming-Klunderud, Ave; Petersen, Petra E; André, Carl

    2011-05-25

    The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in

  8. Genome Duplication and Gene Loss Affect the Evolution of Heat Shock Transcription Factor Genes in Legumes

    PubMed Central

    Jin, Jing; Jin, Xiaolei; Jiang, Haiyang; Yan, Hanwei; Cheng, Beijiu

    2014-01-01

    Whole-genome duplication events (polyploidy events) and gene loss events have played important roles in the evolution of legumes. Here we show that the vast majority of Hsf gene duplications resulted from whole genome duplication events rather than tandem duplication, and significant differences in gene retention exist between species. By searching for intraspecies gene colinearity (microsynteny) and dating the age distributions of duplicated genes, we found that genome duplications accounted for 42 of 46 Hsf-containing segments in Glycine max, while paired segments were rarely identified in Lotus japonicas, Medicago truncatula and Cajanus cajan. However, by comparing interspecies microsynteny, we determined that the great majority of Hsf-containing segments in Lotus japonicas, Medicago truncatula and Cajanus cajan show extensive conservation with the duplicated regions of Glycine max. These segments formed 17 groups of orthologous segments. These results suggest that these regions shared ancient genome duplication with Hsf genes in Glycine max, but more than half of the copies of these genes were lost. On the other hand, the Glycine max Hsf gene family retained approximately 75% and 84% of duplicated genes produced from the ancient genome duplication and recent Glycine-specific genome duplication, respectively. Continuous purifying selection has played a key role in the maintenance of Hsf genes in Glycine max. Expression analysis of the Hsf genes in Lotus japonicus revealed their putative involvement in multiple tissue-/developmental stages and responses to various abiotic stimuli. This study traces the evolution of Hsf genes in legume species and demonstrates that the rates of gene gain and loss are far from equilibrium in different species. PMID:25047803

  9. Metallothionein Gene Duplications and Metal Tolerance in Natural Populations of Drosophila melanogaster

    PubMed Central

    Maroni, G.; Wise, J.; Young, J. E.; Otto, E.

    1987-01-01

    A search for duplications of the Drosophila melanogaster metallothionein gene (Mtn) yielded numerous examples of this type of chromosomal rearrangement. These duplications are distributed widely—we found them in samples from four continents, and they are functional—larvae carrying Mtn duplications produce more Mtn RNA and tolerate increased cadmium and copper concentrations. Six different duplication types were characterized by restriction-enzyme analyses using probes from the Mtn region. The restriction maps show that in four cases the sequences, ranging in size between 2.2 and 6.0 kb, are arranged as direct, tandem repeats; in two other cases, this basic pattern is modified by the insertion of a putative transposable element into one of the repeated units. Duplications of the D. melanogaster metallothionein gene such as those that we found in natural populations may represent early stages in the evolution of a gene family. PMID:2828157

  10. Explosive Tandem and Segmental Duplications of Multigenic Families in Eucalyptus grandis

    PubMed Central

    Li, Qiang; Yu, Hong; Cao, Phi Bang; Fawal, Nizar; Mathé, Catherine; Azar, Sahar; Cassan-Wang, Hua; Myburg, Alexander A.; Grima-Pettenati, Jacqueline; Marque, Christiane; Teulières, Chantal; Dunand, Christophe

    2015-01-01

    Plant organisms contain a large number of genes belonging to numerous multigenic families whose evolution size reflects some functional constraints. Sequences from eight multigenic families, involved in biotic and abiotic responses, have been analyzed in Eucalyptus grandis and compared with Arabidopsis thaliana. Two transcription factor families APETALA 2 (AP2)/ethylene responsive factor and GRAS, two auxin transporter families PIN-FORMED and AUX/LAX, two oxidoreductase families (ascorbate peroxidases [APx] and Class III peroxidases [CIII Prx]), and two families of protective molecules late embryogenesis abundant (LEA) and DNAj were annotated in expert and exhaustive manner. Many recent tandem duplications leading to the emergence of species-specific gene clusters and the explosion of the gene numbers have been observed for the AP2, GRAS, LEA, PIN, and CIII Prx in E. grandis, while the APx, the AUX/LAX and DNAj are conserved between species. Although no direct evidence has yet demonstrated the roles of these recent duplicated genes observed in E. grandis, this could indicate their putative implications in the morphological and physiological characteristics of E. grandis, and be the key factor for the survival of this nondormant species. Global analysis of key families would be a good criterion to evaluate the capabilities of some organisms to adapt to environmental variations. PMID:25769696

  11. Evolution of Gene Duplication in Plants.

    PubMed

    Panchy, Nicholas; Lehti-Shiu, Melissa; Shiu, Shin-Han

    2016-08-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. © 2016 American Society of Plant Biologists. All Rights Reserved.

  12. An epigenetic state associated with areas of gene duplication

    PubMed Central

    Gimelbrant, Alexander A.; Chess, Andrew

    2006-01-01

    Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes. PMID:16687731

  13. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution.

    PubMed

    Gu, Xun; Wang, Yufeng; Gu, Jianying

    2002-06-01

    The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.

  14. The early stages of duplicate gene evolution

    PubMed Central

    Moore, Richard C.; Purugganan, Michael D.

    2003-01-01

    Gene duplications are one of the primary driving forces in the evolution of genomes and genetic systems. Gene duplicates account for 8–20% of the genes in eukaryotic genomes, and the rates of gene duplication are estimated at between 0.2% and 2% per gene per million years. Duplicate genes are believed to be a major mechanism for the establishment of new gene functions and the generation of evolutionary novelty, yet very little is known about the early stages of the evolution of duplicated gene pairs. It is unclear, for example, to what extent selection, rather than neutral genetic drift, drives the fixation and early evolution of duplicate loci. Analysis of recently duplicated genes in the Arabidopsis thaliana genome reveals significantly reduced species-wide levels of nucleotide polymorphisms in the progenitor and/or duplicate gene copies, suggesting that selective sweeps accompany the initial stages of the evolution of these duplicated gene pairs. Our results support recent theoretical work that indicates that fates of duplicate gene pairs may be determined in the initial phases of duplicate gene evolution and that positive selection plays a prominent role in the evolutionary dynamics of the very early histories of duplicate nuclear genes. PMID:14671323

  15. The Evolution of Pepsinogen C Genes in Vertebrates: Duplication, Loss and Functional Diversification

    PubMed Central

    Gonçalves, Odete; Wilson, Jonathan Mark

    2012-01-01

    Background Aspartic proteases comprise a large group of enzymes involved in peptide proteolysis. This collection includes prominent enzymes globally categorized as pepsins, which are derived from pepsinogen precursors. Pepsins are involved in gastric digestion, a hallmark of vertebrate physiology. An important member among the pepsinogens is pepsinogen C (Pgc). A particular aspect of Pgc is its apparent single copy status, which contrasts with the numerous gene copies found for example in pepsinogen A (Pga). Although gene sequences with similarity to Pgc have been described in some vertebrate groups, no exhaustive evolutionary framework has been considered so far. Methodology/Principal Findings By combining phylogenetics and genomic analysis, we find an unexpected Pgc diversity in the vertebrate sub-phylum. We were able to reconstruct gene duplication timings relative to the divergence of major vertebrate clades. Before tetrapod divergence, a single Pgc gene tandemly expanded to produce two gene lineages (Pgbc and Pgc2). These have been differentially retained in various classes. Accordingly, we find Pgc2 in sauropsids, amphibians and marsupials, but not in eutherian mammals. Pgbc was retained in amphibians, but duplicated in the ancestor of amniotes giving rise to Pgb and Pgc1. The latter was retained in mammals and probably in reptiles and marsupials but not in birds. Pgb was kept in all of the amniote clade with independent episodes of loss in some mammalian species. Lineage specific expansions of Pgc2 and Pgbc have also occurred in marsupials and amphibians respectively. We find that teleost and tetrapod Pgc genes reside in distinct genomic regions hinting at a possible translocation. Conclusions We conclude that the repertoire of Pgc genes is larger than previously reported, and that tandem duplications have modelled the history of Pgc genes. We hypothesize that gene expansion lead to functional divergence in tetrapods, coincident with the invasion of

  16. Tandem duplication within a Neurofibromatosis type I (NFI) gene exon in a family with features of Watson syndrome and Noonan syndrome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tassabehji, M.; Strachan, T.; Colley, A.

    Type 1 neurofibromatosis (NF1), Watson syndrome (WS), and Noonan syndrome (NS) show some overlap in clinical manifestations. In addition, WS has been shown to be linked to markers flanking the NF1 locus and a deletion at the NF1 locus demonstrated in a WS patient. This suggests either that WS and NF1 are allelic or the phenotypes arise from mutations in very closely linked genes. Here the authors provide evidence for the former by demonstrating a mutation in the NF1 gene in a family with features of both WS and NS. The mutation is an almost perfect in-frame tandem duplication ofmore » 42 bases in exon 28 of the NF1 gene. Unlike the mutations previously described in classical NF1, which show a preponderance of null alleles, the mutation in this family would be expected to result in a mutant neurofibromin product. 31 refs., 2 figs.« less

  17. De novo tandem duplication of chromosome segement 22q11-q12: Clinical, cytogenetic, and molecular characterization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lindsay, E.A.; Shaffer, L.G.; Carrozzo, R.

    We report on a case of duplication of the segment 22q11-q12 due to a de novo duplication. Molecular cytogenetics studies demonstrated this to be a tandem duplication, flanked proximally by the marker D22Z4, a centromeric alpha satellite DNA repeat, and distally by D22S260, an anonymous DNA marker proximal to the Ewing sarcoma breakpoint. The segment includes the regions responsible for the {open_quotes}cat-eye{close_quotes}, Di George, and velo-cardio-facial syndromes and extends distal to the breakpoint cluster region (BCR). The clinical picture is dominated by the cardiac defects and includes findings reminiscent of {open_quotes}cat-eye{close_quotes} syndrome. These findings reinforce the hypothesis that the proximalmore » 22q region contains dosage-sensitive genes involved in development. 20 refs., 3 figs.« less

  18. Evolution of Gene Duplication in Plants1[OPEN

    PubMed Central

    2016-01-01

    Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication. PMID:27288366

  19. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms.

    PubMed

    Li, Zhen; Defoort, Jonas; Tasdighian, Setareh; Maere, Steven; Van de Peer, Yves; De Smet, Riet

    2016-02-01

    Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. © 2016 American Society of Plant Biologists. All rights reserved.

  20. Papain-like cysteine proteases in Carica papaya: lineage-specific gene duplication and expansion.

    PubMed

    Liu, Juan; Sharma, Anupma; Niewiara, Marie Jamille; Singh, Ratnesh; Ming, Ray; Yu, Qingyi

    2018-01-06

    Papain-like cysteine proteases (PLCPs), a large group of cysteine proteases structurally related to papain, play important roles in plant development, senescence, and defense responses. Papain, the first cysteine protease whose structure was determined by X-ray crystallography, plays a crucial role in protecting papaya from herbivorous insects. Except the four major PLCPs purified and characterized in papaya latex, the rest of the PLCPs in papaya genome are largely unknown. We identified 33 PLCP genes in papaya genome. Phylogenetic analysis clearly separated plant PLCP genes into nine subfamilies. PLCP genes are not equally distributed among the nine subfamilies and the number of PLCPs in each subfamily does not increase or decrease proportionally among the seven selected plant species. Papaya showed clear lineage-specific gene expansion in the subfamily III. Interestingly, all four major PLCPs purified from papaya latex, including papain, chymopapain, glycyl endopeptidase and caricain, were grouped into the lineage-specific expansion branch in the subfamily III. Mapping PLCP genes on chromosomes of five plant species revealed that lineage-specific expansions of PLCP genes were mostly derived from tandem duplications. We estimated divergence time of papaya PLCP genes of subfamily III. The major duplication events leading to lineage-specific expansion of papaya PLCP genes in subfamily III were estimated at 48 MYA, 34 MYA, and 16 MYA. The gene expression patterns of the papaya PLCP genes in different tissues were assessed by transcriptome sequencing and qRT-PCR. Most of the papaya PLCP genes of subfamily III expressed at high levels in leaf and green fruit tissues. Tandem duplications played the dominant role in affecting copy number of PLCPs in plants. Significant variations in size of the PLCP subfamilies among species may reflect genetic adaptation of plant species to different environments. The lineage-specific expansion of papaya PLCPs of subfamily III might

  1. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms[OPEN

    PubMed Central

    Li, Zhen; Van de Peer, Yves; De Smet, Riet

    2016-01-01

    Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of “gene duplicability” is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes. PMID:26744215

  2. Subcellular Relocalization and Positive Selection Play Key Roles in the Retention of Duplicate Genes of Populus Class III Peroxidase Family[W][OPEN

    PubMed Central

    Ren, Lin-Ling; Liu, Yan-Jing; Liu, Hai-Jing; Qian, Ting-Ting; Qi, Li-Wang; Wang, Xiao-Ru; Zeng, Qing-Yin

    2014-01-01

    Gene duplication is the primary source of new genes and novel functions. Over the course of evolution, many duplicate genes lose their function and are eventually removed by deletion. However, some duplicates have persisted and evolved diverse functions. A particular challenge is to understand how this diversity arises and whether positive selection plays a role. In this study, we reconstructed the evolutionary history of the class III peroxidase (PRX) genes from the Populus trichocarpa genome. PRXs are plant-specific enzymes that play important roles in cell wall metabolism and in response to biotic and abiotic stresses. We found that two large tandem-arrayed clusters of PRXs evolved from an ancestral cell wall type PRX to vacuole type, followed by tandem duplications and subsequent functional specification. Substitution models identified seven positively selected sites in the vacuole PRXs. These positively selected sites showed significant effects on the biochemical functions of the enzymes. We also found that positive selection acts more frequently on residues adjacent to, rather than directly at, a critical active site of the enzyme, and on flexible regions rather than on rigid structural elements of the protein. Our study provides new insights into the adaptive molecular evolution of plant enzyme families. PMID:24934172

  3. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication.

    PubMed

    Chapman, Brad A; Bowers, John E; Feltus, Frank A; Paterson, Andrew H

    2006-02-21

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a "paleologous" locus than in "singleton" genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during "genomic turmoil" immediately after genome duplication but continues to act approximately 60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages.

  4. Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication

    PubMed Central

    Chapman, Brad A.; Bowers, John E.; Feltus, Frank A.; Paterson, Andrew H.

    2006-01-01

    Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a “paleologous” locus than in “singleton” genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during “genomic turmoil” immediately after genome duplication but continues to act ≈60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages. PMID:16467140

  5. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes.

    PubMed

    Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping

    2016-05-09

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.

  6. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes

    PubMed Central

    Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping

    2016-01-01

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299

  7. Quantifying the major mechanisms of recent gene duplications in the human and mouse genomes: a novel strategy to estimate gene duplication rates

    PubMed Central

    Pan, Deng; Zhang, Liqing

    2007-01-01

    Background The rate of gene duplication is an important parameter in the study of evolution, but the influence of gene conversion and technical problems have confounded previous attempts to provide a satisfying estimate. We propose a new strategy to estimate the rate that involves separate quantification of the rates of two different mechanisms of gene duplication and subsequent combination of the two rates, based on their respective contributions to the overall gene duplication rate. Results Previous estimates of gene duplication rates are based on small gene families. Therefore, to assess the applicability of this to families of all sizes, we looked at both two-copy gene families and the entire genome. We studied unequal crossover and retrotransposition, and found that these mechanisms of gene duplication are largely independent and account for a substantial amount of duplicated genes. Unequal crossover contributed more to duplications in the entire genome than retrotransposition did, but this contribution was significantly less in two-copy gene families, and duplicated genes arising from this mechanism are more likely to be retained. Combining rates of duplication using the two mechanisms, we estimated the overall rates to be from approximately 0.515 to 1.49 × 10-3 per gene per million years in human, and from approximately 1.23 to 4.23 × 10-3 in mouse. The rates estimated from two-copy gene families are always lower than those from the entire genome, and so it is not appropriate to use small families to estimate the rate for the entire genome. Conclusion We present a novel strategy for estimating gene duplication rates. Our results show that different mechanisms contribute differently to the evolution of small and large gene families. PMID:17683522

  8. Expression of HOXB genes is significantly different in acute myeloid leukemia with a partial tandem duplication of MLL vs. a MLL translocation: a cross-laboratory study.

    PubMed

    Liu, Hsi-Che; Shih, Lee-Yung; May Chen, Mei-Ju; Wang, Chien-Chih; Yeh, Ting-Chi; Lin, Tung-Huei; Chen, Chien-Yu; Lin, Chih-Jen; Liang, Der-Cherng

    2011-05-01

    In acute myeloid leukemia (AML), the mixed lineage leukemia (MLL) gene may be rearranged to generate a partial tandem duplication (PTD), or fused to partner genes through a chromosomal translocation (tMLL). In this study, we first explored the differentially expressed genes between MLL-PTD and tMLL using gene expression profiling of our cohort (15 MLL-PTD and 10 tMLL) and one published data set. The top 250 probes were chosen from each set, resulting in 29 common probes (21 unique genes) to both sets. The selected genes include four HOXB genes, HOXB2, B3, B5, and B6. The expression values of these HOXB genes significantly differ between MLL-PTD and tMLL cases. Clustering and classification analyses were thoroughly conducted to support our gene selection results. Second, as MLL-PTD, FLT3-ITD, and NPM1 mutations are identified in AML with normal karyotypes, we briefly studied their impact on the HOXB genes. Another contribution of this study is to demonstrate that using public data from other studies enriches samples for analysis and yields more conclusive results. 2011 Elsevier Inc. All rights reserved.

  9. Segmental Duplication, Microinversion, and Gene Loss Associated with a Complex Inversion Breakpoint Region in Drosophila

    PubMed Central

    Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo

    2012-01-01

    Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714

  10. Functional requirements driving the gene duplication in 12 Drosophila species.

    PubMed

    Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui

    2013-08-15

    Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.

  11. Gene duplication and the evolution of phenotypic diversity in insect societies.

    PubMed

    Chau, Linh M; Goodisman, Michael A D

    2017-12-01

    Gene duplication is an important evolutionary process thought to facilitate the evolution of phenotypic diversity. We investigated if gene duplication was associated with the evolution of phenotypic differences in a highly social insect, the honeybee Apis mellifera. We hypothesized that the genetic redundancy provided by gene duplication could promote the evolution of social and sexual phenotypes associated with advanced societies. We found a positive correlation between sociality and rate of gene duplications across the Apoidea, indicating that gene duplication may be associated with sociality. We also discovered that genes showing biased expression between A. mellifera alternative phenotypes tended to be found more frequently than expected among duplicated genes than singletons. Moreover, duplicated genes had higher levels of caste-, sex-, behavior-, and tissue-biased expression compared to singletons, as expected if gene duplication facilitated phenotypic differentiation. We also found that duplicated genes were maintained in the A. mellifera genome through the processes of conservation, neofunctionalization, and specialization, but not subfunctionalization. Overall, we conclude that gene duplication may have facilitated the evolution of social and sexual phenotypes, as well as tissue differentiation. Thus this study further supports the idea that gene duplication allows species to evolve an increased range of phenotypic diversity. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.

  12. Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion

    PubMed Central

    Fawcett, Jeffrey A.; Innan, Hideki

    2011-01-01

    Gene conversion is one of the major mutational mechanisms involved in the DNA sequence evolution of duplicated genes. It contributes to create unique patters of DNA polymorphism within species and divergence between species. A typical pattern is so-called concerted evolution, in which the divergence between duplicates is maintained low for a long time because of frequent exchanges of DNA fragments. In addition, gene conversion affects the DNA evolution of duplicates in various ways especially when selection operates. Here, we review theoretical models to understand the evolution of duplicates in both neutral and non-neutral cases. We also explain how these theories contribute to interpreting real polymorphism and divergence data by using some intriguing examples. PMID:24710144

  13. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae).

    PubMed

    Baker, Richard H; Narechania, Apurva; Johns, Philip M; Wilkinson, Gerald S

    2012-08-19

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict.

  14. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae)

    PubMed Central

    Baker, Richard H.; Narechania, Apurva; Johns, Philip M.; Wilkinson, Gerald S.

    2012-01-01

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict. PMID:22777023

  15. Identification of three duplicated Spin genes in medaka (Oryzias latipes).

    PubMed

    Wang, Xiao-Lei; Mei, Jie; Sun, Min; Hong, Yun-Han; Gui, Jian-Fang

    2005-05-09

    Gene and genomic duplications are very important and frequent events in fish evolution, and the divergence of duplicated genes in sequences and functions is a focus of research on gene evolution. Here, we report the identification and characterization of three duplicated Spindlin (Spin) genes from medaka (Oryzias latipes): OlSpinA, OlSpinB, and OlSpinC. Molecular cloning, genomic DNA Blast analysis and phylogenetic relationship analysis demonstrated that the three duplicated OlSpin genes should belong to gene duplication. Furthermore, Western blot analysis revealed significant expression differences of the three OlSpins among different tissues and during embryogenesis in medaka, and suggested that sequence and functional divergence might have occurred in evolution among them.

  16. Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

    PubMed Central

    Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

    2014-01-01

    Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106

  17. Regulatory divergence of homeologous Atlantic salmon elovl5 genes following the salmonid-specific whole-genome duplication.

    PubMed

    Carmona-Antoñanzas, Greta; Zheng, Xiaozhong; Tocher, Douglas R; Leaver, Michael J

    2016-10-10

    Fatty acyl elongase 5 (elovl5) is a critical enzyme in the vertebrate biosynthetic pathway which produces the physiologically essential long-chain polyunsaturated fatty acids (LC-PUFA), docosahexenoic acid (DHA), and eicosapentenoic acid (EPA) from 18 carbon fatty acids precursors. In contrast to most other vertebrates, Atlantic salmon possess two copies of elovl5 (elovl5a and elovl5b) as a result of a whole-genome duplication (WGD) which occurred at the base of the salmonid lineage. WGDs have had a major influence on vertebrate evolution, providing extra genetic material, enabling neofunctionalization to accelerate adaptation and speciation. However, little is known about the mechanisms by which such duplicated homeologous genes diverge. Here we show that homeologous Atlantic salmon elovl5a and elovl5b genes have been asymmetrically colonised by transposon-like elements. Identical locations and identities of insertions are also present in the rainbow trout duplicate elovl5 genes, but not in the nearest extant representative preduplicated teleost, the northern pike. Both elovl5 salmon duplicates possessed conserved regulatory elements that promoted Srebp1- and Srebp2-dependent transcription, and differences in the magnitude of Srebp response between promoters could be attributed to a tandem duplication of SRE and NF-Y cofactor binding sites in elovl5b. Furthermore, an insertion in the promoter region of elovl5a confers responsiveness to Lxr/Rxr transcriptional activation. Our results indicate that most, but not all, transposon mobilisation into elovl5 genes occurred after the split from the common ancestor of pike and salmon, but before more recent salmonid speciations, and that divergence of elovl5 regulatory regions have enabled neofuntionalization by promoting differential expression of these homeologous genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. A limited role for gene duplications in the evolution of platypus venom.

    PubMed

    Wong, Emily S W; Papenfuss, Anthony T; Whittington, Camilla M; Warren, Wesley C; Belov, Katherine

    2012-01-01

    Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the "venome" of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation.

  19. A Limited Role for Gene Duplications in the Evolution of Platypus Venom

    PubMed Central

    Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine

    2012-01-01

    Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864

  20. Isolated 46,XY gonadal dysgenesis in two sisters caused by a Xp21.2 interstitial duplication containing the DAX1 gene.

    PubMed

    Barbaro, Michela; Oscarson, Mikael; Schoumans, Jacqueline; Staaf, Johan; Ivarsson, Sten A; Wedell, Anna

    2007-08-01

    Testis development is a tightly regulated process that requires an efficient and coordinated spatiotemporal action of many factors, and it has been shown that several genes involved in gonadal development exert a dosage effect. Chromosomal imbalances have been reported in several patients presenting with gonadal dysgenesis as part of severe dysmorphic phenotypes. We screened for submicroscopic DNA copy number variations in two sisters with an apparent normal 46,XY karyotype and female external genitalia due to gonadal dysgenesis, and in which mutations in known candidate genes had been excluded. By high-resolution tiling bacterial artificial chromosome array comparative genome hybridization, a submicroscopic duplication at Xp21.2 containing DAX1 (NR0B1) was identified. Using fluorescence in situ hybridization, multiple ligation probe amplification, and PCR, the rearrangement was further characterized. This revealed a 637-kb tandem duplication that in addition to DAX1 includes the four MAGEB genes, the hypothetical gene CXorf21, GK, and part of the MAP3K7IP3 gene. Sequencing and analysis of the breakpoint boundaries and duplication junction suggest that the duplication originated through a coupled homologous and nonhomologous recombination process. This represents the first duplication on Xp21.2 identified in patients with isolated gonadal dysgenesis because all previously described XY subjects with Xp21 duplications presented with gonadal dysgenesis as part of a more complex phenotype, including mental retardation and/or malformations. Thus, our data support DAX1 as a dosage sensitive gene responsible for gonadal dysgenesis and highlight the importance of considering DAX1 locus duplications in the evaluation of all cases of 46,XY gonadal dysgenesis.

  1. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    PubMed

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  2. The role of tandem duplicator phenotype in tumour evolution in high-grade serous ovarian cancer.

    PubMed

    Ng, Charlotte K Y; Cooke, Susanna L; Howe, Kevin; Newman, Scott; Xian, Jian; Temple, Jillian; Batty, Elizabeth M; Pole, Jessica C M; Langdon, Simon P; Edwards, Paul A W; Brenton, James D

    2012-04-01

    High-grade serous ovarian carcinoma (HGSOC) is characterized by genomic instability, ubiquitous TP53 loss, and frequent development of platinum resistance. Loss of homologous recombination (HR) is a mutator phenotype present in 50% of HGSOCs and confers hypersensitivity to platinum treatment. We asked which other mutator phenotypes are present in HGSOC and how they drive the emergence of platinum resistance. We performed whole-genome paired-end sequencing on a model of two HGSOC cases, each consisting of a pair of cell lines established before and after clinical resistance emerged, to describe their structural variants (SVs) and to infer their ancestral genomes as the SVs present within each pair. The first case (PEO1/PEO4), with HR deficiency, acquired translocations and small deletions through its early evolution, but a revertant BRCA2 mutation restoring HR function in the resistant lineage re-stabilized its genome and reduced platinum sensitivity. The second case (PEO14/PEO23) had 216 tandem duplications and did not show evidence of HR or mismatch repair deficiency. By comparing the cell lines to the tissues from which they originated, we showed that the tandem duplicator mutator phenotype arose early in progression in vivo and persisted throughout evolution in vivo and in vitro, which may have enabled continual evolution. From the analysis of SNP array data from 454 HGSOC cases in The Cancer Genome Atlas series, we estimate that 12.8% of cases show patterns of aberrations similar to the tandem duplicator, and this phenotype is mutually exclusive with BRCA1/2 carrier mutations. Copyright © 2012 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  3. Gene Duplication and the Evolution of Hemoglobin Isoform Differentiation in Birds*

    PubMed Central

    Grispo, Michael T.; Natarajan, Chandrasekhar; Projecto-Garcia, Joana; Moriyama, Hideaki; Weber, Roy E.; Storz, Jay F.

    2012-01-01

    The majority of bird species co-express two functionally distinct hemoglobin (Hb) isoforms in definitive erythrocytes as follows: HbA (the major adult Hb isoform, with α-chain subunits encoded by the αA-globin gene) and HbD (the minor adult Hb isoform, with α-chain subunits encoded by the αD-globin gene). The αD-globin gene originated via tandem duplication of an embryonic α-like globin gene in the stem lineage of tetrapod vertebrates, which suggests the possibility that functional differentiation between the HbA and HbD isoforms may be attributable to a retained ancestral character state in HbD that harkens back to a primordial, embryonic function. To investigate this possibility, we conducted a combined analysis of protein biochemistry and sequence evolution to characterize the structural and functional basis of Hb isoform differentiation in birds. Functional experiments involving purified HbA and HbD isoforms from 11 different bird species revealed that HbD is characterized by a consistently higher O2 affinity in the presence of allosteric effectors such as organic phosphates and Cl− ions. In the case of both HbA and HbD, analyses of oxygenation properties under the two-state Monod-Wyman-Changeux allosteric model revealed that the pH dependence of Hb-O2 affinity stems primarily from changes in the O2 association constant of deoxy (T-state)-Hb. Ancestral sequence reconstructions revealed that the amino acid substitutions that distinguish the adult-expressed Hb isoforms are not attributable to the retention of an ancestral (pre-duplication) character state in the αD-globin gene that is shared with the embryonic α-like globin gene. PMID:22962007

  4. Gene duplication and the evolution of hemoglobin isoform differentiation in birds.

    PubMed

    Grispo, Michael T; Natarajan, Chandrasekhar; Projecto-Garcia, Joana; Moriyama, Hideaki; Weber, Roy E; Storz, Jay F

    2012-11-02

    The majority of bird species co-express two functionally distinct hemoglobin (Hb) isoforms in definitive erythrocytes as follows: HbA (the major adult Hb isoform, with α-chain subunits encoded by the α(A)-globin gene) and HbD (the minor adult Hb isoform, with α-chain subunits encoded by the α(D)-globin gene). The α(D)-globin gene originated via tandem duplication of an embryonic α-like globin gene in the stem lineage of tetrapod vertebrates, which suggests the possibility that functional differentiation between the HbA and HbD isoforms may be attributable to a retained ancestral character state in HbD that harkens back to a primordial, embryonic function. To investigate this possibility, we conducted a combined analysis of protein biochemistry and sequence evolution to characterize the structural and functional basis of Hb isoform differentiation in birds. Functional experiments involving purified HbA and HbD isoforms from 11 different bird species revealed that HbD is characterized by a consistently higher O(2) affinity in the presence of allosteric effectors such as organic phosphates and Cl(-) ions. In the case of both HbA and HbD, analyses of oxygenation properties under the two-state Monod-Wyman-Changeux allosteric model revealed that the pH dependence of Hb-O(2) affinity stems primarily from changes in the O(2) association constant of deoxy (T-state)-Hb. Ancestral sequence reconstructions revealed that the amino acid substitutions that distinguish the adult-expressed Hb isoforms are not attributable to the retention of an ancestral (pre-duplication) character state in the α(D)-globin gene that is shared with the embryonic α-like globin gene.

  5. Hypertelorism in Charcot-Marie-Tooth disease 1A from the common PMP22 duplication: A Case Report

    PubMed Central

    Finsterer, Josef

    2012-01-01

    The 1.4Mb tandem-duplication in the PMP22 gene at 17p11.2 usually manifests as hereditary sensorimotor polyneuropathy with foot deformity, sensorineural hearing-loss, moderate developmental delay, and gait disturbance. Hypertelorism and marked phenotypic variability within a single family has not been reported. In a single family, the PMP22 tandem-duplication manifested as short stature, sensorimotor polyneuropathy, tremor, ataxia, sensorineural hearing-loss, and hypothyroidism in the 27 years-old index case, as mild facial dysmorphism, muscle cramps, tinnitus, intention tremor, bradydiadochokinesia, and sensorimotor polyneuropathy in the 31 year-old half-brother of the index-patient, and as sensorimotor polyneuropathy and foot-deformity in the father of the two. The half-brother additionally presented with hypertelorism, not previously reported in PMP22 tandem-duplication carriers. The presented cases show that the tandem-duplication 17p11.2 may present with marked intra-familial phenotype variability and that mild facial dysmorphism with stuck-out ears and hypertelorism may be a rare phenotypic feature of this mutation. The causal relation between facial dysmorphism and the PMP22 tandem-duplication, however, remains speculative. PMID:22496945

  6. Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.

    PubMed

    Gout, Jean-Francois; Lynch, Michael

    2015-08-01

    Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Startling Mosaicism of the Y-Chromosome and Tandem Duplication of the SRY and DAZ Genes in Patients with Turner Syndrome

    PubMed Central

    Premi, Sanjay; Srivastava, Jyoti; Panneer, Ganesan; Ali, Sher

    2008-01-01

    Presence of the human Y-chromosome in females with Turner Syndrome (TS) enhances the risk of development of gonadoblastoma besides causing several other phenotypic abnormalities. In the present study, we have analyzed the Y chromosome in 15 clinically diagnosed Turner Syndrome (TS) patients and detected high level of mosaicisms ranging from 45,XO:46,XY = 100:0% in 4; 45,XO:46,XY:46XX = 4:94:2 in 8; and 45,XO:46,XY:46XX = 50:30:20 cells in 3 TS patients, unlike previous reports showing 5–8% cells with Y- material. Also, no ring, marker or di-centric Y was observed in any of the cases. Of the two TS patients having intact Y chromosome in >85% cells, one was exceptionally tall. Both the patients were positive for SRY, DAZ, CDY1, DBY, UTY and AZFa, b and c specific STSs. Real Time PCR and FISH demonstrated tandem duplication/multiplication of the SRY and DAZ genes. At sequence level, the SRY was normal in 8 TS patients while the remaining 7 showed either absence of this gene or known and novel mutations within and outside of the HMG box. SNV/SFV analysis showed normal four copies of the DAZ genes in these 8 patients. All the TS patients showed aplastic uterus with no ovaries and no symptom of gonadoblastoma. Present study demonstrates new types of polymorphisms indicating that no two TS patients have identical genotype-phenotype. Thus, a comprehensive analysis of more number of samples is warranted to uncover consensus on the loci affected, to be able to use them as potential diagnostic markers. PMID:19030103

  8. Complexity of Gene Expression Evolution after Duplication: Protein Dosage Rebalancing

    PubMed Central

    Rogozin, Igor B.

    2014-01-01

    Ongoing debates about functional importance of gene duplications have been recently intensified by a heated discussion of the “ortholog conjecture” (OC). Under the OC, which is central to functional annotation of genomes, orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of gene ontology (GO) annotations and expression profiles, among within-species paralogs compared to orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. Subsequent studies suggested that the OC appears to be generally valid when applied to mammalian evolution but the complete picture of evolution of gene expression also has to incorporate lineage-specific aspects of paralogy. The observed complexity of gene expression evolution after duplication can be explained through selection for gene dosage effect combined with the duplication-degeneration-complementation model. This paper discusses expression divergence of recent duplications occurring before functional divergence of proteins encoded by duplicate genes. PMID:25197576

  9. Prevalent Role of Gene Features in Determining Evolutionary Fates of Whole-Genome Duplication Duplicated Genes in Flowering Plants1[W][OA

    PubMed Central

    Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi

    2013-01-01

    The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs. PMID:23396833

  10. A novel regimen for relapsed/refractory adult acute myeloid leukemia using a KMT2A partial tandem duplication targeted therapy: results of phase 1 study NCI 8485.

    PubMed

    Mims, Alice S; Mishra, Anjali; Orwick, Shelley; Blachly, James; Klisovic, Rebecca B; Garzon, Ramiro; Walker, Alison R; Devine, Steven M; Walsh, Katherine J; Vasu, Sumithira; Whitman, Susan; Marcucci, Guido; Jones, Daniel; Heerema, Nyla A; Lozanski, Gerard; Caligiuri, Michael A; Bloomfield, Clara D; Byrd, John C; Piekarz, Richard; Grever, Michael R; Blum, William

    2018-06-01

    KMT2A partial tandem duplication occurs in approximately 5-10% of patients with acute myeloid leukemia and is associated with adverse prognosis. KMT2A wild type is epigenetically silenced in KMT2A partial tandem duplication; re-expression can be induced with DNA methyltransferase and/or histone deacetylase inhibitors in vitro , sensitizing myeloid blasts to chemotherapy. We hypothesized that epigenetic silencing of KMT2A wildtype contributes to KMT2A partial tandem duplication-associated leukemogenesis and pharmacologic re-expression activates apoptotic mechanisms important for chemoresponse. We developed a regimen for this unique molecular subset, but due to relatively low frequency of KMT2A partial tandem duplication, this dose finding study was conducted in relapsed/refractory disease regardless of molecular subtype. Seventeen adults (< age 60) with relapsed/refractory acute myeloid leukemia were treated on study. Patients received decitabine 20 milligrams/meter 2 daily on days 1-10 and vorinostat 400 milligrams daily on days 5-10. Cytarabine was dose-escalated from 1.5 grams/meter 2 every 12 hours to 3 grams/meter 2 every 12 hours on days 12, 14 and 16. Two patients experienced dose limiting toxicities at dose level 1 due to prolonged myelosuppression. However, as both patients achieved complete remission after Day 42, the protocol was amended to adjust the definition of hematologic dose limiting toxicity. No further dose limiting toxicities were found. Six of 17 patients achieved complete remission including 2 of 4 patients with KMT2A partial tandem duplication. Combination therapy with decitabine, vorinostat and cytarabine was tolerated in younger relapsed/refractory acute myeloid leukemia and should be explored further focusing on the KMT2A partial tandem duplication subset. ( clinicaltrials.gov identifier 01130506 ). Copyright © 2018 Ferrata Storti Foundation.

  11. The evolution of duplicate gene expression in mammalian organs

    PubMed Central

    Guschanski, Katerina; Warnefors, Maria; Kaessmann, Henrik

    2017-01-01

    Gene duplications generate genomic raw material that allows the emergence of novel functions, likely facilitating adaptive evolutionary innovations. However, global assessments of the functional and evolutionary relevance of duplicate genes in mammals were until recently limited by the lack of appropriate comparative data. Here, we report a large-scale study of the expression evolution of DNA-based functional gene duplicates in three major mammalian lineages (placental mammals, marsupials, egg-laying monotremes) and birds, on the basis of RNA sequencing (RNA-seq) data from nine species and eight organs. We observe dynamic changes in tissue expression preference of paralogs with different duplication ages, suggesting differential contribution of paralogs to specific organ functions during vertebrate evolution. Specifically, we show that paralogs that emerged in the common ancestor of bony vertebrates are enriched for genes with brain-specific expression and provide evidence for differential forces underlying the preferential emergence of young testis- and liver-specific expressed genes. Further analyses uncovered that the overall spatial expression profiles of gene families tend to be conserved, with several exceptions of pronounced tissue specificity shifts among lineage-specific gene family expansions. Finally, we trace new lineage-specific genes that may have contributed to the specific biology of mammalian organs, including the little-studied placenta. Overall, our study provides novel and taxonomically broad evidence for the differential contribution of duplicate genes to tissue-specific transcriptomes and for their importance for the phenotypic evolution of vertebrates. PMID:28743766

  12. A novel duplication polymorphism in the FANCA promoter and its association with breast and ovarian cancer.

    PubMed

    Thompson, Ella; Dragovic, Rebecca L; Stephenson, Sally-Anne; Eccles, Diana M; Campbell, Ian G; Dobrovic, Alexander

    2005-04-29

    The FANCA gene is one of the genes in which mutations lead to Fanconi anaemia, a rare autosomal recessive disorder characterised by congenital abnormalities, bone marrow failure, and predisposition to malignancy. FANCA is also a potential breast and ovarian cancer susceptibility gene. A novel allele was identified which has a tandem duplication of a 13 base pair sequence in the promoter region. We screened germline DNA from 352 breast cancer patients, 390 ovarian cancer patients and 256 normal controls to determine if the presence of either of these two alleles was associated with an increased risk of breast or ovarian cancer. The duplication allele had a frequency of 0.34 in the normal controls. There was a non-significant decrease in the frequency of the duplication allele in breast cancer patients. The frequency of the duplication allele was significantly decreased in ovarian cancer patients. However, when malignant and benign tumours were considered separately, the decrease was only significant in benign tumours. The allele with the tandem duplication does not appear to modify breast cancer risk but may act as a low penetrance protective allele for ovarian cancer.

  13. A novel duplication polymorphism in the FANCA promoter and its association with breast and ovarian cancer

    PubMed Central

    Thompson, Ella; Dragovic, Rebecca L; Stephenson, Sally-Anne; Eccles, Diana M; Campbell, Ian G; Dobrovic, Alexander

    2005-01-01

    The FANCA gene is one of the genes in which mutations lead to Fanconi anaemia, a rare autosomal recessive disorder characterised by congenital abnormalities, bone marrow failure, and predisposition to malignancy. FANCA is also a potential breast and ovarian cancer susceptibility gene. A novel allele was identified which has a tandem duplication of a 13 base pair sequence in the promoter region. Methods We screened germline DNA from 352 breast cancer patients, 390 ovarian cancer patients and 256 normal controls to determine if the presence of either of these two alleles was associated with an increased risk of breast or ovarian cancer. Results The duplication allele had a frequency of 0.34 in the normal controls. There was a non-significant decrease in the frequency of the duplication allele in breast cancer patients. The frequency of the duplication allele was significantly decreased in ovarian cancer patients. However, when malignant and benign tumours were considered separately, the decrease was only significant in benign tumours. Conclusion The allele with the tandem duplication does not appear to modify breast cancer risk but may act as a low penetrance protective allele for ovarian cancer. PMID:15860134

  14. Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution.

    PubMed

    Acharya, Debarun; Ghosh, Tapash C

    2016-01-22

    Gene duplication is a genetic mutation that creates functionally redundant gene copies that are initially relieved from selective pressures and may adapt themselves to new functions with time. The levels of gene duplication may vary from small-scale duplication (SSD) to whole genome duplication (WGD). Studies with yeast revealed ample differences between these duplicates: Yeast WGD pairs were functionally more similar, less divergent in subcellular localization and contained a lesser proportion of essential genes. In this study, we explored the differences in evolutionary genomic properties of human SSD and WGD genes, with the identifiable human duplicates coming from the two rounds of whole genome duplication occurred early in vertebrate evolution. We observed that these two groups of duplicates were also dissimilar in terms of their evolutionary and genomic properties. But interestingly, this is not like the same observed in yeast. The human WGDs were found to be functionally less similar, diverge more in subcellular level and contain a higher proportion of essential genes than the SSDs, all of which are opposite from yeast. Additionally, we explored that human WGDs were more divergent in their gene expression profile, have higher multifunctionality and are more often associated with disease, and are evolutionarily more conserved than human SSDs. Our study suggests that human WGD duplicates are more divergent and entails the adaptation of WGDs to novel and important functions that consequently lead to their evolutionary conservation in the course of evolution.

  15. Cdx ParaHox genes acquired distinct developmental roles after gene duplication in vertebrate evolution.

    PubMed

    Marlétaz, Ferdinand; Maeso, Ignacio; Faas, Laura; Isaacs, Harry V; Holland, Peter W H

    2015-08-01

    The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates.

  16. Gene duplication, silencing and expression alteration govern the molecular evolution of PRC2 genes in plants.

    PubMed

    Furihata, Hazuka Y; Suenaga, Kazuya; Kawanabe, Takahiro; Yoshida, Takanori; Kawabe, Akira

    2016-10-13

    PRC2 genes were analyzed for their number of gene duplications, d N /d S ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.

  17. Comparative inference of duplicated genes produced by polyploidization in soybean genome.

    PubMed

    Yang, Yanmei; Wang, Jinpeng; Di, Jianyong

    2013-01-01

    Soybean (Glycine max) is one of the most important crop plants for providing protein and oil. It is important to investigate soybean genome for its economic and scientific value. Polyploidy is a widespread and recursive phenomenon during plant evolution, and it could generate massive duplicated genes which is an important resource for genetic innovation. Improved sequence alignment criteria and statistical analysis are used to identify and characterize duplicated genes produced by polyploidization in soybean. Based on the collinearity method, duplicated genes by whole genome duplication account for 70.3% in soybean. From the statistical analysis of the molecular distances between duplicated genes, our study indicates that the whole genome duplication event occurred more than once in the genome evolution of soybean, which is often distributed near the ends of chromosomes.

  18. Consensus properties and their large-scale applications for the gene duplication problem.

    PubMed

    Moon, Jucheol; Lin, Harris T; Eulenstein, Oliver

    2016-06-01

    Solving the gene duplication problem is a classical approach for species tree inference from gene trees that are confounded by gene duplications. This problem takes a collection of gene trees and seeks a species tree that implies the minimum number of gene duplications. Wilkinson et al. posed the conjecture that the gene duplication problem satisfies the desirable Pareto property for clusters. That is, for every instance of the problem, all clusters that are commonly present in the input gene trees of this instance, called strict consensus, will also be found in every solution to this instance. We prove that this conjecture does not generally hold. Despite this negative result we show that the gene duplication problem satisfies a weaker version of the Pareto property where the strict consensus is found in at least one solution (rather than all solutions). This weaker property contributes to our design of an efficient scalable algorithm for the gene duplication problem. We demonstrate the performance of our algorithm in analyzing large-scale empirical datasets. Finally, we utilize the algorithm to evaluate the accuracy of standard heuristics for the gene duplication problem using simulated datasets.

  19. Divergence of Gene Body DNA Methylation and Evolution of Plant Duplicate Genes

    PubMed Central

    Wang, Jun; Marowsky, Nicholas C.; Fan, Chuanzhu

    2014-01-01

    It has been shown that gene body DNA methylation is associated with gene expression. However, whether and how deviation of gene body DNA methylation between duplicate genes can influence their divergence remains largely unexplored. Here, we aim to elucidate the potential role of gene body DNA methylation in the fate of duplicate genes. We identified paralogous gene pairs from Arabidopsis and rice (Oryza sativa ssp. japonica) genomes and reprocessed their single-base resolution methylome data. We show that methylation in paralogous genes nonlinearly correlates with several gene properties including exon number/gene length, expression level and mutation rate. Further, we demonstrated that divergence of methylation level and pattern in paralogs indeed positively correlate with their sequence and expression divergences. This result held even after controlling for other confounding factors known to influence the divergence of paralogs. We observed that methylation level divergence might be more relevant to the expression divergence of paralogs than methylation pattern divergence. Finally, we explored the mechanisms that might give rise to the divergence of gene body methylation in paralogs. We found that exonic methylation divergence more closely correlates with expression divergence than intronic methylation divergence. We show that genomic environments (e.g., flanked by transposable elements and repetitive sequences) of paralogs generated by various duplication mechanisms are associated with the methylation divergence of paralogs. Overall, our results suggest that the changes in gene body DNA methylation could provide another avenue for duplicate genes to develop differential expression patterns and undergo different evolutionary fates in plant genomes. PMID:25310342

  20. Altitudinal Variation at Duplicated β-Globin Genes in Deer Mice: Effects of Selection, Recombination, and Gene Conversion

    PubMed Central

    Storz, Jay F.; Natarajan, Chandrasekhar; Cheviron, Zachary A.; Hoffmann, Federico G.; Kelly, John K.

    2012-01-01

    Spatially varying selection on a given polymorphism is expected to produce a localized peak in the between-population component of nucleotide diversity, and theory suggests that the chromosomal extent of elevated differentiation may be enhanced in cases where tandemly linked genes contribute to fitness variation. An intriguing example is provided by the tandemly duplicated β-globin genes of deer mice (Peromyscus maniculatus), which contribute to adaptive differentiation in blood–oxygen affinity between high- and low-altitude populations. Remarkably, the two β-globin genes segregate the same pair of functionally distinct alleles due to a history of interparalog gene conversion and alleles of the same functional type are in perfect coupling-phase linkage disequilibrium (LD). Here we report a multilocus analysis of nucleotide polymorphism and LD in highland and lowland mice with different genetic backgrounds at the β-globin genes. The analysis of haplotype structure revealed a paradoxical pattern whereby perfect LD between the two β-globin paralogs (which are separated by 16.2 kb) is maintained in spite of the fact that LD within both paralogs decays to background levels over physical distances of less than 1 kb. The survey of nucleotide polymorphism revealed that elevated levels of altitudinal differentiation at each of the β-globin genes drop away quite rapidly in the external flanking regions (upstream of the 5′ paralog and downstream of the 3′ paralog), but the level of differentiation remains unexpectedly high across the intergenic region. Observed patterns of diversity and haplotype structure are difficult to reconcile with expectations of a two-locus selection model with multiplicative fitness. PMID:22042573

  1. Evolution of tuf genes: ancient duplication, differential loss and gene conversion.

    PubMed

    Lathe, W C; Bork, P

    2001-08-03

    The tuf gene of eubacteria, encoding the EF-tu elongation factor, was duplicated early in the evolution of the taxon. Phylogenetic and genomic location analysis of 20 complete eubacterial genomes suggests that this ancient duplication has been differentially lost and maintained in eubacteria.

  2. Levels of duplicate gene expression in armoured catfishes.

    PubMed

    Dunham, R A; Philipp, D P; Whitt, G S

    1980-01-01

    Species of armoured catfishes differ significantly in their cellular DNA content and chromosome number. Starch gel electrophoresis of isozymes was used to determine whether each of 16 enzyme loci was expressed in a single or duplicate state. The percent of enzyme loci exhibiting duplicate locus expression in Corydoras aeneus, Corydoras julii, Corydoras melanistius, and Corydoras myersi was 37.5 percent, 18.75 percent, 12.5 percent, and 6.25 percent, respectively. The percentage of loci expressed in duplicate is higher in the species with higher haploid DNA contents, which are 4.4 pg, 3.0 pg, and 2.3 pg, respectively. These differences in DNA contents are also associated with differences in chromosome number. These data are consistent with the hypothesis that increases in DNA contents and enzyme loci occur both by tetraploidization and by regional gene duplication and that these increases are then followed by a partial loss of DNA and a reduction in the number of the duplicate isozyme loci expressed. Such analyses provide insight into the mechanisms of genome amplification and reduction as well as insights into the fats of duplicate genes.

  3. Inferring evolution of gene duplicates using probabilistic models and nonparametric belief propagation.

    PubMed

    Zeng, Jia; Hannenhalli, Sridhar

    2013-01-01

    Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study. Here we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human. Our results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.

  4. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene.

    PubMed

    Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

    2007-11-29

    Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains.

  5. Duplicated genes evolve independently in allopolyploid cotton.

    Treesearch

    Richard C. Cronn; Randall L. Small; Jonathan F. Wendel

    1999-01-01

    Of the many processes that generate gene duplications, polyploidy is unique in that entire genomes are duplicated. This process has been important in the evolution of many eukaryotic groups, and it occurs with high frequency in plants. Recent evidence suggests that polyploidization may be accompanied by rapid genomic changes, but the evolutionary fate of discrete loci...

  6. The origins and impact of primate segmental duplications.

    PubMed

    Marques-Bonet, Tomas; Girirajan, Santhosh; Eichler, Evan E

    2009-10-01

    Duplicated sequences are substrates for the emergence of new genes and are an important source of genetic instability associated with rare and common diseases. Analyses of primate genomes have shown an increase in the proportion of interspersed segmental duplications (SDs) within the genomes of humans and great apes. This contrasts with other mammalian genomes that seem to have their recently duplicated sequences organized in a tandem configuration. In this review, we focus on the mechanistic origin and impact of this difference with respect to evolution, genetic diversity and primate phenotype. Although many genomes will be sequenced in the future, resolution of this aspect of genomic architecture still requires high quality sequences and detailed analyses.

  7. Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene

    PubMed Central

    Amit, Maayan; Sela, Noa; Keren, Hadas; Melamed, Ze'ev; Muler, Inna; Shomron, Noam; Izraeli, Shai; Ast, Gil

    2007-01-01

    Background Gene duplication and exonization of intronic transposed elements are two mechanisms that enhance genomic diversity. We examined whether there is less selection against exonization of transposed elements in duplicated genes than in single-copy genes. Results Genome-wide analysis of exonization of transposed elements revealed a higher rate of exonization within duplicated genes relative to single-copy genes. The gene for TIF-IA, an RNA polymerase I transcription initiation factor, underwent a humanoid-specific triplication, all three copies of the gene are active transcriptionally, although only one copy retains the ability to generate the TIF-IA protein. Prior to TIF-IA triplication, an Alu element was inserted into the first intron. In one of the non-protein coding copies, this Alu is exonized. We identified a single point mutation leading to exonization in one of the gene duplicates. When this mutation was introduced into the TIF-IA coding copy, exonization was activated and the level of the protein-coding mRNA was reduced substantially. A very low level of exonization was detected in normal human cells. However, this exonization was abundant in most leukemia cell lines evaluated, although the genomic sequence is unchanged in these cancerous cells compared to normal cells. Conclusion The definition of the Alu element within the TIF-IA gene as an exon is restricted to certain types of cancers; the element is not exonized in normal human cells. These results further our understanding of the delicate interplay between gene duplication and alternative splicing and of the molecular evolutionary mechanisms leading to genetic innovations. This implies the existence of purifying selection against exonization in single copy genes, with duplicate genes free from such constrains. PMID:18047649

  8. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates

    PubMed Central

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-01-01

    Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708

  9. Circular DNA Intermediate in the Duplication of Nile Tilapia vasa Genes

    PubMed Central

    Fujimura, Koji; Conte, Matthew A.; Kocher, Thomas D.

    2011-01-01

    vasa is a highly conserved RNA helicase involved in animal germ cell development. Among vertebrate species, it is typically present as a single copy per genome. Here we report the isolation and sequencing of BAC clones for Nile tilapia vasa genes. Contrary to a previous report that Nile tilapia have a single copy of the vasa gene, we find evidence for at least three vasa gene loci. The vasa gene locus was duplicated from the original site and integrated into two distant novel sites. For one of these insertions we find evidence that the duplication was mediated by a circular DNA intermediate. This mechanism of gene duplication may explain the origin of isolated gene duplicates during the evolution of fish genomes. These data provide a foundation for studying the role of multiple vasa genes in the development of tilapia gonads, and will contribute to investigations of the molecular mechanisms of sex determination and evolution in cichlid fishes. PMID:22216289

  10. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    PubMed

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  11. Genome Mutational and Transcriptional Hotspots Are Traps for Duplicated Genes and Sources of Adaptations.

    PubMed

    Fares, Mario A; Sabater-Muñoz, Beatriz; Toft, Christina

    2017-05-01

    Gene duplication generates new genetic material, which has been shown to lead to major innovations in unicellular and multicellular organisms. A whole-genome duplication occurred in the ancestor of Saccharomyces yeast species but 92% of duplicates returned to single-copy genes shortly after duplication. The persisting duplicated genes in Saccharomyces led to the origin of major metabolic innovations, which have been the source of the unique biotechnological capabilities in the Baker's yeast Saccharomyces cerevisiae. What factors have determined the fate of duplicated genes remains unknown. Here, we report the first demonstration that the local genome mutation and transcription rates determine the fate of duplicates. We show, for the first time, a preferential location of duplicated genes in the mutational and transcriptional hotspots of S. cerevisiae genome. The mechanism of duplication matters, with whole-genome duplicates exhibiting different preservation trends compared to small-scale duplicates. Genome mutational and transcriptional hotspots are rich in duplicates with large repetitive promoter elements. Saccharomyces cerevisiae shows more tolerance to deleterious mutations in duplicates with repetitive promoter elements, which in turn exhibit higher transcriptional plasticity against environmental perturbations. Our data demonstrate that the genome traps duplicates through the accelerated regulatory and functional divergence of their gene copies providing a source of novel adaptations in yeast. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Gene and domain duplication in the chordate Otx gene family: insights from amphioxus Otx.

    PubMed

    Williams, N A; Holland, P W

    1998-05-01

    We report the genomic organization and deduced protein sequence of a cephalochordate member of the Otx homeobox gene family (AmphiOtx) and show its probable single-copy state in the genome. We also present molecular phylogenetic analysis indicating that there was single ancestral Otx gene in the first chordates which was duplicated in the vertebrate lineage after it had split from the lineage leading to the cephalochordates. Duplication of a C-terminal protein domain has occurred specifically in the vertebrate lineage, strengthening the case for a single Otx gene in an ancestral chordate whose gene structure has been retained in an extant cephalochordate. Comparative analysis of protein sequences and published gene expression patterns suggest that the ancestral chordate Otx gene had roles in patterning the anterior mesendoderm and central nervous system. These roles were elaborated following Otx gene duplication in vertebrates, accompanied by regulatory and structural divergence, particularly of Otx1 descendant genes.

  13. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    PubMed

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Models for loosely linked gene duplicates suggest lengthy persistence of both copies.

    PubMed

    O'Hely, Martin; Wockner, Leesa

    2007-06-21

    Consider the appearance of a duplicate copy of a gene at a locus linked loosely, if at all, to the locus at which the gene is usually found. If all copies of the gene are subject to non-functionalizing mutations, then two fates are possible: loss of functional copies at the duplicate locus (loss of duplicate expression), or loss of functional copies at the original locus (map change). This paper proposes a simple model to address the probability of map change, the time taken for a map change and/or loss of duplicate expression, and considers where in the spectrum between loss of duplicate expression and map change such a duplicate complex is likely to be found. The findings are: the probability of map change is always half the reciprocal of the population size N, the time for a map change to occur is order NlogN generations, and that there is a marked tendency for duplicates to remain near equi-frequency with the gene at the original locus for a large portion of that time. This is in excellent agreement with simulations.

  15. Evolutionary history and functional divergence of the cytochrome P450 gene superfamily between Arabidopsis thaliana and Brassica species uncover effects of whole genome and tandem duplications.

    PubMed

    Yu, Jingyin; Tehrim, Sadia; Wang, Linhai; Dossa, Komivi; Zhang, Xiurong; Ke, Tao; Liao, Boshou

    2017-09-18

    The cytochrome P450 monooxygenase (P450) superfamily is involved in the biosynthesis of various primary and secondary metabolites. However, little is known about the effects of whole genome duplication (WGD) and tandem duplication (TD) events on the evolutionary history and functional divergence of P450s in Brassica after splitting from a common ancestor with Arabidopsis thaliana. Using Hidden Markov Model search and manual curation, we detected that Brassica species have nearly 1.4-fold as many P450 members as A. thaliana. Most P450s in A. thaliana and Brassica species were located on pseudo-chromosomes. The inferred phylogeny indicated that all P450s were clustered into two different subgroups. Analysis of WGD event revealed that different P450 gene families had appeared after evolutionary events of species. For the TD event analyses, the P450s from TD events in Brassica species can be divided into ancient and recent parts. Our comparison of influence of WGD and TD events on the P450 gene superfamily between A. thaliana and Brassica species indicated that the family-specific evolution in the Brassica lineage can be attributed to both WGD and TD, whereas WGD was recognized as the major mechanism for the recent evolution of the P450 super gene family. Expression analysis of P450s from A. thaliana and Brassica species indicated that WGD-type P450s showed the same expression pattern but completely different expression with TD-type P450s across different tissues in Brassica species. Selection force analysis suggested that P450 orthologous gene pairs between A. thaliana and Brassica species underwent negative selection, but no significant differences were found between P450 orthologous gene pairs in A. thaliana-B. rapa and A. thaliana-B. oleracea lineages, as well as in different subgenomes in B. rapa or B. oleracea compared with A. thaliana. This study is the first to investigate the effects of WGD and TD on the evolutionary history and functional divergence of P450

  16. Long-Read Single Molecule Sequencing to Resolve Tandem Gene Copies: The Mst77Y Region on the Drosophila melanogaster Y Chromosome

    PubMed Central

    Krsticevic, Flavia J.; Schrago, Carlos G.; Carvalho, A. Bernardo

    2015-01-01

    The autosomal gene Mst77F of Drosophila melanogaster is essential for male fertility. In 2010, Krsticevic et al. (Genetics 184: 295−307) found 18 Y-linked copies of Mst77F (“Mst77Y”), which collectively account for 20% of the functional Mst77F-like mRNA. The Mst77Y genes were severely misassembled in the then-available genome assembly and were identified by cloning and sequencing polymerase chain reaction products. The genomic structure of the Mst77Y region and the possible existence of additional copies remained unknown. The recent publication of two long-read assemblies of D. melanogaster prompted us to reinvestigate this challenging region of the Y chromosome. We found that the Illumina Synthetic Long Reads assembly failed in the Mst77Y region, most likely because of its tandem duplication structure. The PacBio MHAP assembly of the Mst77Y region seems to be very accurate, as revealed by comparisons with the previously found Mst77Y genes, a bacterial artificial chromosome sequence, and Illumina reads of the same strain. We found that the Mst77Y region spans 96 kb and originated from a 3.4-kb transposition from chromosome 3L to the Y chromosome, followed by tandem duplications inside the Y chromosome and invasion of transposable elements, which account for 48% of its length. Twelve of the 18 Mst77Y genes found in 2010 were confirmed in the PacBio assembly, the remaining six being polymerase chain reaction−induced artifacts. There are several identical copies of some Mst77Y genes, coincidentally bringing the total copy number to 18. Besides providing a detailed picture of the Mst77Y region, our results highlight the utility of PacBio technology in assembling difficult genomic regions such as tandemly repeated genes. PMID:25858959

  17. Evolutionary origins of a novel host plant detoxification gene in butterflies.

    PubMed

    Fischer, Hanna M; Wheat, Christopher W; Heckel, David G; Vogel, Heiko

    2008-05-01

    Chemical interactions between plants and their insect herbivores provide an excellent opportunity to study the evolution of species interactions on a molecular level. Here, we investigate the molecular evolutionary events that gave rise to a novel detoxifying enzyme (nitrile-specifier protein [NSP]) in the butterfly family Pieridae, previously identified as a coevolutionary key innovation. By generating and sequencing expressed sequence tags, genomic libraries, and screening databases we found NSP to be a member of an insect-specific gene family, which we characterized and named the NSP-like gene family. Members consist of variable tandem repeats, are gut expressed, and are found across Insecta evolving in a dynamic, ongoing birth-death process. In the Lepidoptera, multiple copies of single-domain major allergen genes are present and originate via tandem duplications. Multiple domain genes are found solely within the brassicaceous-feeding Pieridae butterflies, one of them being NSP and another called major allergen (MA). Analyses suggest that NSP and its paralog MA have a unique single-domain evolutionary origin, being formed by intragenic domain duplication followed by tandem whole-gene duplication. Duplicates subsequently experienced a period of relaxed constraint followed by an increase in constraint, perhaps after neofunctionalization. NSP and its ortholog MA are still experiencing high rates of change, reflecting a dynamic evolution consistent with the known role of NSP in plant-insect interactions. Our results provide direct evidence to the hypothesis that gene duplication is one of the driving forces for speciation and adaptation, showing that both within- and whole-gene tandem duplications are a powerful force underlying evolutionary adaptation.

  18. Evolution of a Novel Antiviral Immune-Signaling Interaction by Partial-Gene Duplication

    PubMed Central

    Korithoski, Bryan; Kolaczkowski, Oralia; Mukherjee, Krishanu; Kola, Reema; Earl, Chandra; Kolaczkowski, Bryan

    2015-01-01

    The RIG-like receptors (RLRs) are related proteins that identify viral RNA in the cytoplasm and activate cellular immune responses, primarily through direct protein-protein interactions with the signal transducer, IPS1. Although it has been well established that the RLRs, RIG-I and MDA5, activate IPS1 through binding between the twin caspase activation and recruitment domains (CARDs) on the RLR and a homologous CARD on IPS1, it is less clear which specific RLR CARD(s) are required for this interaction, and almost nothing is known about how the RLR-IPS1 interaction evolved. In contrast to what has been observed in the presence of immune-modulating K63-linked polyubiquitin, here we show that—in the absence of ubiquitin—it is the first CARD domain of human RIG-I and MDA5 (CARD1) that binds directly to IPS1 CARD, and not the second (CARD2). Although the RLRs originated in the earliest animals, both the IPS1 gene and the twin-CARD domain architecture of RIG-I and MDA5 arose much later in the deuterostome lineage, probably through a series of tandem partial-gene duplication events facilitated by tight clustering of RLRs and IPS1 in the ancestral deuterostome genome. Functional differentiation of RIG-I CARD1 and CARD2 appears to have occurred early during this proliferation of RLR and related CARDs, potentially driven by adaptive coevolution between RIG-I CARD domains and IPS1 CARD. However, functional differentiation of MDA5 CARD1 and CARD2 occurred later. These results fit a general model in which duplications of protein-protein interaction domains into novel gene contexts could facilitate the expansion of signaling networks and suggest a potentially important role for functionally-linked gene clusters in generating novel immune-signaling pathways. PMID:26356745

  19. Autosomal Genes of Autosomal/X-Linked Duplicated Gene Pairs and Germ-Line Proliferation in Caenorhabditis elegans

    PubMed Central

    Maciejowski, John; Ahn, James Hyungsoo; Cipriani, Patricia Giselle; Killian, Darrell J.; Chaudhary, Aisha L.; Lee, Ji Inn; Voutev, Roumen; Johnsen, Robert C.; Baillie, David L.; Gunsalus, Kristin C.; Fitch, David H. A.; Hubbard, E. Jane Albert

    2005-01-01

    We report molecular genetic studies of three genes involved in early germ-line proliferation in Caenorhabditis elegans that lend unexpected insight into a germ-line/soma functional separation of autosomal/X-linked duplicated gene pairs. In a genetic screen for germ-line proliferation-defective mutants, we identified mutations in rpl-11.1 (L11 protein of the large ribosomal subunit), pab-1 [a poly(A)-binding protein], and glp-3/eft-3 (an elongation factor 1-α homolog). All three are members of autosome/X gene pairs. Consistent with a germ-line-restricted function of rpl-11.1 and pab-1, mutations in these genes extend life span and cause gigantism. We further examined the RNAi phenotypes of the three sets of rpl genes (rpl-11, rpl-24, and rpl-25) and found that for the two rpl genes with autosomal/X-linked pairs (rpl-11 and rpl-25), zygotic germ-line function is carried by the autosomal copy. Available RNAi results for highly conserved autosomal/X-linked gene pairs suggest that other duplicated genes may follow a similar trend. The three rpl and the pab-1/2 duplications predate the divergence between C. elegans and C. briggsae, while the eft-3/4 duplication appears to have occurred in the lineage to C. elegans after it diverged from C. briggsae. The duplicated C. briggsae orthologs of the three C. elegans autosomal/X-linked gene pairs also display functional differences between paralogs. We present hypotheses for evolutionary mechanisms that may underlie germ-line/soma subfunctionalization of duplicated genes, taking into account the role of X chromosome silencing in the germ line and analogous mammalian phenomena. PMID:15687263

  20. The Natural History of Class I Primate Alcohol Dehydrogenases Includes Gene Duplication, Gene Loss, and Gene Conversion

    PubMed Central

    Carrigan, Matthew A.; Uryasev, Oleg; Davis, Ross P.; Zhai, LanMin; Hurley, Thomas D.; Benner, Steven A.

    2012-01-01

    Background Gene duplication is a source of molecular innovation throughout evolution. However, even with massive amounts of genome sequence data, correlating gene duplication with speciation and other events in natural history can be difficult. This is especially true in its most interesting cases, where rapid and multiple duplications are likely to reflect adaptation to rapidly changing environments and life styles. This may be so for Class I of alcohol dehydrogenases (ADH1s), where multiple duplications occurred in primate lineages in Old and New World monkeys (OWMs and NWMs) and hominoids. Methodology/Principal Findings To build a preferred model for the natural history of ADH1s, we determined the sequences of nine new ADH1 genes, finding for the first time multiple paralogs in various prosimians (lemurs, strepsirhines). Database mining then identified novel ADH1 paralogs in both macaque (an OWM) and marmoset (a NWM). These were used with the previously identified human paralogs to resolve controversies relating to dates of duplication and gene conversion in the ADH1 family. Central to these controversies are differences in the topologies of trees generated from exonic (coding) sequences and intronic sequences. Conclusions/Significance We provide evidence that gene conversions are the primary source of difference, using molecular clock dating of duplications and analyses of microinsertions and deletions (micro-indels). The tree topology inferred from intron sequences appear to more correctly represent the natural history of ADH1s, with the ADH1 paralogs in platyrrhines (NWMs) and catarrhines (OWMs and hominoids) having arisen by duplications shortly predating the divergence of OWMs and NWMs. We also conclude that paralogs in lemurs arose independently. Finally, we identify errors in database interpretation as the source of controversies concerning gene conversion. These analyses provide a model for the natural history of ADH1s that posits four ADH1 paralogs in

  1. Gene Duplication, Population Genomics, and Species-Level Differentiation within a Tropical Mountain Shrub

    PubMed Central

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H.; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C.

    2014-01-01

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. PMID:25223767

  2. Whole-genome sequencing in patients with ciliopathies uncovers a novel recurrent tandem duplication in IFT140.

    PubMed

    Geoffroy, Véronique; Stoetzel, Corinne; Scheidecker, Sophie; Schaefer, Elise; Perrault, Isabelle; Bär, Séverine; Kröll, Ariane; Delbarre, Marion; Antin, Manuela; Leuvrey, Anne-Sophie; Henry, Charline; Blanché, Hélène; Decker, Eva; Kloth, Katja; Klaus, Günter; Mache, Christoph; Martin-Coignard, Dominique; McGinn, Steven; Boland, Anne; Deleuze, Jean-François; Friant, Sylvie; Saunier, Sophie; Rozet, Jean-Michel; Bergmann, Carsten; Dollfus, Hélène; Muller, Jean

    2018-04-24

    Ciliopathies represent a wide spectrum of rare diseases with overlapping phenotypes and a high genetic heterogeneity. Among those, IFT140 is implicated in a variety of phenotypes ranging from isolated retinis pigmentosa to more syndromic cases. Using whole-genome sequencing in patients with uncharacterized ciliopathies, we identified a novel recurrent tandem duplication of exon 27-30 (6.7 kb) in IFT140, c.3454-488_4182+2588dup p.(Tyr1152_Thr1394dup), missed by whole-exome sequencing. Pathogenicity of the mutation was assessed on the patients' skin fibroblasts. Several hundreds of patients with a ciliopathy phenotype were screened and biallelic mutations were identified in 11 families representing 12 pathogenic variants of which seven are novel. Among those unrelated families especially with a Mainzer-Saldino syndrome, eight carried the same tandem duplication (two at the homozygous state and six at the heterozygous state). In conclusion, we demonstrated the implication of structural variations in IFT140-related diseases expanding its mutation spectrum. We also provide evidences for a unique genomic event mediated by an Alu-Alu recombination occurring on a shared haplotype. We confirm that whole-genome sequencing can be instrumental in the ability to detect structural variants for genomic disorders. © 2018 Wiley Periodicals, Inc.

  3. Successful treatment of post-transplant relapsed acute myeloid leukemia with FLT3 internal tandem duplication using the combination of induction chemotherapy, donor lymphocyte infusion, sorafenib and azacitidine. Report of three cases

    PubMed Central

    Campregher, Paulo Vidal; de Mattos, Vinicius Renan Pinto; Salvino, Marco Aurélio; Santos, Fabio Pires de Souza; Hamerschlak, Nelson

    2017-01-01

    ABSTRACT Acute myeloid leukemia is a hematopoietic stem cell neoplastic disease associated with high morbidity and mortality. The presence of FLT3 internal tandem duplication mutations leads to high rates of relapse and decreased overall survival. Patients with FLT3 internal tandem duplication are normally treated with hematopoietic stem cell transplantation in first complete remission. Nevertheless, the incidence of post-transplant relapse is considerable in this group of patients, and the management of this clinical condition is challenging. The report describes the outcomes of patients with FLT3 internal tandem duplication positive acute myeloid leukemia who relapsed after allogeneic hematopoietic stem cell transplantation and were treated with the combination of re-induction chemotherapy, donor lymphocyte infusion, sorafenib and azacitidine. Three cases are described and all patients achieved prolonged complete remission with the combined therapy. The combination of induction chemotherapy followed by donor lymphocyte infusion, and the maintenance with azacitidine and sorafenib can be effective approaches in the treatment of post-hematopoietic stem cell transplant and relapsed FLT3 internal tandem duplication positive acute myeloid leukemia patients. This strategy should be further explored in the context of clinical trials. PMID:28746590

  4. Gene duplication, population genomics, and species-level differentiation within a tropical mountain shrub.

    PubMed

    Mastretta-Yanes, Alicia; Zamudio, Sergio; Jorgensen, Tove H; Arrigo, Nils; Alvarez, Nadir; Piñero, Daniel; Emerson, Brent C

    2014-09-14

    Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Gene duplication in the major insecticide target site, Rdl, in Drosophila melanogaster

    PubMed Central

    Remnant, Emily J.; Good, Robert T.; Schmidt, Joshua M.; Lumb, Christopher; Robin, Charles; Daborn, Phillip J.; Batterham, Philip

    2013-01-01

    The Resistance to Dieldrin gene, Rdl, encodes a GABA-gated chloride channel subunit that is targeted by cyclodiene and phenylpyrazole insecticides. The gene was first characterized in Drosophila melanogaster by genetic mapping of resistance to the cyclodiene dieldrin. The 4,000-fold resistance observed was due to a single amino acid replacement, Ala301 to Ser. The equivalent change was subsequently identified in Rdl orthologs of a large range of resistant insect species. Here, we report identification of a duplication at the Rdl locus in D. melanogaster. The 113-kb duplication contains one WT copy of Rdl and a second copy with two point mutations: an Ala301 to Ser resistance mutation and Met360 to Ile replacement. Individuals with this duplication exhibit intermediate dieldrin resistance compared with single copy Ser301 homozygotes, reduced temperature sensitivity, and altered RNA editing associated with the resistant allele. Ectopic recombination between Roo transposable elements is involved in generating this genomic rearrangement. The duplication phenotypes were confirmed by construction of a transgenic, artificial duplication integrating the 55.7-kb Rdl locus with a Ser301 change into an Ala301 background. Gene duplications can contribute significantly to the evolution of insecticide resistance, most commonly by increasing the amount of gene product produced. Here however, duplication of the Rdl target site creates permanent heterozygosity, providing unique potential for adaptive mutations to accrue in one copy, without abolishing the endogenous role of an essential gene. PMID:23959864

  6. Evolution of vertebrate central nervous system is accompanied by novel expression changes of duplicate genes.

    PubMed

    Chen, Yuan; Ding, Yun; Zhang, Zuming; Wang, Wen; Chen, Jun-Yuan; Ueno, Naoto; Mao, Bingyu

    2011-12-20

    The evolution of the central nervous system (CNS) is one of the most striking changes during the transition from invertebrates to vertebrates. As a major source of genetic novelties, gene duplication might play an important role in the functional innovation of vertebrate CNS. In this study, we focused on a group of CNS-biased genes that duplicated during early vertebrate evolution. We investigated the tempo-spatial expression patterns of 33 duplicate gene families and their orthologs during the embryonic development of the vertebrate Xenopus laevis and the cephalochordate Brachiostoma belcheri. Almost all the identified duplicate genes are differentially expressed in the CNS in Xenopus embryos, and more than 50% and 30% duplicate genes are expressed in the telencephalon and mid-hindbrain boundary, respectively, which are mostly considered as two innovations in the vertebrate CNS. Interestingly, more than 50% of the amphioxus orthologs do not show apparent expression in the CNS in amphioxus embryos as detected by in situ hybridization, indicating that some of the vertebrate CNS-biased duplicate genes might arise from non-CNS genes in invertebrates. Our data accentuate the functional contribution of gene duplication in the CNS evolution of vertebrate and uncover an invertebrate non-CNS history for some vertebrate CNS-biased duplicate genes. Copyright © 2011. Published by Elsevier Ltd.

  7. Extensive Local Gene Duplication and Functional Divergence among Paralogs in Atlantic Salmon

    PubMed Central

    Warren, Ian A.; Ciborowski, Kate L.; Casadei, Elisa; Hazlerigg, David G.; Martin, Sam; Jordan, William C.; Sumner, Seirian

    2014-01-01

    Many organisms can generate alternative phenotypes from the same genome, enabling individuals to exploit diverse and variable environments. A prevailing hypothesis is that such adaptation has been favored by gene duplication events, which generate redundant genomic material that may evolve divergent functions. Vertebrate examples of recent whole-genome duplications are sparse although one example is the salmonids, which have undergone a whole-genome duplication event within the last 100 Myr. The life-cycle of the Atlantic salmon, Salmo salar, depends on the ability to produce alternating phenotypes from the same genome, to facilitate migration and maintain its anadromous life history. Here, we investigate the hypothesis that genome-wide and local gene duplication events have contributed to the salmonid adaptation. We used high-throughput sequencing to characterize the transcriptomes of three key organs involved in regulating migration in S. salar: Brain, pituitary, and olfactory epithelium. We identified over 10,000 undescribed S. salar sequences and designed an analytic workflow to distinguish between paralogs originating from local gene duplication events or from whole-genome duplication events. These data reveal that substantial local gene duplications took place shortly after the whole-genome duplication event. Many of the identified paralog pairs have either diverged in function or become noncoding. Future functional genomics studies will reveal to what extent this rich source of divergence in genetic sequence is likely to have facilitated the evolution of extreme phenotypic plasticity required for an anadromous life-cycle. PMID:24951567

  8. Dating and functional characterization of duplicated genes in the apple (Malus domestica Borkh.) by analyzing EST data.

    PubMed

    Sanzol, Javier

    2010-05-14

    Gene duplication is central to genome evolution. In plants, genes can be duplicated through small-scale events and large-scale duplications often involving polyploidy. The apple belongs to the subtribe Pyrinae (Rosaceae), a diverse lineage that originated via allopolyploidization. Both small-scale duplications and polyploidy may have been important mechanisms shaping the genome of this species. This study evaluates the gene duplication and polyploidy history of the apple by characterizing duplicated genes in this species using EST data. Overall, 68% of the apple genes were clustered into families with a mean copy-number of 4.6. Analysis of the age distribution of gene duplications supported a continuous mode of small-scale duplications, plus two episodes of large-scale duplicates of vastly different ages. The youngest was consistent with the polyploid origin of the Pyrinae 37-48 MYBP, whereas the older may be related to gamma-triplication; an ancient hexapolyploidization previously characterized in the four sequenced eurosid genomes and basal to the eurosid-asterid divergence. Duplicated genes were studied for functional diversification with an emphasis on young paralogs; those originated during or after the formation of the Pyrinae lineage. Unequal assignment of single-copy genes and gene families to Gene Ontology categories suggested functional bias in the pattern of gene retention of paralogs. Young paralogs related to signal transduction, metabolism, and energy pathways have been preferentially retained. Non-random retention of duplicated genes seems to have mediated the expansion of gene families, some of which may have substantially increased their members after the origin of the Pyrinae. The joint analysis of over-duplicated functional categories and phylogenies, allowed evaluation of the role of both polyploidy and small-scale duplications during this process. Finally, gene expression analysis indicated that 82% of duplicated genes, including 80% of young

  9. Evolution dynamics of a model for gene duplication under adaptive conflict

    NASA Astrophysics Data System (ADS)

    Ancliff, Mark; Park, Jeong-Man

    2014-06-01

    We present and solve the dynamics of a model for gene duplication showing escape from adaptive conflict. We use a Crow-Kimura quasispecies model of evolution where the fitness landscape is a function of Hamming distances from two reference sequences, which are assumed to optimize two different gene functions, to describe the dynamics of a mixed population of individuals with single and double copies of a pleiotropic gene. The evolution equations are solved through a spin coherent state path integral, and we find two phases: one is an escape from an adaptive conflict phase, where each copy of a duplicated gene evolves toward subfunctionalization, and the other is a duplication loss of function phase, where one copy maintains its pleiotropic form and the other copy undergoes neutral mutation. The phase is determined by a competition between the fitness benefits of subfunctionalization and the greater mutational load associated with maintaining two gene copies. In the escape phase, we find a dynamics of an initial population of single gene sequences only which escape adaptive conflict through gene duplication and find that there are two time regimes: until a time t* single gene sequences dominate, and after t* double gene sequences outgrow single gene sequences. The time t* is identified as the time necessary for subfunctionalization to evolve and spread throughout the double gene sequences, and we show that there is an optimum mutation rate which minimizes this time scale.

  10. Duplicated growth hormone genes in a passerine bird, the jungle crow (Corvus macrorhynchos).

    PubMed

    Arai, Natsumi; Iigo, Masayuki

    2010-07-02

    Molecular cloning, molecular phylogeny, gene structure and expression analyses of growth hormone (GH) were performed in a passerine bird, the jungle crow (Corvus macrorhynchos). Unexpectedly, duplicated GH cDNA and genes were identified and designated as GH1A and GH1B. In silico analyses identified the zebra finch orthologs. Both GH genes encode 217 amino acid residues and consist of five exons and four introns, spanning 5.2 kbp in GH1A and 4.2 kbp in GH1B. Predicted GH proteins of the jungle crow and zebra finch contain four conserved cysteine residues, suggesting duplicated GH genes are functional. Molecular phylogenetic analysis revealed that duplication of GH genes occur after divergence of the passerine lineage from the other avian orders as has been suggested from partial genomic DNA sequences of passerine GH genes. RT-PCR analyses confirmed expression of GH1A and GH1B in the pituitary gland. In addition, GH1A gene is expressed in all the tissues examined. However, expression of GH1B is confined to several brain areas and blood cells. These results indicate that the regulatory mechanisms of duplicated GH genes are different and that duplicated GH genes exert both endocrine and autocrine/paracrine functions. Copyright 2010 Elsevier Inc. All rights reserved.

  11. A diffusion approach to approximating preservation probabilities for gene duplicates.

    PubMed

    O'Hely, Martin

    2006-08-01

    Consider a haploid population and, within its genome, a gene whose presence is vital for the survival of any individual. Each copy of this gene is subject to mutations which destroy its function. Suppose one member of the population somehow acquires a duplicate copy of the gene, where the duplicate is fully linked to the original gene's locus. Preservation is said to occur if eventually the entire population consists of individuals descended from this one which initially carried the duplicate. The system is modelled by a finite state-space Markov process which in turn is approximated by a diffusion process, whence an explicit expression for the probability of preservation is derived. The event of preservation can be compared to the fixation of a selectively neutral gene variant initially present in a single individual, the probability of which is the reciprocal of the population size. For very weak mutation, this and the probability of preservation are equal, while as mutation becomes stronger, the preservation probability tends to double this reciprocal. This is in excellent agreement with simulation studies.

  12. Genomic analysis reveals extensive gene duplication within the bovine TRB locus

    PubMed Central

    Connelley, Timothy; Aerts, Jan; Law, Andy; Morrison, W Ivan

    2009-01-01

    Background Diverse TR and IG repertoires are generated by V(D)J somatic recombination. Genomic studies have been pivotal in cataloguing the V, D, J and C genes present in the various TR/IG loci and describing how duplication events have expanded the number of these genes. Such studies have also provided insights into the evolution of these loci and the complex mechanisms that regulate TR/IG expression. In this study we analyze the sequence of the third bovine genome assembly to characterize the germline repertoire of bovine TRB genes and compare the organization, evolution and regulatory structure of the bovine TRB locus with that of humans and mice. Results The TRB locus in the third bovine genome assembly is distributed over 5 scaffolds, extending to ~730 Kb. The available sequence contains 134 TRBV genes, assigned to 24 subgroups, and 3 clusters of DJC genes, each comprising a single TRBD gene, 5–7 TRBJ genes and a single TRBC gene. Seventy-nine of the TRBV genes are predicted to be functional. Comparison with the human and murine TRB loci shows that the gene order, as well as the sequences of non-coding elements that regulate TRB expression, are highly conserved in the bovine. Dot-plot analyses demonstrate that expansion of the genomic TRBV repertoire has occurred via a complex and extensive series of duplications, predominantly involving DNA blocks containing multiple genes. These duplication events have resulted in massive expansion of several TRBV subgroups, most notably TRBV6, 9 and 21 which contain 40, 35 and 16 members respectively. Similarly, duplication has lead to the generation of a third DJC cluster. Analyses of cDNA data confirms the diversity of the TRBV genes and, in addition, identifies a substantial number of TRBV genes, predominantly from the larger subgroups, which are still absent from the genome assembly. The observed gene duplication within the bovine TRB locus has created a repertoire of phylogenetically diverse functional TRBV genes

  13. Impact of gene gains, losses and duplication modes on the origin and diversification of vertebrates.

    PubMed

    Cañestro, Cristian; Albalat, Ricard; Irimia, Manuel; Garcia-Fernàndez, Jordi

    2013-02-01

    The study of the evolutionary origin of vertebrates has been linked to the study of genome duplications since Susumo Ohno suggested that the successful diversification of vertebrate innovations was facilitated by two rounds of whole-genome duplication (2R-WGD) in the stem vertebrate. Since then, studies on the functional evolution of many genes duplicated in the vertebrate lineage have provided the grounds to support experimentally this link. This article reviews cases of gene duplications derived either from the 2R-WGD or from local gene duplication events in vertebrates, analyzing their impact on the evolution of developmental innovations. We analyze how gene regulatory networks can be rewired by the activity of transposable elements after genome duplications, discuss how different mechanisms of duplication might affect the fate of duplicated genes, and how the loss of gene duplicates might influence the fate of surviving paralogs. We also discuss the evolutionary relationships between gene duplication and alternative splicing, in particular in the vertebrate lineage. Finally, we discuss the role that the 2R-WGD might have played in the evolution of vertebrate developmental gene networks, paying special attention to those related to vertebrate key features such as neural crest cells, placodes, and the complex tripartite brain. In this context, we argue that current evidences points that the 2R-WGD may not be linked to the origin of vertebrate innovations, but to their subsequent diversification in a broad variety of complex structures and functions that facilitated the successful transition from peaceful filter-feeding non-vertebrate ancestors to voracious vertebrate predators. Copyright © 2013 Elsevier Ltd. All rights reserved.

  14. Dose-sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses.

    PubMed

    Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.

  15. Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention Through Multiple Tetraploidies in the Grasses

    PubMed Central

    Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael

    2011-01-01

    Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525

  16. PGDD: a database of gene and genome duplication in plants

    PubMed Central

    Lee, Tae-Ho; Tang, Haibao; Wang, Xiyin; Paterson, Andrew H.

    2013-01-01

    Genome duplication (GD) has permanently shaped the architecture and function of many higher eukaryotic genomes. The angiosperms (flowering plants) are outstanding models in which to elucidate consequences of GD for higher eukaryotes, owing to their propensity for chromosomal duplication or even triplication in a few cases. Duplicated genome structures often require both intra- and inter-genome alignments to unravel their evolutionary history, also providing the means to deduce both obvious and otherwise-cryptic orthology, paralogy and other relationships among genes. The burgeoning sets of angiosperm genome sequences provide the foundation for a host of investigations into the functional and evolutionary consequences of gene and GD. To provide genome alignments from a single resource based on uniform standards that have been validated by empirical studies, we built the Plant Genome Duplication Database (PGDD; freely available at http://chibba.agtec.uga.edu/duplication/), a web service providing synteny information in terms of colinearity between chromosomes. At present, PGDD contains data for 26 plants including bryophytes and chlorophyta, as well as angiosperms with draft genome sequences. In addition to the inclusion of new genomes as they become available, we are preparing new functions to enhance PGDD. PMID:23180799

  17. Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.

    PubMed

    Studer, Romain A; Penel, Simon; Duret, Laurent; Robinson-Rechavi, Marc

    2008-09-01

    A stringent branch-site codon model was used to detect positive selection in vertebrate evolution. We show that the test is robust to the large evolutionary distances involved. Positive selection was detected in 77% of 884 genes studied. Most positive selection concerns a few sites on a single branch of the phylogenetic tree: Between 0.9% and 4.7% of sites are affected by positive selection depending on the branches. No functional category was overrepresented among genes under positive selection. Surprisingly, whole genome duplication had no effect on the prevalence of positive selection, whether the fish-specific genome duplication or the two rounds at the origin of vertebrates. Thus positive selection has not been limited to a few gene classes, or to specific evolutionary events such as duplication, but has been pervasive during vertebrate evolution.

  18. Ancient and Recent Duplications Support Functional Diversity of Daphnia Opsins.

    PubMed

    Brandon, Christopher S; Greenwold, Matthew J; Dudycha, Jeffry L

    2017-01-01

    Daphnia pulex has the largest known family of opsins, genes critical for photoreception and vision in animals. This diversity may be functionally redundant, arising from recent processes, or ancient duplications may have been preserved due to distinct functions and independent contributions to fitness. We analyzed opsins in D. pulex and its distant congener Daphnia magna. We identified 48 opsins in the D. pulex genome and 32 in D. magna. We inferred the complement of opsins in the last common ancestor of all Daphnia and evaluated the history of opsin duplication and loss. We further analyzed sequence variation to assess possible functional diversification among Daphnia opsins. Much of the opsin expansion occurred before the D. pulex-D. magna split more than 145 Mya, and both Daphnia lineages preserved most ancient opsins. More recent expansion occurred in pteropsins and long-wavelength visual opsins in both species, particularly D. pulex. Recent duplications were not random: the same ancestral genes duplicated independently in each modern species. Most ancient and some recent duplications involved differentiation at residues known to influence spectral tuning of visual opsins. Arthropsins show evidence of gene conversion between tandemly arrayed paralogs in functionally important domains. Intron-exon gene structure was generally conserved within clades inferred from sequences, although pteropsins showed substantial intron size variation. Overall, our analyses support the hypotheses that diverse opsins are maintained due to diverse functional roles in photoreception and vision, that functional diversification is both ancient and recent, and that multiple evolutionary processes have influenced different types of opsins.

  19. Complete mitochondrial DNA sequence of oyster Crassostrea hongkongensis-a case of "Tandem duplication-random loss" for genome rearrangement in Crassostrea?

    PubMed Central

    Yu, Ziniu; Wei, Zhengpeng; Kong, Xiaoyu; Shi, Wei

    2008-01-01

    pairs. There exists significant codon bias, favoring codons ending in A or T and against those ending with C. Pair analysis of genome rearrangements showed that the rearrangement distance is great between C. gigas-C. hongkongensis and C. virginica, indicating a high degree of rearrangements within Crassostrea. The determination of complete mt-genome of C. hongkongensis has yielded useful insight into features of gene order, variation, and evolution of Crassostrea and bivalve mt-genomes. Conclusion The mt-genome of C. hongkongensis shares some similarity with, and interesting differences to, other Crassostrea species and bivalves. The absence of trnC and trnN genes and duplicated or split rRNA genes from the C. hongkongensis genome is a completely novel feature not previously reported in Crassostrea species. The phenomenon is likely due to the loss of a segment that is present in other Crassostrea species and was present in ancestor of C. hongkongensis, thus a case of "tandem duplication-random loss (TDRL)". The mt-genome and new feature presented here reveal and underline the high level variation of gene order and gene content in Crassostrea and bivalves, inspiring more research to gain understanding to mechanisms underlying gene and genome evolution in bivalves and mollusks. PMID:18847502

  20. Autopolyploidy genome duplication preserves other ancient genome duplications in Atlantic salmon (Salmo salar).

    PubMed

    Christensen, Kris A; Davidson, William S

    2017-01-01

    Salmonids (e.g. Atlantic salmon, Pacific salmon, and trouts) have a long legacy of genome duplication. In addition to three ancient genome duplications that all teleosts are thought to share, salmonids have had one additional genome duplication. We explored a methodology for untangling these duplications from each other to better understand them in Atlantic salmon. In this methodology, homeologous regions (paralogous/duplicated genomic regions originating from a whole genome duplication) from the most recent genome duplication were assumed to have duplicated genes at greater density and have greater sequence similarity. This assumption was used to differentiate duplicated gene pairs in Atlantic salmon that are either from the most recent genome duplication or from earlier duplications. From a comparison with multiple vertebrate species, it is clear that Atlantic salmon have retained more duplicated genes from ancient genome duplications than other vertebrates--often at higher density in the genome and containing fewer synonymous mutations. It may be that polysomic inheritance is the mechanism responsible for maintaining ancient gene duplicates in salmonids. Polysomic inheritance (when multiple chromosomes pair during meiosis) is thought to be relatively common in salmonids compared to other vertebrate species. These findings illuminate how genome duplications may not only increase the number of duplicated genes, but may also be involved in the maintenance of them from previous genome duplications as well.

  1. The circadian clock of teleost fish: a comparative analysis reveals distinct fates for duplicated genes.

    PubMed

    Toloza-Villalobos, Jessica; Arroyo, José Ignacio; Opazo, Juan C

    2015-01-01

    The circadian clock is a central oscillator that coordinates endogenous rhythms. Members of six gene families underlie the metabolic machinery of this system. Although this machinery appears to correspond to a highly conserved genetic system in metazoans, it has been recognized that vertebrates possess a more diverse gene inventory than that of non-vertebrates. This difference could have originated in the two successive rounds of whole-genome duplications that took place in the common ancestor of the group. Teleost fish underwent an extra event of whole-genome duplication, which is thought to have provided an abundance of raw genetic material for the biological innovations that facilitated the radiation of the group. In this study, we assessed the relative contributions of whole-genome duplication and small-scale gene duplication to generate the repertoire of genes associated with the circadian clock of teleost fish. To achieve this goal, we annotated genes from six gene families associated with the circadian clock in eight teleost fish species, and we reconstructed their evolutionary history by inferring phylogenetic relationships. Our comparative analysis indicated that teleost species possess a variable repertoire of genes related to the circadian clock gene families and that the actual diversity of these genes has been shaped by a variety of phenomena, such as the complete deletion of ohnologs, the differential retention of genes, and lineage-specific gene duplications. From a functional perspective, the subfunctionalization of two ohnolog genes (PER1a and PER1b) in zebrafish highlights the power of whole-genome duplications to generate biological diversity.

  2. A duplicated PLP gene causing Pelizaeus-Merzbacher disease detected by comparative multiplex PCR

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Inoue, K.; Sugiyama, N.; Kawanishi, C.

    1996-07-01

    Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused by abnormalities in the proteolipid protein (PLP) gene, which is essential for oligodendrocyte differentiation and CNS myelin formation. Although linkage analysis has shown the homogeneity at the PLP locus in patients with PMD, exonic mutations in the PLP gene have been identified in only 10% - 25% of all cases, which suggests the presence of other genetic aberrations, including gene duplication. In this study, we examined five families with PMD not carrying exonic mutations in PLP gene, using comparative multiplex PCR (CM-PCR) as a semiquantitative assay of gene dosage. PLP genemore » duplications were identified in four families by CM-PCR and confirmed in three families by densitometric RFLP analysis. Because a homologous myelin protein gene, PMP22, is duplicated in the majority of patients with Charcot-Marie-Tooth 1A, PLP gene overdosage may be an important genetic abnormality in PMD and affect myelin formation. 38 ref., 5 figs., 2 tabs.« less

  3. Evolution of homeobox genes.

    PubMed

    Holland, Peter W H

    2013-01-01

    Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.

  4. Rapid diversification of FoxP2 in teleosts through gene duplication in the teleost-specific whole genome duplication event.

    PubMed

    Song, Xiaowei; Wang, Yajun; Tang, Yezhong

    2013-01-01

    As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication.

  5. Rapid Diversification of FoxP2 in Teleosts through Gene Duplication in the Teleost-Specific Whole Genome Duplication Event

    PubMed Central

    Song, Xiaowei; Wang, Yajun; Tang, Yezhong

    2013-01-01

    As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication. PMID:24349554

  6. Many gene and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon.

    PubMed

    Paterson, Andrew H; Chapman, Brad A; Kissinger, Jessica C; Bowers, John E; Feltus, Frank A; Estill, James C

    2006-11-01

    Genome duplication is potentially a good source of new genes, but such genes take time to evolve. We have found a group of "duplication-resistant" genes, which have undergone convergent restoration to singleton status following several independent genome duplications. Restoration of duplication-resistant genes to singleton status could be important to long-term survival of a polyploid lineage. Angiosperms show more frequent polyploidization and a higher degree of duplicate gene preservation than other paleopolyploids, making them well-suited to further study of duplication-resistant genes.

  7. [Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

    PubMed

    Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

    2002-01-01

    To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.

  8. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution

    PubMed Central

    Clarke, Thomas H.; Garb, Jessica E.; Hayashi, Cheryl Y.; Arensburger, Peter; Ayoub, Nadia A.

    2015-01-01

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). PMID:26058392

  9. Chromosomal duplications in bacteria, fruit flies, and humans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lupski, J.R.; Weinstock, G.M.; Roth, J.R.

    1996-01-01

    Tandem duplication of chromosomal segments has been recognized as a frequent mutational mechanism in several genetic model systems. In bacteria, fruit flies, and humans, duplications form by similar molecular mechanisms and appear to be important in genome evolution. 80 refs.

  10. Restriction and Recruitment—Gene Duplication and the Origin and Evolution of Snake Venom Toxins

    PubMed Central

    Hargreaves, Adam D.; Swain, Martin T.; Hegarty, Matthew J.; Logan, Darren W.; Mulley, John F.

    2014-01-01

    Snake venom has been hypothesized to have originated and diversified through a process that involves duplication of genes encoding body proteins with subsequent recruitment of the copy to the venom gland, where natural selection acts to develop or increase toxicity. However, gene duplication is known to be a rare event in vertebrate genomes, and the recruitment of duplicated genes to a novel expression domain (neofunctionalization) is an even rarer process that requires the evolution of novel combinations of transcription factor binding sites in upstream regulatory regions. Therefore, although this hypothesis concerning the evolution of snake venom is very unlikely and should be regarded with caution, it is nonetheless often assumed to be established fact, hindering research into the true origins of snake venom toxins. To critically evaluate this hypothesis, we have generated transcriptomic data for body tissues and salivary and venom glands from five species of venomous and nonvenomous reptiles. Our comparative transcriptomic analysis of these data reveals that snake venom does not evolve through the hypothesized process of duplication and recruitment of genes encoding body proteins. Indeed, our results show that many proposed venom toxins are in fact expressed in a wide variety of body tissues, including the salivary gland of nonvenomous reptiles and that these genes have therefore been restricted to the venom gland following duplication, not recruited. Thus, snake venom evolves through the duplication and subfunctionalization of genes encoding existing salivary proteins. These results highlight the danger of the elegant and intuitive “just-so story” in evolutionary biology. PMID:25079342

  11. Hox gene duplications correlate with posterior heteronomy in scorpions

    PubMed Central

    Sharma, Prashant P.; Schwager, Evelyn E.; Extavour, Cassandra G.; Wheeler, Ward C.

    2014-01-01

    The evolutionary success of the largest animal phylum, Arthropoda, has been attributed to tagmatization, the coordinated evolution of adjacent metameres to form morphologically and functionally distinct segmental regions called tagmata. Specification of regional identity is regulated by the Hox genes, of which 10 are inferred to be present in the ancestor of arthropods. With six different posterior segmental identities divided into two tagmata, the bauplan of scorpions is the most heteronomous within Chelicerata. Expression domains of the anterior eight Hox genes are conserved in previously surveyed chelicerates, but it is unknown how Hox genes regionalize the three tagmata of scorpions. Here, we show that the scorpion Centruroides sculpturatus has two paralogues of all Hox genes except Hox3, suggesting cluster and/or whole genome duplication in this arachnid order. Embryonic anterior expression domain boundaries of each of the last four pairs of Hox genes (two paralogues each of Antp, Ubx, abd-A and Abd-B) are unique and distinguish segmental groups, such as pectines, book lungs and the characteristic tail, while maintaining spatial collinearity. These distinct expression domains suggest neofunctionalization of Hox gene paralogues subsequent to duplication. Our data reconcile previous understanding of Hox gene function across arthropods with the extreme heteronomy of scorpions. PMID:25122224

  12. The HOPA Gene Dodecamer Duplication Is Not a Significant Etiological Factor in Autism.

    ERIC Educational Resources Information Center

    Michaelis, Ron C.; Copeland-Yates, Susan A.; Sossey-Alaoui, Khalid; Skinner, Cindy; Friez, Michael J.; Longshore, John W.; Simensen, Richard J.; Schroer, Richard J.; Stevenson, Roger E.

    2000-01-01

    A study of 202 patients with autism found the incidence of a dodecamer duplication in the HOPA gene was not significantly different between patients and controls. Three female patients inherited the duplication from nonautistic fathers. Also, there was no systematic skewing of X inactivation in female patients with the duplication. (Contains…

  13. Ascorbate peroxidase-related (APx-R) is not a duplicable gene.

    PubMed

    Dunand, Christophe; Mathé, Catherine; Lazzarotto, Fernanda; Margis, Rogério; Margis-Pinheiro, Marcia

    2011-12-01

    Phylogenetic, genomic and functional analyses have allowed the identification of a new class of putative heme peroxidases, so called APx-R (APx-Related). These new class, mainly present in the green lineage (including green algae and land plants), can also be detected in other unicellular chloroplastic organisms. Except for recent polyploid organisms, only single-copy of APx-R gene was detected in each genome, suggesting that the majority of the APx-R extra-copies were lost after chromosomal or segmental duplications. In a similar way, most APx-R co-expressed genes in Arabidopsis genome do not have conserved extra-copies after chromosomal duplications and are predicted to be localized in organelles, as are the APx-R. The member of this gene network can be considered as unique gene, well conserved through the evolution due to a strong negative selection pressure and a low evolution rate. © 2011 Landes Bioscience

  14. Sex-Linked Pheromone Receptor Genes of the European Corn Borer, Ostrinia nubilalis, Are in Tandem Arrays

    PubMed Central

    Yasukochi, Yuji; Miura, Nami; Nakano, Ryo; Sahara, Ken; Ishikawa, Yukio

    2011-01-01

    Background Tuning of the olfactory system of male moths to conspecific female sex pheromones is crucial for correct species recognition; however, little is known about the genetic changes that drive speciation in this system. Moths of the genus Ostrinia are good models to elucidate this question, since significant differences in pheromone blends are observed within and among species. Odorant receptors (ORs) play a critical role in recognition of female sex pheromones; eight types of OR genes expressed in male antennae were previously reported in Ostrinia moths. Methodology/Principal Findings We screened an O. nubilalis bacterial artificial chromosome (BAC) library by PCR, and constructed three contigs from isolated clones containing the reported OR genes. Fluorescence in situ hybridization (FISH) analysis using these clones as probes demonstrated that the largest contig, which contained eight OR genes, was located on the Z chromosome; two others harboring two and one OR genes were found on two autosomes. Sequence determination of BAC clones revealed the Z-linked OR genes were closely related and tandemly arrayed; moreover, four of them shared 181-bp direct repeats spanning exon 7 and intron 7. Conclusions/Significance This is the first report of tandemly arrayed sex pheromone receptor genes in Lepidoptera. The localization of an OR gene cluster on the Z chromosome agrees with previous findings for a Z-linked locus responsible for O. nubilalis male behavioral response to sex pheromone. The 181-bp direct repeats might enhance gene duplications by unequal crossovers. An autosomal locus responsible for male response to sex pheromone in Heliothis virescens and H. subflexa was recently reported to contain at least four OR genes. Taken together, these findings support the hypothesis that generation of additional copies of OR genes can increase the potential for male moths to acquire altered specificity for pheromone components, and accordingly, facilitate differentiation

  15. On the Complexity of Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.

    PubMed

    Kordi, Misagh; Bansal, Mukul S

    2017-01-01

    Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.

  16. A salmonid EST genomic study: genes, duplications, phylogeny and microarrays

    USDA-ARS?s Scientific Manuscript database

    Background: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most wide...

  17. Root hairs, trichomes and the evolution of duplicate genes.

    PubMed

    Kellogg, E A

    2001-12-01

    The MYB-class proteins WEREWOLF and GLABRA1 are functionally interchangeable, even though one is normally expressed solely in roots and the other only in shoots. This shows that their different functions are the result of the modification of cis-regulatory sequences over evolutionary time. The two genes thus provide an example of morphological diversification created by gene duplication and changes in regulation.

  18. On Computing Breakpoint Distances for Genomes with Duplicate Genes.

    PubMed

    Shao, Mingfu; Moret, Bernard M E

    2017-06-01

    A fundamental problem in comparative genomics is to compute the distance between two genomes in terms of its higher level organization (given by genes or syntenic blocks). For two genomes without duplicate genes, we can easily define (and almost always efficiently compute) a variety of distance measures, but the problem is NP-hard under most models when genomes contain duplicate genes. To tackle duplicate genes, three formulations (exemplar, maximum matching, and any matching) have been proposed, all of which aim to build a matching between homologous genes so as to minimize some distance measure. Of the many distance measures, the breakpoint distance (the number of nonconserved adjacencies) was the first one to be studied and remains of significant interest because of its simplicity and model-free property. The three breakpoint distance problems corresponding to the three formulations have been widely studied. Although we provided last year a solution for the exemplar problem that runs very fast on full genomes, computing optimal solutions for the other two problems has remained challenging. In this article, we describe very fast, exact algorithms for these two problems. Our algorithms rely on a compact integer-linear program that we further simplify by developing an algorithm to remove variables, based on new results on the structure of adjacencies and matchings. Through extensive experiments using both simulations and biological data sets, we show that our algorithms run very fast (in seconds) on mammalian genomes and scale well beyond. We also apply these algorithms (as well as the classic orthology tool MSOAR) to create orthology assignment, then compare their quality in terms of both accuracy and coverage. We find that our algorithm for the "any matching" formulation significantly outperforms other methods in terms of accuracy while achieving nearly maximum coverage.

  19. Spider Transcriptomes Identify Ancient Large-Scale Gene Duplication Event Potentially Important in Silk Gland Evolution.

    PubMed

    Clarke, Thomas H; Garb, Jessica E; Hayashi, Cheryl Y; Arensburger, Peter; Ayoub, Nadia A

    2015-06-08

    The evolution of specialized tissues with novel functions, such as the silk synthesizing glands in spiders, is likely an influential driver of adaptive success. Large-scale gene duplication events and subsequent paralog divergence are thought to be required for generating evolutionary novelty. Such an event has been proposed for spiders, but not tested. We de novo assembled transcriptomes from three cobweb weaving spider species. Based on phylogenetic analyses of gene families with representatives from each of the three species, we found numerous duplication events indicative of a whole genome or segmental duplication. We estimated the age of the gene duplications relative to several speciation events within spiders and arachnids and found that the duplications likely occurred after the divergence of scorpions (order Scorpionida) and spiders (order Araneae), but before the divergence of the spider suborders Mygalomorphae and Araneomorphae, near the evolutionary origin of spider silk glands. Transcripts that are expressed exclusively or primarily within black widow silk glands are more likely to have a paralog descended from the ancient duplication event and have elevated amino acid replacement rates compared with other transcripts. Thus, an ancient large-scale gene duplication event within the spider lineage was likely an important source of molecular novelty during the evolution of silk gland-specific expression. This duplication event may have provided genetic material for subsequent silk gland diversification in the true spiders (Araneomorphae). © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots.

    PubMed

    Wang, Yupeng; Ficklin, Stephen P; Wang, Xiyin; Feltus, F Alex; Paterson, Andrew H

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots.

  1. Large-Scale Gene Relocations following an Ancient Genome Triplication Associated with the Diversification of Core Eudicots

    PubMed Central

    Wang, Yupeng; Ficklin, Stephen P.; Wang, Xiyin; Feltus, F. Alex; Paterson, Andrew H.

    2016-01-01

    Different modes of gene duplication including whole-genome duplication (WGD), and tandem, proximal and dispersed duplications are widespread in angiosperm genomes. Small-scale, stochastic gene relocations and transposed gene duplications are widely accepted to be the primary mechanisms for the creation of dispersed duplicates. However, here we show that most surviving ancient dispersed duplicates in core eudicots originated from large-scale gene relocations within a narrow window of time following a genome triplication (γ) event that occurred in the stem lineage of core eudicots. We name these surviving ancient dispersed duplicates as relocated γ duplicates. In Arabidopsis thaliana, relocated γ, WGD and single-gene duplicates have distinct features with regard to gene functions, essentiality, and protein interactions. Relative to γ duplicates, relocated γ duplicates have higher non-synonymous substitution rates, but comparable levels of expression and regulation divergence. Thus, relocated γ duplicates should be distinguished from WGD and single-gene duplicates for evolutionary investigations. Our results suggest large-scale gene relocations following the γ event were associated with the diversification of core eudicots. PMID:27195960

  2. Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.

    PubMed

    Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H

    2009-07-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via

  3. Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes

    PubMed Central

    Deitcher, David L.; Bass, Andrew H.

    2009-01-01

    Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via

  4. Rapid bursts of androgen-binding protein (Abp) gene duplication occurred independently in diverse mammals

    PubMed Central

    2008-01-01

    Background The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) α, β and γ subunits. Further investigation of 14 α-like (Abpa) and 13 β- or γ-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Results Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. Conclusion We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification. PMID:18269759

  5. Rapid bursts of androgen-binding protein (Abp) gene duplication occurred independently in diverse mammals.

    PubMed

    Laukaitis, Christina M; Heger, Andreas; Blakley, Tyler D; Munclinger, Pavel; Ponting, Chris P; Karn, Robert C

    2008-02-12

    The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) alpha, beta and gamma subunits. Further investigation of 14 alpha-like (Abpa) and 13 beta- or gamma-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification.

  6. An ancient genome duplication contributed to the abundance of metabolic genes in the moss Physcomitrella patens

    PubMed Central

    Rensing, Stefan A; Ick, Julia; Fawcett, Jeffrey A; Lang, Daniel; Zimmer, Andreas; Van de Peer, Yves; Reski, Ralf

    2007-01-01

    Background: Analyses of complete genomes and large collections of gene transcripts have shown that most, if not all seed plants have undergone one or more genome duplications in their evolutionary past. Results: In this study, based on a large collection of EST sequences, we provide evidence that the haploid moss Physcomitrella patens is a paleopolyploid as well. Based on the construction of linearized phylogenetic trees we infer the genome duplication to have occurred between 30 and 60 million years ago. Gene Ontology and pathway association of the duplicated genes in P. patens reveal different biases of gene retention compared with seed plants. Conclusion: Metabolic genes seem to have been retained in excess following the genome duplication in P. patens. This might, at least partly, explain the versatility of metabolism, as described for P. patens and other mosses, in comparison to other land plants. PMID:17683536

  7. The ace-1 Locus Is Amplified in All Resistant Anopheles gambiae Mosquitoes: Fitness Consequences of Homogeneous and Heterogeneous Duplications

    PubMed Central

    Djogbénou, Luc S.; Berthomieu, Arnaud; Makoundou, Patrick; Baba-Moussa, Lamine S.; Fiston-Lavier, Anna-Sophie; Belkhir, Khalid; Labbé, Pierrick; Weill, Mylène

    2016-01-01

    Gene copy-number variations are widespread in natural populations, but investigating their phenotypic consequences requires contemporary duplications under selection. Such duplications have been found at the ace-1 locus (encoding the organophosphate and carbamate insecticides’ target) in the mosquito Anopheles gambiae (the major malaria vector); recent studies have revealed their intriguing complexity, consistent with the involvement of various numbers and types (susceptible or resistant to insecticide) of copies. We used an integrative approach, from genome to phenotype level, to investigate the influence of duplication architecture and gene-dosage on mosquito fitness. We found that both heterogeneous (i.e., one susceptible and one resistant ace-1 copy) and homogeneous (i.e., identical resistant copies) duplications segregated in field populations. The number of copies in homogeneous duplications was variable and positively correlated with acetylcholinesterase activity and resistance level. Determining the genomic structure of the duplicated region revealed that, in both types of duplication, ace-1 and 11 other genes formed tandem 203kb amplicons. We developed a diagnostic test for duplications, which showed that ace-1 was amplified in all 173 resistant mosquitoes analyzed (field-collected in several African countries), in heterogeneous or homogeneous duplications. Each type was associated with different fitness trade-offs: heterogeneous duplications conferred an intermediate phenotype (lower resistance and fitness costs), whereas homogeneous duplications tended to increase both resistance and fitness cost, in a complex manner. The type of duplication selected seemed thus to depend on the intensity and distribution of selection pressures. This versatility of trade-offs available through gene duplication highlights the importance of large mutation events in adaptation to environmental variation. This impressive adaptability could have a major impact on vector

  8. Three neuropeptide Y receptor genes in the spiny dogfish, Squalus acanthias, support en bloc duplications in early vertebrate evolution.

    PubMed

    Salaneck, Erik; Ardell, David H; Larson, Earl T; Larhammar, Dan

    2003-08-01

    It has been debated whether the increase in gene number during early vertebrate evolution was due to multiple independent gene duplications or synchronous duplications of many genes. We describe here the cloning of three neuropeptide Y (NPY) receptor genes belonging to the Y1 subfamily in the spiny dogfish, Squalus acanthias, a cartilaginous fish. The three genes are orthologs of the mammalian subtypes Y1, Y4, and Y6, which are located in paralogous gene regions on different chromosomes in mammals. Thus, these genes arose by duplications of a chromosome region before the radiation of gnathostomes (jawed vertebrates). Estimates of duplication times from linearized trees together with evidence from other gene families supports two rounds of chromosome duplications or tetraploidizations early in vertebrate evolution. The anatomical distribution of mRNA was determined by reverse-transcriptase PCR and was found to differ from mammals, suggesting differential functional diversification of the new gene copies during the radiation of the vertebrate classes.

  9. Molecular evolution accompanying functional divergence of duplicated genes along the plant starch biosynthesis pathway

    PubMed Central

    2014-01-01

    Background Starch is the main source of carbon storage in the Archaeplastida. The starch biosynthesis pathway (sbp) emerged from cytosolic glycogen metabolism shortly after plastid endosymbiosis and was redirected to the plastid stroma during the green lineage divergence. The SBP is a complex network of genes, most of which are members of large multigene families. While some gene duplications occurred in the Archaeplastida ancestor, most were generated during the sbp redirection process, and the remaining few paralogs were generated through compartmentalization or tissue specialization during the evolution of the land plants. In the present study, we tested models of duplicated gene evolution in order to understand the evolutionary forces that have led to the development of SBP in angiosperms. We combined phylogenetic analyses and tests on the rates of evolution along branches emerging from major duplication events in six gene families encoding sbp enzymes. Results We found evidence of positive selection along branches following cytosolic or plastidial specialization in two starch phosphorylases and identified numerous residues that exhibited changes in volume, polarity or charge. Starch synthases, branching and debranching enzymes functional specializations were also accompanied by accelerated evolution. However, none of the sites targeted by selection corresponded to known functional domains, catalytic or regulatory. Interestingly, among the 13 duplications tested, 7 exhibited evidence of positive selection in both branches emerging from the duplication, 2 in only one branch, and 4 in none of the branches. Conclusions The majority of duplications were followed by accelerated evolution targeting specific residues along both branches. This pattern was consistent with the optimization of the two sub-functions originally fulfilled by the ancestral gene before duplication. Our results thereby provide strong support to the so-called “Escape from Adaptive Conflict

  10. Duplication and expression of CYC2-like genes in the origin and maintenance of corolla zygomorphy in Lamiales.

    PubMed

    Zhong, Jinshun; Kellogg, Elizabeth A

    2015-01-01

    Duplication, retention, and expression of CYCLOIDEA2 (CYC2)-like genes are thought to affect evolution of corolla symmetry. However, exactly what and how changes in CYC2-like genes correlate with the origin of corolla zygomorphy are poorly understood. We inferred and calibrated a densely sampled phylogeny of CYC2-like genes across the Lamiales and examined their expression in early diverging (EDL) and higher core clades (HCL). CYC2-like genes duplicated extensively in Lamiales, at least six times in core Lamiales (CL) around the Cretaceous-Paleogene (K-Pg) boundary, and seven more in EDL relatively more recently. Nested duplications and losses of CYC2-like paralogs are pervasive but may not correlate with transitions in corolla symmetry. We found evidence for dN/dS (ω) variation following gene duplications. CYC2-like paralogs in HCL show differential expression with higher expression in adaxial petals. Asymmetric expression but not recurrent duplication of CYC2-like genes correlates with the origin of corolla zygomorphy. Changes in both cis-regulatory and coding domains of CYC2-like genes are probably crucial for the evolution of corolla zygomorphy. Multiple selection regimes appear likely to play important roles in gene retention. The parallel duplications of CYC2-like genes are after the initial diversification of bumble bees and Euglossine bees. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  11. A Lossy Compression Technique Enabling Duplication-Aware Sequence Alignment

    PubMed Central

    Freschi, Valerio; Bogliolo, Alessandro

    2012-01-01

    In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment. PMID:22518086

  12. Effects of Gene Duplication, Positive Selection, and Shifts in Gene Expression on the Evolution of the Venom Gland Transcriptome in Widow Spiders

    PubMed Central

    Haney, Robert A.; Clarke, Thomas H.; Gadgil, Rujuta; Fitzpatrick, Ryan; Hayashi, Cheryl Y.; Ayoub, Nadia A.; Garb, Jessica E.

    2016-01-01

    Gene duplication and positive selection can be important determinants of the evolution of venom, a protein-rich secretion used in prey capture and defense. In a typical model of venom evolution, gene duplicates switch to venom gland expression and change function under the action of positive selection, which together with further duplication produces large gene families encoding diverse toxins. Although these processes have been demonstrated for individual toxin families, high-throughput multitissue sequencing of closely related venomous species can provide insights into evolutionary dynamics at the scale of the entire venom gland transcriptome. By assembling and analyzing multitissue transcriptomes from the Western black widow spider and two closely related species with distinct venom toxicity phenotypes, we do not find that gene duplication and duplicate retention is greater in gene families with venom gland biased expression in comparison with broadly expressed families. Positive selection has acted on some venom toxin families, but does not appear to be in excess for families with venom gland biased expression. Moreover, we find 309 distinct gene families that have single transcripts with venom gland biased expression, suggesting that the switching of genes to venom gland expression in numerous unrelated gene families has been a dominant mode of evolution. We also find ample variation in protein sequences of venom gland–specific transcripts, lineage-specific family sizes, and ortholog expression among species. This variation might contribute to the variable venom toxicity of these species. PMID:26733576

  13. Gene Duplication and Evolutionary Innovations in Hemoglobin-Oxygen Transport

    PubMed Central

    2016-01-01

    During vertebrate evolution, duplicated hemoglobin (Hb) genes diverged with respect to functional properties as well as the developmental timing of expression. For example, the subfamilies of genes that encode the different subunit chains of Hb are ontogenetically regulated such that functionally distinct Hb isoforms are expressed during different developmental stages. In some vertebrate taxa, functional differentiation between co-expressed Hb isoforms may also contribute to physiologically important divisions of labor. PMID:27053736

  14. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies

    PubMed Central

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D.

    2016-01-01

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella. We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes—and that the butterfly proboscis is involved in digestive enzyme production. PMID:27553646

  15. GENE-dosage effects on fitness in recent adaptive duplications: ace-1 in the mosquito Culex pipiens.

    PubMed

    Labbé, Pierrick; Milesi, Pascal; Yébakima, André; Pasteur, Nicole; Weill, Mylène; Lenormand, Thomas

    2014-07-01

    Gene duplications have long been advocated to contribute to the evolution of new functions. The role of selection in their early spread is more controversial. Unless duplications are favored for a direct benefit of increased expression, they are likely detrimental. In this article, we investigated the case of duplications favored because they combine already functionally divergent alleles. Their gene-dosage/fitness relations are poorly known because selection may operate on both overall expression and duplicates relative dosage. Using the well-documented case of Culex pipiens resistance to insecticides, we compared strains with various ace-1 allele combinations, including two duplicated alleles carrying both susceptible and resistant copies. The overall protein activity was nearly additive, but, surprisingly, fitness correlated better with the relative proportion of susceptible and resistant copies rather than any absolute measure of activity. Gene dosage is thus crucial, duplications stabilizing a "heterozygote" phenotype. It corroborates the view that these were favored because they fix a permanent heterosis, thereby solving the irreducible trade-off between resistance and synaptic transmission. Moreover, we showed that the contrasted successes of the two duplicated alleles in natural populations depend on genetic changes unrelated to ace-1, confirming the probable implication of recessive sublethal mutations linked to structural rearrangements in some duplications. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.

  16. STRIDE: Species Tree Root Inference from Gene Duplication Events.

    PubMed

    Emms, David M; Kelly, Steven

    2017-12-01

    The correct interpretation of any phylogenetic tree is dependent on that tree being correctly rooted. We present STRIDE, a fast, effective, and outgroup-free method for identification of gene duplication events and species tree root inference in large-scale molecular phylogenetic analyses. STRIDE identifies sets of well-supported in-group gene duplication events from a set of unrooted gene trees, and analyses these events to infer a probability distribution over an unrooted species tree for the location of its root. We show that STRIDE correctly identifies the root of the species tree in multiple large-scale molecular phylogenetic data sets spanning a wide range of timescales and taxonomic groups. We demonstrate that the novel probability model implemented in STRIDE can accurately represent the ambiguity in species tree root assignment for data sets where information is limited. Furthermore, application of STRIDE to outgroup-free inference of the origin of the eukaryotic tree resulted in a root probability distribution that provides additional support for leading hypotheses for the origin of the eukaryotes. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. Gene Duplication and Gene Expression Changes Play a Role in the Evolution of Candidate Pollen Feeding Genes in Heliconius Butterflies.

    PubMed

    Smith, Gilbert; Macias-Muñoz, Aide; Briscoe, Adriana D

    2016-09-02

    Heliconius possess a unique ability among butterflies to feed on pollen. Pollen feeding significantly extends their lifespan, and is thought to have been important to the diversification of the genus. We used RNA sequencing to examine feeding-related gene expression in the mouthparts of four species of Heliconius and one nonpollen feeding species, Eueides isabella We hypothesized that genes involved in morphology and protein metabolism might be upregulated in Heliconius because they have longer proboscides than Eueides, and because pollen contains more protein than nectar. Using de novo transcriptome assemblies, we tested these hypotheses by comparing gene expression in mouthparts against antennae and legs. We first looked for genes upregulated in mouthparts across all five species and discovered several hundred genes, many of which had functional annotations involving metabolism of proteins (cocoonase), lipids, and carbohydrates. We then looked specifically within Heliconius where we found eleven common upregulated genes with roles in morphology (CPR cuticle proteins), behavior (takeout-like), and metabolism (luciferase-like). Closer examination of these candidates revealed that cocoonase underwent several duplications along the lineage leading to heliconiine butterflies, including two Heliconius-specific duplications. Luciferase-like genes also underwent duplication within lepidopterans, and upregulation in Heliconius mouthparts. Reverse-transcription PCR confirmed that three cocoonases, a peptidase, and one luciferase-like gene are expressed in the proboscis with little to no expression in labial palps and salivary glands. Our results suggest pollen feeding, like other dietary specializations, was likely facilitated by adaptive expansions of preexisting genes-and that the butterfly proboscis is involved in digestive enzyme production. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.

    PubMed

    Shao, Mingfu; Lin, Yu; Moret, Bernard M E

    2015-05-01

    Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR.

  19. Duplicated Enhancer Region Increases Expression of CTSB and Segregates with Keratolytic Winter Erythema in South African and Norwegian Families.

    PubMed

    Ngcungcu, Thandiswa; Oti, Martin; Sitek, Jan C; Haukanes, Bjørn I; Linghu, Bolan; Bruccoleri, Robert; Stokowy, Tomasz; Oakeley, Edward J; Yang, Fan; Zhu, Jiang; Sultan, Marc; Schalkwijk, Joost; van Vlijmen-Willems, Ivonne M J J; von der Lippe, Charlotte; Brunner, Han G; Ersland, Kari M; Grayson, Wayne; Buechmann-Moller, Stine; Sundnes, Olav; Nirmala, Nanguneri; Morgan, Thomas M; van Bokhoven, Hans; Steen, Vidar M; Hull, Peter R; Szustakowski, Joseph; Staedtler, Frank; Zhou, Huiqing; Fiskerstrand, Torunn; Ramsay, Michele

    2017-05-04

    Keratolytic winter erythema (KWE) is a rare autosomal-dominant skin disorder characterized by recurrent episodes of palmoplantar erythema and epidermal peeling. KWE was previously mapped to 8p23.1-p22 (KWE critical region) in South African families. Using targeted resequencing of the KWE critical region in five South African families and SNP array and whole-genome sequencing in two Norwegian families, we identified two overlapping tandem duplications of 7.67 kb (South Africans) and 15.93 kb (Norwegians). The duplications segregated with the disease and were located upstream of CTSB, a gene encoding cathepsin B, a cysteine protease involved in keratinocyte homeostasis. Included in the 2.62 kb overlapping region of these duplications is an enhancer element that is active in epidermal keratinocytes. The activity of this enhancer correlated with CTSB expression in normal differentiating keratinocytes and other cell lines, but not with FDFT1 or NEIL2 expression. Gene expression (qPCR) analysis and immunohistochemistry of the palmar epidermis demonstrated significantly increased expression of CTSB, as well as stronger staining of cathepsin B in the stratum granulosum of affected individuals than in that of control individuals. Analysis of higher-order chromatin structure data and RNA polymerase II ChIA-PET data from MCF-7 cells did not suggest remote effects of the enhancer. In conclusion, KWE in South African and Norwegian families is caused by tandem duplications in a non-coding genomic region containing an active enhancer element for CTSB, resulting in upregulation of this gene in affected individuals. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  20. Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.

    PubMed

    Kordi, Misagh; Bansal, Mukul S

    2017-06-01

    Duplication-Transfer-Loss (DTL) reconciliation is a powerful method for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation seeks to reconcile gene trees with species trees by postulating speciation, duplication, transfer, and loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. In practice, however, gene trees are often non-binary due to uncertainty in the gene tree topologies, and DTL reconciliation with non-binary gene trees is known to be NP-hard. In this paper, we present the first exact algorithms for DTL reconciliation with non-binary gene trees. Specifically, we (i) show that the DTL reconciliation problem for non-binary gene trees is fixed-parameter tractable in the maximum degree of the gene tree, (ii) present an exponential-time, but in-practice efficient, algorithm to track and enumerate all optimal binary resolutions of a non-binary input gene tree, and (iii) apply our algorithms to a large empirical data set of over 4700 gene trees from 100 species to study the impact of gene tree uncertainty on DTL-reconciliation and to demonstrate the applicability and utility of our algorithms. The new techniques and algorithms introduced in this paper will help biologists avoid incorrect evolutionary inferences caused by gene tree uncertainty.

  1. Comparative genomic organization and tissue-specific transcription of the duplicated fabp7 and fabp10 genes in teleost fishes.

    PubMed

    Parmar, Manoj B; Wright, Jonathan M

    2013-11-01

    A whole-genome duplication (WGD) early in the teleost fish lineage makes fish ideal organisms to study the fate of duplicated genes and underlying evolutionary trajectories that have led to the retention of ohnologous gene duplicates in fish genomes. Here, we compare the genomic organization and tissue-specific transcription of the ohnologous fabp7 and fabp10 genes in medaka, three-spined stickleback, and spotted green pufferfish to the well-studied duplicated fabp7 and fabp10 genes of zebrafish. Teleost fabp7 and fabp10 genes contain four exons interrupted by three introns. Polypeptide sequences of Fabp7 and Fabp10 show the highest sequence identity and similarity with their orthologs from vertebrates. Orthology was evident as the ohnologous Fabp7 and Fabp10 polypeptides of teleost fishes each formed distinct clades and clustered together with their orthologs from other vertebrates in a phylogenetic tree. Furthermore, ohnologous teleost fabp7 and fabp10 genes exhibit conserved gene synteny with human FABP7 and chicken FABP10, respectively, which provides compelling evidence that the duplicated fabp7 and fabp10 genes of teleost fishes most likely arose from the well-documented WGD. The tissue-specific distribution of fabp7a, fabp7b, fabp10a, and fabp10b transcripts provides evidence of diverged spatial transcriptional regulation between ohnologous gene duplicates of fabp7 and fabp10 in teleost fishes.

  2. Theobroma cacao L. pathogenesis-related gene tandem array members show diverse expression dynamics in response to pathogen colonization.

    PubMed

    Fister, Andrew S; Mejia, Luis C; Zhang, Yufan; Herre, Edward Allen; Maximova, Siela N; Guiltinan, Mark J

    2016-05-17

    The pathogenesis-related (PR) group of proteins are operationally defined as polypeptides that increase in concentration in plant tissues upon contact with a pathogen. To date, 17 classes of highly divergent proteins have been described that act through multiple mechanisms of pathogen resistance. Characterizing these families in cacao, an economically important tree crop, and comparing the families to those in other species, is an important step in understanding cacao's immune response. Using publically available resources, all members of the 17 recognized pathogenesis-related gene families in the genome of Theobroma cacao were identified and annotated resulting in a set of ~350 members in both published cacao genomes. Approximately 50 % of these genes are organized in tandem arrays scattered throughout the genome. This feature was observed in five additional plant taxa (three dicots and two monocots), suggesting that tandem duplication has played an important role in the evolution of the PR genes in higher plants. Expression profiling captured the dynamics and complexity of PR genes expression at basal levels and after induction by two cacao pathogens (the oomycete, Phytophthora palmivora, and the fungus, Colletotrichum theobromicola), identifying specific genes within families that are more responsive to pathogen challenge. Subsequent qRT-PCR validated the induction of several PR-1, PR-3, PR-4, and PR-10 family members, with greater than 1000 fold induction detected for specific genes. We describe candidate genes that are likely to be involved in cacao's defense against Phytophthora and Colletotrichum infection and could be potentially useful for marker-assisted selection for breeding of disease resistant cacao varieties. The data presented here, along with existing cacao-omics resources, will enable targeted functional genetic screening of defense genes likely to play critical functions in cacao's defense against its pathogens.

  3. Assessment and Reconstruction of Novel HSP90 Genes: Duplications, Gains and Losses in Fungal and Animal Lineages

    PubMed Central

    Pantzartzi, Chrysoula N.; Drosopoulou, Elena; Scouras, Zacharias G.

    2013-01-01

    Hsp90s, members of the Heat Shock Protein class, protect the structure and function of proteins and play a significant task in cellular homeostasis and signal transduction. In order to determine the number of hsp90 gene copies and encoded proteins in fungal and animal lineages and through that key duplication events that this family has undergone, we collected and evaluated Hsp90 protein sequences and corresponding Expressed Sequence Tags and analyzed available genomes from various taxa. We provide evidence for duplication events affecting either single species or wider taxonomic groups. With regard to Fungi, duplicated genes have been detected in several lineages. In invertebrates, we demonstrate key duplication events in certain clades of Arthropoda and Mollusca, and a possible gene loss event in a hymenopteran family. Finally, we infer that the duplication event responsible for the two (a and b) isoforms in vertebrates occurred probably shortly after the split of Hyperoartia and Gnathostomata. PMID:24066039

  4. Neofunctionalization of Duplicated P450 Genes Drives the Evolution of Insecticide Resistance in the Brown Planthopper.

    PubMed

    Zimmer, Christoph T; Garrood, William T; Singh, Kumar Saurabh; Randall, Emma; Lueke, Bettina; Gutbrod, Oliver; Matthiesen, Svend; Kohler, Maxie; Nauen, Ralf; Davies, T G Emyr; Bass, Chris

    2018-01-22

    Gene duplication is a major source of genetic variation that has been shown to underpin the evolution of a wide range of adaptive traits [1, 2]. For example, duplication or amplification of genes encoding detoxification enzymes has been shown to play an important role in the evolution of insecticide resistance [3-5]. In this context, gene duplication performs an adaptive function as a result of its effects on gene dosage and not as a source of functional novelty [3, 6-8]. Here, we show that duplication and neofunctionalization of a cytochrome P450, CYP6ER1, led to the evolution of insecticide resistance in the brown planthopper. Considerable genetic variation was observed in the coding sequence of CYP6ER1 in populations of brown planthopper collected from across Asia, but just two sequence variants are highly overexpressed in resistant strains and metabolize imidacloprid. Both variants are characterized by profound amino-acid alterations in substrate recognition sites, and the introduction of these mutations into a susceptible P450 sequence is sufficient to confer resistance. CYP6ER1 is duplicated in resistant strains with individuals carrying paralogs with and without the gain-of-function mutations. Despite numerical parity in the genome, the susceptible and mutant copies exhibit marked asymmetry in their expression with the resistant paralogs overexpressed. In the primary resistance-conferring CYP6ER1 variant, this results from an extended region of novel sequence upstream of the gene that provides enhanced expression. Our findings illustrate the versatility of gene duplication in providing opportunities for functional and regulatory innovation during the evolution of an adaptive trait. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  5. Functional diversification upon leader protease domain duplication in the Citrus tristeza virus genome: Role of RNA sequences and the encoded proteins.

    PubMed

    Kang, Sung-Hwan; Atallah, Osama O; Sun, Yong-Duo; Folimonova, Svetlana Y

    2018-01-15

    Viruses from the family Closteroviridae show an example of intra-genome duplications of more than one gene. In addition to the hallmark coat protein gene duplication, several members possess a tandem duplication of papain-like leader proteases. In this study, we demonstrate that domains encoding the L1 and L2 proteases in the Citrus tristeza virus genome underwent a significant functional divergence at the RNA and protein levels. We show that the L1 protease is crucial for viral accumulation and establishment of initial infection, whereas its coding region is vital for virus transport. On the other hand, the second protease is indispensable for virus infection of its natural citrus host, suggesting that L2 has evolved an important adaptive function that mediates virus interaction with the woody host. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Functional characterization of duplicated Suppressor of Overexpression of Constans 1-like genes in petunia.

    PubMed

    Preston, Jill C; Jorgensen, Stacy A; Jha, Suryatapa G

    2014-01-01

    Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene Suppressor Of Overexpression of Constans 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes Unshaven (UNS) and Floral Binding Protein 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods.

  7. Functional Characterization of Duplicated SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1-Like Genes in Petunia

    PubMed Central

    Preston, Jill C.; Jorgensen, Stacy A.; Jha, Suryatapa G.

    2014-01-01

    Flowering time is strictly controlled by a combination of internal and external signals that match seed set with favorable environmental conditions. In the model plant species Arabidopsis thaliana (Brassicaceae), many of the genes underlying development and evolution of flowering have been discovered. However, much remains unknown about how conserved the flowering gene networks are in plants with different growth habits, gene duplication histories, and distributions. Here we functionally characterize three homologs of the flowering gene SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) in the short-lived perennial Petunia hybrida (petunia, Solanaceae). Similar to A. thaliana soc1 mutants, co-silencing of duplicated petunia SOC1-like genes results in late flowering. This phenotype is most severe when all three SOC1-like genes are silenced. Furthermore, expression levels of the SOC1-like genes UNSHAVEN (UNS) and FLORAL BINDING PROTEIN 21 (FBP21), but not FBP28, are positively correlated with developmental age. In contrast to A. thaliana, petunia SOC1-like gene expression did not increase with longer photoperiods, and FBP28 transcripts were actually more abundant under short days. Despite evidence of functional redundancy, differential spatio-temporal expression data suggest that SOC1-like genes might fine-tune petunia flowering in response to photoperiod and developmental stage. This likely resulted from modification of SOC1-like gene regulatory elements following recent duplication, and is a possible mechanism to ensure flowering under both inductive and non-inductive photoperiods. PMID:24787903

  8. Simulating evolution of protein complexes through gene duplication and co-option.

    PubMed

    Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

    2016-06-21

    We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. The butterfly plant arms-race escalated by gene and genome duplications.

    PubMed

    Edger, Patrick P; Heidel-Fischer, Hanna M; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E; Heckel, David G; Der, Joshua P; Wafula, Eric K; Tang, Michelle; Hofberger, Johannes A; Smithson, Ann; Hall, Jocelyn C; Blanchette, Matthieu; Bureau, Thomas E; Wright, Stephen I; dePamphilis, Claude W; Eric Schranz, M; Barker, Michael S; Conant, Gavin C; Wahlberg, Niklas; Vogel, Heiko; Pires, J Chris; Wheat, Christopher W

    2015-07-07

    Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits.

  10. The butterfly plant arms-race escalated by gene and genome duplications

    PubMed Central

    Edger, Patrick P.; Heidel-Fischer, Hanna M.; Bekaert, Michaël; Rota, Jadranka; Glöckner, Gernot; Platts, Adrian E.; Heckel, David G.; Der, Joshua P.; Wafula, Eric K.; Tang, Michelle; Hofberger, Johannes A.; Smithson, Ann; Hall, Jocelyn C.; Blanchette, Matthieu; Bureau, Thomas E.; Wright, Stephen I.; dePamphilis, Claude W.; Eric Schranz, M.; Barker, Michael S.; Conant, Gavin C.; Wahlberg, Niklas; Vogel, Heiko; Pires, J. Chris; Wheat, Christopher W.

    2015-01-01

    Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits. PMID:26100883

  11. Evolutionary history and stress regulation of plant receptor-like kinase/pelle genes.

    PubMed

    Lehti-Shiu, Melissa D; Zou, Cheng; Hanada, Kousuke; Shiu, Shin-Han

    2009-05-01

    Receptor-Like Kinase (RLK)/Pelle genes play roles ranging from growth regulation to defense response, and the dramatic expansion of this family has been postulated to be crucial for plant-specific adaptations. Despite this, little is known about the history of or the factors that contributed to the dramatic expansion of this gene family. In this study, we show that expansion coincided with the establishment of land plants and that RLK/Pelle subfamilies were established early in land plant evolution. The RLK/Pelle family expanded at a significantly higher rate than other kinases, due in large part to expansion of a few subfamilies by tandem duplication. Interestingly, these subfamilies tend to have members with known roles in defense response, suggesting that their rapid expansion was likely a consequence of adaptation to fast-evolving pathogens. Arabidopsis (Arabidopsis thaliana) expression data support the importance of RLK/Pelles in biotic stress response. We found that hundreds of RLK/Pelles are up-regulated by biotic stress. Furthermore, stress responsiveness is correlated with the degree of tandem duplication in RLK/Pelle subfamilies. Our findings suggest a link between stress response and tandem duplication and provide an explanation for why a large proportion of the RLK/Pelle gene family is found in tandem repeats. In addition, our findings provide a useful framework for potentially predicting RLK/Pelle stress functions based on knowledge of expansion pattern and duplication mechanism. Finally, we propose that the detection of highly variable molecular patterns associated with specific pathogens/parasites is the main reason for the up-regulation of hundreds of RLK/Pelles under biotic stress.

  12. Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals.

    PubMed

    Popova, Olga V; Mikhailov, Kirill V; Nikitin, Mikhail A; Logacheva, Maria D; Penin, Aleksey A; Muntyan, Maria S; Kedrova, Olga S; Petrov, Nikolai B; Panchin, Yuri V; Aleoshin, Vladimir V

    2016-01-01

    Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha-an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia.

  13. Mitochondrial Genomes of Kinorhyncha: trnM Duplication and New Gene Orders within Animals

    PubMed Central

    Popova, Olga V.; Mikhailov, Kirill V.; Nikitin, Mikhail A.; Logacheva, Maria D.; Penin, Aleksey A.; Muntyan, Maria S.; Kedrova, Olga S.; Petrov, Nikolai B.; Panchin, Yuri V.

    2016-01-01

    Many features of mitochondrial genomes of animals, such as patterns of gene arrangement, nucleotide content and substitution rate variation are extensively used in evolutionary and phylogenetic studies. Nearly 6,000 mitochondrial genomes of animals have already been sequenced, covering the majority of animal phyla. One of the groups that escaped mitogenome sequencing is phylum Kinorhyncha—an isolated taxon of microscopic worm-like ecdysozoans. The kinorhynchs are thought to be one of the early-branching lineages of Ecdysozoa, and their mitochondrial genomes may be important for resolving evolutionary relations between major animal taxa. Here we present the results of sequencing and analysis of mitochondrial genomes from two members of Kinorhyncha, Echinoderes svetlanae (Cyclorhagida) and Pycnophyes kielensis (Allomalorhagida). Their mitochondrial genomes are circular molecules approximately 15 Kbp in size. The kinorhynch mitochondrial gene sequences are highly divergent, which precludes accurate phylogenetic inference. The mitogenomes of both species encode a typical metazoan complement of 37 genes, which are all positioned on the major strand, but the gene order is distinct and unique among Ecdysozoa or animals as a whole. We predict four types of start codons for protein-coding genes in E. svetlanae and five in P. kielensis with a consensus DTD in single letter code. The mitochondrial genomes of E. svetlanae and P. kielensis encode duplicated methionine tRNA genes that display compensatory nucleotide substitutions. Two distant species of Kinorhyncha demonstrate similar patterns of gene arrangements in their mitogenomes. Both genomes have duplicated methionine tRNA genes; the duplication predates the divergence of two species. The kinorhynchs share a few features pertaining to gene order that align them with Priapulida. Gene order analysis reveals that gene arrangement specific of Priapulida may be ancestral for Scalidophora, Ecdysozoa, and even Protostomia

  14. A gene duplication/loss event in the ribulose-1,5-bisphosphate-carboxylase/oxygenase (rubisco) small subunit gene family among accessions of Arabidopsis thaliana.

    PubMed

    Schwarte, Sandra; Tiedemann, Ralph

    2011-06-01

    Rubisco (ribulose-1,5-bisphosphate carboxylase/oxygenase; EC 4.1.1.39), the most abundant protein in nature, catalyzes the assimilation of CO(2) (worldwide about 10(11) t each year) by carboxylation of ribulose-1,5-bisphosphate. It is a hexadecamer consisting of eight large and eight small subunits. Although the Rubisco large subunit (rbcL) is encoded by a single gene on the multicopy chloroplast genome, the Rubisco small subunits (rbcS) are encoded by a family of nuclear genes. In Arabidopsis thaliana, the rbcS gene family comprises four members, that is, rbcS-1a, rbcS-1b, rbcS-2b, and rbcS-3b. We sequenced all Rubisco genes in 26 worldwide distributed A. thaliana accessions. In three of these accessions, we detected a gene duplication/loss event, where rbcS-1b was lost and substituted by a duplicate of rbcS-2b (called rbcS-2b*). By screening 74 additional accessions using a specific polymerase chain reaction assay, we detected five additional accessions with this duplication/loss event. In summary, we found the gene duplication/loss in 8 of 100 A. thaliana accessions, namely, Bch, Bu, Bur, Cvi, Fei, Lm, Sha, and Sorbo. We sequenced an about 1-kb promoter region for all Rubisco genes as well. This analysis revealed that the gene duplication/loss event was associated with promoter alterations (two insertions of 450 and 850 bp, one deletion of 730 bp) in rbcS-2b and a promoter deletion (2.3 kb) in rbcS-2b* in all eight affected accessions. The substitution of rbcS-1b by a duplicate of rbcS-2b (i.e., rbcS-2b*) might be caused by gene conversion. All four Rubisco genes evolve under purifying selection, as expected for central genes of the highly conserved photosystem of green plants. We inferred a single positive selected site, a tyrosine to aspartic acid substitution at position 72 in rbcS-1b. Exactly the same substitution compromises carboxylase activity in the cyanobacterium Anacystis nidulans. In A. thaliana, this substitution is associated with an inferred

  15. Duplication 16p13.3 and the CREBBP gene: confirmation of the phenotype.

    PubMed

    Demeer, Bénédicte; Andrieux, Joris; Receveur, Aline; Morin, Gilles; Petit, Florence; Julia, Sophie; Plessis, Ghislaine; Martin-Coignard, Dominique; Delobel, Bruno; Firth, Helen V; Thuresson, Ann C; Lanco Dosen, Sandrine; Sjörs, Kerstin; Le Caignec, Cedric; Devriendt, Koenraad; Mathieu-Dramard, Michèle

    2013-01-01

    The introduction of molecular karyotyping technologies into the diagnostic work-up of patients with congenital disorders permitted the identification and delineation of novel microdeletion and microduplication syndromes. Interstitial 16p13.3 duplication, encompassing the CREBBP gene, which is mutated or deleted in the Rubinstein-Taybi syndrome, have been proposed to cause a recognisable syndrome with variable intellectual disability, normal growth, mild facial dysmorphism, mild anomalies of the extremities, and occasional findings such as developmental defects of the heart, genitalia, palate or the eyes. We here report the phenotypic and genotypic delineation of 9 patients carrying a submicroscopic 16p13.3 duplication, including the smallest 16p13.3 duplication reported so far. Careful clinical assessment confirms the distinctive clinical phenotype and also defines frequent associated features : marked speech problems, frequent ocular region involvement with upslanting of the eyes, narrow palpebral fissures, ptosis and strabismus, frequent proximal implantation of thumbs, cleft palate/bifid uvula and inguinal hernia. It also confirms that CREBBP is the critical gene involved in the duplication 16p13.3 syndrome. Copyright © 2012 Elsevier Masson SAS. All rights reserved.

  16. Gene duplications in prokaryotes can be associated with environmental adaptation.

    PubMed

    Bratlie, Marit S; Johansen, Jostein; Sherman, Brad T; Huang, Da Wei; Lempicki, Richard A; Drabløs, Finn

    2010-10-20

    Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate

  17. Evolution of Homospermidine Synthase in the Convolvulaceae: A Story of Gene Duplication, Gene Loss, and Periods of Various Selection Pressures[C][W][OA

    PubMed Central

    Kaltenegger, Elisabeth; Eich, Eckart; Ober, Dietrich

    2013-01-01

    Homospermidine synthase (HSS), the first pathway-specific enzyme of pyrrolizidine alkaloid biosynthesis, is known to have its origin in the duplication of a gene encoding deoxyhypusine synthase. To study the processes that followed this gene duplication event and gave rise to HSS, we identified sequences encoding HSS and deoxyhypusine synthase from various species of the Convolvulaceae. We show that HSS evolved only once in this lineage. This duplication event was followed by several losses of a functional gene copy attributable to gene loss or pseudogenization. Statistical analyses of sequence data suggest that, in those lineages in which the gene copy was successfully recruited as HSS, the gene duplication event was followed by phases of various selection pressures, including purifying selection, relaxed functional constraints, and possibly positive Darwinian selection. Site-specific mutagenesis experiments have confirmed that the substitution of sites predicted to be under positive Darwinian selection is sufficient to convert a deoxyhypusine synthase into a HSS. In addition, analyses of transcript levels have shown that HSS and deoxyhypusine synthase have also diverged with respect to their regulation. The impact of protein–protein interaction on the evolution of HSS is discussed with respect to current models of enzyme evolution. PMID:23572540

  18. Cheetahs have 4 serum amyloid a genes evolved through repeated duplication events.

    PubMed

    Chen, Lei; Une, Yumi; Higuchi, Keiichi; Mori, Masayuki

    2012-01-01

    Amyloid A (AA) amyloidosis is a leading cause of mortality in captive cheetahs (Acinonyx jubatus). We performed genome walking and PCR cloning and revealed that cheetahs have 4 SAA genes (provisionally named SAA1A, SAA1B, SAA3A, and SAA3B). In addition, we identified multiple nucleotide polymorphisms in the 4 SAA genes by screening 51 cheetahs. The polymorphisms defined 4, 7, 6, and 4 alleles for SAA1A, SAA3A, SAA1B, and SAA3B, respectively. Pedigree analysis of the inheritance of genotypes for the SAA genes revealed that specific combinations of alleles for the 4 SAA genes cosegregated as a unit (haplotype) in pedigrees, indicating that the 4 genes were linked on the same chromosome. Notably, cheetah SAA1A and SAA1B were highly homologous in their nucleotide sequences. Likewise, SAA3A and SAA3B genes were homologous. These observations suggested a model for the evolution of the 4 SAA genes in cheetahs in which duplication of an ancestral SAA gene first gave rise to SAA1 and SAA3. Subsequently, each gene duplicated one more time, uniquely making 4 genes in the cheetah genome. The monomorphism of the cheetah SAA1A protein might be one of the factors responsible for the high incidence of AA amyloidosis in this species.

  19. Comparative Transcriptome Analyses Reveal Core Parasitism Genes and Suggest Gene Duplication and Repurposing as Sources of Structural Novelty

    PubMed Central

    Yang, Zhenzhen; Wafula, Eric K.; Honaas, Loren A.; Zhang, Huiting; Das, Malay; Fernandez-Aparicio, Monica; Huang, Kan; Bandaranayake, Pradeepa C.G.; Wu, Biao; Der, Joshua P.; Clarke, Christopher R.; Ralph, Paula E.; Landherr, Lena; Altman, Naomi S.; Timko, Michael P.; Yoder, John I.; Westwood, James H.; dePamphilis, Claude W.

    2015-01-01

    The origin of novel traits is recognized as an important process underlying many major evolutionary radiations. We studied the genetic basis for the evolution of haustoria, the novel feeding organs of parasitic flowering plants, using comparative transcriptome sequencing in three species of Orobanchaceae. Around 180 genes are upregulated during haustorial development following host attachment in at least two species, and these are enriched in proteases, cell wall modifying enzymes, and extracellular secretion proteins. Additionally, about 100 shared genes are upregulated in response to haustorium inducing factors prior to host attachment. Collectively, we refer to these newly identified genes as putative “parasitism genes.” Most of these parasitism genes are derived from gene duplications in a common ancestor of Orobanchaceae and Mimulus guttatus, a related nonparasitic plant. Additionally, the signature of relaxed purifying selection and/or adaptive evolution at specific sites was detected in many haustorial genes, and may play an important role in parasite evolution. Comparative analysis of gene expression patterns in parasitic and nonparasitic angiosperms suggests that parasitism genes are derived primarily from root and floral tissues, but with some genes co-opted from other tissues. Gene duplication, often taking place in a nonparasitic ancestor of Orobanchaceae, followed by regulatory neofunctionalization, was an important process in the origin of parasitic haustoria. PMID:25534030

  20. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    PubMed

    Guo, Yong; Qiu, Li-Juan

    2013-01-01

    The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  1. A local duplication of the Melanocortin receptor 1 locus in Astyanax

    PubMed Central

    Gross, Joshua B.; Weagley, James; Stahl, Bethany A.; Ma, Li; Espinasa, Luis; McGaugh, Suzanne E.

    2017-01-01

    In this study, we report evidence of a novel duplication of Melanocortin receptor 1 (Mc1r) in the cavefish genome. This locus was discovered following the observation of excessive allelic diversity in a ~820 bp fragment of Mc1r amplified via degenerate PCR from a natural population of Astyanax aeneus fish from Guerrero, Mexico. The cavefish genome reveals the presence of two closely related Mc1r open reading frames separated by a 1.46 kb intergenic region. One open reading frame corresponds to the previously reported Mc1r receptor, and the other open reading frame (duplicate copy) is 975 bp in length, encoding a receptor of 325 amino acids. Sequence similarity analyses position both copies in the syntenic region of the single Mc1r locus in 16 representative craniate genomes spanning bony fish (including Astyanax) to mammals, suggesting we discovered tandem duplicates of this important gene. The two Mc1r copies share ~89% sequence similarity, and, within Astyanax, are more similar to one another compared to other melanocortin family members. Future studies will inform the precise functional significance of the duplicated Mc1r locus, and if this novel copy number variant may have adaptive significance for the Astyanax lineage. PMID:28738163

  2. Adaptations to High Salt in a Halophilic Protist: Differential Expression and Gene Acquisitions through Duplications and Gene Transfers

    PubMed Central

    Harding, Tommy; Roger, Andrew J.; Simpson, Alastair G. B.

    2017-01-01

    The capacity of halophiles to thrive in extreme hypersaline habitats derives partly from the tight regulation of ion homeostasis, the salt-dependent adjustment of plasma membrane fluidity, and the increased capability to manage oxidative stress. Halophilic bacteria, and archaea have been intensively studied, and substantial research has been conducted on halophilic fungi, and the green alga Dunaliella. By contrast, there have been very few investigations of halophiles that are phagotrophic protists, i.e., protozoa. To gather fundamental knowledge about salt adaptation in these organisms, we studied the transcriptome-level response of Halocafeteria seosinensis (Stramenopiles) grown under contrasting salinities. We provided further evolutionary context to our analysis by identifying genes that underwent recent duplications. Genes that were highly responsive to salinity variations were involved in stress response (e.g., chaperones), ion homeostasis (e.g., Na+/H+ transporter), metabolism and transport of lipids (e.g., sterol biosynthetic genes), carbohydrate metabolism (e.g., glycosidases), and signal transduction pathways (e.g., transcription factors). A significantly high proportion (43%) of duplicated genes were also differentially expressed, accentuating the importance of gene expansion in adaptation by H. seosinensis to high salt environments. Furthermore, we found two genes that were lateral acquisitions from bacteria, and were also highly up-regulated and highly expressed at high salt, suggesting that this evolutionary mechanism could also have facilitated adaptation to high salt. We propose that a transition toward high-salt adaptation in the ancestors of H. seosinensis required the acquisition of new genes via duplication, and some lateral gene transfers (LGTs), as well as the alteration of transcriptional programs, leading to increased stress resistance, proper establishment of ion gradients, and modification of cell structure properties like membrane

  3. Comparative and evolutionary analysis of the HES/HEY gene family reveal exon/intron loss and teleost specific duplication events.

    PubMed

    Zhou, Mi; Yan, Jun; Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

    2012-01-01

    HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication.

  4. Whole-Gene Positive Selection, Elevated Synonymous Substitution Rates, Duplication, and Indel Evolution of the Chloroplast clpP1 Gene

    PubMed Central

    Erixon, Per; Oxelman, Bengt

    2008-01-01

    Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue

  5. Identification of a duplication within the GDF9 gene and novel candidate genes for primary ovarian insufficiency (POI) by a customized high-resolution array comparative genomic hybridization platform.

    PubMed

    Norling, A; Hirschberg, A L; Rodriguez-Wallberg, K A; Iwarsson, E; Wedell, A; Barbaro, M

    2014-08-01

    Can high-resolution array comparative genomic hybridization (CGH) analysis of DNA samples from women with primary ovarian insufficiency (POI) improve the diagnosis of the condition and identify novel candidate genes for POI? A mutation affecting the regulatory region of growth differentiation factor 9 (GDF9) was identified for the first time together with several novel candidate genes for POI. Most patients with POI do not receive a molecular diagnosis despite a significant genetic component in the pathogenesis. We performed a case-control study. Twenty-six patients were analyzed by array CGH for identification of copy number variants. Novel changes were investigated in 95 controls and in a separate population of 28 additional patients with POI. The experimental procedures were performed during a 1-year period. DNA samples from 26 patients with POI were analyzed by a customized 1M array-CGH platform with whole genome coverage and probe enrichment targeting 78 genes in sex development. By PCR amplification and sequencing, the breakpoint of an identified partial GDF9 gene duplication was characterized. A multiplex ligation-dependent probe amplification (MLPA) probe set for specific identification of deletions/duplications affecting GDF9 was developed. An MLPA probe set for the identification of additional cases or controls carrying novel candidate regions identified by array-CGH was developed. Sequencing of three candidate genes was performed. Eleven unique copy number changes were identified in a total of 11 patients, including a tandem duplication of 475 bp, containing part of the GDF9 gene promoter region. The duplicated region contains three NOBOX-binding elements and an E-box, important for GDF9 gene regulation. This aberration is likely causative of POI. Fifty-four patients were investigated for copy number changes within GDF9, but no additional cases were found. Ten aberrations constituting novel candidate regions were detected, including a second DNAH6

  6. Expansion by whole genome duplication and evolution of the sox gene family in teleost fish

    PubMed Central

    Naville, Magali; Volff, Jean-Nicolas

    2017-01-01

    It is now recognized that several rounds of whole genome duplication (WGD) have occurred during the evolution of vertebrates, but the link between WGDs and phenotypic diversification remains unsolved. We have investigated in this study the impact of the teleost-specific WGD on the evolution of the sox gene family in teleostean fishes. The sox gene family, which encodes for transcription factors, has essential role in morphology, physiology and behavior of vertebrates and teleosts, the current largest group of vertebrates. We have first redrawn the evolution of all sox genes identified in eleven teleost genomes using a comparative genomic approach including phylogenetic and synteny analyses. We noticed, compared to tetrapods, an important expansion of the sox family: 58% (11/19) of sox genes are duplicated in teleost genomes. Furthermore, all duplicated sox genes, except sox17 paralogs, are derived from the teleost-specific WGD. Then, focusing on five sox genes, analyzing the evolution of coding and non-coding sequences, as well as the expression patterns in fish embryos and adult tissues, we demonstrated that these paralogs followed lineage-specific evolutionary trajectories in teleost genomes. This work, based on whole genome data from multiple teleostean species, supports the contribution of WGDs to the expansion of gene families, as well as to the emergence of genomic differences between lineages that might promote genetic and phenotypic diversity in teleosts. PMID:28738066

  7. Sorting cancer karyotypes using double-cut-and-joins, duplications and deletions.

    PubMed

    Zeira, Ron; Shamir, Ron

    2018-05-03

    Problems of genome rearrangement are central in both evolution and cancer research. Most genome rearrangement models assume that the genome contains a single copy of each gene and the only changes in the genome are structural, i.e., reordering of segments. In contrast, tumor genomes also undergo numerical changes such as deletions and duplications, and thus the number of copies of genes varies. Dealing with unequal gene content is a very challenging task, addressed by few algorithms to date. More realistic models are needed to help trace genome evolution during tumorigenesis. Here we present a model for the evolution of genomes with multiple gene copies using the operation types double-cut-and-joins, duplications and deletions. The events supported by the model are reversals, translocations, tandem duplications, segmental deletions, and chromosomal amplifications and deletions, covering most types of structural and numerical changes observed in tumor samples. Our goal is to find a series of operations of minimum length that transform one karyotype into the other. We show that the problem is NP-hard and give an integer linear programming formulation that solves the problem exactly under some mild assumptions. We test our method on simulated genomes and on ovarian cancer genomes. Our study advances the state of the art in two ways: It allows a broader set of operations than extant models, thus being more realistic, and it is the first study attempting to reconstruct the full sequence of structural and numerical events during cancer evolution. Code and data are available in https://github.com/Shamir-Lab/Sorting-Cancer-Karyotypes. ronzeira@post.tau.ac.il, rshamir@tau.ac.il. Supplementary data are available at Bioinformatics online.

  8. Evolutionary history of glucose-6-phosphatase encoding genes in vertebrate lineages: towards a better understanding of the functions of multiple duplicates.

    PubMed

    Marandel, Lucie; Panserat, Stéphane; Plagnes-Juan, Elisabeth; Arbenoits, Eva; Soengas, José Luis; Bobe, Julien

    2017-05-02

    Glucose-6-phosphate (G6pc) is a key enzyme involved in the regulation of the glucose homeostasis. The present study aims at revisiting and clarifying the evolutionary history of g6pc genes in vertebrates. g6pc duplications happened by successive rounds of whole genome duplication that occurred during vertebrate evolution. g6pc duplicated before or around Osteichthyes/Chondrichthyes radiation, giving rise to g6pca and g6pcb as a consequence of the second vertebrate whole genome duplication. g6pca was lost after this duplication in Sarcopterygii whereas both g6pca and g6pcb then duplicated as a consequence of the teleost-specific whole genome duplication. One g6pca duplicate was lost after this duplication in teleosts. Similarly one g6pcb2 duplicate was lost at least in the ancestor of percomorpha. The analysis of the evolution of spatial expression patterns of g6pc genes in vertebrates showed that all g6pc were mainly expressed in intestine and liver whereas teleost-specific g6pcb2 genes were mainly and surprisingly expressed in brain and heart. g6pcb2b, one gene previously hypothesised to be involved in the glucose intolerant phenotype in trout, was unexpectedly up-regulated (as it was in liver) by carbohydrates in trout telencephalon without showing significant changes in other brain regions. This up-regulation is in striking contrast with expected glucosensing mechanisms suggesting that its positive response to glucose relates to specific unknown processes in this brain area. Our results suggested that the fixation and the divergence of g6pc duplicated genes during vertebrates' evolution may lead to adaptive novelty and probably to the emergence of novel phenotypes related to glucose homeostasis.

  9. Sequence divergence in the 3'-untranslated region has an effect on the subfunctionalization of duplicate genes.

    PubMed

    Tong, Ying; Zheng, Kang; Zhao, Shufang; Xiao, Guanxiu; Luo, Chen

    2012-11-01

    Recent studies demonstrated that sequence divergence in both transcriptional regulatory region and coding region contributes to the subfunctionalization of duplicate gene. However, whether sequence divergence in the 3'-untranslated region (3'-UTR) has an impact on the subfunctionalization of duplicate genes remains unclear. Here, we identified two diverging duplicate vsx1 (visual system homeobox-1) loci in goldfish, named vsx1A1 and vsx1A2. Phylogenetic analysis suggests that vsx1A1 and vsx1A2 may arise from a duplication of vsx1 after the separation of goldfish and zebrafish. Sequence comparison revealed that divergence in both transcriptional and translational regulatory regions is higher than divergence in the introns. vsx1A2 expresses during blastula and gastrula stages and in adult retina but silences from segmentation stage to hatching stage, vsx1A1 starts expression from segmentation onward. Comparing to that zebrafish vsx1 expresses in all the developmental stages and in the adult retina, it appears that goldfish vsx1A1 and vsx1A2 are under going to share the functions of ancestral vsx1. The different but overlapping temporal expression patterns of vsx1A1 and vsx1A2 suggest that sequence divergence in the promoter region of duplicate vsx1 is not sufficient for partitioning the functions of ancestral vsx1. By comparing vsx1A1 and vsx1A2 3'-UTR-linked green fluorescent protein gene expression patterns, we demonstrated that the 3'-UTR of vsx1A1 remains but the 3'-UTR of vsx1A2 has lost the capability of mediating bipolar cell specific expression during retina development. These results indicate that sequence divergence in the 3'-UTRs has a clear effect on subfunctionalization of the duplicate genes. © 2012 WILEY PERIODICALS, INC.

  10. Duplication polymorphisms in exon 4 of κ-casein gene in yak breeds/populations.

    PubMed

    Pingcuo, S; Gao, J; Jiang, Z R; Jin, S Y; Fu, C Y; Liu, X; Huang, L; Zheng, Y C

    2015-08-28

    The objective of this study was to compare 12 bp-duplication polymorphisms in exon 4 of the κ-casein gene among 3 breeds/populations of yak (Bos grunniens). Genomic DNA was extracted from yak blood or muscle samples (N = 211) and a partial sequence of exon 4 of κ-casein gene was amplified by polymerase chain reaction. A polyacrylamide gel electrophoresis assay of the products (169 bp) revealed 2 variants. These variants differed in a 12-bp duplication of the nucleotide sequence corresponding to amino acids 147-150 (Glu-Ala-Ser-Pro) or 148-151 (Ala-Ser-Pro-Glu). The genotype frequency and gene frequency of the 2 κ-casein variants differed among the 3 yak breeds/populations. The long form of the κ-casein gene was the predominant allele, and the Jiulong yak showed the highest frequency of the short form variant of the κ-casein gene. In addition, 2 nucleotide differences resulting in amino acid substitutions were also identified in yaks. These results are significant for designing a breeding strategy to improve the genetic makeup of yak herds.

  11. A Tandem Duplicate of Anti-Müllerian Hormone with a Missense SNP on the Y Chromosome Is Essential for Male Sex Determination in Nile Tilapia, Oreochromis niloticus

    PubMed Central

    Li, Minghui; Sun, Yunlv; Zhao, Jiue; Shi, Hongjuan; Zeng, Sheng; Ye, Kai; Jiang, Dongneng; Zhou, Linyan; Sun, Lina; Tao, Wenjing; Nagahama, Yoshitaka; Kocher, Thomas D.; Wang, Deshou

    2015-01-01

    Variation in the TGF-β signaling pathway is emerging as an important mechanism by which gonadal sex determination is controlled in teleosts. Here we show that amhy, a Y-specific duplicate of the anti-Müllerian hormone (amh) gene, induces male sex determination in Nile tilapia. amhy is a tandem duplicate located immediately downstream of amhΔ-y on the Y chromosome. The coding sequence of amhy was identical to the X-linked amh (amh) except a missense SNP (C/T) which changes an amino acid (Ser/Leu92) in the N-terminal region. amhy lacks 5608 bp of promoter sequence that is found in the X-linked amh homolog. The amhΔ-y contains several insertions and deletions in the promoter region, and even a 5 bp insertion in exonVI that results in a premature stop codon and thus a truncated protein product lacking the TGF-β binding domain. Both amhy and amhΔ-y expression is restricted to XY gonads from 5 days after hatching (dah) onwards. CRISPR/Cas9 knockout of amhy in XY fish resulted in male to female sex reversal, while mutation of amhΔ-y alone could not. In contrast, overexpression of Amhy in XX fish, using a fosmid transgene that carries the amhy/amhΔ-y haplotype or a vector containing amhy ORF under the control of CMV promoter, resulted in female to male sex reversal, while overexpression of AmhΔ-y alone in XX fish could not. Knockout of the anti-Müllerian hormone receptor type II (amhrII) in XY fish also resulted in 100% complete male to female sex reversal. Taken together, these results strongly suggest that the duplicated amhy with a missense SNP is the candidate sex determining gene and amhy/amhrII signal is essential for male sex determination in Nile tilapia. These findings highlight the conserved roles of TGF-β signaling pathway in fish sex determination. PMID:26588702

  12. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species

    PubMed Central

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; Kelly, Steven

    2016-01-01

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. Key words: C4 photosynthesis, gene duplication, gene families, parallel evolution. PMID:27016024

  13. Recurrent duplications of the annexin A1 gene (ANXA1) in autism spectrum disorders.

    PubMed

    Correia, Catarina T; Conceição, Inês C; Oliveira, Bárbara; Coelho, Joana; Sousa, Inês; Sequeira, Ana F; Almeida, Joana; Café, Cátia; Duque, Frederico; Mouga, Susana; Roberts, Wendy; Gao, Kun; Lowe, Jennifer K; Thiruvahindrapuram, Bhooma; Walker, Susan; Marshall, Christian R; Pinto, Dalila; Nurnberger, John I; Scherer, Stephen W; Geschwind, Daniel H; Oliveira, Guiomar; Vicente, Astrid M

    2014-04-10

    Validating the potential pathogenicity of copy number variants (CNVs) identified in genome-wide studies of autism spectrum disorders (ASD) requires detailed assessment of case/control frequencies, inheritance patterns, clinical correlations, and functional impact. Here, we characterize a small recurrent duplication in the annexin A1 (ANXA1) gene, identified by the Autism Genome Project (AGP) study. From the AGP CNV genomic screen in 2,147 ASD individuals, we selected for characterization an ANXA1 gene duplication that was absent in 4,964 population-based controls. We further screened the duplication in a follow-up sample including 1,496 patients and 410 controls, and evaluated clinical correlations and family segregation. Sequencing of exonic/downstream ANXA1 regions was performed in 490 ASD patients for identification of additional variants. The ANXA1 duplication, overlapping the last four exons and 3'UTR region, had an overall prevalence of 11/3,643 (0.30%) in unrelated ASD patients but was not identified in 5,374 controls. Duplication carriers presented no distinctive clinical phenotype. Family analysis showed neuropsychiatric deficits and ASD traits in multiple relatives carrying the duplication, suggestive of a complex genetic inheritance. Sequencing of exonic regions and the 3'UTR identified 11 novel changes, but no obvious variants with clinical significance. We provide multilevel evidence for a role of ANXA1 in ASD etiology. Given its important role as mediator of glucocorticoid function in a wide variety of brain processes, including neuroprotection, apoptosis, and control of the neuroendocrine system, the results add ANXA1 to the growing list of rare candidate genetic etiological factors for ASD.

  14. A rare FANCA gene variation as a breast cancer susceptibility allele in an Iranian population

    PubMed Central

    Abbasi, Sakineh; Rasouli, Mina

    2017-01-01

    Fanconi Anemia (FA) is an autosomal recessive syndrome characterized by congenital abnormalities, progressive bone marrow failure and Fanconi anemia complementation group A (FANCA) is also a potential breast and ovarian cancer susceptibility gene. A novel allele with tandem duplication of 13 base pair sequence in promoter region was identified. To investigate whether the 13 base pair sequence of tandem duplication in promoter region of the FANCA gene is of high penetrance in patients with breast cancer and to determine if the presence of the duplicated allele was associated with an altered risk of breast cancer, the present study screened DNA in blood samples from 304 breast cancer patients and 295 normal individuals as controls. The duplication allele had a frequency of 35.4 and 21.2% in patients with breast cancer and normal controls, respectively. There was a significant increase in the frequency of the duplication allele in patients with familial breast cancer compared with controls (45.1%, P=0.001). Furthermore, the estimated risk of breast cancer in individuals with a homozygote [odds ratio (OR), 4.093; 95% confidence intervals (CI), 1.957–8.561] or heterozygote duplicated genotype (OR, 3.315; 95% CI, 1.996–5.506) was higher compared with the corresponding normal homozygote genotype. In conclusion, the present study indicated that the higher the frequency of the duplicated allele, the higher the risk of breast cancer. To the best of our knowledge, the present study is the first to report FANCA gene duplication in patients with breast cancer. PMID:28440412

  15. A rare FANCA gene variation as a breast cancer susceptibility allele in an Iranian population.

    PubMed

    Abbasi, Sakineh; Rasouli, Mina

    2017-06-01

    Fanconi Anemia (FA) is an autosomal recessive syndrome characterized by congenital abnormalities, progressive bone marrow failure and Fanconi anemia complementation group A (FANCA) is also a potential breast and ovarian cancer susceptibility gene. A novel allele with tandem duplication of 13 base pair sequence in promoter region was identified. To investigate whether the 13 base pair sequence of tandem duplication in promoter region of the FANCA gene is of high penetrance in patients with breast cancer and to determine if the presence of the duplicated allele was associated with an altered risk of breast cancer, the present study screened DNA in blood samples from 304 breast cancer patients and 295 normal individuals as controls. The duplication allele had a frequency of 35.4 and 21.2% in patients with breast cancer and normal controls, respectively. There was a significant increase in the frequency of the duplication allele in patients with familial breast cancer compared with controls (45.1%, P=0.001). Furthermore, the estimated risk of breast cancer in individuals with a homozygote [odds ratio (OR), 4.093; 95% confidence intervals (CI), 1.957‑8.561] or heterozygote duplicated genotype (OR, 3.315; 95% CI, 1.996‑5.506) was higher compared with the corresponding normal homozygote genotype. In conclusion, the present study indicated that the higher the frequency of the duplicated allele, the higher the risk of breast cancer. To the best of our knowledge, the present study is the first to report FANCA gene duplication in patients with breast cancer.

  16. Tandemly arranged chalcone synthase A genes contribute to the spatially regulated expression of siRNA and the natural bicolor floral phenotype in Petunia hybrida.

    PubMed

    Morita, Yasumasa; Saito, Ryoko; Ban, Yusuke; Tanikawa, Natsu; Kuchitsu, Kazuyuki; Ando, Toshio; Yoshikawa, Manabu; Habu, Yoshiki; Ozeki, Yoshihiro; Nakayama, Masayoshi

    2012-06-01

    The natural bicolor floral traits of the horticultural petunia (Petunia hybrida) cultivars Picotee and Star are caused by the spatial repression of the chalcone synthase A (CHS-A) gene, which encodes an anthocyanin biosynthetic enzyme. Here we show that Picotee and Star petunias carry the same short interfering RNA (siRNA)-producing locus, consisting of two intact CHS-A copies, PhCHS-A1 and PhCHS-A2, in a tandem head-to-tail orientation. The precursor CHS mRNAs are transcribed from the two CHS-A copies throughout the bicolored petals, but the mature CHS mRNAs are not found in the white tissues. An analysis of small RNAs revealed the accumulation of siRNAs of 21 nucleotides that originated from the exon 2 region of both CHS-A copies. This accumulation is closely correlated with the disappearance of the CHS mRNAs, indicating that the bicolor floral phenotype is caused by the spatially regulated post-transcriptional silencing of both CHS-A genes. Linkage between the tandemly arranged CHS-A allele and the bicolor floral trait indicates that the CHS-A allele is a necessary factor to confer the trait. We suppose that the spatially regulated production of siRNAs in Picotee and Star flowers is triggered by another putative regulatory locus, and that the silencing mechanism in this case may be different from other known mechanisms of post-transcriptional gene silencing in plants. A sequence analysis of wild Petunia species indicated that these tandem CHS-A genes originated from Petunia integrifolia and/or Petunia inflata, the parental species of P. hybrida, as a result of a chromosomal rearrangement rather than a gene duplication event. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  17. Gene duplications in prokaryotes can be associated with environmental adaptation

    PubMed Central

    2010-01-01

    Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism

  18. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate

    PubMed Central

    Adomako-Ankomah, Yaw; English, Elizabeth D.; Danielson, Jeffrey J.; Pernas, Lena F.; Parker, Michelle L.; Boulanger, Martin J.; Dubey, Jitender P.; Boyle, Jon P.

    2016-01-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum. Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA+ paralogs. Additionally, we found that exogenous expression of an HMA+ paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. PMID:26920761

  19. Clinical and molecular characterization of duplications encompassing the human SHOX gene reveal a variable effect on stature.

    PubMed

    Thomas, N Simon; Harvey, John F; Bunyan, David J; Rankin, Julia; Grigelioniene, Giedre; Bruno, Damien L; Tan, Tiong Y; Tomkins, Susan; Hastings, Robert

    2009-07-01

    Deletions of the SHOX gene are well documented and cause disproportionate short stature and variable skeletal abnormalities. In contrast interstitial SHOX duplications limited to PAR1 appear to be very rare and the clinical significance of the only case report in the literature is unclear. Mapping of this duplication has now shown that it includes the entire SHOX gene but little flanking sequence and so will not encompass any of the long-range enhancers required for SHOX transcription. We now describe the clinical and molecular characterization of three additional cases. The duplications all included the SHOX coding sequence but varied in the amount of flanking sequence involved. The probands were ascertained for a variety of reasons: hypotonia and features of Asperger syndrome, Leri-Weill dyschondrosteosis (LWD), and a family history of cleft palate. However, the presence of a duplication did not correlate with any of these features or with evidence of skeletal abnormality. Remarkably, the proband with LWD had inherited both a SHOX deletion and a duplication. The effect of the duplications on stature was variable: height appeared to be elevated in some carriers, particularly in those with the largest duplications, but was still within the normal range. SHOX duplications are likely to be under ascertained and more cases need to be identified and characterized in detail in order to accurately determine their phenotypic consequences.

  20. Atlantic salmon populations reveal adaptive divergence of immune related genes - a duplicated genome under selection.

    PubMed

    Kjærner-Semb, Erik; Ayllon, Fernando; Furmanek, Tomasz; Wennevik, Vidar; Dahle, Geir; Niemelä, Eero; Ozerov, Mikhail; Vähä, Juha-Pekka; Glover, Kevin A; Rubin, Carl J; Wargelius, Anna; Edvardsen, Rolf B

    2016-08-11

    Populations of Atlantic salmon display highly significant genetic differences with unresolved molecular basis. These differences may result from separate postglacial colonization patterns, diversifying natural selection and adaptation, or a combination. Adaptation could be influenced or even facilitated by the recent whole genome duplication in the salmonid lineage which resulted in a partly tetraploid species with duplicated genes and regions. In order to elucidate the genes and genomic regions underlying the genetic differences, we conducted a genome wide association study using whole genome resequencing data from eight populations from Northern and Southern Norway. From a total of ~4.5 million sequencing-derived SNPs, more than 10 % showed significant differentiation between populations from these two regions and ten selective sweeps on chromosomes 5, 10, 11, 13-15, 21, 24 and 25 were identified. These comprised 59 genes, of which 15 had one or more differentiated missense mutation. Our analysis showed that most sweeps have paralogous regions in the partially tetraploid genome, each lacking the high number of significant SNPs found in the sweeps. The most significant sweep was found on Chr 25 and carried several missense mutations in the antiviral mx genes, suggesting that these populations have experienced differing viral pressures. Interestingly the second most significant sweep, found on Chr 5, contains two genes involved in the NF-KB pathway (nkap and nkrf), which is also a known pathogen target that controls a large number of processes in animals. Our results show that natural selection acting on immune related genes has contributed to genetic divergence between salmon populations in Norway. The differences between populations may have been facilitated by the plasticity of the salmon genome. The observed signatures of selection in duplicated genomic regions suggest that the recently duplicated genome has provided raw material for evolutionary adaptation.

  1. Comparative and Evolutionary Analysis of the HES/HEY Gene Family Reveal Exon/Intron Loss and Teleost Specific Duplication Events

    PubMed Central

    Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

    2012-01-01

    Background HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. Methods and Findings In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Conclusions Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and

  2. Differential retention of metabolic genes following whole-genome duplication.

    PubMed

    Gout, Jean-François; Duret, Laurent; Kahn, Daniel

    2009-05-01

    Classical studies in Metabolic Control Theory have shown that metabolic fluxes usually exhibit little sensitivity to changes in individual enzyme activity, yet remain sensitive to global changes of all enzymes in a pathway. Therefore, little selective pressure is expected on the dosage or expression of individual metabolic genes, yet entire pathways should still be constrained. However, a direct estimate of this selective pressure had not been evaluated. Whole-genome duplications (WGDs) offer a good opportunity to address this question by analyzing the fates of metabolic genes during the massive gene losses that follow. Here, we take advantage of the successive rounds of WGD that occurred in the Paramecium lineage. We show that metabolic genes exhibit different gene retention patterns than nonmetabolic genes. Contrary to what was expected for individual genes, metabolic genes appeared more retained than other genes after the recent WGD, which was best explained by selection for gene expression operating on entire pathways. Metabolic genes also tend to be less retained when present at high copy number before WGD, contrary to other genes that show a positive correlation between gene retention and preduplication copy number. This is rationalized on the basis of the classical concave relationship relating metabolic fluxes with enzyme expression.

  3. Independent and Parallel Evolution of New Genes by Gene Duplication in Two Origins of C4 Photosynthesis Provides New Insight into the Mechanism of Phloem Loading in C4 Species.

    PubMed

    Emms, David M; Covshoff, Sarah; Hibberd, Julian M; Kelly, Steven

    2016-07-01

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes is enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Furthermore, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species. C4 photosynthesis, gene duplication, gene families, parallel evolution. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. Recurrent duplications of the annexin A1 gene (ANXA1) in autism spectrum disorders

    PubMed Central

    2014-01-01

    Background Validating the potential pathogenicity of copy number variants (CNVs) identified in genome-wide studies of autism spectrum disorders (ASD) requires detailed assessment of case/control frequencies, inheritance patterns, clinical correlations, and functional impact. Here, we characterize a small recurrent duplication in the annexin A1 (ANXA1) gene, identified by the Autism Genome Project (AGP) study. Methods From the AGP CNV genomic screen in 2,147 ASD individuals, we selected for characterization an ANXA1 gene duplication that was absent in 4,964 population-based controls. We further screened the duplication in a follow-up sample including 1,496 patients and 410 controls, and evaluated clinical correlations and family segregation. Sequencing of exonic/downstream ANXA1 regions was performed in 490 ASD patients for identification of additional variants. Results The ANXA1 duplication, overlapping the last four exons and 3’UTR region, had an overall prevalence of 11/3,643 (0.30%) in unrelated ASD patients but was not identified in 5,374 controls. Duplication carriers presented no distinctive clinical phenotype. Family analysis showed neuropsychiatric deficits and ASD traits in multiple relatives carrying the duplication, suggestive of a complex genetic inheritance. Sequencing of exonic regions and the 3’UTR identified 11 novel changes, but no obvious variants with clinical significance. Conclusions We provide multilevel evidence for a role of ANXA1 in ASD etiology. Given its important role as mediator of glucocorticoid function in a wide variety of brain processes, including neuroprotection, apoptosis, and control of the neuroendocrine system, the results add ANXA1 to the growing list of rare candidate genetic etiological factors for ASD. PMID:24720851

  5. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae.

    PubMed

    Shen, Danyu; Liu, Tingli; Ye, Wenwu; Liu, Li; Liu, Peihan; Wu, Yuren; Wang, Yuanchao; Dou, Daolong

    2013-01-01

    Phytophthora and other oomycetes secrete a large number of putative host cytoplasmic effectors with conserved FLAK motifs following signal peptides, termed crinkling and necrosis inducing proteins (CRN), or Crinkler. Here, we first investigated the evolutionary patterns and mechanisms of CRN effectors in Phytophthora sojae and compared them to two other Phytophthora species. The genes encoding CRN effectors could be divided into 45 orthologous gene groups (OGG), and most OGGs unequally distributed in the three species, in which each underwent large number of gene gains or losses, indicating that the CRN genes expanded after species evolution in Phytophthora and evolved through pathoadaptation. The 134 expanded genes in P. sojae encoded family proteins including 82 functional genes and expressed at higher levels while the other 68 genes encoding orphan proteins were less expressed and contained 50 pseudogenes. Furthermore, we demonstrated that most expanded genes underwent gene duplication or/and fragment recombination. Three different mechanisms that drove gene duplication or recombination were identified. Finally, the expanded CRN effectors exhibited varying pathogenic functions, including induction of programmed cell death (PCD) and suppression of PCD through PAMP-triggered immunity or/and effector-triggered immunity. Overall, these results suggest that gene duplication and fragment recombination may be two mechanisms that drive the expansion and neofunctionalization of the CRN family in P. sojae, which aids in understanding the roles of CRN effectors within each oomycete pathogen.

  6. Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications

    PubMed Central

    Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel

    2007-01-01

    Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor

  7. Differential accumulation of retroelements and diversification of NB-LRR disease resistance genes in duplicated regions following polyploidy in the ancestor of soybean.

    PubMed

    Innes, Roger W; Ameline-Torregrosa, Carine; Ashfield, Tom; Cannon, Ethalinda; Cannon, Steven B; Chacko, Ben; Chen, Nicolas W G; Couloux, Arnaud; Dalwani, Anita; Denny, Roxanne; Deshpande, Shweta; Egan, Ashley N; Glover, Natasha; Hans, Christian S; Howell, Stacy; Ilut, Dan; Jackson, Scott; Lai, Hongshing; Mammadov, Jafar; Del Campo, Sara Martin; Metcalf, Michelle; Nguyen, Ashley; O'Bleness, Majesta; Pfeil, Bernard E; Podicheti, Ram; Ratnaparkhe, Milind B; Samain, Sylvie; Sanders, Iryna; Ségurens, Béatrice; Sévignac, Mireille; Sherman-Broyles, Sue; Thareau, Vincent; Tucker, Dominic M; Walling, Jason; Wawrzynski, Adam; Yi, Jing; Doyle, Jeff J; Geffroy, Valérie; Roe, Bruce A; Maroof, M A Saghai; Young, Nevin D

    2008-12-01

    The genomes of most, if not all, flowering plants have undergone whole genome duplication events during their evolution. The impact of such polyploidy events is poorly understood, as is the fate of most duplicated genes. We sequenced an approximately 1 million-bp region in soybean (Glycine max) centered on the Rpg1-b disease resistance gene and compared this region with a region duplicated 10 to 14 million years ago. These two regions were also compared with homologous regions in several related legume species (a second soybean genotype, Glycine tomentella, Phaseolus vulgaris, and Medicago truncatula), which enabled us to determine how each of the duplicated regions (homoeologues) in soybean has changed following polyploidy. The biggest change was in retroelement content, with homoeologue 2 having expanded to 3-fold the size of homoeologue 1. Despite this accumulation of retroelements, over 77% of the duplicated low-copy genes have been retained in the same order and appear to be functional. This finding contrasts with recent analyses of the maize (Zea mays) genome, in which only about one-third of duplicated genes appear to have been retained over a similar time period. Fluorescent in situ hybridization revealed that the homoeologue 2 region is located very near a centromere. Thus, pericentromeric localization, per se, does not result in a high rate of gene inactivation, despite greatly accelerated retrotransposon accumulation. In contrast to low-copy genes, nucleotide-binding-leucine-rich repeat disease resistance gene clusters have undergone dramatic species/homoeologue-specific duplications and losses, with some evidence for partitioning of subfamilies between homoeologues.

  8. Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate.

    PubMed

    Adomako-Ankomah, Yaw; English, Elizabeth D; Danielson, Jeffrey J; Pernas, Lena F; Parker, Michelle L; Boulanger, Martin J; Dubey, Jitender P; Boyle, Jon P

    2016-05-01

    In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA(+) paralogs. Additionally, we found that exogenous expression of an HMA(+) paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous. Copyright © 2016 by the Genetics Society of America.

  9. Genome-wide identification and expression analysis of sulfate transporter (SULTR) genes in potato (Solanum tuberosum L.).

    PubMed

    Vatansever, Recep; Koc, Ibrahim; Ozyigit, Ibrahim Ilker; Sen, Ugur; Uras, Mehmet Emin; Anjum, Naser A; Pereira, Eduarda; Filiz, Ertugrul

    2016-12-01

    Solanum tuberosum genome analysis revealed 12 StSULTR genes encoding 18 transcripts. Among genes annotated at group level ( StSULTR I-IV), group III members formed the largest SULTRs-cluster and were potentially involved in biotic/abiotic stress responses via various regulatory factors, and stress and signaling proteins. Employing bioinformatics tools, this study performed genome-wide identification and expression analysis of SULTR (StSULTR) genes in potato (Solanum tuberosum L.). Very strict homology search and subsequent domain verification with Hidden Markov Model revealed 12 StSULTR genes encoding 18 transcripts. StSULTR genes were mapped on seven S. tuberosum chromosomes. Annotation of StSULTR genes was also done as StSULTR I-IV at group level based mainly on the phylogenetic distribution with Arabidopsis SULTRs. Several tandem and segmental duplications were identified between StSULTR genes. Among these duplications, Ka/Ks ratios indicated neutral nature of mutations that might not be causing any selection. Two segmental and one-tandem duplications were calculated to occur around 147.69, 180.80 and 191.00 million years ago (MYA), approximately corresponding to the time of monocot/dicot divergence. Two other segmental duplications were found to occur around 61.23 and 67.83 MYA, which is very close to the origination of monocotyledons. Most cis-regulatory elements in StSULTRs were found associated with major hormones (such as abscisic acid and methyl jasmonate), and defense and stress responsiveness. The cis-element distribution in duplicated gene pairs indicated the contribution of duplication events in conferring the neofunctionalization/s in StSULTR genes. Notably, RNAseq data analyses unveiled expression profiles of StSULTR genes under different stress conditions. In particular, expression profiles of StSULTR III members suggested their involvement in plant stress responses. Additionally, gene co-expression networks of these group members included various

  10. Genome-Wide Identification and Expression Analysis of NBS-Encoding Genes in Malus x domestica and Expansion of NBS Genes Family in Rosaceae

    PubMed Central

    Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K.

    2014-01-01

    Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1∶1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple. PMID:25232838

  11. The Use of Duplication-Generating Rearrangements for Studying Heterokaryon Incompatibility Genes in Neurospora

    PubMed Central

    Perkins, David D.

    1975-01-01

    Heterokaryon (vegetative) incompatibility, governing the fusion of somatic hyphal filaments to form stable heterokaryons, is of interest because of its widespread occurrence in fungi and its bearing on cellular recognition. Conventional investigations of the genetic basis of heterokaryon incompatibility in N. crassa are difficult because in commonly used stocks differences are present at several het loci, all with similar incompatibility phenotypes. This difficulty is overcome by using duplications (partial diploids) that are unlikely to contain more than one het locus. A phenotypically expressed incompatibility reaction occurs when unlike het alleles are present within the same somatic nucleus, and this parallels the heterokaryon incompatibility reaction that occurs when unlike alleles in different haploid nuclei are introduced into the same somatic hypha by mycelial fusion.—Nontandem duplications were used to confirm that the incompatibility reactions in heterokaryons and in duplications are alternate expressions of the same genes. This was demonstrated for three loci which had previously been established by conventional heterokaryon tests—het-e, het-c and mt. These were each obtained in duplications as recombinant meiotic segregants from crosses heterozygous for duplication-generating chromosome rearrangements. The particular method of producing the duplications is irrelevant so long as the incompatibility alleles are heterozygous.—The duplication technique has made it possible to determine easily the het-e and het-c genotypes of numerous laboratory and wild strains of unknown constitution. In laboratory strains both loci are represented simply by two alleles. Analysis of het-c is more complicated in some wild strains, where differences have been demonstrated at one or more additional het loci within the duplication used and multiple allelism is also possible.—The results show that the duplication method can be used to identify and map additional

  12. Duplications and losses in gene families of rust pathogens highlight putative effectors.

    PubMed

    Pendleton, Amanda L; Smith, Katherine E; Feau, Nicolas; Martin, Francis M; Grigoriev, Igor V; Hamelin, Richard; Nelson, C Dana; Burleigh, J Gordon; Davis, John M

    2014-01-01

    Rust fungi are a group of fungal pathogens that cause some of the world's most destructive diseases of trees and crops. A shared characteristic among rust fungi is obligate biotrophy, the inability to complete a lifecycle without a host. This dependence on a host species likely affects patterns of gene expansion, contraction, and innovation within rust pathogen genomes. The establishment of disease by biotrophic pathogens is reliant upon effector proteins that are encoded in the fungal genome and secreted from the pathogen into the host's cell apoplast or within the cells. This study uses a comparative genomic approach to elucidate putative effectors and determine their evolutionary histories. We used OrthoMCL to identify nearly 20,000 gene families in proteomes of 16 diverse fungal species, which include 15 basidiomycetes and one ascomycete. We inferred patterns of duplication and loss for each gene family and identified families with distinctive patterns of expansion/contraction associated with the evolution of rust fungal genomes. To recognize potential contributors for the unique features of rust pathogens, we identified families harboring secreted proteins that: (i) arose or expanded in rust pathogens relative to other fungi, or (ii) contracted or were lost in rust fungal genomes. While the origin of rust fungi appears to be associated with considerable gene loss, there are many gene duplications associated with each sampled rust fungal genome. We also highlight two putative effector gene families that have expanded in Cqf that we hypothesize have roles in pathogenicity.

  13. Extensive horizontal gene transfer, duplication, and loss of chlorophyll synthesis genes in the algae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hunsperger, Heather M.; Randhawa, Tejinder; Cattolico, Rose Ann

    Two non-homologous, isofunctional enzymes catalyze the penultimate step of chlorophyll a synthesis in oxygenic photosynthetic organisms such as cyanobacteria, eukaryotic algae and land plants: the light independent (LIPOR) and light-dependent (POR) protochlorophyllide oxidoreductases. Whereas the distribution of these enzymes in cyanobacteria and land plants is well understood, the presence, loss, duplication, and replacement of these genes have not been surveyed in the polyphyletic and remarkably diverse eukaryotic algal lineages.

  14. Extensive horizontal gene transfer, duplication, and loss of chlorophyll synthesis genes in the algae

    DOE PAGES

    Hunsperger, Heather M.; Randhawa, Tejinder; Cattolico, Rose Ann

    2015-02-10

    Two non-homologous, isofunctional enzymes catalyze the penultimate step of chlorophyll a synthesis in oxygenic photosynthetic organisms such as cyanobacteria, eukaryotic algae and land plants: the light independent (LIPOR) and light-dependent (POR) protochlorophyllide oxidoreductases. Whereas the distribution of these enzymes in cyanobacteria and land plants is well understood, the presence, loss, duplication, and replacement of these genes have not been surveyed in the polyphyletic and remarkably diverse eukaryotic algal lineages.

  15. Intraspecific rearrangement of duplicated mitochondrial control regions in the Luzon Tarictic Hornbill Penelopides manillae (Aves: Bucerotidae).

    PubMed

    Sammler, Svenja; Ketmaier, Valerio; Havenstein, Katja; Tiedemann, Ralph

    2013-12-01

    Philippine hornbills of the genera Aceros and Penelopides (Bucerotidae) are known to possess a large tandemly duplicated fragment in their mitochondrial genome, whose paralogous parts largely evolve in concert. In the present study, we surveyed the two distinguishable duplicated control regions in several individuals of the Luzon Tarictic Hornbill Penelopides manillae, compare their characteristics within and across individuals, and report on an intraspecific mitochondrial gene rearrangement found in one single specimen, i.e., an interchange between the two control regions. To our knowledge, this is the first observation of two distinct mitochondrial genome rearrangements within a bird species. We briefly discuss a possible evolutionary mechanism responsible for this pattern, and highlight potential implications for the application of control region sequences as a marker in population genetics and phylogeography.

  16. Evolutionary history of the alpha2,8-sialyltransferase (ST8Sia) gene family: Tandem duplications in early deuterostomes explain most of the diversity found in the vertebrate ST8Sia genes

    PubMed Central

    2008-01-01

    invertebrates and vertebrates. The initial expansion and subsequent divergence of the ST8Sia genes resulted as a consequence of a series of ancient duplications and translocations in the invertebrate genome long before the emergence of vertebrates. A second subset of ST8sia genes in the vertebrate genome arose from whole genome duplication (WGD) R1 and R2. Subsequent selective ST8Sia gene loss is responsible for the characteristic ST8Sia gene expression pattern observed today in individual species. PMID:18811928

  17. Evolutionary history of the alpha2,8-sialyltransferase (ST8Sia) gene family: tandem duplications in early deuterostomes explain most of the diversity found in the vertebrate ST8Sia genes.

    PubMed

    Harduin-Lepers, Anne; Petit, Daniel; Mollicone, Rosella; Delannoy, Philippe; Petit, Jean-Michel; Oriol, Rafael

    2008-09-23

    initial expansion and subsequent divergence of the ST8Sia genes resulted as a consequence of a series of ancient duplications and translocations in the invertebrate genome long before the emergence of vertebrates. A second subset of ST8sia genes in the vertebrate genome arose from whole genome duplication (WGD) R1 and R2. Subsequent selective ST8Sia gene loss is responsible for the characteristic ST8Sia gene expression pattern observed today in individual species.

  18. Independent and parallel evolution of new genes by gene duplication in two origins of C4 photosynthesis provides new insight into the mechanism of phloem loading in C4 species

    DOE PAGES

    Emms, David M.; Covshoff, Sarah; Hibberd, Julian M.; ...

    2016-03-24

    C4 photosynthesis is considered one of the most remarkable examples of evolutionary convergence in eukaryotes. However, it is unknown whether the evolution of C4 photosynthesis required the evolution of new genes. Genome-wide gene-tree species-tree reconciliation of seven monocot species that span two origins of C4 photosynthesis revealed that there was significant parallelism in the duplication and retention of genes coincident with the evolution of C4 photosynthesis in these lineages. Specifically, 21 orthologous genes were duplicated and retained independently in parallel at both C4 origins. Analysis of this gene cohort revealed that the set of parallel duplicated and retained genes ismore » enriched for genes that are preferentially expressed in bundle sheath cells, the cell type in which photosynthesis was activated during C4 evolution. Moreover, functional analysis of the cohort of parallel duplicated genes identified SWEET-13 as a potential key transporter in the evolution of C4 photosynthesis in grasses, and provides new insight into the mechanism of phloem loading in these C4 species.« less

  19. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies.

    PubMed

    Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D

    2009-05-13

    The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1 and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3-35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7-13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5-26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate

  20. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies

    PubMed Central

    Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D

    2009-01-01

    Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya). Our family-level results are congruent with recent estimates found in

  1. Gene duplications are extensive and contribute significantly to the toxic proteome of nematocysts isolated from Acropora digitifera (Cnidaria: Anthozoa: Scleractinia).

    PubMed

    Gacesa, Ranko; Chung, Ray; Dunn, Simon R; Weston, Andrew J; Jaimes-Becerra, Adrian; Marques, Antonio C; Morandini, André C; Hranueli, Daslav; Starcevic, Antonio; Ward, Malcolm; Long, Paul F

    2015-10-13

    Gene duplication followed by adaptive selection is a well-accepted process leading to toxin diversification in venoms. However, emergent genomic, transcriptomic and proteomic evidence now challenges this role to be at best equivocal to other processess . Cnidaria are arguably the most ancient phylum of the extant metazoa that are venomous and such provide a definitive ancestral anchor to examine the evolution of this trait. Here we compare predicted toxins from the translated genome of the coral Acropora digitifera to putative toxins revealed by proteomic analysis of soluble proteins discharged from nematocysts, to determine the extent to which gene duplications contribute to venom innovation in this reef-building coral species. A new bioinformatics tool called HHCompare was developed to detect potential gene duplications in the genomic data, which is made freely available ( https://github.com/rgacesa/HHCompare ). A total of 55 potential toxin encoding genes could be predicted from the A. digitifera genome, of which 36 (65 %) had likely arisen by gene duplication as evinced using the HHCompare tool and verified using two standard phylogeny methods. Surprisingly, only 22 % (12/55) of the potential toxin repertoire could be detected following rigorous proteomic analysis, for which only half (6/12) of the toxin proteome could be accounted for as peptides encoded by the gene duplicates. Biological activities of these toxins are dominatedby putative phospholipases and toxic peptidases. Gene expansions in A. digitifera venom are the most extensive yet described in any venomous animal, and gene duplication plays a significant role leading to toxin diversification in this coral species. Since such low numbers of toxins were detected in the proteome, it is unlikely that the venom is evolving rapidly by prey-driven positive natural selection. Rather we contend that the venom has a defensive role deterring predation or harm from interspecific competition and overgrowth by

  2. Regulation of Chlamydia Gene Expression by Tandem Promoters with Different Temporal Patterns.

    PubMed

    Rosario, Christopher J; Tan, Ming

    2016-01-15

    Chlamydia is a genus of pathogenic bacteria with an unusual intracellular developmental cycle marked by temporal waves of gene expression. The three main temporal groups of chlamydial genes are proposed to be controlled by separate mechanisms of transcriptional regulation. However, we have noted genes with discrepancies, such as the early gene dnaK and the midcycle genes bioY and pgk, which have promoters controlled by the late transcriptional regulators EUO and σ(28). To resolve this issue, we analyzed the promoters of these three genes in vitro and in Chlamydia trachomatis bacteria grown in cell culture. Transcripts from the σ(28)-dependent promoter of each gene were detected only at late times in the intracellular infection, bolstering the role of σ(28) RNA polymerase in late gene expression. In each case, however, expression prior to late times was due to a second promoter that was transcribed by σ(66) RNA polymerase, which is the major form of chlamydial polymerase. These results demonstrate that chlamydial genes can be transcribed from tandem promoters with different temporal profiles, leading to a composite expression pattern that differs from the expression profile of a single promoter. In addition, tandem promoters allow a gene to be regulated by multiple mechanisms of transcriptional regulation, such as DNA supercoiling or late regulation by EUO and σ(28). We discuss how tandem promoters broaden the repertoire of temporal gene expression patterns in the chlamydial developmental cycle and can be used to fine-tune the expression of specific genes. Chlamydia is a pathogenic bacterium that is responsible for the majority of infectious disease cases reported to the CDC each year. It causes an intracellular infection that is characterized by coordinated expression of chlamydial genes in temporal waves. Chlamydial transcription has been shown to be regulated by DNA supercoiling, alternative forms of RNA polymerase, and transcription factors, but the number

  3. Chromosome I duplications in Caenorhabditis elegans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McKim, K.S.; Rose, A.M.

    1990-01-01

    We have isolated and characterized 76 duplications of chromosome I in the genome of Caenorhabditis elegans. The region studied is the 20 map unit left half of the chromosome. Sixty-two duplications were induced with gamma radiation and 14 arose spontaneously. The latter class was apparently the result of spontaneous breaks within the parental duplication. The majority of duplications behave as if they are free. Three duplications are attached to identifiable sequences from other chromosomes. The duplication breakpoints have been mapped by complementation analysis relative to genes on chromosome I. Nineteen duplication breakpoints and seven deficiency breakpoints divide the left halfmore » of the chromosome into 24 regions. We have studied the relationship between duplication size and segregational stability. While size is an important determinant of mitotic stability, it is not the only one. We observed clear exceptions to a size-stability correlation. In addition to size, duplication stability may be influenced by specific sequences or chromosome structure. The majority of the duplications were stable enough to be powerful tools for gene mapping. Therefore the duplications described here will be useful in the genetic characterization of chromosome I and the techniques we have developed can be adapted to other regions of the genome.« less

  4. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    PubMed

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  5. FLT3 D835/I836 mutations are associated with poor disease-free survival and a distinct gene-expression signature among younger adults with de novo cytogenetically normal acute myeloid leukemia lacking FLT3 internal tandem duplications

    PubMed Central

    Ruppert, Amy S.; Radmacher, Michael D.; Mrózek, Krzysztof; Paschka, Peter; Langer, Christian; Baldus, Claudia D.; Wen, Jing; Racke, Frederick; Powell, Bayard L.; Kolitz, Jonathan E.; Larson, Richard A.; Caligiuri, Michael A.; Marcucci, Guido; Bloomfield, Clara D.

    2008-01-01

    The prognostic relevance of FLT3 D835/I836 mutations (FLT3-TKD) in cytogenetically normal acute myeloid leukemia (CN-AML) remains to be established. After excluding patients with FLT3 internal tandem duplications, we compared treatment outcome of 16 de novo CN-AML patients with FLT3-TKD with that of 123 patients with wild-type FLT3 (FLT3-WT), less than 60 years of age and similarly treated on Cancer and Leukemia Group B protocols. All FLT3-TKD+ patients and 85% of FLT3-WT patients achieved a complete remission (P = .13). Disease-free survival (DFS) of FLT3-TKD+ patients was worse than DFS of FLT3-WT patients (P = .01; estimated 3-year DFS rates, 31% vs 60%, respectively). In a multivariable analysis, FLT3-TKD was associated with worse DFS (P = .02) independent of NPM1 status and percentage of bone marrow blasts. To gain further biologic insights, a gene-expression signature differentiating FLT3-TKD+ from FLT3-WT patients was identified. The signature (333 probe sets) included overexpression of VNN1, C3AR1, PTPN6, and multiple other genes involved in monocarboxylate transport activity, and underexpression of genes involved in signal transduction regulation. These associations with outcome, other prognostic markers, and the elucidated expression signature enhance our understanding of FLT3-TKD–associated biology and may lead to development of novel therapies that improve clinical outcome of CN-AML patients with FLT3-TKD. PMID:17940205

  6. Evolution and functional divergence of NLRP genes in mammalian reproductive systems

    PubMed Central

    2009-01-01

    Background NLRPs (Nucleotide-binding oligomerization domain, Leucine rich Repeat and Pyrin domain containing Proteins) are members of NLR (Nod-like receptors) protein family. Recent researches have shown that NLRP genes play important roles in both mammalian innate immune system and reproductive system. Several of NLRP genes were shown to be specifically expressed in the oocyte in mammals. The aim of the present work was to study how these genes evolved and diverged after their duplication, as well as whether natural selection played a role during their evolution. Results By using in silico methods, we have evaluated the evolution and functional divergence of NLRP genes, in particular of mouse reproduction-related Nlrp genes. We found that (1) major NLRP genes have been duplicated before the divergence of mammals, with certain lineage-specific duplications in primates (NLRP7 and 11) and in rodents (Nlrp1, 4 and 9 duplicates); (2) tandem duplication events gave rise to a mammalian reproduction-related NLRP cluster including NLRP2, 4, 5, 7, 8, 9, 11, 13 and 14 genes; (3) the function of mammalian oocyte-specific NLRP genes (NLRP4, 5, 9 and 14) might have diverged during gene evolution; (4) recent segmental duplications concerning Nlrp4 copies and vomeronasal 1 receptor encoding genes (V1r) have been undertaken in the mouse; and (5) duplicates of Nlrp4 and 9 in the mouse might have been subjected to adaptive evolution. Conclusion In conclusion, this study brings us novel information on the evolution of mammalian reproduction-related NLRPs. On the one hand, NLRP genes duplicated and functionally diversified in mammalian reproductive systems (such as NLRP4, 5, 9 and 14). On the other hand, during evolution, different lineages adapted to develop their own NLRP genes, particularly in reproductive function (such as the specific expansion of Nlrp4 and Nlrp9 in the mouse). PMID:19682372

  7. Partners in crime: The role of tandem modules in gene transcription.

    PubMed

    Sharma, Rajal; Zhou, Ming-Ming

    2015-09-01

    Histones and their modifications play an important role in the regulation of gene transcription. Numerous modifications, such as acetylation, phosphorylation, methylation, ubiquitination, and SUMOylation, have been described. These modifications almost always co-occur and thereby increase the combinatorial complexity of post-translational modification detection. The domains that recognize these histone modifications often occur in tandem in the context of larger proteins and complexes. The presence of multiple modifications can positively or negatively regulate the binding of these tandem domains, influencing downstream cellular function. Alternatively, these tandem domains can have novel functions from their independent parts. Here we summarize structural and functional information known about major tandem domains and their histone binding properties. An understanding of these interactions is key for the development of epigenetic therapy. © 2015 The Protein Society.

  8. Sorting by Cuts, Joins, and Whole Chromosome Duplications.

    PubMed

    Zeira, Ron; Shamir, Ron

    2017-02-01

    Genome rearrangement problems have been extensively studied due to their importance in biology. Most studied models assumed a single copy per gene. However, in reality, duplicated genes are common, most notably in cancer. In this study, we make a step toward handling duplicated genes by considering a model that allows the atomic operations of cut, join, and whole chromosome duplication. Given two linear genomes, [Formula: see text] with one copy per gene and [Formula: see text] with two copies per gene, we give a linear time algorithm for computing a shortest sequence of operations transforming [Formula: see text] into [Formula: see text] such that all intermediate genomes are linear. We also show that computing an optimal sequence with fewest duplications is NP-hard.

  9. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function.

    PubMed

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estele; Bertrand, Daniel; Leonard, Sherry

    2011-10-15

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca(2+), that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [(125)I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. Copyright © 2011 Elsevier Inc. All rights reserved.

  10. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function

    PubMed Central

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estelle; Bertrand, Daniel; Leonard, Sherry

    2011-01-01

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca++, that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [125I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. PMID:21718690

  11. Independent Origin and Global Distribution of Distinct Plasmodium vivax Duffy Binding Protein Gene Duplications

    PubMed Central

    Hostetler, Jessica B.; Lo, Eugenia; Kanjee, Usheer; Amaratunga, Chanaki; Suon, Seila; Sreng, Sokunthea; Mao, Sivanna; Yewhalaw, Delenasaw; Mascarenhas, Anjali; Kwiatkowski, Dominic P.; Ferreira, Marcelo U.; Rathod, Pradipsinh K.; Yan, Guiyun; Fairhurst, Rick M.; Duraisingh, Manoj T.; Rayner, Julian C.

    2016-01-01

    Background Plasmodium vivax causes the majority of malaria episodes outside Africa, but remains a relatively understudied pathogen. The pathology of P. vivax infection depends critically on the parasite’s ability to recognize and invade human erythrocytes. This invasion process involves an interaction between P. vivax Duffy Binding Protein (PvDBP) in merozoites and the Duffy antigen receptor for chemokines (DARC) on the erythrocyte surface. Whole-genome sequencing of clinical isolates recently established that some P. vivax genomes contain two copies of the PvDBP gene. The frequency of this duplication is particularly high in Madagascar, where there is also evidence for P. vivax infection in DARC-negative individuals. The functional significance and global prevalence of this duplication, and whether there are other copy number variations at the PvDBP locus, is unknown. Methodology/Principal Findings Using whole-genome sequencing and PCR to study the PvDBP locus in P. vivax clinical isolates, we found that PvDBP duplication is widespread in Cambodia. The boundaries of the Cambodian PvDBP duplication differ from those previously identified in Madagascar, meaning that current molecular assays were unable to detect it. The Cambodian PvDBP duplication did not associate with parasite density or DARC genotype, and ranged in prevalence from 20% to 38% over four annual transmission seasons in Cambodia. This duplication was also present in P. vivax isolates from Brazil and Ethiopia, but not India. Conclusions/Significance PvDBP duplications are much more widespread and complex than previously thought, and at least two distinct duplications are circulating globally. The same duplication boundaries were identified in parasites from three continents, and were found at high prevalence in human populations where DARC-negativity is essentially absent. It is therefore unlikely that PvDBP duplication is associated with infection of DARC-negative individuals, but functional tests

  12. Neurodevelopmental disorders among individuals with duplication of 4p13 to 4p12 containing a GABAA receptor subunit gene cluster

    PubMed Central

    Polan, Michelle B; Pastore, Matthew T; Steingass, Katherine; Hashimoto, Sayaka; Thrush, Devon L; Pyatt, Robert; Reshmi, Shalini; Gastier-Foster, Julie M; Astbury, Caroline; McBride, Kim L

    2014-01-01

    Recent studies have shown that certain copy number variations (CNV) are associated with a wide range of neurodevelopmental disorders, including autism spectrum disorders (ASD), bipolar disorder and intellectual disabilities. Implicated regions and genes have comprised a variety of post synaptic complex proteins and neurotransmitter receptors, including gamma-amino butyric acid A (GABAA). Clusters of GABAA receptor subunit genes are found on chromosomes 4p12, 5q34, 6q15 and 15q11-13. Maternally inherited 15q11-13 duplications among individuals with neurodevelopmental disorders are well described, but few case reports exist for the other regions. We describe a family with a 2.42 Mb duplication at chromosome 4p13 to 4p12, identified in the index case and other family members by oligonucleotide array comparative genomic hybridization, that contains 13 genes including a cluster of four GABAA receptor subunit genes. Fluorescent in-situ hybridization was used to confirm the duplication. The duplication segregates with a variety of neurodevelopmental disorders in this family, including ASD (index case), developmental delay, dyspraxia and ADHD (brother), global developmental delays (brother), learning disabilities (mother) and bipolar disorder (maternal grandmother). In addition, we identified and describe another individual unrelated to this family, with a similar duplication, who was diagnosed with ASD, ADHD and borderline intellectual disability. The 4p13 to 4p12 duplication appears to confer a susceptibility to a variety of neurodevelopmental disorders in these two families. We hypothesize that the duplication acts through a dosage effect of GABAA receptor subunit genes, adding evidence for alterations in the GABAergic system in the etiology of neurodevelopmental disorders. PMID:23695283

  13. Pericentromeric Effects Shape the Patterns of Divergence, Retention, and Expression of Duplicated Genes in the Paleopolyploid Soybean[C][W

    PubMed Central

    Du, Jianchang; Tian, Zhixi; Sui, Yi; Zhao, Meixia; Song, Qijian; Cannon, Steven B.; Cregan, Perry; Ma, Jianxin

    2012-01-01

    The evolutionary forces that govern the divergence and retention of duplicated genes in polyploids are poorly understood. In this study, we first investigated the rates of nonsynonymous substitution (Ka) and the rates of synonymous substitution (Ks) for a nearly complete set of genes in the paleopolyploid soybean (Glycine max) by comparing the orthologs between soybean and its progenitor species Glycine soja and then compared the patterns of gene divergence and expression between pericentromeric regions and chromosomal arms in different gene categories. Our results reveal strong associations between duplication status and Ka and gene expression levels and overall low Ks and low levels of gene expression in pericentromeric regions. It is theorized that deleterious mutations can easily accumulate in recombination-suppressed regions, because of Hill-Robertson effects. Intriguingly, the genes in pericentromeric regions—the cold spots for meiotic recombination in soybean—showed significantly lower Ka and higher levels of expression than their homoeologs in chromosomal arms. This asymmetric evolution of two members of individual whole genome duplication (WGD)-derived gene pairs, echoing the biased accumulation of singletons in pericentromeric regions, suggests that distinct genomic features between the two distinct chromatin types are important determinants shaping the patterns of divergence and retention of WGD-derived genes. PMID:22227891

  14. Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

    PubMed Central

    Benslimane, A A; Dron, M; Hartmann, C; Rode, A

    1986-01-01

    Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553

  15. Functional analysis of duplicated Symbiosis Receptor Kinase (SymRK) genes during nodulation and mycorrhizal infection in soybean (Glycine max).

    PubMed

    Indrasumunar, Arief; Wilde, Julia; Hayashi, Satomi; Li, Dongxue; Gresshoff, Peter M

    2015-03-15

    Association between legumes and rhizobia results in the formation of root nodules, where symbiotic nitrogen fixation occurs. The early stages of this association involve a complex of signalling events between the host and microsymbiont. Several genes dealing with early signal transduction have been cloned, and one of them encodes the leucine-rich repeat (LRR) receptor kinase (SymRK; also termed NORK). The Symbiosis Receptor Kinase gene is required by legumes to establish a root endosymbiosis with Rhizobium bacteria as well as mycorrhizal fungi. Using degenerate primer and BAC sequencing, we cloned duplicated SymRK homeologues in soybean called GmSymRKα and GmSymRKβ. These duplicated genes have high similarity of nucleotide (96%) and amino acid sequence (95%). Sequence analysis predicted a malectin-like domain within the extracellular domain of both genes. Several putative cis-acting elements were found in promoter regions of GmSymRKα and GmSymRKβ, suggesting a participation in lateral root development, cell division and peribacteroid membrane formation. The mutant of SymRK genes is not available in soybean; therefore, to know the functions of these genes, RNA interference (RNAi) of these duplicated genes was performed. For this purpose, RNAi construct of each gene was generated and introduced into the soybean genome by Agrobacterium rhizogenes-mediated hairy root transformation. RNAi of GmSymRKβ gene resulted in an increased reduction of nodulation and mycorrhizal infection than RNAi of GmSymRKα, suggesting it has the major activity of the duplicated gene pair. The results from the important crop legume soybean confirm the joint phenotypic action of GmSymRK genes in both mycorrhizal and rhizobial infection seen in model legumes. Copyright © 2015 Elsevier GmbH. All rights reserved.

  16. Quantitative fluorescence-polymerase chain reaction assay for the detection of the duplication of the Charcot Marie Tooth disease type 1A critical region.

    PubMed

    De Toffol, Simona; Bellone, Emilia; Dulcetti, Francesca; Ruggeri, Anna Maria; Maggio, Pietro Paolo; Pulimeno, Maria Rosaria; Mandich, Paola; Maggi, Federico; Simoni, Giuseppe; Grati, Francesca Romana

    2010-04-01

    Charcot Marie Tooth (CMT) syndrome is the most common hereditary peripheral neuropathy, with an incidence of about 1 in 2500. The subtype 1A (CMT1A) is caused by a tandem duplication of a 1.5-Mb region encompassing the PMP22 gene. Conventional short tandem repeat (STR) analysis can reveal this imbalance if a triallelic pattern, defining with certainty the presence of duplication, is present. In case of duplication with a biallelic pattern, it can only indicate a semiquantitative dosage of the fluorescence intensity ratio of the two fragments. In this study we developed a quantitative fluorescence-PCR using seven highly informative STRs within the CMT1A critical region that successfully disclosed or excluded the presence of the pathogenic imbalance in a cohort of 60 samples including 40 DNAs from samples with the CMT1A duplication previously characterized with two different molecular approaches, and 20 diagnostic samples from 10 members of a five-generation pedigree segregating CMT1A, 8 unrelated cases and 2 prenatal samples. The application of the quantitative fluorescence-PCR using STRs located in the critical region could be a reliable method to evaluate the presence of the PMP22 duplication for the diagnosis and classification of hereditary neuropathies in asymptomatic subjects with a family history of inherited neuropathy, in prenatal samples in cases with one affected parent, and in unrelated patients with a sporadic demyelinating neuropathy with clinical features resembling CMT (i.e., pes cavus with hammer toes) or with conduction velocities in the range of CMT1A.

  17. High time for a roll call: gene duplication and phylogenetic relationships of TCP-like genes in monocots

    PubMed Central

    Mondragón-Palomino, Mariana; Trontin, Charlotte

    2011-01-01

    Background and Aims The TCP family is an ancient group of plant developmental transcription factors that regulate cell division in vegetative and reproductive structures and are essential in the establishment of flower zygomorphy. In-depth research on eudicot TCPs has documented their evolutionary and developmental role. This has not happened to the same extent in monocots, although zygomorphy has been critical for the diversification of Orchidaceae and Poaceae, the largest families of this group. Investigating the evolution and function of TCP-like genes in a wider group of monocots requires a detailed phylogenetic analysis of all available sequence information and a system that facilitates comparing genetic and functional information. Methods The phylogenetic relationships of TCP-like genes in monocots were investigated by analysing sequences from the genomes of Zea mays, Brachypodium distachyon, Oryza sativa and Sorghum bicolor, as well as EST data from several other monocot species. Key Results All available monocot TCP-like sequences are associated in 20 major groups with an average identity ≥64 % and most correspond to well-supported clades of the phylogeny. Their sequence motifs and relationships of orthology were documented and it was found that 67 % of the TCP-like genes of Sorghum, Oryza, Zea and Brachypodium are in microsyntenic regions. This analysis suggests that two rounds of whole genome duplication drove the expansion of TCP-like genes in these species. Conclusions A system of classification is proposed where putative or recognized monocot TCP-like genes are assigned to a specific clade of PCF-, CIN- or CYC/tb1-like genes. Specific biases in sequence data of this family that must be tackled when studying its molecular evolution and phylogeny are documented. Finally, the significant retention of duplicated TCP genes from Zea mays is considered in the context of balanced gene drive. PMID:21444336

  18. Gene duplication and fragmentation in the zebra finch major histocompatibility complex

    PubMed Central

    2010-01-01

    Background Due to its high polymorphism and importance for disease resistance, the major histocompatibility complex (MHC) has been an important focus of many vertebrate genome projects. Avian MHC organization is of particular interest because the chicken Gallus gallus, the avian species with the best characterized MHC, possesses a highly streamlined minimal essential MHC, which is linked to resistance against specific pathogens. It remains unclear the extent to which this organization describes the situation in other birds and whether it represents a derived or ancestral condition. The sequencing of the zebra finch Taeniopygia guttata genome, in combination with targeted bacterial artificial chromosome (BAC) sequencing, has allowed us to characterize an MHC from a highly divergent and diverse avian lineage, the passerines. Results The zebra finch MHC exhibits a complex structure and history involving gene duplication and fragmentation. The zebra finch MHC includes multiple Class I and Class II genes, some of which appear to be pseudogenes, and spans a much more extensive genomic region than the chicken MHC, as evidenced by the presence of MHC genes on each of seven BACs spanning 739 kb. Cytogenetic (FISH) evidence and the genome assembly itself place core MHC genes on as many as four chromosomes with TAP and Class I genes mapping to different chromosomes. MHC Class II regions are further characterized by high endogenous retroviral content. Lastly, we find strong evidence of selection acting on sites within passerine MHC Class I and Class II genes. Conclusion The zebra finch MHC differs markedly from that of the chicken, the only other bird species with a complete genome sequence. The apparent lack of synteny between TAP and the expressed MHC Class I locus is in fact reminiscent of a pattern seen in some mammalian lineages and may represent convergent evolution. Our analyses of the zebra finch MHC suggest a complex history involving chromosomal fission, gene

  19. Gene duplication and fragmentation in the zebra finch major histocompatibility complex.

    PubMed

    Balakrishnan, Christopher N; Ekblom, Robert; Völker, Martin; Westerdahl, Helena; Godinez, Ricardo; Kotkiewicz, Holly; Burt, David W; Graves, Tina; Griffin, Darren K; Warren, Wesley C; Edwards, Scott V

    2010-04-01

    Due to its high polymorphism and importance for disease resistance, the major histocompatibility complex (MHC) has been an important focus of many vertebrate genome projects. Avian MHC organization is of particular interest because the chicken Gallus gallus, the avian species with the best characterized MHC, possesses a highly streamlined minimal essential MHC, which is linked to resistance against specific pathogens. It remains unclear the extent to which this organization describes the situation in other birds and whether it represents a derived or ancestral condition. The sequencing of the zebra finch Taeniopygia guttata genome, in combination with targeted bacterial artificial chromosome (BAC) sequencing, has allowed us to characterize an MHC from a highly divergent and diverse avian lineage, the passerines. The zebra finch MHC exhibits a complex structure and history involving gene duplication and fragmentation. The zebra finch MHC includes multiple Class I and Class II genes, some of which appear to be pseudogenes, and spans a much more extensive genomic region than the chicken MHC, as evidenced by the presence of MHC genes on each of seven BACs spanning 739 kb. Cytogenetic (FISH) evidence and the genome assembly itself place core MHC genes on as many as four chromosomes with TAP and Class I genes mapping to different chromosomes. MHC Class II regions are further characterized by high endogenous retroviral content. Lastly, we find strong evidence of selection acting on sites within passerine MHC Class I and Class II genes. The zebra finch MHC differs markedly from that of the chicken, the only other bird species with a complete genome sequence. The apparent lack of synteny between TAP and the expressed MHC Class I locus is in fact reminiscent of a pattern seen in some mammalian lineages and may represent convergent evolution. Our analyses of the zebra finch MHC suggest a complex history involving chromosomal fission, gene duplication and translocation in the

  20. Duplication and amplification of antibiotic resistance genes enable increased resistance in isolates of multidrug-resistant Salmonella Typhimurium

    USDA-ARS?s Scientific Manuscript database

    During normal bacterial DNA replication, gene duplication and amplification (GDA) events occur randomly at a low frequency in the genome throughout a population. In the absence of selection, GDA events that increase the number of copies of a bacterial gene (or a set of genes) are lost. Antibiotic ...

  1. Antagonistic Roles for KNOX1 and KNOX2 Genes in Patterning the Land Plant Body Plan Following an Ancient Gene Duplication

    PubMed Central

    Furumizu, Chihiro; Alvarez, John Paul; Sakakibara, Keiko; Bowman, John L.

    2015-01-01

    Neofunctionalization following gene duplication is thought to be one of the key drivers in generating evolutionary novelty. A gene duplication in a common ancestor of land plants produced two classes of KNOTTED-like TALE homeobox genes, class I (KNOX1) and class II (KNOX2). KNOX1 genes are linked to tissue proliferation and maintenance of meristematic potentials of flowering plant and moss sporophytes, and modulation of KNOX1 activity is implicated in contributing to leaf shape diversity of flowering plants. While KNOX2 function has been shown to repress the gametophytic (haploid) developmental program during moss sporophyte (diploid) development, little is known about KNOX2 function in flowering plants, hindering syntheses regarding the relationship between two classes of KNOX genes in the context of land plant evolution. Arabidopsis plants harboring loss-of-function KNOX2 alleles exhibit impaired differentiation of all aerial organs and have highly complex leaves, phenocopying gain-of-function KNOX1 alleles. Conversely, gain-of-function KNOX2 alleles in conjunction with a presumptive heterodimeric BELL TALE homeobox partner suppressed SAM activity in Arabidopsis and reduced leaf complexity in the Arabidopsis relative Cardamine hirsuta, reminiscent of loss-of-function KNOX1 alleles. Little evidence was found indicative of epistasis or mutual repression between KNOX1 and KNOX2 genes. KNOX proteins heterodimerize with BELL TALE homeobox proteins to form functional complexes, and contrary to earlier reports based on in vitro and heterologous expression, we find high selectivity between KNOX and BELL partners in vivo. Thus, KNOX2 genes confer opposing activities rather than redundant roles with KNOX1 genes, and together they act to direct the development of all above-ground organs of the Arabidopsis sporophyte. We infer that following the KNOX1/KNOX2 gene duplication in an ancestor of land plants, neofunctionalization led to evolution of antagonistic biochemical

  2. Teleost Fish-Specific Preferential Retention of Pigmentation Gene-Containing Families After Whole Genome Duplications in Vertebrates

    PubMed Central

    Lorin, Thibault; Brunet, Frédéric G.; Laudet, Vincent; Volff, Jean-Nicolas

    2018-01-01

    Vertebrate pigmentation is a highly diverse trait mainly determined by neural crest cell derivatives. It has been suggested that two rounds (1R/2R) of whole-genome duplications (WGDs) at the basis of vertebrates allowed changes in gene regulation associated with neural crest evolution. Subsequently, the teleost fish lineage experienced other WGDs, including the teleost-specific Ts3R before teleost radiation and the more recent Ss4R at the basis of salmonids. As the teleost lineage harbors the highest number of pigment cell types and pigmentation diversity in vertebrates, WGDs might have contributed to the evolution and diversification of the pigmentation gene repertoire in teleosts. We have compared the impact of the basal vertebrate 1R/2R duplications with that of the teleost-specific Ts3R and salmonid-specific Ss4R WGDs on 181 gene families containing genes involved in pigmentation. We show that pigmentation genes (PGs) have been globally more frequently retained as duplicates than other genes after Ts3R and Ss4R but not after the early 1R/2R. This is also true for non-pigmentary paralogs of PGs, suggesting that the function in pigmentation is not the sole key driver of gene retention after WGDs. On the long-term, specific categories of PGs have been repeatedly preferentially retained after ancient 1R/2R and Ts3R WGDs, possibly linked to the molecular nature of their proteins (e.g., DNA binding transcriptional regulators) and their central position in protein-protein interaction networks. Taken together, our results support a major role of WGDs in the diversification of the pigmentation gene repertoire in the teleost lineage, with a possible link with the diversity of pigment cell lineages observed in these animals compared to other vertebrates. PMID:29599177

  3. Neofunctionalization of a duplicate hatching enzyme gene during the evolution of teleost fishes.

    PubMed

    Sano, Kaori; Kawaguchi, Mari; Watanabe, Satoshi; Yasumasu, Shigeki

    2014-10-19

    Duplication and subsequent neofunctionalization of the teleostean hatching enzyme gene occurred in the common ancestor of Euteleostei and Otocephala, producing two genes belonging to different phylogenetic clades (clade I and II). In euteleosts, the clade I enzyme inherited the activity of the ancestral enzyme of swelling the egg envelope by cleavage of the N-terminal region of egg envelope proteins. The clade II enzyme gained two specific cleavage sites, N-ZPd and mid-ZPd but lost the ancestral activity. Thus, euteleostean clade II enzymes assumed a new function; solubilization of the egg envelope by the cooperative action with clade I enzyme. However, in Otocephala, the clade II gene was lost during evolution. Consequently, in a late group of Otocephala, only the clade I enzyme is present to swell the egg envelope. We evaluated the egg envelope digestion properties of clade I and II enzymes in Gonorynchiformes, an early diverging group of Otocephala, using milkfish, and compared their digestion with those of other fishes. Finally, we propose a hypothesis of the neofunctionalization process. The milkfish clade II enzyme cleaved N-ZPd but not mid-ZPd, and did not cause solubilization of the egg envelope. We conclude that neofunctionalization is incomplete in the otocephalan clade II enzymes. Comparison of clade I and clade II enzyme characteristics implies that the specificity of the clade II enzymes gradually changed during evolution after the duplication event, and that a change in substrate was required for the addition of the mid-ZPd site and loss of activity at the N-terminal region. We infer the process of neofunctionalization of the clade II enzyme after duplication of the gene. The ancestral clade II gene gained N-ZPd cleavage activity in the common ancestral lineage of the Euteleostei and Otocephala. Subsequently, acquisition of cleavage activity at the mid-ZPd site and loss of cleavage activity in the N-terminal region occurred during the evolution of

  4. SHOX gene and conserved noncoding element deletions/duplications in Colombian patients with idiopathic short stature.

    PubMed

    Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio

    2014-03-01

    SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3-15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS.

  5. SHOX gene and conserved noncoding element deletions/duplications in Colombian patients with idiopathic short stature

    PubMed Central

    Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio

    2014-01-01

    SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3–15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS. PMID:24689071

  6. Association of an α-globin gene cluster duplication and heterozygous β-thalassemia in a patient with a severe thalassemia syndrome.

    PubMed

    Jiang, Hua; Liu, Sha; Zhang, Yong-Ling; Wan, Jun-Hui; Li, Ru; Li, Dong-Zhi

    2015-01-01

    We describe a new case of a β-thalassemia (β-thal) heterozygote with the mutation IVS-II-654 (C>T) presenting with a transfusion-dependent phenotype. Multiplex ligation-dependent probe amplification (MLPA) and array comparative genomic hybridization (CGH) analyses of the α-globin gene cluster revealed a full duplication of the α-globin genes including the upstream regulatory element. The duplicated allele and the normal allele in trans resulted in a total of six active α-globin genes. The severe clinical phenotype seemed to be related to the considerable excess of the α- and β-globin deficit caused by the presence of the β-thal. α-Globin cluster duplication should be considered in patients heterozygous for β-thal who show a more severe phenotype than β-thal trait.

  7. Expression, subcellular localization, and cis-regulatory structure of duplicated phytoene synthase genes in melon (Cucumis melo L.).

    PubMed

    Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li

    2011-10-01

    Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.

  8. Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana.

    PubMed

    Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi

    2014-01-03

    Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome

  9. Evolution history of duplicated smad3 genes in teleost: insights from Japanese flounder, Paralichthys olivaceus

    PubMed Central

    Du, Xinxin; Liu, Yuezhong; Liu, Jinxiang; Zhang, Quanqi

    2016-01-01

    Following the two rounds of whole-genome duplication (WGD) during deuterosome evolution, a third genome duplication occurred in the ray-fined fish lineage and is considered to be responsible for the teleost-specific lineage diversification and regulation mechanisms. As a receptor-regulated SMAD (R-SMAD), the function of SMAD3 was widely studied in mammals. However, limited information of its role or putative paralogs is available in ray-finned fishes. In this study, two SMAD3 paralogs were first identified in the transcriptome and genome of Japanese flounder (Paralichthys olivaceus). We also explored SMAD3 duplication in other selected species. Following identification, genomic structure, phylogenetic reconstruction, and synteny analyses performed by MrBayes and online bioinformatic tools confirmed that smad3a/3b most likely originated from the teleost-specific WGD. Additionally, selection pressure analysis and expression pattern of the two genes performed by PAML and quantitative real-time PCR (qRT-PCR) revealed evidence of subfunctionalization of the two SMAD3 paralogs in teleost. Our results indicate that two SMAD3 genes originate from teleost-specific WGD, remain transcriptionally active, and may have likely undergone subfunctionalization. This study provides novel insights to the evolution fates of smad3a/3b and draws attentions to future function analysis of SMAD3 gene family. PMID:27703851

  10. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups

    PubMed Central

    2013-01-01

    Background WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. Results We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. Conclusions In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have

  11. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups.

    PubMed

    Yin, Guangjun; Xu, Hongliang; Xiao, Shuyang; Qin, Yajuan; Li, Yaxuan; Yan, Yueming; Hu, Yingkao

    2013-10-03

    WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species. We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean. In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates

  12. Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

    PubMed

    Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

    2016-01-01

    Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.

  13. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster

    PubMed Central

    2012-01-01

    Background The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. Results We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. Conclusions These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster. PMID:22577841

  14. Phylogenomics of the benzoxazinoid biosynthetic pathway of Poaceae: gene duplications and origin of the Bx cluster.

    PubMed

    Dutartre, Leslie; Hilliou, Frédérique; Feyereisen, René

    2012-05-11

    The benzoxazinoids 2,4-dihydroxy-1,4-benzoxazin-3-one (DIBOA) and 2,4-dihydroxy-7- methoxy-1,4-benzoxazin-3-one (DIMBOA), are key defense compounds present in major agricultural crops such as maize and wheat. Their biosynthesis involves nine enzymes thought to form a linear pathway leading to the storage of DI(M)BOA as glucoside conjugates. Seven of the genes (Bx1-Bx6 and Bx8) form a cluster at the tip of the short arm of maize chromosome 4 that includes four P450 genes (Bx2-5) belonging to the same CYP71C subfamily. The origin of this cluster is unknown. We show that the pathway appeared following several duplications of the TSA gene (α-subunit of tryptophan synthase) and of a Bx2-like ancestral CYP71C gene and the recruitment of Bx8 before the radiation of Poaceae. The origins of Bx6 and Bx7 remain unclear. We demonstrate that the Bx2-like CYP71C ancestor was not committed to the benzoxazinoid pathway and that after duplications the Bx2-Bx5 genes were under positive selection on a few sites and underwent functional divergence, leading to the current specific biochemical properties of the enzymes. The absence of synteny between available Poaceae genomes involving the Bx gene regions is in contrast with the conserved synteny in the TSA gene region. These results demonstrate that rearrangements following duplications of an IGL/TSA gene and of a CYP71C gene probably resulted in the clustering of the new copies (Bx1 and Bx2) at the tip of a chromosome in an ancestor of grasses. Clustering favored cosegregation and tip chromosomal location favored gene rearrangements that allowed the further recruitment of genes to the pathway. These events, a founding event and elongation events, may have been the key to the subsequent evolution of the benzoxazinoid biosynthetic cluster.

  15. Segmental duplications and evolutionary acquisition of UV damage response in the SPATA31 gene family of primates and humans.

    PubMed

    Bekpen, Cemalettin; Künzel, Sven; Xie, Chen; Eaaswarkhanth, Muthukrishnan; Lin, Yen-Lung; Gokcumen, Omer; Akdis, Cezmi A; Tautz, Diethard

    2017-03-06

    Segmental duplications are an abundant source for novel gene functions and evolutionary adaptations. This mechanism of generating novelty was very active during the evolution of primates particularly in the human lineage. Here, we characterize the evolution and function of the SPATA31 gene family (former designation FAM75A), which was previously shown to be among the gene families with the strongest signal of positive selection in hominoids. The mouse homologue for this gene family is a single copy gene expressed during spermatogenesis. We show that in primates, the SPATA31 gene duplicated into SPATA31A and SPATA31C types and broadened the expression into many tissues. Each type became further segmentally duplicated in the line towards humans with the largest number of full-length copies found for SPATA31A in humans. Copy number estimates of SPATA31A based on digital PCR show an average of 7.5 with a range of 5-11 copies per diploid genome among human individuals. The primate SPATA31 genes also acquired new protein domains that suggest an involvement in UV response and DNA repair. We generated antibodies and show that the protein is re-localized from the nucleolus to the whole nucleus upon UV-irradiation suggesting a UV damage response. We used CRISPR/Cas mediated mutagenesis to knockout copies of the gene in human primary fibroblast cells. We find that cell lines with reduced functional copies as well as naturally occurring low copy number HFF cells show enhanced sensitivity towards UV-irradiation. The acquisition of new SPATA31 protein functions and its broadening of expression may be related to the evolution of the diurnal life style in primates that required a higher UV tolerance. The increased segmental duplications in hominoids as well as its fast evolution suggest the acquisition of further specific functions particularly in humans.

  16. An ace-1 gene duplication resorbs the fitness cost associated with resistance in Anopheles gambiae, the main malaria mosquito.

    PubMed

    Assogba, Benoît S; Djogbénou, Luc S; Milesi, Pascal; Berthomieu, Arnaud; Perez, Julie; Ayala, Diego; Chandre, Fabrice; Makoutodé, Michel; Labbé, Pierrick; Weill, Mylène

    2015-10-05

    Widespread resistance to pyrethroids threatens malaria control in Africa. Consequently, several countries switched to carbamates and organophophates insecticides for indoor residual spraying. However, a mutation in the ace-1 gene conferring resistance to these compounds (ace-1(R) allele), is already present. Furthermore, a duplicated allele (ace-1(D)) recently appeared; characterizing its selective advantage is mandatory to evaluate the threat. Our data revealed that a unique duplication event, pairing a susceptible and a resistant copy of the ace-1 gene spread through West Africa. Further investigations revealed that, while ace-1(D) confers less resistance than ace-1(R), the high fitness cost associated with ace-1(R) is almost completely suppressed by the duplication for all traits studied. ace-1 duplication thus represents a permanent heterozygote phenotype, selected, and thus spreading, due to the mosaic nature of mosquito control. It provides malaria mosquito with a new evolutionary path that could hamper resistance management.

  17. An ace-1 gene duplication resorbs the fitness cost associated with resistance in Anopheles gambiae, the main malaria mosquito

    PubMed Central

    Assogba, Benoît S.; Djogbénou, Luc S.; Milesi, Pascal; Berthomieu, Arnaud; Perez, Julie; Ayala, Diego; Chandre, Fabrice; Makoutodé, Michel; Labbé, Pierrick; Weill, Mylène

    2015-01-01

    Widespread resistance to pyrethroids threatens malaria control in Africa. Consequently, several countries switched to carbamates and organophophates insecticides for indoor residual spraying. However, a mutation in the ace-1 gene conferring resistance to these compounds (ace-1R allele), is already present. Furthermore, a duplicated allele (ace-1D) recently appeared; characterizing its selective advantage is mandatory to evaluate the threat. Our data revealed that a unique duplication event, pairing a susceptible and a resistant copy of the ace-1 gene spread through West Africa. Further investigations revealed that, while ace-1D confers less resistance than ace-1R, the high fitness cost associated with ace-1R is almost completely suppressed by the duplication for all traits studied. ace-1 duplication thus represents a permanent heterozygote phenotype, selected, and thus spreading, due to the mosaic nature of mosquito control. It provides malaria mosquito with a new evolutionary path that could hamper resistance management. PMID:26434951

  18. De novo interstitial tandem duplication of chromosome 4(q21-q28)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Navarro, E.G.; Ramon, F.J.H.; Jimenez, R.D.

    1996-03-29

    We describe a girl with a previously unreported de novo duplication of chromosome 4q involving segment q21-q28. Clinical manifestations included growth and psychomotor retardation, facial asymmetry, hypotelorism, epicanthic folds, mongoloid slant of palpebral fissures, apparently low-set auricles, high nasal bridge, long philtrum, small mouth, short neck, low-set thumbs, and bilateral club foot. This phenotype is compared with that of previously reported cases of duplication 4q. 12 refs., 3 figs., 1 tab.

  19. Speciation of polyploid Cyprinidae fish of common carp, crucian carp, and silver crucian carp derived from duplicated Hox genes.

    PubMed

    Yuan, Jian; He, Zhuzi; Yuan, Xiangnan; Jiang, Xiayun; Sun, Xiaowen; Zou, Shuming

    2010-09-15

    Recent studies on comparative genomics have suggested that a round of fish-specific whole genome duplication (3R) in ray-finned fishes might have occurred around 226-316 Mya. Additional genome duplication, specifically in cyprinids, may have occurred more recently after the divergence of the teleosts. The timing of this event, however, is unknown. To address this question, we sequenced four Hox genes from taxa representing the polyploid Cyprinidae fish, common carp (Cyprinus carpio, 2n=100), crucian carp (Carassius auratus auratus, 2n=100), and silver crucian carp (C. auratus gibelio, 2n=156), and then compared them with known sequences from the diploid Cyprinidae fish, blunt snout bream (Megalobrama amblycephala, 2n=48). Our results showed the presence of two distinct Hox duplicates in the genomes of common and crucian carp. Three distinct Hox sequences, one of them orthologous to a Hox gene in common carp and the other two orthologous to a Hox gene in crucian carp, were isolated in silver crucian carp, indicating a possible hybrid origin of silver crucian carp from crucian and common carp. The gene duplication resulting in the origin of the common ancestor of common and crucian carp likely occurred around 10.9-13.2 Mya. The speciations of common vs. crucian carp and silver crucian vs. crucian carp likely occurred around 8.1-11.4 and 2.3-3.0 Mya, respectively. Finally, nonfunctionalization resulting from point mutations in the coding region is a probable fate for some Hox duplicates. Taken together, these results suggested an evolutionary model for polyploidization in speciation and diversification of polyploid fish. (c) 2010 Wiley-Liss, Inc.

  20. Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate

    PubMed Central

    Dehal, Paramvir; Boore, Jeffrey L

    2005-01-01

    The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish–tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage. PMID:16128622

  1. β2-microglobulin gene duplication in cetartiodactyla remains intact only in pigs and possibly confers selective advantage to the species.

    PubMed

    Le, Thong Minh; Le, Quy Van Chanh; Truong, Dung Minh; Lee, Hye-Jeong; Choi, Min-Kyeung; Cho, Hyesun; Chung, Hak-Jae; Kim, Jin-Hoi; Do, Jeong-Tae; Song, Hyuk; Park, Chankyu

    2017-01-01

    Several β2-microglobulin (B2M) -bound protein complexes undertake key roles in various immune system pathways, including the neonatal Fc receptor (FcRn), cluster of differentiation 1 (CD1) protein, non-classical major histocompatibility complex (MHC), and well-known MHC class I molecules. Therefore, the duplication of B2M may lead to an increase in the biological competence of organisms to the environment. Based on the pig genome assembly SSC10.2, a segmental duplication of ~45.5 kb, encoding the entire B2M protein, was identified in pig chromosome 1. Through experimental validation, we confirmed the functional duplication of the B2M gene with a completely identical coding sequence between two copies in pigs. Considering the importance of B2M in the immune system, we performed the phylogenetic analysis of B2M duplication in ten mammalian species, confirming the presence of B2M duplication in cetartioldactyls, like cattle, sheep, goats, pigs and whales, but non-cetartiodactyl species, like mice, cats, dogs, horses, and humans. The density of long interspersed nuclear element (LINE) at the edges of duplicated blocks (39 to 66%) was found to be 2 to 3-fold higher than the average (20.12%) of the pig genome, suggesting its role in the duplication event. The B2M mRNA expression level in pigs was 12.71 and 7.57 times (2-ΔΔCt values) higher than humans and mice, respectively. However, we were unable to experimentally demonstrate the difference in the level of B2M protein because species specific anti-B2M antibodies are not available. We reported, for the first time, the functional duplication of the B2M gene in animals. The identification of partially remaining duplicated B2M sequences in the genomes of only cetartiodactyls indicates that the event was lineage specific. B2M duplication could be beneficial to the immune system of pigs by increasing the availability of MHC class I light chain protein, B2M, to complex with the proteins encoded by the relatively large

  2. Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient

    PubMed Central

    Kouprina, Natalay; Noskov, Vladimir N.; Waterfall, Joshua J.; Walker, Robert L.; Meltzer, Paul S.; Topol, Eric J.; Larionov, Vladimir

    2018-01-01

    Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer. PMID:29632643

  3. Evolutionary interplay between sister cytochrome P450 genes shapes plasticity in plant metabolism.

    PubMed

    Liu, Zhenhua; Tavares, Raquel; Forsythe, Evan S; André, François; Lugan, Raphaël; Jonasson, Gabriella; Boutet-Mercey, Stéphanie; Tohge, Takayuki; Beilstein, Mark A; Werck-Reichhart, Danièle; Renault, Hugues

    2016-10-07

    Expansion of the cytochrome P450 gene family is often proposed to have a critical role in the evolution of metabolic complexity, in particular in microorganisms, insects and plants. However, the molecular mechanisms underlying the evolution of this complexity are poorly understood. Here we describe the evolutionary history of a plant P450 retrogene, which emerged and underwent fixation in the common ancestor of Brassicales, before undergoing tandem duplication in the ancestor of Brassicaceae. Duplication leads first to gain of dual functions in one of the copies. Both sister genes are retained through subsequent speciation but eventually return to a single copy in two of three diverging lineages. In the lineage in which both copies are maintained, the ancestral functions are split between paralogs and a novel function arises in the copy under relaxed selection. Our work illustrates how retrotransposition and gene duplication can favour the emergence of novel metabolic functions.

  4. Brain evolution by brain pathway duplication

    PubMed Central

    Chakraborty, Mukta; Jarvis, Erich D.

    2015-01-01

    Understanding the mechanisms of evolution of brain pathways for complex behaviours is still in its infancy. Making further advances requires a deeper understanding of brain homologies, novelties and analogies. It also requires an understanding of how adaptive genetic modifications lead to restructuring of the brain. Recent advances in genomic and molecular biology techniques applied to brain research have provided exciting insights into how complex behaviours are shaped by selection of novel brain pathways and functions of the nervous system. Here, we review and further develop some insights to a new hypothesis on one mechanism that may contribute to nervous system evolution, in particular by brain pathway duplication. Like gene duplication, we propose that whole brain pathways can duplicate and the duplicated pathway diverge to take on new functions. We suggest that one mechanism of brain pathway duplication could be through gene duplication, although other mechanisms are possible. We focus on brain pathways for vocal learning and spoken language in song-learning birds and humans as example systems. This view presents a new framework for future research in our understanding of brain evolution and novel behavioural traits. PMID:26554045

  5. Functional diversification of B MADS-box homeotic regulators of flower development: Adaptive evolution in protein-protein interaction domains after major gene duplication events.

    PubMed

    Hernández-Hernández, Tania; Martínez-Castilla, León Patricio; Alvarez-Buylla, Elena R

    2007-02-01

    B-class MADS-box genes have been shown to be the key regulators of petal and stamen specification in several eudicot model species such as Arabidopsis thaliana, Antirrhinum majus, and Petunia hybrida. Orthologs of these genes have been found across angiosperms and gymnosperms, and it is thought that the basic regulatory function of B proteins is conserved in seed plant lineages. The evolution of B genes is characterized by numerous duplications that might represent key elements fostering the functional diversification of duplicates with a deep impact on their role in the evolution of the floral developmental program. To evaluate this, we performed a rigorous statistical analysis with B gene sequences. Using maximum likelihood and Bayesian methods, we estimated molecular substitution rates and determined the selective regimes operating at each residue of B proteins. We implemented tests that rely on phylogenetic hypotheses and codon substitution models to detect significant differences in substitution rates (DSRs) and sites under positive adaptive selection (PS) in specific lineages before and after duplication events. With these methods, we identified several protein residues fixed by PS shortly after the origin of PISTILLATA-like and APETALA3-like lineages in angiosperms and shortly after the origin of the euAP3-like lineage in core eudicots, the 2 main B gene duplications. The residues inferred to have been fixed by positive selection lie mostly within the K domain of the protein, which is key to promote heterodimerization. Additionally, we used a likelihood method that accommodates DSRs among lineages to estimate duplication dates for AP3-PI and euAP3-TM6, calibrating with data from the fossil record. The dates obtained are consistent with angiosperm origins and diversification of core eudicots. Our results strongly suggest that novel multimer formation with other MADS proteins could have been crucial for the functional divergence of B MADS-box genes. We thus

  6. A case report of two male siblings with autism and duplication of Xq13-q21, a region including three genes predisposing for autism.

    PubMed

    Wentz, Elisabet; Vujic, Mihailo; Kärrstedt, Ewa-Lotta; Erlandsson, Anna; Gillberg, Christopher

    2014-05-01

    Autism spectrum disorder, severe behaviour problems and duplication of the Xq12 to Xq13 region have recently been described in three male relatives. To describe the psychiatric comorbidity and dysmorphic features, including craniosynostosis, of two male siblings with autism and duplication of the Xq13 to Xq21 region, and attempt to narrow down the number of duplicated genes proposed to be leading to global developmental delay and autism. We performed DNA sequencing of certain exons of the TWIST1 gene, the FGFR2 gene and the FGFR3 gene. We also performed microarray analysis of the DNA. In addition to autism, the two male siblings exhibited severe learning disability, self-injurious behaviour, temper tantrums and hyperactivity, and had no communicative language. Chromosomal analyses were normal. Neither of the two siblings showed mutations of the sequenced exons known to produce craniosynostosis. The microarray analysis detected an extra copy of a region on the long arm of chromosome X, chromosome band Xq13.1-q21.1. Comparison of our two cases with previously described patients allowed us to identify three genes predisposing for autism in the duplicated chromosomal region. Sagittal craniosynostosis is also a new finding linked to the duplication.

  7. The nuclear OXPHOS genes in insecta: a common evolutionary origin, a common cis-regulatory motif, a common destiny for gene duplicates

    PubMed Central

    Porcelli, Damiano; Barsanti, Paolo; Pesole, Graziano; Caggese, Corrado

    2007-01-01

    Background When orthologous sequences from species distributed throughout an optimal range of divergence times are available, comparative genomics is a powerful tool to address problems such as the identification of the forces that shape gene structure during evolution, although the functional constraints involved may vary in different genes and lineages. Results We identified and annotated in the MitoComp2 dataset the orthologs of 68 nuclear genes controlling oxidative phosphorylation in 11 Drosophilidae species and in five non-Drosophilidae insects, and compared them with each other and with their counterparts in three vertebrates (Fugu rubripes, Danio rerio and Homo sapiens) and in the cnidarian Nematostella vectensis, taking into account conservation of gene structure and regulatory motifs, and preservation of gene paralogs in the genome. Comparative analysis indicates that the ancestral insect OXPHOS genes were intron rich and that extensive intron loss and lineage-specific intron gain occurred during evolution. Comparison with vertebrates and cnidarians also shows that many OXPHOS gene introns predate the cnidarian/Bilateria evolutionary split. The nuclear respiratory gene element (NRG) has played a key role in the evolution of the insect OXPHOS genes; it is constantly conserved in the OXPHOS orthologs of all the insect species examined, while their duplicates either completely lack the element or possess only relics of the motif. Conclusion Our observations reinforce the notion that the common ancestor of most animal phyla had intron-rich gene, and suggest that changes in the pattern of expression of the gene facilitate the fixation of duplications in the genome and the development of novel genetic functions. PMID:18315839

  8. New insights into the nutritional regulation of gluconeogenesis in carnivorous rainbow trout (Oncorhynchus mykiss): a gene duplication trail.

    PubMed

    Marandel, Lucie; Seiliez, Iban; Véron, Vincent; Skiba-Cassy, Sandrine; Panserat, Stéphane

    2015-07-01

    The rainbow trout (Oncorhynchus mykiss) is considered to be a strictly carnivorous fish species that is metabolically adapted for high catabolism of proteins and low utilization of dietary carbohydrates. This species consequently has a "glucose-intolerant" phenotype manifested by persistent hyperglycemia when fed a high-carbohydrate diet. Gluconeogenesis in adult fish is also poorly, if ever, regulated by carbohydrates, suggesting that this metabolic pathway is involved in this specific phenotype. In this study, we hypothesized that the fate of duplicated genes after the salmonid-specific 4th whole genome duplication (Ss4R) may have led to adaptive innovation and that their study might provide new elements to enhance our understanding of gluconeogenesis and poor dietary carbohydrate use in this species. Our evolutionary analysis of gluconeogenic genes revealed that pck1, pck2, fbp1a, and g6pca were retained as singletons after Ss4r, while g6pcb1, g6pcb2, and fbp1b ohnolog pairs were maintained. For all genes, duplication may have led to sub- or neofunctionalization. Expression profiles suggest that the gluconeogenesis pathway remained active in trout fed a no-carbohydrate diet. When trout were fed a high-carbohydrate diet (30%), most of the gluconeogenic genes were non- or downregulated, except for g6pbc2 ohnologs, whose RNA levels were surprisingly increased. This study demonstrates that Ss4R in trout involved adaptive innovation via gene duplication and via the outcome of the resulting ohnologs. Indeed, maintenance of ohnologous g6pcb2 pair may contribute in a significant way to the glucose-intolerant phenotype of trout and may partially explain its poor use of dietary carbohydrates. Copyright © 2015 the American Physiological Society.

  9. Generation of megabase-scale deletions, inversions and duplications involving the Contactin-6 gene in mice by CRISPR/Cas9 technology.

    PubMed

    Korablev, Alexei N; Serova, Irina A; Serov, Oleg L

    2017-12-28

    Copy Number Variation (CNV) of the human CNTN6 gene (encoding the contactin-6 protein), caused by deletions or duplications, is responsible for severe neurodevelopmental impairments, often in combination with facial dysmorphias. Conversely, deleterious point mutations of this gene do not show any clinical phenotypes. The aim of this study is to generate mice carrying large deletions, duplications and inversions involving the Cntn6 gene as a new experimental model to study CNV of the human CNTN6 locus. To generate large chromosomal rearrangements on mouse chromosome 6, we applied CRISPR/Cas9 technology in zygotes. Two guide RNAs (gRNAs) (flanking a DNA fragment of 1137 Mb) together with Cas9 mRNA and single-stranded DNA oligonucleotides (ssODN) were microinjected into the cytoplasm of 599 zygotes of F1 (C57BL x CBA) mice, and 256 of them were transplanted into oviducts of CD-1 females. As a result, we observed the birth of 41 viable F0 offspring. Genotyping of these mice was performed by PCR analysis and sequencing of PCR products. Among the 41 F0 offspring, we identified seven mice with deletions, two animals carrying duplications of the gene and four carrying inversions. Interestingly, two F0 offspring had both deletions and duplications. It is important to note that while three of seven deletion carriers showed expected sequences at the new joint sites, in another three, we identified an absence of 1-10 nucleotides at the CRISPR/Cas9 cut sites, and in one animal, 103 bp were missing, presumably due to error-prone non-homologous end joining. In addition, we detected the absence of 5 and 13 nucleotides at these sites in two F0 duplication carriers. Similar sequence changes at CRISPR/Cas9 cut sites were observed at the right and left boundaries of inversions. Thus, megabase-scale deletions, duplications and inversions were identified in 11 F0 offspring among 41 analyzed, i.e., approximately 25% efficiency. All genetically modified F0 offspring were viable and

  10. Mirror-image duplication of the primary axis and heart in Xenopus embryos by the overexpression of Msx-1 gene.

    PubMed

    Chen, Y; Solursh, M

    1995-10-01

    The Msx-1 gene (formerly known as Hox-7) is a member of a discrete subclass of homeobox-containing genes. Examination of the expression pattern of Msx-1 in murine and avian embryos suggests that this gene may be involved in the regionalization of the medio-lateral axis during earlier development. We have examined the possible functions of Xenopus Msx-1 during early Xenopus embryonic development by overexpression of the Msx-1 gene. Overexpression of Msx-1 causes a left-right mirror-image duplication of primary axial structures, including notochord, neural tube, somites, suckers, and foregut. The embryonic developing heart is also mirror-image duplicated, including looping directions and polarity. These results indicate that Msx-1 may be involved in the mesoderm formation as well as left-right patterning in the early Xenopus embryonic development.

  11. Orsomucoid: A new variant and additional duplicated ORM1 gene in Qatari population

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sebetan, I.M.; Alali, K.A.; Alzaman, A.

    1994-09-01

    A new genetically determined ORM2 variant and additional duplicated ORM1 gene were observed in Qatari population using isoelectric focusing in ultra thin layer polyacrylamide gels. The studied population samples indicate occurence of six ORM1 alleles and three ORM2 ones. A simple reliable method for separation of orsomucoid variations with comparison of different reported methods will be presented.

  12. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.

  13. JGI Plant Genomics Gene Annotation Pipeline

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shu, Shengqiang; Rokhsar, Dan; Goodstein, David

    2014-07-14

    Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward thismore » aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.« less

  14. A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli.

    PubMed

    Li, Mingji; Wang, Junshu; Geng, Yanping; Li, Yikui; Wang, Qian; Liang, Quanfeng; Qi, Qingsheng

    2012-02-06

    For metabolic engineering, many rate-limiting steps may exist in the pathways of accumulating the target metabolites. Increasing copy number of the desired genes in these pathways is a general method to solve the problem, for example, the employment of the multi-copy plasmid-based expression system. However, this method may bring genetic instability, structural instability and metabolic burden to the host, while integrating of the desired gene into the chromosome may cause inadequate transcription or expression. In this study, we developed a strategy for obtaining gene overexpression by engineering promoter clusters consisted of multiple core-tac-promoters (MCPtacs) in tandem. Through a uniquely designed in vitro assembling process, a series of promoter clusters were constructed. The transcription strength of these promoter clusters showed a stepwise enhancement with the increase of tandem repeats number until it reached the critical value of five. Application of the MCPtacs promoter clusters in polyhydroxybutyrate (PHB) production proved that it was efficient. Integration of the phaCAB genes with the 5CPtacs promoter cluster resulted in an engineered E.coli that can accumulate 23.7% PHB of the cell dry weight in batch cultivation. The transcription strength of the MCPtacs promoter cluster can be greatly improved by increasing the tandem repeats number of the core-tac-promoter. By integrating the desired gene together with the MCPtacs promoter cluster into the chromosome of E. coli, we can achieve high and stale overexpression with only a small size. This strategy has an application potential in many fields and can be extended to other bacteria.

  15. Analysis of phylogenomic datasets reveals conflict, concordance, and gene duplications with examples from animals and plants.

    PubMed

    Smith, Stephen A; Moore, Michael J; Brown, Joseph W; Yang, Ya

    2015-08-05

    The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes

  16. Duplication of an upstream silencer of FZP increases grain yield in rice.

    PubMed

    Bai, Xufeng; Huang, Yong; Hu, Yong; Liu, Haiyang; Zhang, Bo; Smaczniak, Cezary; Hu, Gang; Han, Zhongmin; Xing, Yongzhong

    2017-11-01

    Transcriptional silencer and copy number variants (CNVs) are associated with gene expression. However, their roles in generating phenotypes have not been well studied. Here we identified a rice quantitative trait locus, SGDP7 (Small Grain and Dense Panicle 7). SGDP7 is identical to FZP (FRIZZY PANICLE), which represses the formation of axillary meristems. The causal mutation of SGDP7 is an 18-bp fragment, named CNV-18bp, which was inserted ~5.3 kb upstream of FZP and resulted in a tandem duplication in the cultivar Chuan 7. The CNV-18bp duplication repressed FZP expression, prolonged the panicle branching period and increased grain yield by more than 15% through substantially increasing the number of spikelets per panicle (SPP) and slightly decreasing the 1,000-grain weight (TGW). The transcription repressor OsBZR1 binds the CGTG motifs in CNV-18bp and thereby represses FZP expression, indicating that CNV-18bp is the upstream silencer of FZP. These findings showed that the silencer CNVs coordinate a trade-off between SPP and TGW by fine-tuning FZP expression, and balancing the trade-off could enhance yield potential.

  17. Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template.

    PubMed

    Gouran, Hossein; Chakraborty, Sandeep; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya

    2014-01-01

    Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction.

  18. Directed evolution induces tributyrin hydrolysis in a virulence factor of Xylella fastidiosa using a duplicated gene as a template

    PubMed Central

    Rao, Basuthkar J.; Asgeirsson, Bjarni; Dandekar, Abhaya

    2014-01-01

    Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction. PMID:25717364

  19. Detection of a large duplication mutation in the myosin-binding protein C3 gene in a case of hypertrophic cardiomyopathy.

    PubMed

    Meyer, Thomas; Pankuweit, Sabine; Richter, Anette; Maisch, Bernhard; Ruppert, Volker

    2013-09-15

    Hypertrophic cardiomyopathy (HCM) is a cardiovascular disease with autosomal dominant inheritance caused by mutations in genes coding for sarcomeric and/or regulatory proteins expressed in cardiomyocytes. In a small cohort of HCM patients (n=8), we searched for mutations in the two most common genes responsible for HCM and found four missense mutations in the MYH7 gene encoding cardiac β-myosin heavy chain (R204H, M493V, R719W, and R870H) and three mutations in the myosin-binding protein C3 gene (MYBPC3) including one missense (A848V) and two frameshift mutations (c.3713delTG and c.702ins26bp). The c.702ins26bp insertion resulted from the duplication of a 26-bp fragment in a 54-year-old female HCM patient presenting with clinical signs of heart failure due to diastolic dysfunction. Although such large duplications (>10 bp) in the MYBPC3 gene are very rare and have been identified only in 4 families reported so far, the identical duplication mutation was found earlier in a Dutch patient, demonstrating that it may constitute a hitherto unknown founder mutation in central European populations. This observation underscores the significance of insertions into the coding sequence of the MYBPC3 gene for the development and pathogenesis of HCM. © 2013 Elsevier B.V. All rights reserved.

  20. Mitochondrial Genome Sequences of Nematocera (Lower Diptera): Evidence of Rearrangement following a Complete Genome Duplication in a Winter Crane Fly

    PubMed Central

    Beckenbach, Andrew T.

    2012-01-01

    The complete mitochondrial DNA sequences of eight representatives of lower Diptera, suborder Nematocera, along with nearly complete sequences from two other species, are presented. These taxa represent eight families not previously represented by complete mitochondrial DNA sequences. Most of the sequences retain the ancestral dipteran mitochondrial gene arrangement, while one sequence, that of the midge Arachnocampa flava (family Keroplatidae), has an inversion of the trnE gene. The most unusual result is the extensive rearrangement of the mitochondrial genome of a winter crane fly, Paracladura trichoptera (family Trichocera). The pattern of rearrangement indicates that the mechanism of rearrangement involved a tandem duplication of the entire mitochondrial genome, followed by random and nonrandom loss of one copy of each gene. Another winter crane fly retains the ancestral diperan gene arrangement. A preliminary mitochondrial phylogeny of the Diptera is also presented. PMID:22155689

  1. A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors

    PubMed Central

    Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa

    2006-01-01

    Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. PMID:16606702

  2. Topography of the Duchenne muscular dystrophy (DMD) gene: FIGE and cDNA analysis of 194 cases reveals 115 deletions and 13 duplications.

    PubMed Central

    Den Dunnen, J T; Grootscholten, P M; Bakker, E; Blonden, L A; Ginjaar, H B; Wapenaar, M C; van Paassen, H M; van Broeckhoven, C; Pearson, P L; van Ommen, G J

    1989-01-01

    We have studied 34 Becker and 160 Duchenne muscular dystrophy (DMD) patients with the dystrophin cDNA, using conventional blots and FIGE analysis. One hundred twenty-eight mutations (65%) were found, 115 deletions and 13 duplications, of which 106 deletions and 11 duplications could be precisely mapped in relation to both the mRNA and the major and minor mutation hot spots. Junction fragments, ideal markers for carrier detection, were found in 23 (17%) of the 128 cases. We identified eight new cDNA RFLPs within the DMD gene. With the use of cDNA probes we have completed the long-range map of the DMD gene, by the identification of a 680-kb SfiI fragment containing the gene's 3' end. The size of the DMD gene is now determined to be about 2.3 million basepairs. The combination of cDNA hybridizations with long-range analysis of deletion and duplication patients yields a global picture of the exon spacing within the dystrophin gene. The gene shows a large variability of intron size, ranging from only a few kilobases to 160-180 kb for the P20 intron. Images Figure 1 Figure 4 PMID:2573997

  3. F-box genes: Genome-wide expansion, evolution and their contribution to pollen growth in pear (Pyrus bretschneideri).

    PubMed

    Wang, Guo-Ming; Yin, Hao; Qiao, Xin; Tan, Xu; Gu, Chao; Wang, Bao-Hua; Cheng, Rui; Wang, Ying-Zhen; Zhang, Shao-Ling

    2016-12-01

    F-box gene family, as one of the largest gene families in plants, plays crucial roles in regulating plant development, reproduction, cellular protein degradation and responses to biotic and abiotic stresses. However, comprehensive analysis of the F-box gene family in pear (Pyrus bretschneideri Rehd.) and other Rosaceae species has not been reported yet. Herein, we identified a total of 226 full-length F-box genes in pear for the first time. And these genes were further divided into various subgroups based on specific domains and phylogenetic analysis. Intriguingly, we observed that whole-genome duplication and dispersed duplication have a major contribution to F-box family expansion. Furthermore, the dynamic evolution for different modes of gene duplication was dissected. Interestingly, we found that dispersed and tandem duplicate have been evolving at a high rate. In addition, we found that F-box genes exhibited functional specificity based on GO analysis, and most of the F-box genes were significantly enriched in the protein binding (GO: 0005515) term, supporting that F-box genes might play a critical role for gene regulation in pear. Transcriptome and digital expression profiles revealed that F-box genes are involved in the development of multiple pear tissues. Overall, these results will set stage for elaborating the biological role of F-box genes in pear and other plants. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  4. A yeast gene essential for regulation of spindle pole duplication.

    PubMed Central

    Baum, P; Yip, C; Goetsch, L; Byers, B

    1988-01-01

    In eucaryotic cells, duplication of spindle poles must be coordinated with other cell cycle functions. We report here the identification in Saccharomyces cerevisiae of a temperature-sensitive lethal mutation, esp1, that deregulates spindle pole duplication. Mutant cells transferred to the nonpermissive temperature became unable to continue DNA synthesis and cell division but displayed repeated duplication of their spindle pole bodies. Although entry into this state after transient challenge by the nonpermissive temperature was largely lethal, rare survivors were recovered and found to have become increased in ploidy. If the mutant cells were held in G0 or G1 during exposure to the elevated temperature, they remained viable and maintained normal numbers of spindle poles. These results suggest dual regulation of spindle pole duplication, including a mechanism that promotes duplication as cells enter the division cycle and a negative regulatory mechanism, controlled by ESP1, that limits duplication to a single occurrence in each cell division cycle. Tetrad analysis has revealed that ESP1 resides at a previously undescribed locus on the right arm of chromosome VII. Images PMID:3072479

  5. Tandem duplication dup(X)(q13q22) in a male proband inherited from the mother showing mosaicism of X-inactivation.

    PubMed

    Steinbach, P; Horstmann, W; Scholz, W

    1980-01-01

    An aberrant X chromosome containing extra material in the long arm was observed in a psychomotoric retarded boy and his healthy, short-statured mother. The proband showed generalized muscular hypotony, growth retardation, and somatic anomalies including hypoplastic genitalia and cryptorchism. Chromosomal banding techniques suggested a tandem duplication of the segment Xq13 leads to Xq22. In the mother the vast majority of lymphocytes showed late replication of the aberrant X chromosome. Some of her cells, however, contained an apparently active aberrant X. Both the early- and late-replicating aberrant X exhibited late replication patterns very similar to those described for normal X chromosomes in lymphocytes. Asynchrony of DNA replication among the two segments Xq13 leads to Xq22 in the dup(X) was never observed. We consider that the clinical picture of the proband is caused by an excess of active X material.

  6. Gene Duplication and Transference of Function in the paleoAP3 Lineage of Floral Organ Identity Genes

    PubMed Central

    Galimba, Kelsey D.; Martínez-Gómez, Jesús; Di Stilio, Verónica S.

    2018-01-01

    The floral organ identity gene APETALA3 (AP3) is a MADS-box transcription factor involved in stamen and petal identity that belongs to the B-class of the ABC model of flower development. Thalictrum (Ranunculaceae), an emerging model in the non-core eudicots, has AP3 homologs derived from both ancient and recent gene duplications. Prior work has shown that petals have been lost repeatedly and independently in Ranunculaceae in correlation with the loss of a specific AP3 paralog, and Thalictrum represents one of these instances. The main goal of this study was to conduct a functional analysis of the three AP3 orthologs present in Thalictrum thalictroides, representing the paleoAP3 gene lineage, to determine the degree of redundancy versus divergence after gene duplication. Because Thalictrum lacks petals, and has lost the petal-specific AP3, we also asked whether heterotopic expression of the remaining AP3 genes contributes to the partial transference of petal function to the first whorl found in insect-pollinated species. To address these questions, we undertook functional characterization by virus-induced gene silencing (VIGS), protein–protein interaction and binding site analyses. Our results illustrate partial redundancy among Thalictrum AP3s, with deep conservation of B-class function in stamen identity and a novel role in ectopic petaloidy of sepals. Certain aspects of petal function of the lost AP3 locus have apparently been transferred to the other paralogs. A novel result is that the protein products interact not only with each other, but also as homodimers. Evidence presented here also suggests that expression of the different ThtAP3 paralogs is tightly integrated, with an apparent disruption of B function homeostasis upon silencing of one of the paralogs that codes for a truncated protein. To explain this result, we propose two testable alternative scenarios: that the truncated protein is a dominant negative mutant or that there is a compensational

  7. Yeast Interspecies Comparative Proteomics Reveals Divergence in Expression Profiles and Provides Insights into Proteome Resource Allocation and Evolutionary Roles of Gene Duplication*

    PubMed Central

    Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke

    2016-01-01

    Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under

  8. A conserved segmental duplication within ELA.

    PubMed

    Brinkmeyer-Langford, C L; Murphy, W J; Childers, C P; Skow, L C

    2010-12-01

    The assembled genomic sequence of the horse major histocompatibility complex (MHC) (equine lymphocyte antigen, ELA) is very similar to the homologous human HLA, with the notable exception of a large segmental duplication at the boundary of ELA class I and class III that is absent in HLA. The segmental duplication consists of a ∼ 710 kb region of at least 11 repeated blocks: 10 blocks each contain an MHC class I-like sequence and the helicase domain portion of a BAT1-like sequence, and the remaining unit contains the full-length BAT1 gene. Similar genomic features were found in other Perissodactyls, indicating an ancient origin, which is consistent with phylogenetic analyses. Reverse-transcriptase PCR (RT-PCR) of mRNA from peripheral white blood cells of healthy and chronically or acutely infected horses detected transcription from predicted open reading frames in several of the duplicated blocks. This duplication is not present in the sequenced MHCs of most other mammals, although a similar feature at the same relative position is present in the feline MHC (FLA). Striking sequence conservation throughout Perissodactyl evolution is consistent with a functional role for at least some of the genes included within this segmental duplication. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  9. Cis-acting mutation and duplication: History of molecular evolution in a P450 haplotype responsible for insecticide resistance in Culex quinquefasciatus.

    PubMed

    Itokawa, Kentaro; Komagata, Osamu; Kasai, Shinji; Masada, Masahiro; Tomita, Takashi

    2011-07-01

    A cytochrome P450 gene, Cyp9m10, is more than 200-fold overexpressed in a pyrethroid resistant strain of Culex quinquefasciatus, JPal-per. The haplotype of this strain contains two copies of Cyp9m10 resulted from recent tandem duplication. In this study, we discovered and isolated a Cyp9m10 haplotype closely related to this duplicated Cyp9m10 haplotype from JHB, a strain used for the recent genome project for this mosquito species. The isolated haplotype (JHB-NIID-B haplotype) shared the same insertion of a transposable element upstream of the coding region with JPal-per strain but not duplicated. The JHB-NIID-B haplotype was considered to have diverged from the JPal-per lineage just before the duplication event. Cyp9m10 was moderately overexpressed in larvae with the JHB-NIID-B haplotype. The overexpressions in JHB-NIID-B and JPal-per haplotypes were developmentally regulated in similar pattern indicating both haplotypes share a common cis-acting mutation responsible for the overexpressions. The isolated moderately overexpressed haplotype conferred resistance, however, its efficacy was relatively small. We hypothesized that the first cis-acting mutation modified the consequence of the subsequent duplication in JPal-per lineage to confer stronger phenotypic effect than that if it occurred before the first cis-acting mutation. Copyright © 2011 Elsevier Ltd. All rights reserved.

  10. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm

    PubMed Central

    Jin, Xiaoli; Ren, Jing; Nevo, Eviatar; Yin, Xuegui; Sun, Dongfa; Peng, Junhua

    2017-01-01

    NAC (NAM/ATAF/CUC) proteins constitute one of the biggest plant-specific transcription factor (TF) families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1) uneven constitution of Clusters of Orthologous Groups (COGs) and contrasting birth/death rates among subfamilies, and (2) two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses. PMID:28713414

  11. Assessing duplication and loss of APETALA1/FRUITFULL homologs in Ranunculales

    PubMed Central

    Pabón-Mora, Natalia; Hidalgo, Oriane; Gleissberg, Stefan; Litt, Amy

    2013-01-01

    Gene duplication and loss provide raw material for evolutionary change within organismal lineages as functional diversification of gene copies provide a mechanism for phenotypic variation. Here we focus on the APETALA1/FRUITFULL MADS-box gene lineage evolution. AP1/FUL genes are angiosperm-specific and have undergone several duplications. By far the most significant one is the core-eudicot duplication resulting in the euAP1 and euFUL clades. Functional characterization of several euAP1 and euFUL genes has shown that both function in proper floral meristem identity, and axillary meristem repression. Independently, euAP1 genes function in floral meristem and sepal identity, whereas euFUL genes control phase transition, cauline leaf growth, compound leaf morphogenesis and fruit development. Significant functional variation has been detected in the function of pre-duplication basal-eudicot FUL-like genes, but the underlying mechanisms for change have not been identified. FUL-like genes in the Papaveraceae encode all functions reported for euAP1 and euFUL genes, whereas FUL-like genes in Aquilegia (Ranunculaceae) function in inflorescence development and leaf complexity, but not in flower or fruit development. Here we isolated FUL-like genes across the Ranunculales and used phylogenetic approaches to analyze their evolutionary history. We identified an early duplication resulting in the RanFL1 and RanFL2 clades. RanFL1 genes were present in all the families sampled and are mostly under strong negative selection in the MADS, I and K domains. RanFL2 genes were only identified from Eupteleaceae, Papaveraceae s.l., Menispermaceae and Ranunculaceae and show relaxed purifying selection at the I and K domains. We discuss how asymmetric sequence diversification, new motifs, differences in codon substitutions and likely protein-protein interactions resulting from this Ranunculiid-specific duplication can help explain the functional differences among basal-eudicot FUL-like genes

  12. Positive selection and ancient duplications in the evolution of class B floral homeotic genes of orchids and grasses

    PubMed Central

    Mondragón-Palomino, Mariana; Hiese, Luisa; Härter, Andrea; Koch, Marcus A; Theißen, Günter

    2009-01-01

    Background Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged. Results The DEF-like genes from Orchidaceae form four well-supported, ancient clades of orthologues. In contrast, orchid GLO-like genes form a single clade of ancient orthologues and recent paralogues. DEF-like genes from orchid clade 2 (OMADS3-like genes) are under less stringent purifying selection than the other orchid DEF-like and GLO-like genes. In comparison with orchids, purifying selection was less stringent in DEF

  13. Recombination and evolution of duplicate control regions in the mitochondrial genome of the Asian big-headed turtle, Platysternon megacephalum.

    PubMed

    Zheng, Chenfei; Nie, Liuwang; Wang, Jue; Zhou, Huaxing; Hou, Huazhen; Wang, Hao; Liu, Juanjuan

    2013-01-01

    Complete mitochondrial (mt) genome sequences with duplicate control regions (CRs) have been detected in various animal species. In Testudines, duplicate mtCRs have been reported in the mtDNA of the Asian big-headed turtle, Platysternon megacephalum, which has three living subspecies. However, the evolutionary pattern of these CRs remains unclear. In this study, we report the completed sequences of duplicate CRs from 20 individuals belonging to three subspecies of this turtle and discuss the micro-evolutionary analysis of the evolution of duplicate CRs. Genetic distances calculated with MEGA 4.1 using the complete duplicate CR sequences revealed that within turtle subspecies, genetic distances between orthologous copies from different individuals were 0.63% for CR1 and 1.2% for CR2app:addword:respectively, and the average distance between paralogous copies of CR1 and CR2 was 4.8%. Phylogenetic relationships were reconstructed from the CR sequences, excluding the variable number of tandem repeats (VNTRs) at the 3' end using three methods: neighbor-joining, maximum likelihood algorithm, and Bayesian inference. These data show that any two CRs within individuals were more genetically distant from orthologous genes in different individuals within the same subspecies. This suggests independent evolution of the two mtCRs within each P. megacephalum subspecies. Reconstruction of separate phylogenetic trees using different CR components (TAS, CD, CSB, and VNTRs) suggested the role of recombination in the evolution of duplicate CRs. Consequently, recombination events were detected using RDP software with break points at ≈290 bp and ≈1,080 bp. Based on these results, we hypothesize that duplicate CRs in P. megacephalum originated from heterological ancestral recombination of mtDNA. Subsequent recombination could have resulted in homogenization during independent evolutionary events, thus maintaining the functions of duplicate CRs in the mtDNA of P. megacephalum.

  14. Recombination and Evolution of Duplicate Control Regions in the Mitochondrial Genome of the Asian Big-Headed Turtle, Platysternon megacephalum

    PubMed Central

    Zheng, Chenfei; Nie, Liuwang; Wang, Jue; Zhou, Huaxing; Hou, Huazhen; Wang, Hao; Liu, Juanjuan

    2013-01-01

    Complete mitochondrial (mt) genome sequences with duplicate control regions (CRs) have been detected in various animal species. In Testudines, duplicate mtCRs have been reported in the mtDNA of the Asian big-headed turtle, Platysternon megacephalum, which has three living subspecies. However, the evolutionary pattern of these CRs remains unclear. In this study, we report the completed sequences of duplicate CRs from 20 individuals belonging to three subspecies of this turtle and discuss the micro-evolutionary analysis of the evolution of duplicate CRs. Genetic distances calculated with MEGA 4.1 using the complete duplicate CR sequences revealed that within turtle subspecies, genetic distances between orthologous copies from different individuals were 0.63% for CR1 and 1.2% for CR2app:addword:respectively, and the average distance between paralogous copies of CR1 and CR2 was 4.8%. Phylogenetic relationships were reconstructed from the CR sequences, excluding the variable number of tandem repeats (VNTRs) at the 3′ end using three methods: neighbor-joining, maximum likelihood algorithm, and Bayesian inference. These data show that any two CRs within individuals were more genetically distant from orthologous genes in different individuals within the same subspecies. This suggests independent evolution of the two mtCRs within each P. megacephalum subspecies. Reconstruction of separate phylogenetic trees using different CR components (TAS, CD, CSB, and VNTRs) suggested the role of recombination in the evolution of duplicate CRs. Consequently, recombination events were detected using RDP software with break points at ≈290 bp and ≈1,080 bp. Based on these results, we hypothesize that duplicate CRs in P. megacephalum originated from heterological ancestral recombination of mtDNA. Subsequent recombination could have resulted in homogenization during independent evolutionary events, thus maintaining the functions of duplicate CRs in the mtDNA of P. megacephalum. PMID

  15. Evolutionary genomics of LysM genes in land plants.

    PubMed

    Zhang, Xue-Cheng; Cannon, Steven B; Stacey, Gary

    2009-08-03

    The ubiquitous LysM motif recognizes peptidoglycan, chitooligosaccharides (chitin) and, presumably, other structurally-related oligosaccharides. LysM-containing proteins were first shown to be involved in bacterial cell wall degradation and, more recently, were implicated in perceiving chitin (one of the established pathogen-associated molecular patterns) and lipo-chitin (nodulation factors) in flowering plants. However, the majority of LysM genes in plants remain functionally uncharacterized and the evolutionary history of complex LysM genes remains elusive. We show that LysM-containing proteins display a wide range of complex domain architectures. However, only a simple core architecture is conserved across kingdoms. Each individual kingdom appears to have evolved a distinct array of domain architectures. We show that early plant lineages acquired four characteristic architectures and progressively lost several primitive architectures. We report plant LysM phylogenies and associated gene, protein and genomic features, and infer the relative timing of duplications of LYK genes. We report a domain architecture catalogue of LysM proteins across all kingdoms. The unique pattern of LysM protein domain architectures indicates the presence of distinctive evolutionary paths in individual kingdoms. We describe a comparative and evolutionary genomics study of LysM genes in plant kingdom. One of the two groups of tandemly arrayed plant LYK genes likely resulted from an ancient genome duplication followed by local genomic rearrangement, while the origin of the other groups of tandemly arrayed LYK genes remains obscure. Given the fact that no animal LysM motif-containing genes have been functionally characterized, this study provides clues to functional characterization of plant LysM genes and is also informative with regard to evolutionary and functional studies of animal LysM genes.

  16. The Sequence and Analysis of Duplication Rich Human Chromosome 16

    DOE R&D Accomplishments Database

    Martin, Joel; Han, Cliff; Gordon, Laurie A.; Terry, Astrid; Prabhakar, Shyam; She, Xinwei; Xie, Gary; Hellsten, Uffe; Man Chan, Yee; Altherr, Michael; Couronne, Olivier; Aerts, Andrea; Bajorek, Eva; Black, Stacey; Blumer, Heather; Branscomb, Elbert; Brown, Nancy C.; Bruno, William J.; Buckingham, Judith M.; Callen, David F.; Campbell, Connie S.; Campbell, Mary L.; Campbell, Evelyn W.; Caoile, Chenier; Challacombe, Jean F.; Chasteen, Leslie A.; Chertkov, Olga; Chi, Han C.; Christensen, Mari; Clark, Lynn M.; Cohn, Judith D.; Denys, Mirian; Detter, John C.; Dickson, Mark; Dimitrijevic-Bussod, Mira; Escobar, Julio; Fawcett, Joseph J.; Flowers, Dave; Fotopulos, Dea; Glavina, Tijana; Gomez, Maria; Gonzales, Eidelyn; Goodstein, David; Goodwin, Lynne A.; Grady, Deborah L.; Grigoriev, Igor; Groza, Matthew; Hammon, Nancy; Hawkins, Trevor; Haydu, Lauren; Hildebrand, Carl E.; Huang, Wayne; Israni, Sanjay; Jett, Jamie; Jewett, Phillip E.; Kadner, Kristen; Kimball, Heather; Kobayashi, Arthur; Krawczyk, Marie-Claude; Leyba, Tina; Longmire, Jonathan L.; Lopez, Frederick; Lou, Yunian; Lowry, Steve; Ludeman, Thom; Mark, Graham A.; Mcmurray, Kimberly L.; Meincke, Linda J.; Morgan, Jenna; Moyzis, Robert K.; Mundt, Mark O.; Munk, A. Christine; Nandkeshwar, Richard D.; Pitluck, Sam; Pollard, Martin; Predki, Paul; Parson-Quintana, Beverly; Ramirez, Lucia; Rash, Sam; Retterer, James; Ricke, Darryl O.; Robinson, Donna L.; Rodriguez, Alex; Salamov, Asaf; Saunders, Elizabeth H.; Scott, Duncan; Shough, Timothy; Stallings, Raymond L.; Stalvey, Malinda; Sutherland, Robert D.; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Torney, David C.; Tran-Gyamfi, Mary; Tsai, Ming; Ulanovsky, Levy E.; Ustaszewska, Anna; Vo, Nu; White, P. Scott; Williams, Albert L.; Wills, Patricia L.; Wu, Jung-Rung; Wu, Kevin; Yang, Joan; DeJong, Pieter; Bruce, David; Doggett, Norman; Deaven, Larry; Schmutz, Jeremy; Grimwood, Jane; Richardson, Paul; et al.

    2004-01-01

    We report here the 78,884,754 base pairs of finished human chromosome 16 sequence, representing over 99.9 percent of its euchromatin. Manual annotation revealed 880 protein coding genes confirmed by 1,637 aligned transcripts, 19 tRNA genes, 341 pseudogenes and 3 RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukemia. Several large-scale structural polymorphisms spanning hundreds of kilobasepairs were identified and result in gene content differences across humans. One of the unique features of chromosome 16 is its high level of segmental duplication, ranked among the highest of the human autosomes. While the segmental duplications are enriched in the relatively gene poor pericentromere of the p-arm, some are involved in recent gene duplication and conversion events which are likely to have had an impact on the evolution of primates and human disease susceptibility.

  17. A 1.5-Mb cosmid contig of the CMT1A duplication/HNPP deletion critical region in 17p11.2-p12

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Murakami, Tatsufumi; Lupski, J.R.

    1996-05-15

    Charcot-Marie-Tooth disease type 1A (CMT1A) is associated with a 1.5-Mb tandem duplication in chromosome 17p11.2-p12, and hereditary neuropathy with liability to pressure palsies (HNPP) is associated with a 1.5-Mb deletion at this locus. Both diseases appear to result from an altered copy number of the peripheral myelin protein-22 gene, PMP22, which maps within the critical region. To identify additional genes and characterize chromosomal elements, a 1.5-Mb cosmid contig of the CMT1A duplication/HNPP deletion critical region was assembled using a yeast artificial chromosome (YAC)-based isolation and binning strategy. Whole YAC probes were used for screening a high-density arrayed chromosome 17-specific cosmidmore » library. Selected cosmids were spotted on dot blots and assigned to bins defined by YACs. This binning of cosmids facilitated the subsequent fingerprint analysis. The 1.5-Mb region was covered by 137 cosmids with a minimum overlap set of 52 cosmids assigned to 17 bins and 9 contigs. 20 refs., 2 figs.« less

  18. Frequent loss of lineages and deficient duplications accounted for low copy number of disease resistance genes in Cucurbitaceae

    PubMed Central

    2013-01-01

    Background The sequenced genomes of cucumber, melon and watermelon have relatively few R-genes, with 70, 75 and 55 copies only, respectively. The mechanism for low copy number of R-genes in Cucurbitaceae genomes remains unknown. Results Manual annotation of R-genes in the sequenced genomes of Cucurbitaceae species showed that approximately half of them are pseudogenes. Comparative analysis of R-genes showed frequent loss of R-gene loci in different Cucurbitaceae species. Phylogenetic analysis, data mining and PCR cloning using degenerate primers indicated that Cucurbitaceae has limited number of R-gene lineages (subfamilies). Comparison between R-genes from Cucurbitaceae and those from poplar and soybean suggested frequent loss of R-gene lineages in Cucurbitaceae. Furthermore, the average number of R-genes per lineage in Cucurbitaceae species is approximately 1/3 that in soybean or poplar. Therefore, both loss of lineages and deficient duplications in extant lineages accounted for the low copy number of R-genes in Cucurbitaceae. No extensive chimeras of R-genes were found in any of the sequenced Cucurbitaceae genomes. Nevertheless, one lineage of R-genes from Trichosanthes kirilowii, a wild Cucurbitaceae species, exhibits chimeric structures caused by gene conversions, and may contain a large number of distinct R-genes in natural populations. Conclusions Cucurbitaceae species have limited number of R-gene lineages and each genome harbors relatively few R-genes. The scarcity of R-genes in Cucurbitaceae species was due to frequent loss of R-gene lineages and infrequent duplications in extant lineages. The evolutionary mechanisms for large variation of copy number of R-genes in different plant species were discussed. PMID:23682795

  19. Analysis of LMNB1 Duplications in Autosomal Dominant Leukodystrophy Provides Insights into Duplication Mechanisms and Allele-Specific Expression

    PubMed Central

    Giorgio, Elisa; Rolyan, Harshvardhan; Kropp, Laura; Chakka, Anish Baswanth; Yatsenko, Svetlana; Gregorio, Eleonora Di; Lacerenza, Daniela; Vaula, Giovanna; Talarico, Flavia; Mandich, Paola; Toro, Camilo; Pierre, Eleonore Eymard; Labauge, Pierre; Capellari, Sabina; Cortelli, Pietro; Vairo, Filippo Pinto; Miguel, Diego; Stubbolo, Danielle; Marques, Lourenco Charles; Gahl, William; Boespflug-Tanguy, Odile; Melberg, Atle; Hassin-Baer, Sharon; Cohen, Oren S; Pjontek, Rastislav; Grau, Armin; Klopstock, Thomas; Fogel, Brent; Meijer, Inge; Rouleau, Guy; Bouchard, Jean-Pierre L; Ganapathiraju, Madhavi; Vanderver, Adeline; Dahl, Niklas; Hobson, Grace; Brusco, Alfredo; Brussino, Alessandro; Padiath, Quasar Saleem

    2013-01-01

    ABSTRACT Autosomal dominant leukodystrophy (ADLD) is an adult onset demyelinating disorder that is caused by duplications of the lamin B1 (LMNB1) gene. However, as only a few cases have been analyzed in detail, the mechanisms underlying LMNB1 duplications are unclear. We report the detailed molecular analysis of the largest collection of ADLD families studied, to date. We have identified the minimal duplicated region necessary for the disease, defined all the duplication junctions at the nucleotide level and identified the first inverted LMNB1 duplication. We have demonstrated that the duplications are not recurrent; patients with identical duplications share the same haplotype, likely inherited from a common founder and that the duplications originated from intrachromosomal events. The duplication junction sequences indicated that nonhomologous end joining or replication-based mechanisms such fork stalling and template switching or microhomology-mediated break induced repair are likely to be involved. LMNB1 expression was increased in patients’ fibroblasts both at mRNA and protein levels and the three LMNB1 alleles in ADLD patients show equal expression, suggesting that regulatory regions are maintained within the rearranged segment. These results have allowed us to elucidate duplication mechanisms and provide insights into allele-specific LMNB1 expression levels. PMID:23649844

  20. Yeast Interspecies Comparative Proteomics Reveals Divergence in Expression Profiles and Provides Insights into Proteome Resource Allocation and Evolutionary Roles of Gene Duplication.

    PubMed

    Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke

    2016-01-01

    Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under

  1. Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins

    PubMed Central

    Opazo, Juan C.; Butts, G. Tyler; Nery, Mariana F.; Storz, Jay F.; Hoffmann, Federico G.

    2013-01-01

    Subsequent to the two rounds of whole-genome duplication that occurred in the common ancestor of vertebrates, a third genome duplication occurred in the stem lineage of teleost fishes. This teleost-specific genome duplication (TGD) is thought to have provided genetic raw materials for the physiological, morphological, and behavioral diversification of this highly speciose group. The extreme physiological versatility of teleost fish is manifest in their diversity of blood–gas transport traits, which reflects the myriad solutions that have evolved to maintain tissue O2 delivery in the face of changing metabolic demands and environmental O2 availability during different ontogenetic stages. During the course of development, regulatory changes in blood–O2 transport are mediated by the expression of multiple, functionally distinct hemoglobin (Hb) isoforms that meet the particular O2-transport challenges encountered by the developing embryo or fetus (in viviparous or oviparous species) and in free-swimming larvae and adults. The main objective of the present study was to assess the relative contributions of whole-genome duplication, large-scale segmental duplication, and small-scale gene duplication in producing the extraordinary functional diversity of teleost Hbs. To accomplish this, we integrated phylogenetic reconstructions with analyses of conserved synteny to characterize the genomic organization and evolutionary history of the globin gene clusters of teleosts. These results were then integrated with available experimental data on functional properties and developmental patterns of stage-specific gene expression. Our results indicate that multiple α- and β-globin genes were present in the common ancestor of gars (order Lepisoteiformes) and teleosts. The comparative genomic analysis revealed that teleosts possess a dual set of TGD-derived globin gene clusters, each of which has undergone lineage-specific changes in gene content via repeated duplication and

  2. Screening of duplicated loci reveals hidden divergence patterns in a complex salmonid genome

    USGS Publications Warehouse

    Limborg, Morten T.; Larson, Wesley; Seeb, Lisa W.; Seeb, James E.

    2017-01-01

    A whole-genome duplication (WGD) doubles the entire genomic content of a species and is thought to have catalysed adaptive radiation in some polyploid-origin lineages. However, little is known about general consequences of a WGD because gene duplicates (i.e., paralogs) are commonly filtered in genomic studies; such filtering may remove substantial portions of the genome in data sets from polyploid-origin species. We demonstrate a new method that enables genome-wide scans for signatures of selection at both nonduplicated and duplicated loci by taking locus-specific copy number into account. We apply this method to RAD sequence data from different ecotypes of a polyploid-origin salmonid (Oncorhynchus nerka) and reveal signatures of divergent selection that would have been missed if duplicated loci were filtered. We also find conserved signatures of elevated divergence at pairs of homeologous chromosomes with residual tetrasomic inheritance, suggesting that joint evolution of some nondiverged gene duplicates may affect the adaptive potential of these genes. These findings illustrate that including duplicated loci in genomic analyses enables novel insights into the evolutionary consequences of WGDs and local segmental gene duplications.

  3. On the need for widespread horizontal gene transfers under genome size constraint.

    PubMed

    Isambert, Hervé; Stein, Richard R

    2009-08-25

    While eukaryotes primarily evolve by duplication-divergence expansion (and reduction) of their own gene repertoire with only rare horizontal gene transfers, prokaryotes appear to evolve under both gene duplications and widespread horizontal gene transfers over long evolutionary time scales. But, the evolutionary origin of this striking difference in the importance of horizontal gene transfers remains by and large a mystery. We propose that the abundance of horizontal gene transfers in free-living prokaryotes is a simple but necessary consequence of two opposite effects: i) their apparent genome size constraint compared to typical eukaryote genomes and ii) their underlying genome expansion dynamics through gene duplication-divergence evolution, as demonstrated by the presence of many tandem and block repeated genes. In principle, this combination of genome size constraint and underlying duplication expansion should lead to a coalescent-like process with extensive turnover of functional genes. This would, however, imply the unlikely, systematic reinvention of functions from discarded genes within independent phylogenetic lineages. Instead, we propose that the long-term evolutionary adaptation of free-living prokaryotes must have resulted in the emergence of efficient non-phylogenetic pathways to circumvent gene loss. This need for widespread horizontal gene transfers due to genome size constraint implies, in particular, that prokaryotes must remain under strong selection pressure in order to maintain the long-term evolutionary adaptation of their "mutualized" gene pool, beyond the inevitable turnover of individual prokaryote species. By contrast, the absence of genome size constraint for typical eukaryotes has presumably relaxed their need for widespread horizontal gene transfers and strong selection pressure. Yet, the resulting loss of genetic functions, due to weak selection pressure and inefficient gene recovery mechanisms, must have ultimately favored the

  4. Sucrose metabolism gene families and their biological functions

    PubMed Central

    Jiang, Shu-Ye; Chi, Yun-Hua; Wang, Ji-Zhou; Zhou, Jun-Xia; Cheng, Yan-Song; Zhang, Bao-Lan; Ma, Ali; Vanitha, Jeevanandam; Ramachandran, Srinivasan

    2015-01-01

    Sucrose, as the main product of photosynthesis, plays crucial roles in plant development. Although studies on general metabolism pathway were well documented, less information is available on the genome-wide identification of these genes, their expansion and evolutionary history as well as their biological functions. We focused on four sucrose metabolism related gene families including sucrose synthase, sucrose phosphate synthase, sucrose phosphate phosphatase and UDP-glucose pyrophosphorylase. These gene families exhibited different expansion and evolutionary history as their host genomes experienced differentiated rates of the whole genome duplication, tandem and segmental duplication, or mobile element mediated gene gain and loss. They were evolutionarily conserved under purifying selection among species and expression divergence played important roles for gene survival after expansion. However, we have detected recent positive selection during intra-species divergence. Overexpression of 15 sorghum genes in Arabidopsis revealed their roles in biomass accumulation, flowering time control, seed germination and response to high salinity and sugar stresses. Our studies uncovered the molecular mechanisms of gene expansion and evolution and also provided new insight into the role of positive selection in intra-species divergence. Overexpression data revealed novel biological functions of these genes in flowering time control and seed germination under normal and stress conditions. PMID:26616172

  5. Expansion of banana (Musa acuminata) gene families involved in ethylene biosynthesis and signalling after lineage-specific whole-genome duplications.

    PubMed

    Jourda, Cyril; Cardi, Céline; Mbéguié-A-Mbéguié, Didier; Bocs, Stéphanie; Garsmeur, Olivier; D'Hont, Angélique; Yahiaoui, Nabila

    2014-05-01

    Whole-genome duplications (WGDs) are widespread in plants, and three lineage-specific WGDs occurred in the banana (Musa acuminata) genome. Here, we analysed the impact of WGDs on the evolution of banana gene families involved in ethylene biosynthesis and signalling, a key pathway for banana fruit ripening. Banana ethylene pathway genes were identified using comparative genomics approaches and their duplication modes and expression profiles were analysed. Seven out of 10 banana ethylene gene families evolved through WGD and four of them (1-aminocyclopropane-1-carboxylate synthase (ACS), ethylene-insensitive 3-like (EIL), ethylene-insensitive 3-binding F-box (EBF) and ethylene response factor (ERF)) were preferentially retained. Banana orthologues of AtEIN3 and AtEIL1, two major genes for ethylene signalling in Arabidopsis, were particularly expanded. This expansion was paralleled by that of EBF genes which are responsible for control of EIL protein levels. Gene expression profiles in banana fruits suggested functional redundancy for several MaEBF and MaEIL genes derived from WGD and subfunctionalization for some of them. We propose that EIL and EBF genes were co-retained after WGD in banana to maintain balanced control of EIL protein levels and thus avoid detrimental effects of constitutive ethylene signalling. In the course of evolution, subfunctionalization was favoured to promote finer control of ethylene signalling. © 2014 CIRAD New Phytologist © 2014 New Phytologist Trust.

  6. Molecular evolution of the odorant and gustatory receptor genes in lepidopteran insects: implications for their adaptation and speciation.

    PubMed

    Engsontia, Patamarerk; Sangket, Unitsa; Chotigeat, Wilaiwan; Satasook, Chutamas

    2014-08-01

    Lepidoptera (comprised of butterflies and moths) is one of the largest groups of insects, including more than 160,000 described species. Chemoreception plays important roles in the adaptation of these species to a wide range of niches, e.g., plant hosts, egg-laying sites, and mates. This study investigated the molecular evolution of the lepidopteran odorant (Or) and gustatory receptor (Gr) genes using recently identified genes from Bombyx mori, Danaus plexippus, Heliconius melpomene, Plutella xylostella, Heliothis virescens, Manduca sexta, Cydia pomonella, and Spodoptera littoralis. A limited number of cases of large lineage-specific gene expansion are observed (except in the P. xylostella lineage), possibly due to selection against tandem gene duplication. There has been strong purifying selection during the evolution of both lepidopteran odorant and gustatory genes, as shown by the low ω values estimated through CodeML analysis, ranging from 0.0093 to 0.3926. However, purifying selection has been relaxed on some amino acid sites in these receptors, leading to sequence divergence, which is a precursor of positive selection on these sequences. Signatures of positive selection were detected only in a few loci from the lineage-specific analysis. Estimation of gene gains and losses suggests that the common ancestor of the Lepidoptera had fewer Or genes compared to extant species and an even more reduced number of Gr genes, particularly within the bitter receptor clade. Multiple gene gains and a few gene losses occurred during the evolution of Lepidoptera. Gene family expansion may be associated with the adaptation of lepidopteran species to plant hosts, especially after angiosperm radiation. Phylogenetic analysis of the moth sex pheromone receptor genes suggested that chromosomal translocations have occurred several times. New sex pheromone receptors have arisen through tandem gene duplication. Positive selection was detected at some amino acid sites predicted to be

  7. Comparative genomics of duplicate γ-glutamyl transferase genes in teleosts: medaka (Oryzias latipes), stickleback (Gasterosteus aculeatus), green spotted pufferfish (Tetraodon nigroviridis), fugu (Takifugu rubripes), and zebrafish (Danio rerio).

    PubMed

    Law, Sheran Hiu Wan; Redelings, Benjamin David; Kullman, Seth William

    2012-01-15

    The availability of multiple teleost (bony fish) genomes is providing unprecedented opportunities to understand the diversity and function of gene duplication events using comparative genomics. Here we examine multiple paralogous genes of γ-glutamyl transferase (GGT) in several distantly related teleost species including medaka, stickleback, green spotted pufferfish, fugu, and zebrafish. Through mining genome databases, we have identified multiple GGT orthologs. Duplicate (paralogous) GGT sequences for GGT1 (GGT1 a and b), GGTL1 (GGTL1 a and b), and GGTL3 (GGTL3 a and b) were identified for each species. Phylogenetic analysis suggests that GGTs are ancient proteins conserved across most metazoan phyla and those paralogous GGTs in teleosts likely arose from the serial 3R genome duplication events. A third GGTL1 gene (GGTL1c) was found in green spotted pufferfish; however, this gene is not present in medaka, stickleback, or fugu. Similarly, one or both paralogs of GGTL3 appear to have been lost in green spotted pufferfish, fugu, and zebrafish. Syntenic relationships were highly maintained between duplicated teleost chromosomes, among teleosts and across ray-finned (Actinopterygii) and lobe-finned (Sarcopterygii) species. To assess subfunction partitioning, six medaka GGT genes were cloned and assessed for developmental and tissue-specific expression. On the basis of these data, we propose a modification of the "duplication-degeneration-complementation" model of subfunction partitioning where quantitative differences rather than absolute differences in gene expression are observed between gene paralogs. Our results demonstrate that multiple GGT genes have been retained within teleost genomes. Questions remain, however, regarding the functional roles of multiple GGTs in these species. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.

  8. Evolution of Antifreeze Protein Genes in the Diatom Genus Fragilariopsis: Evidence for Horizontal Gene Transfer, Gene Duplication and Episodic Diversifying Selection

    PubMed Central

    Sorhannus, Ulf

    2011-01-01

    Hypotheses about horizontal transfer of antifreeze protein genes to ice-living diatoms were addressed using two different statistical methods available in the program Prunier. The role of diversifying selection in driving the differentiation of a set of antifreeze protein genes in the diatom genus Fragilariopsis was also investigated. Four horizontal gene transfer events were identified. Two of these took place between two major eukaryote lineages, that is from the diatom Chaetoceros neogracile to the copepod Stephos longipes and from a basidiomycete clade to a monophyletic group, consisting of the diatom species Fragilariopsis curta and Fragilariopsis cylindrus. The remaining two events included transfers from an ascomycete lineage to the proteobacterium Stigmatella aurantiaca and from the proteobacterium Polaribacter irgensii to a group composed of 4 proteobacterium species. After the Fragilariopsis lineage acquired the antifreeze protein gene from the basidiomycetes, it duplicated and went through episodic evolution, characterized by strong positive selection acting on short segments of the branches in the tree. This selection pattern suggests that the paralogs differentiated functionally over relatively short time periods. Taken together, the results obtained here indicate that the group of antifreeze protein genes considered here have a complex evolutionary history. PMID:22253534

  9. Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications.

    PubMed

    Brown, J R; Doolittle, W F

    1995-03-28

    Universal trees based on sequences of single gene homologs cannot be rooted. Iwabe et al. [Iwabe, N., Kuma, K.-I., Hasegawa, M., Osawa, S. & Miyata, T. (1989) Proc. Natl. Acad. Sci. USA 86, 9355-9359] circumvented this problem by using ancient gene duplications that predated the last common ancestor of all living things. Their separate, reciprocally rooted gene trees for elongation factors and ATPase subunits showed Bacteria (eubacteria) as branching first from the universal tree with Archaea (archaebacteria) and Eucarya (eukaryotes) as sister groups. Given its topical importance to evolutionary biology and concerns about the appropriateness of the ATPase data set, an evaluation of the universal tree root using other ancient gene duplications is essential. In this study, we derive a rooting for the universal tree using aminoacyl-tRNA synthetase genes, an extensive multigene family whose divergence likely preceded that of prokaryotes and eukaryotes. An approximately 1600-bp conserved region was sequenced from the isoleucyl-tRNA synthetases of several species representing deep evolutionary branches of eukaryotes (Nosema locustae), Bacteria (Aquifex pyrophilus and Thermotoga maritima) and Archaea (Pyrococcus furiosus and Sulfolobus acidocaldarius). In addition, a new valyl-tRNA synthetase was characterized from the protist Trichomonas vaginalis. Different phylogenetic methods were used to generate trees of isoleucyl-tRNA synthetases rooted by valyl- and leucyl-tRNA synthetases. All isoleucyl-tRNA synthetase trees showed Archaea and Eucarya as sister groups, providing strong confirmation for the universal tree rooting reported by Iwabe et al. As well, there was strong support for the monophyly (sensu Hennig) of Archaea. The valyl-tRNA synthetase gene from Tr. vaginalis clustered with other eukaryotic ValRS genes, which may have been transferred from the mitochondrial genome to the nuclear genome, suggesting that this amitochondrial trichomonad once harbored an

  10. Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

    PubMed

    van der Ley, P

    1988-11-01

    Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.

  11. Evolution of Rubisco activase gene in plants.

    PubMed

    Nagarajan, Ragupathi; Gill, Kulvinder S

    2018-01-01

    Rubisco activase of plants evolved in a stepwise manner without losing its function to adapt to the major evolutionary events including endosymbiosis and land colonization. Rubisco activase is an essential enzyme for photosynthesis, which removes inhibitory sugar phosphates from the active sites of Rubisco, a process necessary for Rubisco activation and carbon fixation. The gene probably evolved in cyanobacteria as different species differ for its presence. However, the gene is present in all other plant species. At least a single gene copy was maintained throughout plant evolution; but various genome and gene duplication events, which occurred during plant evolution, increased its copy number in some species. The exons and exon-intron junctions of present day higher plant's Rca, which is conserved in most species seem to have evolved in charophytes. A unique tandem duplication of Rca gene occurred in a common grass ancestor, and the two genes evolved differently for gene structure, sequence, and expression pattern. At the protein level, starting with a primitive form in cyanobacteria, RCA of chlorophytes evolved by integrating chloroplast transit peptide (cTP), and N-terminal domains to the ATPase, Rubisco recognition and C-terminal domains. The redox regulated C-terminal extension (CTE) and the associated alternate splicing mechanism, which splices the RCA-α and RCA-β isoforms were probably gained from another gene in charophytes, conserved in most species except the members of Solanaceae family.

  12. Characterization of various promoter regions of the human DNA helicase-encoding genes and identification of duplicated ets (GGAA) motifs as an essential transcription regulatory element.

    PubMed

    Uchiumi, Fumiaki; Watanabe, Takeshi; Tanuma, Sei-ichi

    2010-05-15

    DNA helicases are important in the regulation of DNA transaction and thereby various cellular functions. In this study, we developed a cost-effective multiple DNA transfection assay with DEAE-dextran reagent and analyzed the promoter activities of the human DNA helicases. The 5'-flanking regions of the human DNA helicase-encoding genes were isolated and subcloned into luciferase (Luc) expression plasmids. They were coated onto 96-well plate and used for co-transfection with a renilla-Luc expression vector into various cells, and dual-Luc assays were performed. The profiles of promoter activities were dependent on cell lines used. Among these human DNA helicase genes, XPB, RecQL5, and RTEL promoters were activated during TPA-induced HL-60 cell differentiation. Interestingly, duplicated ets (GGAA) elements are commonly located around the transcription start sites of these genes. The duplicated GGAA motifs are also found in the promoters of DNA replication/repair synthesis factor genes including PARG, ATR, TERC, and Rb1. Mutation analyses suggested that the duplicated GGAA-motifs are necessary for the basal promoter activity in various cells and some of them positively respond to TPA in HL-60 cells. TPA-induced response of 44-bp in the RTEL promoter was attenuated by co-transfection of the PU.1 expression vector. These findings suggest that the duplicated ets motifs regulate DNA-repair associated gene expressions during macrophage-like differentiation of HL-60 cells. Copyright 2010 Elsevier Inc. All rights reserved.

  13. RANGER-DTL 2.0: Rigorous Reconstruction of Gene-Family Evolution by Duplication, Transfer, and Loss.

    PubMed

    Bansal, Mukul S; Kellis, Manolis; Kordi, Misagh; Kundu, Soumya

    2018-04-24

    RANGER-DTL 2.0 is a software program for inferring gene family evolution using Duplication-Transfer-Loss reconciliation. This new software is highly scalable and easy to use, and offers many new features not currently available in any other reconciliation program. RANGER-DTL 2.0 has a particular focus on reconciliation accuracy and can account for many sources of reconciliation uncertainty including uncertain gene tree rooting, gene tree topological uncertainty, multiple optimal reconciliations, and alternative event cost assignments. RANGER-DTL 2.0 is open-source and written in C ++ and Python. Pre-compiled executables, source code (open-source under GNU GPL), and a detailed manual are freely available from http://compbio.engr.uconn.edu/software/RANGER-DTL/. mukul.bansal@uconn.edu.

  14. The Evolutionary Fates of a Large Segmental Duplication in Mouse

    PubMed Central

    Morgan, Andrew P.; Holt, J. Matthew; McMullan, Rachel C.; Bell, Timothy A.; Clayshulte, Amelia M.-F.; Didion, John P.; Yadgary, Liran; Thybert, David; Odom, Duncan T.; Flicek, Paul; McMillan, Leonard; de Villena, Fernando Pardo-Manuel

    2016-01-01

    Gene duplication and loss are major sources of genetic polymorphism in populations, and are important forces shaping the evolution of genome content and organization. We have reconstructed the origin and history of a 127-kbp segmental duplication, R2d, in the house mouse (Mus musculus). R2d contains a single protein-coding gene, Cwc22. De novo assembly of both the ancestral (R2d1) and the derived (R2d2) copies reveals that they have been subject to nonallelic gene conversion events spanning tens of kilobases. R2d2 is also a hotspot for structural variation: its diploid copy number ranges from zero in the mouse reference genome to >80 in wild mice sampled from around the globe. Hemizygosity for high copy-number alleles of R2d2 is associated in cis with meiotic drive; suppression of meiotic crossovers; and copy-number instability, with a mutation rate in excess of 1 per 100 transmissions in some laboratory populations. Our results provide a striking example of allelic diversity generated by duplication and demonstrate the value of de novo assembly in a phylogenetic context for understanding the mutational processes affecting duplicate genes. PMID:27371833

  15. Long-read sequencing uncovers the adaptive topography of a carnivorous plant genome

    PubMed Central

    Lan, Tianying; Renner, Tanya; Ibarra-Laclette, Enrique; Farr, Kimberly M.; Chang, Tien-Hao; Cervantes-Pérez, Sergio Alan; Zheng, Chunfang; Sankoff, David; Tang, Haibao; Purbojati, Rikky W.; Putra, Alexander; Drautz-Moses, Daniela I.; Schuster, Stephan C.; Herrera-Estrella, Luis; Albert, Victor A.

    2017-01-01

    Utricularia gibba, the humped bladderwort, is a carnivorous plant that retains a tiny nuclear genome despite at least two rounds of whole genome duplication (WGD) since common ancestry with grapevine and other species. We used a third-generation genome assembly with several complete chromosomes to reconstruct the two most recent lineage-specific ancestral genomes that led to the modern U. gibba genome structure. Patterns of subgenome dominance in the most recent WGD, both architectural and transcriptional, are suggestive of allopolyploidization, which may have generated genomic novelty and led to instantaneous speciation. Syntenic duplicates retained in polyploid blocks are enriched for transcription factor functions, whereas gene copies derived from ongoing tandem duplication events are enriched in metabolic functions potentially important for a carnivorous plant. Among these are tandem arrays of cysteine protease genes with trap-specific expression that evolved within a protein family known to be useful in the digestion of animal prey. Further enriched functions among tandem duplicates (also with trap-enhanced expression) include peptide transport (intercellular movement of broken-down prey proteins), ATPase activities (bladder-trap acidification and transmembrane nutrient transport), hydrolase and chitinase activities (breakdown of prey polysaccharides), and cell-wall dynamic components possibly associated with active bladder movements. Whereas independently polyploid Arabidopsis syntenic gene duplicates are similarly enriched for transcriptional regulatory activities, Arabidopsis tandems are distinct from those of U. gibba, while still metabolic and likely reflecting unique adaptations of that species. Taken together, these findings highlight the special importance of tandem duplications in the adaptive landscapes of a carnivorous plant genome. PMID:28507139

  16. Identification and genetic effect of a variable duplication in the promoter region of the cattle ADIPOQ gene

    USDA-ARS?s Scientific Manuscript database

    The ADIPOQ gene of cattle, is located in the vicinity of the quantitative trait locus (QTL) wich effects marbling, the rib eye muscle area and fat thickness on BTA1. In our study, a novel variable duplication (NW_003103812.1:g.9232067_9232133 dup) in the bovine ADIPOQ promoter region was identified ...

  17. A 21 Nucleotide Duplication on the α1- and α2-Globin Genes Involves a Variety of Hypochromic Microcytic Anemias, From Mild to Hb H Disease.

    PubMed

    Farashi, Samaneh; Faramarzi Garous, Negin; Zeinali, Fatemeh; Vakili, Shadi; Ashki, Mehri; Imanian, Hashem; Najmabadi, Hossein; Azarkeivan, Azita; Tamaddoni, Ahmad

    2015-01-01

    α-Thalassemia (α-thal) is a common genetic disorder in Iran and many parts of the world. Genetic defects in the α-globin gene cluster can result in α-thal that may develop into a clinical phenotype varying from almost asymptomatic to a lethal hemolytic anemia. Loss of one functional α gene, indicated as heterozygous α(+)-thal, shows minor hematological abnormalities. Homozygosity for α(+)- or heterozygosity for α(0)-thal have more severe hematological abnormalities due to a markedly reduced α chain output. At the molecular level, the absence of three α-globin genes resulting from the compound heterozygous state for α(0)- and α(+)-thal, lead to Hb H disease. Here we present a 21 nucleotide (nt) duplication consisting of six amino acids and 3 bp of intronic sequence at the exon-intron boundary, in both the α-globin genes, detected by direct DNA sequencing. This duplication was identified in three patients originating from two different Iranian ethnic groups and one Arab during more than 12 years. The clinical presentation of these individuals varies widely from a mild asymptomatic anemia (heterozygote in α1-globin gene) to a severely anemic state, diagnosed as an Hb H individual requiring blood transfusion (duplication on the α2-globin gene in combination with the - -(MED) double α-globin gene deletion). The third individual, who was homozygous for this nt duplication on the α1-globin gene, showed severe hypochromic microcytic anemia and splenomegaly. In the last decade, numerous α-globin mutations have demonstrated the necessity of prenatal diagnosis (PND) for α-thal, and this study has contributed another mutation as important enough that needs to be considered.

  18. Are duplicated genes responsible for anthracnose resistance in common bean?

    PubMed

    Costa, Larissa Carvalho; Nalin, Rafael Storto; Ramalho, Magno Antonio Patto; de Souza, Elaine Aparecida

    2017-01-01

    The race 65 of Colletotrichum lindemuthianum, etiologic agent of anthracnose in common bean, is distributed worldwide, having great importance in breeding programs for anthracnose resistance. Several resistance alleles have been identified promoting resistance to this race. However, the variability that has been detected within race has made it difficult to obtain cultivars with durable resistance, because cultivars may have different reactions to each strain of race 65. Thus, this work aimed at studying the resistance inheritance of common bean lines to different strains of C. lindemuthianum, race 65. We used six C. lindemuthianum strains previously characterized as belonging to the race 65 through the international set of differential cultivars of anthracnose and nine commercial cultivars, adapted to the Brazilian growing conditions and with potential ability to discriminate the variability within this race. To obtain information on the resistance inheritance related to nine commercial cultivars to six strains of race 65, these cultivars were crossed two by two in all possible combinations, resulting in 36 hybrids. Segregation in the F2 generations revealed that the resistance to each strain is conditioned by two independent genes with the same function, suggesting that they are duplicated genes, where the dominant allele promotes resistance. These results indicate that the specificity between host resistance genes and pathogen avirulence genes is not limited to races, it also occurs within strains of the same race. Further research may be carried out in order to establish if the alleles identified in these cultivars are different from those described in the literature.

  19. Are duplicated genes responsible for anthracnose resistance in common bean?

    PubMed Central

    2017-01-01

    The race 65 of Colletotrichum lindemuthianum, etiologic agent of anthracnose in common bean, is distributed worldwide, having great importance in breeding programs for anthracnose resistance. Several resistance alleles have been identified promoting resistance to this race. However, the variability that has been detected within race has made it difficult to obtain cultivars with durable resistance, because cultivars may have different reactions to each strain of race 65. Thus, this work aimed at studying the resistance inheritance of common bean lines to different strains of C. lindemuthianum, race 65. We used six C. lindemuthianum strains previously characterized as belonging to the race 65 through the international set of differential cultivars of anthracnose and nine commercial cultivars, adapted to the Brazilian growing conditions and with potential ability to discriminate the variability within this race. To obtain information on the resistance inheritance related to nine commercial cultivars to six strains of race 65, these cultivars were crossed two by two in all possible combinations, resulting in 36 hybrids. Segregation in the F2 generations revealed that the resistance to each strain is conditioned by two independent genes with the same function, suggesting that they are duplicated genes, where the dominant allele promotes resistance. These results indicate that the specificity between host resistance genes and pathogen avirulence genes is not limited to races, it also occurs within strains of the same race. Further research may be carried out in order to establish if the alleles identified in these cultivars are different from those described in the literature. PMID:28296933

  20. Duplication within the SEPT9 gene associated with a founder effect in North American families with hereditary neuralgic amyotrophy

    PubMed Central

    Landsverk, Megan L.; Ruzzo, Elizabeth K.; Mefford, Heather C.; Buysse, Karen; Buchan, Jillian G.; Eichler, Evan E.; Petty, Elizabeth M.; Peterson, Esther A.; Knutzen, Dana M.; Barnett, Karen; Farlow, Martin R.; Caress, Judy; Parry, Gareth J.; Quan, Dianna; Gardner, Kathy L.; Hong, Ming; Simmons, Zachary; Bird, Thomas D.; Chance, Phillip F.; Hannibal, Mark C.

    2009-01-01

    Hereditary neuralgic amyotrophy (HNA) is an autosomal dominant disorder associated with recurrent episodes of focal neuropathy primarily affecting the brachial plexus. Point mutations in the SEPT9 gene have been previously identified as the molecular basis of HNA in some pedigrees. However in many families, including those from North America demonstrating a genetic founder haplotype, no sequence mutations have been detected. We report an intragenic 38 Kb SEPT9 duplication that is linked to HNA in 12 North American families that share the common founder haplotype. Analysis of the breakpoints showed that the duplication is identical in all pedigrees, and molecular analysis revealed that the duplication includes the 645 bp exon in which previous HNA mutations were found. The SEPT9 transcript variants that span this duplication contain two in-frame repeats of this exon, and immunoblotting demonstrates larger molecular weight SEPT9 protein isoforms. This exon also encodes for a majority of the SEPT9 N-terminal proline rich region suggesting that this region plays a role in the pathogenesis of HNA. PMID:19139049

  1. Duplication within the SEPT9 gene associated with a founder effect in North American families with hereditary neuralgic amyotrophy.

    PubMed

    Landsverk, Megan L; Ruzzo, Elizabeth K; Mefford, Heather C; Buysse, Karen; Buchan, Jillian G; Eichler, Evan E; Petty, Elizabeth M; Peterson, Esther A; Knutzen, Dana M; Barnett, Karen; Farlow, Martin R; Caress, Judy; Parry, Gareth J; Quan, Dianna; Gardner, Kathy L; Hong, Ming; Simmons, Zachary; Bird, Thomas D; Chance, Phillip F; Hannibal, Mark C

    2009-04-01

    Hereditary neuralgic amyotrophy (HNA) is an autosomal dominant disorder associated with recurrent episodes of focal neuropathy primarily affecting the brachial plexus. Point mutations in the SEPT9 gene have been previously identified as the molecular basis of HNA in some pedigrees. However in many families, including those from North America demonstrating a genetic founder haplotype, no sequence mutations have been detected. We report an intragenic 38 Kb SEPT9 duplication that is linked to HNA in 12 North American families that share the common founder haplotype. Analysis of the breakpoints showed that the duplication is identical in all pedigrees, and molecular analysis revealed that the duplication includes the 645 bp exon in which previous HNA mutations were found. The SEPT9 transcript variants that span this duplication contain two in-frame repeats of this exon, and immunoblotting demonstrates larger molecular weight SEPT9 protein isoforms. This exon also encodes for a majority of the SEPT9 N-terminal proline rich region suggesting that this region plays a role in the pathogenesis of HNA.

  2. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database.

    PubMed

    Pasquier, Jeremy; Cabau, Cédric; Nguyen, Thaovi; Jouanno, Elodie; Severac, Dany; Braasch, Ingo; Journot, Laurent; Pontarotti, Pierre; Klopp, Christophe; Postlethwait, John H; Guiguen, Yann; Bobe, Julien

    2016-05-18

    With more than 30,000 species, ray-finned fish represent approximately half of vertebrates. The evolution of ray-finned fish was impacted by several whole genome duplication (WGD) events including a teleost-specific WGD event (TGD) that occurred at the root of the teleost lineage about 350 million years ago (Mya) and more recent WGD events in salmonids, carps, suckers and others. In plants and animals, WGD events are associated with adaptive radiations and evolutionary innovations. WGD-spurred innovation may be especially relevant in the case of teleost fish, which colonized a wide diversity of habitats on earth, including many extreme environments. Fish biodiversity, the use of fish models for human medicine and ecological studies, and the importance of fish in human nutrition, fuel an important need for the characterization of gene expression repertoires and corresponding evolutionary histories of ray-finned fish genes. To this aim, we performed transcriptome analyses and developed the PhyloFish database to provide (i) de novo assembled gene repertoires in 23 different ray-finned fish species including two holosteans (i.e. a group that diverged from teleosts before TGD) and 21 teleosts (including six salmonids), and (ii) gene expression levels in ten different tissues and organs (and embryos for many) in the same species. This resource was generated using a common deep RNA sequencing protocol to obtain the most exhaustive gene repertoire possible in each species that allows between-species comparisons to study the evolution of gene expression in different lineages. The PhyloFish database described here can be accessed and searched using RNAbrowse, a simple and efficient solution to give access to RNA-seq de novo assembled transcripts.

  3. Asymmetric histone modifications between the original and derived loci of human segmental duplications

    PubMed Central

    Zheng, Deyou

    2008-01-01

    Background Sequencing and annotation of several mammalian genomes have revealed that segmental duplications are a common architectural feature of primate genomes; in fact, about 5% of the human genome is composed of large blocks of interspersed segmental duplications. These segmental duplications have been implicated in genomic copy-number variation, gene novelty, and various genomic disorders. However, the molecular processes involved in the evolution and regulation of duplicated sequences remain largely unexplored. Results In this study, the profile of about 20 histone modifications within human segmental duplications was characterized using high-resolution, genome-wide data derived from a ChIP-Seq study. The analysis demonstrates that derivative loci of segmental duplications often differ significantly from the original with respect to many histone methylations. Further investigation showed that genes are present three times more frequently in the original than in the derivative, whereas pseudogenes exhibit the opposite trend. These asymmetries tend to increase with the age of segmental duplications. The uneven distribution of genes and pseudogenes does not, however, fully account for the asymmetry in the profile of histone modifications. Conclusion The first systematic analysis of histone modifications between segmental duplications demonstrates that two seemingly 'identical' genomic copies are distinct in their epigenomic properties. Results here suggest that local chromatin environments may be implicated in the discrimination of derived copies of segmental duplications from their originals, leading to a biased pseudogenization of the new duplicates. The data also indicate that further exploration of the interactions between histone modification and sequence degeneration is necessary in order to understand the divergence of duplicated sequences. PMID:18598352

  4. Evidence for the involvement of Globosa-like gene duplications and expression divergence in the evolution of floral morphology in the Zingiberales.

    PubMed

    Bartlett, Madelaine E; Specht, Chelsea D

    2010-07-01

    *The MADS box transcription factor family has long been identified as an important contributor to the control of floral development. It is often hypothesized that the evolution of floral development across angiosperms and within specific lineages may occur as a result of duplication, functional diversification, and changes in regulation of MADS box genes. Here we examine the role of Globosa (GLO)-like genes, members of the B-class MADS box gene lineage, in the evolution of floral development within the monocot order Zingiberales. *We assessed changes in perianth and stamen whorl morphology in a phylogenetic framework. We identified GLO homologs (ZinGLO1-4) from 50 Zingiberales species and investigated the evolution of this gene lineage. Expression of two GLO homologs was assessed in Costus spicatus and Musa basjoo. *Based on the phylogenetic data and expression results, we propose several family-specific losses and gains of GLO homologs that appear to be associated with key morphological changes. The GLO-like gene lineage has diversified concomitant with the evolution of the dimorphic perianth and the staminodial labellum. *Duplications and expression divergence within the GLO-like gene lineage may have played a role in floral diversification in the Zingiberales.

  5. Ancient Expansion of the Hox Cluster in Lepidoptera Generated Four Homeobox Genes Implicated in Extra-Embryonic Tissue Formation

    PubMed Central

    Taylor, William R.; Gibbs, Melanie; Breuker, Casper J.; Holland, Peter W. H.

    2014-01-01

    Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes) has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina) plus a caddisfly outgroup (Glyphotaelius pellucidus) to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths). Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria), with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks. PMID:25340822

  6. Human-Specific Duplication and Mosaic Transcripts: The Recent Paralogous Structure of Chromosome 22

    PubMed Central

    Bailey, Jeffrey A. ; Yavor, Amy M. ; Viggiano, Luigi ; Misceo, Doriana ; Horvath, Juliann E. ; Archidiacono, Nicoletta ; Schwartz, Stuart ; Rocchi, Mariano ; Eichler, Evan E. 

    2002-01-01

    In recent decades, comparative chromosomal banding, chromosome painting, and gene-order studies have shown strong conservation of gross chromosome structure and gene order in mammals. However, findings from the human genome sequence suggest an unprecedented degree of recent (<35 million years ago) segmental duplication. This dynamism of segmental duplications has important implications in disease and evolution. Here we present a chromosome-wide view of the structure and evolution of the most highly homologous duplications (⩾1 kb and ⩾90%) on chromosome 22. Overall, 10.8% (3.7/33.8 Mb) of chromosome 22 is duplicated, with an average sequence identity of 95.4%. To organize the duplications into tractable units, intron-exon structure and well-defined duplication boundaries were used to define 78 duplicated modules (minimally shared evolutionary segments) with 157 copies on chromosome 22. Analysis of these modules provides evidence for the creation or modification of 11 novel transcripts. Comparative FISH analyses of human, chimpanzee, gorilla, orangutan, and macaque reveal qualitative and quantitative differences in the distribution of these duplications—consistent with their recent origin. Several duplications appear to be human specific, including a ∼400-kb duplication (99.4%–99.8% sequence identity) that transposed from chromosome 14 to the most proximal pericentromeric region of chromosome 22. Experimental and in silico data further support a pericentromeric gradient of duplications where the most recent duplications transpose adjacent to the centromere. Taken together, these data suggest that segmental duplications have been an ongoing process of primate genome evolution, contributing to recent gene innovation and the dynamic transformation of genome architecture within and among closely related species. PMID:11731936

  7. Variation in GABA-A subunit gene copy number in an autistic patient with mosaic 4 p duplication (p12p16).

    PubMed

    Kakinuma, Hiroaki; Ozaki, Mamoru; Sato, Hitoshi; Takahashi, Hiroaki

    2008-09-05

    Autism has been associated with chromosomal aberrations, including duplications at chromosome 4, and the identification of genetic factors contributing to the etiology of this disease is the focus of much research. Here we report a Japanese girl with mosaic of chromosome 4p duplication, mos 46,XX,dup(4)(p12p16)[54]/46,XX[6], who was diagnosed with autism at 3 years of age. Fluorescence in situ hybridization (FISH) with probes covering the region spanning a cluster of the gamma aminobutyric acid A (GABA-A) receptor subunit genes in the proximal short arm of chromosome 4 demonstrated total three signals for the GABRG1, GABRA4, and GABRA2 genes, but only two signals for GABRB1. This suggests that aberrant copy number of the GABA-A receptor subunit genes may contribute to the etiology of autism in this patient. 2007 Wiley-Liss, Inc.

  8. [Genome-wide identification and expression analysis of auxin-related gene families in grape].

    PubMed

    Yuan, Hua-zhao; Zhao, Mi-zhen; Wu, Wei-min; Yu, Hong-Mei; Qian, Ya-ming; Wang, Zhuang-wei; Wang, Xi-cheng

    2015-07-01

    The auxin response gene family adjusts the auxin balance and the growth hormone signaling pathways in plants. Using bioinformatics methods, the auxin-response genes from the grape genome database are identified and their chromosomal location, gene collinearity and phylogenetic analysis are performed. Probable genes include 25 AUX_IAA, 19 ARF, 9 GH3 and 42 LBD genes, which are unevenly distributed on all 19 chromosomes and some of them formed distinct tandem duplicate gene clusters. The available grape microarray databases show that all of the auxin-response genes are expressed in fruit and leaf buds, and significant overexpressed during fruit color-changing, bud break and bud dormancy periods. This paper provides a resource for functional studies of auxin-response genes in grape leaf and fruit development.

  9. Targeted Mutagenesis of Duplicated Genes in Soybean with Zinc-Finger Nucleases1[W][OA

    PubMed Central

    Curtin, Shaun J.; Zhang, Feng; Sander, Jeffry D.; Haun, William J.; Starker, Colby; Baltes, Nicholas J.; Reyon, Deepak; Dahlborg, Elizabeth J.; Goodwin, Mathew J.; Coffman, Andrew P.; Dobbs, Drena; Joung, J. Keith; Voytas, Daniel F.; Stupar, Robert M.

    2011-01-01

    We performed targeted mutagenesis of a transgene and nine endogenous soybean (Glycine max) genes using zinc-finger nucleases (ZFNs). A suite of ZFNs were engineered by the recently described context-dependent assembly platform—a rapid, open-source method for generating zinc-finger arrays. Specific ZFNs targeting DICER-LIKE (DCL) genes and other genes involved in RNA silencing were cloned into a vector under an estrogen-inducible promoter. A hairy-root transformation system was employed to investigate the efficiency of ZFN mutagenesis at each target locus. Transgenic roots exhibited somatic mutations localized at the ZFN target sites for seven out of nine targeted genes. We next introduced a ZFN into soybean via whole-plant transformation and generated independent mutations in the paralogous genes DCL4a and DCL4b. The dcl4b mutation showed efficient heritable transmission of the ZFN-induced mutation in the subsequent generation. These findings indicate that ZFN-based mutagenesis provides an efficient method for making mutations in duplicate genes that are otherwise difficult to study due to redundancy. We also developed a publicly accessible Web-based tool to identify sites suitable for engineering context-dependent assembly ZFNs in the soybean genome. PMID:21464476

  10. Duplication and selection in the evolution of primate β-defensin genes

    PubMed Central

    Semple, Colin AM; Rolfe, Mark; Dorin, Julia R

    2003-01-01

    Background Innate immunity is the first line of defense against microorganisms in vertebrates and acts by providing an initial barrier to microorganisms and triggering adaptive immune responses. Peptides such as β-defensins are an important component of this defense, providing a broad spectrum of antimicrobial activity against bacteria, fungi, mycobacteria and several enveloped viruses. β-defensins are small cationic peptides that vary in their expression patterns and spectrum of pathogen specificity. Disruptions in β-defensin function have been implicated in human diseases, including cystic fibrosis, and a fuller understanding of the variety, function and evolution of human β-defensins might form the basis for novel therapies. Here we use a combination of laboratory and computational techniques to characterize the main human β-defensin locus on chromosome 8p22-p23. Results In addition to known genes in the region we report the genomic structures and expression patterns of four novel human β-defensin genes and a related pseudogene. These genes show an unusual pattern of evolution, with rapid divergence between second exon sequences that encode the mature β-defensin peptides matched by relative stasis in first exons that encode signal peptides. Conclusions We conclude that the 8p22-p23 locus has evolved by successive rounds of duplication followed by substantial divergence involving positive selection, to produce a diverse cluster of paralogous genes established before the human-baboon divergence more than 23 million years ago. Positive selection, disproportionately favoring alterations in the charge of amino-acid residues, is implicated as driving second exon divergence in these genes. PMID:12734011

  11. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    PubMed

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  12. Chlorophyll Biosynthesis Gene Evolution Indicates Photosystem Gene Duplication, Not Photosystem Merger, at the Origin of Oxygenic Photosynthesis

    PubMed Central

    Sousa, Filipa L.; Shavit-Grievink, Liat; Allen, John F.; Martin, William F.

    2013-01-01

    An open question regarding the evolution of photosynthesis is how cyanobacteria came to possess the two reaction center (RC) types, Type I reaction center (RCI) and Type II reaction center (RCII). The two main competing theories in the foreground of current thinking on this issue are that either 1) RCI and RCII are related via lineage divergence among anoxygenic photosynthetic bacteria and became merged in cyanobacteria via an event of large-scale lateral gene transfer (also called "fusion" theories) or 2) the two RC types are related via gene duplication in an ancestral, anoxygenic but protocyanobacterial phototroph that possessed both RC types before making the transition to using water as an electron donor. To distinguish between these possibilities, we studied the evolution of the core (bacterio)chlorophyll biosynthetic pathway from protoporphyrin IX (Proto IX) up to (bacterio)chlorophyllide a. The results show no dichotomy of chlorophyll biosynthesis genes into RCI- and RCII-specific chlorophyll biosynthetic clades, thereby excluding models of fusion at the origin of cyanobacteria and supporting the selective-loss hypothesis. By considering the cofactor demands of the pathway and the source genes from which several steps in chlorophyll biosynthesis are derived, we infer that the cell that first synthesized chlorophyll was a cobalamin-dependent, heme-synthesizing, diazotrophic anaerobe. PMID:23258841

  13. Chlorophyll biosynthesis gene evolution indicates photosystem gene duplication, not photosystem merger, at the origin of oxygenic photosynthesis.

    PubMed

    Sousa, Filipa L; Shavit-Grievink, Liat; Allen, John F; Martin, William F

    2013-01-01

    An open question regarding the evolution of photosynthesis is how cyanobacteria came to possess the two reaction center (RC) types, Type I reaction center (RCI) and Type II reaction center (RCII). The two main competing theories in the foreground of current thinking on this issue are that either 1) RCI and RCII are related via lineage divergence among anoxygenic photosynthetic bacteria and became merged in cyanobacteria via an event of large-scale lateral gene transfer (also called "fusion" theories) or 2) the two RC types are related via gene duplication in an ancestral, anoxygenic but protocyanobacterial phototroph that possessed both RC types before making the transition to using water as an electron donor. To distinguish between these possibilities, we studied the evolution of the core (bacterio)chlorophyll biosynthetic pathway from protoporphyrin IX (Proto IX) up to (bacterio)chlorophyllide a. The results show no dichotomy of chlorophyll biosynthesis genes into RCI- and RCII-specific chlorophyll biosynthetic clades, thereby excluding models of fusion at the origin of cyanobacteria and supporting the selective-loss hypothesis. By considering the cofactor demands of the pathway and the source genes from which several steps in chlorophyll biosynthesis are derived, we infer that the cell that first synthesized chlorophyll was a cobalamin-dependent, heme-synthesizing, diazotrophic anaerobe.

  14. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    PubMed

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Evolution by selection, recombination, and gene duplication in MHC class I genes of two Rhacophoridae species

    PubMed Central

    2013-01-01

    Background Comparison of major histocompatibility complex (MHC) genes across vertebrate species can reveal molecular mechanisms underlying the evolution of adaptive immunity-related proteins. As the first terrestrial tetrapods, amphibians deserve special attention because of their exposure to probably increased spectrum of microorganisms compared with ancestral aquatic fishes. Knowledge regarding the evolutionary patterns and mechanisms associated with amphibian MHC genes remains limited. The goal of the present study was to isolate MHC class I genes from two Rhacophoridae species (Rhacophorus omeimontis and Polypedates megacephalus) and examine their evolution. Results We identified 27 MHC class I alleles spanning the region from exon 2 to 4 in 38 tree frogs. The available evidence suggests that these 27 sequences all belong to classical MHC class I (MHC Ia) genes. Although several anuran species only display one MHC class Ia locus, at least two or three loci were observed in P. megacephalus and R. omeimontis, indicating that the number of MHC class Ia loci varies among anuran species. Recombination events, which mainly involve the entire exons, played an important role in shaping the genetic diversity of the 27 MHC class Ia alleles. In addition, signals of positive selection were found in Rhacophoridae MHC class Ia genes. Amino acid sites strongly suggested by program to be under positive selection basically accorded with the putative antigen binding sites deduced from crystal structure of human HLA. Phylogenetic relationships among MHC class I alleles revealed the presence of trans-species polymorphisms. Conclusions In the two Rhacophoridae species (1) there are two or three MHC class Ia loci; (2) recombination mainly occurs between the entire exons of MHC class Ia genes; (3) balancing selection, gene duplication and recombination all contribute to the diversity of MHC class Ia genes. These findings broaden our knowledge on the evolution of amphibian MHC systems

  16. Structure of a double-domain phosphagen kinase reveals an asymmetric arrangement of the tandem domains.

    PubMed

    Wang, Zhiming; Qiao, Zhu; Ye, Sheng; Zhang, Rongguang

    2015-04-01

    Tandem duplications and fusions of single genes have led to magnificent expansions in the divergence of protein structures and functions over evolutionary timescales. One of the possible results is polydomain enzymes with interdomain cooperativities, few examples of which have been structurally characterized at the full-length level to explore their innate synergistic mechanisms. This work reports the crystal structures of a double-domain phosphagen kinase in both apo and ligand-bound states, revealing a novel asymmetric L-shaped arrangement of the two domains. Unexpectedly, the interdomain connections are not based on a flexible hinge linker but on a rigid secondary-structure element: a long α-helix that tethers the tandem domains in relatively fixed positions. Besides the connective helix, the two domains also contact each other directly and form an interdomain interface in which hydrogen bonds and hydrophobic interactions further stabilize the L-shaped domain arrangement. Molecular-dynamics simulations show that the interface is generally stable, suggesting that the asymmetric domain arrangement crystallographically observed in the present study is not a conformational state simply restrained by crystal-packing forces. It is possible that the asymmetrically arranged tandem domains could provide a structural basis for further studies of the interdomain synergy.

  17. Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress

    PubMed Central

    Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming

    2017-01-01

    The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance. PMID:28417911

  18. Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress.

    PubMed

    Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming

    2017-04-12

    The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance.

  19. Saccharomyces cerevisiae ribosomal protein L37 is encoded by duplicate genes that are differentially expressed.

    PubMed

    Tornow, J; Santangelo, G M

    1994-06-01

    A duplicate copy of the RPL37A gene (encoding ribosomal protein L37) was cloned and sequenced. The coding region of RPL37B is very similar to that of RPL37A, with only one conservative amino-acid difference. However, the intron and flanking sequences of the two genes are extremely dissimilar. Disruption experiments indicate that the two loci are not functionally equivalent: disruption of RPL37B was insignificant, but disruption of RPL37A severely impaired the growth rate of the cell. When both RPL37 loci are disrupted, the cell is unable to grow at all, indicating that rpL37 is an essential protein. The functional disparity between the two RPL37 loci could be explained by differential gene expression. The results of two experiments support this idea: gene fusion of RPL37A to a reporter gene resulted in six-fold higher mRNA levels than was generated by the same reporter gene fused to RPL37B, and a modest increase in gene dosage of RPL37B overcame the lack of a functional RPL37A gene.

  20. MLL duplication in a pediatric patient with B-cell lymphoblastic lymphoma.

    PubMed

    Mater, David Van; Goodman, Barbara K; Wang, Endi; Gaca, Ana M; Wechsler, Daniel S

    2012-04-01

    Lymphoblastic lymphoma is the second most common type of non-Hodgkin lymphoma seen in children. Approximately, 90% of lymphoblastic lymphomas arise from T cells, with the remaining 10% being B-cell-lineage derived. Although T-cell lymphoblastic lymphoma most frequently occurs in the anterior mediastinum (thymus), B-cell lymphoblastic lymphoma (B-LBL) predominates in extranodal sites such as skin and bone. Here, we describe a pediatric B-LBL patient who presented with extensive abdominal involvement and whose lymphoma cells displayed segmental duplication of the mixed lineage leukemia (MLL) gene. MLL duplication/amplification has been described primarily in acute myeloid leukemia and myelodysplastic syndrome with no published reports of discrete MLL duplication/amplification events in B-LBL. The MLL gene duplication noted in this case may represent a novel mechanism for tumorigenesis in B-LBL.

  1. Genotype-phenotype characterization in 13 individuals with chromosome Xp11.22 duplications.

    PubMed

    Grams, Sarah E; Argiropoulos, Bob; Lines, Matthew; Chakraborty, Pranesh; Mcgowan-Jordan, Jean; Geraghty, Michael T; Tsang, Marilyn; Eswara, Marthand; Tezcan, Kamer; Adams, Kelly L; Linck, Leesa; Himes, Patricia; Kostiner, Dana; Zand, Dina J; Stalker, Heather; Driscoll, Daniel J; Huang, Taosheng; Rosenfeld, Jill A; Li, Xu; Chen, Emily

    2016-04-01

    We report 13 new individuals with duplications in Xp11.22-p11.23. The index family has one male and two female members in three generations with mild-severe intellectual disability (ID), speech delay, dysmorphic features, early puberty, constipation, and/or hand and foot abnormalities. Affected individuals were found to have two small duplications in Xp11.22 at nucleotide position (hg19) 50,112,063-50,456,458 bp (distal) and 53,160,114-53,713,154 bp (proximal). Collectively, these two regions include 14 RefSeq genes, prompting collection of a larger cohort of patients, in an attempt to delineate critical genes associated with the observed phenotype. In total, we have collected data on nine individuals with duplications overlapping the distal duplication region containing SHROOM4 and DGKK and eight individuals overlapping the proximal region including HUWE1. Duplications of HUWE1 have been previously associated with non-syndromic ID. Our data, with previously published reports, suggest that duplications involving SHROOM4 and DGKK may represent a new syndromic X-linked ID critical region associated with mild to severe ID, speech delay +/- dysarthria, attention deficit disorder, precocious puberty, constipation, and motor delay. We frequently observed foot abnormalities, 5th finger clinodactyly, tapering fingers, constipation, and exercise intolerance in patients with duplications of these two genes. Regarding duplications including the proximal region, our observations agree with previous studies, which have found associations with intellectual disability. In addition, expressive language delay, failure to thrive, motor delay, and 5th finger clinodactyly were also frequently observed in patients with the proximal duplication. © 2015 Wiley Periodicals, Inc.

  2. Increased expression of LD1 genes transcribed by RNA polymerase I in Leishmania donovani as a result of duplication into the rRNA gene locus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lodes, M.J.; Merlin, G.; DeVos, T.

    1995-12-01

    This report investigates the duplication of two LD1 genes into the rRNA locus and the resultant transcription by RNA polymerase I, which has a faster transcription rate than that of RNA polymerase II. This was conducted using a 2.2-Mb chromosome in Leishmania donovani. 55 refs., 6 figs.

  3. Tandem alternative polyadenylation events of genes in non-eosinophilic nasal polyp tissue identified by high-throughput sequencing analysis

    PubMed Central

    TIAN, PENG; LI, JIE; LIU, XIANG; LI, YUXI; CHEN, MEIHENG; MA, YUN; ZHENG, YI QING; FU, YONGGUI; ZOU, HUA

    2014-01-01

    Nasal polyps (NP) is highly associated with the disorder of immune cells. Alternative polyadenylation (APA) produces mRNA isoforms with different length of 3′-untranslated region (UTR) and regulates gene expression. It has been proven that this APA-mediated regulation of 3′UTR length is an immune-associated phenomenon. The aim of this study was to investigate the genome-wide alternative tandem 3′UTR length switching events in non-eosinophilic nasal polyp tissue. Thirteen patients diagnosed as having non-eosinophilic nasal polyps were included in this study. Nasal polyp tissue and control mucosa were collected during surgery. The 3′ end library of cDNA was constructed. The recovered libraries were sequenced with second sequencing technology, and the sequencing data were analyzed by an in-house bioinformatics pipeline. Tandem 3′UTR length switching between samples was detected by a test of linear trend alternative to independence. We found a significant alteration in the tandem 3′UTR length in 1,920 genes in nasal polyp samples. Functional annotation results showed that several gene ontology (GO) terms were enriched in the list of genes with switched APA sites, including regulation of transcription, macromolecule catabolic localization and mRNA processing. The results suggested that APA-mediated alternative 3′UTR regulation plays an important role in the post-transcriptional regulation of gene expression in non-eosinophilic nasal polyps. PMID:24715051

  4. Characterization of DNA methyltransferase and demethylase genes in Fragaria vesca.

    PubMed

    Gu, Tingting; Ren, Shuai; Wang, Yuanhua; Han, Yuhui; Li, Yi

    2016-06-01

    DNA methylation is an epigenetic modification essential for gene regulations in plants, but understanding on how it is involved in fruit development, especially in non-climacteric fleshy fruit, is limited. The diploid woodland strawberry (Fragaria vesca) is an important model for non-climacteric fruit crops. In this study, we identified DNA methyltransferase genes and demethylase genes in Fragaria vesca and other angiosperm species. In accordance with previous studies, our phylogenetic analyses of those DNA methylation modifiers support the clustering of those genes into several classes. Our data indicate that whole-genome duplications and tandem duplications contributed to the expansion of those DNA methylation modifiers in angiosperms. We have further demonstrated that some DNA methylase and demethylase genes reach their highest expression levels in strawberry fleshy fruits when turning from white to red, suggesting that DNA methylation might undergo a dramatic change at the onset of fleshy fruit-ripening process. In addition, we have observed that expression of some DNA demethylase genes increases in response to various abiotic stresses including heat, cold, drought and salinity. Collectively, our study indicates a regulatory role of DNA methylation in the turning stage of non-climacteric fleshy fruit and responses to environment stimuli, and would facilitate functional studies of DNA methylation in the growth and development of non-climacteric fruits.

  5. The evolution of genes encoding for green fluorescent proteins: insights from cephalochordates (amphioxus)

    NASA Astrophysics Data System (ADS)

    Yue, Jia-Xing; Holland, Nicholas D.; Holland, Linda Z.; Deheyn, Dimitri D.

    2016-06-01

    Green Fluorescent Protein (GFP) was originally found in cnidarians, and later in copepods and cephalochordates (amphioxus) (Branchiostoma spp). Here, we looked for GFP-encoding genes in Asymmetron, an early-diverged cephalochordate lineage, and found two such genes closely related to some of the Branchiostoma GFPs. Dim fluorescence was found throughout the body in adults of Asymmetron lucayanum, and, as in Branchiostoma floridae, was especially intense in the ripe ovaries. Spectra of the fluorescence were similar between Asymmetron and Branchiostoma. Lineage-specific expansion of GFP-encoding genes in the genus Branchiostoma was observed, largely driven by tandem duplications. Despite such expansion, purifying selection has strongly shaped the evolution of GFP-encoding genes in cephalochordates, with apparent relaxation for highly duplicated clades. All cephalochordate GFP-encoding genes are quite different from those of copepods and cnidarians. Thus, the ancestral cephalochordates probably had GFP, but since GFP appears to be lacking in more early-diverged deuterostomes (echinoderms, hemichordates), it is uncertain whether the ancestral cephalochordates (i.e. the common ancestor of Asymmetron and Branchiostoma) acquired GFP by horizontal gene transfer (HGT) from copepods or cnidarians or inherited it from the common ancestor of copepods and deuterostomes, i.e. the ancestral bilaterians.

  6. Soybean kinome: functional classification and gene expression patterns

    PubMed Central

    Liu, Jinyi; Chen, Nana; Grant, Joshua N.; Cheng, Zong-Ming (Max); Stewart, C. Neal; Hewezi, Tarek

    2015-01-01

    The protein kinase (PK) gene family is one of the largest and most highly conserved gene families in plants and plays a role in nearly all biological functions. While a large number of genes have been predicted to encode PKs in soybean, a comprehensive functional classification and global analysis of expression patterns of this large gene family is lacking. In this study, we identified the entire soybean PK repertoire or kinome, which comprised 2166 putative PK genes, representing 4.67% of all soybean protein-coding genes. The soybean kinome was classified into 19 groups, 81 families, and 122 subfamilies. The receptor-like kinase (RLK) group was remarkably large, containing 1418 genes. Collinearity analysis indicated that whole-genome segmental duplication events may have played a key role in the expansion of the soybean kinome, whereas tandem duplications might have contributed to the expansion of specific subfamilies. Gene structure, subcellular localization prediction, and gene expression patterns indicated extensive functional divergence of PK subfamilies. Global gene expression analysis of soybean PK subfamilies revealed tissue- and stress-specific expression patterns, implying regulatory functions over a wide range of developmental and physiological processes. In addition, tissue and stress co-expression network analysis uncovered specific subfamilies with narrow or wide interconnected relationships, indicative of their association with particular or broad signalling pathways, respectively. Taken together, our analyses provide a foundation for further functional studies to reveal the biological and molecular functions of PKs in soybean. PMID:25614662

  7. Retention of duplicated ITAM-containing transmembrane signaling subunits in the tetraploid amphibian species Xenopus laevis

    PubMed Central

    Guselnikov, S.V.; Grayfer, L.; De Jesús Andino, F.; Rogozin, I.B.; Robert, J.; Taranin, A.V.

    2015-01-01

    The ITAM-bearing transmembrane signaling subunits (TSS) are indispensable components of activating leukocyte receptor complexes. The TSS-encoding genes map to paralogous chromosomal regions, which are thought to arise from ancient genome tetraploidization(s). To assess a possible role of tetraploidization in the TSS evolution, we studied TSS and other functionally linked genes in the amphibian species Xenopus laevis whose genome was duplicated about 40 MYR ago. We found that X. laevis has retained a duplicated set of sixteen TSS genes, all except one being transcribed. Furthermore, duplicated TCRα loci and genes encoding TSS-coupling protein kinases have also been retained. No clear evidence for functional divergence of the TSS paralogs was obtained from gene expression and sequence analyses. We suggest that the main factor of maintenance of duplicated TSS genes in X. laevis was a protein dosage effect and that this effect might have facilitated the TSS set expansion in early vertebrates. PMID:26170006

  8. A 20-basepair duplication in the human thyroid peroxidase gene results in a total iodide organification defect and congenital hypothyroidism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bikker, H.; Hartog, M.T. den; Gons, M.H.

    1994-07-01

    In this study, the authors present the molecular basis of a total iodide organification defect causing severe congenital hypothyroidism. In the thyroid gland of the patient, thyroid peroxidase (TPO) activity and the iodination degree of thyroglobulin were below detection limits, and no TPO messenger ribonucleic acid was detectable by Northern blot analysis. Denaturing gradient gel electrophoretic analysis of the TPO gene of the patient revealed a homozygous mutation in exon 2. Sequence analysis showed the presence of a 20-basepair duplication, 47 basepairs down-stream of the ATG start codon. This duplication generates a frame shift, resulting in a termination signal inmore » exon 3, compatible with the complete absence of TPO. Both parents of the patient are heterozygous for the same duplication, confirming the recessive mode of inheritance of the mutation. 32 refs., 4 figs.« less

  9. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Glodzik, Dominik; Morganella, Sandro; Davies, Helen

    Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may be sites of selective susceptibility to double-strand-break damage due to high transcriptional activity or, through incrementally increasing copy number, may be sites of secondary selective pressure. Furthermore, the transcriptomicmore » consequences ranged from strong individual oncogene effects to weak but quantifiable multigene expression effects. We thus present a somatic-rearrangement mutational process affecting coding sequences and noncoding regulatory elements and contributing a continuum of driver consequences, from modest to strong effects, thereby supporting a polygenic model of cancer development.« less

  10. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers

    DOE PAGES

    Glodzik, Dominik; Morganella, Sandro; Davies, Helen; ...

    2017-01-23

    Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may be sites of selective susceptibility to double-strand-break damage due to high transcriptional activity or, through incrementally increasing copy number, may be sites of secondary selective pressure. Furthermore, the transcriptomicmore » consequences ranged from strong individual oncogene effects to weak but quantifiable multigene expression effects. We thus present a somatic-rearrangement mutational process affecting coding sequences and noncoding regulatory elements and contributing a continuum of driver consequences, from modest to strong effects, thereby supporting a polygenic model of cancer development.« less

  11. Natural non-homologous recombination led to the emergence of a duplicated V3-NS5A region in HCV-1b strains associated with hepatocellular carcinoma.

    PubMed

    Le Guillou-Guillemette, Hélène; Pivert, Adeline; Bouthry, Elise; Henquell, Cécile; Petsaris, Odile; Ducancelle, Alexandra; Veillon, Pascal; Vallet, Sophie; Alain, Sophie; Thibault, Vincent; Abravanel, Florence; Rosenberg, Arielle A; André-Garnier, Elisabeth; Bour, Jean-Baptiste; Baazia, Yazid; Trimoulet, Pascale; André, Patrice; Gaudy-Graffin, Catherine; Bettinger, Dominique; Larrat, Sylvie; Signori-Schmuck, Anne; Saoudin, Hénia; Pozzetto, Bruno; Lagathu, Gisèle; Minjolle-Cha, Sophie; Stoll-Keller, Françoise; Pawlotsky, Jean-Michel; Izopet, Jacques; Payan, Christopher; Lunel-Fabiani, Françoise; Lemaire, Christophe

    2017-01-01

    The emergence of new strains in RNA viruses is mainly due to mutations or intra and inter-genotype homologous recombination. Non-homologous recombinations may be deleterious and are rarely detected. In previous studies, we identified HCV-1b strains bearing two tandemly repeated V3 regions in the NS5A gene without ORF disruption. This polymorphism may be associated with an unfavorable course of liver disease and possibly involved in liver carcinogenesis. Here we aimed at characterizing the origin of these mutant strains and identifying the evolutionary mechanism on which the V3 duplication relies. Direct sequencing of the entire NS5A and E1 genes was performed on 27 mutant strains. Quasispecies analyses in consecutive samples were also performed by cloning and sequencing the NS5A gene for all mutant and wild strains. We analyzed the mutant and wild-type sequence polymorphisms using Bayesian methods to infer the evolutionary history of and the molecular mechanism leading to the duplication-like event. Quasispecies were entirely composed of exclusively mutant or wild-type strains respectively. Mutant quasispecies were found to have been present since contamination and had persisted for at least 10 years. This V3 duplication-like event appears to have resulted from non-homologous recombination between HCV-1b wild-type strains around 100 years ago. The association between increased liver disease severity and these HCV-1b mutants may explain their persistence in chronically infected patients. These results emphasize the possible consequences of non-homologous recombination in the emergence and severity of new viral diseases.

  12. Evolution of developmental roles of Pax2/5/8 paralogs after independent duplication in urochordate and vertebrate lineages

    PubMed Central

    Bassham, Susan; Cañestro, Cristian; Postlethwait, John H

    2008-01-01

    Background Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. Results To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial

  13. Evolution of developmental roles of Pax2/5/8 paralogs after independent duplication in urochordate and vertebrate lineages.

    PubMed

    Bassham, Susan; Cañestro, Cristian; Postlethwait, John H

    2008-08-22

    Gene duplication provides opportunities for lineage diversification and evolution of developmental novelties. Duplicated genes generally either disappear by accumulation of mutations (nonfunctionalization), or are preserved either by the origin of positively selected functions in one or both duplicates (neofunctionalization), or by the partitioning of original gene subfunctions between the duplicates (subfunctionalization). The Pax2/5/8 family of important developmental regulators has undergone parallel expansion among chordate groups. After the divergence of urochordate and vertebrate lineages, two rounds of independent gene duplications resulted in the Pax2, Pax5, and Pax8 genes of most vertebrates (the sister group of the urochordates), and an additional duplication provided the pax2a and pax2b duplicates in teleost fish. Separate from the vertebrate genome expansions, a duplication also created two Pax2/5/8 genes in the common ancestor of ascidian and larvacean urochordates. To better understand mechanisms underlying the evolution of duplicated genes, we investigated, in the larvacean urochordate Oikopleura dioica, the embryonic gene expression patterns of Pax2/5/8 paralogs. We compared the larvacean and ascidian expression patterns to infer modular subfunctions present in the single pre-duplication Pax2/5/8 gene of stem urochordates, and we compared vertebrate and urochordate expression to infer the suite of Pax2/5/8 gene subfunctions in the common ancestor of olfactores (vertebrates + urochordates). Expression pattern differences of larvacean and ascidian Pax2/5/8 orthologs in the endostyle, pharynx and hindgut suggest that some ancestral gene functions have been partitioned differently to the duplicates in the two urochordate lineages. Novel expression in the larvacean heart may have resulted from the neofunctionalization of a Pax2/5/8 gene in the urochordates. Expression of larvacean Pax2/5/8 in the endostyle, in sites of epithelial remodeling, and in

  14. Ancestral and more recently acquired syntenic relationships of MADS-box genes uncovered by the Physcomitrella patens pseudochromosomal genome assembly.

    PubMed

    Barker, Elizabeth I; Ashton, Neil W

    2016-03-01

    The Physcomitrella pseudochromosomal genome assembly revealed previously invisible synteny enabling realisation of the full potential of shared synteny as a tool for probing evolution of this plant's MADS-box gene family. Assembly of the sequenced genome of Physcomitrella patens into 27 mega-scaffolds (pseudochromosomes) has confirmed the major predictions of our earlier model of expansion of the MADS-box gene family in the Physcomitrella lineage. Additionally, microsynteny has been conserved in the immediate vicinity of some recent duplicates of MADS-box genes. However, comparison of non-syntenic MIKC MADS-box genes and neighbouring genes indicates that chromosomal rearrangements and/or sequence degeneration have destroyed shared synteny over longer distances (macrosynteny) around MADS-box genes despite subsets comprising two or three MIKC genes having remained syntenic. In contrast, half of the type I MADS-box genes have been transposed creating new syntenic relations with MIKC genes. This implies that conservation of ancient ancestral synteny of MIKC genes and of more recently acquired synteny of type I and MIKC genes may be selectively advantageous. Our revised model predicts the birth rate of MIKC genes in Physcomitrella is higher than that of type I genes. However, this difference is attributable to an early tandem duplication and an early segmental duplication of MIKC genes prior to the two polyploidisations that account for most of the expansion of the MADS-box gene family in Physcomitrella. Furthermore, this early segmental duplication spawned two chromosomal lineages: one with a MIKC (C) gene, belonging to the PPM2 clade, in close proximity to one or a pair of MIKC* genes and another with a MIKC (C) gene, belonging to the PpMADS-S clade, characterised by greater separation from syntenic MIKC* genes. Our model has evolutionary implications for the Physcomitrella karyotype.

  15. Sequencing of Pax6 Loci from the Elephant Shark Reveals a Family of Pax6 Genes in Vertebrate Genomes, Forged by Ancient Duplications and Divergences

    PubMed Central

    Gautier, Philippe; Loosli, Felix; Tay, Boon-Hui; Tay, Alice; Murdoch, Emma; Coutinho, Pedro; van Heyningen, Veronica; Brenner, Sydney; Venkatesh, Byrappa; Kleinjan, Dirk A.

    2013-01-01

    Pax6 is a developmental control gene essential for eye development throughout the animal kingdom. In addition, Pax6 plays key roles in other parts of the CNS, olfactory system, and pancreas. In mammals a single Pax6 gene encoding multiple isoforms delivers these pleiotropic functions. Here we provide evidence that the genomes of many other vertebrate species contain multiple Pax6 loci. We sequenced Pax6-containing BACs from the cartilaginous elephant shark (Callorhinchus milii) and found two distinct Pax6 loci. Pax6.1 is highly similar to mammalian Pax6, while Pax6.2 encodes a paired-less Pax6. Using synteny relationships, we identify homologs of this novel paired-less Pax6.2 gene in lizard and in frog, as well as in zebrafish and in other teleosts. In zebrafish two full-length Pax6 duplicates were known previously, originating from the fish-specific genome duplication (FSGD) and expressed in divergent patterns due to paralog-specific loss of cis-elements. We show that teleosts other than zebrafish also maintain duplicate full-length Pax6 loci, but differences in gene and regulatory domain structure suggest that these Pax6 paralogs originate from a more ancient duplication event and are hence renamed as Pax6.3. Sequence comparisons between mammalian and elephant shark Pax6.1 loci highlight the presence of short- and long-range conserved noncoding elements (CNEs). Functional analysis demonstrates the ancient role of long-range enhancers for Pax6 transcription. We show that the paired-less Pax6.2 ortholog in zebrafish is expressed specifically in the developing retina. Transgenic analysis of elephant shark and zebrafish Pax6.2 CNEs with homology to the mouse NRE/Pα internal promoter revealed highly specific retinal expression. Finally, morpholino depletion of zebrafish Pax6.2 resulted in a “small eye” phenotype, supporting a role in retinal development. In summary, our study reveals that the pleiotropic functions of Pax6 in vertebrates are served by a divergent

  16. A rare complex DNA rearrangement in the murine Steel gene results in exon duplication and a lethal phenotype.

    PubMed

    Chandra, Saurabh; Kapur, Reuben; Chuzhanova, Nadia; Summey, Victoria; Prentice, David; Barker, Jane; Cooper, David N; Williams, David A

    2003-11-15

    Kit ligand (Kitl), encoded by the Steel (Sl) locus, plays an essential role in hematopoiesis, gametogenesis, and melanogenesis during both embryonic and adult life. We have characterized a new spontaneous mutant of the Sl locus in mice designated KitlSl-20J that arose in the breeding colony at Jackson Laboratories. Heterozygous KitlSl-20J mice display a white belly spot and intercrossing results in an embryonic lethal phenotype in the homozygous state. Analysis of homozygous embryos demonstrated a significant reduction in fetal liver cellularity, colony forming unit-erythroid (CFU-E) progenitors, and a total absence of germ cells. Although expressed in vivo, recombinant mutant protein demonstrated loss of bioactivity that was correlated with lack of receptor binding. Analysis of the Sl gene transcripts in heterozygous KitlSl-20J mice revealed an in-frame tandem duplication of exon 3. A long-range polymerase chain reaction (PCR) strategy using overlapping primers in exon 3 amplified an approximately 7-kilobase (kb) product from DNA isolated from heterozygous KitlSl-20J mice but not from wild-type DNA that contained sequences from both introns 2 and 3 and an inverted intron 2 sequence, suggesting a complex rearrangement as the mechanism of the mutation. "Complexity analysis" of the sequence of the amplified product strongly suggests that local DNA motifs may have contributed to the generation of this spontaneous KitlSl-20J allele, likely mediated by a 2-step process. The KitlSl-20J mutation is a unique KitlSl allele and represents an unusual mechanism of mutation.

  17. Insertional translocation leading to a 4q13 duplication including the EPHA5 gene in two siblings with attention-deficit hyperactivity disorder.

    PubMed

    Matoso, Eunice; Melo, Joana B; Ferreira, Susana I; Jardim, Ana; Castelo, Teresa M; Weise, Anja; Carreira, Isabel M

    2013-08-01

    An insertional translocation (IT) can result in pure segmental aneusomy for the inserted genomic segment allowing to define a more accurate clinical phenotype. Here, we report on two siblings sharing an unbalanced IT inherited from the mother with a history of learning difficulty. An 8-year-old girl with developmental delay, speech disability, and attention-deficit hyperactivity disorder (ADHD), showed by GTG banding analysis a subtle interstitial alteration in 21q21. Oligonucleotide array comparative genomic hybridization (array-CGH) analysis showed a 4q13.1-q13.3 duplication spanning 8.6 Mb. Fluorescence in situ hybridization (FISH) with bacterial artificial chromosome (BAC) clones confirmed the rearrangement, a der(21)ins(21;4)(q21;q13.1q13.3). The duplication described involves 50 RefSeq genes including the EPHA5 gene that encodes for the EphA5 receptor involved in embryonic development of the brain and also in synaptic remodeling and plasticity thought to underlie learning and memory. The same rearrangement was observed in a younger brother with behavioral problems and also exhibiting ADHD. ADHD is among the most heritable of neuropsychiatric disorders. There are few reports of patients with duplications involving the proximal region of 4q and a mild phenotype. To the best of our knowledge this is the first report of a duplication restricted to band 4q13. This abnormality could be easily missed in children who have nonspecific cognitive impairment. The presence of this behavioral disorder in the two siblings reinforces the hypothesis that the region involved could include genes involved in ADHD. Copyright © 2013 Wiley Periodicals, Inc.

  18. 6-mercaptopurine influences TPMT gene transcription in a TPMT gene promoter variable number of tandem repeats-dependent manner.

    PubMed

    Kotur, Nikola; Stankovic, Biljana; Kassela, Katerina; Georgitsi, Marianthi; Vicha, Anna; Leontari, Iliana; Dokmanovic, Lidija; Janic, Dragana; Krstovski, Nada; Klaassen, Kristel; Radmilovic, Milena; Stojiljkovic, Maja; Nikcevic, Gordana; Simeonidis, Argiris; Sivolapenko, Gregory; Pavlovic, Sonja; Patrinos, George P; Zukic, Branka

    2012-02-01

    TPMT activity is characterized by a trimodal distribution, namely low, intermediate and high methylator. TPMT gene promoter contains a variable number of GC-rich tandem repeats (VNTRs), namely A, B and C, ranging from three to nine repeats in length in an A(n)B(m)C architecture. We have previously shown that the VNTR architecture in the TPMT gene promoter affects TPMT gene transcription. MATERIALS, METHODS & RESULTS: Here we demonstrate, using reporter assays, that 6-mercaptopurine (6-MP) treatment results in a VNTR architecture-dependent decrease of TPMT gene transcription, mediated by the binding of newly recruited protein complexes to the TPMT gene promoter, upon 6-MP treatment. We also show that acute lymphoblastic leukemia patients undergoing 6-MP treatment display a VNTR architecture-dependent response to 6-MP. These data suggest that the TPMT gene promoter VNTR architecture can be potentially used as a pharmacogenomic marker to predict toxicity due to 6-MP treatment in acute lymphoblastic leukemia patients.

  19. Diversification of Genes Encoding Granule-Bound Starch Synthase in Monocots and Dicots Is Marked by Multiple Genome-Wide Duplication Events

    PubMed Central

    Qiu, Wen-Ming; Li, Jing; Zhou, Hui; Zhang, Qiong; Guo, Wenwu; Zhu, Tingting; Peng, Junhua; Sun, Fengjie; Li, Shaohua; Korban, Schuyler S.; Han, Yuepeng

    2012-01-01

    Starch is one of the major components of cereals, tubers, and fruits. Genes encoding granule-bound starch synthase (GBSS), which is responsible for amylose synthesis, have been extensively studied in cereals but little is known about them in fruits. Due to their low copy gene number, GBSS genes have been used to study plant phylogenetic and evolutionary relationships. In this study, GBSS genes have been isolated and characterized in three fruit trees, including apple, peach, and orange. Moreover, a comprehensive evolutionary study of GBSS genes has also been conducted between both monocots and eudicots. Results have revealed that genomic structures of GBSS genes in plants are conserved, suggesting they all have evolved from a common ancestor. In addition, the GBSS gene in an ancestral angiosperm must have undergone genome duplication ∼251 million years ago (MYA) to generate two families, GBSSI and GBSSII. Both GBSSI and GBSSII are found in monocots; however, GBSSI is absent in eudicots. The ancestral GBSSII must have undergone further divergence when monocots and eudicots split ∼165 MYA. This is consistent with expression profiles of GBSS genes, wherein these profiles are more similar to those of GBSSII in eudicots than to those of GBSSI genes in monocots. In dicots, GBSSII must have undergone further divergence when rosids and asterids split from each other ∼126 MYA. Taken together, these findings suggest that it is GBSSII rather than GBSSI of monocots that have orthologous relationships with GBSS genes of eudicots. Moreover, diversification of GBSS genes is mainly associated with genome-wide duplication events throughout the evolutionary course of history of monocots and eudicots. PMID:22291904

  20. Using Paleogenomics to Study the Evolution of Gene Families: Origin and Duplication History of the Relaxin Family Hormones and Their Receptors

    PubMed Central

    Yegorov, Sergey; Good, Sara

    2012-01-01

    Recent progress in the analysis of whole genome sequencing data has resulted in the emergence of paleogenomics, a field devoted to the reconstruction of ancestral genomes. Ancestral karyotype reconstructions have been used primarily to illustrate the dynamic nature of genome evolution. In this paper, we demonstrate how they can also be used to study individual gene families by examining the evolutionary history of relaxin hormones (RLN/INSL) and relaxin family peptide receptors (RXFP). Relaxin family hormones are members of the insulin superfamily, and are implicated in the regulation of a variety of primarily reproductive and neuroendocrine processes. Their receptors are G-protein coupled receptors (GPCR's) and include members of two distinct evolutionary groups, an unusual characteristic. Although several studies have tried to elucidate the origins of the relaxin peptide family, the evolutionary origin of their receptors and the mechanisms driving the diversification of the RLN/INSL-RXFP signaling systems in non-placental vertebrates has remained elusive. Here we show that the numerous vertebrate RLN/INSL and RXFP genes are products of an ancestral receptor-ligand system that originally consisted of three genes, two of which apparently trace their origins to invertebrates. Subsequently, diversification of the system was driven primarily by whole genome duplications (WGD, 2R and 3R) followed by almost complete retention of the ligand duplicates in most vertebrates but massive loss of receptor genes in tetrapods. Interestingly, the majority of 3R duplicates retained in teleosts are potentially involved in neuroendocrine regulation. Furthermore, we infer that the ancestral AncRxfp3/4 receptor may have been syntenically linked to the AncRln-like ligand in the pre-2R genome, and show that syntenic linkages among ligands and receptors have changed dynamically in different lineages. This study ultimately shows the broad utility, with some caveats, of incorporating

  1. The fate of the duplicated androgen receptor in fishes: a late neofunctionalization event?

    PubMed Central

    2008-01-01

    Background Based on the observation of an increased number of paralogous genes in teleost fishes compared with other vertebrates and on the conserved synteny between duplicated copies, it has been shown that a whole genome duplication (WGD) occurred during the evolution of Actinopterygian fish. Comparative phylogenetic dating of this duplication event suggests that it occurred early on, specifically in teleosts. It has been proposed that this event might have facilitated the evolutionary radiation and the phenotypic diversification of the teleost fish, notably by allowing the sub- or neo-functionalization of many duplicated genes. Results In this paper, we studied in a wide range of Actinopterygians the duplication and fate of the androgen receptor (AR, NR3C4), a nuclear receptor known to play a key role in sex-determination in vertebrates. The pattern of AR gene duplication is consistent with an early WGD event: it has been duplicated into two genes AR-A and AR-B after the split of the Acipenseriformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes. Genomic and syntenic analyses in addition to lack of PCR amplification show that one of the duplicated copies, AR-B, was lost in several basal Clupeocephala such as Cypriniformes (including the model species zebrafish), Siluriformes, Characiformes and Salmoniformes. Interestingly, we also found that, in basal teleost fish (Osteoglossiformes and Anguilliformes), the two copies remain very similar, whereas, specifically in Percomorphs, one of the copies, AR-B, has accumulated substitutions in both the ligand binding domain (LBD) and the DNA binding domain (DBD). Conclusion The comparison of the mutations present in these divergent AR-B with those known in human to be implicated in complete, partial or mild androgen insensitivity syndrome suggests that the existence of two distinct AR duplicates may be correlated to specific functional differences that may be connected to the well

  2. Xq28 duplication presenting with intestinal and bladder dysfunction and a distinctive facial appearance

    PubMed Central

    Clayton-Smith, Jill; Walters, Sarah; Hobson, Emma; Burkitt-Wright, Emma; Smith, Rupert; Toutain, Annick; Amiel, Jeanne; Lyonnet, Stanislas; Mansour, Sahar; Fitzpatrick, David; Ciccone, Roberto; Ricca, Ivana; Zuffardi, Orsetta; Donnai, Dian

    2009-01-01

    Xq28 duplications encompassing MECP2 have been described in male patients with a severe neurodevelopmental disorder associated with hypotonia and spasticity, severe learning disability and recurrent pneumonia. We identified an Xq28 duplication in three families where several male patients had presented with intestinal pseudo-obstruction or bladder distension. The affected boys had similar dysmorphic facial appearances. Subsequently, we ascertained seven further families where the proband presented with similar features. We demonstrated duplications of the Xq28 region in five of these additional families. In addition to MECP2, these duplications encompassed several other genes already known to be associated with diseases including SLC6A8, L1CAM and Filamin A (FLNA). The two remaining families were shown to have intragenic duplications of FLNA only. We discuss which elements of the Xq28 duplication phenotype may be associated with the various genes in the duplication. We propose that duplication of FLNA may contribute to the bowel and bladder phenotype seen in these seven families. PMID:18854860

  3. Genome-wide identification and characterization of five MyD88 duplication genes in Yesso scallop (Patinopecten yessoensis) and expression changes in response to bacterial challenge.

    PubMed

    Ning, Xianhui; Wang, Ruijia; Li, Xue; Wang, Shuyue; Zhang, Mengran; Xing, Qiang; Sun, Yan; Wang, Shi; Zhang, Lingling; Hu, Xiaoli; Bao, Zhenmin

    2015-10-01

    Myeloid differentiation factor 88 (MyD88) is a pivotal adaptor in the TLR/IL-1R signaling pathway, which plays an important role in activating the innate immune system. Although MyD88 genes have been identified in a variety of species, they have not been systematically characterized in scallops. In this study, five MyD88 genes were identified in Yesso scallop (Patinopecten yessoensis), PyMyD88-1, PyMyD88-2a, PyMyD88-2b, PyMyD88-3 and PyMyD88-4, which consisted of two pairs of tandem duplications located on the same chromosome. To our knowledge, this is the largest number of MyD88 genes found in an invertebrate. Phylogenetic and protein structural analyses were carried out to determine the identities and evolutionary relationships of these genes. PyMyD88s have highly conserved structures compared to MyD88 genes from other invertebrate species, except for PyMyD88-4, which contains only a DD domain, suggesting the evolutionarily conserved form of this particular gene member. We investigated the expression profiles of PyMyD88 genes at different developmental stages and in healthy adult tissues and hemocytes after Micrococcus luteus and Vibrio anguillarum infection using quantitative real-time PCR (qRT-PCR). The expression of most PyMyD88s was significantly induced in the acute phase (3-6 h) after infection with both gram-positive (M. luteus) and gram-negative (V. anguillarum) bacteria, with much more dramatic changes in PyMyD88 expression being observed after V. anguillarum challenge. Collectively, the abundance of MyD88s and their specific expression patterns provide insight into their versatile roles in the response of the bivalve innate immune system to gram-negative bacterial pathogens. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. High mature grain phytase activity in the Triticeae has evolved by duplication followed by neofunctionalization of the purple acid phosphatase phytase (PAPhy) gene

    PubMed Central

    Brinch-Pedersen, Henrik

    2013-01-01

    The phytase activity in food and feedstuffs is an important nutritional parameter. Members of the Triticeae tribe accumulate purple acid phosphatase phytases (PAPhy) during grain filling. This accumulation elevates mature grain phytase activities (MGPA) up to levels between ~650 FTU/kg for barley and 6000 FTU/kg for rye. This is notably more than other cereals. For instance, rice, maize, and oat have MGPAs below 100 FTU/kg. The cloning and characterization of the PAPhy gene complement from wheat, barley, rye, einkorn, and Aegilops tauschii is reported here. The Triticeae PAPhy genes generally consist of a set of paralogues, PAPhy_a and PAPhy_b, and have been mapped to Triticeae chromosomes 5 and 3, respectively. The promoters share a conserved core but the PAPhy_a promoter have acquired a novel cis-acting regulatory element for expression during grain filling while the PAPhy_b promoter has maintained the archaic function and drives expression during germination. Brachypodium is the only sequenced Poaceae sharing the PAPhy duplication. As for the Triticeae, the duplication is reflected in a high MGPA of ~4200 FTU/kg in Brachypodium. The sequence conservation of the paralogous loci on Brachypodium chromosomes 1 and 2 does not extend beyond the PAPhy gene. The results indicate that a single-gene segmental duplication may have enabled the evolution of high MGPA by creating functional redundancy of the parent PAPhy gene. This implies that similar MGPA levels may be out of reach in breeding programs for some Poaceae, e.g. maize and rice, whereas Triticeae breeders should focus on PAPhy_a. PMID:23918958

  5. History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis

    PubMed Central

    2014-01-01

    Background The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. Results We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Conclusions Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages. PMID:25250171

  6. History of a prolific family: the Hes/Hey-related genes of the annelid Platynereis.

    PubMed

    Gazave, Eve; Guillou, Aurélien; Balavoine, Guillaume

    2014-01-01

    The Hes superfamily or Hes/Hey-related genes encompass a variety of metazoan-specific bHLH genes, with somewhat fuzzy phylogenetic relationships. Hes superfamily members are involved in a variety of major developmental mechanisms in metazoans, notably in neurogenesis and segmentation processes, in which they often act as direct effector genes of the Notch signaling pathway. We have investigated the molecular and functional evolution of the Hes superfamily in metazoans using the lophotrochozoan Platynereis dumerilii as model. Our phylogenetic analyses of more than 200 Metazoan Hes/Hey-related genes revealed the presence of five families, three of them (Hes, Hey and Helt) being pan-metazoan. Those families were likely composed of a unique representative in the last common metazoan ancestor. The evolution of the Hes family was shaped by many independent lineage specific tandem duplication events. The expression patterns of 13 of the 15 Hes/Hey-related genes in Platynereis indicate a broad functional diversification. Nevertheless, a majority of these genes are involved in two crucial developmental processes in annelids: neurogenesis and segmentation, resembling functions highlighted in other animal models. Combining phylogenetic and expression data, our study suggests an unusual evolutionary history for the Hes superfamily. An ancestral multifunctional annelid Hes gene may have undergone multiples rounds of duplication-degeneration-complementation processes in the lineage leading to Platynereis, each gene copies ensuring their maintenance in the genome by subfunctionalisation. Similar but independent waves of duplications are at the origin of the multiplicity of Hes genes in other metazoan lineages.

  7. Expansion of the receptor-like kinase/Pelle gene family and receptor-like proteins in Arabidopsis.

    PubMed

    Shiu, Shin Han; Bleecker, Anthony B

    2003-06-01

    Receptor-like kinases (RLKs) are a family of transmembrane proteins with versatile N-terminal extracellular domains and C-terminal intracellular kinases. They control a wide range of physiological responses in plants and belong to one of the largest gene families in the Arabidopsis genome with more than 600 members. Interestingly, this gene family constitutes 60% of all kinases in Arabidopsis and accounts for nearly all transmembrane kinases in Arabidopsis. Analysis of four fungal, six metazoan, and two Plasmodium sp. genomes indicates that the family was represented in all but fungal genomes, indicating an ancient origin for the family with a more recent expansion only in the plant lineages. The RLK/Pelle family can be divided into several subfamilies based on three independent criteria: the phylogeny based on kinase domain sequences, the extracellular domain identities, and intron locations and phases. A large number of receptor-like proteins (RLPs) resembling the extracellular domains of RLKs are also found in the Arabidopsis genome. However, not all RLK subfamilies have corresponding RLPs. Several RLK/Pelle subfamilies have undergone differential expansions. More than 33% of the RLK/Pelle members are found in tandem clusters, substantially higher than the genome average. In addition, 470 of the RLK/Pelle family members are located within the segmentally duplicated regions in the Arabidopsis genome and 268 of them have a close relative in the corresponding regions. Therefore, tandem duplications and segmental/whole-genome duplications represent two of the major mechanisms for the expansion of the RLK/Pelle family in Arabidopsis.

  8. Long-term disease-free survivors with cytogenetically normal acute myeloid leukemia and MLL partial tandem duplication: a Cancer and Leukemia Group B study

    PubMed Central

    Ruppert, Amy S.; Marcucci, Guido; Mrózek, Krzysztof; Paschka, Peter; Langer, Christian; Baldus, Claudia D.; Wen, Jing; Vukosavljevic, Tamara; Powell, Bayard L.; Carroll, Andrew J.; Kolitz, Jonathan E.; Larson, Richard A.; Caligiuri, Michael A.; Bloomfield, Clara D.

    2007-01-01

    The clinical impact of MLL partial tandem duplication (MLL-PTD) was evaluated in 238 adults aged 18 to 59 years with cytogenetically normal (CN) de novo acute myeloid leukemia (AML) who were treated intensively on similar Cancer and Leukemia Group B protocols 9621 and 19808. Twenty-four (10.1%) patients harbored an MLL-PTD. Of those, 92% achieved complete remission (CR) compared with 83% of patients without MLL-PTD (P = .39). Neither overall survival nor disease-free survival significantly differed between the 2 groups (P = .67 and P = .55, respectively). Thirteen MLL-PTD+ patients relapsed within 1.4 years of achieving CR. MLL-PTD+ patients who relapsed more often had other adverse CN-AML–associated molecular markers. In contrast with previously reported studies, 9 (41%) MLL-PTD+ patients continue in long-term first remission (CR1; range, 2.5-7.7 years). Intensive consolidation therapy that included autologous peripheral stem-cell transplantation during CR1 may have contributed to the better outcome of this historically poor-prognosis group of CN-AML patients with MLL-PTD. PMID:17341662

  9. Divergent evolution of part of the involucrin gene in the hominoids: Unique intragenic duplications in the gorilla and human

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Teumer, J.; Green, H.

    1989-02-01

    The gene for involucrin, an epidermal protein, has been remodeled in the higher primates. Most of the coding region of the human gene consists of a modern segment of repeats derived from a 10-codon sequence present in the ancestral segment of the gene. The modern segment can be divided into early, middle, and late regions. The authors report here the nucleotide sequence of three alleles of the gorilla involucrin gene. Each possesses a modern segment homologous to that of the human and consisting of 10-codon repeats. The early and middle regions are similar to the corresponding regions of the humanmore » allele and are nearly identical among the different gorilla alleles. The late region consists of recent duplications whose pattern is unique in each of the gorilla alleles and in the human allele. The early region is located in what is now the 3{prime} third of the modern segment, and the late, polymorphic region is located in what is now the 5{prime} third. Therefore, as the modern segment expanded during evolution, its 3{prime} end became stabilized, and continuing duplications became confined to its 5{prime} end. The expansion of the involucrin coding region, which began long before the separation of the gorilla and human, has continued in both species after their separation.« less

  10. Recent gene duplication and subfunctionalization produced a mitochondrial GrpE, the nucleotide exchange factor of the Hsp70 complex, specialized in thermotolerance to chronic heat stress in Arabidopsis.

    PubMed

    Hu, Catherine; Lin, Siou-ying; Chi, Wen-tzu; Charng, Yee-yung

    2012-02-01

    The duplication and divergence of heat stress (HS) response genes might help plants adapt to varied HS conditions, but little is known on the topic. Here, we examined the evolution and function of Arabidopsis (Arabidopsis thaliana) mitochondrial GrpE (Mge) proteins. GrpE acts as a nucleotide-exchange factor in the Hsp70/DnaK chaperone machinery. Genomic data show that AtMge1 and AtMge2 arose from a recent whole-genome duplication event. Phylogenetic analysis indicated that duplication and preservation of Mges occurred independently in many plant species, which suggests a common tendency in the evolution of the genes. Intron retention contributed to the divergence of the protein structure of Mge paralogs in higher plants. In both Arabidopsis and tomato (Solanum lycopersicum), Mge1 is induced by ultraviolet B light and Mge2 is induced by heat, which suggests regulatory divergence of the genes. Consistently, AtMge2 but not AtMge1 is under the control of HsfA1, the master regulator of the HS response. Heterologous expression of AtMge2 but not AtMge1 in the temperature-sensitive Escherichia coli grpE mutant restored its growth at 43°C. Arabidopsis T-DNA knockout lines under different HS regimes revealed that Mge2 is specifically required for tolerating prolonged exposure to moderately high temperature, as compared with the need of the heat shock protein 101 and the HS-associated 32-kD protein for short-term extreme heat. Therefore, with duplication and subfunctionalization, one copy of the Arabidopsis Mge genes became specialized in a distinct type of HS. We provide direct evidence supporting the connection between gene duplication and adaptation to environmental stress.

  11. Breakup of a homeobox cluster after genome duplication in teleosts

    PubMed Central

    Mulley, John F.; Chiu, Chi-hua; Holland, Peter W. H.

    2006-01-01

    Several families of homeobox genes are arranged in genomic clusters in metazoan genomes, including the Hox, ParaHox, NK, Rhox, and Iroquois gene clusters. The selective pressures responsible for maintenance of these gene clusters are poorly understood. The ParaHox gene cluster is evolutionarily conserved between amphioxus and human but is fragmented in teleost fishes. We show that two basal ray-finned fish, Polypterus and Amia, each possess an intact ParaHox cluster; this implies that the selective pressure maintaining clustering was lost after whole-genome duplication in teleosts. Cluster breakup is because of gene loss, not transposition or inversion, and the total number of ParaHox genes is the same in teleosts, human, mouse, and frog. We propose that this homeobox gene cluster is held together in chordates by the existence of interdigitated control regions that could be separated after locus duplication in the teleost fish. PMID:16801555

  12. Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species

    PubMed Central

    Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj

    2015-01-01

    The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056

  13. A dynamic history of gene duplications and losses characterizes the evolution of the SPARC family in eumetazoans.

    PubMed

    Bertrand, Stephanie; Fuentealba, Jaime; Aze, Antoine; Hudson, Clare; Yasuo, Hitoyoshi; Torrejon, Marcela; Escriva, Hector; Marcellini, Sylvain

    2013-04-22

    The vertebrates share the ability to produce a skeleton made of mineralized extracellular matrix. However, our understanding of the molecular changes that accompanied their emergence remains scarce. Here, we describe the evolutionary history of the SPARC (secreted protein acidic and rich in cysteine) family, because its vertebrate orthologues are expressed in cartilage, bones and teeth where they have been proposed to bind calcium and act as extracellular collagen chaperones, and because further duplications of specific SPARC members produced the small calcium-binding phosphoproteins (SCPP) family that is crucial for skeletal mineralization to occur. Both phylogeny and synteny conservation analyses reveal that, in the eumetazoan ancestor, a unique ancestral gene duplicated to give rise to SPARC and SPARCB described here for the first time. Independent losses have eliminated one of the two paralogues in cnidarians, protostomes and tetrapods. Hence, only non-tetrapod deuterostomes have conserved both genes. Remarkably, SPARC and SPARCB paralogues are still linked in the amphioxus genome. To shed light on the evolution of the SPARC family members in chordates, we performed a comprehensive analysis of their embryonic expression patterns in amphioxus, tunicates, teleosts, amphibians and mammals. Our results show that in the chordate lineage SPARC and SPARCB family members were recurrently recruited in a variety of unrelated tissues expressing collagen genes. We propose that one of the earliest steps of skeletal evolution involved the co-expression of SPARC paralogues with collagenous proteins.

  14. Spotting and validation of a genome wide oligonucleotide chip with duplicate measurement of each gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thomassen, Mads; Skov, Vibe; Eiriksdottir, Freyja

    2006-06-16

    The quality of DNA microarray based gene expression data relies on the reproducibility of several steps in a microarray experiment. We have developed a spotted genome wide microarray chip with oligonucleotides printed in duplicate in order to minimise undesirable biases, thereby optimising detection of true differential expression. The validation study design consisted of an assessment of the microarray chip performance using the MessageAmp and FairPlay labelling kits. Intraclass correlation coefficient (ICC) was used to demonstrate that MessageAmp was significantly more reproducible than FairPlay. Further examinations with MessageAmp revealed the applicability of the system. The linear range of the chips wasmore » three orders of magnitude, the precision was high, as 95% of measurements deviated less than 1.24-fold from the expected value, and the coefficient of variation for relative expression was 13.6%. Relative quantitation was more reproducible than absolute quantitation and substantial reduction of variance was attained with duplicate spotting. An analysis of variance (ANOVA) demonstrated no significant day-to-day variation.« less

  15. Multiple independent origins of mitochondrial control region duplications in the order Psittaciformes

    PubMed Central

    Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.; Eberhard, Jessica R.; Miyaki, Cristina Y.; Sanchez, Juan J.; Hernandez, Alexis; Müeller, Heinrich; Graves, Gary R.; Fleischer, Robert C.; Wright, Timothy F.

    2012-01-01

    Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified with increasing frequency, particularly in birds (Class Aves). In this study, we investigate the evolutionary history of mitochondrial control region states within the avian order Psittaciformes (parrots and cockatoos). To this aim, we reconstructed a comprehensive multi-locus phylogeny of parrots, used PCR of three diagnostic fragments to classify the mitochondrial control region state as single or duplicated, and mapped these states onto the phylogeny. We further sequenced 44 selected species to validate these inferences of control region state. Ancestral state reconstruction using a range of weighting schemes identified six independent origins of mitochondrial control region duplications within Psittaciformes. Analysis of sequence data showed that varying levels of mitochondrial gene and tRNA homology and degradation were present within a given clade exhibiting duplications. Levels of divergence between control regions within an individual varied from 0–10.9% with the differences occurring mainly between 51 and 225 nucleotides 3′ of the goose hairpin in domain I. Further investigations into the fates of duplicated mitochondrial genes, the potential costs and benefits of having a second control region, and the complex relationship between evolutionary rates, selection, and time since duplication are needed to fully explain these patterns in the mitochondrial genome. PMID:22543055

  16. Host plant-dependent phenotypic reversion of Ralstonia solanacearum from non-pathogenic to pathogenic forms via alterations in the phcA gene.

    PubMed

    Poussier, Stéphane; Thoquet, Philippe; Trigalet-Demery, Danièle; Barthet, Séverine; Meyer, Damien; Arlat, Matthieu; Trigalet, André

    2003-08-01

    Ralstonia solanacearum is a plant pathogenic bacterium that undergoes a spontaneous phenotypic conversion (PC) from a wild-type pathogenic to a non-pathogenic form. PC is often associated with mutations in phcA, which is a key virulence regulatory gene. Until now, reversion to the wild-type pathogenic form has not been observed for PC variants and the biological significance of PC has been questioned. In this study, we characterized various alterations in phcA (eight IS element insertions, three tandem duplications, seven deletions and a base substitution) in 19 PC mutants from the model strain GMI1000. In five of these variants, reversion to the pathogenic form was observed in planta, while no reversion was ever noticed in vitro whatever culture media used. However, reversion was observed for a 64 bp tandem duplication in vitro in the presence of tomato root exudate. This is the first report showing a complete cycle of phenotypic conversion/reversion in a plant pathogenic bacterium.

  17. The polyphenol oxidase gene family in land plants: Lineage-specific duplication and expansion

    PubMed Central

    2012-01-01

    Background Plant polyphenol oxidases (PPOs) are enzymes that typically use molecular oxygen to oxidize ortho-diphenols to ortho-quinones. These commonly cause browning reactions following tissue damage, and may be important in plant defense. Some PPOs function as hydroxylases or in cross-linking reactions, but in most plants their physiological roles are not known. To better understand the importance of PPOs in the plant kingdom, we surveyed PPO gene families in 25 sequenced genomes from chlorophytes, bryophytes, lycophytes, and flowering plants. The PPO genes were then analyzed in silico for gene structure, phylogenetic relationships, and targeting signals. Results Many previously uncharacterized PPO genes were uncovered. The moss, Physcomitrella patens, contained 13 PPO genes and Selaginella moellendorffii (spike moss) and Glycine max (soybean) each had 11 genes. Populus trichocarpa (poplar) contained a highly diversified gene family with 11 PPO genes, but several flowering plants had only a single PPO gene. By contrast, no PPO-like sequences were identified in several chlorophyte (green algae) genomes or Arabidopsis (A. lyrata and A. thaliana). We found that many PPOs contained one or two introns often near the 3’ terminus. Furthermore, N-terminal amino acid sequence analysis using ChloroP and TargetP 1.1 predicted that several putative PPOs are synthesized via the secretory pathway, a unique finding as most PPOs are predicted to be chloroplast proteins. Phylogenetic reconstruction of these sequences revealed that large PPO gene repertoires in some species are mostly a consequence of independent bursts of gene duplication, while the lineage leading to Arabidopsis must have lost all PPO genes. Conclusion Our survey identified PPOs in gene families of varying sizes in all land plants except in the genus Arabidopsis. While we found variation in intron numbers and positions, overall PPO gene structure is congruent with the phylogenetic relationships based on

  18. The cytochrome P450 2AA gene cluster in zebrafish (Danio rerio): Expression of CYP2AA1 and CYP2AA2 and response to phenobarbital-type inducers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kubota, Akira; Bainy, Afonso C.D.; Departamento de Bioquímica, CCB, Universidade Federal de Santa Catarina, Florianopolis, SC 88040-900

    2013-10-01

    The cytochrome P450 (CYP) 2 gene family is the largest and most diverse CYP gene family in vertebrates. In zebrafish, we have identified 10 genes in a new subfamily, CYP2AA, which does not show orthology to any human or other mammalian CYP genes. Here we report evolutionary and structural relationships of the 10 CYP2AA genes and expression of the first two genes, CYP2AA1 and CYP2AA2. Parsimony reconstruction of the tandem duplication pattern for the CYP2AA cluster suggests that CYP2AA1, CYP2AA2 and CYP2AA3 likely arose in the earlier duplication events and thus are most diverged in function from the other CYP2AAs.more » On the other hand, CYP2AA8 and CYP2AA9 are genes that arose in the latest duplication event, implying functional similarity between these two CYPs. A molecular model of CYP2AA1 showing the sequence conservation across the CYP2AA cluster reveals that the regions with the highest variability within the cluster map onto CYP2AA1 near the substrate access channels, suggesting differing substrate specificities. Zebrafish CYP2AA1 transcript was expressed predominantly in the intestine, while CYP2AA2 was most highly expressed in the kidney, suggesting differing roles in physiology. In the liver CYP2AA2 expression but not that of CYP2AA1, was increased by 1,4-bis [2-(3,5-dichloropyridyloxy)] benzene (TCPOBOP) and, to a lesser extent, by phenobarbital (PB). In contrast, pregnenolone 16α-carbonitrile (PCN) increased CYP2AA1 expression, but not CYP2AA2 in the liver. The results identify a CYP2 subfamily in zebrafish that includes genes apparently induced by PB-type chemicals and PXR agonists, the first concrete in vivo evidence for a PB-type response in fish. - Highlights: • A tandemly duplicated cluster of ten CYP2AA genes was described in zebrafish. • Parsimony and duplication analyses suggest pathways to CYP2AA diversity. • Homology models reveal amino acid positions possibly related to functional diversity. • The CYP2AA locus does not share

  19. Phylogenetic investigation of human FGFR-bearing paralogons favors piecemeal duplication theory of vertebrate genome evolution.

    PubMed

    Ajmal, Wajya; Khan, Hiba; Abbasi, Amir Ali

    2014-12-01

    Understanding the genetic mechanisms underlying the organismal complexity and origin of novelties during vertebrate history is one of the central goals of evolutionary biology. Ohno (1970) was the first to postulate that whole genome duplications (WGD) have played a vital role in the evolution of new gene functions: permitting an increase in morphological, physiological and anatomical complexity during early vertebrate history. Here, we analyze the evolutionary history of human FGFR-bearing paralogon (human autosome 4/5/8/10) by the phylogenetic analysis of multigene families with triplicate and quadruplicate distribution on these chromosomes. Our results categorized the histories of 21 families into discrete co-duplicated groups. Genes of a particular co-duplicated group exhibit identical evolutionary history and have duplicated in concert with each other, whereas genes belonging to different groups have dissimilar histories and have not duplicated concurrently. Taken together with our previously published data, we submit that there is sufficient empirical evidence to disprove the 1R/2R hypothesis and to support the general prediction that vertebrate genome evolved by relatively small-scale, regional duplication events that spread across the history of life. Copyright © 2014 Elsevier Inc. All rights reserved.

  20. The detection of large deletions or duplications in genomic DNA.

    PubMed

    Armour, J A L; Barton, D E; Cockburn, D J; Taylor, G R

    2002-11-01

    While methods for the detection of point mutations and small insertions or deletions in genomic DNA are well established, the detection of larger (>100 bp) genomic duplications or deletions can be more difficult. Most mutation scanning methods use PCR as a first step, but the subsequent analyses are usually qualitative rather than quantitative. Gene dosage methods based on PCR need to be quantitative (i.e., they should report molar quantities of starting material) or semi-quantitative (i.e., they should report gene dosage relative to an internal standard). Without some sort of quantitation, heterozygous deletions and duplications may be overlooked and therefore be under-ascertained. Gene dosage methods provide the additional benefit of reporting allele drop-out in the PCR. This could impact on SNP surveys, where large-scale genotyping may miss null alleles. Here we review recent developments in techniques for the detection of this type of mutation and compare their relative strengths and weaknesses. We emphasize that comprehensive mutation analysis should include scanning for large insertions and deletions and duplications. Copyright 2002 Wiley-Liss, Inc.

  1. Genetics Home Reference: MECP2 duplication syndrome

    MedlinePlus

    ... of autism spectrum disorders that affect communication and social interaction. Females with a MECP2 gene duplication tend to ... Accessibility FOIA Viewers & Players U.S. Department of Health & Human Services National Institutes of Health National Library of ...

  2. Duplication and Whorl-Specific Down-Regulation of the Obligate AP3-PI Heterodimer Genes Explain the Origin of Paeonia lactiflora Plants with Spontaneous Corolla Mutation.

    PubMed

    Gong, Pichang; Ao, Xiang; Liu, Gaixiu; Cheng, Fangyun; He, Chaoying

    2017-03-01

    Herbaceous peony (Paeonia lactiflora) is a globally important ornamental plant. Spontaneous floral mutations occur frequently during cultivation, and are selected as a way to release new cultivars, but the underlying evolutionary developmental genetics remain largely elusive. Here, we investigated a collection of spontaneous corolla mutational plants (SCMPs) whose other floral organs were virtually unaffected. Unlike the corolla in normal plants (NPs) that withered soon after fertilization, the transformed corolla (petals) in SCMPs was greenish and persistent similar to the calyx (sepals). Epidermal cellular morphology of the SCMP corolla was also similar to that of calyx cells, further suggesting a sepaloid corolla in SCMPs. Ten floral MADS-box genes from these Paeonia plants were comparatively characterized with respect to sequence and expression. Codogenic sequence variation of these MADS-box genes was not linked to corolla changes in SCMPs. However, we found that both APETALA3 (AP3) and PISTILLATA (PI) lineages of B-class MADS-box genes were duplicated, and subsequent selective expression alterations of these genes were closely associated with the origin of SCMPs. AP3-PI obligate heterodimerization, essential for organ identity of corolla and stamens, was robustly detected. However, selective down-regulation of these duplicated genes might result in a reduction of this obligate heterodimer concentration in a corolla-specific manner, leading to the sepaloid corolla in SCMPs, thus representing a new sepaloid corolla model taking advantage of gene duplication. Our work suggests that modifying floral MADS-box genes could facilitate the breeding of novel cultivars with distinct floral morphology in ornamental plants, and also provides new insights into the functional evolution of the MADS-box genes in plants. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please

  3. Norrie disease and MAO genes: nearest neighbors.

    PubMed

    Chen, Z Y; Denney, R M; Breakefield, X O

    1995-01-01

    The Norrie disease and MAO genes are tandemly arranged in the p11.4-p11.3 region of the human X chromosome in the order tel-MAOA-MAOB-NDP-cent. This relationship is conserved in the mouse in the order tel-MAOB-MAOA-NDP-cent. The MAO genes appear to have arisen by tandem duplication of an ancestral MAO gene, but their positional relationship to NDP appears to be random. Distinctive X-linked syndromes have been described for mutations in the MAOA and NDP genes, and in addition, individuals have been identified with contiguous gene syndromes due to chromosomal deletions which encompass two or three of these genes. Loss of function of the NDP gene causes a syndrome of congenital blindness and progressive hearing loss, sometimes accompanied by signs of CNS dysfunction, including variable mental retardation and psychiatric symptoms. Other mutations in the NDP gene have been found to underlie another X-linked eye disease, exudative vitreo-retinopathy. An MAOA deficiency state has been described in one family to date, with features of altered amine and amine metabolite levels, low normal intelligence, apparent difficulty in impulse control and cardiovascular difficulty in affected males. A contiguous gene syndrome in which all three genes are lacking, as well as other as yet unidentified flanking genes, results in severe mental retardation, small stature, seizures and congenital blindness, as well as altered amine and amine metabolites. Issues that remain to be resolved are the function of the NDP gene product, the frequency and phenotype of the MAOA deficiency state, and the possible occurrence and phenotype of an MAOB deficiency state.

  4. Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication.

    PubMed

    Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S

    2016-01-01

    Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and compared the clinical, genotyping, and gene

  5. Case report of individual with cutaneous immunodeficiency and novel 1p36 duplication

    PubMed Central

    Hatter, Alyn D; Soler, David C; Curtis, Christine; Cooper, Kevin D; McCormick, Thomas S

    2016-01-01

    Introduction Crusted or Norwegian scabies is an infectious skin dermatopathology usually associated with an underlying immunodeficiency condition. It is caused when the mite Sarcoptes scabiei infects the skin, and the immune system is unable to control its spread, leading to a massive hyperinfestation with a simultaneous inflammatory and hyperkeratotic reaction. This is the first report of a novel 1p36 duplication associated with a recurrent infection of crusted scabies. Case report We describe a 34-year-old patient with a cutaneous immunodeficiency characterized by recurrent crusted scabies infestation, diffuse tinea, and recurrent staphylococcal cellulitis, who we suspected had an undiagnosed syndrome. The patient also suffered from mental retardation, renal failure, and premature senescence. A cytogenetic fluorescence in situ hybridization analysis revealed a 9.34 Mb duplication within the short (p) arm of chromosome 1, precisely from 1p36.11 to 1p36.21, with an adjacent 193 kb copy gain entirely within 1p36.11. In addition, chromosome 4 had a 906 kb gain in 4p16.1 and chromosome 9 had a 81 kb copy gain in 9p24.3. Over 100 genes localized within these duplicated regions. Gene expression array revealed 82 genes whose expression changed >1.5-fold compared to a healthy age-matched skin control, but among them only the lipolytic enzyme arylacetamide deacetylase-like 3 was found within the duplicated 1p36 region of chromosome 1. Discussion Although genetic duplications in the 1p36 region have been previously described, our report describes a novel duplicative variant within the 1p36 region. The patient did not have a past history of immunosuppression but was afflicted by a recurrent case of crusted scabies, raising the possibility that the recurrent infection was associated with the 1p36 genetic duplication. Conclusion To our knowledge, the specific duplicated sequence between 1p36.11 and p36.21 found in our patient has never been previously reported. We reviewed and

  6. Characterization of the first intragenic SATB2 duplication in a girl with intellectual disability, nearly absent speech and suspected hypodontia.

    PubMed

    Kaiser, Ann-Sophie; Maas, Bianca; Wolff, Anna; Sutter, Christian; Janssen, Johannes W G; Hinderhofer, Katrin; Moog, Ute

    2015-05-01

    SATB2, a gene encoding a highly conserved DNA-binding protein, is known to have an important role in craniofacial and neuronal development. Only a few patients with SATB2 variants have been described so far. Recently, Döcker et al provided a summary of these patients and delineated the SAS (SATB2-associated syndrome). We here report on a girl with intellectual disability, nearly absent speech and suspected hypodontia who was shown to carry an intragenic SATB2 tandem duplication hypothesized to lead to haploinsufficiency of SATB2. Preliminary information on this patient had already been included in the article by Döcker et al. We want to give a detailed description of the patient's phenotype and genotype, providing further insight into the spectrum of the molecular mechanisms leading to SAS.

  7. MECP2 duplications in six patients with complex sex chromosome rearrangements

    PubMed Central

    Breman, Amy M; Ramocki, Melissa B; Kang, Sung-Hae L; Williams, Misti; Freedenberg, Debra; Patel, Ankita; Bader, Patricia I; Cheung, Sau Wai

    2011-01-01

    Duplications of the Xq28 chromosome region resulting in functional disomy are associated with a distinct clinical phenotype characterized by infantile hypotonia, severe developmental delay, progressive neurological impairment, absent speech, and proneness to infections. Increased expression of the dosage-sensitive MECP2 gene is considered responsible for the severe neurological impairments observed in affected individuals. Although cytogenetically visible duplications of Xq28 are well documented in the published literature, recent advances using array comparative genomic hybridization (CGH) led to the detection of an increasing number of microduplications spanning MECP2. In rare cases, duplication results from intrachromosomal rearrangement between the X and Y chromosomes. We report six cases with sex chromosome rearrangements involving duplication of MECP2. Cases 1–4 are unbalanced rearrangements between X and Y, resulting in MECP2 duplication. The additional Xq material was translocated to Yp in three cases (cases 1–3), and to the heterochromatic region of Yq12 in one case (case 4). Cases 5 and 6 were identified by array CGH to have a loss in copy number at Xp and a gain in copy number at Xq28 involving the MECP2 gene. In both cases, fluorescent in situ hybridization (FISH) analysis revealed a recombinant X chromosome containing the duplicated material from Xq28 on Xp, resulting from a maternal pericentric inversion. These cases add to a growing number of MECP2 duplications that have been detected by array CGH, while demonstrating the value of confirmatory chromosome and FISH studies for the localization of the duplicated material and the identification of complex rearrangements. PMID:21119712

  8. The duplication mutation of Quebec platelet disorder dysregulates PLAU, but not C10orf55, selectively increasing production of normal PLAU transcripts by megakaryocytes but not granulocytes.

    PubMed

    Hayward, Catherine P M; Liang, Minggao; Tasneem, Subia; Soomro, Asim; Waye, John S; Paterson, Andrew D; Rivard, Georges E; Wilson, Michael D

    2017-01-01

    Quebec Platelet disorder (QPD) is a unique bleeding disorder that markedly increases urokinase plasminogen activator (uPA) in megakaryocytes and platelets but not in plasma or urine. The cause is tandem duplication of a 78 kb region of chromosome 10 containing PLAU (the uPA gene) and C10orf55, a gene of unknown function. QPD increases uPA in platelets and megakaryocytes >100 fold, far more than expected for a gene duplication. To investigate the tissue-specific effect that PLAU duplication has on gene expression and transcript structure in QPD, we tested if QPD leads to: 1) overexpression of normal or unique PLAU transcripts; 2) increased uPA in leukocytes; 3) altered levels of C10orf55 mRNA and/or protein in megakaryocytes and leukocytes; and 4) global changes in megakaryocyte gene expression. Primary cells and cultured megakaryocytes from donors were prepared for quantitative reverse polymerase chain reaction analyses, RNA-seq and protein expression analyses. Rapidly isolated blood leukocytes from QPD subjects showed only a 3.9 fold increase in PLAU transcript levels, in keeping with the normal to minimally increased uPA in affinity purified, QPD leukocytes. All subjects had more uPA in granulocytes than monocytes and minimal uPA in lymphocytes. QPD leukocytes expressed PLAU alleles in proportions consistent with an extra copy of PLAU on the disease chromosome, unlike QPD megakaryocytes. QPD PLAU transcripts were consistent with reference gene models, with a much higher proportion of reads originating from the disease chromosome in megakaryocytes than granulocytes. QPD and control megakaryocytes contained minimal reads for C10orf55, and C10orf55 protein was not increased in QPD megakaryocytes or platelets. Finally, our QPD megakaryocyte transcriptome analysis revealed a global down regulation of the interferon type 1 pathway. We suggest that the low endogenous levels of uPA in blood are actively regulated, and that the regulatory mechanisms are disrupted in QPD

  9. The duplication mutation of Quebec platelet disorder dysregulates PLAU, but not C10orf55, selectively increasing production of normal PLAU transcripts by megakaryocytes but not granulocytes

    PubMed Central

    Soomro, Asim; Waye, John S.; Paterson, Andrew D.; Rivard, Georges E.; Wilson, Michael D.

    2017-01-01

    Quebec Platelet disorder (QPD) is a unique bleeding disorder that markedly increases urokinase plasminogen activator (uPA) in megakaryocytes and platelets but not in plasma or urine. The cause is tandem duplication of a 78 kb region of chromosome 10 containing PLAU (the uPA gene) and C10orf55, a gene of unknown function. QPD increases uPA in platelets and megakaryocytes >100 fold, far more than expected for a gene duplication. To investigate the tissue-specific effect that PLAU duplication has on gene expression and transcript structure in QPD, we tested if QPD leads to: 1) overexpression of normal or unique PLAU transcripts; 2) increased uPA in leukocytes; 3) altered levels of C10orf55 mRNA and/or protein in megakaryocytes and leukocytes; and 4) global changes in megakaryocyte gene expression. Primary cells and cultured megakaryocytes from donors were prepared for quantitative reverse polymerase chain reaction analyses, RNA-seq and protein expression analyses. Rapidly isolated blood leukocytes from QPD subjects showed only a 3.9 fold increase in PLAU transcript levels, in keeping with the normal to minimally increased uPA in affinity purified, QPD leukocytes. All subjects had more uPA in granulocytes than monocytes and minimal uPA in lymphocytes. QPD leukocytes expressed PLAU alleles in proportions consistent with an extra copy of PLAU on the disease chromosome, unlike QPD megakaryocytes. QPD PLAU transcripts were consistent with reference gene models, with a much higher proportion of reads originating from the disease chromosome in megakaryocytes than granulocytes. QPD and control megakaryocytes contained minimal reads for C10orf55, and C10orf55 protein was not increased in QPD megakaryocytes or platelets. Finally, our QPD megakaryocyte transcriptome analysis revealed a global down regulation of the interferon type 1 pathway. We suggest that the low endogenous levels of uPA in blood are actively regulated, and that the regulatory mechanisms are disrupted in QPD

  10. Myelodysplastic syndrome in an infant with constitutional pure duplication 1q41-qter.

    PubMed

    Morokawa, Hirokazu; Kamiya, Motoko; Wakui, Keiko; Kobayashi, Mikiko; Kurata, Takashi; Matsuda, Kazuyuki; Kawamura, Rie; Kanno, Hiroyuki; Fukushima, Yoshimitsu; Nakazawa, Yozo; Kosho, Tomoki

    2018-01-01

    We report on a Japanese female infant as the fourth patient with the constitutional pure duplication 1q41-qter confirmed by chromosomal microarray and as the first who developed myelodysplastic syndrome (MDS) among those with the constitutional 1q duplication. Common clinical features of the constitutional pure duplication 1q41-qter include developmental delay, craniofacial characteristics, foot malformation, hypertrichosis, and respiratory insufficiency. The association between MDS and the duplication of the genes in the 1q41-qter region remains unknown.

  11. Xp22.33p22.12 Duplication in a Patient with Intellectual Disability and Dysmorphic Facial Features

    PubMed Central

    Lintas, Carla; Picinelli, Chiara; Piras, Ignazio S.; Sacco, Roberto; Gabriele, Stefano; Verdecchia, Magda; Persico, Antonio M.

    2016-01-01

    A novel 19.98-Mb duplication in chromosome Xp22.33p22.12 was detected by array CGH in a 30-year-old man affected by intellectual disability, congenital hypotonia and dysmorphic features. The duplication encompasses more than 100 known genes. Many of these genes (such as neuroligin 4, cyclin-dependent kinase like 5, and others) have already correlated with X-linked intellectual disability and/or neurodevelopmental disorders. Due to the high number of potentially pathogenic genes involved in the reported duplication, we cannot correlate the clinical phenotype to a single gene. Indeed, we suggest that the resulting clinical phenotype may have arisen from the overexpression and consequent perturbation of fine gene dosage. PMID:26997944

  12. Xp22.33p22.12 Duplication in a Patient with Intellectual Disability and Dysmorphic Facial Features.

    PubMed

    Lintas, Carla; Picinelli, Chiara; Piras, Ignazio S; Sacco, Roberto; Gabriele, Stefano; Verdecchia, Magda; Persico, Antonio M

    2016-02-01

    A novel 19.98-Mb duplication in chromosome Xp22.33p22.12 was detected by array CGH in a 30-year-old man affected by intellectual disability, congenital hypotonia and dysmorphic features. The duplication encompasses more than 100 known genes. Many of these genes (such as neuroligin 4, cyclin-dependent kinase like 5, and others) have already correlated with X-linked intellectual disability and/or neurodevelopmental disorders. Due to the high number of potentially pathogenic genes involved in the reported duplication, we cannot correlate the clinical phenotype to a single gene. Indeed, we suggest that the resulting clinical phenotype may have arisen from the overexpression and consequent perturbation of fine gene dosage.

  13. The house spider genome reveals an ancient whole-genome duplication during arachnid evolution.

    PubMed

    Schwager, Evelyn E; Sharma, Prashant P; Clarke, Thomas; Leite, Daniel J; Wierschin, Torsten; Pechmann, Matthias; Akiyama-Oda, Yasuko; Esposito, Lauren; Bechsgaard, Jesper; Bilde, Trine; Buffry, Alexandra D; Chao, Hsu; Dinh, Huyen; Doddapaneni, HarshaVardhan; Dugan, Shannon; Eibner, Cornelius; Extavour, Cassandra G; Funch, Peter; Garb, Jessica; Gonzalez, Luis B; Gonzalez, Vanessa L; Griffiths-Jones, Sam; Han, Yi; Hayashi, Cheryl; Hilbrant, Maarten; Hughes, Daniel S T; Janssen, Ralf; Lee, Sandra L; Maeso, Ignacio; Murali, Shwetha C; Muzny, Donna M; Nunes da Fonseca, Rodrigo; Paese, Christian L B; Qu, Jiaxin; Ronshaugen, Matthew; Schomburg, Christoph; Schönauer, Anna; Stollewerk, Angelika; Torres-Oliva, Montserrat; Turetzek, Natascha; Vanthournout, Bram; Werren, John H; Wolff, Carsten; Worley, Kim C; Bucher, Gregor; Gibbs, Richard A; Coddington, Jonathan; Oda, Hiroki; Stanke, Mario; Ayoub, Nadia A; Prpic, Nikola-Michael; Flot, Jean-François; Posnien, Nico; Richards, Stephen; McGregor, Alistair P

    2017-07-31

    The duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum. We found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication. Our results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.

  14. Molecular Characterization of Soybean Pterocarpan 2-Dimethylallyltransferase in Glyceollin Biosynthesis: Local Gene and Whole-Genome Duplications of Prenyltransferase Genes Led to the Structural Diversity of Soybean Prenylated Isoflavonoids

    PubMed Central

    Yoneyama, Keisuke; Akashi, Tomoyoshi; Aoki, Toshio

    2016-01-01

    Soybean (Glycine max) accumulates several prenylated isoflavonoid phytoalexins, collectively referred to as glyceollins. Glyceollins (I, II, III, IV and V) possess modified pterocarpan skeletons with C5 moieties from dimethylallyl diphosphate, and they are commonly produced from (6aS, 11aS)-3,9,6a-trihydroxypterocarpan [(−)-glycinol]. The metabolic fate of (−)-glycinol is determined by the enzymatic introduction of a dimethylallyl group into C-4 or C-2, which is reportedly catalyzed by regiospecific prenyltransferases (PTs). 4-Dimethylallyl (−)-glycinol and 2-dimethylallyl (−)-glycinol are precursors of glyceollin I and other glyceollins, respectively. Although multiple genes encoding (−)-glycinol biosynthetic enzymes have been identified, those involved in the later steps of glyceollin formation mostly remain unidentified, except for (−)-glycinol 4-dimethylallyltransferase (G4DT), which is involved in glyceollin I biosynthesis. In this study, we identified four genes that encode isoflavonoid PTs, including (−)-glycinol 2-dimethylallyltransferase (G2DT), using homology-based in silico screening and biochemical characterization in yeast expression systems. Transcript analyses illustrated that changes in G2DT gene expression were correlated with the induction of glyceollins II, III, IV and V in elicitor-treated soybean cells and leaves, suggesting its involvement in glyceollin biosynthesis. Moreover, the genomic signatures of these PT genes revealed that G4DT and G2DT are paralogs derived from whole-genome duplications of the soybean genome, whereas other PT genes [isoflavone dimethylallyltransferase 1 (IDT1) and IDT2] were derived via local gene duplication on soybean chromosome 11. PMID:27986914

  15. Genome-wide analysis of the WRKY gene family in cotton.

    PubMed

    Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

    2014-12-01

    WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.

  16. A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers

    PubMed Central

    2012-01-01

    Background Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. Results To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Conclusions Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants. PMID:23102090

  17. A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers.

    PubMed

    Pavy, Nathalie; Pelgas, Betty; Laroche, Jérôme; Rigault, Philippe; Isabel, Nathalie; Bousquet, Jean

    2012-10-26

    Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants.

  18. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus

    PubMed Central

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions

  19. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    PubMed

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions

  20. Expansion of signal transduction pathways in fungi by extensive genome duplication

    PubMed Central

    Corrochano, Luis M.; Kuo, Alan; Marcet-Houben, Marina; Polaino, Silvia; Salamov, Asaf; Villalobos-Escobedo, José M.; Grimwood, Jane; Álvarez, M. Isabel; Avalos, Javier; Bauer, Diane; Benito, Ernesto P.; Benoit, Isabelle; Burger, Gertraud; Camino, Lola P.; Cánovas, David; Cerdá-Olmedo, Enrique; Cheng, Jan-Fang; Domínguez, Angel; Eliáš, Marek; Eslava, Arturo P.; Glaser, Fabian; Gutiérrez, Gabriel; Heitman, Joseph; Henrissat, Bernard; Iturriaga, Enrique A.; Lang, B. Franz; Lavín, José L.; Lee, Soo Chan; Li, Wenjun; Lindquist, Erika; López-García, Sergio; Luque, Eva M.; Marcos, Ana T.; Martin, Joel; McCluskey, Kevin; Medina, Humberto R.; Miralles-Durán, Alejandro; Miyazaki, Atsushi; Muñoz-Torres, Elisa; Oguiza, José A.; Ohm, Robin A.; Orejas, Margarita; Ortiz-Castellanos, Lucila; Pisabarro, Antonio G.; Rodríguez-Romero, Julio; Ruiz-Herrera, José; Ruiz-Vázquez, Rosa; Sanz, Catalina; Schackwitz, Wendy; Shahriari, Mahdi; Shelest, Ekaterina; Silva-Franco, Fátima; Soanes, Darren; Syed, Khajamohiddin; Tagua, Víctor G.; Talbot, Nicholas J.; Thon, Michael R.; Tice, Hope; de Vries, Ronald P.; Wiebenga, Ad; Yadav, Jagjit S.; Braun, Edward L.; Baker, Scott E.; Garre, Victoriano; Schmutz, Jeremy; Horwitz, Benjamin A.; Torres-Martínez, Santiago; Idnurm, Alexander; Herrera-Estrella, Alfredo; Gabaldón, Toni; Grigoriev, Igor V.

    2016-01-01

    Summary Plants and fungi use light and other signals to regulate development, growth, and metabolism. The fruiting bodies of the fungus Phycomyces blakesleeanus are single cells that react to environmental cues, including light, but the mechanisms are largely unknown [1]. The related fungus Mucor circinelloides is an opportunistic human pathogen that changes its mode of growth upon receipt of signals from the environment to facilitate pathogenesis [2]. Understanding how these organisms respond to environmental cues should provide insights into the mechanisms of sensory perception and signal transduction by a single eukaryotic cell, and their role in pathogenesis. We sequenced the genomes of P. blakesleeanus and M. circinelloides, and show that they have been shaped by an extensive genome duplication or, most likely, a whole genome duplication (WGD), which is rarely observed in fungi [3-6]. We show that the genome duplication has expanded gene families, including those involved in signal transduction, and that duplicated genes have specialized, as evidenced by differences in their regulation by light. The transcriptional response to light varies with the developmental stage and is still observed in a photoreceptor mutant of P. blakesleeanus. A phototropic mutant of P. blakesleeanus with a heterozygous mutation in the photoreceptor gene madA demonstrates that photosensor dosage is important for the magnitude of signal transduction. We conclude that the genome duplication provided the means to improve signal transduction for enhanced perception of environmental signals. Our results will help to understand the role of genome dynamics in the evolution of sensory perception in eukaryotes. PMID:27238284

  1. Duplication in the Microtubule-Actin Cross-linking Factor 1 gene causes a novel neuromuscular condition

    PubMed Central

    Jørgensen, Louise H.; Mosbech, Mai-Britt; Færgeman, Nils J.; Graakjaer, Jesper; Jacobsen, Søren V.; Schrøder, Henrik D.

    2014-01-01

    Spectrins and plakins are important communicators linking cytoskeletal components to each other and to cellular junctions. Microtubule-actin cross-linking factor 1 (MACF1) belongs to the spectraplakin family and is involved in control of microtubule dynamics. Complete knock out of MACF1 in mice is associated with developmental retardation and embryonic lethality. Here we present a family with a novel neuromuscular condition. Genetic analyses show a heterozygous duplication resulting in reduced MACF1 gene product. The functional consequence is affected motility observed as periodic hypotonia, lax muscles and diminished motor skills, with heterogeneous presentation among the affected family members. To corroborate these findings we used RNA interference to knock down the VAB-10 locus containing the MACF1 homologue in C. elegans, and we could show that this also causes movement disturbances. These findings suggest that changes in the MACF1 gene is implicated in this neuromuscular condition, which is an important observation since MACF1 has not previously been associated with any human disease and thus presents a key to understanding the essential nature of this gene. PMID:24899269

  2. Duplication in the microtubule-actin cross-linking factor 1 gene causes a novel neuromuscular condition.

    PubMed

    Jørgensen, Louise H; Mosbech, Mai-Britt; Færgeman, Nils J; Graakjaer, Jesper; Jacobsen, Søren V; Schrøder, Henrik D

    2014-06-05

    Spectrins and plakins are important communicators linking cytoskeletal components to each other and to cellular junctions. Microtubule-actin cross-linking factor 1 (MACF1) belongs to the spectraplakin family and is involved in control of microtubule dynamics. Complete knock out of MACF1 in mice is associated with developmental retardation and embryonic lethality. Here we present a family with a novel neuromuscular condition. Genetic analyses show a heterozygous duplication resulting in reduced MACF1 gene product. The functional consequence is affected motility observed as periodic hypotonia, lax muscles and diminished motor skills, with heterogeneous presentation among the affected family members. To corroborate these findings we used RNA interference to knock down the VAB-10 locus containing the MACF1 homologue in C. elegans, and we could show that this also causes movement disturbances. These findings suggest that changes in the MACF1 gene is implicated in this neuromuscular condition, which is an important observation since MACF1 has not previously been associated with any human disease and thus presents a key to understanding the essential nature of this gene.

  3. Evolutionary Patterns of RNA-Based Duplication in Non-Mammalian Chordates

    PubMed Central

    Li, Xin; Vibranovski, Maria D.; Gan, Xiaoni; Wang, Dengqiang; Wang, Wen; Long, Manyuan; He, Shunping

    2011-01-01

    The role of RNA-based duplication, or retroposition, in the evolution of new gene functions in mammals, plants, and Drosophila has been widely reported. However, little is known about RNA-based duplication in non-mammalian chordates. In this study, we screened ten non-mammalian chordate genomes for retrocopies and investigated their evolutionary patterns. We identified numerous retrocopies in these species. Examination of the age distribution of these retrocopies revealed no burst of young retrocopies in ancient chordate species. Upon comparing these non-mammalian chordate species to the mammalian species, we observed that a larger fraction of the non-mammalian retrocopies was under strong evolutionary constraints than mammalian retrocopies are, as evidenced by signals of purifying selection and expression profiles. For the Western clawed frog, Medaka, and Sea squirt, many retrogenes have evolved gonad and brain expression patterns, similar to what was observed in human. Testing of retrogene movement in the Medaka genome, where the nascent sex chrosomes have been well assembled, did not reveal any significant gene movement. Taken together, our analyses demonstrate that RNA-based duplication generates many functional genes and can make a significant contribution to the evolution of non-mammalian genomes. PMID:21779328

  4. Co-Circulation of 72bp Duplication Group A and 60bp Duplication Group B Respiratory Syncytial Virus (RSV) Strains in Riyadh, Saudi Arabia during 2014.

    PubMed

    Ahmed, Anwar; Haider, Shakir H; Parveen, Shama; Arshad, Mohammed; Alsenaidy, Hytham A; Baaboud, Alawi Omar; Mobaireek, Khalid Fahad; AlSaadi, Muslim Mohammed; Alsenaidy, Abdulrahman M; Sullender, Wayne

    2016-01-01

    Respiratory syncytial virus (RSV) is an important viral pathogen of acute respiratory tract infection (ARI). Limited data are available on molecular epidemiology of RSV from Saudi Arabia. A total of 130 nasopharyngeal aspirates were collected from children less than 5 years of age with ARI symptoms attending the Emergency Department at King Khalid University Hospital and King Fahad Medical City, Riyadh, Saudi Arabia between October and December, 2014. RSV was identified in the 26% of the hospitalized children by reverse transcriptase PCR. Group A RSV (77%) predominated during the study as compared to group B RSV (23%). The phylogenetic analysis of 28 study strains clustered group A RSV in NA1 and ON1 genotypes and group B viruses in BA (BA9) genotype. Interestingly, 26% of the positive samples clustered in genotypes with duplication in the G protein gene (ON1 for group A and BA for group B). Both the genotypes showed enhanced O-linked glycosylation in the duplicated region, with 10 and 2 additional sites in ON1 and BA respectively. Selection pressure analysis revealed purifying selection in both the ON1 and BA genotypes. One codon each in the ON1 (position 274) and BA genotypes (position 219) were positively selected and had high entropy values indicating variations at these amino acid positions. This is the first report describing the presence of ON1 genotype and the first report on co-circulation of two different genotypes of RSV with duplication in the G protein gene from Saudi Arabia. The clinical implications of the simultaneous occurrence of genotypes with duplication in G protein gene in a given population especially in the concurrent infections should be investigated in future. Further, the ongoing surveillance of RSV in this region will reveal the evolutionary trajectory of these two genotypes with duplication in G protein gene from largest country in the Middle East.

  5. Co-Circulation of 72bp Duplication Group A and 60bp Duplication Group B Respiratory Syncytial Virus (RSV) Strains in Riyadh, Saudi Arabia during 2014

    PubMed Central

    Ahmed, Anwar; Haider, Shakir H.; Parveen, Shama; Arshad, Mohammed; Alsenaidy, Hytham A.; Baaboud, Alawi Omar; Mobaireek, Khalid Fahad; AlSaadi, Muslim Mohammed; Alsenaidy, Abdulrahman M.; Sullender, Wayne

    2016-01-01

    Respiratory syncytial virus (RSV) is an important viral pathogen of acute respiratory tract infection (ARI). Limited data are available on molecular epidemiology of RSV from Saudi Arabia. A total of 130 nasopharyngeal aspirates were collected from children less than 5 years of age with ARI symptoms attending the Emergency Department at King Khalid University Hospital and King Fahad Medical City, Riyadh, Saudi Arabia between October and December, 2014. RSV was identified in the 26% of the hospitalized children by reverse transcriptase PCR. Group A RSV (77%) predominated during the study as compared to group B RSV (23%). The phylogenetic analysis of 28 study strains clustered group A RSV in NA1 and ON1 genotypes and group B viruses in BA (BA9) genotype. Interestingly, 26% of the positive samples clustered in genotypes with duplication in the G protein gene (ON1 for group A and BA for group B). Both the genotypes showed enhanced O-linked glycosylation in the duplicated region, with 10 and 2 additional sites in ON1 and BA respectively. Selection pressure analysis revealed purifying selection in both the ON1 and BA genotypes. One codon each in the ON1 (position 274) and BA genotypes (position 219) were positively selected and had high entropy values indicating variations at these amino acid positions. This is the first report describing the presence of ON1 genotype and the first report on co-circulation of two different genotypes of RSV with duplication in the G protein gene from Saudi Arabia. The clinical implications of the simultaneous occurrence of genotypes with duplication in G protein gene in a given population especially in the concurrent infections should be investigated in future. Further, the ongoing surveillance of RSV in this region will reveal the evolutionary trajectory of these two genotypes with duplication in G protein gene from largest country in the Middle East. PMID:27835664

  6. Duplication in CHIT1 gene and the risk for Aspergillus lung disease in CF patients.

    PubMed

    Livnat, Galit; Bar-Yoseph, Ronen; Mory, Adi; Dagan, Efrat; Elias, Nael; Gershoni, Ruth; Bentur, Lea

    2014-01-01

    Aspergillus often persists in the respiratory tract of patients with Cystic Fibrosis (CF) and may cause allergic broncho-pulmonary aspergillosis (ABPA). Chitinases are enzymes that digest the chitin polymer. Plants use chitinase as a defense mechanism against fungi. Chitotriosidase (CHIT1) is the major chitinase in human airways. Variation in the coding region with 24-bp duplication allele results in reduced CHIT1 activity. Recently, CHIT1 duplication heterozygocity was found in 6/6 patients with severe asthma and fungal sensitization (SAFS). Our aim was to evaluate the link between CHIT1 duplication in CF patients and the predisposition to Allergic broncho-pulmonary mycosis (ABPM) or persistent Aspergillus positive sputum (APS). CHIT1 duplication was assessed in three CF groups. Group 1: patients who had neither ABPM nor APS in the past (control group). Group 2: patients with persistent APS (≥2/year), without ABPA. Group 3: patients with current or past ABPM. Forty patients with CF were included in the analysis, CHIT1 duplication heterozygocity was found in 3/6 (50%) of the patients in the ABPM group, 3/12 (25%) in the APS group, and 7/22 (31.8%) in the control group (P > 0.05). Eleven patients carried W1282X mutation, 90.9% were negative for CHIT1 duplication, five of them were homozygous for W1282X; none of them had CHIT1 duplication or ABPM. CHIT1 duplication is not found in all CF patients with ABPM in contrast to patients with SAFS. These results suggest that CHIT1 duplication cannot be the sole explanation for Aspergillus positive sputum in CF patients. © 2013 Wiley Periodicals, Inc.

  7. Expansion and stress responses of the AP2/EREBP superfamily in cotton.

    PubMed

    Liu, Chunxiao; Zhang, Tianzhen

    2017-01-31

    The allotetraploid cotton originated from one hybridization event between an extant progenitor of Gosssypium herbaceum (A 1 ) or G. arboreum (A 2 ) and another progenitor, G. raimondii Ulbrich (D 5 ) 1-1.5 million years ago (Mya). The APETALA2/ethylene-responsive element binding protein (AP2/EREBP) transcription factors constitute one of the largest and most conserved gene families in plants. They are characterized by their AP2 domain, which comprises 60-70 amino acids, and are classified into four main subfamilies: the APETALA2 (AP2), Related to ABI3/VP1 (RAV), Dehydration-Responsive Element Binding protein (DREB) and Ethylene-Responsive Factor (ERF) subfamilies. The AP2/EREBP genes play crucial roles in plant growth, development and biotic and abiotic stress responses. Hence, understanding the molecular characteristics of cotton stress tolerance and gene family expansion would undoubtedly facilitate cotton resistance breeding and evolution research. A total of 269 AP2/EREBP genes were identified in the G. raimondii (D5) cotton genome. The protein domain architecture and intron/exon structure are simple and relatively conserved within each subfamily. They are distributed throughout all chromosomes but are clustered on various chromosomes due to genomic tandem duplication. We identified 73 tandem duplicated genes and 221 segmental duplicated gene pairs which contributed to the expansion of AP2/EREBP superfamily. Of them, tandem duplication was the most important force of the expansion of the B3 group. Transcriptome analysis showed that 504 AP2/EREBP genes were expressed in at least one tested G. hirsutum TM-1 tissues. In G. hirsutum, 151 non-repeated genes of the DREB and ERF subfamily genes were responsive to different stresses: 132 genes were induced by cold, 63 genes by drought and 94 genes by heat. qRT-PCR confirmed that 13 GhDREB and 15 GhERF genes were induced by cold and/or drought. No transcripts detected for 53 of the 111 tandem duplicated genes in TM-1

  8. A Synergism between Adaptive Effects and Evolvability Drives Whole Genome Duplication to Fixation

    PubMed Central

    Cuypers, Thomas D.; Hogeweg, Paulien

    2014-01-01

    Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change. PMID:24743268

  9. A synergism between adaptive effects and evolvability drives whole genome duplication to fixation.

    PubMed

    Cuypers, Thomas D; Hogeweg, Paulien

    2014-04-01

    Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change.

  10. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta).

    PubMed

    Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan

    2016-07-01

    The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  11. Duplication of 20p12.3 associated with familial Wolff-Parkinson-White syndrome.

    PubMed

    Mills, Kimberly I; Anderson, Jacqueline; Levy, Philip T; Cole, F Sessions; Silva, Jennifer N A; Kulkarni, Shashikant; Shinawi, Marwan

    2013-01-01

    Wolff-Parkinson-White (WPW) syndrome is caused by preexcitation of the ventricular myocardium via an accessory pathway which increases the risk for paroxysmal supraventricular tachycardia. The condition is often sporadic and of unknown etiology in the majority of cases. Autosomal dominant inheritance and association with congenital heart defects or ventricular hypertrophy were described. Microdeletions of 20p12.3 have been associated with WPW syndrome with either cognitive dysfunction or Alagille syndrome. Here, we describe the association of 20p12.3 duplication with WPW syndrome in a patient who presented with non-immune hydrops. Her paternal uncle carries the duplication and has attention-deficit hyperactivity disorder and electrocardiographic findings consistent with WPW. The 769 kb duplication was detected by the Affymetrix Whole Genome-Human SNP Array 6.0 and encompasses two genes and the first two exons of a third gene. We discuss the potential role of the genes in the duplicated region in the pathogenesis of WPW and possible neurobehavioral abnormalities. Our data provide additional support for a significant role of 20p12.3 chromosomal rearrangements in the etiology of WPW syndrome. Copyright © 2012 Wiley Periodicals, Inc.

  12. Age distribution patterns of human gene families: divergent for Gene Ontology categories and concordant between different subcellular localizations.

    PubMed

    Liu, Gangbiao; Zou, Yangyun; Cheng, Qiqun; Zeng, Yanwu; Gu, Xun; Su, Zhixi

    2014-04-01

    The age distribution of gene duplication events within the human genome exhibits two waves of duplications along with an ancient component. However, because of functional constraint differences, genes in different functional categories might show dissimilar retention patterns after duplication. It is known that genes in some functional categories are highly duplicated in the early stage of vertebrate evolution. However, the correlations of the age distribution pattern of gene duplication between the different functional categories are still unknown. To investigate this issue, we developed a robust pipeline to date the gene duplication events in the human genome. We successfully estimated about three-quarters of the duplication events within the human genome, along with the age distribution pattern in each Gene Ontology (GO) slim category. We found that some GO slim categories show different distribution patterns when compared to the whole genome. Further hierarchical clustering of the GO slim functional categories enabled grouping into two main clusters. We found that human genes located in the duplicated copy number variant regions, whose duplicate genes have not been fixed in the human population, were mainly enriched in the groups with a high proportion of recently duplicated genes. Moreover, we used a phylogenetic tree-based method to date the age of duplications in three signaling-related gene superfamilies: transcription factors, protein kinases and G-protein coupled receptors. These superfamilies were expressed in different subcellular localizations. They showed a similar age distribution as the signaling-related GO slim categories. We also compared the differences between the age distributions of gene duplications in multiple subcellular localizations. We found that the distribution patterns of the major subcellular localizations were similar to that of the whole genome. This study revealed the whole picture of the evolution patterns of gene functional

  13. A genome-wide survey of homeodomain-leucine zipper genes and analysis of cold-responsive HD-Zip I members' expression in tomato.

    PubMed

    Zhang, Zhenzhu; Chen, Xiuling; Guan, Xin; Liu, Yang; Chen, Hongyu; Wang, Tingting; Mouekouba, Liana Dalcantara Ongouya; Li, Jingfu; Wang, Aoxue

    2014-01-01

    Homeodomain-leucine zipper (HD-Zip) proteins are a kind of transcriptional factors that play a vital role in plant growth and development. However, no detailed information of HD-Zip family in tomato has been reported till now. In this study, 51 HD-Zip genes (SlHZ01-51) in this family were identified and categorized into 4 classes by exon-intron and protein structure in tomato (Solanum lycopersicum) genome. The synthetical phylogenetic tree of tomato, Arabidopsis and rice HD-Zip genes were established for an insight into their evolutionary relationships and putative functions. The results showed that the contribution of segmental duplication was larger than that of tandem duplication for expansion and evolution of genes in this family of tomato. The expression profile results under abiotic stress suggested that all SlHZ I genes were responsive to cold stress. This study will provide a clue for the further investigation of functional identification and the role of tomato HD-Zip I subfamily in plant cold stress responses and developmental events.

  14. Bivalve-specific gene expansion in the pearl oyster genome: implications of adaptation to a sessile lifestyle.

    PubMed

    Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi

    2016-01-01

    Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These

  15. Prevalence of exon 11 internal tandem duplications in the C-KIT proto-oncogene in Australian canine mast cell tumours.

    PubMed

    Tamlin, V S; Kessell, A E; Mccoy, R J; Dobson, E C; Smith, T S; Hebart, M; Brown, L; Mitrovic, D; Peaston, A E

    2017-10-01

    To measure the prevalence of internal tandem duplications (ITDs) in exon 11 of the proto-oncogene C-KIT in a sample of Australian cutaneous canine mast cell tumours (MCTs) drawn from general practice and to evaluate relationships between tumour mutation status and prognostic factors including signalment, tumour histological grade, tumour anatomical location and tumour size. C-KIT exon 11 ITDs were detected by PCR in DNA extracted from formalin-fixed, paraffin-embedded canine MCTs sourced from three veterinary diagnostic laboratories in Adelaide and Melbourne. Tumours were graded according to two different systems (Patnaik and Kiupel systems) by board-certified anatomical pathologists blinded to the PCR results. Relationships between tumour mutation status and prognostic factors were evaluated using a generalised binary logistic regression analysis. ITDs were identified in 13 of 74 cutaneous canine MCT samples, giving an overall prevalence of 17.6% (95% confidence interval: 8.9-26.2%). ITDs were detected in 10 of 18 Patnaik grade III MCTs (55.6%) and 11 of 22 Kiupel high-grade MCTs (50%). Wald chi-square analysis revealed that detection of tumour ITDs was significantly associated with both Patnaik's and Kiupel's histologic grading systems (each: P < 0.001). The presence of the ITDs in MCTs was not associated with signalment, tumour anatomical location or tumour size. The prevalence of C-KIT exon 11 ITDs in Australian canine MCTs is similar to the prevalence in overseas canine populations (overall prevalence in Australia approximately 18%). ITDs were more frequently identified in higher grade MCTs. © 2017 Australian Veterinary Association.

  16. Tandem repeats of the 5' non-transcribed spacer of Tetrahymena rDNA function as high copy number autonomous replicons in the macronucleus but do not prevent rRNA gene dosage regulation.

    PubMed Central

    Pan, W J; Blackburn, E H

    1995-01-01

    The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211

  17. The Histone Modification H3K27me3 Is Retained after Gene Duplication and Correlates with Conserved Noncoding Sequences in Arabidopsis

    PubMed Central

    Berke, Lidija; Snel, Berend

    2014-01-01

    The histone modification H3K27me3 is involved in repression of transcription and plays a crucial role in developmental transitions in both animals and plants. It is deposited by PRC2 (Polycomb repressive complex 2), a conserved protein complex. In Arabidopsis thaliana, H3K27me3 is found at 15% of all genes. These tend to encode transcription factors and other regulators important for development. However, it is not known how PRC2 is recruited to target loci nor how this set of target genes arose during Arabidopsis evolution. To resolve the latter, we integrated A. thaliana gene families with five independent genome-wide H3K27me3 data sets. Gene families were either significantly enriched or depleted of H3K27me3, showing a strong impact of shared ancestry to H3K27me3 distribution. To quantify this, we performed ancestral state reconstruction of H3K27me3 on phylogenetic trees of gene families. The set of H3K27me3-marked genes changed less than expected by chance, suggesting that H3K27me3 was retained after gene duplication. This retention suggests that the PRC2-recruiting signal could be encoded in the DNA and also conserved among certain duplicated genes. Indeed, H3K27me3-marked genes were overrepresented among paralogs sharing conserved noncoding sequences (CNSs) that are enriched with transcription factor binding sites. The association of upstream CNSs with H3K27me3-marked genes represents the first genome-wide connection between H3K27me3 and potential regulatory elements in plants. Thus, we propose that CNSs likely function as part of the PRC2 recruitment in plants. PMID:24567304

  18. Adaptations to Endosymbiosis in a Cnidarian-Dinoflagellate Association: Differential Gene Expression and Specific Gene Duplications

    PubMed Central

    Magnone, Virginie; Allemand, Denis; Furla, Paola; Sabourault, Cécile

    2011-01-01

    Trophic endosymbiosis between anthozoans and photosynthetic dinoflagellates forms the key foundation of reef ecosystems. Dysfunction and collapse of symbiosis lead to bleaching (symbiont expulsion), which is responsible for the severe worldwide decline of coral reefs. Molecular signals are central to the stability of this partnership and are therefore closely related to coral health. To decipher inter-partner signaling, we developed genomic resources (cDNA library and microarrays) from the symbiotic sea anemone Anemonia viridis. Here we describe differential expression between symbiotic (also called zooxanthellate anemones) or aposymbiotic (also called bleached) A. viridis specimens, using microarray hybridizations and qPCR experiments. We mapped, for the first time, transcript abundance separately in the epidermal cell layer and the gastrodermal cells that host photosynthetic symbionts. Transcriptomic profiles showed large inter-individual variability, indicating that aposymbiosis could be induced by different pathways. We defined a restricted subset of 39 common genes that are characteristic of the symbiotic or aposymbiotic states. We demonstrated that transcription of many genes belonging to this set is specifically enhanced in the symbiotic cells (gastroderm). A model is proposed where the aposymbiotic and therefore heterotrophic state triggers vesicular trafficking, whereas the symbiotic and therefore autotrophic state favors metabolic exchanges between host and symbiont. Several genetic pathways were investigated in more detail: i) a key vitamin K–dependant process involved in the dinoflagellate-cnidarian recognition; ii) two cnidarian tissue-specific carbonic anhydrases involved in the carbon transfer from the environment to the intracellular symbionts; iii) host collagen synthesis, mostly supported by the symbiotic tissue. Further, we identified specific gene duplications and showed that the cnidarian-specific isoform was also up-regulated both in the

  19. High Fractional Occupancy of a Tandem Maf Recognition Element and Its Role in Long-Range β-Globin Gene Regulation

    PubMed Central

    Stees, Jared R.; Hossain, Mir A.; Sunose, Tomoki; Kudo, Yasushi; Pardo, Carolina E.; Nabilsi, Nancy H.; Darst, Russell P.; Poudyal, Rosha; Igarashi, Kazuhiko; Kladde, Michael P.

    2015-01-01

    Enhancers and promoters assemble protein complexes that ultimately regulate the recruitment and activity of RNA polymerases. Previous work has shown that at least some enhancers form stable protein complexes, leading to the formation of enhanceosomes. We analyzed protein-DNA interactions in the murine β-globin gene locus using the methyltransferase accessibility protocol for individual templates (MAPit). The data show that a tandem Maf recognition element (MARE) in locus control region (LCR) hypersensitive site 2 (HS2) reveals a remarkably high degree of occupancy during differentiation of mouse erythroleukemia cells. Most of the other transcription factor binding sites in LCR HS2 or in the adult β-globin gene promoter regions exhibit low fractional occupancy, suggesting highly dynamic protein-DNA interactions. Targeting of an artificial zinc finger DNA-binding domain (ZF-DBD) to the HS2 tandem MARE caused a reduction in the association of MARE-binding proteins and transcription complexes at LCR HS2 and the adult βmajor-globin gene promoter but did not affect expression of the βminor-globin gene. The data demonstrate that a stable MARE-associated footprint in LCR HS2 is important for the recruitment of transcription complexes to the adult βmajor-globin gene promoter during erythroid cell differentiation. PMID:26503787

  20. Duplicate retention in signalling proteins and constraints from network dynamics.

    PubMed

    Soyer, O S; Creevey, C J

    2010-11-01

    Duplications are a major driving force behind evolution. Most duplicates are believed to fix through genetic drift, but it is not clear whether this process affects all duplications equally or whether there are certain gene families that are expected to show neutral expansions under certain circumstances. Here, we analyse the neutrality of duplications in different functional classes of signalling proteins based on their effects on response dynamics. We find that duplications involving intermediary proteins in a signalling network are neutral more often than those involving receptors. Although the fraction of neutral duplications in all functional classes increase with decreasing population size and selective pressure on dynamics, this effect is most pronounced for receptors, indicating a possible expansion of receptors in species with small population size. In line with such an expectation, we found a statistically significant increase in the number of receptors as a fraction of genome size in eukaryotes compared with prokaryotes. Although not confirmative, these results indicate that neutral processes can be a significant factor in shaping signalling networks and affect proteins from different functional classes differently. © 2010 The Authors. Journal Compilation © 2010 European Society For Evolutionary Biology.

  1. Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals

    PubMed Central

    Patel, Vidushi S; Cooper, Steven JB; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer AM

    2008-01-01

    Background Vertebrate alpha (α)- and beta (β)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the α- and β-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil β-globin gene (ω) in the marsupial α-cluster, however, suggested that duplication of the α-β cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous α- and β-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. Results The platypus α-globin cluster (chromosome 21) contains embryonic and adult α- globin genes, a β-like ω-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-ζ-ζ'-αD-α3-α2-α1-ω-GBY-3'. The platypus β-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-ε-β-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate α-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal β-globin clusters are embedded in olfactory genes. Thus, the mammalian α- and β-globin clusters are orthologous to the bird α- and β-globin clusters respectively. Conclusion We propose that α- and β-globin clusters evolved from an ancient MPG-C16orf35-α-β-GBY-LUC7L arrangement 410 million years ago. A copy of the original β (represented by ω in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of β-globin genes with different expression profiles in different lineages. PMID:18657265

  2. Platypus globin genes and flanking loci suggest a new insertional model for beta-globin evolution in birds and mammals.

    PubMed

    Patel, Vidushi S; Cooper, Steven J B; Deakin, Janine E; Fulton, Bob; Graves, Tina; Warren, Wesley C; Wilson, Richard K; Graves, Jennifer A M

    2008-07-25

    Vertebrate alpha (alpha)- and beta (beta)-globin gene families exemplify the way in which genomes evolve to produce functional complexity. From tandem duplication of a single globin locus, the alpha- and beta-globin clusters expanded, and then were separated onto different chromosomes. The previous finding of a fossil beta-globin gene (omega) in the marsupial alpha-cluster, however, suggested that duplication of the alpha-beta cluster onto two chromosomes, followed by lineage-specific gene loss and duplication, produced paralogous alpha- and beta-globin clusters in birds and mammals. Here we analyse genomic data from an egg-laying monotreme mammal, the platypus (Ornithorhynchus anatinus), to explore haemoglobin evolution at the stem of the mammalian radiation. The platypus alpha-globin cluster (chromosome 21) contains embryonic and adult alpha- globin genes, a beta-like omega-globin gene, and the GBY globin gene with homology to cytoglobin, arranged as 5'-zeta-zeta'-alphaD-alpha3-alpha2-alpha1-omega-GBY-3'. The platypus beta-globin cluster (chromosome 2) contains single embryonic and adult globin genes arranged as 5'-epsilon-beta-3'. Surprisingly, all of these globin genes were expressed in some adult tissues. Comparison of flanking sequences revealed that all jawed vertebrate alpha-globin clusters are flanked by MPG-C16orf35 and LUC7L, whereas all bird and mammal beta-globin clusters are embedded in olfactory genes. Thus, the mammalian alpha- and beta-globin clusters are orthologous to the bird alpha- and beta-globin clusters respectively. We propose that alpha- and beta-globin clusters evolved from an ancient MPG-C16orf35-alpha-beta-GBY-LUC7L arrangement 410 million years ago. A copy of the original beta (represented by omega in marsupials and monotremes) was inserted into an array of olfactory genes before the amniote radiation (>315 million years ago), then duplicated and diverged to form orthologous clusters of beta-globin genes with different expression

  3. Partial duplication of the CRYBB1-CRYBA4 locus is associated with autosomal dominant congenital cataract

    PubMed Central

    Siggs, Owen M; Javadiyan, Shari; Sharma, Shiwani; Souzeau, Emmanuelle; Lower, Karen M; Taranath, Deepa A; Black, Jo; Pater, John; Willoughby, John G; Burdon, Kathryn P; Craig, Jamie E

    2017-01-01

    Congenital cataract is a rare but severe paediatric visual impediment, often caused by variants in one of several crystallin genes that produce the bulk of structural proteins in the lens. Here we describe a pedigree with autosomal dominant isolated congenital cataract and linkage to the crystallin gene cluster on chromosome 22. No rare single nucleotide variants or short indels were identified by exome sequencing, yet copy number variant analysis revealed a duplication spanning both CRYBB1 and CRYBA4. While the CRYBA4 duplication was complete, the CRYBB1 duplication was not, with the duplicated CRYBB1 product predicted to create a gain of function allele. This association suggests a new genetic mechanism for the development of isolated congenital cataract. PMID:28272538

  4. SHOX duplications found in some cases with type I Mayer-Rokitansky-Kuster-Hauser syndrome.

    PubMed

    Gervasini, Cristina; Grati, Francesca Romana; Lalatta, Faustina; Tabano, Silvia; Gentilin, Barbara; Colapietro, Patrizia; De Toffol, Simona; Frontino, Giada; Motta, Francesca; Maitz, Silvia; Bernardini, Laura; Dallapiccola, Bruno; Fedele, Luigi; Larizza, Lidia; Miozzo, Monica

    2010-10-01

    The Mayer-Rokitansky-Küster-Hauser syndrome is defined as congenital aplasia of müllerian ducts derived structures in females with a normal female chromosomal and gonadal sex. Most cases with Mayer-Rokitansky-Küster-Hauser syndrome are sporadic, although familial cases have been reported. The genetic basis of Mayer-Rokitansky-Küster-Hauser syndrome is largely unknown and seems heterogeneous, and a small number of cases were found to have mutations in the WNT4 gene. The aim of this study was to identify possible recurrent submicroscopic imbalances in a cohort of familial and sporadic cases with Mayer-Rokitansky-Küster-Hauser syndrome. Multiplex ligation-dependent probe amplification was used to screen the subtelomeric sequences of all chromosomes in 30 patients with Mayer-Rokitansky-Küster-Hauser syndrome (sporadic, n = 27 and familial, n = 3). Segregation analysis and pyrosequencing were applied to validate the MLPA results in the informative family. Partial duplication of the Xpter pseudoautosomal region 1 containing the short stature homeobox (SHOX) gene was detected in five patients with Mayer-Rokitansky-Küster-Hauser syndrome (familial, n = 3 and sporadic, n = 2) and not in 53 healthy controls. The duplications were not overlapping, and SHOX was never entirely duplicated. Haplotyping in the informative family revealed that SHOX gene duplication was inherited from the unaffected father and was absent in two healthy sisters. Partial duplication of SHOX gene is found in some cases with both familial and sporadic Mayer-Rokitansky-Küster-Hauser type I syndrome.

  5. The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.

    PubMed Central

    Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W

    1998-01-01

    At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over. PMID:9811791

  6. Formation of new chromatin domains determines pathogenicity of genomic duplications.

    PubMed

    Franke, Martin; Ibrahim, Daniel M; Andrey, Guillaume; Schwarzer, Wibke; Heinrich, Verena; Schöpflin, Robert; Kraft, Katerina; Kempfer, Rieke; Jerković, Ivana; Chan, Wing-Lee; Spielmann, Malte; Timmermann, Bernd; Wittler, Lars; Kurth, Ingo; Cambiaso, Paola; Zuffardi, Orsetta; Houge, Gunnar; Lambie, Lindsay; Brancati, Francesco; Pombo, Ana; Vingron, Martin; Spitz, Francois; Mundlos, Stefan

    2016-10-13

    Chromosome conformation capture methods have identified subchromosomal structures of higher-order chromatin interactions called topologically associated domains (TADs) that are separated from each other by boundary regions. By subdividing the genome into discrete regulatory units, TADs restrict the contacts that enhancers establish with their target genes. However, the mechanisms that underlie partitioning of the genome into TADs remain poorly understood. Here we show by chromosome conformation capture (capture Hi-C and 4C-seq methods) that genomic duplications in patient cells and genetically modified mice can result in the formation of new chromatin domains (neo-TADs) and that this process determines their molecular pathology. Duplications of non-coding DNA within the mouse Sox9 TAD (intra-TAD) that cause female to male sex reversal in humans, showed increased contact of the duplicated regions within the TAD, but no change in the overall TAD structure. In contrast, overlapping duplications that extended over the next boundary into the neighbouring TAD (inter-TAD), resulted in the formation of a new chromatin domain (neo-TAD) that was isolated from the rest of the genome. As a consequence of this insulation, inter-TAD duplications had no phenotypic effect. However, incorporation of the next flanking gene, Kcnj2, in the neo-TAD resulted in ectopic contacts of Kcnj2 with the duplicated part of the Sox9 regulatory region, consecutive misexpression of Kcnj2, and a limb malformation phenotype. Our findings provide evidence that TADs are genomic regulatory units with a high degree of internal stability that can be sculptured by structural genomic variations. This process is important for the interpretation of copy number variations, as these variations are routinely detected in diagnostic tests for genetic disease and cancer. This finding also has relevance in an evolutionary setting because copy-number differences are thought to have a crucial role in the evolution of

  7. Characterization of Conserved Tandem Donor Sites and Intronic Motifs Required for Alternative Splicing in Corticosteroid Receptor Genes

    PubMed Central

    Qian, Xiaoxiao; Matthews, Laura; Lightman, Stafford; Ray, David; Norman, Michael

    2015-01-01

    Alternative splicing events from tandem donor sites result in mRNA variants coding for additional amino acids in the DNA binding domain of both the glucocorticoid (GR) and mineralocorticoid (MR) receptors. We now show that expression of both splice variants is extensively conserved in mammalian species, providing strong evidence for their functional significance. An exception to the conservation of the MR tandem splice site (an A at position +5 of the MR+12 donor site in the mouse) was predicted to decrease U1 small nuclear RNA binding. In accord with this prediction, we were unable to detect the MR+12 variant in this species. The one exception to the conservation of the GR tandem splice site, an A at position +3 of the platypus GRγ donor site that was predicted to enhance binding of U1 snRNA, was unexpectedly associated with decreased expression of the variant from the endogenous gene as well as a minigene. An intronic pyrimidine motif present in both GR and MR genes was found to be critical for usage of the downstream donor site, and overexpression of TIA1/TIAL1 RNA binding proteins, which are known to bind such motifs, led to a marked increase in the proportion of GRγ and MR+12. These results provide striking evidence for conservation of a complex splicing mechanism that involves processes other than stochastic spliceosome binding and identify a mechanism that would allow regulation of variant expression. PMID:19819975

  8. Intratypic variability of a tandem repeat locus within the DNA polymerase gene of human herpes simplex virus type 2.

    PubMed

    Sun, Yongjiang; Chan, Roy Kum Wah; Tan, Suat Hoon

    2004-01-01

    In this study, the irntratypic variability of a tandem repeat locus within the DNA polymerase (pol) gene of human herpes simplex virus type 2 (HSV2) was uncovered. The locus contained variable numbers of tandem dodecanucleotide (5'-GAC GAG GAC GGG-3') repetitive units. Our result showed that approximately 95% of analyzed HSV2 clinical isolates and the current GenBank HSV2 strains contained two copies of the repetitive units. From genital herpes specimens, three new HSV2 strains, which respectively contained 1, 3, and 4 copies of the repetitive units, were identified. This variable number of tandem repeat (VNTR) locus is absent in HSV1, and thus it also contributes to the intertypic variability of HSV1 and HSV2. The intratypic variability of the locus may be useful for HSV2 strain genotyping and this application is discussed.

  9. Systematic Analysis of Sequences and Expression Patterns of Drought-Responsive Members of the HD-Zip Gene Family in Maize

    PubMed Central

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Background Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Conclusions Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development. PMID:22164299

  10. Systematic analysis of sequences and expression patterns of drought-responsive members of the HD-Zip gene family in maize.

    PubMed

    Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu

    2011-01-01

    Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.

  11. The evolutionary duplication and probable demise of an endodermal GATA factor in Caenorhabditis elegans.

    PubMed

    Fukushige, Tetsunari; Goszczynski, Barbara; Tian, Helen; McGhee, James D

    2003-10-01

    WGATAR sequences); ELT-4 protein neither enhanced nor inhibited the strong sequence-specific binding of the ELT-2 protein. Whereas ELT-2 protein is a strong transcriptional activator in yeast, ELT-4 protein has no such activity under similar conditions, nor does it influence the transcriptional activity of coexpressed ELT-2 protein. Although an elt-2 homolog was easily identified in the genomic sequence of the related nematode C. briggsae, no elt-4 homolog could be identified. Analysis of the changes in silent third codon positions within the DNA-binding domains indicates that elt-4 arose as a duplication of elt-2, some 25-55 MYA. Thus, elt-4 has survived far longer than the average duplicated gene in C. elegans, even though no obvious biological function could be detected. elt-4 provides an interesting example of a tandemly duplicated gene that may originally have been the same size as elt-2 but has gradually been whittled down to its present size of little more than a zinc finger. Although elt-4 must confer (or must have conferred) some selective advantage to C. elegans, we suggest that its ultimate evolutionary fate will be disappearance from the C. elegans genome.

  12. Gene Duplication Leads to Altered Membrane Topology of a Cytochrome P450 Enzyme in Seed Plants

    PubMed Central

    Renault, Hugues; De Marothy, Minttu; Jonasson, Gabriella; Lara, Patricia; Nelson, David R.; Nilsson, IngMarie; André, François; von Heijne, Gunnar; Werck-Reichhart, Danièle

    2017-01-01

    Abstract Evolution of the phenolic metabolism was critical for the transition of plants from water to land. A cytochrome P450, CYP73, with cinnamate 4-hydroxylase (C4H) activity, catalyzes the first plant-specific and rate-limiting step in this pathway. The CYP73 gene is absent from green algae, and first detected in bryophytes. A CYP73 duplication occurred in the ancestor of seed plants and was retained in Taxaceae and most angiosperms. In spite of a clear divergence in primary sequence, both paralogs can fulfill comparable cinnamate hydroxylase roles both in vitro and in vivo. One of them seems dedicated to the biosynthesis of lignin precursors. Its N-terminus forms a single membrane spanning helix and its properties and length are highly constrained. The second is characterized by an elongated and variable N-terminus, reminiscent of ancestral CYP73s. Using as proxies the Brachypodium distachyon proteins, we show that the elongation of the N-terminus does not result in an altered subcellular localization, but in a distinct membrane topology. Insertion in the membrane of endoplasmic reticulum via a double-spanning open hairpin structure allows reorientation to the lumen of the catalytic domain of the protein. In agreement with participation to a different functional unit and supramolecular organization, the protein displays modified heme proximal surface. These data suggest the evolution of divergent C4H enzymes feeding different branches of the phenolic network in seed plants. It shows that specialization required for retention of gene duplicates may result from altered protein topology rather than change in enzyme activity. PMID:28505373

  13. FGFR3 gene mutation plus GRB10 gene duplication in a patient with achondroplasia plus growth delay with prenatal onset.

    PubMed

    Yuan, Haiming; Huang, Linhuan; Hu, Xizi; Li, Qian; Sun, Xiaofang; Xie, Yingjun; Kong, Shu; Wang, Xiaoman

    2016-07-02

    Achondroplasia is a well-defined and common bone dysplasia. Genotype- and phenotype-level correlations have been found between the clinical symptoms of achondroplasia and achondroplasia-specific FGFR3 mutations. A 2-year-old boy with clinical features consistent with achondroplasia and Silver-Russell syndrome-like symptoms was found to carry a mutation in the fibroblast growth factor receptor-3 (FGFR3) gene at c.1138G > A (p.Gly380Arg) and a de novo 574 kb duplication at chromosome 7p12.1 that involved the entire growth-factor receptor bound protein 10 (GRB10) gene. Using quantitative real-time PCR analysis, GRB10 was over-expressed, and, using enzyme-linked immunosorbent assays for IGF1 and IGF-binding protein-3 (IGFBP3), we found that IGF1 and IGFBP3 were low-expressed in this patient. We demonstrate that a combination of uncommon, rare and exceptional molecular defects related to the molecular bases of particular birth defects can be analyzed and diagnosed to potentially explain the observed variability in the combination of molecular defects.

  14. Gene duplication and divergence affecting drug content in Cannabis sativa.

    PubMed

    Weiblen, George D; Wenger, Jonathan P; Craft, Kathleen J; ElSohly, Mahmoud A; Mehmedic, Zlatko; Treiber, Erin L; Marks, M David

    2015-12-01

    Cannabis sativa is an economically important source of durable fibers, nutritious seeds, and psychoactive drugs but few economic plants are so poorly understood genetically. Marijuana and hemp were crossed to evaluate competing models of cannabinoid inheritance and to explain the predominance of tetrahydrocannabinolic acid (THCA) in marijuana compared with cannabidiolic acid (CBDA) in hemp. Individuals in the resulting F2 population were assessed for differential expression of cannabinoid synthase genes and were used in linkage mapping. Genetic markers associated with divergent cannabinoid phenotypes were identified. Although phenotypic segregation and a major quantitative trait locus (QTL) for the THCA/CBDA ratio were consistent with a simple model of codominant alleles at a single locus, the diversity of THCA and CBDA synthase sequences observed in the mapping population, the position of enzyme coding loci on the map, and patterns of expression suggest multiple linked loci. Phylogenetic analysis further suggests a history of duplication and divergence affecting drug content. Marijuana is distinguished from hemp by a nonfunctional CBDA synthase that appears to have been positively selected to enhance psychoactivity. An unlinked QTL for cannabinoid quantity may also have played a role in the recent escalation of drug potency. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  15. Duplication and concerted evolution in a master sex determiner under balancing selection.

    PubMed

    Privman, Eyal; Wurm, Yannick; Keller, Laurent

    2013-05-07

    The transformer (tra) gene is a key regulator in the signalling hierarchy controlling all aspects of somatic sexual differentiation in Drosophila and other insects. Here, we show that six of the seven sequenced ants have two copies of tra. Surprisingly, the two paralogues are always more similar within species than among species. Comparative sequence analyses indicate that this pattern is owing to the ongoing concerted evolution after an ancestral duplication rather than independent duplications in each of the six species. In particular, there was strong support for inter-locus recombination between the paralogues of the ant Atta cephalotes. In the five species where the location of paralogues is known, they are adjacent to each other in four cases and separated by only few genes in the fifth case. Because there have been extensive genomic rearrangements in these lineages, this suggests selection acting to conserve their synteny. In three species, we also find a signature of positive selection in one of the paralogues. In three bee species where information is available, the tra gene is also duplicated, the copies are adjacent and in at least one species there was recombination between paralogues. These results suggest that concerted evolution plays an adaptive role in the evolution of this gene family.

  16. Novel variable number of tandem repeats of gibbon MAOA gene and its evolutionary significance.

    PubMed

    Choi, Yuri; Jung, Yi-Deun; Ayarpadikannan, Selvam; Koga, Akihiko; Imai, Hiroo; Hirai, Hirohisa; Roos, Christian; Kim, Heui-Soo

    2014-08-01

    Variable number of tandem repeats (VNTRs) are scattered throughout the primate genome, and genetic variation of these VNTRs have been accumulated during primate radiation. Here, we analyzed VNTRs upstream of the monoamine oxidase A (MAOA) gene in 11 different gibbon species. An abundance of truncated VNTR sequences and copy number differences were observed compared to those of human VNTR sequences. To better understand the biological role of these VNTRs, a luciferase activity assay was conducted and results indicated that selected VNTR sequences of the MAOA gene from human and three different gibbon species (Hylobates klossii, Hylobates lar, and Nomascus concolor) showed silencing ability. Together, these data could be useful for understanding the evolutionary history and functional significance of MAOA VNTR sequences in gibbon species.

  17. Mutation screening of patients with Alzheimer disease identifies APP locus duplication in a Swedish patient

    PubMed Central

    2011-01-01

    Background Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. Methods We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. Results In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. Conclusions This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus. PMID:22044463

  18. Mutation screening of patients with Alzheimer disease identifies APP locus duplication in a Swedish patient.

    PubMed

    Thonberg, Håkan; Fallström, Marie; Björkström, Jenny; Schoumans, Jacqueline; Nennesmo, Inger; Graff, Caroline

    2011-11-01

    Missense mutations in three different genes encoding amyloid-β precursor protein, presenilin 1 and presenilin 2 are recognized to cause familial early-onset Alzheimer disease. Also duplications of the amyloid precursor protein gene have been shown to cause the disease. At the Dept. of Geriatric Medicine, Karolinska University Hospital, Sweden, patients are referred for mutation screening for the identification of nucleotide variations and for determining copy-number of the APP locus. We combined the method of microsatellite marker genotyping with a quantitative real-time PCR analysis to detect duplications in patients with Alzheimer disease. In 22 DNA samples from individuals diagnosed with clinical Alzheimer disease, we identified one patient carrying a duplication on chromosome 21 which included the APP locus. Further mapping of the chromosomal region by array-comparative genome hybridization showed that the duplication spanned a maximal region of 1.09 Mb. This is the first report of an APP duplication in a Swedish Alzheimer patient and describes the use of quantitative real-time PCR as a tool for determining copy-number of the APP locus.

  19. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    PubMed

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  20. Amplification of the BCR/ABL fusion gene clustered on a masked Philadelphia chromosome in a patient with myeloblastic crisis of chronic myelocytic leukemia.

    PubMed

    Gargallo, Patricia M; Cuello, Maria Teresa; Aranguren, Pedro Negri; Larripa, Irene B

    2003-06-01

    Although the chronic phase of chronic myelocytic leukemia (CML) is characterized by the Philadelphia (Ph) chromosome creating a hybrid BCR/ABL gene, additional genetic changes involved in blast crisis are poorly understood. We report a 4-8-fold amplification by tandem duplication of the BCR/ABL fusion gene clustered on a masked Ph chromosome in a 61-year-old male patient with CML in myeloblastic crisis. Our finding suggests that the BCR/ABL amplification may play a role as a novel mechanism in the progression to an aggressive blast transformation in some cases of Ph-positive CML.

  1. Xq28 duplications including MECP2 in five females: Expanding the phenotype to severe mental retardation.

    PubMed

    Bijlsma, E K; Collins, A; Papa, F T; Tejada, M I; Wheeler, P; Peeters, E A J; Gijsbers, A C J; van de Kamp, J M; Kriek, M; Losekoot, M; Broekma, A J; Crolla, J A; Pollazzon, M; Mucciolo, M; Katzaki, E; Disciglio, V; Ferreri, M I; Marozza, A; Mencarelli, M A; Castagnini, C; Dosa, L; Ariani, F; Mari, F; Canitano, R; Hayek, G; Botella, M P; Gener, B; Mínguez, M; Renieri, A; Ruivenkamp, C A L

    2012-06-01

    Duplications leading to functional disomy of chromosome Xq28, including MECP2 as the critical dosage-sensitive gene, are associated with a distinct clinical phenotype in males, characterized by severe mental retardation, infantile hypotonia, progressive neurologic impairment, recurrent infections, bladder dysfunction, and absent speech. Female patients with Xq duplications including MECP2 are rare. Only recently submicroscopic duplications of this region on Xq28 have been recognized in four females, and a triplication in a fifth, all in combination with random X-chromosome inactivation (XCI). Based on this small series, it was concluded that in females with MECP2 duplication and random XCI, the typical symptoms of affected boys are not present. We present clinical and molecular data on a series of five females with an Xq28 duplication including the MECP2 gene, both isolated and as the result of a translocation, and compare them with the previously reported cases of small duplications in females. The collected data indicate that the associated phenotype in females is distinct from males with similar duplications, but the clinical effects may be as severe as seen in males. Copyright © 2012 Elsevier Masson SAS. All rights reserved.

  2. Xq28 duplications including MECP2 in five females: Expanding the phenotype to severe mental retardation

    PubMed Central

    Bijlsma, E.K.; Collins, A.; Papa, F.T.; Tejada, M.I.; Wheeler, P.; Peeters, E.A.J.; Gijsbers, A.C.J.; van de Kamp, J.M.; Kriek, M.; Losekoot, M.; Broekma, A.J.; Crolla, J.A.; Pollazzon, M.; Mucciolo, M.; Katzaki, E.; Disciglio, V.; Ferreri, M.I.; Marozza, A.; Mencarelli, M.A.; Castagnini, C.; Dosa, L.; Ariani, F.; Mari, F.; Canitano, R.; Hayek, G.; Botella, M.P.; Gener, B.; Mínguez, M.; Renieri, A.; Ruivenkamp, C.A.L.

    2012-01-01

    Duplications leading to functional disomy of chromosome Xq28, including MECP2 as the critical dosage-sensitive gene, are associated with a distinct clinical phenotype in males, characterized by severe mental retardation, infantile hypotonia, progressive neurologic impairment, recurrent infections, bladder dysfunction, and absent speech. Female patients with Xq duplications including MECP2 are rare. Only recently submicroscopic duplications of this region on Xq28 have been recognized in four females, and a triplication in a fifth, all in combination with random X-chromosome inactivation (XCI). Based on this small series, it was concluded that in females with MECP2 duplication and random XCI, the typical symptoms of affected boys are not present. We present clinical and molecular data on a series of five females with an Xq28 duplication including the MECP2 gene, both isolated and as the result of a translocation, and compare them with the previously reported cases of small duplications in females. The collected data indicate that the associated phenotype in females is distinct from males with similar duplications, but the clinical effects may be as severe as seen in males. PMID:22522176

  3. XX male sex reversal with genital abnormalities associated with a de novo SOX3 gene duplication.

    PubMed

    Moalem, Sharon; Babul-Hirji, Riyana; Stavropolous, Dmitri J; Wherrett, Diane; Bägli, Darius J; Thomas, Paul; Chitayat, David

    2012-07-01

    Differentiation of the bipotential gonad into testis is initiated by the Y chromosome-linked gene SRY (Sex-determining Region Y) through upregulation of its autosomal direct target gene SOX9 (Sry-related HMG box-containing gene 9). Sequence and chromosome homology studies have shown that SRY most probably evolved from SOX3, which in humans is located at Xq27.1. Mutations causing SOX3 loss-of-function do not affect the sex determination in mice or humans. However, transgenic mouse studies have shown that ectopic expression of Sox3 in the bipotential gonad results in upregulation of Sox9, resulting in testicular induction and XX male sex reversal. However, the mechanism by which these rearrangements cause sex reversal and the frequency with which they are associated with disorders of sex development remains unclear. Rearrangements of the SOX3 locus were identified recently in three cases of human XX male sex reversal. We report on a case of XX male sex reversal associated with a novel de novo duplication of the SOX3 gene. These data provide additional evidence that SOX3 gain-of-function in the XX bipotential gonad causes XX male sex reversal and further support the hypothesis that SOX3 is the evolutionary antecedent of SRY. Copyright © 2012 Wiley Periodicals, Inc.

  4. Genome-wide survey and expression analysis of F-box genes in chickpea.

    PubMed

    Gupta, Shefali; Garg, Vanika; Kant, Chandra; Bhatia, Sabhyata

    2015-02-13

    The F-box genes constitute one of the largest gene families in plants involved in degradation of cellular proteins. F-box proteins can recognize a wide array of substrates and regulate many important biological processes such as embryogenesis, floral development, plant growth and development, biotic and abiotic stress, hormonal responses and senescence, among others. However, little is known about the F-box genes in the important legume crop, chickpea. The available draft genome sequence of chickpea allowed us to conduct a genome-wide survey of the F-box gene family in chickpea. A total of 285 F-box genes were identified in chickpea which were classified based on their C-terminal domain structures into 10 subfamilies. Thirteen putative novel motifs were also identified in F-box proteins with no known functional domain at their C-termini. The F-box genes were physically mapped on the 8 chickpea chromosomes and duplication events were investigated which revealed that the F-box gene family expanded largely due to tandem duplications. Phylogenetic analysis classified the chickpea F-box genes into 9 clusters. Also, maximum syntenic relationship was observed with soybean followed by Medicago truncatula, Lotus japonicus and Arabidopsis. Digital expression analysis of F-box genes in various chickpea tissues as well as under abiotic stress conditions utilizing the available chickpea transcriptome data revealed differential expression patterns with several F-box genes specifically expressing in each tissue, few of which were validated by using quantitative real-time PCR. The genome-wide analysis of chickpea F-box genes provides new opportunities for characterization of candidate F-box genes and elucidation of their function in growth, development and stress responses for utilization in chickpea improvement.

  5. p53 protects against genome instability following centriole duplication failure

    PubMed Central

    Lambrus, Bramwell G.; Uetake, Yumi; Clutario, Kevin M.; Daggubati, Vikas; Snyder, Michael; Sluder, Greenfield

    2015-01-01

    Centriole function has been difficult to study because of a lack of specific tools that allow persistent and reversible centriole depletion. Here we combined gene targeting with an auxin-inducible degradation system to achieve rapid, titratable, and reversible control of Polo-like kinase 4 (Plk4), a master regulator of centriole biogenesis. Depletion of Plk4 led to a failure of centriole duplication that produced an irreversible cell cycle arrest within a few divisions. This arrest was not a result of a prolonged mitosis, chromosome segregation errors, or cytokinesis failure. Depleting p53 allowed cells that fail centriole duplication to proliferate indefinitely. Washout of auxin and restoration of endogenous Plk4 levels in cells that lack centrioles led to the penetrant formation of de novo centrioles that gained the ability to organize microtubules and duplicate. In summary, we uncover a p53-dependent surveillance mechanism that protects against genome instability by preventing cell growth after centriole duplication failure. PMID:26150389

  6. Distal 7q11.23 Duplication, an Emerging Microduplication Syndrome: A Case Report and Further Characterisation.

    PubMed

    Faundes, Víctor; Santa María, Lorena; Morales, Paulina; Curotto, Bianca; Parraguez, María M

    2016-10-01

    Chromosome 7q11.23 duplication syndrome is a well-recognised syndrome which involves the duplication of the same genes located in the Williams-Beuren critical region. However, in 2010, 4 patients were reported with a microduplication only in the HIP1 and YWHAG genes. We refer to this as a distal 7q11.23 duplication (dup7q11.23D). Here, we report the fifth de novo patient with dup7q11.23D, whose symptoms may be explained by YWHAG overexpression as was demonstrated recently in mice and obese patients. Finally, further studies will be necessary to delineate this emerging microduplication syndrome.

  7. Phylogeny of C4-photosynthesis enzymes based on algal transcriptomic and genomic data supports an archaeal/proteobacterial origin and multiple duplication for most C4-related genes.

    PubMed

    Chi, Shan; Wu, Shuangxiu; Yu, Jun; Wang, Xumin; Tang, Xuexi; Liu, Tao

    2014-01-01

    Both Calvin-Benson-Bassham (C3) and Hatch-Slack (C4) cycles are most important autotrophic CO2 fixation pathways on today's Earth. C3 cycle is believed to be originated from cyanobacterial endosymbiosis. However, studies on evolution of different biochemical variants of C4 photosynthesis are limited to tracheophytes and origins of C4-cycle genes are not clear till now. Our comprehensive analyses on bioinformatics and phylogenetics of novel transcriptomic sequencing data of 21 rhodophytes and 19 Phaeophyceae marine species and public genomic data of more algae, tracheophytes, cyanobacteria, proteobacteria and archaea revealed the origin and evolution of C4 cycle-related genes. Almost all of C4-related genes were annotated in extensive algal lineages with proteobacterial or archaeal origins, except for phosphoenolpyruvate carboxykinase (PCK) and aspartate aminotransferase (AST) with both cyanobacterial and archaeal/proteobacterial origin. Notably, cyanobacteria may not possess complete C4 pathway because of the flawed annotation of pyruvate orthophosphate dikinase (PPDK) genes in public data. Most C4 cycle-related genes endured duplication and gave rise to functional differentiation and adaptation in different algal lineages. C4-related genes of NAD-ME (NAD-malic enzyme) and PCK subtypes exist in most algae and may be primitive ones, while NADP-ME (NADP-malic enzyme) subtype genes might evolve from NAD-ME subtype by gene duplication in chlorophytes and tracheophytes.

  8. Genome-wide identification, characterisation and expression analysis of the MADS-box gene family in Prunus mume.

    PubMed

    Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia

    2014-10-01

    MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.

  9. The evolutionary implications of knox-I gene duplications in conifers: correlated evidence from phylogeny, gene mapping, and analysis of functional divergence.

    PubMed

    Guillet-Claude, Carine; Isabel, Nathalie; Pelgas, Betty; Bousquet, Jean

    2004-12-01

    Class I knox genes code for transcription factors that play an essential role in plant growth and development as central regulators of meristem cell identity. Based on the analysis of new cDNA sequences from various tissues and genomic DNA sequences, we identified a highly diversified group of class I knox genes in conifers. Phylogenetic analyses of complete amino acid sequences from various seed plants indicated that all conifer sequences formed a monophyletic group. Within conifers, four subgroups here named genes KN1 to KN4 were well delineated, each regrouping pine and spruce sequences. KN4 was sister group to KN3, which was sister group to KN1 and KN2. Genetic mapping on the genomes of two divergent Picea species indicated that KN1 and KN2 are located close to each other on the same linkage group, whereas KN3 and KN4 mapped on different linkage groups, correlating the more ancient divergence of these two genes. The proportion of synonymous and nonsynonymous substitutions suggested intense purifying selection for the four genes. However, rates of substitution per year indicated an evolution in two steps: faster rates were noted after gene duplications, followed subsequently by lower rates. Positive directional selection was detected for most of the internal branches harboring an accelerated rate of evolution. In addition, many sites with highly significant amino acid rate shift were identified between these branches. However, the tightly linked KN1 and KN2 did not diverge as much from each other. The implications of the correlation between phylogenetic, structural, and functional information are discussed in relation to the diversification of the knox-I gene family in conifers.

  10. Gene duplication and phylogeography of North American members of the Hart Park serogroup of avian rhabdoviruses.

    PubMed

    Allison, Andrew B; Mead, Daniel G; Palacios, Gustavo F; Tesh, Robert B; Holmes, Edward C

    2014-01-05

    Flanders virus (FLAV) and Hart Park virus (HPV) are rhabdoviruses that circulate in mosquito-bird cycles in the eastern and western United States, respectively, and constitute the only two North American representatives of the Hart Park serogroup. Previously, it was suggested that FLAV is unique among the rhabdoviruses in that it contains two pseudogenes located between the P and M genes, while the cognate sequence for HPV has been lacking. Herein, we demonstrate that FLAV and HPV do not contain pseudogenes in this region, but encode three small functional proteins designated as U1-U3 that apparently arose by gene duplication. To further investigate the U1-U3 region, we conducted the first large-scale evolutionary analysis of a member of the Hart Park serogroup by analyzing over 100 spatially and temporally distinct FLAV isolates. Our phylogeographic analysis demonstrates that although FLAV appears to be slowly evolving, phylogenetically divergent lineages co-circulate sympatrically. © 2013 Published by Elsevier Inc.

  11. Exonic duplication CNV of NDRG1 associated with autosomal-recessive HMSN-Lom/CMT4D.

    PubMed

    Okamoto, Yuji; Goksungur, Meryem Tuba; Pehlivan, Davut; Beck, Christine R; Gonzaga-Jauregui, Claudia; Muzny, Donna M; Atik, Mehmed M; Carvalho, Claudia M B; Matur, Zeliha; Bayraktar, Serife; Boone, Philip M; Akyuz, Kaya; Gibbs, Richard A; Battaloglu, Esra; Parman, Yesim; Lupski, James R

    2014-05-01

    Copy-number variations as a mutational mechanism contribute significantly to human disease. Approximately one-half of the patients with Charcot-Marie-Tooth (CMT) disease have a 1.4 Mb duplication copy-number variation as the cause of their neuropathy. However, non-CMT1A neuropathy patients rarely have causative copy-number variations, and to date, autosomal-recessive disease has not been associated with copy-number variation as a mutational mechanism. We performed Agilent 8 × 60 K array comparative genomic hybridization on DNA from 12 recessive Turkish families with CMT disease. Additional molecular studies were conducted to detect breakpoint junctions and to evaluate gene expression levels in a family in which we detected an intragenic duplication copy-number variation. We detected an ~6.25 kb homozygous intragenic duplication in NDRG1, a gene known to be causative for recessive HMSNL/CMT4D, in three individuals from a Turkish family with CMT neuropathy. Further studies showed that this intragenic copy-number variation resulted in a homozygous duplication of exons 6-8 that caused decreased mRNA expression of NDRG1. Exon-focused high-resolution array comparative genomic hybridization enables the detection of copy-number variation carrier states in recessive genes, particularly small copy-number variations encompassing or disrupting single genes. In families for whom a molecular diagnosis has not been elucidated by conventional clinical assays, an assessment for copy-number variations in known CMT genes might be considered.

  12. Copy Number Heterogeneity, Large Origin Tandem Repeats, and Interspecies Recombination in Human Herpesvirus 6A (HHV-6A) and HHV-6B Reference Strains

    PubMed Central

    Roychoudhury, Pavitra; Makhsous, Negar; Hanson, Derek; Chase, Jill; Krueger, Gerhard; Xie, Hong; Huang, Meei-Li; Saunders, Lindsay; Ablashi, Dharam; Koelle, David M.; Cook, Linda; Jerome, Keith R.

    2018-01-01

    ABSTRACT Quantitative PCR is a diagnostic pillar for clinical virology testing, and reference materials are necessary for accurate, comparable quantitation between clinical laboratories. Accurate quantitation of human herpesvirus 6A/B (HHV-6A/B) is important for detection of viral reactivation and inherited chromosomally integrated HHV-6A/B in immunocompromised patients. Reference materials in clinical virology commonly consist of laboratory-adapted viral strains that may be affected by the culture process. We performed next-generation sequencing to make relative copy number measurements at single nucleotide resolution of eight candidate HHV-6A and seven HHV-6B reference strains and DNA materials from the HHV-6 Foundation and Advanced Biotechnologies Inc. Eleven of 17 (65%) HHV-6A/B candidate reference materials showed multiple copies of the origin of replication upstream of the U41 gene by next-generation sequencing. These large tandem repeats arose independently in culture-adapted HHV-6A and HHV-6B strains, measuring 1,254 bp and 983 bp, respectively. The average copy number measured was between 5 and 10 times the number of copies of the rest of the genome. We also report the first interspecies recombinant HHV-6A/B strain with a HHV-6A backbone and a >5.5-kb region from HHV-6B, from U41 to U43, that covered the origin tandem repeat. Specific HHV-6A reference strains demonstrated duplication of regions at U1/U2, U87, and U89, as well as deletion in the U12-to-U24 region and the U94/U95 genes. HHV-6A/B strains derived from cord blood mononuclear cells from different laboratories on different continents with fewer passages revealed no copy number differences throughout the viral genome. These data indicate that large origin tandem duplications are an adaptation of both HHV-6A and HHV-6B in culture and show interspecies recombination is possible within the Betaherpesvirinae. IMPORTANCE Anything in science that needs to be quantitated requires a standard unit of

  13. Prevalence and Spectrum of Large Deletions or Duplications in the Major Long QT Syndrome-Susceptibility Genes and Implications for Long QT Syndrome Genetic Testing

    PubMed Central

    Tester, David J.; Benton, Amber J.; Train, Laura; Deal, Barbara; Baudhuin, Linnea M.; Ackerman, Michael J.

    2010-01-01

    Long QT Syndrome (LQTS) is a cardiac channelopathy associated with syncope, seizures, and sudden death. Approximately 75% of LQTS is due to mutations in genes encoding for three cardiac ion channel alpha-subunits (LQT1-3). However, traditional mutational analyses have limited detection capabilities for atypical mutations such as large gene rearrangements. Here, we set out to determine the prevalence and spectrum of large deletions/duplications in the major LQTS-susceptibility genes among unrelated patients who were mutation-negative following point mutation analysis of LQT1-12-susceptibility genes. Forty-two unrelated clinically strong LQTS patients were analyzed using multiplex ligation-dependent probe amplification (MLPA), a quantitative fluorescent technique for detecting multiple exon deletions and duplications. The SALSA-MLPA LQTS Kit from MRC-Holland was used to analyze the three major LQTS-associated genes: KCNQ1, KCNH2, and SCN5A and the two minor genes: KCNE1 and KCNE2. Overall, 2 gene rearrangements were found in 2/42 (4.8%, CI, 1.7–11%) unrelated patients. A deletion of KCNQ1 exon 3 was identified in a 10 year-old Caucasian boy with a QTc of 660 milliseconds (ms), a personal history of exercise-induced syncope, and a family history of syncope. A deletion of KCNQ1 exon 7 was identified in a 17 year-old Caucasian girl with a QTc of 480 ms, a personal history of exercise-induced syncope, and a family history of sudden cardiac death. In conclusion, since nearly 5% of patients with genetically elusive LQTS had large genomic rearrangements involving the canonical LQTS-susceptibility genes, reflex genetic testing to investigate genomic rearrangements may be of clinical value. PMID:20920651

  14. Gene Duplication Leads to Altered Membrane Topology of a Cytochrome P450 Enzyme in Seed Plants.

    PubMed

    Renault, Hugues; De Marothy, Minttu; Jonasson, Gabriella; Lara, Patricia; Nelson, David R; Nilsson, IngMarie; André, François; von Heijne, Gunnar; Werck-Reichhart, Danièle

    2017-08-01

    Evolution of the phenolic metabolism was critical for the transition of plants from water to land. A cytochrome P450, CYP73, with cinnamate 4-hydroxylase (C4H) activity, catalyzes the first plant-specific and rate-limiting step in this pathway. The CYP73 gene is absent from green algae, and first detected in bryophytes. A CYP73 duplication occurred in the ancestor of seed plants and was retained in Taxaceae and most angiosperms. In spite of a clear divergence in primary sequence, both paralogs can fulfill comparable cinnamate hydroxylase roles both in vitro and in vivo. One of them seems dedicated to the biosynthesis of lignin precursors. Its N-terminus forms a single membrane spanning helix and its properties and length are highly constrained. The second is characterized by an elongated and variable N-terminus, reminiscent of ancestral CYP73s. Using as proxies the Brachypodium distachyon proteins, we show that the elongation of the N-terminus does not result in an altered subcellular localization, but in a distinct membrane topology. Insertion in the membrane of endoplasmic reticulum via a double-spanning open hairpin structure allows reorientation to the lumen of the catalytic domain of the protein. In agreement with participation to a different functional unit and supramolecular organization, the protein displays modified heme proximal surface. These data suggest the evolution of divergent C4H enzymes feeding different branches of the phenolic network in seed plants. It shows that specialization required for retention of gene duplicates may result from altered protein topology rather than change in enzyme activity. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. Genome-wide characterization of phenylalanine ammonia-lyase gene family in watermelon (Citrullus lanatus).

    PubMed

    Dong, Chun-Juan; Shang, Qing-Mao

    2013-07-01

    Phenylalanine ammonia-lyase (PAL), the first enzyme in the phenylpropanoid pathway, plays a critical role in plant growth, development, and adaptation. PAL enzymes are encoded by a gene family in plants. Here, we report a genome-wide search for PAL genes in watermelon. A total of 12 PAL genes, designated ClPAL1-12, are identified . Nine are arranged in tandem in two duplication blocks located on chromosomes 4 and 7, and the other three ClPAL genes are distributed as single copies on chromosomes 2, 3, and 8. Both the cDNA and protein sequences of ClPALs share an overall high identity with each other. A phylogenetic analysis places 11 of the ClPALs into a separate cucurbit subclade, whereas ClPAL2, which belongs to neither monocots nor dicots, may serve as an ancestral PAL in plants. In the cucurbit subclade, seven ClPALs form homologous pairs with their counterparts from cucumber. Expression profiling reveals that 11 of the ClPAL genes are expressed and show preferential expression in the stems and male and female flowers. Six of the 12 ClPALs are moderately or strongly expressed in the fruits, particularly in the pulp, suggesting the potential roles of PAL in the development of fruit color and flavor. A promoter motif analysis of the ClPAL genes implies redundant but distinctive cis-regulatory structures for stress responsiveness. Finally, duplication events during the evolution and expansion of the ClPAL gene family are discussed, and the relationships between the ClPAL genes and their cucumber orthologs are estimated.

  16. Ongoing resolution of duplicate gene functions shapes the diversification of a metabolic network

    PubMed Central

    Kuang, Meihua Christina; Hutchins, Paul D; Russell, Jason D; Coon, Joshua J; Hittinger, Chris Todd

    2016-01-01

    The evolutionary mechanisms leading to duplicate gene retention are well understood, but the long-term impacts of paralog differentiation on the regulation of metabolism remain underappreciated. Here we experimentally dissect the functions of two pairs of ancient paralogs of the GALactose sugar utilization network in two yeast species. We show that the Saccharomyces uvarum network is more active, even as over-induction is prevented by a second co-repressor that the model yeast Saccharomyces cerevisiae lacks. Surprisingly, removal of this repression system leads to a strong growth arrest, likely due to overly rapid galactose catabolism and metabolic overload. Alternative sugars, such as fructose, circumvent metabolic control systems and exacerbate this phenotype. We further show that S. cerevisiae experiences homologous metabolic constraints that are subtler due to how the paralogs have diversified. These results show how the functional differentiation of paralogs continues to shape regulatory network architectures and metabolic strategies long after initial preservation. DOI: http://dx.doi.org/10.7554/eLife.19027.001 PMID:27690225

  17. Ongoing resolution of duplicate gene functions shapes the diversification of a metabolic network

    DOE PAGES

    Kuang, Meihua Christina; Hutchins, Paul D.; Russell, Jason D.; ...

    2016-09-30

    The evolutionary mechanisms leading to duplicate gene retention are well understood, but the long-term impacts of paralog differentiation on the regulation of metabolism remain underappreciated. Here we experimentally dissect the functions of two pairs of ancient paralogs of theGALactose sugar utilization network in two yeast species. Here, we show that theSaccharomyces uvarumnetwork is more active, even as over-induction is prevented by a second co-repressor that the model yeastSaccharomyces cerevisiaelacks. Surprisingly, removal of this repression system leads to a strong growth arrest, likely due to overly rapid galactose catabolism and metabolic overload. Alternative sugars, such as fructose, circumvent metabolic control systemsmore » and exacerbate this phenotype. Furthermore, we show thatS. cerevisiaeexperiences homologous metabolic constraints that are subtler due to how the paralogs have diversified. Our results show how the functional differentiation of paralogs continues to shape regulatory network architectures and metabolic strategies long after initial preservation.« less

  18. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    PubMed

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  19. Evolution of Genes Involved in Gamete Interaction: Evidence for Positive Selection, Duplications and Losses in Vertebrates

    PubMed Central

    Callebaut, Isabelle; Laurin, Michel; Pascal, Géraldine; Poupon, Anne; Goudet, Ghylène; Monget, Philippe

    2012-01-01

    Genes encoding proteins involved in sperm-egg interaction and fertilization exhibit a particularly fast evolution and may participate in prezygotic species isolation [1], [2]. Some of them (ZP3, ADAM1, ADAM2, ACR and CD9) have individually been shown to evolve under positive selection [3], [4], suggesting a role of positive Darwinian selection on sperm-egg interaction. However, the genes involved in this biological function have not been systematically and exhaustively studied with an evolutionary perspective, in particular across vertebrates with internal and external fertilization. Here we show that 33 genes among the 69 that have been experimentally shown to be involved in fertilization in at least one taxon in vertebrates are under positive selection. Moreover, we identified 17 pseudogenes and 39 genes that have at least one duplicate in one species. For 15 genes, we found neither positive selection, nor gene copies or pseudogenes. Genes of teleosts, especially genes involved in sperm-oolemma fusion, appear to be more frequently under positive selection than genes of birds and eutherians. In contrast, pseudogenization, gene loss and gene gain are more frequent in eutherians. Thus, each of the 19 studied vertebrate species exhibits a unique signature characterized by gene gain and loss, as well as position of amino acids under positive selection. Reflecting these clade-specific signatures, teleosts and eutherian mammals are recovered as clades in a parsimony analysis. Interestingly the same analysis places Xenopus apart from teleosts, with which it shares the primitive external fertilization, and locates it along with amniotes (which share internal fertilization), suggesting that external or internal environmental conditions of germ cell interaction may not be the unique factors that drive the evolution of fertilization genes. Our work should improve our understanding of the fertilization process and on the establishment of reproductive barriers, for example by

  20. Creation of Mice Bearing a Partial Duplication of HPRT Gene Marked with a GFP Gene and Detection of Revertant Cells In Situ as GFP-Positive Somatic Cells.

    PubMed

    Noda, Asao; Suemori, Hirofumi; Hirai, Yuko; Hamasaki, Kanya; Kodama, Yoshiaki; Mitani, Hiroshi; Landes, Reid D; Nakamura, Nori

    2015-01-01

    It is becoming clear that apparently normal somatic cells accumulate mutations. Such accumulations or propagations of mutant cells are thought to be related to certain diseases such as cancer. To better understand the nature of somatic mutations, we developed a mouse model that enables in vivo detection of rare genetically altered cells via GFP positive cells. The mouse model carries a partial duplication of 3' portion of X-chromosomal HPRT gene and a GFP gene at the end of the last exon. In addition, although HPRT gene expression was thought ubiquitous, the expression level was found insufficient in vivo to make the revertant cells detectable by GFP positivity. To overcome the problem, we replaced the natural HPRT-gene promoter with a CAG promoter. In such animals, termed HPRT-dup-GFP mouse, losing one duplicated segment by crossover between the two sister chromatids or within a single molecule of DNA reactivates gene function, producing hybrid HPRT-GFP proteins which, in turn, cause the revertant cells to be detected as GFP-positive cells in various tissues. Frequencies of green mutant cells were measured using fixed and frozen sections (liver and pancreas), fixed whole mount (small intestine), or by means of flow cytometry (unfixed splenocytes). The results showed that the frequencies varied extensively among individuals as well as among tissues. X-ray exposure (3 Gy) increased the frequency moderately (~2 times) in the liver and small intestine. Further, in two animals out of 278 examined, some solid tissues showed too many GFP-positive cells to score (termed extreme jackpot mutation). Present results illustrated a complex nature of somatic mutations occurring in vivo. While the HPRT-dup-GFP mouse may have a potential for detecting tissue-specific environmental mutagens, large inter-individual variations of mutant cell frequency cause the results unstable and hence have to be reduced. This future challenge will likely involve lowering the background mutation

  1. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping.

    PubMed

    U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

    2007-03-30

    The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health

  2. Genome-wide identification and analysis of the SBP-box family genes in apple (Malus × domestica Borkh.).

    PubMed

    Li, Jun; Hou, Hongmin; Li, Xiaoqin; Xiang, Jiang; Yin, Xiangjing; Gao, Hua; Zheng, Yi; Bassett, Carole L; Wang, Xiping

    2013-09-01

    SQUAMOSA promoter binding protein (SBP)-box genes encode a family of plant-specific transcription factors and play many crucial roles in plant development. In this study, 27 SBP-box gene family members were identified in the apple (Malus × domestica Borkh.) genome, 15 of which were suggested to be putative targets of MdmiR156. Plant SBPs were classified into eight groups according to the phylogenetic analysis of SBP-domain proteins. Gene structure, gene chromosomal location and synteny analyses of MdSBP genes within the apple genome demonstrated that tandem and segmental duplications, as well as whole genome duplications, have likely contributed to the expansion and evolution of the SBP-box gene family in apple. Additionally, synteny analysis between apple and Arabidopsis indicated that several paired homologs of MdSBP and AtSPL genes were located in syntenic genomic regions. Tissue-specific expression analysis of MdSBP genes in apple demonstrated their diversified spatiotemporal expression patterns. Most MdmiR156-targeted MdSBP genes, which had relatively high transcript levels in stems, leaves, apical buds and some floral organs, exhibited a more differential expression pattern than most MdmiR156-nontargeted MdSBP genes. Finally, expression analysis of MdSBP genes in leaves upon various plant hormone treatments showed that many MdSBP genes were responsive to different plant hormones, indicating that MdSBP genes may be involved in responses to hormone signaling during stress or in apple development. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  3. Duplicate editorial on duplicate publication.

    PubMed

    Corson, Stephen L; Decherney, Alan H

    2005-04-01

    The authors define and discuss the various forms taken by duplicate publications, and provide suggested remedies to help authors, editors, reviewers, and readers avoid this form of internal plagiarism.

  4. Prevalence and spectrum of large deletions or duplications in the major long QT syndrome-susceptibility genes and implications for long QT syndrome genetic testing.

    PubMed

    Tester, David J; Benton, Amber J; Train, Laura; Deal, Barbara; Baudhuin, Linnea M; Ackerman, Michael J

    2010-10-15

    Long QT syndrome (LQTS) is a cardiac channelopathy associated with syncope, seizures, and sudden death. Approximately 75% of LQTS is due to mutations in genes encoding for 3 cardiac ion channel α-subunits (LQT1 to LQT3). However, traditional mutational analyses have limited detection capabilities for atypical mutations such as large gene rearrangements. We set out to determine the prevalence and spectrum of large deletions/duplications in the major LQTS-susceptibility genes in unrelated patients who were mutation negative after point mutation analysis of LQT1- to LQT12-susceptibility genes. Forty-two unrelated, clinically strong LQTS patients were analyzed using multiplex ligation-dependent probe amplification, a quantitative fluorescent technique for detecting multiple exon deletions and duplications. The SALSA multiplex ligation-dependent probe amplification LQTS kit from MRC-Holland was used to analyze the 3 major LQTS-associated genes, KCNQ1, KCNH2, and SCN5A, and the 2 minor genes, KCNE1 and KCNE2. Overall, 2 gene rearrangements were found in 2 of 42 unrelated patients (4.8%, confidence interval 1.7 to 11). A deletion of KCNQ1 exon 3 was identified in a 10-year-old Caucasian boy with a corrected QT duration of 660 ms, a personal history of exercise-induced syncope, and a family history of syncope. A deletion of KCNQ1 exon 7 was identified in a 17-year-old Caucasian girl with a corrected QT duration of 480 ms, a personal history of exercise-induced syncope, and a family history of sudden cardiac death. In conclusion, because nearly 5% of patients with genetically elusive LQTS had large genomic rearrangements involving the canonical LQTS-susceptibility genes, reflex genetic testing to investigate genomic rearrangements may be of clinical value. Copyright © 2010 Elsevier Inc. All rights reserved.

  5. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. A case report: Becker muscular dystrophy presenting with epilepsy and dysgnosia induced by duplication mutation of Dystrophin gene.

    PubMed

    Miao, Jing; Feng, Jia-Chun; Zhu, Dan; Yu, Xue-Fan

    2016-12-12

    Becker muscular dystrophy (BMD), a genetic disorder of X-linked recessive inheritance, typically presents with gradually progressive muscle weakness. The condition is caused by mutations of Dystrophin gene located at Xp21.2. Epilepsy is an infrequent manifestation of BMD, while cases of BMD with dysgnosia are extremely rare. We describe a 9-year-old boy with BMD, who presented with epilepsy and dysgnosia. Serum creatine kinase level was markedly elevated (3665 U/L). Wechsler intelligence tests showed a low intelligence quotient (IQ = 65). Electromyogram showed slight myogenic changes and skeletal muscle biopsy revealed muscular dystrophy. Immunohistochemical staining showed partial positivity of sarcolemma for dystrophin-N. Multiplex ligation-dependent probe amplification revealed a duplication mutation in exons 37-44 in the Dystrophin gene. The present case report helps to better understand the clinical and genetic features of BMD.

  7. Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio)

    PubMed Central

    2012-01-01

    Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280

  8. Comprehensive review of the duplication 3q syndrome and report of a patient with Currarino syndrome and de novo duplication 3q26.32-q27.2.

    PubMed

    Dworschak, G C; Crétolle, C; Hilger, A; Engels, H; Korsch, E; Reutter, H; Ludwig, M

    2017-05-01

    Partial duplications of the long arm of chromosome 3, dup(3q), are a rare but well-described condition, sharing features of Cornelia de Lange syndrome. Around two thirds of cases are derived from unbalanced translocations, whereas pure dup(3q) have rarely been reported. Here, we provide an extensive review of the literature on dup(3q). This search revealed several patients with caudal malformations and anomalies, suggesting that caudal malformations or anomalies represent an inherent phenotypic feature of dup(3q). In this context, we report a patient with a pure de novo duplication 3q26.32-q27.2. The patient had the clinical diagnosis of Currarino syndrome (CS) (characterized by the triad of sacral anomalies, anorectal malformations and a presacral mass) and additional features, frequently detected in patients with a dup(3q). Mutations within the MNX1 gene were found to be causative in CS but no MNX1 mutation could be detected in our patient. Our comprehensive search for candidate genes located in the critical region of the duplication 3q syndrome, 3q26.3-q27, revealed a so far neglected phenotypic overlap of dup(3q) and the Pierpont syndrome, associated with a mutation of the TBL1XR1 gene on 3q26.32. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. Phylogenetic analysis of IDD gene family and characterization of its expression in response to flower induction in Malus.

    PubMed

    Fan, Sheng; Zhang, Dong; Xing, Libo; Qi, Siyan; Du, Lisha; Wu, Haiqin; Shao, Hongxia; Li, Youmei; Ma, Juanjuan; Han, Mingyu

    2017-08-01

    Although INDETERMINATE DOMAIN (IDD) genes encoding specific plant transcription factors have important roles in plant growth and development, little is known about apple IDD (MdIDD) genes and their potential functions in the flower induction. In this study, we identified 20 putative IDD genes in apple and named them according to their chromosomal locations. All identified MdIDD genes shared a conserved IDD domain. A phylogenetic analysis separated MdIDDs and other plant IDD genes into four groups. Bioinformatic analysis of chemical characteristics, gene structure, and prediction of protein-protein interactions demonstrated the functional and structural diversity of MdIDD genes. To further uncover their potential functions, we performed analysis of tandem, synteny, and gene duplications, which indicated several paired homologs of IDD genes between apple and Arabidopsis. Additionally, genome duplications also promoted the expansion and evolution of the MdIDD genes. Quantitative real-time PCR revealed that all the MdIDD genes showed distinct expression levels in five different tissues (stems, leaves, buds, flowers, and fruits). Furthermore, the expression levels of candidate MdIDD genes were also investigated in response to various circumstances, including GA treatment (decreased the flowering rate), sugar treatment (increased the flowering rate), alternate-bearing conditions, and two varieties with different-flowering intensities. Parts of them were affected by exogenous treatments and showed different expression patterns. Additionally, changes in response to alternate-bearing and different-flowering varieties of apple trees indicated that they were also responsive to flower induction. Taken together, our comprehensive analysis provided valuable information for further analysis of IDD genes aiming at flower induction.

  10. Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes.

    PubMed

    Karn, Robert C; Laukaitis, Christina M

    2014-08-01

    In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.

  11. Duplication in DNA Sequences

    NASA Astrophysics Data System (ADS)

    Ito, Masami; Kari, Lila; Kincaid, Zachary; Seki, Shinnosuke

    The duplication and repeat-deletion operations are the basis of a formal language theoretic model of errors that can occur during DNA replication. During DNA replication, subsequences of a strand of DNA may be copied several times (resulting in duplications) or skipped (resulting in repeat-deletions). As formal language operations, iterated duplication and repeat-deletion of words and languages have been well studied in the literature. However, little is known about single-step duplications and repeat-deletions. In this paper, we investigate several properties of these operations, including closure properties of language families in the Chomsky hierarchy and equations involving these operations. We also make progress toward a characterization of regular languages that are generated by duplicating a regular language.

  12. The sea lamprey meiotic map improves resolution of ancient vertebrate genome duplications.

    PubMed

    Smith, Jeramiah J; Keinath, Melissa C

    2015-08-01

    It is generally accepted that many genes present in vertebrate genomes owe their origin to two whole-genome duplications that occurred deep in the ancestry of the vertebrate lineage. However, details regarding the timing and outcome of these duplications are not well resolved. We present high-density meiotic and comparative genomic maps for the sea lamprey (Petromyzon marinus), a representative of an ancient lineage that diverged from all other vertebrates ∼550 million years ago. Linkage analyses yielded a total of 95 linkage groups, similar to the estimated number of germline chromosomes (1n ∼ 99), spanning a total of 5570.25 cM. Comparative mapping data yield strong support for the hypothesis that a single whole-genome duplication occurred in the basal vertebrate lineage, but do not strongly support a hypothetical second event. Rather, these comparative maps reveal several evolutionarily independent segmental duplications occurring over the last 600+ million years of chordate evolution. This refined history of vertebrate genome duplication should permit more precise investigations of vertebrate evolution. © 2015 Smith and Keinath; Published by Cold Spring Harbor Laboratory Press.

  13. Two Functional Copies of the DGCR6 Gene Are Present on Human Chromosome 22q11 Due to a Duplication of an Ancestral Locus

    PubMed Central

    Edelmann, Lisa; Stankiewicz, Pavel; Spiteri, Elizabeth; Pandita, Raj K.; Shaffer, Lisa; Lupski, James; Morrow, Bernice E.

    2001-01-01

    The DGCR6 (DiGeorge critical region) gene encodes a putative protein with sequence similarity to gonadal (gdl), a Drosophila melanogaster gene of unknown function. We mapped the DGCR6 gene to chromosome 22q11 within a low copy repeat, termed sc11.1a, and identified a second copy of the gene, DGCR6L, within the duplicate locus, termed sc11.1b. Both sc11.1 repeats are deleted in most persons with velo-cardio-facial syndrome/DiGeorge syndrome (VCFS/DGS), and they map immediately adjacent and internal to the low copy repeats, termed LCR22, that mediate the deletions associated with VCFS/DGS. We sequenced genomic clones from both loci and determined that the putative initiator methionine is located further upstream than originally described, but in a position similar to the mouse and chicken orthologs. DGCR6L encodes a highly homologous, functional copy of DGCR6, with some base changes rendering amino acid differences. Expression studies of the two genes indicate that both genes are widely expressed in fetal and adult tissues. Evolutionary studies using FISH mapping in several different species of ape combined with sequence analysis of DGCR6 in a number of different primate species indicate that the duplication is at least 12 million years old and may date back to before the divergence of Catarrhines from Platyrrhines, 35 mya. These data suggest that there has been selective evolutionary pressure toward the functional maintenance of both paralogs. Interestingly, a full-length HERV-K provirus integrated into the sc11.1a locus after the divergence of chimpanzees and humans. PMID:11157784

  14. The central domain of bovine submaxillary mucin consists of over 50 tandem repeats of 329 amino acids. Chromosomal localization of the BSM1 gene and relations to ovine and porcine counterparts.

    PubMed

    Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P

    2000-04-01

    We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.

  15. Characterization and Evolution of Conserved MicroRNA through Duplication Events in Date Palm (Phoenix dactylifera)

    PubMed Central

    Yang, Yaodong; Mason, Annaliese S.; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events. PMID:23951162

  16. Characterization and evolution of conserved MicroRNA through duplication events in date palm (Phoenix dactylifera).

    PubMed

    Xiao, Yong; Xia, Wei; Yang, Yaodong; Mason, Annaliese S; Lei, Xintao; Ma, Zilong

    2013-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression at the post-transcriptional level in a wide range of species. Highly conserved miRNAs regulate ancestral transcription factors common to all plants, and control important basic processes such as cell division and meristem function. We selected 21 conserved miRNA families to analyze the distribution and maintenance of miRNAs. Recently, the first genome sequence in Palmaceae was released: date palm (Phoenix dactylifera). We conducted a systematic miRNA analysis in date palm, computationally identifying and characterizing the distribution and duplication of conserved miRNAs in this species compared to other published plant genomes. A total of 81 miRNAs belonging to 18 miRNA families were identified in date palm. The majority of miRNAs in date palm and seven other well-studied plant species were located in intergenic regions and located 4 to 5 kb away from the nearest protein-coding genes. Sequence comparison showed that 67% of date palm miRNA members were present in duplicated segments, and that 135 pairs of miRNA-containing segments were duplicated in Arabidopsis, tomato, orange, rice, apple, poplar and soybean with a high similarity of non coding sequences between duplicated segments, indicating genomic duplication was a major force for expansion of conserved miRNAs. Duplicated miRNA pairs in date palm showed divergence in pre-miRNA sequence and in number of promoters, implying that these duplicated pairs may have undergone divergent evolution. Comparisons between date palm and the seven other plant species for the gain/loss of miR167 loci in an ancient segment shared between monocots and dicots suggested that these conserved miRNAs were highly influenced by and diverged as a result of genomic duplication events.

  17. A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments.

    PubMed

    Bansal, Vikas

    2017-03-14

    PCR amplification is an important step in the preparation of DNA sequencing libraries prior to high-throughput sequencing. PCR amplification introduces redundant reads in the sequence data and estimating the PCR duplication rate is important to assess the frequency of such reads. Existing computational methods do not distinguish PCR duplicates from "natural" read duplicates that represent independent DNA fragments and therefore, over-estimate the PCR duplication rate for DNA-seq and RNA-seq experiments. In this paper, we present a computational method to estimate the average PCR duplication rate of high-throughput sequence datasets that accounts for natural read duplicates by leveraging heterozygous variants in an individual genome. Analysis of simulated data and exome sequence data from the 1000 Genomes project demonstrated that our method can accurately estimate the PCR duplication rate on paired-end as well as single-end read datasets which contain a high proportion of natural read duplicates. Further, analysis of exome datasets prepared using the Nextera library preparation method indicated that 45-50% of read duplicates correspond to natural read duplicates likely due to fragmentation bias. Finally, analysis of RNA-seq datasets from individuals in the 1000 Genomes project demonstrated that 70-95% of read duplicates observed in such datasets correspond to natural duplicates sampled from genes with high expression and identified outlier samples with a 2-fold greater PCR duplication rate than other samples. The method described here is a useful tool for estimating the PCR duplication rate of high-throughput sequence datasets and for assessing the fraction of read duplicates that correspond to natural read duplicates. An implementation of the method is available at https://github.com/vibansal/PCRduplicates .

  18. Genome specific PPARαB duplicates in salmonids and insights into estrogenic regulation in brown trout.

    PubMed

    Madureira, Tânia Vieira; Pinheiro, Ivone; de Paula Freire, Rafaelle; Rocha, Eduardo; Castro, Luis Filipe; Urbatzka, Ralph

    2017-06-01

    Peroxisome proliferator-activated receptors (PPARs) are key regulators of many processes in vertebrates, such as carbohydrate and lipid metabolism. PPARα, a member of the PPAR nuclear receptor gene subfamily (NR1C1), is involved in fatty acid metabolism, namely in peroxisomal β-oxidation. Two gene paralogues, pparαA and pparαB, were described in several teleost species with their origin dating back to the teleost-specific genome duplication (3R). Given the additional salmonid-specific genome duplication (4R), four genes could be theoretically anticipated for this gene subfamily. In this work, we examined the pparα gene repertoire in brown trout, Salmo trutta f. fario. Data disclosed two pparα-like sequences in brown trout. Phylogenetic analyses further revealed that the isolated genes are most likely genome pparαB duplicates, pparαBa and pparαBb, while pparαA is apparently absent in salmonids. Both genes showed a ubiquitous mRNA expression across a panel of 11 different organs. In vitro exposed primary brown trout hepatocytes strongly suggest that pparα gene paralogues are differently regulated by ethinylestradiol (EE2). PparαBb mRNA expression significantly decreased with dosage, reaching significance after exposure to 50μM EE2, while pparαBa mRNA increased, significant at 1μM EE2. The present data enhances the understanding of pparα function and evolution in teleost, and reinforces the evidence of a potential crosstalk between estrogenic and pparα signaling pathways. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Mitogenomes of two neotropical bird species and the multiple independent origin of mitochondrial gene orders in Passeriformes.

    PubMed

    Caparroz, Renato; Rocha, Amanda V; Cabanne, Gustavo S; Tubaro, Pablo; Aleixo, Alexandre; Lemmon, Emily M; Lemmon, Alan R

    2018-06-01

    At least four mitogenome arrangements occur in Passeriformes and differences among them are derived from an initial tandem duplication involving a segment containing the control region (CR), followed by loss or reduction of some parts of this segment. However, it is still unclear how often duplication events have occurred in this bird order. In this study, the mitogenomes from two species of Neotropical passerines (Sicalis olivascens and Lepidocolaptes angustirostris) with different gene arrangements were first determined. We also estimated how often duplication events occurred in Passeriformes and if the two CR copies demonstrate a pattern of concerted evolution in Sylvioidea. One tissue sample for each species was used to obtain the mitogenomes as a byproduct using next generation sequencing. The evolutionary history of mitogenome rearrangements was reconstructed mapping these characters onto a mitogenome Bayesian phylogenetic tree of Passeriformes. Finally, we performed a Bayesian analysis for both CRs from some Sylvioidea species in order to evaluate the evolutionary process involving these two copies. Both mitogenomes described comprise 2 rRNAs, 22 tRNAs, 13 protein-codon genes and the CR. However, S. olivascens has 16,768 bp showing the ancestral avian arrangement, while L. angustirostris has 16,973 bp and the remnant CR2 arrangement. Both species showed the expected gene order compared to their closest relatives. The ancestral state reconstruction suggesting at least six independent duplication events followed by partial deletions or loss of one copy in some lineages. Our results also provide evidence that both CRs in some Sylvioidea species seem to be maintained in an apparently functional state, perhaps by concerted evolution, and that this mechanism may be important for the evolution of the bird mitogenome.

  20. Comparison and Evaluation of Clustering Algorithms for Tandem Mass Spectra.

    PubMed

    Rieder, Vera; Schork, Karin U; Kerschke, Laura; Blank-Landeshammer, Bernhard; Sickmann, Albert; Rahnenführer, Jörg

    2017-11-03

    In proteomics, liquid chromatography-tandem mass spectrometry (LC-MS/MS) is established for identifying peptides and proteins. Duplicated spectra, that is, multiple spectra of the same peptide, occur both in single MS/MS runs and in large spectral libraries. Clustering tandem mass spectra is used to find consensus spectra, with manifold applications. First, it speeds up database searches, as performed for instance by Mascot. Second, it helps to identify novel peptides across species. Third, it is used for quality control to detect wrongly annotated spectra. We compare different clustering algorithms based on the cosine distance between spectra. CAST, MS-Cluster, and PRIDE Cluster are popular algorithms to cluster tandem mass spectra. We add well-known algorithms for large data sets, hierarchical clustering, DBSCAN, and connected components of a graph, as well as the new method N-Cluster. All algorithms are evaluated on real data with varied parameter settings. Cluster results are compared with each other and with peptide annotations based on validation measures such as purity. Quality control, regarding the detection of wrongly (un)annotated spectra, is discussed for exemplary resulting clusters. N-Cluster proves to be highly competitive. All clustering results benefit from the so-called DISMS2 filter that integrates additional information, for example, on precursor mass.

  1. Ancestral genomic duplication of the insulin gene in tilapia: An analysis of possible implications for clinical islet xenotransplantation using donor islets from transgenic tilapia expressing a humanized insulin gene.

    PubMed

    Hrytsenko, Olga; Pohajdak, Bill; Wright, James R

    2016-07-03

    Tilapia, a teleost fish, have multiple large anatomically discrete islets which are easy to harvest, and when transplanted into diabetic murine recipients, provide normoglycemia and mammalian-like glucose tolerance profiles. Tilapia insulin differs structurally from human insulin which could preclude their use as islet donors for xenotransplantation. Therefore, we produced transgenic tilapia with islets expressing a humanized insulin gene. It is now known that fish genomes may possess an ancestral duplication and so tilapia may have a second insulin gene. Therefore, we cloned, sequenced, and characterized the tilapia insulin 2 transcript and found that its expression is negligible in islets, is not islet-specific, and would not likely need to be silenced in our transgenic fish.

  2. Ancestral genomic duplication of the insulin gene in tilapia: An analysis of possible implications for clinical islet xenotransplantation using donor islets from transgenic tilapia expressing a humanized insulin gene

    PubMed Central

    Hrytsenko, Olga; Pohajdak, Bill; Wright, James R.

    2016-01-01

    ABSTRACT Tilapia, a teleost fish, have multiple large anatomically discrete islets which are easy to harvest, and when transplanted into diabetic murine recipients, provide normoglycemia and mammalian-like glucose tolerance profiles. Tilapia insulin differs structurally from human insulin which could preclude their use as islet donors for xenotransplantation. Therefore, we produced transgenic tilapia with islets expressing a humanized insulin gene. It is now known that fish genomes may possess an ancestral duplication and so tilapia may have a second insulin gene. Therefore, we cloned, sequenced, and characterized the tilapia insulin 2 transcript and found that its expression is negligible in islets, is not islet-specific, and would not likely need to be silenced in our transgenic fish. PMID:27222321

  3. A decamer duplication in the 3′ region of the BRI gene originates an amyloid peptide that is associated with dementia in a Danish kindred

    PubMed Central

    Vidal, Ruben; Révész, Tamas; Rostagno, Agueda; Kim, Eugene; Holton, Janice L.; Bek, Toke; Bojsen-Møller, Marie; Braendgaard, Hans; Plant, Gordon; Ghiso, Jorge; Frangione, Blas

    2000-01-01

    Familial Danish dementia (FDD), also known as heredopathia ophthalmo-oto-encephalica, is an autosomal dominant disorder characterized by cataracts, deafness, progressive ataxia, and dementia. Neuropathological findings include severe widespread cerebral amyloid angiopathy, hippocampal plaques, and neurofibrillary tangles, similar to Alzheimer's disease. N-terminal sequence analysis of isolated leptomeningeal amyloid fibrils revealed homology to ABri, the peptide originated by a point mutation at the stop codon of gene BRI in familial British dementia. Molecular genetic analysis of the BRI gene in the Danish kindred showed a different defect, namely the presence of a 10-nt duplication (795–796insTTTAATTTGT) between codons 265 and 266, one codon before the normal stop codon 267. The decamer duplication mutation produces a frame-shift in the BRI sequence generating a larger-than-normal precursor protein, of which the amyloid subunit (designated ADan) comprises the last 34 C-terminal amino acids. This de novo-created amyloidogenic peptide, associated with a genetic defect in the Danish kindred, stresses the importance of amyloid formation as a causative factor in neurodegeneration and dementia. PMID:10781099

  4. Facial duplication: case, review, and embryogenesis.

    PubMed

    Barr, M

    1982-04-01

    The craniofacial anatomy of an infant with facial duplication is described. There were four eyes, two noses, two maxillae, and one mandible. Anterior to the single pituitary the brain was duplicated and there was bilateral arhinencephaly. Portions of the brain were extruded into a large frontal encephalocele. Cases of symmetrical facial duplication reported in the literature range from two complete faces on a single head (diprosopus) to simple nasal duplication. The variety of patterns of duplication suggests that the doubling of facial components arises in several different ways: Forking of the notochord, duplication of the prosencephalon, duplication of the olfactory placodes, and duplication of maxillary and/or mandibular growth centers around the margins of the stomatodeal plate. Among reported cases, the female:male ratio is 2:1.

  5. Genomic organization, phylogenetic comparison, and expression profiles of the SPL family genes and their regulation in soybean.

    PubMed

    Tripathi, Rajiv K; Goel, Ridhi; Kumari, Sweta; Dahuja, Anil

    2017-03-01

    SQUAMOSA Promoter-Binding Protein-Like (SPL) genes form a major family of plant-specific transcription factors and play an important role in plant growth and development. In this study, we report the identification of 41 SPL genes (GmSPLs) in the soybean genome. Phylogenetic analysis revealed that these genes were divided into five groups (groups 1-5). Further, exon/intron structure and motif composition revealed that the GmSPL genes are conserved within their same group. The N-terminal zinc finger 1 (Zn1) of the SBP domain was a CCCH (Cys3His1) and the C terminus zinc finger 2 (Zn2) was a CCHC (Cys2HisCys) type. The 41 GmSPL genes were distributed unevenly on 17 of the 20 chromosomes, with tandem and segmental duplication events. We found that segmental duplication has made an important contribution to soybean SPL gene family expansion. The Ka/Ks ratios revealed that the duplicated GmSPL genes evolved under the effect of purifying selection. In addition, 17 of the 41 GmSPLs were found as targets of miR156; these might be involved in their posttranscriptional regulation through miR156. Importantly, RLM-RACE analysis confirmed the GmmiR156-mediated cleavage of GmSPL2a transcript in 2-4 mm stage of soybean seed. Alternative splicing events in 9 GmSPLs were detected which produces transcripts and proteins of different lengths that may modulate protein signaling, binding, localization, stability, and other properties. Expression analysis of the soybean SPL genes in various tissues and different developmental stages of seed suggested distinct spatiotemporal patterns. Differences in the expression patterns of miR156-targeted and miR156-non-targeted soybean SPL genes suggest that miR156 plays key functions in soybean development. Our results provide an important foundation for further uncovering the crucial roles of GmSPLs in the development of soybean and other biological processes.

  6. Pyloric duplications: review and case study.

    PubMed

    Cooper, S; Abrams, R S; Carbaugh, R A

    1995-12-01

    Gastric duplications are unusual congenital anomalies that often require surgical treatment. Pyloric duplications are particularly rare; few are reported in the English literature. This article reviews the literature on pyloric duplications and describes a pyloric duplication associated with hypertrophic pyloric stenosis in a 5-week-old child and a duplication that recurred 7 years later.

  7. Segmental duplications: evolution and impact among the current Lepidoptera genomes.

    PubMed

    Zhao, Qian; Ma, Dongna; Vasseur, Liette; You, Minsheng

    2017-07-06

    Structural variation among genomes is now viewed to be as important as single nucleoid polymorphisms in influencing the phenotype and evolution of a species. Segmental duplication (SD) is defined as segments of DNA with homologous sequence. Here, we performed a systematic analysis of segmental duplications (SDs) among five lepidopteran reference genomes (Plutella xylostella, Danaus plexippus, Bombyx mori, Manduca sexta and Heliconius melpomene) to understand their potential impact on the evolution of these species. We find that the SDs content differed substantially among species, ranging from 1.2% of the genome in B. mori to 15.2% in H. melpomene. Most SDs formed very high identity (similarity higher than 90%) blocks but had very few large blocks. Comparative analysis showed that most of the SDs arose after the divergence of each linage and we found that P. xylostella and H. melpomene showed more duplications than other species, suggesting they might be able to tolerate extensive levels of variation in their genomes. Conserved ancestral and species specific SD events were assessed, revealing multiple examples of the gain, loss or maintenance of SDs over time. SDs content analysis showed that most of the genes embedded in SDs regions belonged to species-specific SDs ("Unique" SDs). Functional analysis of these genes suggested their potential roles in the lineage-specific evolution. SDs and flanking regions often contained transposable elements (TEs) and this association suggested some involvement in SDs formation. Further studies on comparison of gene expression level between SDs and non-SDs showed that the expression level of genes embedded in SDs was significantly lower, suggesting that structure changes in the genomes are involved in gene expression differences in species. The results showed that most of the SDs were "unique SDs", which originated after species formation. Functional analysis suggested that SDs might play different roles in different species. Our

  8. De novo direct duplication of chromosome segment 22q11.2-q13.1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fujimoto, Atsuko; Lin, Ming S.

    Lindsay et al. [1995] reported a case of de novo duplication of the segment 22q11-q12. Molecular cytogenetics studies showed that the segment includes the regions responsible for the {open_quotes}cat eye,{close_quotes} DiGeorge, and velo-cardio-facial syndrome, and extends distal to the breakpoint cluster region. The phenotype was milder than that of complete trisomy 22 and der(22)t(11;22) (q23;q11) syndrome and was similar in type and severity to that of {open_quotes}cat eye{close_quotes} syndrome (CES). They suggested that trisomy of gene(s) responsible for the CES might have a predominant phenotypic effect over other genes present in the region duplicated in their patient. 3 refs., 2more » figs.« less

  9. Cyclic nucleotide-gated ion channel gene family in rice, identification, characterization and experimental analysis of expression response to plant hormones, biotic and abiotic stresses.

    PubMed

    Nawaz, Zarqa; Kakar, Kaleem Ullah; Saand, Mumtaz A; Shu, Qing-Yao

    2014-10-04

    Cyclic nucleotide-gated channels (CNGCs) are Ca2+-permeable cation transport channels, which are present in both animal and plant systems. They have been implicated in the uptake of both essential and toxic cations, Ca2+ signaling, pathogen defense, and thermotolerance in plants. To date there has not been a genome-wide overview of the CNGC gene family in any economically important crop, including rice (Oryza sativa L.). There is an urgent need for a thorough genome-wide analysis and experimental verification of this gene family in rice. In this study, a total of 16 full length rice CNGC genes distributed on chromosomes 1-6, 9 and 12, were identified by employing comprehensive bioinformatics analyses. Based on phylogeny, the family of OsCNGCs was classified into four major groups (I-IV) and two sub-groups (IV-A and IV- B). Likewise, the CNGCs from all plant lineages clustered into four groups (I-IV), where group II was conserved in all land plants. Gene duplication analysis revealed that both chromosomal segmentation (OsCNGC1 and 2, 10 and 11, 15 and 16) and tandem duplications (OsCNGC1 and 2) significantly contributed to the expansion of this gene family. Motif composition and protein sequence analysis revealed that the CNGC specific domain "cyclic nucleotide-binding domain (CNBD)" comprises a "phosphate binding cassette" (PBC) and a "hinge" region that is highly conserved among the OsCNGCs. In addition, OsCNGC proteins also contain various other functional motifs and post-translational modification sites. We successively built a stringent motif: (LI-X(2)-[GS]-X-[FV]-X-G-[1]-ELL-X-W-X(12,22)-SA-X(2)-T-X(7)-[EQ]-AF-X-L) that recognizes the rice CNGCs specifically. Prediction of cis-acting regulatory elements in 5' upstream sequences and expression analyses through quantitative qPCR demonstrated that OsCNGC genes were highly responsive to multiple stimuli including hormonal (abscisic acid, indoleacetic acid, kinetin and ethylene), biotic (Pseudomonas fuscovaginae

  10. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.

    PubMed

    Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M

    2013-05-29

    Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in

  11. Duplication of 17(p11.2p11.2) in a male child with autism and severe language delay.

    PubMed

    Nakamine, Alisa; Ouchanov, Leonid; Jiménez, Patricia; Manghi, Elina R; Esquivel, Marcela; Monge, Silvia; Fallas, Marietha; Burton, Barbara K; Szomju, Barbara; Elsea, Sarah H; Marshall, Christian R; Scherer, Stephen W; McInnes, L Alison

    2008-03-01

    Duplications of 17(p11.2p11.2) have been associated with various behavioral manifestations including attention deficits, obsessive-compulsive symptoms, autistic traits, and language delay. We are conducting a genetic study of autism and are screening all cases for submicroscopic chromosomal abnormalities, in addition to standard karyotyping, and fragile X testing. Using array-based comparative genomic hybridization analysis of data from the Affymetrix GeneChip(R) Human Mapping Array set, we detected a duplication of approximately 3.3 Mb on chromosome 17p11.2 in a male child with autism and severe expressive language delay. The duplication was confirmed by measuring the copy number of genomic DNA using quantitative polymerase chain reaction. Gene expression analyses revealed increased expression of three candidate genes for the Smith-Magenis neurobehavioral phenotype, RAI1, DRG2, and RASD1, in transformed lymphocytes from Case 81A, suggesting gene dosage effects. Our results add to a growing body of evidence suggesting that duplications of 17(p11.2p11.2) result in language delay as well as autism and related phenotypes. As Smith-Magenis syndrome is also associated with language delay, a gene involved in acquisition of language may lie within this interval. Whether a parent of origin effect, gender of the case, the presence of allelic variation, or changes in expression of genes outside the breakpoints influence the resultant phenotype remains to be determined. (c) 2007 Wiley-Liss, Inc.

  12. Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

    PubMed

    Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

    1991-11-20

    A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.

  13. Parental Origin of Interstitial Duplications at 15q11.2-q13.3 in Schizophrenia and Neurodevelopmental Disorders

    PubMed Central

    Isles, Anthony R.; Ingason, Andrés; Lowther, Chelsea; Gawlick, Micha; Stöber, Gerald; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F.; Gejman, Pablo V.; Shi, Jianxin; Sanders, Alan R.; Duan, Jubao; Sisodiya, Sanjay; Costain, Gregory; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J.; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Stefansson, Hreinn; Stefansson, Kari; O’Donovan, Michael C.; Owen, Michael J.; Bassett, Anne; Kirov, George

    2016-01-01

    Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally

  14. Parental Origin of Interstitial Duplications at 15q11.2-q13.3 in Schizophrenia and Neurodevelopmental Disorders.

    PubMed

    Isles, Anthony R; Ingason, Andrés; Lowther, Chelsea; Walters, James; Gawlick, Micha; Stöber, Gerald; Rees, Elliott; Martin, Joanna; Little, Rosie B; Potter, Harry; Georgieva, Lyudmila; Pizzo, Lucilla; Ozaki, Norio; Aleksic, Branko; Kushima, Itaru; Ikeda, Masashi; Iwata, Nakao; Levinson, Douglas F; Gejman, Pablo V; Shi, Jianxin; Sanders, Alan R; Duan, Jubao; Willis, Joseph; Sisodiya, Sanjay; Costain, Gregory; Werge, Thomas M; Degenhardt, Franziska; Giegling, Ina; Rujescu, Dan; Hreidarsson, Stefan J; Saemundsen, Evald; Ahn, Joo Wook; Ogilvie, Caroline; Girirajan, Santhosh D; Stefansson, Hreinn; Stefansson, Kari; O'Donovan, Michael C; Owen, Michael J; Bassett, Anne; Kirov, George

    2016-05-01

    Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally

  15. Plasticity and innovation of regulatory mechanisms underlying seed oil content mediated by duplicated genes in the palaeopolyploid soybean.

    PubMed

    Zhang, Dajian; Zhao, Meixia; Li, Shuai; Sun, Lianjun; Wang, Weidong; Cai, Chunmei; Dierking, Emily C; Ma, Jianxin

    2017-06-01

    Many plants have undergone whole genome duplication (WGD). However, how regulatory networks underlying a particular trait are reshaped in polyploids has not been experimentally investigated. Here we show that the regulatory pathways modulating seed oil content, which involve WRINKLED1 (WRI1), LEAFY COTYLEDON1 (LEC1), and LEC2 in Arabidopsis, have been modified in the palaeopolyploid soybean. Such modifications include functional reduction of GmWRI1b of the GmWRI1a/GmWRI1b homoeologous pair relevant to WRI1, complementary non-allelic dosage effects of the GmLEC1a/GmLEC1b homoeologous pair relevant to LEC1, pseudogenization of the singleton GmLEC2 relevant to LEC2, and the rise of the LEC2-like function of GmABI3b, contrasting to its homoeolog GmABI3a, which maintains the ABSCISIC ACID INSENSITIVE 3 (ABI3)-like function in modulating seed maturation and dormancy. The function of GmABI3b in modulating seed oil biosynthesis was fulfilled by direct binding to a RY (CATGCA) cis-regulatory element in the GmWRI1a promoter, which was absent in the GmWRI1b promoter, resulting in reduction of the GmWRI1b expression. Nevertheless, the three regulators each exhibited similar intensities of purifying selection to their respective duplicates since these pairs were formed by a WGD event that is proposed to have occurred approximately 13 million years ago (mya), suggesting that the differentiation in spatiotemporal expression between the duplicated genes is more likely to be the outcome of neutral variation in regulatory sequences. This study thus exemplifies the plasticity, dynamics, and novelty of regulatory networks mediated by WGD. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  16. Craniofacial duplication: a case report.

    PubMed

    Suryawanshi, Pradeep; Deshpande, Mandar; Verma, Nitin; Mahendrakar, Vivek; Mahendrakar, Sandhya

    2013-09-01

    A craniofacial duplication or diprosopus is an unusual variant of conjoined twinning. The reported incidence is one in 180,000-15 million births and 35 cases have been reported till date. The phenotype is wide, with the partial duplication of a few facial structures to complete dicephalus. A complete duplication is associated with a high incidence of anomalies in the central nervous system, cardiovascular system, gastrointestinal system and the respiratory system, whereas no major anomalies are found in the infants with a partial duplication. A term baby with the features of a craniofacial duplication has been described, with the proposed theories on embryogenesis and a brief review of the literature.

  17. Specific duplication and dorsoventrally asymmetric expression patterns of Cycloidea-like genes in zygomorphic species of Ranunculaceae.

    PubMed

    Jabbour, Florian; Cossard, Guillaume; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine

    2014-01-01

    Floral bilateral symmetry (zygomorphy) has evolved several times independently in angiosperms from radially symmetrical (actinomorphic) ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc) have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like) lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture.

  18. Specific Duplication and Dorsoventrally Asymmetric Expression Patterns of Cycloidea-Like Genes in Zygomorphic Species of Ranunculaceae

    PubMed Central

    Jabbour, Florian; Cossard, Guillaume; Le Guilloux, Martine; Sannier, Julie; Nadot, Sophie; Damerval, Catherine

    2014-01-01

    Floral bilateral symmetry (zygomorphy) has evolved several times independently in angiosperms from radially symmetrical (actinomorphic) ancestral states. Homologs of the Antirrhinum majus Cycloidea gene (Cyc) have been shown to control floral symmetry in diverse groups in core eudicots. In the basal eudicot family Ranunculaceae, there is a single evolutionary transition from actinomorphy to zygomorphy in the stem lineage of the tribe Delphinieae. We characterized Cyc homologs in 18 genera of Ranunculaceae, including the four genera of Delphinieae, in a sampling that represents the floral morphological diversity of this tribe, and reconstructed the evolutionary history of this gene family in Ranunculaceae. Within each of the two RanaCyL (Ranunculaceae Cycloidea-like) lineages previously identified, an additional duplication possibly predating the emergence of the Delphinieae was found, resulting in up to four gene copies in zygomorphic species. Expression analyses indicate that the RanaCyL paralogs are expressed early in floral buds and that the duration of their expression varies between species and paralog class. At most one RanaCyL paralog was expressed during the late stages of floral development in the actinomorphic species studied whereas all paralogs from the zygomorphic species were expressed, composing a species-specific identity code for perianth organs. The contrasted asymmetric patterns of expression observed in the two zygomorphic species is discussed in relation to their distinct perianth architecture. PMID:24752428

  19. Genomic evidence of gene duplication and adaptive evolution of Toll like receptors (TLR2 and TLR4) in reptiles.

    PubMed

    Shang, Shuai; Zhong, Huaming; Wu, Xiaoyang; Wei, Qinguo; Zhang, Huanxin; Chen, Jun; Chen, Yao; Tang, Xuexi; Zhang, Honghai

    2018-04-01

    Toll-like receptors (TLRs) encoded by the TLR multigene family play an important role in initial pathogen recognition in vertebrates. Among the TLRs, TLR2 and TLR4 may be of particular importance to reptiles. In order to study the evolutionary patterns and structural characteristics of TLRs, we explored the available genomes of several representative members of reptiles. 25 TLR2 genes and 19 TLR4 genes from reptiles were obtained in this study. Phylogenetic results showed that the TLR2 gene duplication occurred in several species. Evolutionary analysis by at least two methods identified 30 and 13 common positively selected codons in TLR2 and TLR4, respectively. Most positively selected sites of TLR2 and TLR4 were located in the Leucine-rich repeat (LRRs). Branch model analysis showed that TLR2 genes were under different evolutionary forces in reptiles, while the TLR4 genes showed no significant selection pressure. The different evolutionary adaptation of TLR2 and TLR4 among the reptiles might be due to their different function in recognizing bacteria. Overall, we explored the structure and evolution of TLR2 and TLR4 genes in reptiles for the first time. Our study revealed valuable information regarding TLR2 and TLR4 in reptiles, and provided novel insights into the conservation concern of natural populations. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Recommended nomenclature for five mammalian carboxylesterase gene families: human, mouse, and rat genes and proteins.

    PubMed

    Holmes, Roger S; Wright, Matthew W; Laulederkind, Stanley J F; Cox, Laura A; Hosokawa, Masakiyo; Imai, Teruko; Ishibashi, Shun; Lehner, Richard; Miyazaki, Masao; Perkins, Everett J; Potter, Phillip M; Redinbo, Matthew R; Robert, Jacques; Satoh, Tetsuo; Yamashita, Tetsuro; Yan, Bingfan; Yokoi, Tsuyoshi; Zechner, Rudolf; Maltais, Lois J

    2010-10-01

    Mammalian carboxylesterase (CES or Ces) genes encode enzymes that participate in xenobiotic, drug, and lipid metabolism in the body and are members of at least five gene families. Tandem duplications have added more genes for some families, particularly for mouse and rat genomes, which has caused confusion in naming rodent Ces genes. This article describes a new nomenclature system for human, mouse, and rat carboxylesterase genes that identifies homolog gene families and allocates a unique name for each gene. The guidelines of human, mouse, and rat gene nomenclature committees were followed and "CES" (human) and "Ces" (mouse and rat) root symbols were used followed by the family number (e.g., human CES1). Where multiple genes were identified for a family or where a clash occurred with an existing gene name, a letter was added (e.g., human CES4A; mouse and rat Ces1a) that reflected gene relatedness among rodent species (e.g., mouse and rat Ces1a). Pseudogenes were named by adding "P" and a number to the human gene name (e.g., human CES1P1) or by using a new letter followed by ps for mouse and rat Ces pseudogenes (e.g., Ces2d-ps). Gene transcript isoforms were named by adding the GenBank accession ID to the gene symbol (e.g., human CES1_AB119995 or mouse Ces1e_BC019208). This nomenclature improves our understanding of human, mouse, and rat CES/Ces gene families and facilitates research into the structure, function, and evolution of these gene families. It also serves as a model for naming CES genes from other mammalian species.