COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.; ...
2016-09-20
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
COOLAIR Antisense RNAs Form Evolutionarily Conserved Elaborate Secondary Structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hawkes, Emily J.; Hennelly, Scott P.; Novikova, Irina V.
There is considerable debate about the functionality of long non-coding RNAs (lncRNAs). Lack of sequence conservation has been used to argue against functional relevance. Here, we investigated antisense lncRNAs, called COOLAIR, at the A. thaliana FLC locus and experimentally determined their secondary structure. The major COOLAIR variants are highly structured, organized by exon. The distally polyadenylated transcript has a complex multi-domain structure, altered by a single non-coding SNP defining a functionally distinct A. thaliana FLC haplotype. The A. thaliana COOLAIR secondary structure was used to predict COOLAIR exons in evolutionarily divergent Brassicaceae species. These predictions were validated through chemical probingmore » and cloning. Despite the relatively low nucleotide sequence identity, the structures, including multi-helix junctions, show remarkable evolutionary conservation. In a number of places, the structure is conserved through covariation of a non-contiguous DNA sequence. This structural conservation supports a functional role for COOLAIR transcripts rather than, or in addition to, antisense transcription.« less
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Vouille, V; Amiche, M; Nicolas, P
1997-09-01
We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
Crystal Structure of the CLOCK Transactivation Domain Exon19 in Complex with a Repressor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hou, Zhiqiang; Su, Lijing; Pei, Jimin
In the canonical clock model, CLOCK:BMAL1-mediated transcriptional activation is feedback regulated by its repressors CRY and PER and, in association with other coregulators, ultimately generates oscillatory gene expression patterns. How CLOCK:BMAL1 interacts with coregulator(s) is not well understood. Here we report the crystal structures of the mouse CLOCK transactivating domain Exon19 in complex with CIPC, a potent circadian repressor that functions independently of CRY and PER. The Exon19:CIPC complex adopts a three-helical coiled-coil bundle conformation containing two Exon19 helices and one CIPC. Unique to Exon19:CIPC, three highly conserved polar residues, Asn341 of CIPC and Gln544 of the two Exon19 helices,more » are located at the mid-section of the coiled-coil bundle interior and form hydrogen bonds with each other. Combining results from protein database search, sequence analysis, and mutagenesis studies, we discovered for the first time that CLOCK Exon19:CIPC interaction is a conserved transcription regulatory mechanism among mammals, fish, flies, and other invertebrates.« less
Genomic structure of the human D-site binding protein (DBP) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shutler, G.; Glassco, T.; Kang, Xiaolin
1996-06-15
The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Intron-loss evolution of hatching enzyme genes in Teleostei
2010-01-01
Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321
Graveley, Brenton R.
2008-01-01
Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K
2016-06-01
The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A
1996-01-01
In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-01-01
ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.
Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng
2017-10-03
Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dyer, K.D.; Handen, J.S.; Rosenberg, H.F.
The Charcot-Leyden crystal (CLC) protein, or eosinophil lysophospholipase, is a characteristic protein of human eosinophils and basophils; recent work has demonstrated that the CLC protein is both structurally and functionally related to the galectin family of {beta}-galactoside binding proteins. The galectins as a group share a number of features in common, including a linear ligand binding site encoded on a single exon. In this work, we demonstrate that the intron-exon structure of the gene encoding CLC is analogous to those encoding the galectins. The coding sequence of the CLC gene is divided into four exons, with the entire {beta}-galactoside bindingmore » site encoded by exon III. We have isolated CLC {beta}-galactoside binding sites from both orangutan (Pongo pygmaeus) and murine (Mus musculus) genomic DNAs, both encoded on single exons, and noted conservation of the amino acids shown to interact directly with the {beta}-galactoside ligand. The most likely interpretation of these results suggests the occurrence of one or more exon duplication and insertion events, resulting in the distribution of this lectin domain to CLC as well as to the multiple galectin genes. 35 refs., 3 figs.« less
Menzerath-Altmann law in mammalian exons reflects the dynamics of gene structure evolution.
Nikolaou, Christoforos
2014-12-01
Genomic sequences exhibit self-organization properties at various hierarchical levels. One such is the gene structure of higher eukaryotes with its complex exon/intron arrangement. Exon sizes and exon numbers in genes have been shown to conform to a law derived from statistical linguistics and formulated by Menzerath and Altmann, according to which the mean size of the constituents of an entity is inversely related to the number of these constituents. We herein perform a detailed analysis of this property in the complete exon set of the mouse genome in correlation to the sequence conservation of each exon and the transcriptional complexity of each gene locus. We show that extensive linear fits, representative of accordance to Menzerath-Altmann law are restricted to a particular subset of genes that are formed by exons under low or intermediate sequence constraints and have a small number of alternative transcripts. Based on this observation we propose a hypothesis for the law of Menzerath-Altmann in mammalian genes being predominantly due to genes that are more versatile in function and thus, more prone to undergo changes in their structure. To this end we demonstrate one test case where gene categories of different functionality also show differences in the extent of conformity to Menzerath-Altmann law. Copyright © 2014 Elsevier Ltd. All rights reserved.
Mo, Fan; Hong, Xu; Gao, Feng; Du, Lin; Wang, Jun; Omenn, Gilbert S; Lin, Biaoyang
2008-12-16
Alternative splicing is an important gene regulation mechanism. It is estimated that about 74% of multi-exon human genes have alternative splicing. High throughput tandem (MS/MS) mass spectrometry provides valuable information for rapidly identifying potentially novel alternatively-spliced protein products from experimental datasets. However, the ability to identify alternative splicing events through tandem mass spectrometry depends on the database against which the spectra are searched. We wrote scripts in perl, Bioperl, mysql and Ensembl API and built a theoretical exon-exon junction protein database to account for all possible combinations of exons for a gene while keeping the frame of translation (i.e., keeping only in-phase exon-exon combinations) from the Ensembl Core Database. Using our liver cancer MS/MS dataset, we identified a total of 488 non-redundant peptides that represent putative exon skipping events. Our exon-exon junction database provides the scientific community with an efficient means to identify novel alternatively spliced (exon skipping) protein isoforms using mass spectrometry data. This database will be useful in annotating genome structures using rapidly accumulating proteomics data.
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Intron self-complementarity enforces exon inclusion in a yeast pre-mRNA
Howe, Kenneth James; Ares, Manuel
1997-01-01
Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing. PMID:9356473
nGASP - the nematode genome annotation assessment project
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coghlan, A; Fiedler, T J; McKay, S J
2008-12-19
While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner'more » algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders.« less
Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays
Sugnet, Charles W; Srinivasan, Karpagam; Clark, Tyson A; O'Brien, Georgeann; Cline, Melissa S; Wang, Hui; Williams, Alan; Kulp, David; Blume, John E; Haussler, David; Ares, Manuel
2006-01-01
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families. PMID:16424921
Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S
2015-04-01
Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.
Molecular evolution of the leptin exon 3 in some species of the family Canidae.
Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek
2003-01-01
The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris)--16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical.
Zhao, G; Hortsch, M
1998-07-17
Members of the L1 family of neural cell adhesion molecules consist of multiple extracellular immunoglobulin and fibronectin type III domains that mediate the adhesive properties of this group of transmembrane proteins. In vertebrate genomes, these protein domains are separated by introns, and it has been suggested that L1-type genes might have been subject to exon-shuffling events during evolution. However, comparison of the human L1-CAM and the chicken neurofascin gene with the genomic structure of their Drosophila homologue, neuroglian, indicates that no major rearrangement of protein domains has taken place subsequent to the split of the arthropod and chordate phyla. The Drosophila neuroglian gene appears to have lost most of the introns that have been conserved in the human L1-CAM and the chicken neurofascin gene. Nevertheless, exon shuffling or the generation of new exons by mutational changes might have been responsible for the generation of additional, alternatively spliced exons in L1-type genes.
Molecular evolution of the leptin exon 3 in some species of the family Canidae
Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek
2003-01-01
The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris) – 16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical. PMID:12939206
Comparative genomic analysis of the false killer whale (Pseudorca crassidens) LMBR1 locus.
Kim, Dae-Won; Choi, Sang-Haeng; Kim, Ryong Nam; Kim, Sun-Hong; Paik, Sang-Gi; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Aeri; Kang, Aram; Park, Hong-Seog
2010-09-01
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wong, E.C.C.; Mullersman, J.E.; Thomas, M.L.
1993-07-01
The leukocyte common antigen-related protein tyrosine phosphatase (LRP) is a widely expressed transmembrane glycoprotein thought to be involved in cell growth and differentiation. Similar to most other transmembrane protein tyrosine phosphatases, LRP contains two tandem cytoplasmic phosphatase domains. To understand further the regulation and evolution of LRP, the authors have isolated and characterized mouse [lambda] genomic clones. Thirteen genomic clones could be divided into two non-overlapping clusters. The first cluster contained the transcription initiation site and the exon encoding most of the 5[prime] untranslated region. The second cluster contained the remaining exons encoding the protein and the 3[prime] untranslated region.more » The gene consists of 22 exons spanning over 75 kb. The distance between exon 1 and exon 2 is at least 25 kb. Characterization of the 5[prime] ends of LRP mRNA by S1 nuclease protection identifies putative initiation start sites within a G/C-rich region. The upstream region does not contain a TATA box. Comparison of the LRP gene structure to the mammalian protein tyrosine phosphatase gene, CD45, shows striking similarities in size and genomic organization. 29 refs., 5 figs., 1 tab.« less
Remarkable sequence conservation of the last intron in the PKD1 gene.
Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P
2003-10-01
The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
A Plant 5S Ribosomal RNA Mimic Regulates Alternative Splicing of Transcription Factor IIIA Pre-mRNAs
Hammond, Ming C.; Wachter, Andreas; Breaker, Ronald R.
2009-01-01
Transcription factor IIIA (TFIIIA) is required for eukaryotic synthesis of 5S ribosomal RNA by RNA polymerase III. Here we report the discovery of a structured RNA element with striking resemblance to 5S rRNA that is conserved within TFIIIA precursor mRNAs (pre-mRNAs) from diverse plant lineages. TFIIIA protein expression is controlled by alternative splicing of the exon containing the plant 5S rRNA mimic (P5SM). P5SM triggers exon skipping upon binding of ribosomal protein L5, a natural partner of 5S rRNA, which demonstrates the functional adaptation of its structural mimicry. Since the exon-skipped splice product encodes full-length TFIIIA protein, these results reveal a ribosomal protein-mRNA interaction that is involved in 5S rRNA synthesis and has implications for cross-coordination of ribosomal components. This study also provides insight into the origin and function of a newfound class of structured RNA that regulates alternative splicing. PMID:19377483
Hammond, Ming C; Wachter, Andreas; Breaker, Ronald R
2009-05-01
Transcription factor IIIA (TFIIIA) is required for eukaryotic synthesis of 5S ribosomal RNA by RNA polymerase III. Here we report the discovery of a structured RNA element with clear resemblance to 5S rRNA that is conserved within TFIIIA precursor mRNAs from diverse plant lineages. TFIIIA protein expression is controlled by alternative splicing of the exon containing the plant 5S rRNA mimic (P5SM). P5SM triggers exon skipping upon binding of ribosomal protein L5, a natural partner of 5S rRNA, which demonstrates the functional adaptation of its structural mimicry. As the exon-skipped splice product encodes full-length TFIIIA protein, these results reveal a ribosomal protein-mRNA interaction that is involved in 5S rRNA synthesis and has implications for cross-coordination of ribosomal components. This study also provides insight into the origin and function of a newfound class of structured RNA that regulates alternative splicing.
Terenzi, Fulvia; Ladd, Andrea N
2010-01-01
Muscleblind-like (MBNL) proteins have been shown to regulate pre-mRNA alternative splicing, and MBNL1 has been implicated in regulating fetal-to-adult transitions in alternative splicing in the heart. MBNL1 is highly conserved, exhibiting more than 95% identity at the amino acid level between birds and mammals. To investigate MBNL1 expression during embryonic heart development, we examined MBNL1 transcript and protein expression in the embryonic chicken heart from the formation of the primitive heart tube through cardiac morphogenesis (embryonic days 1.5 through 8). MBNL1 transcript levels remained steady throughout these stages, whereas MBNL1 protein levels increased and exhibited a shift in isoforms. MBNL1 has several alternatively spliced exons. Using RT-PCR, we determined that the inclusion of one of these, exon 5, decreases dramatically during cardiac morphogenesis. This developmental transition is conserved in mice. Functional analyses of MBNL1 isoforms containing or lacking exon 5-encoded sequences revealed that exon 5 is important for the regulation of the subcellular localization, RNA binding affinity, and alternative splicing activity of MBNL1 proteins. A second MBNL protein, MBNL2, is also expressed in the embryonic heart. We found that MBNL2 exon 5, which is paralogous to MBNL1 exon 5, is similarly regulated during embryonic heart development. Analysis of MBNL1 and MBNL2 transcripts in several embryonic tissues in chicken and mouse indicate that exon 5 alternative splicing is highly conserved and tissue-specific. Thus, we propose that conserved developmental stage- and tissue-specific alternative splicing of MBNL transcripts is an important mechanism by which MBNL activity is regulated during embryonic development.
Bottomless barrel-sponge species in the Indo-Pacific?
Setiawan, Edwin; Voogd, Nicole J De; Wörheide, Gert; Erpenbeck, Dirk
2016-07-06
The use of nuclear markers, in addition to traditional mitochondrial markers, helps to clarify hidden patterns of genetic structure in natural populations (Palumbi & Baker, 1994). This is particularly evident among demosponges that possess slow mitochondrial evolutionary rates compared to Bilateria, where nuclear intron markers can aid in the understanding of shallow level phylogenetic relationships (Shearer et al., 2002). Ideally, these nuclear markers (i) are evolutionary well-conserved across different lineages, (ii) produce amplicons holding a number of sites with sufficient variability to answer the relevant phylogenetic question, (iii) derive from single copy genes (see review in Zhang & Hewitt, 2003). A popular method to amplify intron markers uses EPIC (Exon-Primed, Intron-Crossing) primers that anneal to the more conserved flanking exon regions and subsequently bridge the intron during amplification (Palumbi & Baker, 1994).
Alternative splicing of anciently exonized 5S rRNA regulates plant transcription factor TFIIIA
Fu, Yan; Bannach, Oliver; Chen, Hao; Teune, Jan-Hendrik; Schmitz, Axel; Steger, Gerhard; Xiong, Liming; Barbazuk, W. Brad
2009-01-01
Identifying conserved alternative splicing (AS) events among evolutionarily distant species can prioritize AS events for functional characterization and help uncover relevant cis- and trans-regulatory factors. A genome-wide search for conserved cassette exon AS events in higher plants revealed the exonization of 5S ribosomal RNA (5S rRNA) within the gene of its own transcription regulator, TFIIIA (transcription factor for polymerase III A). The 5S rRNA-derived exon in TFIIIA gene exists in all representative land plant species but not in green algae and nonplant species, suggesting it is specific to land plants. TFIIIA is essential for RNA polymerase III-based transcription of 5S rRNA in eukaryotes. Integrating comparative genomics and molecular biology revealed that the conserved cassette exon derived from 5S rRNA is coupled with nonsense-mediated mRNA decay. Utilizing multiple independent Arabidopsis overexpressing TFIIIA transgenic lines under osmotic and salt stress, strong accordance between phenotypic and molecular evidence reveals the biological relevance of AS of the exonized 5S rRNA in quantitative autoregulation of TFIIIA homeostasis. Most significantly, this study provides the first evidence of ancient exaptation of 5S rRNA in plants, suggesting a novel gene regulation model mediated by the AS of an anciently exonized noncoding element. PMID:19211543
Identifcation of a Novel Mutation p.I240T in the FRMD7 gene in a Family with Congenital Nystagmus
NASA Astrophysics Data System (ADS)
Zhu, Yihua; Zhuang, Jianfu; Ge, Xianglian; Zhang, Xiao; Wang, Zheng; Sun, Ji; Yang, Juhua; Gu, Feng
2013-10-01
Congenital Nystagmus (CN) is a genetically heterogeneous ocular disease, which causes a significant proportion of childhood visual impairment. To identify the underlying genetic defect of a CN family, twenty-two members were recruited. Genotype analysis showed that affected individuals shared a common haplotype with markers flanking FRMD7 locus. Sequencing FRMD7 revealed a T > C transition in exon 8, causing a conservative substitution of Isoleucine to Tyrosine at codon 240. By protein structural modeling, we found the mutation may disrupt the hydrophobic core and destabilize the protein structure. We reviewed the literature and found that exons 2, 8, and 9 (11.4% of the sequence of FRMD7 mRNA) represent the majority (55.3%) of the reported FRMD7 mutations. In summary, we identified a novel mutation in FRMD7, showed its molecular consequence, and revealed the mutation-rich exons of the FRMD7 gene. Collectively, this provides molecular insights for future CN clinical genetic diagnosis and treatment.
Identifcation of a novel mutation p.I240T in the FRMD7 gene in a family with congenital nystagmus.
Zhu, Yihua; Zhuang, Jianfu; Ge, Xianglian; Zhang, Xiao; Wang, Zheng; Sun, Ji; Yang, Juhua; Gu, Feng
2013-10-30
Congenital Nystagmus (CN) is a genetically heterogeneous ocular disease, which causes a significant proportion of childhood visual impairment. To identify the underlying genetic defect of a CN family, twenty-two members were recruited. Genotype analysis showed that affected individuals shared a common haplotype with markers flanking FRMD7 locus. Sequencing FRMD7 revealed a T > C transition in exon 8, causing a conservative substitution of Isoleucine to Tyrosine at codon 240. By protein structural modeling, we found the mutation may disrupt the hydrophobic core and destabilize the protein structure. We reviewed the literature and found that exons 2, 8, and 9 (11.4% of the sequence of FRMD7 mRNA) represent the majority (55.3%) of the reported FRMD7 mutations. In summary, we identified a novel mutation in FRMD7, showed its molecular consequence, and revealed the mutation-rich exons of the FRMD7 gene. Collectively, this provides molecular insights for future CN clinical genetic diagnosis and treatment.
Identifcation of a Novel Mutation p.I240T in the FRMD7 gene in a Family with Congenital Nystagmus
Zhu, Yihua; Zhuang, Jianfu; Ge, Xianglian; Zhang, Xiao; Wang, Zheng; Sun, Ji; Yang, Juhua; Gu, Feng
2013-01-01
Congenital Nystagmus (CN) is a genetically heterogeneous ocular disease, which causes a significant proportion of childhood visual impairment. To identify the underlying genetic defect of a CN family, twenty-two members were recruited. Genotype analysis showed that affected individuals shared a common haplotype with markers flanking FRMD7 locus. Sequencing FRMD7 revealed a T > C transition in exon 8, causing a conservative substitution of Isoleucine to Tyrosine at codon 240. By protein structural modeling, we found the mutation may disrupt the hydrophobic core and destabilize the protein structure. We reviewed the literature and found that exons 2, 8, and 9 (11.4% of the sequence of FRMD7 mRNA) represent the majority (55.3%) of the reported FRMD7 mutations. In summary, we identified a novel mutation in FRMD7, showed its molecular consequence, and revealed the mutation-rich exons of the FRMD7 gene. Collectively, this provides molecular insights for future CN clinical genetic diagnosis and treatment. PMID:24169426
Probing the Boundaries of Orthology: The Unanticipated Rapid Evolution of Drosophila centrosomin
Eisman, Robert C.; Kaufman, Thomas C.
2013-01-01
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. PMID:23749319
Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M
2017-01-01
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
2014-01-01
Background Alternative splicing is an important process in higher eukaryotes that allows obtaining several transcripts from one gene. A specific case of alternative splicing is mutually exclusive splicing, in which exactly one exon out of a cluster of neighbouring exons is spliced into the mature transcript. Recently, a new algorithm for the prediction of these exons has been developed based on the preconditions that the exons of the cluster have similar lengths, sequence homology, and conserved splice sites, and that they are translated in the same reading frame. Description In this contribution we introduce Kassiopeia, a database and web application for the generation, storage, and presentation of genome-wide analyses of mutually exclusive exomes. Currently, Kassiopeia provides access to the mutually exclusive exomes of twelve Drosophila species, the thale cress Arabidopsis thaliana, the flatworm Caenorhabditis elegans, and human. Mutually exclusive spliced exons (MXEs) were predicted based on gene reconstructions from Scipio. Based on the standard prediction values, with which 83.5% of the annotated MXEs of Drosophila melanogaster were reconstructed, the exomes contain surprisingly more MXEs than previously supposed and identified. The user can search Kassiopeia using BLAST or browse the genes of each species optionally adjusting the parameters used for the prediction to reveal more divergent or only very similar exon candidates. Conclusions We developed a pipeline to predict MXEs in the genomes of several model organisms and a web interface, Kassiopeia, for their visualization. For each gene Kassiopeia provides a comprehensive gene structure scheme, the sequences and predicted secondary structures of the MXEs, and, if available, further evidence for MXE candidates from cDNA/EST data, predictions of MXEs in homologous genes of closely related species, and RNA secondary structure predictions. Kassiopeia can be accessed at http://www.motorprotein.de/kassiopeia. PMID:24507667
Splicing-Related Features of Introns Serve to Propel Evolution
Luo, Yuping; Li, Chun; Gong, Xi; Wang, Yanlu; Zhang, Kunshan; Cui, Yaru; Sun, Yi Eve; Li, Siguang
2013-01-01
The role of spliceosomal intronic structures played in evolution has only begun to be elucidated. Comparative genomic analyses of fungal snoRNA sequences, which are often contained within introns and/or exons, revealed that about one-third of snoRNA-associated introns in three major snoRNA gene clusters manifested polymorphisms, likely resulting from intron loss and gain events during fungi evolution. Genomic deletions can clearly be observed as one mechanism underlying intron and exon loss, as well as generation of complex introns where several introns lie in juxtaposition without intercalating exons. Strikingly, by tracking conserved snoRNAs in introns, we found that some introns had moved from one position to another by excision from donor sites and insertion into target sties elsewhere in the genome without needing transposon structures. This study revealed the origin of many newly gained introns. Moreover, our analyses suggested that intron-containing sequences were more prone to sustainable structural changes than DNA sequences without introns due to intron's ability to jump within the genome via unknown mechanisms. We propose that splicing-related structural features of introns serve as an additional motor to propel evolution. PMID:23516505
Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays.
Johnson, Jason M; Castle, John; Garrett-Engele, Philip; Kan, Zhengyan; Loerch, Patrick M; Armour, Christopher D; Santos, Ralph; Schadt, Eric E; Stoughton, Roland; Shoemaker, Daniel D
2003-12-19
Alternative pre-messenger RNA (pre-mRNA) splicing plays important roles in development, physiology, and disease, and more than half of human genes are alternatively spliced. To understand the biological roles and regulation of alternative splicing across different tissues and stages of development, systematic methods are needed. Here, we demonstrate the use of microarrays to monitor splicing at every exon-exon junction in more than 10,000 multi-exon human genes in 52 tissues and cell lines. These genome-wide data provide experimental evidence and tissue distributions for thousands of known and novel alternative splicing events. Adding to previous studies, the results indicate that at least 74% of human multi-exon genes are alternatively spliced.
Genome-wide association between DNA methylation and alternative splicing in an invertebrate
2012-01-01
Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee) and Nasonia vitripennis (jewel wasp) analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq) to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice variants by positively influencing exon inclusion during transcription. The results from our cross-species homology analysis suggest that DNA methylation and alternative splicing are genetic mechanisms whose utilization could contribute to a longer gene length and a slower rate of gene evolution. PMID:22978521
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
Structure, synthesis, and molecular cloning of dermaseptins B, a family of skin peptide antibiotics.
Charpentier, S; Amiche, M; Mester, J; Vouille, V; Le Caer, J P; Nicolas, P; Delfour, A
1998-06-12
Analysis of antimicrobial activities that are present in the skin secretions of the South American frog Phyllomedusa bicolor revealed six polycationic (lysine-rich) and amphipathic alpha-helical peptides, 24-33 residues long, termed dermaseptins B1 to B6, respectively. Prepro-dermaseptins B all contain an almost identical signal peptide, which is followed by a conserved acidic propiece, a processing signal Lys-Arg, and a dermaseptin progenitor sequence. The 22-residue signal peptide plus the first 3 residues of the acidic propiece are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The 25-residue amino-terminal region of prepro-dermaseptins B shares 50% identity with the corresponding region of precursors for D-amino acid containing opioid peptides or for antimicrobial peptides originating from the skin of distantly related frog species. The remarkable similarity found between prepro-proteins that encode end products with strikingly different sequences, conformations, biological activities and modes of action suggests that the corresponding genes have evolved through dissemination of a conserved "secretory cassette" exon.
Awan, Ali R; Manfredo, Amanda; Pleiss, Jeffrey A
2013-07-30
Alternative splicing is a potent regulator of gene expression that vastly increases proteomic diversity in multicellular eukaryotes and is associated with organismal complexity. Although alternative splicing is widespread in vertebrates, little is known about the evolutionary origins of this process, in part because of the absence of phylogenetically conserved events that cross major eukaryotic clades. Here we describe a lariat-sequencing approach, which offers high sensitivity for detecting splicing events, and its application to the unicellular fungus, Schizosaccharomyces pombe, an organism that shares many of the hallmarks of alternative splicing in mammalian systems but for which no previous examples of exon-skipping had been demonstrated. Over 200 previously unannotated splicing events were identified, including examples of regulated alternative splicing. Remarkably, an evolutionary analysis of four of the exons identified here as subject to skipping in S. pombe reveals high sequence conservation and perfect length conservation with their homologs in scores of plants, animals, and fungi. Moreover, alternative splicing of two of these exons have been documented in multiple vertebrate organisms, making these the first demonstrations of identical alternative-splicing patterns in species that are separated by over 1 billion y of evolution.
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Schulte, W; Töpfer, R; Stracke, R; Schell, J; Martini, N
1997-04-01
Three genes coding for different multifunctional acetyl-CoA carboxylase (ACCase; EC 6.4.1.2) isoenzymes from Brassica napus were isolated and divided into two major classes according to structural features in their 5' regions: class I comprises two genes with an additional coding exon of approximately 300 bp at the 5' end, and class II is represented by one gene carrying an intron of 586 bp in its 5' untranslated region. Fusion of the peptide sequence encoded by the additional first exon of a class I ACCase gene to the jellyfish Aequorea victoria green fluorescent protein (GFP) and transient expression in tobacco protoplasts targeted GFP to the chloroplasts. In contrast to the deduced primary structure of the biotin carboxylase domain encoded by the class I gene, the corresponding amino acid sequence of the class II ACCase shows higher identity with that of the Arabidopsis ACCase, both lacking a transit peptide. The Arabidopsis ACCase has been proposed to be a cytosolic isoenzyme. These observations indicate that the two classes of ACCase genes encode plastidic and cytosolic isoforms of multi-functional, eukaryotic type, respectively, and that B. napus contains at least one multi-functional ACCase besides the multi-subunit, prokaryotic type located in plastids. Southern blot analysis of genomic DNA from B. napus, Brassica rapa, and Brassica oleracea, the ancestors of amphidiploid rapeseed, using a fragment of a multi-functional ACCase gene as a probe revealed that ACCase is encoded by a multi-gene family of at least five members.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lewis, P.M.; Crosier, K.E.; Crosier, P.S.
The receptor tyrosine kinase Dtk/Tyro 3/Sky/rse/brt/tif is a member of a new subfamily of receptors that also includes Axl/Ufo/Ark and Eyk/Mer. These receptors are characterized by the presence of two immunoglobulin-like loops and two fibronectin type III repeats in their extracellular domains. The structure of the murine Dtk gene has been determined. The gene consists of 21 exons that are distributed over 21 kb of genomic DNA. An isoform of Dtk is generated by differential splicing of exons from the 5{prime} region of the gene. The overall genomic structure of Dtk is virtually identical to that determined for the humanmore » UFO gene. This particular genomic organization is likely to have been duplicated and closely maintained throughout evolution. 38 refs., 3 figs., 1 tab.« less
Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G; Simpson, June
2012-01-01
Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants.
Cortés-Romero, Celso; Martínez-Hernández, Aída; Mellado-Mojica, Erika; López, Mercedes G.; Simpson, June
2012-01-01
Fructans are the main storage polysaccharides found in Agave species. The synthesis of these complex carbohydrates relies on the activities of specific fructosyltransferase enzymes closely related to the hydrolytic invertases. Analysis of Agave tequilana transcriptome data led to the identification of ESTs encoding putative fructosyltransferases and invertases. Based on sequence alignments and structure/function relationships, two different genes were predicted to encode 1-SST and 6G-FFT type fructosyltransferases, in addition, 4 genes encoding putative cell wall invertases and 4 genes encoding putative vacuolar invertases were also identified. Probable functions for each gene, were assigned based on conserved amino acid sequences and confirmed for 2 fructosyltransferases and one invertase by analyzing the enzymatic activity of recombinant Agave protein s expressed and purified from Pichia pastoris. The genome organization of the fructosyltransferase/invertase genes, for which the corresponding cDNA contained the complete open reading frame, was found to be well conserved since all genes were shown to carry a 9 bp mini-exon and all showed a similar structure of 8 exons/7 introns with the exception of a cell wall invertase gene which has 7 exons and 6 introns. Fructosyltransferase genes were strongly expressed in the storage organs of the plants, especially in vegetative stages of development and to lower levels in photosynthetic tissues, in contrast to the invertase genes where higher levels of expression were observed in leaf tissues and in mature plants. PMID:22558253
Genome-wide identification and characterization of aquaporin gene family in Beta vulgaris
Kong, Weilong; Yang, Shaozong; Wang, Yulu; Bendahmane, Mohammed
2017-01-01
Aquaporins (AQPs) are essential channel proteins that execute multi-functions throughout plant growth and development, including water transport, uncharged solutes uptake, stress response, and so on. Here, we report the first genome-wide identification and characterization AQP (BvAQP) genes in sugar beet (Beta vulgaris), an important crop widely cultivated for feed, for sugar production and for bioethanol production. Twenty-eight sugar beet AQPs (BvAQPs) were identified and assigned into five subfamilies based on phylogenetic analyses: seven of plasma membrane (PIPs), eight of tonoplast (TIPs), nine of NOD26-like (NIPs), three of small basic (SIPs), and one of x-intrinsic proteins (XIPs). BvAQP genes unevenly mapped on all chromosomes, except on chromosome 4. Gene structure and motifs analyses revealed that BvAQP have conserved exon-intron organization and that they exhibit conserved motifs within each subfamily. Prediction of BvAQPs functions, based on key protein domains conservation, showed a remarkable difference in substrate specificity among the five subfamilies. Analyses of BvAQPs expression, by mean of RNA-seq, in different plant organs and in response to various abiotic stresses revealed that they were ubiquitously expressed and that their expression was induced by heat and salt stresses. These results provide a reference base to address further the function of sugar beet aquaporins and to explore future applications for plants growth and development improvements as well as in response to environmental stresses. PMID:28948097
Disrupted auto-regulation of the spliceosomal gene SNRPB causes cerebro–costo–mandibular syndrome
Lynch, Danielle C.; Revil, Timothée; Schwartzentruber, Jeremy; Bhoj, Elizabeth J.; Innes, A. Micheil; Lamont, Ryan E.; Lemire, Edmond G.; Chodirker, Bernard N.; Taylor, Juliet P.; Zackai, Elaine H.; McLeod, D. Ross; Kirk, Edwin P.; Hoover-Fong, Julie; Fleming, Leah; Savarirayan, Ravi; Boycott, Kym; MacKenzie, Alex; Brudno, Michael; Bulman, Dennis; Dyment, David; Majewski, Jacek; Jerome-Majewska, Loydie A.; Parboosingh, Jillian S.; Bernier, Francois P.
2014-01-01
Elucidating the function of highly conserved regulatory sequences is a significant challenge in genomics today. Certain intragenic highly conserved elements have been associated with regulating levels of core components of the spliceosome and alternative splicing of downstream genes. Here we identify mutations in one such element, a regulatory alternative exon of SNRPB as the cause of cerebro–costo–mandibular syndrome. This exon contains a premature termination codon that triggers nonsense-mediated mRNA decay when included in the transcript. These mutations cause increased inclusion of the alternative exon and decreased overall expression of SNRPB. We provide evidence for the functional importance of this conserved intragenic element in the regulation of alternative splicing and development, and suggest that the evolution of such a regulatory mechanism has contributed to the complexity of mammalian development. PMID:25047197
Disrupted auto-regulation of the spliceosomal gene SNRPB causes cerebro-costo-mandibular syndrome.
Lynch, Danielle C; Revil, Timothée; Schwartzentruber, Jeremy; Bhoj, Elizabeth J; Innes, A Micheil; Lamont, Ryan E; Lemire, Edmond G; Chodirker, Bernard N; Taylor, Juliet P; Zackai, Elaine H; McLeod, D Ross; Kirk, Edwin P; Hoover-Fong, Julie; Fleming, Leah; Savarirayan, Ravi; Majewski, Jacek; Jerome-Majewska, Loydie A; Parboosingh, Jillian S; Bernier, Francois P
2014-07-22
Elucidating the function of highly conserved regulatory sequences is a significant challenge in genomics today. Certain intragenic highly conserved elements have been associated with regulating levels of core components of the spliceosome and alternative splicing of downstream genes. Here we identify mutations in one such element, a regulatory alternative exon of SNRPB as the cause of cerebro-costo-mandibular syndrome. This exon contains a premature termination codon that triggers nonsense-mediated mRNA decay when included in the transcript. These mutations cause increased inclusion of the alternative exon and decreased overall expression of SNRPB. We provide evidence for the functional importance of this conserved intragenic element in the regulation of alternative splicing and development, and suggest that the evolution of such a regulatory mechanism has contributed to the complexity of mammalian development.
Wang, Chun-Chi; Chang, Jan-Gowth; Chen, Yen-Ling; Jong, Yuh-Jyh; Wu, Shou-Mei
2010-07-01
In this study, we established the first method for simultaneous evaluation of nine exons in the survival motor neuron (SMN) genes for full-scale genotyping. This method was used not only to quantify the copy numbers of highly homogenous telomeric SMN (SMN1)/centromeric SMN genes in exons 7 and 8 but also to determine intragenic mutations in all nine exons for complete diagnosis of spinal muscular atrophy (SMA). Additionally, we utilized the "universal fluorescent PCR" for simultaneously fluorescent labeling of eleven gene fragments (nine exons in SMN and two internal standards). Such technique is very beneficial for multi-exon analysis due to only requirement of one universal fluorescent primer which could fluorescently amplify all gene fragments. Of all 262 detected individuals, three subjects possessing different ratios of SMN1/centromeric SMN in the two exons were determined as gene conversion, and we also detected three interesting intragenic mutations (c.1 -39A>G, c.22_23insA in exon 1, c.84C>T in exon 2a) which were associated with the SMA patients owning one copy of SMN1 including two mutations never reported previously. This high-resolved method provided better potential technique for genotyping and identifying SMA, carrier and normal controls in large population.
Plant Proteins Are Smaller Because They Are Encoded by Fewer Exons than Animal Proteins.
Ramírez-Sánchez, Obed; Pérez-Rodríguez, Paulino; Delaye, Luis; Tiessen, Axel
2016-12-01
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylogenetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ∼81aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ∼10 exons of small size [∼176 nucleotides (nt)]. Streptophyta have on average only ∼5.7 exons of medium size (∼230nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400nt). Among subcellular compartments, membrane proteins are the largest (∼520aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (∼240aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ∼34% more but ∼20% smaller proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes. Copyright © 2016 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Zipper plot: visualizing transcriptional activity of genomic regions.
Avila Cobos, Francisco; Anckaert, Jasper; Volders, Pieter-Jan; Everaert, Celine; Rombaut, Dries; Vandesompele, Jo; De Preter, Katleen; Mestdagh, Pieter
2017-05-02
Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms.
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-05-02
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1-8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible "mini-complexes" or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-01-01
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1–8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible “mini-complexes” or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. PMID:27044515
High resolution structure of cleaved Serpin 42 Da from Drosophila melanogaster.
Ellisdon, Andrew M; Zhang, Qingwei; Henstridge, Michelle A; Johnson, Travis K; Warr, Coral G; Law, Ruby Hp; Whisstock, James C
2014-04-24
The Drosophila melanogaster Serpin 42 Da gene (previously Serpin 4) encodes a serine protease inhibitor that is capable of remarkable functional diversity through the alternative splicing of four different reactive centre loop exons. Eight protein isoforms of Serpin 42 Da have been identified to date, targeting the protease inhibitor to both different proteases and cellular locations. Biochemical and genetic studies suggest that Serpin 42 Da inhibits target proteases through the classical serpin 'suicide' inhibition mechanism, however the crystal structure of a representative Serpin 42 Da isoform remains to be determined. We report two high-resolution crystal structures of Serpin 42 Da representing the A/B isoforms in the cleaved conformation, belonging to two different space-groups and diffracting to 1.7 Å and 1.8 Å. Structural analysis reveals the archetypal serpin fold, with the major elements of secondary structure displaying significant homology to the vertebrate serpin, neuroserpin. Key residues known to have central roles in the serpin inhibitory mechanism are conserved in both the hinge and shutter regions of Serpin 42 Da. Furthermore, these structures identify important conserved interactions that appear to be of crucial importance in allowing the Serpin 42 Da fold to act as a versatile template for multiple reactive centre loops that have different sequences and protease specificities. In combination with previous biochemical and genetic studies, these structures confirm for the first time that the Serpin 42 Da isoforms are typical inhibitory serpin family members with the conserved serpin fold and inhibitory mechanism. Additionally, these data reveal the remarkable structural plasticity of serpins, whereby the basic fold is harnessed as a template for inhibition of a large spectrum of proteases by reactive centre loop exon 'switching'. This is the first structure of a Drosophila serpin reported to date, and will provide a platform for future mutational studies in Drosophila to ascertain the functional role of each of the Serpin 42 Da isoforms.
Structure and polymorphism of the mouse prion protein gene.
Westaway, D; Cooper, C; Turner, S; Da Costa, M; Carlson, G A; Prusiner, S B
1994-01-01
Missense mutations in the prion protein (PrP) gene, overexpression of the cellular isoform of PrP (PrPC), and infection with prions containing the scrapie isoform of PrP (PrPSc) all cause neurodegenerative disease. To understand better the physiology and expression of PrPC, we retrieved mouse PrP gene (Prn-p) yeast artificial chromosome (YAC), cosmid, phage, and cDNA clones. Physical mapping positions Prn-p approximately 300 kb from ecotropic virus integration site number 4 (Evi-4), compatible with failure to detect recombination between Prn-p and Evi-4 in genetic crosses. The Prn-pa allele encompasses three exons, with exons 1 and 2 encoding the mRNA 5' untranslated region. Exon 2 has no equivalent in the Syrian hamster and human PrP genes. The Prn-pb gene shares this intron/exon structure but harbors an approximately 6-kb deletion within intron 2. While the Prn-pb open reading frame encodes two amino acid substitutions linked to prolonged scrapie incubation periods, a deletion of intron 2 sequences also characterizes inbred strains such as RIII/S and MOLF/Ei with shorter incubation periods, making a relationship between intron 2 size and scrapie pathogenesis unlikely. The promoter regions of a and b Prn-p alleles include consensus Sp1 and AP-1 sites, as well as other conserved motifs which may represent binding sites for as yet unidentified transcription factors. Images PMID:7912827
nGASP--the nematode genome annotation assessment project.
Coghlan, Avril; Fiedler, Tristan J; McKay, Sheldon J; Flicek, Paul; Harris, Todd W; Blasiar, Darin; Stein, Lincoln D
2008-12-19
While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with unusually many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs posed the greatest difficulty for gene-finders. This experiment establishes a baseline of gene prediction accuracy in Caenorhabditis genomes, and has guided the choice of gene-finders for the annotation of newly sequenced genomes of Caenorhabditis and other nematode species. We have created new gene sets for C. briggsae, C. remanei, C. brenneri, C. japonica, and Brugia malayi using some of the best-performing gene-finders.
Lineage-specific splicing of a brain-enriched alternative exon promotes glioblastoma progression
Ferrarese, Roberto; Harsh, Griffith R.; Yadav, Ajay K.; Bug, Eva; Maticzka, Daniel; Reichardt, Wilfried; Dombrowski, Stephen M.; Miller, Tyler E.; Masilamani, Anie P.; Dai, Fangping; Kim, Hyunsoo; Hadler, Michael; Scholtens, Denise M.; Yu, Irene L.Y.; Beck, Jürgen; Srinivasasainagendra, Vinodh; Costa, Fabrizio; Baxan, Nicoleta; Pfeifer, Dietmar; von Elverfeldt, Dominik; Backofen, Rolf; Weyerbrock, Astrid; Duarte, Christine W.; He, Xiaolin; Prinz, Marco; Chandler, James P.; Vogel, Hannes; Chakravarti, Arnab; Rich, Jeremy N.; Carro, Maria S.; Bredel, Markus
2014-01-01
Tissue-specific alternative splicing is critical for the emergence of tissue identity during development, yet the role of this process in malignant transformation is undefined. Tissue-specific splicing involves evolutionarily conserved, alternative exons that represent only a minority of the total alternative exons identified. Many of these conserved exons have functional features that influence signaling pathways to profound biological effect. Here, we determined that lineage-specific splicing of a brain-enriched cassette exon in the membrane-binding tumor suppressor annexin A7 (ANXA7) diminishes endosomal targeting of the EGFR oncoprotein, consequently enhancing EGFR signaling during brain tumor progression. ANXA7 exon splicing was mediated by the ribonucleoprotein PTBP1, which is normally repressed during neuronal development. PTBP1 was highly expressed in glioblastomas due to loss of a brain-enriched microRNA (miR-124) and to PTBP1 amplification. The alternative ANXA7 splicing trait was present in precursor cells, suggesting that glioblastoma cells inherit the trait from a potential tumor-initiating ancestor and that these cells exploit this trait through accumulation of mutations that enhance EGFR signaling. Our data illustrate that lineage-specific splicing of a tissue-regulated alternative exon in a constituent of an oncogenic pathway eliminates tumor suppressor functions and promotes glioblastoma progression. This paradigm may offer a general model as to how tissue-specific regulatory mechanisms can reprogram normal developmental processes into oncogenic ones. PMID:24865424
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Novel mechanism of conjoined gene formation in the human genome.
Kim, Ryong Nam; Kim, Aeri; Choi, Sang-Haeng; Kim, Dae-Soo; Nam, Seong-Hyeuk; Kim, Dae-Won; Kim, Dong-Wook; Kang, Aram; Kim, Min-Young; Park, Kun-Hyang; Yoon, Byoung-Ha; Lee, Kang Seon; Park, Hong-Seog
2012-03-01
Recently, conjoined genes (CGs) have emerged as important genetic factors necessary for understanding the human genome. However, their formation mechanism and precise structures have remained mysterious. Based on a detailed structural analysis of 57 human CG transcript variants (CGTVs, discovered in this study) and all (833) known CGs in the human genome, we discovered that the poly(A) signal site from the upstream parent gene region is completely removed via the skipping or truncation of the final exon; consequently, CG transcription is terminated at the poly(A) signal site of the downstream parent gene. This result led us to propose a novel mechanism of CG formation: the complete removal of the poly(A) signal site from the upstream parent gene is a prerequisite for the CG transcriptional machinery to continue transcribing uninterrupted into the intergenic region and downstream parent gene. The removal of the poly(A) signal sequence from the upstream gene region appears to be caused by a deletion or truncation mutation in the human genome rather than post-transcriptional trans-splicing events. With respect to the characteristics of CG sequence structures, we found that intergenic regions are hot spots for novel exon creation during CGTV formation and that exons farther from the intergenic regions are more highly conserved in the CGTVs. Interestingly, many novel exons newly created within the intergenic and intragenic regions originated from transposable element sequences. Additionally, the CGTVs showed tumor tissue-biased expression. In conclusion, our study provides novel insights into the CG formation mechanism and expands the present concepts of the genetic structural landscape, gene regulation, and gene formation mechanisms in the human genome.
Jayashree, B; Jagadeesh, V T; Hoisington, D
2008-05-01
The availability of complete, annotated genomic sequence information in model organisms is a rich resource that can be extended to understudied orphan crops through comparative genomic approaches. We report here a software tool (cisprimertool) for the identification of conserved intron scanning regions using expressed sequence tag alignments to a completely sequenced model crop genome. The method used is based on earlier studies reporting the assessment of conserved intron scanning primers (called CISP) within relatively conserved exons located near exon-intron boundaries from onion, banana, sorghum and pearl millet alignments with rice. The tool is freely available to academic users at http://www.icrisat.org/gt-bt/CISPTool.htm. © 2007 ICRISAT.
Molecular cloning and characterization of sea bass (Dicentrarchus labrax, L.) Tapasin.
Pinto, Rute D; da Silva, Diogo V; Pereira, Pedro J B; dos Santos, Nuno M S
2012-01-01
Mammalian tapasin (TPN) is a key member of the major histocompatibility complex (MHC) class I antigen presentation pathway, being part of the multi-protein complex called the peptide loading complex (PLC). Several studies describe its important roles in stabilizing empty MHC class I complexes, facilitating peptide loading and editing the repertoire of bound peptides, with impact on CD8(+) T cell immune responses. In this work, the gene and cDNA of the sea bass (Dicentrarchus labrax) glycoprotein TPN have been isolated and characterized. The coding sequence has a 1329 bp ORF encoding a 442-residue precursor protein with a predicted 24-amino acid leader peptide, generating a 418-amino acid mature form that retains a conserved N-glycosylation site, three conserved mammalian tapasin motifs, two Ig superfamily domains, a transmembrane domain and an ER-retention di-lysine motif at the C-terminus, suggestive of a function similar to mammalian tapasins. Similar to the human counterpart, the sea bass TPN gene comprises 8 exons, some of which correspond to separate functional domains of the protein. A three-dimensional homology model of sea bass tapasin was calculated and is consistent with the structural features described for the human molecule. Together, these results support the concept that the basic structure of TPN has been maintained through evolution. Moreover, the present data provides information that will allow further studies on cell-mediated immunity and class I antigen presentation pathway in particular, in this important fish species. Copyright © 2011 Elsevier Ltd. All rights reserved.
Kongchum, Pawapol; Hallerman, Eric M; Hulata, Gideon; David, Lior; Palti, Yniv
2011-01-01
Induction of innate immune pathways is critical for early host defense, but there is limited understanding of how teleost fishes recognize pathogen molecules and activate these pathways. In mammals, cells of the innate immune system detect pathogenic molecular structures using pattern recognition receptors (PRRs). TLR9 functions as a PRR that recognizes CpG motifs in bacterial and viral DNA and requires adaptor molecules MyD88 and TRAF6 for signal transduction. Here we report full-length cDNA isolation, structural characterization and tissue mRNA expression analysis of the common carp (cc) TLR9, MyD88 and TRAF6 gene orthologs. The ccTLR9 open-reading frame (ORF) is predicted to encode a 1064-amino acid (aa) protein. We found that MyD88 and TRAF6 genes are duplicated in common carp. This is the first report of TRAF6 duplication in a vertebrate genome and stronger evidence in support of MyD88 duplication is provided. The ccMyD88a and b ORFs are predicted to encode 288-aa and 284-aa peptides, respectively. They share 91% aa sequence identity between paralogs. The ccTRAF6a and b ORFs are both predicted to encode 543-aa peptides sharing 95% aa sequence identity between paralogs. The ccTLR9 gene is contained in a single large exon. The ccMyD88a and ccMyD88b coding sequences span five exons. The TRAF6b gene spans six exons. PCR amplification to obtain the entire coding sequence of ccTRAF6a gene was not successful. The 2104-bp fragment amplified covers the 3' end of the gene and it contains a partial sequence of one exon and three complete exons. The predicated protein domains of the ccTLR9, ccMyD88 and ccTRAF6 are conserved and resemble orthologs from other vertebrates. Real-time quantitative PCR assays of the ccTLR9, MyD88a and b, and TRAF6a and b gene transcripts in healthy common carp indicated that mRNA expression varied between tissues. Differential expression of duplicate copies were found for ccMyD88 and ccTRAF6 in white and red muscle tissues, suggesting that paralogs may have evolved and attained a new function. The genomic information we describe in this paper provides evidence of sequence and structural conservation of immune response genes in common carp. Published by Elsevier Ltd.
Molecular cloning and characterization of sea bass (Dicentrarchus labrax, L.) calreticulin.
Pinto, Rute D; Moreira, Ana R; Pereira, Pedro J B; dos Santos, Nuno M S
2013-06-01
Mammalian calreticulin (CRT) is a key molecular chaperone and regulator of Ca(2+) homeostasis in endoplasmic reticulum (ER), also being implicated in a variety of physiological/pathological processes outside the ER. Importantly, it is involved in assembly of MHC class I molecules. In this work, sea bass (Dicentrarchus labrax) CRT (Dila-CRT) gene and cDNA have been isolated and characterized. The mature protein retains two conserved motifs, three structural/functional domains (N, P and C), three type 1 and 2 motifs repeated in tandem, a conserved pair of cysteines and ER-retention motif. It is a single-copy gene composed of 9 exons. Dila-CRT three-dimensional homology models are consistent with the structural features described for mammalian molecules. Together, these results are supportive of a highly conserved structure of CRT through evolution. Moreover, the present data provides information that will allow further studies on sea bass CRT involvement in immunity and in particular class I antigen presentation. Copyright © 2013 Elsevier Ltd. All rights reserved.
Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martin, L.H.; Calabi, F.; Lefebvre, F.A.
1987-12-01
The CD1 human antigens are a family of at least three components, CD1a, CD1b, and CD1c, that are characteristic of the cortical stage of thymocyte maturation. CD1a was originally named HTA1 or T6 and thought to be the human equivalent of mouse Tla. The genes coding for all three have not been identified by transfection into mouse cells. The transfectants express the surface antigens that can then be recognized by the corresponding cluster of monoclonal antibodies used to define the three members of CD1. The full sequence of the genomic DNA is described for all three. The intron-exon structure ofmore » CD1a is deduced by comparison with a near-full-length cDNA clone. Similar structures are proposed for the other two, largely based on sequence homology. An unusually long 5'-untranslated exon (280 bases long) is highly conserved between the three genes, suggesting an important but unknown function. CD1c has a duplicated form of this exon that is thought to be spliced out. The major homology between the three antigens is in the ..beta../sub 2/-microglobulin-binding-domain. The general relatedness to major histocompatibility complex class I and class II molecules is significant but low, with no section of higher homology to mouse Tla.« less
RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts
Sanchez-Pulido, Luis; Haerty, Wilfried
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein–protein interactions. PMID:25524026
In silico study of breast cancer associated gene 3 using LION Target Engine and other tools.
León, Darryl A; Cànaves, Jaume M
2003-12-01
Sequence analysis of individual targets is an important step in annotation and validation. As a test case, we investigated human breast cancer associated gene 3 (BCA3) with LION Target Engine and with other bioinformatics tools. LION Target Engine confirmed that the BCA3 gene is located on 11p15.4 and that the two most likely splice variants (lacking exon 3 and exons 3 and 5, respectively) exist. Based on our manual curation of sequence data, it is proposed that an additional variant (missing only exon 5) published in a public sequence repository, is a prediction artifact. A significant number of new orthologs were also identified, and these were the basis for a high-quality protein secondary structure prediction. Moreover, our research confirmed several distinct functional domains as described in earlier reports. Sequence conservation from multiple sequence alignments, splice variant identification, secondary structure predictions, and predicted phosphorylation sites suggest that the removal of interaction sites through alternative splicing might play a modulatory role in BCA3. This in silico approach shows the depth and relevance of an analysis that can be accomplished by including a variety of publicly available tools with an integrated and customizable life science informatics platform.
Multi-step splicing of sphingomyelin synthase linear and circular RNAs.
Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V
2018-05-15
The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.
Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales.
Palumbi, S R; Baker, C S
1994-05-01
Powerful analyses of population structure require information from multiple genetic loci. To help develop a molecular toolbox for obtaining this information, we have designed universal oligonucleotide primers that span conserved intron-exon junctions in a wide variety of animal phyla. We test the utility of exon-primed, intron-crossing amplifications by analyzing the variability of actin intron sequences from humpback, blue, and bowhead whales and comparing the results with mitochondrial DNA (mtDNA) haplotype data. Humpback actin introns fall into two major clades that exist in different frequencies in different oceanic populations. It is surprising that Hawaii and California populations, which are very distinct in mtDNAs, are similar in actin intron alleles. This discrepancy between mtDNA and nuclear DNA results may be due either to differences in genetic drift in mitochondrial and nuclear genes or to preferential movement of males, which do not transmit mtDNA to offspring, between separate breeding grounds. Opposing mtDNA and nuclear DNA results can help clarify otherwise hidden patterns of structure in natural populations.
Splicing predictions reliably classify different types of alternative splicing
Busch, Anke; Hertel, Klemens J.
2015-01-01
Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853
Li, Yang I; Sanchez-Pulido, Luis; Haerty, Wilfried; Ponting, Chris P
2015-01-01
Ninety-four percent of mammalian protein-coding exons exceed 51 nucleotides (nt) in length. The paucity of micro-exons (≤ 51 nt) suggests that their recognition and correct processing by the splicing machinery present greater challenges than for longer exons. Yet, because thousands of human genes harbor processed micro-exons, specialized mechanisms may be in place to promote their splicing. Here, we survey deep genomic data sets to define 13,085 micro-exons and to study their splicing mechanisms and molecular functions. More than 60% of annotated human micro-exons exhibit a high level of sequence conservation, an indicator of functionality. While most human micro-exons require splicing-enhancing genomic features to be processed, the splicing of hundreds of micro-exons is enhanced by the adjacent binding of splice factors in the introns of pre-messenger RNAs. Notably, splicing of a significant number of micro-exons was found to be facilitated by the binding of RBFOX proteins, which promote their inclusion in the brain, muscle, and heart. Our analyses suggest that accurate regulation of micro-exon inclusion by RBFOX proteins and PTBP1 plays an important role in the maintenance of tissue-specific protein-protein interactions. © 2015 Li et al.; Published by Cold Spring Harbor Laboratory Press.
Ehrmann, Ingrid; Dalgliesh, Caroline; Liu, Yilei; Danilenko, Marina; Crosier, Moira; Overman, Lynn; Arthur, Helen M.; Lindsay, Susan; Clowry, Gavin J.; Venables, Julian P.; Fort, Philippe; Elliott, David J.
2013-01-01
The RNA binding protein T-STAR was created following a gene triplication 520–610 million years ago, which also produced its two parologs Sam68 and SLM-1. Here we have created a T-STAR null mouse to identify the endogenous functions of this RNA binding protein. Mice null for T-STAR developed normally and were fertile, surprisingly, given the high expression of T-STAR in the testis and the brain, and the known infertility and pleiotropic defects of Sam68 null mice. Using a transcriptome-wide search for splicing targets in the adult brain, we identified T-STAR protein as a potent splicing repressor of the alternatively spliced segment 4 (AS4) exons from each of the Neurexin1-3 genes, and exon 23 of the Stxbp5l gene. T-STAR protein was most highly concentrated in forebrain-derived structures like the hippocampus, which also showed maximal Neurexin1-3 AS4 splicing repression. In the absence of endogenous T-STAR protein, Nrxn1-3 AS4 splicing repression dramatically decreased, despite physiological co-expression of Sam68. In transfected cells Neurexin3 AS4 alternative splicing was regulated by either T-STAR or Sam68 proteins. In contrast, Neurexin2 AS4 splicing was only regulated by T-STAR, through a UWAA-rich response element immediately downstream of the regulated exon conserved since the radiation of bony vertebrates. The AS4 exons in the Nrxn1 and Nrxn3 genes were also associated with distinct patterns of conserved UWAA repeats. Consistent with an ancient mechanism of splicing control, human T-STAR protein was able to repress splicing inclusion of the zebrafish Nrxn3 AS4 exon. Although Neurexin1-3 and Stxbp5l encode critical synaptic proteins, T-STAR null mice had no detectable spatial memory deficits, despite an almost complete absence of AS4 splicing repression in the hippocampus. Our work identifies T-STAR as an ancient and potent tissue-specific splicing regulator that uses a concentration-dependent mechanism to co-ordinately regulate regional splicing patterns of the Neurexin1-3 AS4 exons in the mouse brain. PMID:23637638
Evolution of Nova-Dependent Splicing Regulation in the Brain
Živin, Marko; Darnell, Robert B
2007-01-01
A large number of alternative exons are spliced with tissue-specific patterns, but little is known about how such patterns have evolved. Here, we study the conservation of the neuron-specific splicing factors Nova1 and Nova2 and of the alternatively spliced exons they regulate in mouse brain. Whereas Nova RNA binding domains are 94% identical across vertebrate species, Nova-dependent splicing silencer and enhancer elements (YCAY clusters) show much greater divergence, as less than 50% of mouse YCAY clusters are conserved at orthologous positions in the zebrafish genome. To study the relation between the evolution of tissue-specific splicing and YCAY clusters, we compared the brain-specific splicing of Nova-regulated exons in zebrafish, chicken, and mouse. The presence of YCAY clusters in lower vertebrates invariably predicted conservation of brain-specific splicing across species, whereas their absence in lower vertebrates correlated with a loss of alternative splicing. We hypothesize that evolution of Nova-regulated splicing in higher vertebrates proceeds mainly through changes in cis-acting elements, that tissue-specific splicing might in some cases evolve in a single step corresponding to evolution of a YCAY cluster, and that the conservation level of YCAY clusters relates to the functions encoded by the regulated RNAs. PMID:17937501
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-01-01
Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K
2008-10-28
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Chamala, Srikar; Feng, Guanqiao; Chavarro, Carolina; Barbazuk, W. Brad
2015-01-01
Alternative splicing (AS) plays important roles in many plant functions, but its conservation across the plant kingdom is not known. We describe a methodology to identify AS events and identify conserved AS events across large phylogenetic distances using RNA-Seq datasets. We applied this methodology to transcriptome data from nine angiosperms including Amborella, the single sister species to all other extant flowering plants. AS events within 40–70% of the expressed multi-exonic genes per species were found, 27,120 of which are conserved among two or more of the taxa studied. While many events are species specific, many others are shared across long evolutionary distances suggesting they have functional significance. Conservation of AS event data provides an estimate of the number of ancestral AS events present at each node of the tree representing the nine species studied. Furthermore, the presence or absence of AS isoforms between species with different whole genome duplication (WGD) histories provides the opportunity to examine the impact of WDG on AS potential. Examining AS in gene families identifies those with high rates of AS, and conservation can distinguish ancient events vs. recent or species specific adaptations. The MADS-box and SR protein families are found to represent families with low and high occurrences of AS, respectively, yet their AS events were likely present in the MRCA of angiosperms. PMID:25859541
Functional understanding of the diverse exon-intron structures of human GPCR genes.
Hammond, Dorothy A; Olman, Victor; Xu, Ying
2014-02-01
The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.
Multi-Hamiltonian structure of the Born-Infeld equation
NASA Astrophysics Data System (ADS)
Arik, Metin; Neyzi, Fahrünisa; Nutku, Yavuz; Olver, Peter J.; Verosky, John M.
1989-06-01
The multi-Hamiltonian structure, conservation laws, and higher order symmetries for the Born-Infeld equation are exhibited. A new transformation of the Born-Infeld equation to the equations of a Chaplygin gas is presented and explored. The Born-Infeld equation is distinguished among two-dimensional hyperbolic systems by its wealth of such multi-Hamiltonian structures.
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; Shenker, Sol; Joseph, Brian; Sanfilippo, Piero; Celniker, Susan E.; Graveley, Brenton R.; Lai, Eric C.
2014-01-01
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues and cultured cells, to rigorously annotate >2500 fruitfly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs, and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase dramatically relative to linear isoforms during CNS aging, and constitute a novel aging biomarker. PMID:25544350
Westholm, Jakub O.; Miura, Pedro; Olson, Sara; ...
2014-11-26
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Westholm, Jakub O.; Miura, Pedro; Olson, Sara
Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and codingmore » conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker.« less
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript
Rose, Dominic; Stadler, Peter F.
2011-01-01
Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort
Gambin, Tomasz; Akdemir, Zeynep C.; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M.B.; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M.; Eldomery, Mohammad K.; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W.; Boerwinkle, Eric; Beaudet, Arthur L.; Gibbs, Richard A.
2017-01-01
Abstract We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor–Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17–50% of pathogenic CNVs in different disease cohorts where 7.1–11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. PMID:27980096
Structural organization and mutational analysis of the human uncoupling protein-2 (hUCP2) gene.
Tu, N; Chen, H; Winnikes, U; Reinert, I; Marmann, G; Pirke, K M; Lentes, K U
1999-01-01
Uncoupling proteins (UCPs) are mitochondrial membrane transporters which are involved in dissipating the proton electrochemical gradient thereby releasing stored energy as heat. This implies a major role of UCPs in energy metabolism and thermogenesis which when deregulated are key risk factors for the development of obesity and other eating disorders. From the three different human UCPs identified so far by gene cloning both UCP2 and UCP3 were mapped in close proximity (75-150 kb) to regions of human chromosome 11 (11q13) that have been linked to obesity and hyperinsulinaemia. At the amino acid level hUCP2 has about 55% identity to hUCP1 while hUCP3 is 71% identical to hUCP2. In this study we have deduced the genomic structure of the human UCP2 gene by PCR and direct sequence analysis. The hUCP2 gene spans over 8.7 kb distributed on 8 exons. The localization of the exon/intron boundaries within the coding region matches precisely that of the hUCP1 gene and is almost conserved in the recently discovered hUCP3 gene as well. The high degree of homology at the nucleotide level and the conservation of the exon /intron boundaries among the three UCP genes suggests that they may have evolved from a common ancestor or are the result from gene duplication events. Mutational analysis of the hUCP2 gene in a cohort of 172 children (aged 7 - 13) of Caucasian origin revealed a polymorphism in exon 4 (C to T transition at position 164 of the cDNA resulting in the substitution of an alanine by a valine at codon 55) and an insertion polymorphism in exon 8. The insertion polymorphism consists of a 45 bp repeat located 150 bp downstream of the stop codon in the 3'-UTR. The allele frequencies were 0.63 and 0.37 for the alanine and valine encoded alleles, respectively, and 0.71 versus 0.29 for the insertion polymorphism. The allele frequencies of both polymorphisms were not significantly elevated in a subgroup of 25 children characterized by low Resting Metabolic Rates (RMR). So far a direct correlation of the observed genotype with (RMR) and Body Mass Index (BMI) was not evident. Expression studies of the wild type and mutant forms of UCP2 should clarify the functional consequences these polymorphisms may have on energy metabolism and body weight regulation.
Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu
2013-04-01
The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.
ExoLocator--an online view into genetic makeup of vertebrate proteins.
Khoo, Aik Aun; Ogrizek-Tomas, Mario; Bulovic, Ana; Korpar, Matija; Gürler, Ece; Slijepcevic, Ivan; Šikic, Mile; Mihalek, Ivana
2014-01-01
ExoLocator (http://exolocator.eopsf.org) collects in a single place information needed for comparative analysis of protein-coding exons from vertebrate species. The main source of data--the genomic sequences, and the existing exon and homology annotation--is the ENSEMBL database of completed vertebrate genomes. To these, ExoLocator adds the search for ostensibly missing exons in orthologous protein pairs across species, using an extensive computational pipeline to narrow down the search region for the candidate exons and find a suitable template in the other species, as well as state-of-the-art implementations of pairwise alignment algorithms. The resulting complements of exons are organized in a way currently unique to ExoLocator: multiple sequence alignments, both on the nucleotide and on the peptide levels, clearly indicating the exon boundaries. The alignments can be inspected in the web-embedded viewer, downloaded or used on the spot to produce an estimate of conservation within orthologous sets, or functional divergence across paralogues.
Colombo, Elisa Adele; Spaccini, Luigina; Volpi, Ludovica; Negri, Gloria; Cittaro, Davide; Lazarevic, Dejan; Zirpoli, Salvatore; Farolfi, Andrea; Gervasini, Cristina; Cubellis, Maria Vittoria; Larizza, Lidia
2016-10-07
Integrin α3 (ITGA3) gene mutations are associated with Interstitial Lung disease, Nephrotic syndrome and Epidermolysis bullosa (ILNEB syndrome). To date only six patients are reported: all carried homozygous ITGA3 mutations and presented a dramatically severe phenotype leading to death before age 2 years, from multi-organ failure due to interstitial lung disease and congenital nephrotic syndrome. The involvement of skin and cutaneous adnexa was variable with sparse hair and nail dysplasia combined or not to skin lesions ranging from skin fragility to epidermolysis bullosa-like blistering. We report on two siblings of 13 and 9 years born to non-consanguineous healthy parents, who display growth delay, severe pulmonary fibrosis with fatigue, dyspnea on exertion and wheezing, atrophic skin with erythematosus lesions, rare eyelashes/eyebrows and pachyonychia. By exome sequencing, we identified two unreported ITGA3 missense mutations, c.373G>A (p.(G125R)) in exon 3 and c.821G>A (p.(R274Q)) in exon 6, affecting highly conserved residues in the integrin α3 extracellular N-terminal β-propeller domain. Homology modelling of α3β1 heterodimer fragment, encompassing the mutation sites, showed that G125 plays a pivotal structural role in the β-propeller, while R274 might prevent the interaction between integrin and urokinase complex. We report a variant of ILNEB syndrome in two siblings differing from the previously reported patients in the lack of nephrotic impairment and survival beyond childhood. Our siblings are the first reported compound heterozygous for ITGA3 mutations; this state as well as the hypomorphic nature of their p.(R274Q) mutation likely account for their survival.
Bornert, Olivier; Kühl, Tobias; Bremer, Jeroen; van den Akker, Peter C; Pasmooij, Anna MG; Nyström, Alexander
2016-01-01
Genetically evoked deficiency of collagen VII causes dystrophic epidermolysis bullosa (DEB)—a debilitating disease characterized by chronic skin fragility and progressive fibrosis. Removal of exons carrying frame-disrupting mutations can reinstate protein expression in genetic diseases. The therapeutic potential of this approach is critically dependent on gene, protein, and disease intrinsic factors. Naturally occurring exon skipping in COL7A1, translating collagen VII, suggests that skipping of exons containing disease-causing mutations may be feasible for the treatment of DEB. However, despite a primarily in-frame arrangement of exons in the COL7A1 gene, no general conclusion of the aptitude of exon skipping for DEB can be drawn, since regulation of collagen VII functionality is complex involving folding, intra- and intermolecular interactions. To directly address this, we deleted two conceptually important exons located at both ends of COL7A1, exon 13, containing recurrent mutations, and exon 105, predicted to impact folding. The resulting recombinantly expressed proteins showed conserved functionality in biochemical and in vitro assays. Injected into DEB mice, the proteins promoted skin stability. By demonstrating functionality of internally deleted collagen VII variants, our study provides support of targeted exon deletion or skipping as a potential therapy to treat a large number of individuals with DEB. PMID:27157667
MACF1 gene structure: a hybrid of plectin and dystrophin.
Gong, T W; Besirli, C G; Lomax, M I
2001-11-01
Mammalian MACF1 (Macrophin1; previously named ACF7) is a giant cytoskeletal linker protein with three known isoforms that arise by alternative splicing. We isolated a 19.1-kb cDNA encoding a fourth isoform (MACF1-4) with a unique N-terminus. Instead of an N-terminal actin-binding domain found in the other three isoforms, MACF1-4 has eight plectin repeats. The MACF1 gene is located on human Chr 1p32, contains at least 102 exons, spans over 270 kb, and gives rise to four major isoforms with different N-termini. The genomic organization of the actin-binding domain is highly conserved in mammalian genes for both plectin and BPAG1. All eight plectin repeats are encoded by one large exon; this feature is similar to the genomic structure of plectin. The intron positions within spectrin repeats in MACF1 are very similar to those in the dystrophin gene. This demonstrates that MACF1 has characteristic features of genes for two classes of cytoskeletal proteins, i.e., plectin and dystrophin.
Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort.
Gambin, Tomasz; Akdemir, Zeynep C; Yuan, Bo; Gu, Shen; Chiang, Theodore; Carvalho, Claudia M B; Shaw, Chad; Jhangiani, Shalini; Boone, Philip M; Eldomery, Mohammad K; Karaca, Ender; Bayram, Yavuz; Stray-Pedersen, Asbjørg; Muzny, Donna; Charng, Wu-Lin; Bahrambeigi, Vahid; Belmont, John W; Boerwinkle, Eric; Beaudet, Arthur L; Gibbs, Richard A; Lupski, James R
2017-02-28
We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor-Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17-50% of pathogenic CNVs in different disease cohorts where 7.1-11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Biaoyang; Nasir, J.; Kalchman, M.A.
1995-02-10
We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
Yu, Xianxian; Duan, Xiaoshan; Zhang, Rui; Fu, Xuehao; Ye, Lingling; Kong, Hongzhi; Xu, Guixia; Shan, Hongyan
2016-01-01
AP1/FUL, SEP, AGL6, and FLC subfamily genes play important roles in flower development. The phylogenetic relationships among them, however, have been controversial, which impedes our understanding of the origin and functional divergence of these genes. One possible reason for the controversy may be the problems caused by changes in the exon-intron structure of genes, which, according to recent studies, may generate non-homologous sites and hamper the homology-based sequence alignment. In this study, we first performed exon-by-exon alignments of these and three outgroup subfamilies (SOC1, AG, and STK). Phylogenetic trees reconstructed based on these matrices show improved resolution and better congruence with species phylogeny. In the context of these phylogenies, we traced evolutionary changes of exon-intron structures in each subfamily. We found that structural changes have occurred frequently following gene duplication and speciation events. Notably, exons 7 and 8 (if present) suffered more structural changes than others. With the knowledge of exon-intron structural changes, we generated more reasonable alignments containing all the focal subfamilies. The resulting trees showed that the SEP subfamily is sister to the monophyletic group formed by AP1/FUL and FLC subfamily genes and that the AGL6 subfamily forms a sister group to the three abovementioned subfamilies. Based on this topology, we inferred the evolutionary history of exon-intron structural changes among different subfamilies. Particularly, we found that the eighth exon originated before the divergence of AP1/FUL, FLC, SEP, and AGL6 subfamilies and degenerated in the ancestral FLC-like gene. These results provide new insights into the origin and evolution of the AP1/FUL, FLC, SEP, and AGL6 subfamilies. PMID:27200066
Circular RNAs are abundant, conserved, and associated with ALU repeats
Jeck, William R.; Sorrentino, Jessica A.; Wang, Kai; Slevin, Michael K.; Burd, Christin E.; Liu, Jinze; Marzluff, William F.; Sharpless, Norman E.
2013-01-01
Circular RNAs composed of exonic sequence have been described in a small number of genes. Thought to result from splicing errors, circular RNA species possess no known function. To delineate the universe of endogenous circular RNAs, we performed high-throughput sequencing (RNA-seq) of libraries prepared from ribosome-depleted RNA with or without digestion with the RNA exonuclease, RNase R. We identified >25,000 distinct RNA species in human fibroblasts that contained non-colinear exons (a “backsplice”) and were reproducibly enriched by exonuclease degradation of linear RNA. These RNAs were validated as circular RNA (ecircRNA), rather than linear RNA, and were more stable than associated linear mRNAs in vivo. In some cases, the abundance of circular molecules exceeded that of associated linear mRNA by >10-fold. By conservative estimate, we identified ecircRNAs from 14.4% of actively transcribed genes in human fibroblasts. Application of this method to murine testis RNA identified 69 ecircRNAs in precisely orthologous locations to human circular RNAs. Of note, paralogous kinases HIPK2 and HIPK3 produce abundant ecircRNA from their second exon in both humans and mice. Though HIPK3 circular RNAs contain an AUG translation start, it and other ecircRNAs were not bound to ribosomes. Circular RNAs could be degraded by siRNAs and, therefore, may act as competing endogenous RNAs. Bioinformatic analysis revealed shared features of circularized exons, including long bordering introns that contained complementary ALU repeats. These data show that ecircRNAs are abundant, stable, conserved and nonrandom products of RNA splicing that could be involved in control of gene expression. PMID:23249747
Identification and analysis of multigene families by comparison of exon fingerprints.
Brown, N P; Whittaker, A J; Newell, W R; Rawlings, C J; Beck, S
1995-06-02
Gene families are often recognised by sequence homology using similarity searching to find relationships, however, genomic sequence data provides gene architectural information not used by conventional search methods. In particular, intron positions and phases are expected to be relatively conserved features, because mis-splicing and reading frame shifts should be selected against. A fast search technique capable of detecting possible weak sequence homologies apparent at the intron/exon level of gene organization is presented for comparing spliceosomal genes and gene fragments. FINEX compares strings of exons delimited by intron/exon boundary positions and intron phases (exon fingerprint) using a global dynamic programming algorithm with a combined intron phase identity and exon size dissimilarity score. Exon fingerprints are typically two orders of magnitude smaller than their nucleic acid sequence counterparts giving rise to fast search times: a ranked search against a library of 6755 fingerprints for a typical three exon fingerprint completes in under 30 seconds on an ordinary workstation, while a worst case largest fingerprint of 52 exons completes in just over one minute. The short "sequence" length of exon fingerprints in comparisons is compensated for by the large exon alphabet compounded of intron phase types and a wide range of exon sizes, the latter contributing the most information to alignments. FINEX performs better in some searches than conventional methods, finding matches with similar exon organization, but low sequence homology. A search using a human serum albumin finds all members of the multigene family in the FINEX database at the top of the search ranking, despite very low amino acid percentage identities between family members. The method should complement conventional sequence searching and alignment techniques, offering a means of identifying otherwise hard to detect homologies where genomic data are available.
Dai, Gucan; Sherpa, Tshering; Varnum, Michael D.
2014-01-01
Precursor mRNA encoding CNGA3 subunits of cone photoreceptor cyclic nucleotide-gated (CNG) channels undergoes alternative splicing, generating isoforms differing in the N-terminal cytoplasmic region of the protein. In humans, four variants arise from alternative splicing, but the functional significance of these changes has been a persistent mystery. Heterologous expression of the four possible CNGA3 isoforms alone or with CNGB3 subunits did not reveal significant differences in basic channel properties. However, inclusion of optional exon 3, with or without optional exon 5, produced heteromeric CNGA3 + CNGB3 channels exhibiting an ∼2-fold greater shift in K1/2,cGMP after phosphatidylinositol 4,5-biphosphate or phosphatidylinositol 3,4,5-trisphosphate application compared with channels lacking the sequence encoded by exon 3. We have previously identified two structural features within CNGA3 that support phosphoinositides (PIPn) regulation of cone CNG channels: N- and C-terminal regulatory modules. Specific mutations within these regions eliminated PIPn sensitivity of CNGA3 + CNGB3 channels. The exon 3 variant enhanced the component of PIPn regulation that depends on the C-terminal region rather than the nearby N-terminal region, consistent with an allosteric effect on PIPn sensitivity because of altered N-C coupling. Alternative splicing of CNGA3 occurs in multiple species, although the exact variants are not conserved across CNGA3 orthologs. Optional exon 3 appears to be unique to humans, even compared with other primates. In parallel, we found that a specific splice variant of canine CNGA3 removes a region of the protein that is necessary for high sensitivity to PIPn. CNGA3 alternative splicing may have evolved, in part, to tune the interactions between cone CNG channels and membrane-bound phosphoinositides. PMID:24675082
Dai, Gucan; Sherpa, Tshering; Varnum, Michael D
2014-05-09
Precursor mRNA encoding CNGA3 subunits of cone photoreceptor cyclic nucleotide-gated (CNG) channels undergoes alternative splicing, generating isoforms differing in the N-terminal cytoplasmic region of the protein. In humans, four variants arise from alternative splicing, but the functional significance of these changes has been a persistent mystery. Heterologous expression of the four possible CNGA3 isoforms alone or with CNGB3 subunits did not reveal significant differences in basic channel properties. However, inclusion of optional exon 3, with or without optional exon 5, produced heteromeric CNGA3 + CNGB3 channels exhibiting an ∼2-fold greater shift in K1/2,cGMP after phosphatidylinositol 4,5-biphosphate or phosphatidylinositol 3,4,5-trisphosphate application compared with channels lacking the sequence encoded by exon 3. We have previously identified two structural features within CNGA3 that support phosphoinositides (PIPn) regulation of cone CNG channels: N- and C-terminal regulatory modules. Specific mutations within these regions eliminated PIPn sensitivity of CNGA3 + CNGB3 channels. The exon 3 variant enhanced the component of PIPn regulation that depends on the C-terminal region rather than the nearby N-terminal region, consistent with an allosteric effect on PIPn sensitivity because of altered N-C coupling. Alternative splicing of CNGA3 occurs in multiple species, although the exact variants are not conserved across CNGA3 orthologs. Optional exon 3 appears to be unique to humans, even compared with other primates. In parallel, we found that a specific splice variant of canine CNGA3 removes a region of the protein that is necessary for high sensitivity to PIPn. CNGA3 alternative splicing may have evolved, in part, to tune the interactions between cone CNG channels and membrane-bound phosphoinositides.
The Potential for Double-Loop Learning to Enable Landscape Conservation Efforts
NASA Astrophysics Data System (ADS)
Petersen, Brian; Montambault, Jensen; Koopman, Marni
2014-10-01
As conservation increases its emphasis on implementing change at landscape-level scales, multi-agency, cross-boundary, and multi-stakeholder networks become more important. These elements complicate traditional notions of learning. To investigate this further, we examined structures of learning in the Landscape Conservation Cooperatives (LCCs), which include the entire US and its territories, as well as parts of Canada, Mexico, and Caribbean and Pacific island states. We used semi-structured interviews, transcribed and analyzed using NVivo, as well as a charrette-style workshop to understand the difference between the original stated goals of individual LCCs and the values and purposes expressed as the collaboration matured. We suggest double-loop learning as a theoretical framework appropriate to landscape-scale conservation, recognizing that concerns about accountability are among the valid points of view that must be considered in multi-stakeholder collaborations. Methods from the social sciences and public health sectors provide insights on how such learning might be actualized.
Lex-SVM: exploring the potential of exon expression profiling for disease classification.
Yuan, Xiongying; Zhao, Yi; Liu, Changning; Bu, Dongbo
2011-04-01
Exon expression profiling technologies, including exon arrays and RNA-Seq, measure the abundance of every exon in a gene. Compared with gene expression profiling technologies like 3' array, exon expression profiling technologies could detect alterations in both transcription and alternative splicing, therefore they are expected to be more sensitive in diagnosis. However, exon expression profiling also brings higher dimension, more redundancy, and significant correlation among features. Ignoring the correlation structure among exons of a gene, a popular classification method like L1-SVM selects exons individually from each gene and thus is vulnerable to noise. To overcome this limitation, we present in this paper a new variant of SVM named Lex-SVM to incorporate correlation structure among exons and known splicing patterns to promote classification performance. Specifically, we construct a new norm, ex-norm, including our prior knowledge on exon correlation structure to regularize the coefficients of a linear SVM. Lex-SVM can be solved efficiently using standard linear programming techniques. The advantage of Lex-SVM is that it can select features group-wisely, force features in a subgroup to take equal weihts and exclude the features that contradict the majority in the subgroup. Experimental results suggest that on exon expression profile, Lex-SVM is more accurate than existing methods. Lex-SVM also generates a more compact model and selects genes more consistently in cross-validation. Unlike L1-SVM selecting only one exon in a gene, Lex-SVM assigns equal weights to as many exons in a gene as possible, lending itself easier for further interpretation.
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291
Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László
2005-01-01
DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.
Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael
2013-01-01
Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343
Dynamic and Widespread lncRNA Expression in a Sponge and the Origin of Animal Complexity
Gaiti, Federico; Fernandez-Valverde, Selene L.; Nakanishi, Nagayasu; Calcino, Andrew D.; Yanai, Itai; Tanurdzic, Milos; Degnan, Bernard M.
2015-01-01
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity. PMID:25976353
Kronert, W A; Edwards, K A; Roche, E S; Wells, L; Bernstein, S I
1991-01-01
We show that the molecular lesions in two homozygousviable mutants of the Drosophila muscle myosin heavy chain gene affect an alternative exon (exon 9a) which encodes a portion of the myosin head that is highly conserved among both cytoplasmic and muscle myosins of all organisms. In situ hybridization and Northern blotting analysis in wild-type organisms indicates that exon 9a is used in indirect flight muscles whereas both exons 9a and 9b are utilized in jump muscles. Alternative exons 9b and 9c are used in other larval and adult muscles. One of the mutations in exon 9a is a nonsense allele that greatly reduces myosin RNA stability. It prevents thick filament accumulation in indirect flight muscles and severely reduces the number of thick filaments in a subset of cells of the jump muscles. The second mutation affects the 5' splice site of exon 9a. This results in production of an aberrantly spliced transcript in indirect flight muscles, which prevents thick filament accumulation. Jump muscles of this mutant substitute exon 9b for exon 9a and consequently have normal levels of thick filaments in this muscle type. This isoform substitution does not obviously affect the ultrastructure or function of the jump muscle. Analysis of this mutant illustrates that indirect flight muscles and jump muscles utilize different mechanisms for alternative RNA splicing. Images PMID:1907912
The mini-exon genes of three Phytomonas isolates that differ in plant tissue tropism.
Sturm, N R; Fernandes, O; Campbell, D A
1995-08-01
The tandem mini-exon gene repeat is an ideal diagnostic target for trypanosomatids because it includes sequences that are conserved absolutely coupled with regions of extreme variability. We have exploited these features and the polymerase chain reaction to differentiate Phytomonas strains isolated from phloem, fruit or latex of various host plants. While the transcribed regions are nearly identical, the intergenic sequences are variable in size and content (130-332 base pairs). The mini-exon genes of these phytomonads can therefore be distinguished from each other and from the corresponding genes in insect trypanosomes, with which they are oft confused.
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
Evolution of the alternative AQP2 gene: Acquisition of a novel protein-coding sequence in dolphins.
Kishida, Takushi; Suzuki, Miwa; Takayama, Asuka
2018-01-01
Taxon-specific de novo protein-coding sequences are thought to be important for taxon-specific environmental adaptation. A recent study revealed that bottlenose dolphins acquired a novel isoform of aquaporin 2 generated by alternative splicing (alternative AQP2), which helps dolphins to live in hyperosmotic seawater. The AQP2 gene consists of four exons, but the alternative AQP2 gene lacks the fourth exon and instead has a longer third exon that includes the original third exon and a part of the original third intron. Here, we show that the latter half of the third exon of the alternative AQP2 arose from a non-protein-coding sequence. Intact ORF of this de novo sequence is shared not by all cetaceans, but only by delphinoids. However, this sequence is conservative in all modern cetaceans, implying that this de novo sequence potentially plays important roles for marine adaptation in cetaceans. Copyright © 2017 Elsevier Inc. All rights reserved.
An Abundant Evolutionarily Conserved CSB-PiggyBac Fusion Protein Expressed in Cockayne Syndrome
Newman, John C.; Bailey, Arnold D.; Fan, Hua-Ying; Pavelitz, Thomas; Weiner, Alan M.
2008-01-01
Cockayne syndrome (CS) is a devastating progeria most often caused by mutations in the CSB gene encoding a SWI/SNF family chromatin remodeling protein. Although all CSB mutations that cause CS are recessive, the complete absence of CSB protein does not cause CS. In addition, most CSB mutations are located beyond exon 5 and are thought to generate only C-terminally truncated protein fragments. We now show that a domesticated PiggyBac-like transposon PGBD3, residing within intron 5 of the CSB gene, functions as an alternative 3′ terminal exon. The alternatively spliced mRNA encodes a novel chimeric protein in which CSB exons 1–5 are joined in frame to the PiggyBac transposase. The resulting CSB-transposase fusion protein is as abundant as CSB protein itself in a variety of human cell lines, and continues to be expressed by primary CS cells in which functional CSB is lost due to mutations beyond exon 5. The CSB-transposase fusion protein has been highly conserved for at least 43 Myr since the divergence of humans and marmoset, and appears to be subject to selective pressure. The human genome contains over 600 nonautonomous PGBD3-related MER85 elements that were dispersed when the PGBD3 transposase was last active at least 37 Mya. Many of these MER85 elements are associated with genes which are involved in neuronal development, and are known to be regulated by CSB. We speculate that the CSB-transposase fusion protein has been conserved for host antitransposon defense, or to modulate gene regulation by MER85 elements, but may cause CS in the absence of functional CSB protein. PMID:18369450
PIECE 2.0: an update for the plant gene structure comparison and evolution database
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparision and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron-exon organization and provides valuable insights into the evolution of gene structure in ...
Mass Conservation in Modeling Moisture Diffusion in Multi-Layer Carbon Composite Structures
NASA Technical Reports Server (NTRS)
Nurge, Mark A.; Youngquist, Robert C.; Starr, Stanley O.
2009-01-01
Moisture diffusion in multi-layer carbon composite structures is difficult to model using finite difference methods due to the discontinuity in concentrations between adjacent layers of differing materials. Applying a mass conserving approach at these boundaries proved to be effective at accurately predicting moisture uptake for a sample exposed to a fixed temperature and relative humidity. Details of the model developed are presented and compared with actual moisture uptake data gathered over 130 days from a graphite epoxy composite sandwich coupon with a Rohacell foam core.
Shekarabi, Masoud; Girard, Nathalie; Rivière, Jean-Baptiste; Dion, Patrick; Houle, Martin; Toulouse, André; Lafrenière, Ronald G; Vercauteren, Freya; Hince, Pascale; Laganiere, Janet; Rochefort, Daniel; Faivre, Laurence; Samuels, Mark; Rouleau, Guy A
2008-07-01
Hereditary sensory and autonomic neuropathy type II (HSANII) is an early-onset autosomal recessive disorder characterized by loss of perception to pain, touch, and heat due to a loss of peripheral sensory nerves. Mutations in hereditary sensory neuropathy type II (HSN2), a single-exon ORF originally identified in affected families in Quebec and Newfoundland, Canada, were found to cause HSANII. We report here that HSN2 is a nervous system-specific exon of the with-no-lysine(K)-1 (WNK1) gene. WNK1 mutations have previously been reported to cause pseudohypoaldosteronism type II but have not been studied in the nervous system. Given the high degree of conservation of WNK1 between mice and humans, we characterized the structure and expression patterns of this isoform in mice. Immunodetections indicated that this Wnk1/Hsn2 isoform was expressed in sensory components of the peripheral nervous system and CNS associated with relaying sensory and nociceptive signals, including satellite cells, Schwann cells, and sensory neurons. We also demonstrate that the novel protein product of Wnk1/Hsn2 was more abundant in sensory neurons than motor neurons. The characteristics of WNK1/HSN2 point to a possible role for this gene in the peripheral sensory perception deficits characterizing HSANII.
Shekarabi, Masoud; Girard, Nathalie; Rivière, Jean-Baptiste; Dion, Patrick; Houle, Martin; Toulouse, André; Lafrenière, Ronald G.; Vercauteren, Freya; Hince, Pascale; Laganiere, Janet; Rochefort, Daniel; Faivre, Laurence; Samuels, Mark; Rouleau, Guy A.
2008-01-01
Hereditary sensory and autonomic neuropathy type II (HSANII) is an early-onset autosomal recessive disorder characterized by loss of perception to pain, touch, and heat due to a loss of peripheral sensory nerves. Mutations in hereditary sensory neuropathy type II (HSN2), a single-exon ORF originally identified in affected families in Quebec and Newfoundland, Canada, were found to cause HSANII. We report here that HSN2 is a nervous system–specific exon of the with-no-lysine(K)–1 (WNK1) gene. WNK1 mutations have previously been reported to cause pseudohypoaldosteronism type II but have not been studied in the nervous system. Given the high degree of conservation of WNK1 between mice and humans, we characterized the structure and expression patterns of this isoform in mice. Immunodetections indicated that this Wnk1/Hsn2 isoform was expressed in sensory components of the peripheral nervous system and CNS associated with relaying sensory and nociceptive signals, including satellite cells, Schwann cells, and sensory neurons. We also demonstrate that the novel protein product of Wnk1/Hsn2 was more abundant in sensory neurons than motor neurons. The characteristics of WNK1/HSN2 point to a possible role for this gene in the peripheral sensory perception deficits characterizing HSANII. PMID:18521183
RNA editing in nascent RNA affects pre-mRNA splicing
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni
2018-01-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3′ acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. PMID:29724793
RNA editing in nascent RNA affects pre-mRNA splicing.
Hsiao, Yun-Hua Esther; Bahn, Jae Hoon; Yang, Yun; Lin, Xianzhi; Tran, Stephen; Yang, Ei-Wen; Quinones-Valdez, Giovanni; Xiao, Xinshu
2018-06-01
In eukaryotes, nascent RNA transcripts undergo an intricate series of RNA processing steps to achieve mRNA maturation. RNA editing and alternative splicing are two major RNA processing steps that can introduce significant modifications to the final gene products. By tackling these processes in isolation, recent studies have enabled substantial progress in understanding their global RNA targets and regulatory pathways. However, the interplay between individual steps of RNA processing, an essential aspect of gene regulation, remains poorly understood. By sequencing the RNA of different subcellular fractions, we examined the timing of adenosine-to-inosine (A-to-I) RNA editing and its impact on alternative splicing. We observed that >95% A-to-I RNA editing events occurred in the chromatin-associated RNA prior to polyadenylation. We report about 500 editing sites in the 3' acceptor sequences that can alter splicing of the associated exons. These exons are highly conserved during evolution and reside in genes with important cellular function. Furthermore, we identified a second class of exons whose splicing is likely modulated by RNA secondary structures that are recognized by the RNA editing machinery. The genome-wide analyses, supported by experimental validations, revealed remarkable interplay between RNA editing and splicing and expanded the repertoire of functional RNA editing sites. © 2018 Hsiao et al.; Published by Cold Spring Harbor Laboratory Press.
Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong
2015-01-01
Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555
Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong
2015-08-22
Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).
Peng, Tao; Xue, Chenghai; Bi, Jianning; Li, Tingting; Wang, Xiaowo; Zhang, Xuegong; Li, Yanda
2008-04-26
Alternative splicing expands transcriptome diversity and plays an important role in regulation of gene expression. Previous studies focus on the regulation of a single cassette exon, but recent experiments indicate that multiple cassette exons within a gene may interact with each other. This interaction can increase the potential to generate various transcripts and adds an extra layer of complexity to gene regulation. Several cases of exon interaction have been discovered. However, the extent to which the cassette exons coordinate with each other remains unknown. Based on EST data, we employed a metric of correlation coefficients to describe the interaction between two adjacent cassette exons and then categorized these exon pairs into three different groups by their interaction (correlation) patterns. Sequence analysis demonstrates that strongly-correlated groups are more conserved and contain a higher proportion of pairs with reading frame preservation in a combinatorial manner. Multiple genome comparison further indicates that different groups of correlated pairs have different evolutionary courses: (1) The vast majority of positively-correlated pairs are old, (2) most of the weakly-correlated pairs are relatively young, and (3) negatively-correlated pairs are a mixture of old and young events. We performed a large-scale analysis of interactions between adjacent cassette exons. Compared with weakly-correlated pairs, the strongly-correlated pairs, including both the positively and negatively correlated ones, show more evidence that they are under delicate splicing control and tend to be functionally important. Additionally, the positively-correlated pairs bear strong resemblance to constitutive exons, which suggests that they may evolve from ancient constitutive exons, while negatively and weakly correlated pairs are more likely to contain newly emerging exons.
An UPF3-based nonsense-mediated decay in Paramecium.
Contreras, Julia; Begley, Victoria; Macias, Sandra; Villalobo, Eduardo
2014-12-01
Nonsense-mediated decay recognises mRNAs containing premature termination codons. One of its components, UPF3, is a molecular link bridging through its binding to the exon junction complex nonsense-mediated decay and splicing. In protists UPF3 has not been identified yet. We report that Paramecium tetraurelia bears an UPF3 gene and that it has a role in nonsense-mediated decay. Interestingly, the identified UPF3 has not conserved the essential amino acids required to bind the exon junction complex. Though, our data indicates that this ciliate bears genes coding for core proteins of the exon junction complex. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Bathige, S D N K; Thulasitha, William Shanthakumar; Umasuthan, Navaneethaiyer; Jayasinghe, J D H E; Wan, Qiang; Nam, Bo-Hye; Lee, Jehee
2017-04-01
Signal transducer and activator of transcription 3 (STAT3) is one of the crucial transcription factors in the Janus kinase (JAK)/STAT signaling pathway, and it was previously considered as acute phase response factor. A number of interleukins (ILs) such as IL-5, IL-6, IL-9, IL-10, IL-12, and IL-22 are known to be involved in activation of STAT3. In addition, various growth factors and pathogenic or oxidative stresses mediate the activation of a wide range of functions via STAT3. In this study, a STAT3 homolog was identified and functionally characterized from rock bream (RbSTAT3), Oplegnathus fasciatus. In silico characterization revealed that the RbSTAT3 amino acid sequence shares highly conserved common domain architectural features including N-terminal domain, coiled coil domain, DNA binding domain, linker domain, and Src homology 2 (SH2) domains. In addition, a fairly conserved transcriptional activation domain (TAD) was located at the C-terminus. Comparison of RbSTAT3 with other counterparts revealed higher identities (>90%) with fish orthologs. The genomic sequence of RbSTAT3 was obtained from a bacterial artificial chromosome (BAC) library, and was identified as a multi-exonic gene (24 exons), as found in other vertebrates. Genomic structural comparison and phylogenetic studies have showed that the evolutionary routes of teleostean and non-teleostean vertebrates were distinct. Quantitative real time PCR (qPCR) analysis revealed that the spatial distribution of RbSTAT3 mRNA expression was ubiquitous and highly detectable in blood, heart, and liver tissues. Transcriptional modulation of RbSTAT3 was examined in blood and liver tissues after challenges with bacteria (Edwardsiella tarda and Streptococcus iniae), rock bream irido virus (RBIV), and immune stimulants (LPS and poly (I:C)). Significant changes in RbSTAT3 transcription were also observed in response to tissue injury. In addition, the transcriptional up-regulation of RbSTAT3 was detected in rock bream heart cells upon recombinant rock bream IL-10 (rRbIL-10) treatment. Subcellular localization and nuclear translocation of rock bream STAT3 following poly (I:C) treatment were also demonstrated. Taken together, the results of the current study provide important evidence for potential roles of rock bream STAT3 in the immune system and wound healing processes. Copyright © 2017 Elsevier B.V. All rights reserved.
The emergence of place-based conservation [Chapter 1
Daniel R. Williams; William P. Stewart; Linda E. Kruger
2013-01-01
Place has emerged as a significant topic within conservation research and practice. The transformative changes connected to contemporary conservation are related to recognition of multi-scaled, social-ecological dynamics; emergent, multiscaled governance structures; and rising importance of place-specific meanings and local knowledge. These transformative changes are...
Piece2.0: an update for the pant gene structure comparison and evolution database
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparison and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron–exon organization and provides valuable insights into the evolution of gene structure in pl...
2010-01-01
β-tubulins are structural components of microtubules and the targets of benzimidazole fungicides used to control many diseases of agricultural importance. Intron polymorphisms in the intron-rich genes of these proteins have been used in phylogeographic investigations of phytopathogenic fungi. In this work, we sequenced 2764 nucleotides of the β-tubulin gene (Pp tubB) in samples of Phakopsora pachyrhizi collected from seven soybean fields in Brazil. Pp tubB contained an open reading frame of 1341 nucleotides, including nine exons and eight introns. Exon length varied from 14 to 880 nucleotides, whereas intron length varied from 76 to 102 nucleotides. The presence of only four polymorphic sites limited the usefulness of Pp tubB for phylogeographic studies in P. pachyrhizi. The gene structures of Pp tubB and orthologous β-tubulin genes of Melampsora lini and Uromyces viciae-fabae were highly conserved. The amino acid substitutions in β-tubulin proteins associated with the onset of benzimidazole resistance in model organisms, especially at His 6 , Glu 198 and Phe 200 , were absent from the predicted sequence of the P. pachyrhizi β-tubulin protein. PMID:21637494
Comparative Reannotation of 21 Aspergillus Genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Salamov, Asaf; Riley, Robert; Kuo, Alan
2013-03-08
We used comparative gene modeling to reannotate 21 Aspergillus genomes. Initial automatic annotation of individual genomes may contain some errors of different nature, e.g. missing genes, incorrect exon-intron structures, 'chimeras', which fuse 2 or more real genes or alternatively splitting some real genes into 2 or more models. The main premise behind the comparative modeling approach is that for closely related genomes most orthologous families have the same conserved gene structure. The algorithm maps all gene models predicted in each individual Aspergillus genome to the other genomes and, for each locus, selects from potentially many competing models, the one whichmore » most closely resembles the orthologous genes from other genomes. This procedure is iterated until no further change in gene models is observed. For Aspergillus genomes we predicted in total 4503 new gene models ( ~;;2percent per genome), supported by comparative analysis, additionally correcting ~;;18percent of old gene models. This resulted in a total of 4065 more genes with annotated PFAM domains (~;;3percent increase per genome). Analysis of a few genomes with EST/transcriptomics data shows that the new annotation sets also have a higher number of EST-supported splice sites at exon-intron boundaries.« less
Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei
2014-09-10
Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Genomic assessment of the evolution of the prion protein gene family in vertebrates.
Harrison, Paul M; Khachane, Amit; Kumar, Manish
2010-05-01
Prion diseases are devastating neurological disorders caused by the propagation of particles containing an alternative beta-sheet-rich form of the prion protein (PrP). Genes paralogous to PrP, called Doppel and Shadoo, have been identified, that also have neuropathological relevance. To aid in the further functional characterization of PrP and its relatives, we annotated completely the PrP gene family (PrP-GF), in the genomes of 42 vertebrates, through combined strategic application of gene prediction programs and advanced remote homology detection techniques (such as HMMs, PSI-TBLASTN and pGenThreader). We have uncovered several previously undescribed paralogous genes and pseudogenes. We find that current high-quality genomic evidence indicates that the PrP relative Doppel, was likely present in the last common ancestor of present-day Tetrapoda, but was lost in the bird lineage, since its divergence from reptiles. Using the new gene annotations, we have defined the consensus of structural features that are characteristic of the PrP and Doppel structures, across diverse Tetrapoda clades. Furthermore, we describe in detail a transcribed pseudogene derived from Shadoo that is conserved across primates, and that overlaps the meiosis gene, SYCE1, thus possibly regulating its expression. In addition, we analysed the locus of PRNP/PRND for significant conservation across the genomic DNA of eleven mammals, and determined the phylogenetic penetration of non-coding exons. The genomic evidence indicates that the second PRNP non-coding exon found in even-toed ungulates and rodents, is conserved in all high-coverage genome assemblies of primates (human, chimp, orang utan and macaque), and is, at least, likely to have fallen out of use during primate speciation. Furthermore, we have demonstrated that the PRNT gene (at the PRNP human locus) is conserved across at least sixteen mammals, and evolves like a long non-coding RNA, fashioned from fragments of ancient, long, interspersed elements. These annotations and evolutionary analyses will be of further use for functional characterisation of the PrP-GF, and will be updatable in a semi-automated fashion as more genomes accumulate. Copyright 2010 Elsevier Inc. All rights reserved.
Mateos, Jesús; Herranz, Raúl; Domingo, Alberto; Sparrow, John; Marco, Roberto
2006-01-01
In Drosophila melanogaster two high molecular weight tropomyosin isoforms, historically named heavy troponins (TnH-33 and TnH-34), are encoded by the Tm1 tropomyosin gene. They are specifically expressed in the indirect flight muscles (IFM). Their N-termini are conventional and complete tropomyosin sequences, but their C-termini consist of different IFM-specific domains that are rich in proline, alanine, glycine and glutamate. The evidence indicates that in Diptera these IFM-specific isoforms are conserved and are not troponins, but heavy tropomyosins (TmH). We report here that they are post-translationally modified by several phosphorylations in their C-termini in mature flies, but not in recently emerged flies that are incapable of flight. From stoichiometric measurements of thin filament proteins and interactions of the TmH isoforms with the standard Drosophila IFM tropomyosin isoform (protein 129), we propose that the TmH N-termini are integrated into the thin filament structural unit as tropomyosin dimers. The phosphorylated C-termini remain unlocated and may be important in IFM stretch-activation. Comparison of the Tm1 and Tm2 gene sequences shows a complete conservation of gene organisation in other Drosophilidae, such as Drosophila pseudoobscura, while in Anopheles gambiae only one exon encodes a single C-terminal domain, though overall gene organization is maintained. Interestingly, in Apis mellifera (hymenopteran), while most of the Tm1 and Tm2 gene features are conserved, the gene lacks any C-terminal exons. Instead these sequences are found at the 3' end of the troponin I gene. In this insect order, as in Lethocerus (hemipteran), the original designation of troponin H (TnH) should be retained. We discuss whether the insertion of the IFM-specific pro-ala-gly-glu-rich domain into the tropomyosin or troponin I genes in different insect orders may be related to proposals that the IFM stretch activation mechanism has evolved independently several times in higher insects.
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.
Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa
2017-10-01
Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial genetic diversity. The significance of Pvrom1 conserved exon 2 sequence remains to be investigated. Copyright © 2017 Mahidol University. Published by Elsevier Inc. All rights reserved.
Evolution of Rubisco activase gene in plants.
Nagarajan, Ragupathi; Gill, Kulvinder S
2018-01-01
Rubisco activase of plants evolved in a stepwise manner without losing its function to adapt to the major evolutionary events including endosymbiosis and land colonization. Rubisco activase is an essential enzyme for photosynthesis, which removes inhibitory sugar phosphates from the active sites of Rubisco, a process necessary for Rubisco activation and carbon fixation. The gene probably evolved in cyanobacteria as different species differ for its presence. However, the gene is present in all other plant species. At least a single gene copy was maintained throughout plant evolution; but various genome and gene duplication events, which occurred during plant evolution, increased its copy number in some species. The exons and exon-intron junctions of present day higher plant's Rca, which is conserved in most species seem to have evolved in charophytes. A unique tandem duplication of Rca gene occurred in a common grass ancestor, and the two genes evolved differently for gene structure, sequence, and expression pattern. At the protein level, starting with a primitive form in cyanobacteria, RCA of chlorophytes evolved by integrating chloroplast transit peptide (cTP), and N-terminal domains to the ATPase, Rubisco recognition and C-terminal domains. The redox regulated C-terminal extension (CTE) and the associated alternate splicing mechanism, which splices the RCA-α and RCA-β isoforms were probably gained from another gene in charophytes, conserved in most species except the members of Solanaceae family.
Characterization and Expression of the Lucina pectinata Oxygen and Sulfide Binding Hemoglobin Genes
López-Garriga, Juan; Cadilla, Carmen L.
2016-01-01
The clam Lucina pectinata lives in sulfide-rich muds and houses intracellular symbiotic bacteria that need to be supplied with hydrogen sulfide and oxygen. This clam possesses three hemoglobins: hemoglobin I (HbI), a sulfide-reactive protein, and hemoglobin II (HbII) and III (HbIII), which are oxygen-reactive. We characterized the complete gene sequence and promoter regions for the oxygen reactive hemoglobins and the partial structure and promoters of the HbI gene from Lucina pectinata. We show that HbI has two mRNA variants, where the 5’end had either a sequence of 96 bp (long variant) or 37 bp (short variant). The gene structure of the oxygen reactive Hbs is defined by having 4-exons/3-introns with conservation of intron location at B12.2 and G7.0 and the presence of pre-coding introns, while the partial gene structure of HbI has the same intron conservation but appears to have a 5-exon/ 4-intron structure. A search for putative transcription factor binding sites (TFBSs) was done with the promoters for HbII, HbIII, HbI short and HbI long. The HbII, HbIII and HbI long promoters showed similar predicted TFBSs. We also characterized MITE-like elements in the HbI and HbII gene promoters and intronic regions that are similar to sequences found in other mollusk genomes. The gene expression levels of the clam Hbs, from sulfide-rich and sulfide-poor environments showed a significant decrease of expression in the symbiont-containing tissue for those clams in a sulfide-poor environment, suggesting that the sulfide concentration may be involved in the regulation of these proteins. Gene expression evaluation of the two HbI mRNA variants indicated that the longer variant is expressed at higher levels than the shorter variant in both environments. PMID:26824233
Choi, Ye-Na; Oh, Bong-Kyeong; Kawasaki, Ichiro; Oh, Wan-Suk; Lee, Yi; Paik, Young-Ki; Shim, Yhong-Hee
2010-02-28
The cdc25 gene, which is highly conserved in many eukaryotes, encodes a phosphatase that plays essential roles in cell cycle regulation. We identified a cdc25 ortholog in the pinewood nematode, Bursaphelenchus xylophilus. The B. xylophilus ortholog (Bx-cdc25) was found to be highly similar to Caenorhabditis elegans cdc-25.2 in sequence as well as in gene structure, both having long intron 1. The Bx-cdc25 gene was determined to be composed of seven exons and six introns in a 2,580 bp region, and was shown to encode 360 amino acids of a protein containing a highly-conserved phosphatase domain. Bx-cdc25 mRNA was hardly detectable throughout the juvenile stages but was highly expressed in eggs and in both female and male adults. Functional conservation during germline development between C. elegans cdc25 and Bx-cdc25 was revealed by Bx-cdc25 RNA interference in C. elegans.
Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A
2015-04-01
In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cui, Jianbo, E-mail: jianbocui@lsec.cc.ac.cn; Hong, Jialin, E-mail: hjl@lsec.cc.ac.cn; Liu, Zhihui, E-mail: liuzhihui@lsec.cc.ac.cn
We indicate that the nonlinear Schrödinger equation with white noise dispersion possesses stochastic symplectic and multi-symplectic structures. Based on these structures, we propose the stochastic symplectic and multi-symplectic methods, which preserve the continuous and discrete charge conservation laws, respectively. Moreover, we show that the proposed methods are convergent with temporal order one in probability. Numerical experiments are presented to verify our theoretical results.
On a new class of completely integrable nonlinear wave equations. II. Multi-Hamiltonian structure
NASA Astrophysics Data System (ADS)
Nutku, Y.
1987-11-01
The multi-Hamiltonian structure of a class of nonlinear wave equations governing the propagation of finite amplitude waves is discussed. Infinitely many conservation laws had earlier been obtained for these equations. Starting from a (primary) Hamiltonian formulation of these equations the necessary and sufficient conditions for the existence of bi-Hamiltonian structure are obtained and it is shown that the second Hamiltonian operator can be constructed solely through a knowledge of the first Hamiltonian function. The recursion operator which first appears at the level of bi-Hamiltonian structure gives rise to an infinite sequence of conserved Hamiltonians. It is found that in general there exist two different infinite sequences of conserved quantities for these equations. The recursion relation defining higher Hamiltonian structures enables one to obtain the necessary and sufficient conditions for the existence of the (k+1)st Hamiltonian operator which depends on the kth Hamiltonian function. The infinite sequence of conserved Hamiltonians are common to all the higher Hamiltonian structures. The equations of gas dynamics are discussed as an illustration of this formalism and it is shown that in general they admit tri-Hamiltonian structure with two distinct infinite sets of conserved quantities. The isothermal case of γ=1 is an exceptional one that requires separate treatment. This corresponds to a specialization of the equations governing the expansion of plasma into vacuum which will be shown to be equivalent to Poisson's equation in nonlinear acoustics.
O'Hara, William A; Azar, Walid J; Behringer, Richard R; Renfree, Marilyn B; Pask, Andrew J
2011-12-01
Desert hedgehog (DHH) belongs to the hedgehog gene family that act as secreted intercellular signal transducers. DHH is an essential morphogen for normal testicular development and function in both mice and humans but is not present in the avian lineage. Like other hedgehog proteins, DHH signals through the patched (PTCH) receptors 1 and 2. Here we examine the expression and protein distribution of DHH, PTCH1 and PTCH2 in the developing testes of a marsupial mammal (the tammar wallaby) to determine whether DHH signalling is a conserved factor in gonadal development in all therian mammals. DHH, PTCH1 and PTCH2 were present in the marsupial genome and highly conserved with their eutherian orthologues. Phylogenetic analyses indicate that DHH has recently evolved and is a mammal-specific hedgehog orthologue. The marsupial PTCH2 receptor had an additional exon (exon 21a) not annotated in eutherian PTCH2 proteins. Interestingly we found evidence of this exon in humans and show that its translation would result in a truncated protein with functions similar to PTCH1. We also show that DHH expression was not restricted to the testes during gonadal development (as in mice), but was also expressed in the developing ovary. Expression of DHH, PTCH1 and PTCH2 in the adult tammar testis and ovary was consistent with findings in the adult mouse. These data suggest that there is a highly conserved role for DHH signalling in the differentiation and function of the mammalian testis and that DHH may be necessary for marsupial ovarian development. The receptors PTCH1 and PTCH2 are highly conserved mediators of hedgehog signalling in both the developing and adult marsupial gonads. Together these findings indicate DHH is an essential therian mammal-specific morphogen in gonadal development and gametogenesis.
Greig syndrome: Analysis of the GL13 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Grzeschik, K.H.; Gessler, M.; Heid, C.
1994-09-01
Disruption of the zinc finger gene GL13 by translocation events has been implicated as the cause for cephalopolysyndactyly syndrome (GCPS) in several patients. To characterize this genomic region on human chromosome 7p13, we have isolated a YAC contig of more than 1000 kb including the GL13 gene. About 550 kb from this area were subdivided into a cosmid contig with a two- to ten-fold clone coverage. In this region the cloned GL13 cDNA appears to correspond to at least 14 exons spread over a distance of 280 kb. A CpG island defined by two NotI sites and several BssHII andmore » KspI sites is located in a genomic fragment covering the most proximal exon of the cloned GL13 cDNA. Further upstream, five segments conserved between man and mouse were found. In the mouse this region has been characterized as the transgene integration site resulting in the add phenotype. Both the CpG islands and the conserved regions are likely candidates to search for GL13 promoter and control elements. Intron-exon boundaries and breakpoints of the translocation events within the gene region of patients were identified and characterized.« less
Ni, Julie Z.; Grate, Leslie; Donohue, John Paul; Preston, Christine; Nobida, Naomi; O’Brien, Georgeann; Shiue, Lily; Clark, Tyson A.; Blume, John E.; Ares, Manuel
2007-01-01
Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression. We suggest that the extreme genomic conservation surrounding these regulatory splicing events within splicing factor genes demonstrates the evolutionary importance of maintaining tightly tuned homeostasis of RNA-binding protein levels in the vertebrate cell. PMID:17369403
Sim, Vivian X Y; Dafforn, Katherine A; Simpson, Stuart L; Kelaher, Brendan P; Johnston, Emma L
2015-01-01
Multi-use marine parks achieve conservation through spatial management of activities. Zoning of marine parks in New South Wales, Australia, includes high conservation areas and special purpose zones (SPZ) where maritime activities are concentrated. Although such measures geographically constrain anthropogenic impacts, we have limited understanding of potential ecological effects. We assessed sediment communities and contaminants adjacent to boating infrastructure (boat ramps, jetties and a marina) in a SPZ from the Clyde Estuary in Batemans Marine Park. Metal concentrations and fines content were elevated at boating structures compared to reference sites. Species richness was higher at sites with boating structures, where capitellid polychaetes and nematodes dominated the communities. Changes associated with boating structures were localised and did not extend beyond breakwalls or to reference sites outside the SPZ. The study highlights the benefits of appropriate zoning in a multi-use marine park and the potential to minimise stress on pristine areas through the application of spatial management.
Sim, Vivian X. Y.; Dafforn, Katherine A.; Simpson, Stuart L.; Kelaher, Brendan P.; Johnston, Emma L.
2015-01-01
Multi-use marine parks achieve conservation through spatial management of activities. Zoning of marine parks in New South Wales, Australia, includes high conservation areas and special purpose zones (SPZ) where maritime activities are concentrated. Although such measures geographically constrain anthropogenic impacts, we have limited understanding of potential ecological effects. We assessed sediment communities and contaminants adjacent to boating infrastructure (boat ramps, jetties and a marina) in a SPZ from the Clyde Estuary in Batemans Marine Park. Metal concentrations and fines content were elevated at boating structures compared to reference sites. Species richness was higher at sites with boating structures, where capitellid polychaetes and nematodes dominated the communities. Changes associated with boating structures were localised and did not extend beyond breakwalls or to reference sites outside the SPZ. The study highlights the benefits of appropriate zoning in a multi-use marine park and the potential to minimise stress on pristine areas through the application of spatial management. PMID:26086427
Hu, H M; Chuang, C K; Lee, M J; Tseng, T C; Tang, T K
2000-11-01
We previously reported two novel testis-specific serine/threonine kinases, Aie1 (mouse) and AIE2 (human), that share high amino acid identities with the kinase domains of fly aurora and yeast Ipl1. Here, we report the entire intron-exon organization of the Aie1 gene and analyze the expression patterns of Aie1 mRNA during testis development. The mouse Aie1 gene spans approximately 14 kb and contains seven exons. The sequences of the exon-intron boundaries of the Aie1 gene conform to the consensus sequences (GT/AG) of the splicing donor and acceptor sites of most eukaryotic genes. Comparative genomic sequencing revealed that the gene structure is highly conserved between mouse Aie1 and human AIE2. However, much less homology was found in the sequence outside the kinase-coding domains. The Aie1 locus was mapped to mouse chromosome 7A2-A3 by fluorescent in situ hybridization. Northern blot analysis indicates that Aie1 mRNA likely is expressed at a low level on day 14 and reaches its plateau on day 21 in the developing postnatal testis. RNA in situ hybridization indicated that the expression of the Aie1 transcript was restricted to meiotically active germ cells, with the highest levels detected in spermatocytes at the late pachytene stage. These findings suggest that Aie1 plays a role in spermatogenesis.
Structural features of diverse Pin-II proteinase inhibitor genes from Capsicum annuum.
Mahajan, Neha S; Dewangan, Veena; Lomate, Purushottam R; Joshi, Rakesh S; Mishra, Manasi; Gupta, Vidya S; Giri, Ashok P
2015-02-01
The proteinase inhibitor (PI) genes from Capsicum annuum were characterized with respect to their UTR, introns and promoter elements. The occurrence of PIs with circularly permuted domain organization was evident. Several potato inhibitor II (Pin-II) type proteinase inhibitor (PI) genes have been analyzed from Capsicum annuum (L.) with respect to their differential expression during plant defense response. However, complete gene characterization of any of these C. annuum PIs (CanPIs) has not been carried out so far. Complete gene architectures of a previously identified CanPI-7 (Beads-on-string, Type A) and a member of newly isolated Bracelet type B, CanPI-69 are reported in this study. The 5' UTR (untranslated region), 3'UTR, and intronic sequences of both the CanPI genes were obtained. The genomic sequence of CanPI-7 exhibited, exon 1 (49 base pair, bp) and exon 2 (740 bp) interrupted by a 294-bp long type I intron. We noted the occurrence of three multi-domain PIs (CanPI-69, 70, 71) with circularly permuted domain organization. CanPI-69 was found to possess exon 1 (49 bp), exon 2 (551 bp) and a 584-bp long type I intron. The upstream sequence analysis of CanPI-7 and CanPI-69 predicted various transcription factor-binding sites including TATA and CAAT boxes, hormone-responsive elements (ABRELATERD1, DOFCOREZM, ERELEE4), and a defense-responsive element (WRKY71OS). Binding of transcription factors such as zinc finger motif MADS-box and MYB to the promoter regions was confirmed using electrophoretic mobility shift assay followed by mass spectrometric identification. The 3' UTR analysis for 25 CanPI genes revealed unique/distinct 3' UTR sequence for each gene. Structures of three domain CanPIs of type A and B were predicted and further analyzed for their attributes. This investigation of CanPI gene architecture will enable the better understanding of the genetic elements present in CanPIs.
Regulation of alternative splicing at the single-cell level.
Faigenbloom, Lior; Rubinstein, Nimrod D; Kloog, Yoel; Mayrose, Itay; Pupko, Tal; Stein, Reuven
2015-12-28
Alternative splicing is a key cellular mechanism for generating distinct isoforms, whose relative abundances regulate critical cellular processes. It is therefore essential that inclusion levels of alternative exons be tightly regulated. However, how the precision of inclusion levels among individual cells is governed is poorly understood. Using single-cell gene expression, we show that the precision of inclusion levels of alternative exons is determined by the degree of evolutionary conservation at their flanking intronic regions. Moreover, the inclusion levels of alternative exons, as well as the expression levels of the transcripts harboring them, also contribute to this precision. We further show that alternative exons whose inclusion levels are considerably changed during stem cell differentiation are also subject to this regulation. Our results imply that alternative splicing is coordinately regulated to achieve accuracy in relative isoform abundances and that such accuracy may be important in determining cell fate. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.
Langin, D; Laurell, H; Holst, L S; Belfrage, P; Holm, C
1993-01-01
The human hormone-sensitive lipase (HSL) gene encodes a 786-aa polypeptide (85.5 kDa). It is composed of nine exons spanning approximately 11 kb, with exons 2-5 clustered in a 1.1-kb region. The putative catalytic site (Ser423) and a possible lipid-binding region in the C-terminal part are encoded by exons 6 and 9, respectively. Exon 8 encodes the phosphorylation site (Ser551) that controls cAMP-mediated activity and a second site (Ser553) that is phosphorylated by 5'-AMP-activated protein kinase. Human HSL showed 83% identity with the rat enzyme and contained a 12-aa deletion immediately upstream of the phosphorylation sites with an unknown effect on the activity control. Besides the catalytic site motif (Gly-Xaa-Ser-Xaa-Gly) found in most lipases, HSL shows no homology with other known lipases or proteins, except for a recently reported unexpected homology between the region surrounding its catalytic site and that of the lipase 2 of Moraxella TA144, an antarctic psychrotrophic bacterium. The gene of lipase 2, which catalyses lipolysis below 4 degrees C, was absent in the genomic DNA of five other Moraxella strains living at 37 degrees C. The lipase 2-like sequence in HSL may reflect an evolutionarily conserved cold adaptability that might be of critical survival value when low-temperature-mobilized endogenous lipids are the primary energy source (e.g., in poikilotherms or hibernators). The finding that HSL at 10 degrees C retained 3- to 5-fold more of its 37 degrees C catalytic activity than lipoprotein lipase or carboxyl ester lipase is consistent with this hypothesis. Images Fig. 5 PMID:8506334
Oxidative Stress Triggers Body-Wide Skipping of Multiple Exons of the Spinal Muscular Atrophy Gene
Seo, Joonbae; Singh, Natalia N.; Ottesen, Eric W.; Sivanesan, Senthilkumar; Shishimorova, Maria; Singh, Ravindra N.
2016-01-01
Humans carry two nearly identical copies of Survival Motor Neuron gene: SMN1 and SMN2. Loss of SMN1 leads to spinal muscular atrophy (SMA), the most frequent genetic cause of infant mortality. While SMN2 cannot compensate for the loss of SMN1 due to predominant skipping of exon 7, correction of SMN2 exon 7 splicing holds the promise of a cure for SMA. Previously, we used cell-based models coupled with a multi-exon-skipping detection assay (MESDA) to demonstrate the vulnerability of SMN2 exons to aberrant splicing under the conditions of oxidative stress (OS). Here we employ a transgenic mouse model and MESDA to examine the OS-induced splicing regulation of SMN2 exons. We induced OS using paraquat that is known to trigger production of reactive oxygen species and cause mitochondrial dysfunction. We show an overwhelming co-skipping of SMN2 exon 5 and exon 7 under OS in all tissues except testis. We also show that OS increases skipping of SMN2 exon 3 in all tissues except testis. We uncover several new SMN2 splice isoforms expressed at elevated levels under the conditions of OS. We analyze cis-elements and transacting factors to demonstrate the diversity of mechanisms for splicing misregulation under OS. Our results of proteome analysis reveal downregulation of hnRNP H as one of the potential consequences of OS in brain. Our findings suggest SMN2 as a sensor of OS with implications to SMA and other diseases impacted by low levels of SMN protein. PMID:27111068
Gotoh, Hiroki; Zinna, Robert A; Warren, Ian; DeNieu, Michael; Niimi, Teruyuki; Dworkin, Ian; Emlen, Douglas J; Miura, Toru; Lavine, Laura C
2016-03-22
Genes in the sex determination pathway are important regulators of sexually dimorphic animal traits, including the elaborate and exaggerated male ornaments and weapons of sexual selection. In this study, we identified and functionally analyzed members of the sex determination gene family in the golden metallic stag beetle Cyclommatus metallifer, which exhibits extreme differences in mandible size between males and females. We constructed a C. metallifer transcriptomic database from larval and prepupal developmental stages and tissues of both males and females. Using Roche 454 pyrosequencing, we generated a de novo assembled database from a total of 1,223,516 raw reads, which resulted in 14,565 isotigs (putative transcript isoforms) contained in 10,794 isogroups (putative identified genes). We queried this database for C. metallifer conserved sex determination genes and identified 14 candidate sex determination pathway genes. We then characterized the roles of several of these genes in development of extreme sexual dimorphic traits in this species. We performed molecular expression analyses with RT-PCR and functional analyses using RNAi on three C. metallifer candidate genes--Sex-lethal (CmSxl), transformer-2 (Cmtra2), and intersex (Cmix). No differences in expression pattern were found between the sexes for any of these three genes. In the RNAi gene-knockdown experiments, we found that only the Cmix had any effect on sexually dimorphic morphology, and these mimicked the effects of Cmdsx knockdown in females. Knockdown of CmSxl had no measurable effects on stag beetle phenotype, while knockdown of Cmtra2 resulted in complete lethality at the prepupal period. These results indicate that the roles of CmSxl and Cmtra2 in the sex determination cascade are likely to have diverged in stag beetles when compared to Drosophila. Our results also suggest that Cmix has a conserved role in this pathway. In addition to those three genes, we also performed a more complete functional analysis of the C. metallifer dsx gene (Cmdsx) to identify the isoforms that regulate dimorphism more fully using exon-specific RNAi. We identified a total of 16 alternative splice variants of the Cmdsx gene that code for up to 14 separate exons. Despite the variation in RNA splice products of the Cmdsx gene, only four protein isoforms are predicted. The results of our exon-specific RNAi indicated that the essential CmDsx isoform for postembryonic male differentiation is CmDsxB, whereas postembryonic female specific differentiation is mainly regulated by CmDsxD. Taken together, our results highlight the importance of studying the function of highly conserved sex determination pathways in numerous insect species, especially those with dramatic and exaggerated sexual dimorphism, because conservation in protein structure does not always translate into conservation in downstream function.
Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015
Structural and functional characterization of ribosomal protein gene introns in sponges.
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Chuang, Tzu-Wei; Lee, Kuo-Ming; Lou, Yuan-Chao; Lu, Chia-Chen; Tarn, Woan-Yuh
2016-01-01
Eukaryotic mRNA biogenesis involves a series of interconnected steps mediated by RNA-binding proteins. The exon junction complex core protein Y14 is required for nonsense-mediated mRNA decay (NMD) and promotes translation. Moreover, Y14 binds the cap structure of mRNAs and inhibits the activity of the decapping enzyme Dcp2. In this report, we show that an evolutionarily conserved tryptophan residue (Trp-73) of Y14 is critical for its binding to the mRNA cap structure. A Trp-73 mutant (W73V) bound weakly to mRNAs and failed to protect them from degradation. However, this mutant could still interact with the NMD and mRNA degradation factors and retained partial NMD activity. In addition, we found that the W73V mutant could not interact with translation initiation factors. Overexpression of W73V suppressed reporter mRNA translation in vitro and in vivo and reduced the level of a set of nascent proteins. These results reveal a residue of Y14 that confers cap-binding activity and is essential for Y14-mediated enhancement of translation. Finally, we demonstrated that Y14 may selectively and differentially modulate protein biosynthesis. PMID:26887951
Price, M D; Lai, Z
1999-04-01
Competence for cell fate determination and cellular differentiation is under tight control of regulatory genes. Yan, a nuclear target of receptor tyrosine kinase (RTK) signaling, is an E twenty six (ETS) DNA-binding protein that functions as a negative regulator of cell differentiation and proliferation in Drosophila. Most members of RTK signaling pathways are highly conserved through evolution, yet no yan orthologues have been identified to date in vertebrates. To investigate the degree of yan conservation during evolution, we have characterized a yan homologue from a sibling species of D. melanogaster, D. virilis. Our results show that the organization, primary structure and expression pattern of yan are highly conserved. Both genes span over 20 kb and contain four exons with introns at identical positions. The areas with highest amino acid similarity include the Pointed and ETS domain but there are other discrete regions with a high degree of similarity. Phylogenetic analysis reveals that yan's closest relative is the human tel gene, a negative regulator of differentiation in hematopoetic precursors. In both species, Yan is dynamically expressed beginning as early as stage 4/5 and persisting throughout embryogenesis. In third instar larvae, Yan is expressed in and behind the morphogenetic furrow of the eye imaginal disc as well as in the laminar precursor cells of the brain. Ovarian follicle cells also contain Yan protein. Conservation of the structure and expression patterns of yan genes strongly suggests that regulatory mechanisms for their expression are also conserved in these two species.
Unusual splice site mutations disrupt FANCA exon 8 definition.
Mattioli, Chiara; Pianigiani, Giulia; De Rocco, Daniela; Bianco, Anna Monica Rosaria; Cappelli, Enrico; Savoia, Anna; Pagani, Franco
2014-07-01
The pathological role of mutations that affect not conserved splicing regulatory sequences can be difficult to determine. In a patient with Fanconi anemia, we identified two unpredictable splicing mutations that act on either sides of FANCA exon 8. In patients-derived cells and in minigene splicing assay, we showed that both an apparently benign intronic c.710-5T>C transition and the nonsense c.790C>T substitution induce almost complete exon 8 skipping. Site-directed mutagenesis experiments indicated that the c.710-5T>C transition affects a polypyrimidine tract where most of the thymidines cannot be compensated by cytidines. The c.790C>T mutation located in position -3 relative to the donor site induce exon 8 skipping in an NMD-independent manner and complementation experiments with modified U1 snRNAs showed that U1 snRNP is only partially involved in the splicing defect. Our results highlight the importance of performing splicing functional assay for correct identification of disease-causing mechanism of genomic variants and provide mechanistic insights on how these two FANCA mutations affect exon 8 definition. Copyright © 2014 Elsevier B.V. All rights reserved.
Gene structure and mutant alleles of PCDH15: nonsyndromic deafness DFNB23 and type 1 Usher syndrome.
Ahmed, Zubair M; Riazuddin, Saima; Aye, Sandar; Ali, Rana A; Venselaar, Hanka; Anwar, Saima; Belyantseva, Polina P; Qasim, Muhammad; Riazuddin, Sheikh; Friedman, Thomas B
2008-10-01
Mutations of PCDH15, encoding protocadherin 15, can cause either combined hearing and vision impairment (type 1 Usher syndrome; USH1F) or nonsyndromic deafness (DFNB23). Human PCDH15 is reported to be composed of 35 exons and encodes a variety of isoforms with 3-11 ectodomains (ECs), a transmembrane domain and a carboxy-terminal cytoplasmic domain (CD). Building on these observations, we describe an updated gene structure that has four additional exons of PCDH15 and isoforms that can be subdivided into four classes. Human PCDH15 encodes three alternative, evolutionarily conserved unique cytoplasmic domains (CD1, CD2 or CD3). Families ascertained on the basis of prelingual hearing loss were screened for linkage of this phenotype to markers for PCDH15 on chromosome 10q21.1. In seven of twelve families segregating USH1, we identified homozygous mutant alleles (one missense, one splice site, three nonsense and two deletion mutations) of which six are novel. One family was segregating nonsyndromic deafness DFNB23 due to a homozygous missense mutation. To date, in our cohort of 557 Pakistani families, we have found 11 different PCDH15 mutations that account for deafness in 13 families. Molecular modeling provided mechanistic insight into the phenotypic variation in severity of the PCDH15 missense mutations. We did not find pathogenic mutations in five of the twelve USH1 families linked to markers for USH1F, which suggest either the presence of mutations of yet additional undiscovered exons of PCDH15, mutations in the introns or regulatory elements of PCDH15, or an additional locus for type I USH at chromosome 10q21.1.
Gene structure and mutant alleles of PCDH15: nonsyndromic deafness DFNB23 and type 1 Usher syndrome
Ahmed, Zubair M.; Riazuddin, Saima; Aye, Sandar; Ali, Rana A.; Venselaar, Hanka; Anwar, Saima; Belyantseva, Polina P.; Qasim, Muhammad; Riazuddin, Sheikh; Friedman, Thomas B.
2009-01-01
Mutations of PCDH15, encoding protocadherin 15, can cause either combined hearing and vision impairment (type 1 Usher syndrome; USH1F) or nonsyndromic deafness (DFNB23). Human PCDH15 is reported to be comprised of 35 exons and encodes a variety of isoforms with 3 to 11 ectodomains (EC), a transmembrane domain and a carboxy-terminal cytoplasmic domain (CD). Building on these observations we describe an updated gene structure that has four additional exons of PCDH15 and isoforms that can be subdivided into four classes. Human PCDH15 encodes three alternative, evolutionarily conserved unique cytoplasmic domains (CD1, CD2 or CD3). Families ascertained on the basis of prelingual hearing loss were screened for linkage of this phenotype to markers for PCDH15 on chromosome 10q21.1. In seven of twelve families segregating USH1 we identified homozygous mutant alleles (1 missense, 1 splice site, 3 nonsense and 2 deletion mutations) of which six are novel. One family was segregating nonsyndromic deafness DFNB23 due to a homozygous missense mutation. To date in our cohort of 557 Pakistani families, we have found 11 different PCDH15 mutations that account for deafness in 13 families. Molecular modeling provided mechanistic insight into the phenotypic variation in severity of the PCDH15 missense mutations. We did not find pathogenic mutations in five of the twelve USH1 families linked to markers for USH1F, which suggest either the presence of mutations of yet additional undiscovered exons of PCDH15, mutations in the introns or regulatory elements of PCDH15, or an additional locus for type I USH at chromosome 10q21.1. PMID:18719945
Gonçalves, Catarina; Bastos, Margarida; Pignatelli, Duarte; Borges, Teresa; Aragüés, José M; Fonseca, Fernando; Pereira, Bernardo D; Socorro, Sílvia; Lemos, Manuel C
2015-11-01
To determine the prevalence of fibroblast growth factor receptor 1 (FGFR1) mutations and their predicted functional consequences in patients with idiopathic hypogonadotropic hypogonadism (IHH). Cross-sectional study. Multicentric. Fifty unrelated patients with IHH (21 with Kallmann syndrome and 29 with normosmic IHH). None. Patients were screened for mutations in FGFR1. The functional consequences of mutations were predicted by in silico structural and conservation analysis. Heterozygous FGFR1 mutations were identified in six (12%) kindreds. These consisted of frameshift mutations (p.Pro33-Alafs*17 and p.Tyr654*) and missense mutations in the signal peptide (p.Trp4Cys), in the D1 extracellular domain (p.Ser96Cys) and in the cytoplasmic tyrosine kinase domain (p.Met719Val). A missense mutation was identified in the alternatively spliced exon 8A (p.Ala353Thr) that exclusively affects the D3 extracellular domain of FGFR1 isoform IIIb. Structure-based and sequence-based prediction methods and the absence of these variants in 200 normal controls were all consistent with a critical role for the mutations in the activity of the receptor. Oligogenic inheritance (FGFR1/CHD7/PROKR2) was found in one patient. Two FGFR1 isoforms, IIIb and IIIc, result from alternative splicing of exons 8A and 8B, respectively. Loss-of-function of isoform IIIc is a cause of IHH, whereas isoform IIIb is thought to be redundant. Ours is the first report of normosmic IHH associated with a mutation in the alternatively spliced exon 8A and suggests that this disorder can be caused by defects in either of the two alternatively spliced FGFR1 isoforms. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Tissue-selective restriction of RNA editing of CaV1.3 by splicing factor SRSF9.
Huang, Hua; Kapeli, Katannya; Jin, Wenhao; Wong, Yuk Peng; Arumugam, Thiruma Valavan; Koh, Joanne Huifen; Srimasorn, Sumitra; Mallilankaraman, Karthik; Chua, John Jia En; Yeo, Gene W; Soong, Tuck Wah
2018-05-04
Adenosine DeAminases acting on RNA (ADAR) catalyzes adenosine-to-inosine (A-to-I) conversion within RNA duplex structures. While A-to-I editing is often dynamically regulated in a spatial-temporal manner, the mechanisms underlying its tissue-selective restriction remain elusive. We have previously reported that transcripts of voltage-gated calcium channel CaV1.3 are subject to brain-selective A-to-I RNA editing by ADAR2. Here, we show that editing of CaV1.3 mRNA is dependent on a 40 bp RNA duplex formed between exon 41 and an evolutionarily conserved editing site complementary sequence (ECS) located within the preceding intron. Heterologous expression of a mouse minigene that contained the ECS, intermediate intronic sequence and exon 41 with ADAR2 yielded robust editing. Interestingly, editing of CaV1.3 was potently inhibited by serine/arginine-rich splicing factor 9 (SRSF9). Mechanistically, the inhibitory effect of SRSF9 required direct RNA interaction. Selective down-regulation of SRSF9 in neurons provides a basis for the neuron-specific editing of CaV1.3 transcripts.
Monshausen, Michaela; Gehring, Niels H; Kosik, Kenneth S
2004-01-01
Members of the Staufen family of RNA-binding proteins are highly conserved cytoplasmic RNA transporters associated with RNA granules. staufen2 is specifically expressed in neurons where the delivery of RNA to dendrites is thought to have a role in plasticity. We found that Staufen2 interacts with the nuclear pore protein p62, with the RNA export protein Tap and with the exon-exon junction complex (EJC) proteins Y14-Mago. The interaction of Staufen2 with the Y14-Mago heterodimer seems to represent a highly conserved complex as the same proteins are involved in the Staufen-mediated localization of oskar mRNA in Drosophila oocytes. A pool of Staufen2 is present in neuronal nuclei and colocalizes to a large degree with p62 and partly with Tap, Y14, and Mago. We suggest a model whereby a set of conserved genes in the oskar mRNA export pathway may be recruited to direct a dendritic destination for mRNAs originating as a Staufen2 nuclear complex.
Roman-Padilla, J; Rodríguez-Rua, A; Claros, M G; Hachero-Cruzado, I; Manchado, M
2016-01-01
The apolipoprotein A-IV (ApoA-IV) plays a key role in lipid transport and feed intake regulation. In this work, four cDNA sequences encoding ApoA-IV paralogs were identified. Sequence analysis revealed conserved structural features including the common 33-codon block and nine repeated motifs. Gene structure analysis identified four exons and three introns except for apoA-IVAa1 (with only 3 exons). Synteny analysis showed that the four paralogs were structured into two clusters (cluster A containing apoA-IVAa1 and apoA-IVAa2 and cluster B with apoA-IVBa3 and apoA-IVBa4) linked to an apolipoprotein E. Phylogenetic analysis clearly separated the paralogs according to their cluster organization as well as revealed four subclades highly conserved in Acanthopterygii. Whole-mount analyses (WISH) in early larvae (0 and 1day post-hatch (dph)) showed that the four paralogs were mainly expressed in yolk syncytial layer surrounding the oil globules. Later, at 3 and 5dph, the four paralogs were mainly expressed in liver and intestine although with differences in their relative abundance and temporal expression patterns. Diet supply triggered the intensity of WISH signals in the intestine of the four paralogs. Quantification of mRNA abundance by qPCR using whole larvae only detected the induction by diet at 5dph. Moreover, transcript levels increased progressively with age except for apoA-IVAa2, which appeared as a low-expressed isoform. Expression analysis in juvenile tissues confirmed that the four paralogs were mainly expressed in liver and intestine and secondary in other tissues. The role of these ApoA-IV genes in lipid transport and the possible role of apoA-IVAa2 as a regulatory form are discussed. Copyright © 2015 Elsevier Inc. All rights reserved.
Nanoscale studies link amyloid maturity with polyglutamine diseases onset
NASA Astrophysics Data System (ADS)
Ruggeri, F. S.; Vieweg, S.; Cendrowska, U.; Longo, G.; Chiki, A.; Lashuel, H. A.; Dietler, G.
2016-08-01
The presence of expanded poly-glutamine (polyQ) repeats in proteins is directly linked to the pathogenesis of several neurodegenerative diseases, including Huntington’s disease. However, the molecular and structural basis underlying the increased toxicity of aggregates formed by proteins containing expanded polyQ repeats remain poorly understood, in part due to the size and morphological heterogeneity of the aggregates they form in vitro. To address this knowledge gap and technical limitations, we investigated the structural, mechanical and morphological properties of fibrillar aggregates at the single molecule and nanometer scale using the first exon of the Huntingtin protein as a model system (Exon1). Our findings demonstrate a direct correlation of the morphological and mechanical properties of Exon1 aggregates with their structural organization at the single aggregate and nanometric scale and provide novel insights into the molecular and structural basis of Huntingtin Exon1 aggregation and toxicity.
Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S
2010-10-07
PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
Species-Specific Exon Loss in Human Transcriptomes
Wang, Jinkai; Lu, Zhi-xiang; Tokheim, Collin J.; Miller, Sara E.; Xing, Yi
2015-01-01
Changes in exon–intron structures and splicing patterns represent an important mechanism for the evolution of gene functions and species-specific regulatory networks. Although exon creation is widespread during primate and human evolution and has been studied extensively, much less is known about the scope and potential impact of human-specific exon loss events. Historically, transcriptome data and exon annotations are significantly biased toward humans over nonhuman primates. This ascertainment bias makes it challenging to discover human-specific exon loss events. We carried out a transcriptome-wide search of human-specific exon loss events, by taking advantage of RNA sequencing (RNA-seq) as a powerful and unbiased tool for exon discovery and annotation. Using RNA-seq data of humans, chimpanzees, and other primates, we reconstructed and compared transcript structures across the primate phylogeny. We discovered 33 candidate human-specific exon loss events, among which six exons passed stringent experimental filters for the complete loss of splicing activities in diverse human tissues. These events may result from human-specific deletion of genomic DNA, or small-scale sequence changes that inactivated splicing signals. The impact of human-specific exon loss events is predominantly regulatory. Three of the six events occurred in the 5′ untranslated region (5′-UTR) and affected cis-regulatory elements of mRNA translation. In SLC7A6, a gene encoding an amino acid transporter, luciferase reporter assays suggested that both a human-specific exon loss event and an independent human-specific single nucleotide substitution in the 5′-UTR increased mRNA translational efficiency. Our study provides novel insights into the molecular mechanisms and evolutionary consequences of exon loss during human evolution. PMID:25398629
Structural Disorder Provides Increased Adaptability for Vesicle Trafficking Pathways
Tompa, Peter
2013-01-01
Vesicle trafficking systems play essential roles in the communication between the organelles of eukaryotic cells and also between cells and their environment. Endocytosis and the late secretory route are mediated by clathrin-coated vesicles, while the COat Protein I and II (COPI and COPII) routes stand for the bidirectional traffic between the ER and the Golgi apparatus. Despite similar fundamental organizations, the molecular machinery, functions, and evolutionary characteristics of the three systems are very different. In this work, we compiled the basic functional protein groups of the three main routes for human and yeast and analyzed them from the structural disorder perspective. We found similar overall disorder content in yeast and human proteins, confirming the well-conserved nature of these systems. Most functional groups contain highly disordered proteins, supporting the general importance of structural disorder in these routes, although some of them seem to heavily rely on disorder, while others do not. Interestingly, the clathrin system is significantly more disordered (∼23%) than the other two, COPI (∼9%) and COPII (∼8%). We show that this structural phenomenon enhances the inherent plasticity and increased evolutionary adaptability of the clathrin system, which distinguishes it from the other two routes. Since multi-functionality (moonlighting) is indicative of both plasticity and adaptability, we studied its prevalence in vesicle trafficking proteins and correlated it with structural disorder. Clathrin adaptors have the highest capability for moonlighting while also comprising the most highly disordered members. The ability to acquire tissue specific functions was also used to approach adaptability: clathrin route genes have the most tissue specific exons encoding for protein segments enriched in structural disorder and interaction sites. Overall, our results confirm the general importance of structural disorder in vesicle trafficking and suggest major roles for this structural property in shaping the differences of evolutionary adaptability in the three routes. PMID:23874186
High-resolution phylogenic microbial community profiling
USDA-ARS?s Scientific Manuscript database
PIECE (Plant Intron Exon Comparison and Evolution) is a web-accessible database that houses intron and exon information of plant genes. PIECE serves as a resource for biologists interested in comparing intron–exon organization and provides valuable insights into the evolution of gene structure in pl...
Diversification of the insulin-like growth factor 1 gene in mammals.
Rotwein, Peter
2017-01-01
Insulin-like growth factor 1 (IGF1), a small, secreted peptide growth factor, is involved in a variety of physiological and patho-physiological processes, including somatic growth, tissue repair, and metabolism of carbohydrates, proteins, and lipids. IGF1 gene expression appears to be controlled by several different signaling cascades in the few species in which it has been evaluated, with growth hormone playing a major role by activating a pathway involving the Stat5b transcription factor. Here, genes encoding IGF1 have been evaluated in 25 different mammalian species representing 15 different orders and ranging over ~180 million years of evolutionary diversification. Parts of the IGF1 gene have been fairly well conserved. Like rat Igf1 and human IGF1, 21 of 23 other genes are composed of 6 exons and 5 introns, and all 23 also contain recognizable tandem promoters, each with a unique leader exon. Exon and intron lengths are similar in most species, and DNA sequence conservation is moderately high in orthologous exons and proximal promoter regions. In contrast, putative growth hormone-activated Stat5b-binding enhancers found in analogous locations in rodent Igf1 and in human IGF1 loci, have undergone substantial variation in other mammals, and a processed retro-transposed IGF1 pseudogene is found in the sloth locus, but not in other mammalian genomes. Taken together, the fairly high level of organizational and nucleotide sequence similarity in the IGF1 gene among these 25 species supports the contention that some common regulatory pathways had existed prior to the beginning of mammalian speciation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo
Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding tomore » the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.« less
[Research on non-rigid registration of multi-modal medical image based on Demons algorithm].
Hao, Peibo; Chen, Zhen; Jiang, Shaofeng; Wang, Yang
2014-02-01
Non-rigid medical image registration is a popular subject in the research areas of the medical image and has an important clinical value. In this paper we put forward an improved algorithm of Demons, together with the conservation of gray model and local structure tensor conservation model, to construct a new energy function processing multi-modal registration problem. We then applied the L-BFGS algorithm to optimize the energy function and solve complex three-dimensional data optimization problem. And finally we used the multi-scale hierarchical refinement ideas to solve large deformation registration. The experimental results showed that the proposed algorithm for large de formation and multi-modal three-dimensional medical image registration had good effects.
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Genomic organization of plant aminopropyl transferases.
Rodríguez-Kessler, Margarita; Delgado-Sánchez, Pablo; Rodríguez-Kessler, Gabriela Theresia; Moriguchi, Takaya; Jiménez-Bremont, Juan Francisco
2010-07-01
Aminopropyl transferases like spermidine synthase (SPDS; EC 2.5.1.16), spermine synthase and thermospermine synthase (SPMS, tSPMS; EC 2.5.1.22) belong to a class of widely distributed enzymes that use decarboxylated S-adenosylmethionine as an aminopropyl donor and putrescine or spermidine as an amino acceptor to form in that order spermidine, spermine or thermospermine. We describe the analysis of plant genomic sequences encoding SPDS, SPMS, tSPMS and PMT (putrescine N-methyltransferase; EC 2.1.1.53). Genome organization (including exon size, gain and loss, as well as intron number, size, loss, retention, placement and phase, and the presence of transposons) of plant aminopropyl transferase genes were compared between the genomic sequences of SPDS, SPMS and tSPMS from Zea mays, Oryza sativa, Malus x domestica, Populus trichocarpa, Arabidopsis thaliana and Physcomitrella patens. In addition, the genomic organization of plant PMT genes, proposed to be derived from SPDS during the evolution of alkaloid metabolism, is illustrated. Herein, a particular conservation and arrangement of exon and intron sequences between plant SPDS, SPMS and PMT genes that clearly differs with that of ACL5 genes, is shown. The possible acquisition of the plant SPMS exon II and, in particular exon XI in the monocot SPMS genes, is a remarkable feature that allows their differentiation from SPDS genes. In accordance with our in silico analysis, functional complementation experiments of the maize ZmSPMS1 enzyme (previously considered to be SPDS) in yeast demonstrated its spermine synthase activity. Another significant aspect is the conservation of intron sequences among SPDS and PMT paralogs. In addition the existence of microsynteny among some SPDS paralogs, especially in P. trichocarpa and A. thaliana, supports duplication events of plant SPDS genes. Based in our analysis, we hypothesize that SPMS genes appeared with the divergence of vascular plants by a processes of gene duplication and the acquisition of unique exons of as-yet unknown origin. 2010 Elsevier Masson SAS. All rights reserved.
Toyoda, N; Kleinhaus, N; Larsen, P R
1996-06-01
We analyzed the exon-intron structure of the human type 1 deiodinase gene (dio1) and compared it with that of a patient with suspected congenital type 1 deiodinase (D1) deficiency. The hdio1 gene is identical in exon-intron arrangement to the mouse gene, with coding sequences and a selenocysteine insertion sequence (SECIS) element contained in four exons. There were no mutations in the sequences of exons 1-4 of the patient's genomic DNA. Functional studies by transient expression techniques showed no difference in basal promoter activity or T3 responsiveness between the patient's and the normal dio1 gene. A structural abnormality in the dio1 gene is not a likely explanation for this patient's D1-deficient phenotype.
Moses, Shimon W; Parvari, Ruti
2002-03-01
Glycogen storage disease type IV (GSD-IV), also known as Andersen disease or amylopectinosis (MIM 23250), is a rare autosomal recessive disorder caused by a deficiency of glycogen branching enzyme (GBE) leading to the accumulation of amylopectin-like structures in affected tissues. The disease is extremely heterogeneous in terms of tissue involvement, age of onset and clinical manifestations. The human GBE cDNA is approximately 3-kb in length and encodes a 702-amino acid protein. The GBE amino acid sequence shows a high degree of conservation throughout species. The human GBE gene is located on chromosome 3p14 and consists of 16 exons spanning at least 118 kb of chromosomal DNA. Clinically the classic Andersen disease is a rapidly progressive disorder leading to terminal liver failure unless liver transplantation is performed. Several mutations have been reported in the GBE gene in patients with classic phenotype. Mutations in the GBE gene have also been identified in patients with the milder non-progressive hepatic form of the disease. Several other variants of GSD-IV have been reported: a variant with multi-system involvement including skeletal and cardiac muscle, nerve and liver; a juvenile polysaccharidosis with multi-system involvement but normal GBE activity; and the fatal neonatal neuromuscular form associated with a splice site mutation in the GBE gene. Other presentations include cardiomyopathy, arthrogryposis and even hydrops fetalis. Polyglucosan body disease, characterized by widespread upper and lower motor neuron lesions, can present with or without GBE deficiency indicating that different biochemical defects could result in an identical phenotype. It is evident that this disease exists in multiple forms with enzymatic and molecular heterogeneity unparalleled in the other types of glycogen storage diseases.
Shen, Manli; Bellaousov, Stanislav; Hiller, Michael; de La Grange, Pierre; Creamer, Trevor P.; Malina, Orit; Sperling, Ruth; Mathews, David H.; Stoilov, Peter; Stamm, Stefan
2013-01-01
The serotonin receptor 2C plays a central role in mood and appetite control. It undergoes pre-mRNA editing as well as alternative splicing. The RNA editing suggests that the pre-mRNA forms a stable secondary structure in vivo. To identify substances that promote alternative exons inclusion, we set up a high-throughput screen and identified pyrvinium pamoate as a drug-promoting exon inclusion without editing. Circular dichroism spectroscopy indicates that pyrvinium pamoate binds directly to the pre-mRNA and changes its structure. SHAPE (selective 2′-hydroxyl acylation analysed by primer extension) assays show that part of the regulated 5′-splice site forms intramolecular base pairs that are removed by this structural change, which likely allows splice site recognition and exon inclusion. Genome-wide analyses show that pyrvinium pamoate regulates >300 alternative exons that form secondary structures enriched in A–U base pairs. Our data demonstrate that alternative splicing of structured pre-mRNAs can be regulated by small molecules that directly bind to the RNA, which is reminiscent to an RNA riboswitch. PMID:23393189
Bennici, Carmelo; Biondo, Girolama; Di Natale, Marilena; Masullo, Tiziana; Monastero, Calogera; Ragusa, Maria Antonietta; Tagliavia, Marcello; Cuttitta, Angela
2018-01-01
Gene family encoding translationally controlled tumour protein (TCTP) is defined as highly conserved among organisms; however, there is limited knowledge of non-bilateria. In this study, the first TCTP homologue from anthozoan was characterised in the Mediterranean Sea anemone, Anemonia viridis. The release of the genome sequence of Acropora digitifera, Exaiptasia pallida, Nematostella vectensis and Hydra vulgaris enabled a comprehensive study of the molecular evolution of TCTP family among cnidarians. A comparison among TCTP members from Cnidaria and Bilateria showed conserved intron exon organization, evolutionary conserved TCTP signatures and 3D protein structure. The pattern of mRNA expression profile was also defined in A. viridis. These analyses revealed a constitutive mRNA expression especially in tissues with active proliferation. Additionally, the transcriptional profile of A. viridis TCTP (AvTCTP) after challenges with different abiotic/biotic stresses showed induction by extreme temperatures, heavy metals exposure and immune stimulation. These results suggest the involvement of AvTCTP in the sea anemone defensome taking part in environmental stress and immune responses. PMID:29324689
Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar
2014-12-01
Lysophosphatidyl acyltransferase (LPAT) is one of the major triacylglycerol synthesis enzymes, controlling the metabolic flow of lysophosphatidic acid to phosphatidic acid. Experimental studies in Arabidopsis have shown that LPAT activity is exhibited primarily by three distinct isoforms, namely the plastid-located LPAT1, the endoplasmic reticulum-located LPAT2, and the soluble isoform of LPAT (solLPAT). In this study, 24 putative genes representing all LPAT isoforms were identified from the analysis of 11 complete genomes including green algae, red algae, diatoms and higher plants. We observed LPAT1 and solLPAT genes to be ubiquitously present in nearly all genomes examined, whereas LPAT2 genes to have evolved more recently in the plant lineage. Phylogenetic analysis indicated that LPAT1, LPAT2 and solLPAT have convergently evolved through separate evolutionary paths and belong to three different gene families, which was further evidenced by their wide divergence at gene structure and sequence level. The genome distribution supports the hypothesis that each gene encoding a LPAT is not duplicated. Mapping of exon-intron structure of LPAT genes to the domain structure of proteins across different algal and plant species indicates that exon shuffling plays no role in the evolution of LPAT genes. Besides the previously defined motifs, several conserved consensus sequences were discovered which could be useful to distinguish different LPAT isoforms. Taken together, this study will enable the generation of experimental approximations to better understand the functional role of algal LPAT in lipid accumulation.
Zheng, Hui; Shao, Chong; Zheng, Yan; He, Jin-Wei; Fu, Wen-Zhen; Wang, Chun; Zhang, Zhen-Lin
2016-07-01
Autosomal dominant osteopetrosis type II (ADO-II) is a heritable bone disorder characterized by osteosclerosis, predominantly involving the spine (vertebral end-plate thickening, or rugger-jersey spine), the pelvis ("bone-within-bone" structures) and the skull base. Chloride channel 7 (CLCN7) has been reported to be the causative gene. In this study, we aimed to identify the pathogenic mutation in four Chinese families with ADO-II. All 25 exons of the CLCN7 gene, including the exon-intron boundaries, were amplified and sequenced directly in four probands from the Chinese families with ADO-II. The mutation site was then identified in other family members and 250 healthy controls. In family 1, a known missense mutation c.296A>G in exon 4 of CLCN7 was identified in the proband, resulting in a tyrosine (UAU) to cysteine (UGU) substitution at p.99 (Y99C); the mutation was also identified in his affected father. In family 2, a novel missense mutation c.865G>C in exon 10 was identified in the proband, resulting in a valine (GUC) to leucine (CUC) substitution at p.289 (V289L); the mutation was also identified in her healthy mother and sister. In family 3, a novel missense mutation c.1625C>T in exon 17 of CLCN7 was identified in the proband, resulting in an alanine (GCG) to valine (GUG) substitution at p.542 (A542V); the mutation was also identified in her father. In family 4, a hot spot, R767W (c.2299C>T, CGG>TGG), in exon 24 was found in the proband which once again proved the susceptibility of the site or the similar genetic background in different races. Moreover, two novel mutations, V289L and A542V, occurred at a highly conserved position, found by a comparison of the protein sequences from eight vertebrates, and were predicted to have a pathogenic effect by PolyPhen-2 software, which showed "probably damaging" with a score of approximately 1. These mutation sites were not identified in 250 healthy controls. Our present findings suggest that the novel missense mutations V289L and A542V in the CLCN7 gene were responsible for ADO-II in the two Chinese families.
Sasaki-Haraguchi, Noriko; Ikuyama, Takeshi; Yoshii, Shogo; Takeuchi-Andoh, Tomoko; Frendewey, David; Tani, Tokio
2015-01-01
Exons are ligated in an ordered manner without the skipping of exons in the constitutive splicing of pre-mRNAs with multiple introns. To identify factors ensuring ordered exon joining in constitutive pre-mRNA splicing, we previously screened for exon skipping mutants in Schizosaccharomyces pombe using a reporter plasmid, and characterized three exon skipping mutants named ods1 (ordered splicing 1), ods2, and ods3, the responsible genes of which encode Prp2/U2AF59, U2AF23, and SF1, respectively. They form an SF1-U2AF59-U2AF23 complex involved in recognition of the branch and 3′ splice sites in pre-mRNA. In the present study, we identified a fourth ods mutant, ods4, which was isolated in an exon-skipping screen. The ods4 + gene encodes Cwf16p, which interacts with the NineTeen Complex (NTC), a complex thought to be involved in the first catalytic step of the splicing reaction. We isolated two multi-copy suppressors for the ods4-1 mutation, Srp2p, an SR protein essential for pre-mRNA splicing, and Tif213p, a translation initiation factor, in S. pombe. The overexpression of Srp2p suppressed the exon-skipping phenotype of all ods mutants, whereas Tif213p suppressed only ods4-1, which has a mutation in the translational start codon of the cwf16 gene. We also showed that the decrease in the transcriptional elongation rate induced by drug treatment suppressed exon skipping in ods4-1. We propose that Cwf16p/NTC participates in the early recognition of the branch and 3′ splice sites and cooperates with the SF1-U2AF59-U2AF23 complex to maintain ordered exon joining. PMID:26302002
Criscitiello, Michael F; Ohta, Yuko; Graham, Matthew D; Eubanks, Jeannine O; Chen, Patricia L; Flajnik, Martin F
2012-03-01
The invariant chain (Ii) is the critical third chain required for the MHC class II heterodimer to be properly guided through the cell, loaded with peptide, and expressed on the surface of antigen presenting cells. Here, we report the isolation of the nurse shark Ii gene, and the comparative analysis of Ii splice variants, expression, genomic organization, predicted structure, and function throughout vertebrate evolution. Alternative splicing to yield Ii with and without the putative protease-protective, thyroglobulin-like domain is as ancient as the MHC-based adaptive immune system, as our analyses in shark and lizard further show conservation of this mechanism in all vertebrate classes except bony fish. Remarkable coordinate expression of Ii and class II was found in shark tissues. Conserved Ii residues and cathepsin L orthologs suggest their long co-evolution in the antigen presentation pathway, and genomic analyses suggest 450 million years of conserved Ii exon/intron structure. Other than an extended linker preceding the thyroglobulin-like domain in cartilaginous fish, the Ii gene and protein are predicted to have largely similar physiology from shark to man. Duplicated Ii genes found only in teleosts appear to have become sub-functionalized, as one form is predicted to play the same role as that mediated by Ii mRNA alternative splicing in all other vertebrate classes. No Ii homologs or potential ancestors of any of the functional Ii domains were found in the jawless fish or lower chordates. Copyright © 2011 Elsevier Ltd. All rights reserved.
Nadjar-Boger, Elisabeth; Maccatrozzo, Lisa; Radaelli, Giuseppe; Funkenstein, Bruria
2013-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily, known as a negative regulator of skeletal muscle development and growth in mammals. In contrast to mammals, fish possess at least two paralogs of MSTN: MSTN-1 and MSTN-2. Here we describe the cloning and sequence analysis of spliced and precursor (unspliced) transcripts as well as the 5' flanking region of MSTN-2 from the marine fish Umbrina cirrosa (ucMSTN-2). In silico analysis revealed numerous putative cis regulatory elements including several E-boxes known as binding sites to myogenic transcription factors. Transient transfection experiments using non-muscle and muscle cell lines showed high transcriptional activity in muscle cells and in differentiated neural cells, in accordance with our previous findings in MSTN-2 promoter from Sparus aurata. Comparative informatics analysis of MSTN-2 from several fish species revealed high conservation of the predicted amino acid sequence as well as the gene structure (exon length) although intron length varied between species. The proximal promoter of MSTN-2 gene was found to be conserved among Perciforms. In conclusion, this study reinforces our conclusion that MSTN-2 promoter is a very strong promoter, especially in muscle cells. In addition, we show that the MSTN-2 gene structure is highly conserved among fishes as is the predicted amino acid sequence of the peptide. Copyright © 2012 Elsevier Inc. All rights reserved.
Arman, Ahmet; Ozon, Alev; Isguven, Pinar S; Coker, Ajda; Peker, Ismail; Yordam, Nursen
2008-01-01
Growth hormone (GH) is involved in growth, and fat and carbohydrate metabolism. Interaction of GH with the GH receptor (GHR) is necessary for systemic and local production of insulin-like growth factor-I (IGF-I) which mediates GH actions. Mutations in the GHR cause severe postnatal growth failure; the disorder is an autosomal recessive genetic disease resulting in GH insensitivity, called Laron syndrome. It is characterized by dwarfism with elevated serum GH and low levels of IGF-I. We analyzed the GHR gene for mutations and polymorphisms in eight patients with Laron-type dwarfism from six families. We found three missense mutations (S40L, V125A, I526L), one nonsense mutation (W157X), and one splice site mutation in the extracellular domain of GHR. Furthermore, G168G and exon 3 deletion polymorphisms were detected in patients with Laron syndrome. The splice site mutation, which is a novel mutation, was located at the donor splice site of exon 2/ intron 2 within GHR. Although this mutation changed the highly conserved donor splice site consensus sequence GT to GGT by insertion of a G residue, the intron splicing between exon 2 and exon 3 was detected in the patient. These results imply that the splicing occurs arthe GT site in intron 2, leaving the extra inserted G residue at the end of exon 2, thus changing the open reading frame of GHR resulting in a premature termination codon in exon 3.
Characterization and mapping of the mouse NDP (Norrie disease) locus (Ndp).
Battinelli, E M; Boyd, Y; Craig, I W; Breakefield, X O; Chen, Z Y
1996-02-01
Norrie disease is a severe X-linked recessive neurological disorder characterized by congenital blindness with progressive loss of hearing. Over half of Norrie patients also manifest different degrees of mental retardation. The gene for Norrie disease (NDP) has recently been cloned and characterized. With the human NDP cDNA, mouse genomic phage libraries were screened for the homolog of the gene. Comparison between mouse and human genomic DNA blots hybridized with the NDP cDNA, as well as analysis of phage clones, shows that the mouse NDP gene is 29 kb in size (28 kb for the human gene). The organization in the two species is very similar. Both have three exons with similar-sized introns and identical exon-intron boundaries between exon 2 and 3. The mouse open reading frame is 393 bp and, like the human coding sequence, is encoded in exons 2 and 3. The absence of six nucleotides in the second mouse exon results in the encoded protein being two amino acids smaller than its human counterpart. The overall homology between the human and mouse NDP protein is 95% and is particularly high (99%) in exon 3, consistent with the apparent functional importance of this region. Analysis of transcription initiation sites suggests the presence of multiple start sites associated with expression of the mouse NDP gene. Pedigree analysis of an interspecific mouse backcross localizes the mouse NDP gene close to Maoa in the conserved segment, which runs from CYBB to PFC in both human and mouse.
Balu, Rajkamal; Knott, Robert; Cowieson, Nathan P.; Elvin, Christopher M.; Hill, Anita J.; Choudhury, Namita R.; Dutta, Naba K.
2015-01-01
Rec1-resilin is the first recombinant resilin-mimetic protein polymer, synthesized from exon-1 of the Drosophila melanogaster gene CG15920 that has demonstrated unusual multi-stimuli responsiveness in aqueous solution. Crosslinked hydrogels of Rec1-resilin have also displayed remarkable mechanical properties including near-perfect rubber-like elasticity. The structural basis of these extraordinary properties is not clearly understood. Here we combine a computational and experimental investigation to examine structural ensembles of Rec1-resilin in aqueous solution. The structure of Rec1-resilin in aqueous solutions is investigated experimentally using circular dichroism (CD) spectroscopy and small angle X-ray scattering (SAXS). Both bench-top and synchrotron SAXS are employed to extract structural data sets of Rec1-resilin and to confirm their validity. Computational approaches have been applied to these experimental data sets in order to extract quantitative information about structural ensembles including radius of gyration, pair-distance distribution function, and the fractal dimension. The present work confirms that Rec1-resilin is an intrinsically disordered protein (IDP) that displays equilibrium structural qualities between those of a structured globular protein and a denatured protein. The ensemble optimization method (EOM) analysis reveals a single conformational population with partial compactness. This work provides new insight into the structural ensembles of Rec1-resilin in solution. PMID:26042819
Balu, Rajkamal; Knott, Robert; Cowieson, Nathan P; Elvin, Christopher M; Hill, Anita J; Choudhury, Namita R; Dutta, Naba K
2015-06-04
Rec1-resilin is the first recombinant resilin-mimetic protein polymer, synthesized from exon-1 of the Drosophila melanogaster gene CG15920 that has demonstrated unusual multi-stimuli responsiveness in aqueous solution. Crosslinked hydrogels of Rec1-resilin have also displayed remarkable mechanical properties including near-perfect rubber-like elasticity. The structural basis of these extraordinary properties is not clearly understood. Here we combine a computational and experimental investigation to examine structural ensembles of Rec1-resilin in aqueous solution. The structure of Rec1-resilin in aqueous solutions is investigated experimentally using circular dichroism (CD) spectroscopy and small angle X-ray scattering (SAXS). Both bench-top and synchrotron SAXS are employed to extract structural data sets of Rec1-resilin and to confirm their validity. Computational approaches have been applied to these experimental data sets in order to extract quantitative information about structural ensembles including radius of gyration, pair-distance distribution function, and the fractal dimension. The present work confirms that Rec1-resilin is an intrinsically disordered protein (IDP) that displays equilibrium structural qualities between those of a structured globular protein and a denatured protein. The ensemble optimization method (EOM) analysis reveals a single conformational population with partial compactness. This work provides new insight into the structural ensembles of Rec1-resilin in solution.
NASA Astrophysics Data System (ADS)
Balu, Rajkamal; Knott, Robert; Cowieson, Nathan P.; Elvin, Christopher M.; Hill, Anita J.; Choudhury, Namita R.; Dutta, Naba K.
2015-06-01
Rec1-resilin is the first recombinant resilin-mimetic protein polymer, synthesized from exon-1 of the Drosophila melanogaster gene CG15920 that has demonstrated unusual multi-stimuli responsiveness in aqueous solution. Crosslinked hydrogels of Rec1-resilin have also displayed remarkable mechanical properties including near-perfect rubber-like elasticity. The structural basis of these extraordinary properties is not clearly understood. Here we combine a computational and experimental investigation to examine structural ensembles of Rec1-resilin in aqueous solution. The structure of Rec1-resilin in aqueous solutions is investigated experimentally using circular dichroism (CD) spectroscopy and small angle X-ray scattering (SAXS). Both bench-top and synchrotron SAXS are employed to extract structural data sets of Rec1-resilin and to confirm their validity. Computational approaches have been applied to these experimental data sets in order to extract quantitative information about structural ensembles including radius of gyration, pair-distance distribution function, and the fractal dimension. The present work confirms that Rec1-resilin is an intrinsically disordered protein (IDP) that displays equilibrium structural qualities between those of a structured globular protein and a denatured protein. The ensemble optimization method (EOM) analysis reveals a single conformational population with partial compactness. This work provides new insight into the structural ensembles of Rec1-resilin in solution.
Smith, J L; Wells, J D
2017-03-01
Being able to efficiently differentiate between male and female individuals in the immature forms of insects allows for investigations into sexually dimorphic patterns of growth rates and gene expression. For species lacking sex-specific morphological characteristics during these periods, alternative methods must be devised. Commonly, isolation of sex determination genes reveals sex-specific band patterns and allows for markers that can be used in insect control. For blow flies, a family that includes flies of medical and forensic importance, sex has previously been identified in some members using the male-specific exon in the transformer gene. This gene is relatively conserved between members of the genera Cochliomyia and Lucilia (Diptera: Calliphoridae), and we isolated a portion of this gene in an additional forensically and medically important blow fly genus using the widespread Chrysomya megacephala (F.). We found a relatively high level of conservation between exons 1 and 2 of transformer and were able to amplify a region containing the male-specific exon in C. megacephala. A sex-specific molecular diagnostic test based on the presence of sexually dimorphic PCR product bands showed the expected genotype for adults and intrapuparial period specimens of known sex. The same result could be obtained from single third-instar larval specimens, opening up the possibility to not only determine if development rates are sex dependent, but also to investigate the development of sexually dimorphic traits of interest in C. megacephala. © The Authors 2016. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Chuang, Tzu-Wei; Lee, Kuo-Ming; Lou, Yuan-Chao; Lu, Chia-Chen; Tarn, Woan-Yuh
2016-04-15
Eukaryotic mRNA biogenesis involves a series of interconnected steps mediated by RNA-binding proteins. The exon junction complex core protein Y14 is required for nonsense-mediated mRNA decay (NMD) and promotes translation. Moreover, Y14 binds the cap structure of mRNAs and inhibits the activity of the decapping enzyme Dcp2. In this report, we show that an evolutionarily conserved tryptophan residue (Trp-73) of Y14 is critical for its binding to the mRNA cap structure. A Trp-73 mutant (W73V) bound weakly to mRNAs and failed to protect them from degradation. However, this mutant could still interact with the NMD and mRNA degradation factors and retained partial NMD activity. In addition, we found that the W73V mutant could not interact with translation initiation factors. Overexpression of W73V suppressed reporter mRNA translation in vitro and in vivo and reduced the level of a set of nascent proteins. These results reveal a residue of Y14 that confers cap-binding activity and is essential for Y14-mediated enhancement of translation. Finally, we demonstrated that Y14 may selectively and differentially modulate protein biosynthesis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Kim, Dokyoon; Basile, Anna O; Bang, Lisa; Horgusluoglu, Emrin; Lee, Seunggeun; Ritchie, Marylyn D; Saykin, Andrew J; Nho, Kwangsik
2017-05-18
Rapid advancement of next generation sequencing technologies such as whole genome sequencing (WGS) has facilitated the search for genetic factors that influence disease risk in the field of human genetics. To identify rare variants associated with human diseases or traits, an efficient genome-wide binning approach is needed. In this study we developed a novel biological knowledge-based binning approach for rare-variant association analysis and then applied the approach to structural neuroimaging endophenotypes related to late-onset Alzheimer's disease (LOAD). For rare-variant analysis, we used the knowledge-driven binning approach implemented in Bin-KAT, an automated tool, that provides 1) binning/collapsing methods for multi-level variant aggregation with a flexible, biologically informed binning strategy and 2) an option of performing unified collapsing and statistical rare variant analyses in one tool. A total of 750 non-Hispanic Caucasian participants from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort who had both WGS data and magnetic resonance imaging (MRI) scans were used in this study. Mean bilateral cortical thickness of the entorhinal cortex extracted from MRI scans was used as an AD-related neuroimaging endophenotype. SKAT was used for a genome-wide gene- and region-based association analysis of rare variants (MAF (minor allele frequency) < 0.05) and potential confounding factors (age, gender, years of education, intracranial volume (ICV) and MRI field strength) for entorhinal cortex thickness were used as covariates. Significant associations were determined using FDR adjustment for multiple comparisons. Our knowledge-driven binning approach identified 16 functional exonic rare variants in FANCC significantly associated with entorhinal cortex thickness (FDR-corrected p-value < 0.05). In addition, the approach identified 7 evolutionary conserved regions, which were mapped to FAF1, RFX7, LYPLAL1 and GOLGA3, significantly associated with entorhinal cortex thickness (FDR-corrected p-value < 0.05). In further analysis, the functional exonic rare variants in FANCC were also significantly associated with hippocampal volume and cerebrospinal fluid (CSF) Aβ 1-42 (p-value < 0.05). Our novel binning approach identified rare variants in FANCC as well as 7 evolutionary conserved regions significantly associated with a LOAD-related neuroimaging endophenotype. FANCC (fanconi anemia complementation group C) has been shown to modulate TLR and p38 MAPK-dependent expression of IL-1β in macrophages. Our results warrant further investigation in a larger independent cohort and demonstrate that the biological knowledge-driven binning approach is a powerful strategy to identify rare variants associated with AD and other complex disease.
Bahrami, A; Behzadi, Sh; Miraei-Ashtiani, S R; Roh, S-G; Katoh, K
2013-09-15
The somatotropic axis, the control system for growth hormone (GH) secretion and its endogenous factors involved in the regulation of metabolism and energy partitioning, has promising potentials for producing economically valuable traits in farm animals. Here we investigated single nucleotide polymorphisms (SNPs) of the genes of factors involved in the somatotropic axis for growth hormone (GH1), growth hormone receptor (GHR), ghrelin (GHRL), insulin-like growth factor 1 (IGF-I) and leptin (LEP), using polymerase chain reaction-single-strand conformation polymorphism (PCR-SSCP) and DNA sequencing methods in 452 individual Mehraban sheep. A nonradioactive method to allow SSCP detection was used for genomic DNA and PCR amplification of six fragments: exons 4 and 5 of GH1; exon 10 of GH receptor (GHR); exon 1 of ghrelin (GHRL); exon 1 of insulin-like growth factor-I (IGF-I), and exon 3 of leptin (LEP). Polymorphisms were detected in five of the six PCR products. Two electrophoretic patterns were detected for GH1 exon 4. Five conformational patterns were detected for GH1 exon 5 and LEP exon 3, and three for IGF-I exon 1. Only GHR and GHRL were monomorphic. Changes in protein structures due to variable SNPs were also analyzed. The results suggest that Mehraban sheep, a major breed that is important for the animal industry in Middle East countries, has high genetic variability, opening interesting prospects for future selection programs and preservation strategies. Copyright © 2013 Elsevier B.V. All rights reserved.
Conservation and diversification of Msx protein in metazoan evolution.
Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun
2008-01-01
Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Drosha Promotes Splicing of a Pre-microRNA-like Alternative Exon
Havens, Mallory A.; Reich, Ashley A.; Hastings, Michelle L.
2014-01-01
The ribonuclease III enzyme Drosha has a central role in the biogenesis of microRNA (miRNA) by binding and cleaving hairpin structures in primary RNA transcripts into precursor miRNAs (pre-miRNAs). Many miRNA genes are located within protein-coding host genes and cleaved by Drosha in a manner that is coincident with splicing of introns by the spliceosome. The close proximity of splicing and pre-miRNA biogenesis suggests a potential for co-regulation of miRNA and host gene expression, though this relationship is not completely understood. Here, we describe a cleavage-independent role for Drosha in the splicing of an exon that has a predicted hairpin structure resembling a Drosha substrate. We find that Drosha can cleave the alternatively spliced exon 5 of the eIF4H gene into a pre-miRNA both in vitro and in cells. However, the primary role of Drosha in eIF4H gene expression is to promote the splicing of exon 5. Drosha binds to the exon and enhances splicing in a manner that depends on RNA structure but not on cleavage by Drosha. We conclude that Drosha can function like a splicing enhancer and promote exon inclusion. Our results reveal a new mechanism of alternative splicing regulation involving a cleavage-independent role for Drosha in splicing. PMID:24786770
Exome-wide DNA capture and next generation sequencing in domestic and wild species.
Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon
2011-07-05
Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Intergenic mRNA molecules resulting from trans-splicing.
Finta, Csaba; Zaphiropoulos, Peter G
2002-02-22
Accumulated recent evidence is indicating that alternative splicing represents a generalized process that increases the complexity of human gene expression. Here we show that mRNA production may not necessarily be limited to single genes, as human liver also has the potential to produce a variety of hybrid cytochrome P450 3A mRNA molecules. The four known cytochrome P450 3A genes in humans, CYP3A4, CYP3A5, CYP3A7, and CYP3A43, share a high degree of similarity, consist of 13 exons with conserved exon-intron boundaries, and form a cluster on chromosome 7. The chimeric CYP3A mRNA molecules described herein are characterized by CYP3A43 exon 1 joined at canonical splice sites to distinct sets of CYP3A4 or CYP3A5 exons. Because the CYP3A43 gene is in a head-to-head orientation with the CYP3A4 and CYP3A5 genes, bypassing transcriptional termination can not account for the formation of hybrid CYP3A mRNAs. Thus, the mechanism generating these molecules has to be an RNA processing event that joins exons of independent pre-mRNA molecules, i.e. trans-splicing. Using quantitative real-time polymerase chain reaction, the ratio of one CYP3A43/3A4 intergenic combination was estimated to be approximately 0.15% that of the CYP3A43 mRNAs. Moreover, trans-splicing has been found not to interfere with polyadenylation. Heterologous expression of the chimeric species composed of CYP3A43 exon 1 joined to exons 2-13 of CYP3A4 revealed catalytic activity toward testosterone.
Comparative genomics on Norrie disease gene.
Katoh, Masuko; Katoh, Masaru
2005-05-01
DAND1 (NBL1), DAND2 (CKTSF1B1 or GREM1 or GREMLIN), DAND3 (CKTSF1B2 or GREM2 or PRDC), DAND4 (CER1), DAND5 (CKTSF1B3 or GREM3 or DANTE), MUC2, MUC5AC, MUC5B, MUC6, MUC19, WISP1, WISP2, WISP3, VWF, NOV and Norrie disease (NDP or NORRIN) genes encode proteins with cysteine knot domain. Cysteine-knot superfamily proteins regulate ligand-receptor interactions for a variety of signaling pathways implicated in embryogenesis, homeostasis, and carcinogenesis. Although Ndp is unrelated to Wnt family members, Ndp is claimed to function as a ligand for Fzd4. Here, we identified and characterized rat Ndp, cow Ndp, chicken ndp and zebrafish ndp genes by using bioinformatics. Rat Ndp gene, consisting of three exons, was located within AC105563.4 genome sequence. Cow Ndp and chicken ndp complete CDS were derived from CB467544.1 EST and BX932859.2 cDNA, respectively. Zebrafish ndp gene was located within BX572627.5 genome sequence. Rat Ndp (131 aa) was a secreted protein with C-terminal cysteine knot-like (CTCK) domain. Rat Ndp showed 100, 96.9, 95.4, 87.8 and 66.4 total-amino-acid identity with mouse Ndp, cow Ndp, human NDP, chicken ndp and zebrafish ndp, respectively. Exon-intron structure of mammalian Ndp orthologs was well conserved. FOXA2, CUTL1 (CCAAT displacement protein), LMO2, CEBPA (C/EBPalpha)-binding sites and triple POU2F1 (OCT1)-binding sites were conserved among promoters of mammalian Ndp orthologs.
Identification of NADPH oxidase family members associated with cold stress in strawberry.
Zhang, Yunting; Li, Yali; He, Yuwei; Hu, Wenjie; Zhang, Yong; Wang, Xiaorong; Tang, Haoru
2018-04-01
NADPH oxidase is encoded by a small gene family (Respiratory burst oxidase homologs, Rbohs ) and plays an important role in regulating various biological processes. However, little information about this gene family is currently available for strawberry. In this study, a total of seven Rboh genes were identified from strawberry through genomewide analysis. Gene structure analysis showed the number of exons ranged from 10 to 23, implying that this variation occurred in FvRboh genes by the insertion and distribution of introns; the order and approximate size of exons were relatively conserved. FvRbohC was predicted to localize to the thylakoid membrane of the chloroplast, while other members were computed to localize to the plasma membrane, indicating different functions. Amino acid sequence alignment, conserved domain, and motif analysis showed that all identified FvRbohs had typical features of plant Rbohs. Phylogenetic analysis of Rbohs from strawberry, grape, Arabidopsis, and rice suggested that the FvRbohs could be divided into five subgroups and showed a closer relationship with those from grape and Arabidopsis than those from rice. The expression patterns of FvRboh genes in root, stem, leaf, flower, and fruit revealed robust tissue specificity. The expression levels of FvRbohA and FvRbohD were quickly induced by cold stress, followed by an increase in NADPH oxidase activity, leading to O2- accumulation and triggering the antioxidant reaction by the transient increases in SOD activity. This suggested these two genes may be involved in cold stress and defense responses in strawberry.
An Enhancer Near ISL1 and an Ultraconserved Exon of PCBP2 areDerived from a Retroposon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bejerano, Gill; Lowe, Craig; Ahituv, Nadav
2005-11-27
Hundreds of highly conserved distal cis-regulatory elementshave been characterized to date in vertebrate genomes1. Many thousandsmore are predicted based on comparative genomics2,3. Yet, in starkcontrast to the genes they regulate, virtually none of these regions canbe traced using sequence similarity in invertebrates, leaving theirevolutionary origin obscure. Here we show that a class of conserved,primarily non-coding regions in tetrapods originated from a novel shortinterspersed repetitive element (SINE) retroposon family that was activein Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in theSilurian at least 410 Mya4, and, remarkably, appears to be recentlyactive in the "living fossil" Indonesian coelacanth, Latimeriamenadoensis. We show that onemore » copy is a distal enhancer, located 500kbfrom the neuro-developmental gene ISL1. Several others represent new,possibly regulatory, alternatively spliced exons in the middle ofpre-existing Sarcopterygian genes. One of these is the>200bpultraconserved region5, 100 percent identical in mammals, and 80 percentidentical to the coelacanth SINE, that contains a 31aa alternativelyspliced exon of the mRNA processing gene PCBP26. These add to a growinglist of examples7 in which relics of transposable elements have acquireda function that serves their host, a process termed "exaptation"8, andprovide an origin for at least some of the highly-conservedvertebrate-specific genomic sequences recently discovered usingcomparative genomics.« less
Chen, Ying; Lei, Yun-Ping; Zheng, Hong-Xiang; Wang, Wei; Cheng, Hong-Bo; Zhang, Jing; Wang, Hong-Yan; Jin, Li; Li, Hong
2009-06-01
Congenital contractural arachnodactyly (Beals syndrome) is a rare autosomal dominantly inherited connective tissue disorder characterized by flexion contractures, arachnodactyly, crumpled ears, and mild muscular hypoplasia. Here, a father and son with congenital contractural arachnodactyly features were identified. After sequencing 15 exons (22 to 36) of the FBN2 gene, a novel mutation (C1425Y) was found in exon 33. This de novo mutation presented first in the father and was transmitted to his son, but not in the other 14 unaffected family members and 365 normal people. The C1425Y mutation occurs at the 19th cbEGF domain. Cysteines in this cbEGF domain are rather conserved in species, from human down to ascidian. The cbEGF12-13 in human FBN1 was employed as the template to perform homology modeling of cbEGF18-19 of human FBN2 protein. The mutation has also been evaluated by further prediction tools, for example, SIFT, Blosum62, biochemical Yu's matrice, and UMD-Predictor tool. In all analysis, the mutation is predicted to be pathogenic. Thus, the structure destabilization by C1425Y might be the cause of the disorder.
Gene organization and alternative splicing of human prohormone convertase PC8.
Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T
1998-01-01
The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
DLEU2 encodes an antisense RNA for the putative bicistronic RFP2/LEU5 gene in humans and mouse.
Corcoran, Martin M; Hammarsund, Marianne; Zhu, Chaoyong; Lerner, Mikael; Kapanadze, Bagrat; Wilson, Bill; Larsson, Catharina; Forsberg, Lars; Ibbotson, Rachel E; Einhorn, Stefan; Oscier, David G; Grandér, Dan; Sangfelt, Olle
2004-08-01
Our group previously identified two novel genes, RFP2/LEU5 and DLEU2, within a 13q14.3 genomic region of loss seen in various malignancies. However, no specific inactivating mutations were found in these or other genes in the vicinity of the deletion, suggesting that a nonclassical tumor-suppressor mechanism may be involved. Here, we present data showing that the DLEU2 gene encodes a putative noncoding antisense RNA, with one exon directly overlapping the first exon of the RFP2/LEU5 gene in the opposite orientation. In addition, the RFP2/LEU5 transcript can be alternatively spliced to produce either several monocistronic transcripts or a putative bicistronic transcript encoding two separate open-reading frames, adding to the complexity of the locus. The finding that these gene structures are conserved in the mouse, including the putative bicistronic RFP2/LEU5 transcript as well as the antisense relationship with DLEU2, further underlines the significance of this unusual organization and suggests a biological function for DLEU2 in the regulation of RFP2/LEU5. Copyright 2004 Wiley-Liss, Inc.
Solana, Jordi; Irimia, Manuel; Ayoub, Salah; Orejuela, Marta Rodriguez; Zywitza, Vera; Jens, Marvin; Tapial, Javier; Ray, Debashish; Morris, Quaid; Hughes, Timothy R; Blencowe, Benjamin J; Rajewsky, Nikolaus
2016-01-01
In contrast to transcriptional regulation, the function of alternative splicing (AS) in stem cells is poorly understood. In mammals, MBNL proteins negatively regulate an exon program specific of embryonic stem cells; however, little is known about the in vivo significance of this regulation. We studied AS in a powerful in vivo model for stem cell biology, the planarian Schmidtea mediterranea. We discover a conserved AS program comprising hundreds of alternative exons, microexons and introns that is differentially regulated in planarian stem cells, and comprehensively identify its regulators. We show that functional antagonism between CELF and MBNL factors directly controls stem cell-specific AS in planarians, placing the origin of this regulatory mechanism at the base of Bilaterians. Knockdown of CELF or MBNL factors lead to abnormal regenerative capacities by affecting self-renewal and differentiation sets of genes, respectively. These results highlight the importance of AS interactions in stem cell regulation across metazoans. DOI: http://dx.doi.org/10.7554/eLife.16797.001 PMID:27502555
Wang, Xu-Hua; Wang, Yong; Liu, A-Ke; Liu, Xiao-Ting; Zhou, Yang; Yao, Qin; Chen, Ke-Ping
2015-04-01
The basic helix-loop-helix (bHLH) domain is a highly conserved amino acid motif that defines a group of DNA-binding transcription factors. bHLH proteins play essential regulatory roles in a variety of biological processes in animal, plant, and fungus. The domestic dog, Canis lupus familiaris, is a good model organism for genetic, physiological, and behavioral studies. In this study, we identified 115 putative bHLH genes in the dog genome. Based on a phylogenetic analysis, 51, 26, 14, 4, 12, and 4 dog bHLH genes were assigned to six separate groups (A-F); four bHLH genes were categorized as ''orphans''. Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with positional conservation, other conserved domains flanking the bHLH motif, and highly conserved intron/exon patterns in other vertebrates. Our analytical results confirmed the GenBank annotations of 89 dog bHLH proteins and provided information that could be used to update the annotations of the remaining 26 dog bHLH proteins. These data will provide good references for further studies on the structures and regulatory functions of bHLH proteins in the growth and development of dogs, which may help in understanding the mechanisms that underlie the physical and behavioral differences between dogs and wolves.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.
Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Levy-Lahad, E.; Wang, Kai; Fu, Ying Hui
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23, 737 bp. The first 2 exons encode the 5{prime}-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splicemore » acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system. 19 refs., 2 figs., 3 tabs.« less
Zhou, Yong; Hu, Lifang; Jiang, Lunwei; Liu, Shiqiang
2018-06-01
YTH domain-containing RNA-binding proteins are involved in post-transcriptional regulation and play important roles in the growth and development as well as abiotic stress responses of plants. However, YTH genes have not been previously studied in cucumber (Cucumis sativus). In this study, a total of five YTH genes (CsYTH1-CsYTH5) were identified in cucumber, which could be mapped on three out of the seven cucumber chromosomes. All CsYTH proteins had highly conserved C-terminal YTH domains, and two of them (CsYTH1 and CsYTH4) harbored extra CCCH and P/Q/N-rich domains. The phylogenesis, conserved motifs and exon-intron structure of YTH genes from cucumber, Arabidopsis and rice were also analyzed. The phylogenetically closely clustered YTHs shared similar gene structures and conserved motifs. An analysis of the cis-acting regulatory elements in the upstream region of these genes resulted in the identification of many cis-elements related to stress, hormone and development. Expression analysis based on the transcriptome data showed that some CsYTHs had development- or tissue-specific expression. In addition, their expression levels were altered under various stresses such as salt, drought, cold, and abscisic acid (ABA) treatments. These findings lay the foundation for the functional analysis of CsYTHs in the future.
Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus
Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia
2006-01-01
Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333
Lin, Hsiang-Kai; Boatz, Jennifer C.; Krabbendam, Inge E.; Kodali, Ravindra; Hou, Zhipeng; Wetzel, Ronald; Dolga, Amalia M.; Poirier, Michelle A.; van der Wel, Patrick C. A.
2017-01-01
Polyglutamine expansion in the huntingtin protein is the primary genetic cause of Huntington's disease (HD). Fragments coinciding with mutant huntingtin exon1 aggregate in vivo and induce HD-like pathology in mouse models. The resulting aggregates can have different structures that affect their biochemical behaviour and cytotoxic activity. Here we report our studies of the structure and functional characteristics of multiple mutant htt exon1 fibrils by complementary techniques, including infrared and solid-state NMR spectroscopies. Magic-angle-spinning NMR reveals that fibrillar exon1 has a partly mobile α-helix in its aggregation-accelerating N terminus, and semi-rigid polyproline II helices in the proline-rich flanking domain (PRD). The polyglutamine-proximal portions of these domains are immobilized and clustered, limiting access to aggregation-modulating antibodies. The polymorphic fibrils differ in their flanking domains rather than the polyglutamine amyloid structure. They are effective at seeding polyglutamine aggregation and exhibit cytotoxic effects when applied to neuronal cells. PMID:28537272
NASA Astrophysics Data System (ADS)
Lin, Hsiang-Kai; Boatz, Jennifer C.; Krabbendam, Inge E.; Kodali, Ravindra; Hou, Zhipeng; Wetzel, Ronald; Dolga, Amalia M.; Poirier, Michelle A.; van der Wel, Patrick C. A.
2017-05-01
Polyglutamine expansion in the huntingtin protein is the primary genetic cause of Huntington's disease (HD). Fragments coinciding with mutant huntingtin exon1 aggregate in vivo and induce HD-like pathology in mouse models. The resulting aggregates can have different structures that affect their biochemical behaviour and cytotoxic activity. Here we report our studies of the structure and functional characteristics of multiple mutant htt exon1 fibrils by complementary techniques, including infrared and solid-state NMR spectroscopies. Magic-angle-spinning NMR reveals that fibrillar exon1 has a partly mobile α-helix in its aggregation-accelerating N terminus, and semi-rigid polyproline II helices in the proline-rich flanking domain (PRD). The polyglutamine-proximal portions of these domains are immobilized and clustered, limiting access to aggregation-modulating antibodies. The polymorphic fibrils differ in their flanking domains rather than the polyglutamine amyloid structure. They are effective at seeding polyglutamine aggregation and exhibit cytotoxic effects when applied to neuronal cells.
2012-01-01
Background It is known from recent studies that more than 90% of human multi-exon genes are subject to Alternative Splicing (AS), a key molecular mechanism in which multiple transcripts may be generated from a single gene. It is widely recognized that a breakdown in AS mechanisms plays an important role in cellular differentiation and pathologies. Polymerase Chain Reactions, microarrays and sequencing technologies have been applied to the study of transcript diversity arising from alternative expression. Last generation Affymetrix GeneChip Human Exon 1.0 ST Arrays offer a more detailed view of the gene expression profile providing information on the AS patterns. The exon array technology, with more than five million data points, can detect approximately one million exons, and it allows performing analyses at both gene and exon level. In this paper we describe BEAT, an integrated user-friendly bioinformatics framework to store, analyze and visualize exon arrays datasets. It combines a data warehouse approach with some rigorous statistical methods for assessing the AS of genes involved in diseases. Meta statistics are proposed as a novel approach to explore the analysis results. BEAT is available at http://beat.ba.itb.cnr.it. Results BEAT is a web tool which allows uploading and analyzing exon array datasets using standard statistical methods and an easy-to-use graphical web front-end. BEAT has been tested on a dataset with 173 samples and tuned using new datasets of exon array experiments from 28 colorectal cancer and 26 renal cell cancer samples produced at the Medical Genetics Unit of IRCCS Casa Sollievo della Sofferenza. To highlight all possible AS events, alternative names, accession Ids, Gene Ontology terms and biochemical pathways annotations are integrated with exon and gene level expression plots. The user can customize the results choosing custom thresholds for the statistical parameters and exploiting the available clinical data of the samples for a multivariate AS analysis. Conclusions Despite exon array chips being widely used for transcriptomics studies, there is a lack of analysis tools offering advanced statistical features and requiring no programming knowledge. BEAT provides a user-friendly platform for a comprehensive study of AS events in human diseases, displaying the analysis results with easily interpretable and interactive tables and graphics. PMID:22536968
Intergenic disease-associated regions are abundant in novel transcripts.
Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E
2017-12-28
Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Kück, Ulrich; Choquet, Yves; Schneider, Michel; Dron, Michel; Bennoun, Pierre
1987-01-01
The two homologous genes for the P700 chlorophyll a-apoproteins (ps1A1 and ps1A2) are encoded by the plastom in the green alga Chlamydomonas reinhardii. The structure and organization of the two genes were determined by comparison with the homologous genes from maize using data from heterologous hybridizations as well as from DNA and RNA sequencing. While the ps1A2 (736 codons) gene shows a continuous gene organization, the ps1A1 (754 codons) gene possesses some unusual features. The discontinuous gene is split into three separate exons which are scattered around the circular chloroplast genome. Exon 1 (86 bp) is separated by ∼50 kb from exon 2 (198 bp), which is located ∼ 90 kb apart from exon 3 (1984 bp). All exons are flanked by intronic sequences of group II. Transcription analysis reveals that the ps1A2 gene hybridizes with a 2.8-kb transcript, while all exon regions of the ps1A1 gene are homologous to a mature mRNA of 2.7 kb. From our data we conclude that the three distantly separated exonic sequences of the ps1A1 gene constitute a functional gene which probably operates by a trans-splicing mechanism. ImagesFig. 3.Fig. 5.Fig. 6. PMID:16453785
Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V
2003-09-02
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kawagoe, Kazuyoshi; Takeda, Junji; Kinoshita, Taroh
Many membrane proteins are anchored to the cell membrane by glycosylphosphatidylinositol (GPI). The core structure and biosynthesis of the GPI anchor are well conserved in eukaryote cells. We previously cloned a human PIGA gene that participates in GPI anchor biosynthesis. We have now cloned complementary and genomic DNA of Pig-a, the murine homologue of PIGA, and compared its function and gene structure with those of PIGA. The deduced amino acid sequence of mouse PIG-A is 88% identical with that of human PIG-A. Transfection of Pig-a cDNA complemented the defects of both a PIG-A-deficient murine cell line and a PIG-A-deficient humanmore » cell line, demonstrating that functions of mouse and human PIG-A are conserved. Like human PIGA, the chromosomal Pig-a gene has six exons and spans approximately 16 kb. Moreover, Pig-a was mapped to X-F3/4, which is syntenic to human Xp22.1, where PIGA is located. Thus, murine Pig-a provides a good animal model to study paroxysmal nocturnal hemoglobinuria, a disease caused by a somatic mutation of PIGA. Database analysis demonstrated that a yeast gene, SPT14, is homologous to Pig-a and PIGA and that these genes are members of a glycosyltransferase gene family.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rittig, S.; Siggaard, C.; Pedersen, E.B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation wasmore » unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. 63 refs., 5 figs., 6 tabs.« less
Rittig, S.; Robertson, G. L.; Siggaard, C.; Kovács, L.; Gregersen, N.; Nyborg, J.; Pedersen, E. B.
1996-01-01
Familial neurohypophyseal diabetes insipidus (FNDI) is an autosomal dominant disorder characterized by progressive postnatal deficiency of arginine vasopressin as a result of mutation in the gene that encodes the hormone. To determine the extent of mutations in the coding region that produce the phenotype, we studied members of 17 unrelated kindreds with the disorder. We sequenced all 3 exons of the gene by using a rapid, direct dye-terminator method and found the causative mutation in each kindred. In four kindreds, the mutations were each identical to mutations described in other affected families. In the other 13 kindreds each mutation was unique. There were two missense mutations that altered the cleavage region of the signal peptide, seven missense mutations in exon 2, which codes for the conserved portion of the protein, one nonsense mutation in exon 2, and three nonsense mutations in exon 3. These findings, together with the clinical features of FNDI, suggest that each of the mutations exerts an effect by directing the production of a pre-prohormone that cannot be folded, processed, or degraded properly and eventually destroys vasopressinergic neurons. Images Figure 3 PMID:8554046
Abdoli, R; Zamani, P; Deljou, A; Rezvan, H
2013-07-25
BMPR-1B and GDF9 genes are well known due to their important effects on litter size and mechanisms controlling ovulation rate in sheep. In the present study, polymorphisms of BMPR-1B gene exon 8 and GDF9 gene exon 1 were detected by single strand conformational polymorphism (SSCP) analysis and DNA sequencing methods in 100 Mehraban ewes. The PCR reaction forced to amplify 140 and 380-bp fragments of BMPR-1B and GDF9 genes, respectively. Two single nucleotide polymorphisms (SNPS) were identified in two different SSCP patterns of BMPR-1B gene (CC and CA genotypes) that deduced one amino acid exchange. Also, two SNPS were identified in three different SSCP patterns of GDF9 gene (AA, AG and GG genotypes) that deduced one amino acid exchanges. Two different secondary structures of protein were predicted for BMPR-1B exon 8, but the secondary protein structures predicted for GDF9 exon 1 were similar together. The evaluation of the associations between the SSCP patterns and the protein structure changes with reproduction traits showed that BMPR-1B exon 8 genotypes have significant effects on some of reproduction traits but the GDF9 genotypes did not have any significant effect. The CA genotype of BMPR-1B exon 8 had a significant positive effect on reproduction performance and could be considered as an important and new mutation, affecting the ewes reproduction performance. Marker assisted selection using BMPR-IB gene could be noticed to improve the reproduction traits in Mehraban sheep. Copyright © 2013 Elsevier B.V. All rights reserved.
Zouheir Habbal, Mohammad; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5-16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband's phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin.
Habbal, Mohammad Zouheir; Bou-Assi, Tarek; Zhu, Jun; Owen, Renius; Chehab, Farid F.
2014-01-01
Alkaptonuria is often diagnosed clinically with episodes of dark urine, biochemically by the accumulation of peripheral homogentisic acid and molecularly by the presence of mutations in the homogentisate 1,2-dioxygenase gene (HGD). Alkaptonuria is invariably associated with HGD mutations, which consist of single nucleotide variants and small insertions/deletions. Surprisingly, the presence of deletions beyond a few nucleotides among over 150 reported deleterious mutations has not been described, raising the suspicion that this gene might be protected against the detrimental mechanisms of gene rearrangements. The quest for an HGD mutation in a proband with AKU revealed with a SNP array five large regions of homozygosity (5–16 Mb), one of which includes the HGD gene. A homozygous deletion of 649 bp deletion that encompasses the 72 nucleotides of exon 2 and surrounding DNA sequences in flanking introns of the HGD gene was unveiled in a proband with AKU. The nature of this deletion suggests that this in-frame deletion could generate a protein without exon 2. Thus, we modeled the tertiary structure of the mutant protein structure to determine the effect of exon 2 deletion. While the two β-pleated sheets encoded by exon 2 were missing in the mutant structure, other β-pleated sheets are largely unaffected by the deletion. However, nine novel α-helical coils substituted the eight coils present in the native HGD crystal structure. Thus, this deletion results in a deleterious enzyme, which is consistent with the proband’s phenotype. Screening for mutations in the HGD gene, particularly in the Middle East, ought to include this exon 2 deletion in order to determine its frequency and uncover its origin. PMID:25233259
Fukao, T; Yamaguchi, S; Wakazono, A; Orii, T; Hoganson, G; Hashimoto, T
1994-01-01
We identified a novel exonic mutation which causes exon skipping in the mitochondrial acetoacetyl-CoA thiolase (T2) gene from a girl with T2 deficiency (GK07). GK07 is a compound heterozygote; the maternal allele has a novel G to T transversion at position 1136 causing Gly379 to Val substitution (G379V) of the T2 precursor. In case of in vivo expression analysis, cells transfected with this mutant cDNA showed no evidence of restored T2 activity. The paternal allele was associated with exon 8 skipping at the cDNA level. At the gene level, a C to T transition causing Gln272 to termination codon (Q272STOP) was identified within exon 8, 13 bp from the 5' splice site of intron 8 in the paternal allele. The mRNA with Q272STOP could not be detected in GK07 fibroblasts, presumably because pre-mRNA with Q272STOP was unstable because of the premature termination. In vivo splicing experiments revealed that the exonic mutation caused partial skipping of exon 8. This substitution was thought to alter the secondary structure of T2 pre-mRNA around exon 8 and thus impede normal splicing. The role of exon sequences in the splicing mechanism is indicated by the exon skipping which occurred with an exonic mutation. Images PMID:7907600
Multi-Hamiltonian structure of equations of hydrodynamic type
NASA Astrophysics Data System (ADS)
Gümral, H.; Nutku, Y.
1990-11-01
The discussion of the Hamiltonian structure of two-component equations of hydrodynamic type is completed by presenting the Hamiltonian operators for Euler's equation governing the motion of plane sound waves of finite amplitude and another quasilinear second-order wave equation. There exists a doubly infinite family of conserved Hamiltonians for the equations of gas dynamics that degenerate into one, namely, the Benney sequence, for shallow-water waves. Infinite sequences of conserved quantities for these equations are also presented. In the case of multicomponent equations of hydrodynamic type, it is shown, that Kodama's generalization of the shallow-water equations admits bi-Hamiltonian structure.
Cloning, structure, and chromosome localization of the mouse glutaryl-CoA dehydrogenase gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koeller, D.M.; DiGiulio, A.; Frerman, F.E.
Glutaryl-CoA dehydrogenase (GCDH) is a nuclear-encoded, mitochondrial matrix enzyme. In humans, deficiency of GCDH leads to glutaric acidemia type I, and inherited disorder of amino acid metabolism characterized by a progressive neurodegenerative disease. In this report we describe the cloning and structure of the mouse GCDH (Gcdh) gene and cDNA and its chromosomal localization. The mouse Gcdh cDNA is 1.75 kb long and contains and open reading frame of 438 amino acids. The amino acid sequences of mouse, human, and pig GCDH are highly conserved. The mouse Gcdh gene contains 11 exons and spans 7 kb of genomic DNA. Gcdhmore » was mapped by backcross analysis to mouse chromosome 8 within a region that is homologous to a region of human chromosome 19, where the human gene was previously mapped. 14 refs., 3 figs.« less
Dynamic hyper-editing underlies temperature adaptation in Drosophila
Ashwal-Fluss, Reut; Pandey, Varun; Levanon, Erez Y.; Kadener, Sebastian
2017-01-01
In Drosophila, A-to-I editing is prevalent in the brain, and mutations in the editing enzyme ADAR correlate with specific behavioral defects. Here we demonstrate a role for ADAR in behavioral temperature adaptation in Drosophila. Although there is a higher level of editing at lower temperatures, at 29°C more sites are edited. These sites are less evolutionarily conserved, more disperse, less likely to be involved in secondary structures, and more likely to be located in exons. Interestingly, hypomorph mutants for ADAR display a weaker transcriptional response to temperature changes than wild-type flies and a highly abnormal behavioral response upon temperature increase. In sum, our data shows that ADAR is essential for proper temperature adaptation, a key behavior trait that is essential for survival of flies in the wild. Moreover, our results suggest a more general role of ADAR in regulating RNA secondary structures in vivo. PMID:28746393
Bredel, Markus; Ferrarese, Roberto; Harsh, Griffith R.; Yadav, Ajay K.; Bug, Eva; Maticzka, Daniel; Reichardt, Wilfried; Masilamani, Anie P.; Dai, Fangping; Kim, Hyunsoo; Hadler, Michael; Scholtens, Denise M.; Yu, Irene L.Y.; Beck, Jürgen; Srinivasasainagendra, Vinodh; Costa, Fabrizio; Baxan, Nicoleta; Pfeifer, Dietmar; Elverfeldt, Dominik v.; Backofen, Rolf; Weyerbrock, Astrid; Duarte, Christine W.; He, Xiaolin; Prinz, Marco; Chandler, James P.; Vogel, Hannes; Chakravarti, Arnab; Rich, Jeremy N.; Carro, Maria S.
2014-01-01
BACKGROUND: Tissue-specific alternative splicing is known to be critical to emergence of tissue identity during development, yet its role in malignant transformation is undefined. Tissue-specific splicing involves evolutionary-conserved, alternative exons, which represent only a minority of total alternative exons. Many, however, have functional features that influence activity in signaling pathways to profound biological effect. Given that tissue-specific splicing has a determinative role in brain development and the enrichment of genes containing tissue-specific exons for proteins with roles in signaling and development, it is thus plausible that changes in such exons could rewire normal neurogenesis towards malignant transformation. METHODS: We used integrated molecular genetic and cell biology analyses, computational biology, animal modeling, and clinical patient profiles to characterize the effect of aberrant splicing of a brain-enriched alternative exon in the membrane-binding tumor suppressor Annexin A7 (ANXA7) on oncogene regulation and brain tumorigenesis. RESULTS: We show that aberrant splicing of a tissue-specific cassette exon in ANXA7 diminishes endosomal targeting and consequent termination of the signal of the EGFR oncoprotein during brain tumorigenesis. Splicing of this exon is mediated by the ribonucleoprotein Polypyrimidine Tract-Binding Protein 1 (PTBP1), which is normally repressed during brain development but, we find, is excessively expressed in glioblastomas through either gene amplification or loss of a neuron-specific microRNA, miR-124. Silencing of PTBP1 attenuates both malignancy and angiogenesis in a stem cell-derived glioblastoma animal model characterized by a high native propensity to generate tumor endothelium or vascular pericytes to support tumor growth. We show that EGFR amplification and PTBP1 overexpression portend a similarly poor clinical outcome, further highlighting the importance of PTBP1-mediated activation of EGFR. CONCLUSIONS: Our data illustrate how anomalous splicing of a tissue-regulated exon in a constituent of an oncogenic signaling pathway eliminates its tumor suppressor function and promotes tumorigenesis. This paradigm of malignant glial transformation as a consequence of tissue-specific alternative exon splicing in a tumor suppressor, may have widespread applicability in explaining how changes in critical tissue-specific regulatory mechanisms reprogram normal development to oncogenesis. SECONDARY CATEGORY: n/a.
Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg
2005-12-01
In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.
De novo mutation of PHEX in a type 1 diabetes patient.
Fang, Chen; Li, Hui; Li, Xiaozhen; Xiao, Wenjin; Huang, Yun; Cai, Wu; Yang, Yi; Hu, Ji
2016-05-01
A new missense mutation on the X chromosome (PHEX) at exon 4(c.442C>T) in a 4-generation Chinese Han pedigree is reported. The proband and four family members were clinically identified as the X-linked hypophosphatemic rickets (XLH) which is a dominant inherited disorder characterized by renal phosphate wasting, aberrant vitamin D metabolism, and abnormal bone mineralization. The proband is identified as hemizygous with the four female family members to be heterozygous genotypes. The discovery was made through the complete sequencing of the exons and the intron-exon boundaries of the PHEX gene of this family. The mutation caused the S141 residue to change to Phe from Ser which is perfectly conserved among humans, mice, rats, cows and chickens. PolyPhen-2 software analysis of the mutation indicated it was probably damaging. The proband was also diagnosed with type 1 diabetes (T1D) and the relationship between XLH and diabetes phenotypes was discussed in the paper.
Circular RNA biogenesis can proceed through an exon-containing lariat precursor.
Barrett, Steven P; Wang, Peter L; Salzman, Julia
2015-06-09
Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.
The genomic structure: proof of the role of non-coding DNA.
Bouaynaya, Nidhal; Schonfeld, Dan
2006-01-01
We prove that the introns play the role of a decoy in absorbing mutations in the same way hollow uninhabited structures are used by the military to protect important installations. Our approach is based on a probability of error analysis, where errors are mutations which occur in the exon sequences. We derive the optimal exon length distribution, which minimizes the probability of error in the genome. Furthermore, to understand how can Nature generate the optimal distribution, we propose a diffusive random walk model for exon generation throughout evolution. This model results in an alpha stable exon length distribution, which is asymptotically equivalent to the optimal distribution. Experimental results show that both distributions accurately fit the real data. Given that introns also drive biological evolution by increasing the rate of unequal crossover between genes, we conclude that the role of introns is to maintain a genius balance between stability and adaptability in eukaryotic genomes.
Predicting Gene Structure Changes Resulting from Genetic Variants via Exon Definition Features.
Majoros, William H; Holt, Carson; Campbell, Michael S; Ware, Doreen; Yandell, Mark; Reddy, Timothy E
2018-04-25
Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed, and produce functional proteins. We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and noncoding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or noncoding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products, and we propose that they may commonly act as cryptic factors in disease. The software is available from geneprediction.org/SGRF. bmajoros@duke.edu. Supplementary information is available at Bioinformatics online.
Evolution of two Rh blood group-related genes of the amphioxus species Branchiostoma floridae.
Kitano, Takashi; Satou, Masahiro; Saitou, Naruya
2010-04-01
We determined cDNAs of two genes that belong to the Rhesus (Rh) blood group gene family in an amphioxus species (Branchiostoma floridae) and designated them Rh-related-1 (RhR-1) and Rh-related-2 (RhR-2). RhR-1 and RhR-2 consisted of 10 and 11 exons, respectively. 3' UTR sequences of RhR-1 were shorter (220-272 bp) than those of RhR-2 (1,505-1,650 bp). CDS lengths were 1,344 and 1,476 bp for RhR-1 and RhR-2, respectively, and the average nucleotide difference between their CDS regions was 0.33. The corresponding regions of Rh genes from exons 2 to 7 were relatively conserved among the chordate species examined in this study. Length difference numbers were in multiples of three, which implies that codon frames were conserved among them, and the same exon/intron boundary phases were observed in those regions. This region was used for the phylogenetic analyses. RhR-1 and RhR-2 formed a cluster on the phylogenetic tree of the Rh gene family. Gene duplication time of RhR-1 and RhR-2 was estimated to be ca. 500 million years ago. It is likely that the four Rh family genes in vertebrates emerged by gene duplications in the common ancestor of vertebrates, and functional differentiation has occurred after the first gene duplication.
Leiomodins: larger members of the tropomodulin (Tmod) gene family
NASA Technical Reports Server (NTRS)
Conley, C. A.; Fritz-Six, K. L.; Almenar-Queralt, A.; Fowler, V. M.
2001-01-01
The 64-kDa autoantigen D1 or 1D, first identified as a potential autoantigen in Graves' disease, is similar to the tropomodulin (Tmod) family of actin filament pointed end-capping proteins. A novel gene with significant similarity to the 64-kDa human autoantigen D1 has been cloned from both humans and mice, and the genomic sequences of both genes have been identified. These genes form a subfamily closely related to the Tmods and are here named the Leiomodins (Lmods). Both Lmod genes display a conserved intron-exon structure, as do three Tmod genes, but the intron-exon structure of the Lmods and the Tmods is divergent. mRNA expression analysis indicates that the gene formerly known as the 64-kDa autoantigen D1 is most highly expressed in a variety of human tissues that contain smooth muscle, earning it the name smooth muscle Leiomodin (SM-Lmod; HGMW-approved symbol LMOD1). Transcripts encoding the novel Lmod gene are present exclusively in fetal and adult heart and adult skeletal muscle, and it is here named cardiac Leiomodin (C-Lmod; HGMW-approved symbol LMOD2). Human C-Lmod is located near the hypertrophic cardiomyopathy locus CMH6 on human chromosome 7q3, potentially implicating it in this disease. Our data demonstrate that the Lmods are evolutionarily related and display tissue-specific patterns of expression distinct from, but overlapping with, the expression of Tmod isoforms. Copyright 2001 Academic Press.
Simultaneous gene finding in multiple genomes.
König, Stefanie; Romoth, Lars W; Gerischer, Lizzy; Stanke, Mario
2016-11-15
As the tree of life is populated with sequenced genomes ever more densely, the new challenge is the accurate and consistent annotation of entire clades of genomes. We address this problem with a new approach to comparative gene finding that takes a multiple genome alignment of closely related species and simultaneously predicts the location and structure of protein-coding genes in all input genomes, thereby exploiting negative selection and sequence conservation. The model prefers potential gene structures in the different genomes that are in agreement with each other, or-if not-where the exon gains and losses are plausible given the species tree. We formulate the multi-species gene finding problem as a binary labeling problem on a graph. The resulting optimization problem is NP hard, but can be efficiently approximated using a subgradient-based dual decomposition approach. The proposed method was tested on whole-genome alignments of 12 vertebrate and 12 Drosophila species. The accuracy was evaluated for human, mouse and Drosophila melanogaster and compared to competing methods. Results suggest that our method is well-suited for annotation of (a large number of) genomes of closely related species within a clade, in particular, when RNA-Seq data are available for many of the genomes. The transfer of existing annotations from one genome to another via the genome alignment is more accurate than previous approaches that are based on protein-spliced alignments, when the genomes are at close to medium distances. The method is implemented in C ++ as part of Augustus and available open source at http://bioinf.uni-greifswald.de/augustus/ CONTACT: stefaniekoenig@ymail.com or mario.stanke@uni-greifswald.deSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Premraj, Avinash; Nautiyal, Binita; Aleyas, Abi G; Rasool, Thaha Jamal
2015-10-01
Interleukin-26 (IL-26) is a member of the IL-10 family of cytokines. Though conserved across vertebrates, the IL-26 gene is functionally inactivated in a few mammals like rat, mouse and horse. We report here the identification, isolation and cloning of the cDNA of IL-26 from the dromedary camel. The camel cDNA contains a 516 bp open reading frame encoding a 171 amino acid precursor protein, including a 21 amino acid signal peptide. Sequence analysis revealed high similarity with other mammalian IL-26 homologs and the conservation of IL-10 cytokine family domain structure including key amino acid residues. We also report the identification and cloning of four novel transcript variants produced by alternative splicing at the Exon 3-Exon 4 regions of the gene. Three of the alternative splice variants had premature termination codons and are predicted to code for truncated proteins. The transcript variant 4 (Tv4) having an insertion of an extra 120 bp nucleotides in the ORF was predicted to encode a full length protein product with 40 extra amino acid residues. The mRNA transcripts of all the variants were identified in lymph node, where as fewer variants were observed in other tissues like blood, liver and kidney. The expression of Tv2 and Tv3 were found to be up regulated in mitogen induced camel peripheral blood mononuclear cells. IL-26-Tv2 expression was also induced in camel fibroblast cells infected with Camel pox virus in-vitro. The identification of the transcript variants of IL-26 from the dromedary camel is the first report of alternative splicing for IL-26 in a species in which the gene has not been inactivated. Copyright © 2015 Elsevier Ltd. All rights reserved.
Zouhar, Miloslav; Mazakova, Jana; Rysanek, Pavel
2014-01-01
Abstract Phoma stem canker (blackleg) is a disease of world-wide importance on oilseed rape (Brassica napus) and can cause serious losses for crops globally. The disease is caused by dothideomycetous fungus, Leptosphaeria maculans, which is highly virulent/aggressive. Cyclophilins (CYPs) and FK506-binding proteins (FKBPs) are ubiquitous proteins belonging to the peptidyl-prolyl cis/trans isomerase (PPIase) family. They are collectively referred to as immunophilins (IMMs). In the present study, IMM genes, CYP and FKBP in haploid strain v23.1.3 of L. maculans genome, were identified and classified. Twelve CYPs and five FKBPs were determined in total. Domain architecture analysis revealed the presence of a conserved cyclophilin-like domain (CLD) in the case of CYPs and FKBP_C in the case of FKBPs. Interestingly, IMMs in L. maculans also subgrouped into single domain (SD) and multidomain (MD) proteins. They were primarily found to be localized in cytoplasm, nuclei, and mitochondria. Homologous and orthologous gene pairs were also determined by comparison with the model organism Saccharomyces cerevisiae. Remarkably, IMMs of L. maculans contain shorter introns in comparison to exons. Moreover, CYPs, in contrast with FKBPs, contain few exons. However, two CYPs were determined as being intronless. The expression profile of IMMs in both mycelium and infected primary leaves of B. napus demonstrated their potential role during infection. Secondary structure analysis revealed the presence of atypical eight β strands and two α helices fold architecture. Gene ontology analysis of IMMs predicted their significant role in protein folding and PPIase activity. Taken together, our findings for the first time present new prospects of this highly conserved gene family in phytopathogenic fungus. PMID:25259854
Scholthof, Karen-Beth G.
2015-01-01
In eukaryotes, alternative splicing (AS) promotes transcriptome and proteome diversity. The extent of genome-wide AS changes occurring during a plant-microbe interaction is largely unknown. Here, using high-throughput, paired-end RNA sequencing, we generated an isoform-level spliceome map of Brachypodium distachyon infected with Panicum mosaic virus and its satellite virus. Overall, we detected ∼44,443 transcripts in B. distachyon, ∼30% more than those annotated in the reference genome. Expression of ∼28,900 transcripts was ≥2 fragments per kilobase of transcript per million mapped fragments, and ∼42% of multi-exonic genes were alternatively spliced. Comparative analysis of AS patterns in B. distachyon, rice (Oryza sativa), maize (Zea mays), sorghum (Sorghum bicolor), Arabidopsis thaliana, potato (Solanum tuberosum), Medicago truncatula, and poplar (Populus trichocarpa) revealed conserved ratios of the AS types between monocots and dicots. Virus infection quantitatively altered AS events in Brachypodium with little effect on the AS ratios. We discovered AS events for >100 immune-related genes encoding receptor-like kinases, NB-LRR resistance proteins, transcription factors, RNA silencing, and splicing-associated proteins. Cloning and molecular characterization of SCL33, a serine/arginine-rich splicing factor, identified multiple novel intron-retaining splice variants that are developmentally regulated and modulated during virus infection. B. distachyon SCL33 splicing patterns are also strikingly conserved compared with a distant Arabidopsis SCL33 ortholog. This analysis provides new insights into AS landscapes conserved among monocots and dicots and uncovered AS events in plant defense-related genes. PMID:25634987
Expression of exon-8-skipped kindlin-1 does not compensate for defects of Kindler syndrome.
Natsuga, Ken; Nishie, Wataru; Shinkuma, Satoru; Nakamura, Hideki; Matsushima, Yoichiro; Tatsuta, Aya; Komine, Mayumi; Shimizu, Hiroshi
2011-01-01
Kindler syndrome (KS) is a rare, inherited skin disease characterized by blister formation and generalized poikiloderma. Mutations in KIND1, which encodes kindlin-1, are responsible for KS. c.1089del/1089+1del is a recurrent splice-site deletion mutation in KS patients. To elucidate the effects of c.1089del/1089+1del at the mRNA and protein level. Two KS patients with c.1089del/1089+1del were included in this study. Immunofluorescence analysis of KS skin samples using antibodies against the dermo-epidermal junction proteins was performed. Exon-trapping experiments were performed to isolate the mRNA sequences transcribed from genomic DNA harbouring c.1089del/1089+1del. β1 integrin activation in HeLa cells transfected with truncated KIND1 cDNA was analyzed. Immunofluorescence study showed positive expression of kindlin-1 in KS skin with c.1089del/1089+1del mutation. We identified the exon-8-skipped in-frame transcript as the main product among multiple splicing variants derived from that mutation. HeLa cells transfected with KIND1 cDNA without exon 8 showed impaired β1 integrin activation. Exon-8-coding amino acids are located in the FERM F2 domain, which is conserved among species, and the unstructured region between F2 and the pleckstrin homology domain. This study suggests that exon-8-skipped truncated kindlin-1 is functionally defective and does not compensate for the defects of KS, even though kindlin-1 expression in skin is positive. Copyright © 2010 Japanese Society for Investigative Dermatology. Published by Elsevier Ireland Ltd. All rights reserved.
Hantke, Janina; Chandler, David; King, Rosalind; Wanders, Ronald J A; Angelicheva, Dora; Tournev, Ivailo; McNamara, Elyshia; Kwa, Marcel; Guergueltcheva, Velina; Kaneva, Radka; Baas, Frank; Kalaydjieva, Luba
2009-12-01
Hereditary Motor and Sensory Neuropathy -- Russe (HMSNR) is a severe autosomal recessive disorder, identified in the Gypsy population. Our previous studies mapped the gene to 10q22-q23 and refined the gene region to approximately 70 kb. Here we report the comprehensive sequencing analysis and fine mapping of this region, reducing it to approximately 26 kb of fully characterised sequence spanning the upstream exons of Hexokinase 1 (HK1). We identified two sequence variants in complete linkage disequilibrium, a G>C in a novel alternative untranslated exon (AltT2) and a G>A in the adjacent intron, segregating with the disease in affected families and present in the heterozygote state in only 5/790 population controls. Sequence conservation of the AltT2 exon in 16 species with invariable preservation of the G allele at the mutated site, strongly favour the exonic change as the pathogenic mutation. Analysis of the Hk1 upstream region in mouse mRNA from testis and neural tissues showed an abundance of AltT2-containing transcripts generated by extensive, developmentally regulated alternative splicing. Expression is very low compared with ubiquitous Hk1 and all transcripts skip exon1, which encodes the protein domain responsible for binding to the outer mitochondrial membrane, and regulation of energy production and apoptosis. Hexokinase activity measurement and immunohistochemistry of the peripheral nerve showed no difference between patients and controls. The mutational mechanism and functional effects remain unknown and could involve disrupted translational regulation leading to increased anti-apoptotic activity (suggested by the profuse regenerative activity in affected nerves), or impairment of an unknown HK1 function in the peripheral nervous system (PNS).
Hantke, Janina; Chandler, David; King, Rosalind; Wanders, Ronald JA; Angelicheva, Dora; Tournev, Ivailo; McNamara, Elyshia; Kwa, Marcel; Guergueltcheva, Velina; Kaneva, Radka; Baas, Frank; Kalaydjieva, Luba
2009-01-01
Hereditary Motor and Sensory Neuropathy – Russe (HMSNR) is a severe autosomal recessive disorder, identified in the Gypsy population. Our previous studies mapped the gene to 10q22-q23 and refined the gene region to ∼70 kb. Here we report the comprehensive sequencing analysis and fine mapping of this region, reducing it to ∼26 kb of fully characterised sequence spanning the upstream exons of Hexokinase 1 (HK1). We identified two sequence variants in complete linkage disequilibrium, a G>C in a novel alternative untranslated exon (AltT2) and a G>A in the adjacent intron, segregating with the disease in affected families and present in the heterozygote state in only 5/790 population controls. Sequence conservation of the AltT2 exon in 16 species with invariable preservation of the G allele at the mutated site, strongly favour the exonic change as the pathogenic mutation. Analysis of the Hk1 upstream region in mouse mRNA from testis and neural tissues showed an abundance of AltT2-containing transcripts generated by extensive, developmentally regulated alternative splicing. Expression is very low compared with ubiquitous Hk1 and all transcripts skip exon1, which encodes the protein domain responsible for binding to the outer mitochondrial membrane, and regulation of energy production and apoptosis. Hexokinase activity measurement and immunohistochemistry of the peripheral nerve showed no difference between patients and controls. The mutational mechanism and functional effects remain unknown and could involve disrupted translational regulation leading to increased anti-apoptotic activity (suggested by the profuse regenerative activity in affected nerves), or impairment of an unknown HK1 function in the peripheral nervous system (PNS). PMID:19536174
Alekseyenko, Alexander V.; Kim, Namshin; Lee, Christopher J.
2007-01-01
Association of alternative splicing (AS) with accelerated rates of exon evolution in some organisms has recently aroused widespread interest in its role in evolution of eukaryotic gene structure. Previous studies were limited to analysis of exon creation or lost events in mouse and/or human only. Our multigenome approach provides a way for (1) distinguishing creation and loss events on the large scale; (2) uncovering details of the evolutionary mechanisms involved; (3) estimating the corresponding rates over a wide range of evolutionary times and organisms; and (4) assessing the impact of AS on those evolutionary rates. We use previously unpublished independent analyses of alternative splicing in five species (human, mouse, dog, cow, and zebrafish) from the ASAP database combined with genomewide multiple alignment of 17 genomes to analyze exon creation and loss of both constitutively and alternatively spliced exons in mammals, fish, and birds. Our analysis provides a comprehensive database of exon creation and loss events over 360 million years of vertebrate evolution, including tens of thousands of alternative and constitutive exons. We find that exon inclusion level is inversely related to the rate of exon creation. In addition, we provide a detailed in-depth analysis of mechanisms of exon creation and loss, which suggests that a large fraction of nonrepetitive created exons are results of ab initio creation from purely intronic sequences. Our data indicate an important role for alternative splicing in creation of new exons and provide a useful novel database resource for future genome evolution research. PMID:17369312
Hezroni, Hadas; Koppstein, David; Schwartz, Matthew G; Avrutin, Alexandra; Bartel, David P; Ulitsky, Igor
2015-05-19
The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNAs). Although in each species, >70% of lincRNAs cannot be traced to homologs in species that diverged >50 million years ago, thousands of human lincRNAs have homologs with similar expression patterns in other species. These homologs share short, 5'-biased patches of sequence conservation nested in exonic architectures that have been extensively rewired, in part by transposable element exonization. Thus, over a thousand human lincRNAs are likely to have conserved functions in mammals, and hundreds beyond mammals, but those functions require only short patches of specific sequences and can tolerate major changes in gene architecture. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
A candidate gene for X-linked Ocular Albinism (OA1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bassi, M.T.; Schiaffino, V.; Rugarli, E.
1994-09-01
Ocular Albinism of the Nettleship-Fall type 1 (OA1) is the most common form of ocular albinism. It is transmitted as an X-linked recessive trait with affected males showing severe reduction of visual acuity, nystagmus, strabismus, photophobia. Ophthalmologic examination reveals foveal hypoplasia, hypopigmentation of the retina and iris translucency. Microscopic examination of melanocytes suggests that the underlying defect in OA1 is an abnormality in melanosome formation. Recently we assembled a 350 kb cosmid contig spanning the entire critical region on Xp22.3, which measures approximately 110 kb. A minimum set of cosmids was used to identify transcribed sequences using both cDNA selectionmore » and exon amplification. Two putative exons recovered by exon amplification strategy were found to be highly conserved throughout evolution and, therefore, they were used as probes for the screening of fetal and adult retina cDNA libraries. This led to the isolation of clones spanning a full-length cDNA which measures 7.6 kb. Sequence analysis revealed that the predicted protein product shows homology with syntrophines and a Xenopus laevis apical protein. The gene covers approximately 170 kb of DNA and spans the entire critical region for OA1, being deleted in two patients with contiguous gene deletion including OA1 and in one patient with isolated OA1. Therefore, this new gene represents a very strong candidate for involvement in OA1 (an alternative, but unlikely possibility to be considered is that the true OA1 gene lies within an intron of the former). Northern analysis revealed very high level of expression in retina and melanoma. Unlike most Xp22.3 genes, this gene is conserved in the mouse. We are currently performing SSCP analysis and direct sequencing of exons on DNAs from approximately 60 unrelated patients with OA1 for mutation detection.« less
Alternative Splicing of a Novel Inducible Exon Diversifies the CASK Guanylate Kinase Domain
Dembowski, Jill A.; An, Ping; Scoulos-Hanson, Maritsa; Yeo, Gene; Han, Joonhee; Fu, Xiang-Dong; Grabowski, Paula J.
2012-01-01
Alternative pre-mRNA splicing has a major impact on cellular functions and development with the potential to fine-tune cellular localization, posttranslational modification, interaction properties, and expression levels of cognate proteins. The plasticity of regulation sets the stage for cells to adjust the relative levels of spliced mRNA isoforms in response to stress or stimulation. As part of an exon profiling analysis of mouse cortical neurons stimulated with high KCl to induce membrane depolarization, we detected a previously unrecognized exon (E24a) of the CASK gene, which encodes for a conserved peptide insertion in the guanylate kinase interaction domain. Comparative sequence analysis shows that E24a appeared selectively in mammalian CASK genes as part of a >3,000 base pair intron insertion. We demonstrate that a combination of a naturally defective 5′ splice site and negative regulation by several splicing factors, including SC35 (SRSF2) and ASF/SF2 (SRSF1), drives E24a skipping in most cell types. However, this negative regulation is countered with an observed increase in E24a inclusion after neuronal stimulation and NMDA receptor signaling. Taken together, E24a is typically a skipped exon, which awakens during neuronal stimulation with the potential to diversify the protein interaction properties of the CASK polypeptide. PMID:23008758
Kazachenko, Konstantin Y; Miropolskaya, Nataliya A; Gening, Leonid V; Tarantul, Vyacheslav Z; Makarova, Alena V
2017-02-01
Y-family DNA polymerase iota (Pol ι) possesses both DNA polymerase and dRP lyase activities and was suggested to be involved in DNA translesion synthesis and base excision repair in mammals. The 129 strain of mice and its derivatives have a natural nonsense codon mutation in the second exon of the Pol ι gene resulting in truncation of the Pol ι protein. These mice were widely used as a Pol ι-null model for in vivo studies of the Pol ι function. However whether 129-derived strains of mice are fully deficient in the Pol ι functions was a subject of discussion since Pol ι mRNA undergoes alternative splicing at exon 2. Here we report purification of mouse Pol ι lacking the region encoded by exon 2, which includes several conserved residues involved in catalysis. We show that the deletion abrogates both the DNA polymerase and dRP lyase activities of Pol ι in the presence of either Mg 2+ or Mn 2+ ions. Thus, 129-derived strains of mice express catalytically inactive alternatively spliced Pol ι variant, whose cellular functions, if any exist, remain to be established. Copyright © 2017 Elsevier B.V. All rights reserved.
Paquet, Nicolas; Bernadet, Marie; Morin, Halima; Traas, Jan; Dron, Michel; Charon, Celine
2005-06-01
Poaceae species present a conserved distichous phyllotaxy (leaf position along the stem) and share common properties with respect to leaf initiation. The goal of this work was to determine if these common traits imply common genes. Therefore, homologues of the maize TERMINAL EAR1 gene in Poaceae were studied. This gene encodes an RNA-binding motif (RRM) protein, that is suggested to regulate leaf initiation. Using degenerate primers, one unique tel (terminal ear1-like) gene from seven Poaceae members, covering almost all the phylogenetic tree of the family, was identified by PCR. These genes present a very high degree of similarity, a much conserved exon-intron structure, and the three RRMs and TEL characteristic motifs. The evolution of tel sequences in Poaceae strongly correlates with the known phylogenetic tree of this family. RT-PCR gene expression analyses show conserved tel expression in the shoot apex in all species, suggesting functional orthology between these genes. In addition, in situ hybridization experiments with specific antisense probes show tel transcript accumulation in all differentiating cells of the leaf, from the recruitment of leaf founder cells to leaf margins cells. Tel expression is not restricted to initiating leaves as it is also found in pro-vascular tissues, root meristems, and immature inflorescences. Therefore, these results suggest that TEL is not only associated with leaf initiation but more generally with cell differentiation in Poaceae.
Functional and evolution characterization of SWEET sugar transporters in Ananas comosus.
Guo, Chengying; Li, Huayang; Xia, Xinyao; Liu, Xiuyuan; Yang, Long
2018-02-05
Sugars will eventually be exported transporters (SWEETs) are a group of recently identified sugar transporters in plants that play important roles in diverse physiological processes. However, currently, limited information about this gene family is available in pineapple (Ananas comosus). The availability of the recently released pineapple genome sequence provides the opportunity to identify SWEET genes in a Bromeliaceae family member at the genome level. In this study, 39 pineapple SWEET genes were identified in two pineapple cultivars (18 AnfSWEET and 21 AnmSWEET) and further phylogenetically classified into five clades. A phylogenetic analysis revealed distinct evolutionary paths for the SWEET genes of the two pineapple cultivars. The MD2 cultivar might have experienced a different expansion than the F153 cultivar because two additional duplications exist, which separately gave rise to clades III and IV. A gene exon/intron structure analysis showed that the pineapple SWEET genes contained highly conserved exon/intron numbers. An analysis of public RNA-seq data and expression profiling showed that SWEET genes may be involved in fruit development and ripening processes. AnmSWEET5 and AnmSWEET11 were highly expressed in the early stages of pineapple fruit development and then decreased. The study increases the understanding of the roles of SWEET genes in pineapple. Copyright © 2018 Elsevier Inc. All rights reserved.
Qian, Ming; Zhang, Yike; Yan, Xiangyan; Han, Mingyu; Li, Jinjin; Li, Fang; Li, Furui; Zhang, Dong; Zhao, Caiping
2016-11-18
Polygalacturonase (PG) is an important hydrolytic enzyme involved in pectin degradation during fruit softening. However, the roles of PG family members in fruit softening remain unclear. We identified 45 PpPG genes in the peach genome which are clustered into six subclasses. PpPGs consist of four to nine exons and three to eight introns, and the exon/intron structure is basically conserved in all but subclass E. Only 16 PpPG genes were expressed in ripening fruit, and their expression profiles were analyzed during storage in two peach cultivars with different softening characteristics. Eight PGs ( PpPG1 , - 10 , - 12 , - 13 , - 15 , - 23 , - 21 , and - 22 ) in fast-softening "Qian Jian Bai" (QJB) fruit and three PGs ( PpPG15 , - 21 , and - 22 ) in slow-softening "Qin Wang" (QW) fruit exhibited softening-associated patterns; which also were affected by ethylene treatment. Our results suggest that the different softening characters in QW and QJB fruit is related to the amount of PG members. While keeping relatively lower levels during QW fruit softening, the expression of six PGs ( PpPG1 , - 10 , - 12 , - 11 , - 14 , and - 35 ) rapidly induced by ethylene. PpPG24 , - 25 and - 38 may not be involved in softening of peach fruit.
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai
Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung
2016-01-01
An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His69, Asp117, and Ser216. The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5′ donor splice (GT) and 3′ acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai. PMID:27399771
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai.
Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung
2016-07-05
An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His(69), Asp(117), and Ser(216). The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5' donor splice (GT) and 3' acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai.
Hall, Jennifer R; Clow, Kathy A; Rise, Matthew L; Driedzic, William R
2015-09-01
Aquaglyceroporins (GLPs) are integral membrane proteins that facilitate passive movement of water, glycerol and urea across cellular membranes. In this study, GLP-encoding genes were characterized in rainbow smelt (Osmerus mordax mordax), an anadromous teleost that accumulates high glycerol and modest urea levels in plasma and tissues as an adaptive cryoprotectant mechanism in sub-zero temperatures. We report the gene and promoter sequences for two aqp10b paralogs (aqp10ba, aqp10bb) that are 82% identical at the predicted amino acid level, and aqp9b. Aqp10bb and aqp9b have the 6 exon structure common to vertebrate GLPs. Aqp10ba has 8 exons; there are two additional exons at the 5' end, and the promoter sequence is different from aqp10bb. Molecular phylogenetic analysis suggests that the aqp10b paralogs arose from a gene duplication event specific to the smelt lineage. Smelt GLP transcripts are ubiquitously expressed; however, aqp10ba transcripts were highest in kidney, aqp10bb transcripts were highest in kidney, intestine, pyloric caeca and brain, and aqp9b transcripts were highest in spleen, liver, red blood cells and kidney. In cold-temperature challenge experiments, plasma glycerol and urea levels were significantly higher in cold- compared to warm-acclimated smelt; however, GLP transcript levels were generally either significantly lower or remained constant. The exception was significantly higher aqp10ba transcript levels in kidney. High aqp10ba transcripts in smelt kidney that increase significantly in response to cold temperature in congruence with plasma urea suggest that this gene duplicate may have evolved to allow the re-absorption of urea to concomitantly conserve nitrogen and prevent freezing. Copyright © 2015 Elsevier Inc. All rights reserved.
Developmental expression of high molecular weight tropomyosin isoforms in Mesocestoides corti.
Koziol, Uriel; Costábile, Alicia; Domínguez, María Fernanda; Iriarte, Andrés; Alvite, Gabriela; Kun, Alejandra; Castillo, Estela
2011-02-01
Tropomyosins are a family of actin-binding proteins with diverse roles in actin filament function. One of the best characterized roles is the regulation of muscle contraction. Tropomyosin isoforms can be generated from different genes, and from alternative promoters and alternative splicing from the same gene. In this work, we have isolated sequences for tropomyosin isoforms from the cestode Mesocestoides corti, and searched for tropomyosin genes and isoforms in other flatworms. Two genes are conserved in the cestodes M. corti and Echinococcus multilocularis, and in the trematode Schistosoma mansoni. Both genes have the same structure, and each gene gives rise to at least two different isoforms, a high molecular weight (HMW) and a low molecular weight (LMW) one. Because most exons are duplicated and spliced in a mutually exclusive fashion, isoforms from one gene only share one exon and are highly divergent. The gene duplication preceded the divergence of neodermatans and the planarian Schmidtea mediterranea. Further duplications occurred in Schmidtea, coupled to the selective loss of duplicated exons, resulting in genes that only code for HMW or LMW isoforms. A polyclonal antibody raised against a HMW tropomyosin from Echinococcus granulosus was demonstrated to specifically recognize HMW tropomyosin isoforms of M. corti, and used to study their expression during segmentation. HMW tropomyosins are expressed in muscle layers, with very low or absent levels in other tissues. No expression of HMW tropomyosins is present in early or late genital primordia, and expression only begins once muscle fibers develop in the genital ducts. Therefore, HMW tropomyosins are markers for the development of muscles during the final differentiation of genital primordia. Copyright © 2010 Elsevier B.V. All rights reserved.
Park, Jeenah; Sharma, Neeraj
2014-01-01
Melanocortin-3 receptor (MC3R) is a canonical MSH receptor that plays an essential role in energy homeostasis. Variants in MC3R have been implicated in obesity in humans and mice. However, interpretation of the functional consequences of these variants is challenging because the translational start site of MC3R is unclear. Using 5′ rapid amplification of cDNA ends, we discovered a novel upstream exon that extends the length of the 5′ untranslated region (UTR) in MC3R without changing the open-reading frame. The full-length 5′ UTR directs utilization of an evolutionarily conserved second in-frame ATG as the primary translation start site. MC3R synthesized from the second ATG is localized to apical membranes of polarized Madin-Darby canine kidney cells, consistent with its function as a cell surface mediator of melanocortin signaling. Expression of MC3R causes relocalization of melanocortin receptor accessory protein 2, an accessory factor for melanocortin-2 receptor, to the apical membrane, coincident with the location of MC3R. In contrast, protein synthesized from MC3R cDNAs lacking the 5′ UTR displayed diffuse cytosolic distribution and has no effect on the distribution of melanocortin receptor accessory protein 2. Our findings demonstrate that a previously unannotated 5′ exon directs translation of MC3R protein that localizes to apical membranes of polarized cells. Together, our work provides insight on the structure of human MC3R and reveals a new pathway for regulation of energy metabolism. PMID:25051171
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, Guo-Shun; Grabowski, G.A.
1992-10-01
Gaucher disease is the most frequent lysosomal storage disease and the most prevalent Jewish genetic disease. About 30 identified missense mutations are causal to the defective activity of acid [beta]-glucosidase in this disease. cDNAs were characterized from a moderately affected 9-year-old Ashkenazi Jewish Gaucher disease type 1 patient whose 80-years-old, enzyme-deficient, 1226G (Asn[sup 370][yields]Ser [N370S]) homozygous grandfather was nearly asymptomatic. Sequence analyses revealed four populations of cDNAs with either the 1226G mutation, an exact exon 2 ([Delta] EX2) deletion, a deletion of exon 2 and the first 115 bp of exon 3 ([Delta] EX2-3), or a completely normal sequence. Aboutmore » 50% of the cDNAs were the [Delta] EX2, the [Delta] EX2-3, and the normal cDNAs, in a ratio of 6:3:1. Specific amplification and characterization of exon 2 and 5[prime] and 3[prime] intronic flanking sequences from the structural gene demonstrated clones with either the normal sequence or with a G[sup +1][yields]A[sup +1] transition at the exon 2/intron 2 boundary. This mutation destroyed the splice donor consensus site (U1 binding site) for mRNA processing. This transition also was present at the corresponding exon/intron boundary of the highly homologous pseudogene. This new mutation, termed [open quotes]IVS2 G[sup +1],[close quotes] is the first in the Ashkenazi Jewish population. The occurrence of this [open quotes]pseudogene[close quotes]-type mutation in the structural gene indicates the role of acid [beta]-glucosidase pseudogene and structural gene rearrangements in the pathogenesis of this disease. 33 refs., 8 figs., 1 tab.« less
Multi-subunit RNA polymerases (RNAP) are ornate molecular machines that translocate on a DNA template as they generate a complementary RNA chain. RNAPs are highly conserved in evolution among eukarya, eubacteria, archaea, and some viruses. As such, multi-subunit RNAPs appear to be an irreplaceable advance in the evolution of complex life on earth. Because of their stepwise
Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.
2013-01-01
Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, L.H., E-mail: Luhui.Han@tum.de; Hu, X.Y., E-mail: Xiangyu.Hu@tum.de; Adams, N.A., E-mail: Nikolaus.Adams@tum.de
In this paper we present a scale separation approach for multi-scale modeling of free-surface and two-phase flows with complex interface evolution. By performing a stimulus-response operation on the level-set function representing the interface, separation of resolvable and non-resolvable interface scales is achieved efficiently. Uniform positive and negative shifts of the level-set function are used to determine non-resolvable interface structures. Non-resolved interface structures are separated from the resolved ones and can be treated by a mixing model or a Lagrangian-particle model in order to preserve mass. Resolved interface structures are treated by the conservative sharp-interface model. Since the proposed scale separationmore » approach does not rely on topological information, unlike in previous work, it can be implemented in a straightforward fashion into a given level set based interface model. A number of two- and three-dimensional numerical tests demonstrate that the proposed method is able to cope with complex interface variations accurately and significantly increases robustness against underresolved interface structures.« less
Gu, Wanjun; Gurguis, Christopher I.; Zhou, Jin J.; Zhu, Yihua; Ko, Eun-A.; Ko, Jae-Hong; Wang, Ting; Zhou, Tong
2015-01-01
Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases. PMID:26454016
Rebelling for a Reason: Protein Structural “Outliers”
Arumugam, Gandhimathi; Nair, Anu G.; Hariharaputran, Sridhar; Ramanathan, Sowdhamini
2013-01-01
Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or ‘rebels’, are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities. PMID:24073209
Conservatism implications of shock test tailoring for multiple design environments
NASA Technical Reports Server (NTRS)
Baca, Thomas J.; Bell, R. Glenn; Robbins, Susan A.
1987-01-01
A method for analyzing shock conservation in test specifications that have been tailored to qualify a structure for multiple design environments is discussed. Shock test conservation is qualified for shock response spectra, shock intensity spectra and ranked peak acceleration data in terms of an Index of Conservation (IOC) and an Overtest Factor (OTF). The multi-environment conservation analysis addresses the issue of both absolute and average conservation. The method is demonstrated in a case where four laboratory tests have been specified to qualify a component which must survive seven different field environments. Final judgment of the tailored test specification is shown to require an understanding of the predominant failure modes of the test item.
Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano
2004-01-01
The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features. PMID:15215464
Investigating the Structural Impact of the Glutamine Repeat in Huntingtin Assembly
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perevozchikova, Tatiana; Stanley, Christopher B; McWilliams-Koeppen, Helen P
2014-01-01
Acquiring detailed structural information about the various aggregation states of the huntingtin-exon1 protein (Htt-exon1) is crucial not only for identifying the true nature of the neurotoxic species responsible for Huntington s disease (HD) but also for designing effective therapeutics. Using time-resolved small-angle neutron scattering (TR-SANS), we followed the conformational changes that occurred during fibrillization of the pathologic form of Htt-exon1 (NtQ42P10) and compared the results with those obtained for the wild-type (NtQ22P10). Our results show that the aggregation pathway of NtQ22P10 is very different from that of NtQ42P10, as the initial steps require a monomer to 7-mer transition stage. Inmore » contrast, the earliest species identified for NtQ42P10 are monomer and dimer. The divergent pathways ultimately result in NtQ22P10 fibrils that possess a pack- ing arrangement consistent with the common amyloid sterical zipper model, whereas NtQ42P10 fibrils present a better fit to the Perutz b-helix structural model. The structural details obtained by TR-SANS should help to delineate the key mechanisms that underpin Htt-exon1 aggregation leading to HD.« less
Flexible CRISPR library construction using parallel oligonucleotide retrieval
Read, Abigail; Gao, Shaojian; Batchelor, Eric
2017-01-01
Abstract CRISPR/Cas9-based gene knockout libraries have emerged as a powerful tool for functional screens. We present here a set of pre-designed human and mouse sgRNA sequences that are optimized for both high on-target potency and low off-target effect. To maximize the chance of target gene inactivation, sgRNAs were curated to target both 5΄ constitutive exons and exons that encode conserved protein domains. We describe here a robust and cost-effective method to construct multiple small sized CRISPR library from a single oligo pool generated by array synthesis using parallel oligonucleotide retrieval. Together, these resources provide a convenient means for individual labs to generate customized CRISPR libraries of variable size and coverage depth for functional genomics application. PMID:28334828
Chen, Mingchen; Wolynes, Peter G.
2017-01-01
Huntington’s disease (HD) is a neurodegenerative disease caused by an abnormal expansion in the polyglutamine (polyQ) track of the Huntingtin (HTT) protein. The severity of the disease depends on the polyQ repeat length, arising only in patients with proteins having 36 repeats or more. Previous studies have shown that the aggregation of N-terminal fragments (encoded by HTT exon 1) underlies the disease pathology in mouse models and that the HTT exon 1 gene product can self-assemble into amyloid structures. Here, we provide detailed structural mechanisms for aggregation of several protein fragments encoded by HTT exon 1 by using the associative memory, water-mediated, structure and energy model (AWSEM) to construct their free energy landscapes. We find that the addition of the N-terminal 17-residue sequence (NT17) facilitates polyQ aggregation by encouraging the formation of prefibrillar oligomers, whereas adding the C-terminal polyproline sequence (P10) inhibits aggregation. The combination of both terminal additions in HTT exon 1 fragment leads to a complex aggregation mechanism with a basic core that resembles that found for the aggregation of pure polyQ repeats using AWSEM. At the extrapolated physiological concentration, although the grand canonical free energy profiles are uphill for HTT exon 1 fragments having 20 or 30 glutamines, the aggregation landscape for fragments with 40 repeats has become downhill. This computational prediction agrees with the critical length found for the onset of HD and suggests potential therapies based on blocking early binding events involving the terminal additions to the polyQ repeats. PMID:28400517
NASA Astrophysics Data System (ADS)
Yang, Xiao; Gao, Jinning; Ma, Liman; Li, Zan; Wang, Wenji; Wang, Zhongkai; Yu, Haiyang; Qi, Jie; Wang, Xubo; Wang, Zhigang; Zhang, Quanqi
2015-02-01
Cold-inducible RNA-binding protein (CIRP) is a kind of RNA binding proteins that plays important roles in many physiological processes. The CIRP has been widely studied in mammals and amphibians since it was first cloned from mammals. On the contrary, there are little reports in teleosts. In this study, the Po CIRP gene of the Japanese flounder was cloned and sequenced. The genomic sequence consists of seven exons and six introns. The putative PoCIRP protein of flounder was 198 amino acid residues long containing the RNA recognition motif (RRM). Phylogenetic analysis showed that the flounder PoCIRP is highly conserved with other teleost CIRPs. The 5' flanking sequence was cloned by genome walking and many transcription factor binding sites were identified. There is a CpGs region located in promoter and exon I region and the methylation state is low. Quantitative real-time PCR analysis uncovered that Po CIRP gene was widely expressed in adult tissues with the highest expression level in the ovary. The mRNA of the Po CIRP was maternally deposited and the expression level of the gene was regulated up during the gastrula and neurula stages. In order to gain the information how the protein interacts with mRNA, we performed the modeling of the 3D structure of the flounder PoCIRP. The results showed a cleft existing the surface of the molecular. Taken together, the results indicate that the CIRP is a multifunctional molecular in teleosts and the findings about the structure provide valuable information for understanding the basis of this protein's function.
Makeyev, Aleksandr V.; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J.; Enkhmandakh, Badam; Ruddle, Frank H.; Bayarsaihan, Dashzeveg
2004-01-01
Williams–Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix–loop–helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix–loop–helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams–Beuren syndrome critical region. PMID:15243160
Makeyev, Aleksandr V; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J; Enkhmandakh, Badam; Ruddle, Frank H; Bayarsaihan, Dashzeveg
2004-07-27
Williams-Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix-loop-helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix-loop-helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams-Beuren syndrome critical region.
Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav
2018-06-08
Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.
Ikin, Karen; Barton, Philip S.; Stirnemann, Ingrid A.; Stein, John R.; Michael, Damian; Crane, Mason; Okada, Sachiko; Lindenmayer, David B.
2014-01-01
Improving biodiversity conservation in fragmented agricultural landscapes has become an important global issue. Vegetation at the patch and landscape-scale is important for species occupancy and diversity, yet few previous studies have explored multi-scale associations between vegetation and community assemblages. Here, we investigated how patch and landscape-scale vegetation cover structure woodland bird communities. We asked: (1) How is the bird community associated with the vegetation structure of woodland patches and the amount of vegetation cover in the surrounding landscape? (2) Do species of conservation concern respond to woodland vegetation structure and surrounding vegetation cover differently to other species in the community? And (3) Can the relationships between the bird community and the woodland vegetation structure and surrounding vegetation cover be explained by the ecological traits of the species comprising the bird community? We studied 103 woodland patches (0.5 - 53.8 ha) over two time periods across a large (6,800 km2) agricultural region in southeastern Australia. We found that both patch vegetation and surrounding woody vegetation cover were important for structuring the bird community, and that these relationships were consistent over time. In particular, the occurrence of mistletoe within the patches and high values of woody vegetation cover within 1,000 ha and 10,000 ha were important, especially for bird species of conservation concern. We found that the majority of these species displayed similar, positive responses to patch and landscape vegetation attributes. We also found that these relationships were related to the foraging and nesting traits of the bird community. Our findings suggest that management strategies to increase both remnant vegetation quality and the cover of surrounding woody vegetation in fragmented agricultural landscapes may lead to improved conservation of bird communities. PMID:24830684
Ikin, Karen; Barton, Philip S; Stirnemann, Ingrid A; Stein, John R; Michael, Damian; Crane, Mason; Okada, Sachiko; Lindenmayer, David B
2014-01-01
Improving biodiversity conservation in fragmented agricultural landscapes has become an important global issue. Vegetation at the patch and landscape-scale is important for species occupancy and diversity, yet few previous studies have explored multi-scale associations between vegetation and community assemblages. Here, we investigated how patch and landscape-scale vegetation cover structure woodland bird communities. We asked: (1) How is the bird community associated with the vegetation structure of woodland patches and the amount of vegetation cover in the surrounding landscape? (2) Do species of conservation concern respond to woodland vegetation structure and surrounding vegetation cover differently to other species in the community? And (3) Can the relationships between the bird community and the woodland vegetation structure and surrounding vegetation cover be explained by the ecological traits of the species comprising the bird community? We studied 103 woodland patches (0.5 - 53.8 ha) over two time periods across a large (6,800 km(2)) agricultural region in southeastern Australia. We found that both patch vegetation and surrounding woody vegetation cover were important for structuring the bird community, and that these relationships were consistent over time. In particular, the occurrence of mistletoe within the patches and high values of woody vegetation cover within 1,000 ha and 10,000 ha were important, especially for bird species of conservation concern. We found that the majority of these species displayed similar, positive responses to patch and landscape vegetation attributes. We also found that these relationships were related to the foraging and nesting traits of the bird community. Our findings suggest that management strategies to increase both remnant vegetation quality and the cover of surrounding woody vegetation in fragmented agricultural landscapes may lead to improved conservation of bird communities.
Circular RNA biogenesis can proceed through an exon-containing lariat precursor
Barrett, Steven P; Wang, Peter L; Salzman, Julia
2015-01-01
Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical ‘backsplicing’ event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure. DOI: http://dx.doi.org/10.7554/eLife.07540.001 PMID:26057830
Widespread alternative and aberrant splicing revealed by lariat sequencing
Stepankiw, Nicholas; Raghavan, Madhura; Fogarty, Elizabeth A.; Grimson, Andrew; Pleiss, Jeffrey A.
2015-01-01
Alternative splicing is an important and ancient feature of eukaryotic gene structure, the existence of which has likely facilitated eukaryotic proteome expansions. Here, we have used intron lariat sequencing to generate a comprehensive profile of splicing events in Schizosaccharomyces pombe, amongst the simplest organisms that possess mammalian-like splice site degeneracy. We reveal an unprecedented level of alternative splicing, including alternative splice site selection for over half of all annotated introns, hundreds of novel exon-skipping events, and thousands of novel introns. Moreover, the frequency of these events is far higher than previous estimates, with alternative splice sites on average activated at ∼3% the rate of canonical sites. Although a subset of alternative sites are conserved in related species, implying functional potential, the majority are not detectably conserved. Interestingly, the rate of aberrant splicing is inversely related to expression level, with lowly expressed genes more prone to erroneous splicing. Although we validate many events with RNAseq, the proportion of alternative splicing discovered with lariat sequencing is far greater, a difference we attribute to preferential decay of aberrantly spliced transcripts. Together, these data suggest the spliceosome possesses far lower fidelity than previously appreciated, highlighting the potential contributions of alternative splicing in generating novel gene structures. PMID:26261211
van der Meer-van Kraaij, Cindy; Siezen, Roland; Kramer, Evelien; Reinders, Marjolein; Blokzijl, Hans; van der Meer, Roelof
2007-01-01
Mucosal pentraxin (Mptx), identified in rats, is a short pentraxin of unknown function. Other subfamily members are Serum amyloid P component (SAP), C-reactive protein (CRP) and Jeltraxin. Rat Mptx mRNA is predominantly expressed in colon and in vivo is strongly (30-fold) regulated by dietary heme and calcium, modulators of colon cancer risk. This renders Mptx a potential nutrient sensitive biomarker of gut health. To support a role as biomarker, we examined whether the pentraxin protein structure is conserved, whether Mptx protein is nutrient-sensitively expressed and whether Mptx is expressed in mouse and human. Sequence comparison and 3D modelling showed that rat Mptx is highly homologous to the other pentraxins. The calcium-binding site and subunit interaction sites are highly conserved, while a loop deletion and charged residues contribute to a distinctive “top” face of the pentamer. In accordance with mRNA expression, Mptx protein is strongly down-regulated in rat colon mucosa in response to high dietary heme intake. Mptx mRNA is expressed in rat and mouse colon, but not in human colon. A stop codon at the beginning of human exon two indicates loss of function, which may be related to differences in intestinal cell turnover between man and rodents. PMID:18850182
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pham-Dinh, D.; Gaspera, D.B.; Dautigny, A.
1995-09-20
Myelin/oligodendrocyte glycoprotein (MOG), a special component of the central nervous system localization on the outermost lamellae of mature myelin, is a member of the immunoglobulin superfamily. We report here the organization of the human MOG gene, which spans approximately 17 kb, and the characterization of six MOG mRNA splicing variants. The intron/exon structure of the human MOG gene confirmed the splicing pattern, supporting the hypothesis that mRNA isoforms could arise by alternative splicing of a single gene. In addition to the eight exons coding for the major MOG isoform, the human MOG gene also contains 3` region, a previously unknownmore » alternatively spliced coding exon, VIA. Alternative utilization of two acceptor splicing sites for exon VIII could produce two different C-termini. The nucleotide sequences presented here may be a useful tool to study further possible involvement if the MOG gene in hereditary neurological disorders. 23 refs., 5 figs.« less
Three reasons protein disorder analysis makes more sense in the light of collagen
Oates, Matt E.; Tompa, Peter; Gough, Julian
2016-01-01
Abstract We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen‐encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder‐encoding exons, still hold after considering collagen‐containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix. PMID:26941008
Lorentsen, R H; Graversen, J H; Caterer, N R; Thogersen, H C; Etzerodt, M
2000-01-01
Tetranectin is a homotrimeric plasma and extracellular-matrix protein that binds plasminogen and complex sulphated polysaccharides including heparin. In terms of primary and tertiary structure, tetranectin is related to the collectin family of Ca(2+)-binding C-type lectins. Tetranectin is encoded in three exons. Exon 3 encodes the carbohydrate recognition domain, which binds to kringle 4 in plasminogen at low levels of Ca(2+). Exon 2 encodes an alpha-helix, which is necessary and sufficient to govern the trimerization of tetranectin by assembling into a triple-helical coiled-coil structural element. Here we show that the heparin-binding site in tetranectin resides not in the carbohydrate recognition domain but within the N-terminal region, comprising the 16 amino acid residues encoded by exon 1. In particular, the lysine residues in the decapeptide segment KPKKIVNAKK (tetranectin residues 6-15) are shown to be of primary importance in heparin binding. PMID:10727405
Lorentsen, R H; Graversen, J H; Caterer, N R; Thogersen, H C; Etzerodt, M
2000-04-01
Tetranectin is a homotrimeric plasma and extracellular-matrix protein that binds plasminogen and complex sulphated polysaccharides including heparin. In terms of primary and tertiary structure, tetranectin is related to the collectin family of Ca(2+)-binding C-type lectins. Tetranectin is encoded in three exons. Exon 3 encodes the carbohydrate recognition domain, which binds to kringle 4 in plasminogen at low levels of Ca(2+). Exon 2 encodes an alpha-helix, which is necessary and sufficient to govern the trimerization of tetranectin by assembling into a triple-helical coiled-coil structural element. Here we show that the heparin-binding site in tetranectin resides not in the carbohydrate recognition domain but within the N-terminal region, comprising the 16 amino acid residues encoded by exon 1. In particular, the lysine residues in the decapeptide segment KPKKIVNAKK (tetranectin residues 6-15) are shown to be of primary importance in heparin binding.
Arabidopsis intragenomic conserved noncoding sequence
Thomas, Brian C.; Rapaka, Lakshmi; Lyons, Eric; Pedersen, Brent; Freeling, Michael
2007-01-01
After the most recent tetraploidy in the Arabidopsis lineage, most gene pairs lost one, but not both, of their duplicates. We manually inspected the 3,179 retained gene pairs and their surrounding gene space still present in the genome using a custom-made viewer application. The display of these pairs allowed us to define intragenic conserved noncoding sequences (CNSs), identify exon annotation errors, and discover potentially new genes. Using a strict algorithm to sort high-scoring pair sequences from the bl2seq data, we created a database of 14,944 intragenomic Arabidopsis CNSs. The mean CNS length is 31 bp, ranging from 15 to 285 bp. There are ≈1.7 CNSs associated with a typical gene, and Arabidopsis CNSs are found in all areas around exons, most frequently in the 5′ upstream region. Gene ontology classifications related to transcription, regulation, or “response to …” external or endogenous stimuli, especially hormones, tend to be significantly overrepresented among genes containing a large number of CNSs, whereas protein localization, transport, and metabolism are common among genes with no CNSs. There is a 1.5% overlap between these CNSs and the 218,982 putative RNAs in the Arabidopsis Small RNA Project database, allowing for two mismatches. These CNSs provide a unique set of noncoding sequences enriched for function. CNS function is implied by evolutionary conservation and independently supported because CNS-richness predicts regulatory gene ontology categories. PMID:17301222
A MAYAN FOUNDER MUTATION IS A COMMON CAUSE OF DEAFNESS IN GUATEMALA
Carranza, Claudia; Menendez, Ibis; Herrera, Mariana; Castellanos, Patricia; Amado, Carlos; Maldonado, Fabiola; Rosales, Luisa; Escobar, Nancy; Guerra, Mariela; Alvarez, Darwin; Foster, Joseph; Guo, Shengru; Blanton, Susan H.; Bademci, Guney; Tekin, Mustafa
2017-01-01
SUMMARY Over 5% of the world population have varying degrees of hearing loss. Mutations in GJB2 are the most common cause of autosomal recessive non-syndromic hearing loss (NSHL) in many populations. The frequency and type of mutations are influenced by ethnicity. Guatemala is a multi-ethnic country with four major populations: Maya, Ladino, Xinca, and Garifuna. To determine the mutation profile of GJB2 in a NSHL population from Guatemala, we sequenced both exons of GJB2 in 133 unrelated families. A total of six pathogenic variants were detected. The most frequent pathogenic variant is c.131G>A (p.Trp44*) detected in 21 of 266 alleles. We show that c.131G>A is associated with a conserved haplotype in Guatemala suggesting a single founder. The majority of Mayan population lives in the west region of the country from where all c.131G>A carriers originated. Further analysis of genome-wide variation of individuals carrying the c.131G>A mutation compared to those of Native American, European, and African populations shows a close match with the Mayan population. PMID:26346709
Runaway evolution of the male-specific exon of the doublesex gene in Diptera.
Hughes, Austin L
2011-02-01
In Diptera (Insecta), alternatively spliced male-specific and female-specific products of the doublesex (dsx) gene play a key role in regulating development of the adult genital structures from the genital disc. Analysis of the pattern of nucleotide substitution of different domains of the dsx gene in 29 dipteran species showed that, over short evolutionary times, purifying selection predominated on the domain common to both sexes, the female-specific exons, and the and male-specific exon. However, over longer the evolutionary time frames represented by between-family comparisons, the male-specific exon accumulated nonsynonymous substitutions at a much more rapid rate than either the common domain or the female-specific exon. Overall, the accumulation of nonsynonymous substitutions in the male-specific exon occurred at a significantly greater than linear rate relative to the common domain, whereas the accumulation of nonsynonymous substitutions in the female-specific exon occurred at less than linear rate relative to the common domain. The evolution of the male-specific exon of dsx thus shows a pattern reminiscent of that seen in the "runaway" evolution of male secondary sexual characters at the morphological level, consistent with the hypothesis that female choice is an important factor in the morphological diversification of insect male genitalia. Copyright © 2010 Elsevier B.V. All rights reserved.
Genome-Wide Analysis of the NADK Gene Family in Plants
Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming
2014-01-01
Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity and functional divergence of the NADK family in plants. Conclusions These findings will facilitate further studies of the NADK family and provide valuable information for functional validation of this family in plants. PMID:24968225
Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster
Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.
1988-01-01
We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
New Phosphospecific Antibody Reveals Isoform-Specific Phosphorylation of CPEB3 Protein
Sehgal, Kapil; Sylvester, Marc; Skubal, Magdalena; Josten, Michele; Steinhäuser, Christian; De Koninck, Paul; Theis, Martin
2016-01-01
Cytoplasmic Polyadenylation Element Binding proteins (CPEBs) are a family of polyadenylation factors interacting with 3’UTRs of mRNA and thereby regulating gene expression. Various functions of CPEBs in development, synaptic plasticity, and cellular senescence have been reported. Four CPEB family members of partially overlapping functions have been described to date, each containing a distinct alternatively spliced region. This region is highly conserved between CPEBs-2-4 and contains a putative phosphorylation consensus, overlapping with the exon seven of CPEB3. We previously found CPEBs-2-4 splice isoforms containing exon seven to be predominantly present in neurons, and the isoform expression pattern to be cell type-specific. Here, focusing on the alternatively spliced region of CPEB3, we determined that putative neuronal isoforms of CPEB3 are phosphorylated. Using a new phosphospecific antibody directed to the phosphorylation consensus we found Protein Kinase A and Calcium/Calmodulin-dependent Protein Kinase II to robustly phosphorylate CPEB3 in vitro and in primary hippocampal neurons. Interestingly, status epilepticus induced by systemic kainate injection in mice led to specific upregulation of the CPEB3 isoforms containing exon seven. Extensive analysis of CPEB3 phosphorylation in vitro revealed two other phosphorylation sites. In addition, we found plethora of potential kinases that might be targeting the alternatively spliced kinase consensus site of CPEB3. As this site is highly conserved between the CPEB family members, we suggest the existence of a splicing-based regulatory mechanism of CPEB function, and describe a robust phosphospecific antibody to study it in future. PMID:26915047
Bonen, Linda; Boer, Poppo H.; Gray, Michael W.
1984-01-01
We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
Matus, José Tomás; Aquea, Felipe; Arce-Johnson, Patricio
2008-01-01
Background The MYB superfamily constitutes the most abundant group of transcription factors described in plants. Members control processes such as epidermal cell differentiation, stomatal aperture, flavonoid synthesis, cold and drought tolerance and pathogen resistance. No genome-wide characterization of this family has been conducted in a woody species such as grapevine. In addition, previous analysis of the recently released grape genome sequence suggested expansion events of several gene families involved in wine quality. Results We describe and classify 108 members of the grape R2R3 MYB gene subfamily in terms of their genomic gene structures and similarity to their putative Arabidopsis thaliana orthologues. Seven gene models were derived and analyzed in terms of gene expression and their DNA binding domain structures. Despite low overall sequence homology in the C-terminus of all proteins, even in those with similar functions across Arabidopsis and Vitis, highly conserved motif sequences and exon lengths were found. The grape epidermal cell fate clade is expanded when compared with the Arabidopsis and rice MYB subfamilies. Two anthocyanin MYBA related clusters were identified in chromosomes 2 and 14, one of which includes the previously described grape colour locus. Tannin related loci were also detected with eight candidate homologues in chromosomes 4, 9 and 11. Conclusion This genome wide transcription factor analysis in Vitis suggests that clade-specific grape R2R3 MYB genes are expanded while other MYB genes could be well conserved compared to Arabidopsis. MYB gene abundance, homology and orientation within particular loci also suggests that expanded MYB clades conferring quality attributes of grapes and wines, such as colour and astringency, could possess redundant, overlapping and cooperative functions. PMID:18647406
Lyons, Brendan M; McHenry, Monique A; Barrington, David S
2017-07-01
Cytosolic phosphoglucose isomerase (pgiC) is an enzyme essential to glycolysis found universally in eukaryotes, but broad understanding of variation in the gene coding for pgiC is lacking for ferns. We used a substantially expanded representation of the gene for Andean species of the fern genus Polystichum to characterize pgiC in ferns relative to angiosperms, insects, and an amoebozoan; assess the impact of selection versus neutral evolutionary processes on pgiC; and explore evolutionary relationships of selected Andean species. The dataset of complete sequences comprised nine accessions representing seven species and one hybrid from the Andes and Serra do Mar. The aligned sequences of the full data set comprised 3376 base pairs (70% of the entire gene) including 17 exons and 15 introns from two central areas of the gene. The exons are highly conserved relative to angiosperms and retain substantial homology to insect pgiC, but intron length and structure are unique to the ferns. Average intron size is similar to angiosperms; intron number and location in insects are unlike those of the plants we considered. The introns included an array of indels and, in intron 7, an extensive microsatellite array with potential utility in analyzing population-level histories. Bayesian and maximum-parsimony analysis of 129 variable nucleotides in the Andean polystichums revealed that 59 (1.7% of the 3376 total) were phylogenetically informative; most of these united sister accessions. The phylogenetic trees for the Andean polystichums were incongruent with previously published cpDNA trees for the same taxa, likely the result of rapid evolutionary change in the introns and contrasting stability in the exons. The exons code a total of seven amino-acid substitutions. Comparison of non-synonymous to synonymous substitutions did not suggest that the pgiC gene is under selection in the Andes. Variation in pgiC including two additional accessions represented by incomplete sequences provided new insights into reticulate relationships among Andean taxa. Copyright © 2017 Elsevier Inc. All rights reserved.
Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.
2012-01-01
Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687
Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons
Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil
2012-01-01
Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994
MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks
USDA-ARS?s Scientific Manuscript database
Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...
The genomic organization of the Fanconi anemia group A (FAA) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ianzano, L.; Centra, M.; Savino, M.
1997-05-01
Fanconi anemia (FA) is a genetically heterogeneous disease involving at least five genes on the basis of complementation analysis (FAA to FAE). The FAA gene has been recently isolated by two independent approaches, positional and functional cloning. In the present study we describe the genomic structure of the FAA gene. The gene contains 43 exons spanning approximately 80 kb as determined by the alignment of four cosmids and the fine localization of the first and the last exons in restriction fragments of these clones. Exons range from 34 to 188 bp. All but three of the splice sites were consistentmore » with the ag-gt rule. We also describe three alternative splicing events in cDNA clones that result in the loss of exon 37, a 23-bp deletion at the 5{prime} end of exon 41. Sequence analysis of the 5{prime} region upstream of the putative transcription start site showed no obvious TATA and CAAT boxes, but did show a GC-rich region, typical of housekeeping genes. Knowledge of the structure of the FAA gene will provide an invaluable resource for the discovery of mutations in the gene that accounts for about 60-66% of FA patients. 24 refs., 3 figs., 1 tab.« less
Calpain cleavage within dysferlin exon 40a releases a synaptotagmin-like module for membrane repair
Redpath, G. M. I.; Woolger, N.; Piper, A. K.; Lemckert, F. A.; Lek, A.; Greer, P. A.; North, K. N.; Cooper, S. T.
2014-01-01
Dysferlin and calpain are important mediators of the emergency response to repair plasma membrane injury. Our previous research revealed that membrane injury induces cleavage of dysferlin to release a synaptotagmin-like C-terminal module we termed mini-dysferlinC72. Here we show that injury-activated cleavage of dysferlin is mediated by the ubiquitous calpains via a cleavage motif encoded by alternately spliced exon 40a. An exon 40a–specific antibody recognizing cleaved mini-dysferlinC72 intensely labels the circumference of injury sites, supporting a key role for dysferlinExon40a isoforms in membrane repair and consistent with our evidence suggesting that the calpain-cleaved C-terminal module is the form specifically recruited to injury sites. Calpain cleavage of dysferlin is a ubiquitous response to membrane injury in multiple cell lineages and occurs independently of the membrane repair protein MG53. Our study links calpain and dysferlin in the calcium-activated vesicle fusion of membrane repair, placing calpains as upstream mediators of a membrane repair cascade that elicits cleaved dysferlin as an effector. Of importance, we reveal that myoferlin and otoferlin are also cleaved enzymatically to release similar C-terminal modules, bearing two C2 domains and a transmembrane domain. Evolutionary preservation of this feature highlights its functional importance and suggests that this highly conserved C-terminal region of ferlins represents a functionally specialized vesicle fusion module. PMID:25143396
Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.
Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter
2015-01-01
Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.
Analysis of nonuniformity in intron phase distribution.
Fedorov, A; Suboch, G; Bujakov, M; Fedorova, L
1992-01-01
The distribution of different intron groups with respect to phases has been analyzed. It has been established that group II introns and nuclear introns have a minimum frequency of phase 2 introns. Since the phase of introns is an extremely conservative measure the observed minimum reflects evolutionary processes. A sample of all known, group I introns was too small to provide a valid characteristic of their phase distribution. The findings observed for the unequal distribution of phases cannot be explained solely on the basis of the mobile properties of introns. One of the most likely explanations for this nonuniformity in the intron phase distribution is the process of exon shuffling. It is proposed that group II introns originated at the early stages of evolution and were involved in the process of exon shuffling. PMID:1598214
Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M
2003-01-01
Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing.
Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M
2003-01-01
Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing. PMID:14519201
Mutation analysis in a German family identified a new cataract-causing allele in the CRYBB2 gene
Pauli, Silke; Söker, Torben; Klopp, Norman; Illig, Thomas; Engel, Wolfgang
2007-01-01
Purpose The study demonstrates the functional candidate gene analysis in a cataract family of German descent. Methods We screened a German family, clinically documented to have congenital cataracts, for mutation in the candidate genes CRYG (A to D) and CRYBB2 through polymerase chain reaction analyses and sequencing. Results Congenital cataract was first observed in a daughter of healthy parents. Her two children (a boy and a girl) also suffer from congenital cataracts and have been operated within the first weeks of birth. Morphologically, the cataract is characterized as nuclear with an additional ring-shaped cortical opacity. Molecular analysis revealed no causative mutation in any of the CRYG genes. However, sequencing of the exons of the CRYBB2 gene identified a sequence variation in exon 5 (383 A>T) with a substitution of Asp to Val at position 128. All three affected family members revealed this change but it was not observed in any of the unaffected persons of the family. The putative mutation creates a restriction site for the enzyme TaiI. This mutation was checked for in controls of randomly selected DNA samples from ophthalmologically normal individuals from the population-based KORA S4 study (n=96) and no mutation was observed. Moreover, the Asp at position 128 is within a stretch of 12 amino acids, which are highly conserved throughout the animal kingdom. For the mutant protein, the isoelectric point is raised from pH 6.50 to 6.75. Additionally, the random coil structure of the protein between the amino acids 126-139 is interrupted by a short extended strand structure. In addition, this region becomes hydrophobic (from neutral to +1) and the electrostatic potential in the region surrounding the exchanged amino acid alters from a mainly negative potential to an enlarged positive potential. Conclusions The D128V mutation segregates only in affected family members and is not seen in representative controls. It represents the first mutation outside exon 6 of the human CRYBB2 gene. PMID:17653036
Relative resolution: A hybrid formalism for fluid mixtures.
Chaimovich, Aviel; Peter, Christine; Kremer, Kurt
2015-12-28
We show here that molecular resolution is inherently hybrid in terms of relative separation. While nearest neighbors are characterized by a fine-grained (geometrically detailed) model, other neighbors are characterized by a coarse-grained (isotropically simplified) model. We notably present an analytical expression for relating the two models via energy conservation. This hybrid framework is correspondingly capable of retrieving the structural and thermal behavior of various multi-component and multi-phase fluids across state space.
Relative resolution: A hybrid formalism for fluid mixtures
NASA Astrophysics Data System (ADS)
Chaimovich, Aviel; Peter, Christine; Kremer, Kurt
2015-12-01
We show here that molecular resolution is inherently hybrid in terms of relative separation. While nearest neighbors are characterized by a fine-grained (geometrically detailed) model, other neighbors are characterized by a coarse-grained (isotropically simplified) model. We notably present an analytical expression for relating the two models via energy conservation. This hybrid framework is correspondingly capable of retrieving the structural and thermal behavior of various multi-component and multi-phase fluids across state space.
Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin
ERIC Educational Resources Information Center
Offner, Susan
2010-01-01
The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.
NASA Astrophysics Data System (ADS)
Li, Shengjie; Bai, Junjie; Wang, Lin
2008-08-01
Myostatin or GDF-8, a member of the transforming growth factor-β (TGF-β) superfamily, has been demonstrated to be a negative regulator of skeletal muscle mass in mammals. In the present study, we obtained a 5.64 kb sequence of myostatin encoding gene and its promoter from largemouth bass ( Micropterus salmoides). The myostatin encoding gene consisted of three exons (488 bp, 371 bp and 1779 bp, respectively) and two introns (390 bp and 855 bp, respectively). The intron-exon boundaries were conservative in comparison with those of mammalian myostatin encoding genes, whereas the size of introns was smaller than that of mammals. Sequence analysis of 1.569 kb of the largemouth bass myostatin gene promoter region revealed that it contained two TATA boxes, one CAAT box and nine putative E-boxes. Putative muscle growth response elements for myocyte enhancer factor 2 (MEF2), serum response factor (SRF), activator protein 1 (AP1), etc., and muscle-specific Mt binding site (MTBF) were also detected. Some of the transcription factor binding sites were conserved among five teleost species. This information will be useful for studying the transcriptional regulation of myostatin in fish.
Tollefson, Ann E.; Ying, Baoling; Doronin, Konstantin; Sidor, Peter D.; Wold, William S. M.
2007-01-01
A short open reading frame named the “U exon,” located on the adenovirus (Ad) l-strand (for leftward transcription) between the early E3 region and the fiber gene, is conserved in mastadenoviruses. We have observed that Ad5 mutants with large deletions in E3 that infringe on the U exon display a mild growth defect, as well as an aberrant Ad E2 DNA-binding protein (DBP) intranuclear localization pattern and an apparent failure to organize replication centers during late infection. Mutants in which the U exon DNA is reconstructed have a reversed phenotype. Chow et al. (L. T. Chow et al., J. Mol. Biol. 134:265-303, 1979) described mRNAs initiating in the region of the U exon and spliced to downstream sequences in the late DBP mRNA leader and the DBP-coding region. We have cloned this mRNA (as cDNA) from Ad5 late mRNA; the predicted protein is 217 amino acids, initiating in the U exon and continuing in frame in the DBP leader and in the DBP-coding region but in a different reading frame from DBP. Polyclonal and monoclonal antibodies generated against the predicted U exon protein (UXP) showed that UXP is ∼24K in size by immunoblot and is a late protein. At 18 to 24 h postinfection, UXP is strongly associated with nucleoli and is found throughout the nucleus; later, UXP is associated with the periphery of replication centers, suggesting a function relevant to Ad DNA replication or RNA transcription. UXP is expressed by all four species C Ads. When expressed in transient transfections, UXP complements the aberrant DBP localization pattern of UXP-negative Ad5 mutants. Our data indicate that UXP is a previously unrecognized protein derived from a novel late l-strand transcription unit. PMID:17881437
Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava
Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian
2016-01-01
The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033
Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava.
Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian
2016-01-01
The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
Leclercq, Julie; Adams-Phillips, Lori C.; Zegzouti, Hicham; Jones, Brian; Latché, Alain; Giovannoni, James J.; Pech, Jean-Claude; Bouzayen, Mondher
2002-01-01
LeCTR1 was initially isolated by both differential display reverse transcriptase-polymerase chain reaction screening for tomato (Lycopersicon esculentum) fruit ethylene-inducible genes and through homology with the Arabidopsis CTR1 cDNA. LeCTR1 shares strong nucleotide sequence homology with Arabidopsis CTR1, a gene acting downstream of the ethylene receptor and showing similarity to the Raf family of serine/threonine protein kinases. The length of the LeCTR1 transcribed region from ATG to stop codon (12,000 bp) is more than twice that of Arabidopsis CTR1 (4,700 bp). Structural analysis reveals perfect conservation of both the number and position of introns and exons in LeCTR1 and Arabidopsis CTR1. The introns in LeCTR1 are much longer, however. To address whether this structural conservation is indicative of functional conservation of the corresponding proteins, we expressed LeCTR1 in the Arabidopsis ctr1-1 (constitutive triple response 1) mutant under the direction of the 35S promoter. Our data clearly show that ectopic expression of LeCTR1 in the Arabidopsis ctr1-1 mutant can restore normal ethylene signaling. The recovery of normal ethylene sensitivity upon heterologous expression of LeCTR1 was also confirmed by restored glucose sensitivity absent in the Arabidopsis ctr1-1 mutant. Expression studies confirm ethylene responsiveness of LeCTR1 in various tissues, including ripening fruit, and may suggest the evolution of alternate regulatory mechanisms in tomato versus Arabidopsis. PMID:12427980
Hentschel, Julia; Tatun, Dana; Parkhomchuk, Dmitri; Kurth, Ingo; Schimmel, Bettina; Heinrich-Weltzien, Roswitha; Bertzbach, Sabine; Peters, Hartmut; Beetz, Christian
2016-09-15
Amelogenesis imperfecta (AI) is a clinically and genetically heterogeneous disorder of tooth development which is due to aberrant deposition or composition of enamel. Both syndromic and isolated forms exist; they may be inherited in an X-linked, autosomal recessive, or autosomal dominant manner. WDR72 is one of ten currently known genes for recessive isolated AI; nine WDR72 mutations affecting single nucleotides have been described to date. Based on whole exome sequencing in a large consanguineous AI pedigree, we obtained evidence for presence of a multi-exonic WDR72 deletion. A home-made multiplex ligation-dependent probe amplification assay was used to confirm the aberration, to narrow its extent, and to identify heterozygous carriers. Our study extends the mutational spectrum for WDR72 to include large deletions, and supports a relevance of the previously proposed loss-of-function mechanism. It also introduces an easy-to-use and highly sensitive tool for detecting WDR72 copy number alterations. Copyright © 2016. Published by Elsevier B.V.
Combined sequence and structure analysis of the fungal laccase family.
Kumar, S V Suresh; Phale, Prashant S; Durani, S; Wangikar, Pramod P
2003-08-20
Plant and fungal laccases belong to the family of multi-copper oxidases and show much broader substrate specificity than other members of the family. Laccases have consequently been of interest for potential industrial applications. We have analyzed the essential sequence features of fungal laccases based on multiple sequence alignments of more than 100 laccases. This has resulted in identification of a set of four ungapped sequence regions, L1-L4, as the overall signature sequences that can be used to identify the laccases, distinguishing them within the broader class of multi-copper oxidases. The 12 amino acid residues in the enzymes serving as the copper ligands are housed within these four identified conserved regions, of which L2 and L4 conform to the earlier reported copper signature sequences of multi-copper oxidases while L1 and L3 are distinctive to the laccases. The mapping of regions L1-L4 on to the three-dimensional structure of the Coprinus cinerius laccase indicates that many of the non-copper-ligating residues of the conserved regions could be critical in maintaining a specific, more or less C-2 symmetric, protein conformational motif characterizing the active site apparatus of the enzymes. The observed intraprotein homologies between L1 and L3 and between L2 and L4 at both the structure and the sequence levels suggest that the quasi C-2 symmetric active site conformational motif may have arisen from a structural duplication event that neither the sequence homology analysis nor the structure homology analysis alone would have unraveled. Although the sequence and structure homology is not detectable in the rest of the protein, the relative orientation of region L1 with L2 is similar to that of L3 with L4. The structure duplication of first-shell and second-shell residues has become cryptic because the intraprotein sequence homology noticeable for a given laccase becomes significant only after comparing the conservation pattern in several fungal laccases. The identified motifs, L1-L4, can be useful in searching the newly sequenced genomes for putative laccase enzymes. Copyright 2003 Wiley Periodicals, Inc. Biotechnol Bioeng 83: 386-394, 2003.
Probing binding hot spots at protein-RNA recognition sites.
Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad
2016-01-29
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Purdue, P E; Lumb, M J; Allsop, J; Minatogawa, Y; Danpure, C J
1992-05-01
We have synthesized and sequenced alanine:glyoxylate aminotransferase (AGT; HGMW-approved symbol for the gene--AGXT) cDNA from the liver of a primary hyperoxaluria type 1 (PH1) patient who had normal levels of hepatic peroxisomal immunoreactive AGT protein, but no AGT catalytic activity. This revealed the presence of a single point mutation (G----A at cDNA nucleotide 367), which is predicted to cause a glycine-to-glutamate substitution at residue 82 of the AGT protein. This mutation is located in exon 2 of the AGT gene and leads to the loss of an AvaI restriction site. Exon 2-specific PCR followed by AvaI digestion showed that this patient was homozygous for this mutation. In addition, three other PH1 patients, one related to and two unrelated to, but with enzymological phenotype similar to that of the first patient, were also shown to be homozygous for the mutation. However, one other phenotypically similar PH1 patient was shown to lack this mutation. The mechanism by which the glycine-to-glutamate substitution at residue 82 causes loss of catalytic activity remains to be resolved. However, the protein sequence in this region is highly conserved between different mammals, and the substitution at residue 82 is predicted to cause significant local structural alterations.
Nuzzo, F; Bulato, C; Nielsen, B I; Lee, K; Wielders, S J; Simioni, P; Key, N S; Castoldi, E
2015-03-01
Coagulation factor V (FV) deficiency is a rare autosomal recessive bleeding disorder. We investigated a patient with severe FV deficiency (FV:C < 3%) and moderate bleeding symptoms. Thrombin generation experiments showed residual FV expression in the patient's plasma, which was quantified as 0.7 ± 0.3% by a sensitive prothrombinase-based assay. F5 gene sequencing identified a novel missense mutation in exon 4 (c.578G>C, p.Cys193Ser), predicting the abolition of a conserved disulphide bridge, and an apparently synonymous variant in exon 8 (c.1281C>G). The observation that half of the patient's F5 mRNA lacked the last 18 nucleotides of exon 8 prompted us to re-evaluate the c.1281C>G variant for its possible effects on splicing. Bioinformatics sequence analysis predicted that this transversion would activate a cryptic donor splice site and abolish an exonic splicing enhancer. Characterization in a F5 minigene model confirmed that the c.1281C>G variant was responsible for the patient's splicing defect, which could be partially corrected by a mutation-specific morpholino antisense oligonucleotide. The aberrantly spliced F5 mRNA, whose stability was similar to that of the normal mRNA, encoded a putative FV mutant lacking amino acids 427-432. Expression in COS-1 cells indicated that the mutant protein is poorly secreted and not functional. In conclusion, the c.1281C>G mutation, which was predicted to be translationally silent and hence neutral, causes FV deficiency by impairing pre-mRNA splicing. This finding underscores the importance of cDNA analysis for the correct assessment of exonic mutations. © 2014 John Wiley & Sons Ltd.
Diverse alternative back-splicing and alternative splicing landscape of circular RNAs
Zhang, Xiao-Ou; Dong, Rui; Zhang, Yang; Zhang, Jia-Lin; Luo, Zheng; Zhang, Jun; Chen, Ling-Ling; Yang, Li
2016-01-01
Circular RNAs (circRNAs) derived from back-spliced exons have been widely identified as being co-expressed with their linear counterparts. A single gene locus can produce multiple circRNAs through alternative back-splice site selection and/or alternative splice site selection; however, a detailed map of alternative back-splicing/splicing in circRNAs is lacking. Here, with the upgraded CIRCexplorer2 pipeline, we systematically annotated different types of alternative back-splicing and alternative splicing events in circRNAs from various cell lines. Compared with their linear cognate RNAs, circRNAs exhibited distinct patterns of alternative back-splicing and alternative splicing. Alternative back-splice site selection was correlated with the competition of putative RNA pairs across introns that bracket alternative back-splice sites. In addition, all four basic types of alternative splicing that have been identified in the (linear) mRNA process were found within circRNAs, and many exons were predominantly spliced in circRNAs. Unexpectedly, thousands of previously unannotated exons were detected in circRNAs from the examined cell lines. Although these novel exons had similar splice site strength, they were much less conserved than known exons in sequences. Finally, both alternative back-splicing and circRNA-predominant alternative splicing were highly diverse among the examined cell lines. All of the identified alternative back-splicing and alternative splicing in circRNAs are available in the CIRCpedia database (http://www.picb.ac.cn/rnomics/circpedia). Collectively, the annotation of alternative back-splicing and alternative splicing in circRNAs provides a valuable resource for depicting the complexity of circRNA biogenesis and for studying the potential functions of circRNAs in different cells. PMID:27365365
Chang, Chin-I; Zhang, Yong-An; Zou, Jun; Nie, Pin; Secombes, Christopher J.
2006-01-01
Further to the previous finding of the rainbow trout rtCATH_1 gene, this paper describes three more cathelicidin genes found in salmonids: two in Atlantic salmon, named asCATH_1 and asCATH_2, and one in rainbow trout, named rtCATH_2. All the three new salmonid cathelicidin genes share the common characteristics of mammalian cathelicidin genes, such as consisting of four exons and possessing a highly conserved preproregion and four invariant cysteines clustered in the C-terminal region of the cathelin-like domain. The asCATH_1 gene is homologous to the rainbow trout rtCATH_1 gene, in that it possesses three repeat motifs of TGGGGGTGGC in exon IV and two cysteine residues in the predicted mature peptide, while the asCATH_2 gene and rtCATH_2 gene are homologues of each other, with 96% nucleotide identity. Salmonid cathelicidins possess the same elastase-sensitive residue, threonine, as hagfish cathelicidins and the rabbit CAP18 molecule. The cleavage site of the four salmonid cathelicidins is within a conserved amino acid motif of QKIRTRR, which is at the beginning of the sequence encoded by exon IV. Two 36-residue peptides corresponding to the core part of rtCATH_1 and rtCATH_2 were chemically synthesized and shown to exhibit potent antimicrobial activity. rtCATH_2 was expressed constitutively in gill, head kidney, intestine, skin and spleen, while the expression of rtCATH_1 was inducible in gill, head kidney, and spleen after bacterial challenge. Four cathelicidin genes have now been characterized in salmonids and two were identified in hagfish, confirming that cathelicidin genes evolved early and are likely present in all vertebrates. PMID:16377685
Li, Guang-Qi; Zang, Xiao-Nan; Zhang, Xue-Cheng; Lu, Ning; Ding, Yan; Gong, Le; Chen, Wen-Chao
2014-03-15
To study the response of Gracilaria lemaneiformis to heat stress, two key enzymes - ubiquitin-activating enzyme (E1) and ubiquitin-conjugating enzyme (E2) - of the Ubiquitin/26S proteasome pathway (UPP) were studied in three strains of G. lemaneiformis-wild type, heat-tolerant cultivar 981 and heat-tolerant cultivar 07-2. The full length DNA sequence of E1 contained only one exon. The open reading frame (ORF) sequence was 981 nucleotides encoding 326 amino acids, which contained conserved ATP binding sites (LYDRQIRLWGLE, ELAKNVLLAGV, LKEMN, VVCAI) and the ubiquitin-activating domains (VVCAI…LMTEAC, VFLDLGDEYSYQ, AIVGGMWGRE). The gene sequence of E2 contained four exons and three introns. The sum of the four exons gave an open reading frame sequence of 444 nucleotides encoding 147 amino acids, which contained a conserved ubiquitin-activating domain (GSICLDIL), ubiquitin-conjugating domains (RIYHPNIN, KVLLSICSLL, DDPLV) and ubiquitin-ligase (E3) recognition sites (KRI, YPF, WSP). Real-time-PCR analysis of transcription levels of E1 and E2 under heat shock conditions (28°C and 32°C) showed that in wild type, transcriptions of E1 and E2 were up-regulated at 28°C, while at 32°C, transcriptions of the two enzymes were below the normal level. In cultivar 981 and cultivar 07-2 of G. lemaneiformis, the transcription levels of the two enzymes were up-regulated at 32°C, and transcription level of cultivar 07-2 was even higher than that of cultivar 981. These results suggest that the UPP plays an important role in high temperature resistance of G. lemaneiformis and the bioactivity of UPP is directly related to the heat-resistant ability of G. lemaneiformis. Copyright © 2013 Elsevier B.V. All rights reserved.
Toward quantum plasmonic networks
Holtfrerich, M. W.; Dowran, M.; Davidson, R.; ...
2016-08-30
Here, we demonstrate the transduction of macroscopic quantum entanglement by independent, distant plasmonic structures embedded in separate thin silver films. In particular, we show that the plasmon-mediated transmission through each film conserves spatially dependent, entangled quantum images, opening the door for the implementation of parallel quantum protocols, super-resolution imaging, and quantum plasmonic sensing geometries at the nanoscale level. The conservation of quantum information by the transduction process shows that continuous variable multi-mode entanglement is momentarily transferred from entangled beams of light to the space-like separated, completely independent plasmonic structures, thus providing a first important step toward establishing a multichannel quantummore » network across separate solid-state substrates.« less
Lee, Younghee; Han, Seonggyun; Kim, Dongwook; Kim, Dokyoon; Horgousluoglu, Emrin; Risacher, Shannon L; Saykin, Andrew J; Nho, Kwangsik
2018-01-01
Genetic variation in cis-regulatory elements related to splicing machinery and splicing regulatory elements (SREs) results in exon skipping and undesired protein products. We developed a splicing decision model to identify actionable loci among common SNPs for gene regulation. The splicing decision model identified SNPs affecting exon skipping by analyzing sequence-driven alternative splicing (AS) models and by scanning the genome for the regions with putative SRE motifs. We used non-Hispanic Caucasians with neuroimaging, and fluid biomarkers for Alzheimer's disease (AD) and identified 17,088 common exonic SNPs affecting exon skipping. GWAS identified one SNP (rs1140317) in HLA-DQB1 as significantly associated with entorhinal cortical thickness, AD neuroimaging biomarker, after controlling for multiple testing. Further analysis revealed that rs1140317 was significantly associated with brain amyloid-f deposition (PET and CSF). HLA-DQB1 is an essential immune gene and may regulate AS, thereby contributing to AD pathology. SRE may hold potential as novel therapeutic targets for AD.
Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan
2009-01-01
We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Rapisuwon, Suthee; Parks, Kellie; Al-Refaie, Waddah; Atkins, Michael B
2014-10-01
Primary mucosal melanomas represent ∼1.3% of all cases of melanoma diagnosed in the USA. The sinonasal location is the most common primary site. Mutations in the KIT gene occur in 10-22% of mucosal melanomas. Tumor response to imatinib mesylate has been reported in about half of the patients with tumors harboring KIT mutations. Responses are almost exclusively restricted to tumors with mutations in KIT exon 9 or 11. We report a case of a patient with a sinonasal mucosal melanoma with a novel exon 8 mutation (C443S) who had marked initial response to imatinib. Somatic exon 8 KIT mutations have not been previously reported in mucosal melanoma or in other human solid tumors; however, such mutations have been reported in canine and feline mast cell tumors. Protein transcripts from exon 8 play an important role in the structural and functional integrity of the extracellular domain of KIT. In preclinical studies, a mutation in exon 8 led to autophosphorylation, independent of KIT ligand, and constitutive activation of the tyrosine kinase. This biology may explain the successful application of imatinib in animals with tumors harboring exon 8 KIT mutations and in our patient with mucosal melanoma. This report expands the population of patients with melanoma who might benefit from imatinib to those with somatic exon 8 KIT mutations. Such mutations should be looked for in patients with mucosal melanoma.
Lentes, K U; Tu, N; Chen, H; Winnikes, U; Reinert, I; Marmann, G; Pirke, K M
1999-01-01
Uncoupling proteins (UCPs) are mitochondrial membrane transporters which are involved in dissipating the proton electrochemical gradient thereby releasing stored energy as heat. This implies a major role of UCPs in energy metabolism and thermogenesis which when deregulated are key risk factors for the development of obesity and other eating disorders. Recent studies have shown that the sympathetic nervous system, via norepinephrine (beta-adrenoceptors) and cAMP, as well as thyroid hormones and PPAR gamma ligands seem to be major regulators of UCP expression. From the three different UCPs identified so far by gene cloning UCP1 is expressed exclusively in brown adipocytes while UCP2 is widely expressed. The third analogue, UCP3, is expressed predominantly in human skeletal muscle and was found to exist in a long and a short form. At the amino acid level UCP2 has about 59% homology to UCP1 while UCP3 is 73% identical to UCP2. Both UCP2 and UCP3 were mapped in close proximity (75-150 kb) to regions of human chromosome 11 (11q13) that have been linked to obesity and hyper-insulinaemia. Furthermore, there is strong evidence that UCP2, by virtue of its ubiquitous expression, may be important for determining basal metabolic rate. Based on the published full-length cDNA sequence we have deduced the genomic structure of the human UCP2 (hUCP2) gene by PCR and direct sequence analysis. The hUCP2 gene spans over 8.4 kb distributed on 8 exons. The localization of the exon/intron boundaries within the coding region matches precisely the one found in the human UCP1 gene and is almost conserved in the recently discovered UCP3 gene as well. However, the size of each of the introns in the hUCP2 gene differs from its UCP1 and UCP3 counterparts. It varies from 81 bp (intron 5) to about 3 kb (intron 2). The high degree of homology at the nucleotide level and the conservation of the exon/intron boundaries among the three UCP genes suggests that they may have evolved from a common ancestor or are the result from gene duplication events. Mutational analysis of the hUCP2 gene in a cohort of 25 children of caucasian origin (aged 7-13) characterized by low BMR values revealed a point mutation in exon 4 (C to T transition at position 164 of the corresponding cDNA resulting in the substitution of an alanine residue by a valine at codon 55) and an insertion polymorphism in exon 8. The insertion polymorphism consists of a 45 bp repeat located 150 bp downstream of the stop codon in the 3'-UTR. The allele frequencies were 0.61 and 0.39 for the alanine and valine encoded alleles, respectively, and 0.71 versus 0.29 for the insertion polymorphism. Expression studies of the wildtype and mutant forms of UCP2 should clarify the functional consequences these mutations may have on energy metabolism and body weight regulation. In addition, mapping of the promoter region and the identification of putative promoter regulatory sequences should give insight into the transcriptional regulation of UCP2 expression--in particular by anyone of the above mentioned factors--in vitro and in vivo.
A national geographic framework for guiding conservation on a landscape scale
Millard, Michael J.; Czarnecki, Craig A.; Morton, John M.; Brandt, Laura A.; Briggs, Jennifer S.; Shipley, Frank S.; Sayre, Roger G.; Sponholtz, Pamela J.; Perkins, David; Simpkins, Darin G.; Taylor, Janith
2012-01-01
The U.S. Fish and Wildlife Service, along with the global conservation community, has recognized that the conservation challenges of the 21st century far exceed the responsibilities and footprint of any individual agency or program. The ecological effects of climate change and other anthropogenic stressors do not recognize geopolitical boundaries and, as such, demand a national geographic framework to provide structure for cross-jurisdictional and landscape-scale conservation strategies. In 2009, a new map of ecologically based conservation regions in which to organize capacity and implement strategic habitat conservation was developed using rapid prototyping and expert elicitation by an interagency team of U.S. Fish and Wildlife Service and U.S. Geological Survey scientists and conservation professionals. Incorporating Bird Conservation Regions, Freshwater Ecoregions, and U.S. Geological Survey hydrologic unit codes, the new geographic framework provides a spatial template for building conservation capacity and focusing biological planning and conservation design efforts. The Department of Interior's Landscape Conservation Cooperatives are being organized in these new conservation regions as multi-stakeholder collaborations for improved conservation science and management.
Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.
2013-01-01
Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170
Tanaka, Arisa; Aoki, Fugaku; Suzuki, Masataka G
2018-05-26
The transformer (tra) gene, which is a female-determining master gene in the housefly Musca domestica, acts as a memory device for sex determination via its auto-regulatory function, i.e., through the contribution of the TRA protein to female-specific splicing of its own pre-mRNA. The TRA protein contains 4 small domains that are specifically conserved among TRA proteins (domains 1-4). Domain 2, also named TRA-CAM domain, is the most conserved, but its function remains unknown. To examine whether these domains are involved in the auto-regulatory function, we performed in vitro splicing assays using a tra minigene containing a partial genomic sequence of the M. domestica tra (Mdtra) gene. Co-transfection of the Mdtra minigene and an MdTRA protein expression vector into cultured insect cells strongly induced female-specific splicing of the minigene. A series of deletion mutation analyses demonstrated that these domains act complementarily to induce female-specific splicing. Domain 1 and the TRA-CAM domain were necessary for the female-specific splicing when the MdTRA protein lacked both domains 3 and 4. In this situation, mutation of the well-conserved 3 amino acids (GEG) in the TRA-CAM domain significantly reduced the female-specific splicing activity of MdTRA. GST-pull down analyses demonstrated that the MdTRA protein specifically enriched on the male-specific exonic region (exon 2b), which contains the putative TRA/TRA-2 binding sites, and that the GEG mutation disrupts this enrichment. Since the MdTRA protein interacts with its own pre-mRNA through TRA-2, our findings suggest that the conserved amino acid residues in the TRA-CAM domain may be crucial for the interaction between MdTRA and TRA-2, enhancing MdTRA recruitment on its pre-mRNA to induce female-specific splicing of tra in the housefly. © 2018 S. Karger AG, Basel.
Simultaneous Multi-Scale Diffusion Estimation and Tractography Guided by Entropy Spectrum Pathways
Galinsky, Vitaly L.; Frank, Lawrence R.
2015-01-01
We have developed a method for the simultaneous estimation of local diffusion and the global fiber tracts based upon the information entropy flow that computes the maximum entropy trajectories between locations and depends upon the global structure of the multi-dimensional and multi-modal diffusion field. Computation of the entropy spectrum pathways requires only solving a simple eigenvector problem for the probability distribution for which efficient numerical routines exist, and a straight forward integration of the probability conservation through ray tracing of the convective modes guided by a global structure of the entropy spectrum coupled with a small scale local diffusion. The intervoxel diffusion is sampled by multi b-shell multi q-angle DWI data expanded in spherical waves. This novel approach to fiber tracking incorporates global information about multiple fiber crossings in every individual voxel and ranks it in the most scientifically rigorous way. This method has potential significance for a wide range of applications, including studies of brain connectivity. PMID:25532167
Shirts, Brian H; Salipante, Stephen J; Casadei, Silvia; Ryan, Shawnia; Martin, Judith; Jacobson, Angela; Vlaskin, Tatyana; Koehler, Karen; Livingston, Robert J; King, Mary-Claire; Walsh, Tom; Pritchard, Colin C
2014-10-01
Single-exon inversions have rarely been described in clinical syndromes and are challenging to detect using Sanger sequencing. We report the case of a 40-year-old woman with adenomatous colon polyps too numerous to count and who had a complex inversion spanning the entire exon 10 in APC (the gene encoding for adenomatous polyposis coli), causing exon skipping and resulting in a frameshift and premature protein truncation. In this study, we employed complete APC gene sequencing using high-coverage next-generation sequencing by ColoSeq, analysis with BreakDancer and SLOPE software, and confirmatory transcript analysis. ColoSeq identified a complex small genomic rearrangement consisting of an inversion that results in translational skipping of exon 10 in the APC gene. This mutation would not have been detected by traditional sequencing or gene-dosage methods. We report a case of adenomatous polyposis resulting from a complex single-exon inversion. Our report highlights the benefits of large-scale sequencing methods that capture intronic sequences with high enough depth of coverage-as well as the use of informatics tools-to enable detection of small pathogenic structural rearrangements.
Jing, Zhaobin; Liu, Zhande
2018-04-01
As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.
Li, Xiaonan; Ramchiary, Nirala; Dhandapani, Vignesh; Choi, Su Ryun; Hur, Yoonkang; Nou, Ill-Sup; Yoon, Moo Kyoung; Lim, Yong Pyo
2013-01-01
Brassica rapa is an important crop species that produces vegetables, oilseed, and fodder. Although many studies reported quantitative trait loci (QTL) mapping, the genes governing most of its economically important traits are still unknown. In this study, we report QTL mapping for morphological and yield component traits in B. rapa and comparative map alignment between B. rapa, B. napus, B. juncea, and Arabidopsis thaliana to identify candidate genes and conserved QTL blocks between them. A total of 95 QTL were identified in different crucifer blocks of the B. rapa genome. Through synteny analysis with A. thaliana, B. rapa candidate genes and intronic and exonic single nucleotide polymorphisms in the parental lines were detected from whole genome resequenced data, a few of which were validated by mapping them to the QTL regions. Semi-quantitative reverse transcriptase PCR analysis showed differences in the expression levels of a few genes in parental lines. Comparative mapping identified five key major evolutionarily conserved crucifer blocks (R, J, F, E, and W) harbouring QTL for morphological and yield components traits between the A, B, and C subgenomes of B. rapa, B. juncea, and B. napus. The information of the identified candidate genes could be used for breeding B. rapa and other related Brassica species. PMID:23223793
Claverie, Michel; Dirlewanger, Elisabeth; Bosselut, Nathalie; Van Ghelder, Cyril; Voisin, Roger; Kleinhentz, Marc; Lafargue, Bernard; Abad, Pierre; Rosso, Marie-Noëlle; Chalhoub, Boulos; Esmenjaud, Daniel
2011-01-01
Root-knot nematode (RKN) Meloidogyne species are major polyphagous pests of most crops worldwide, and cultivars with durable resistance are urgently needed because of nematicide bans. The Ma gene from the Myrobalan plum (Prunus cerasifera) confers complete-spectrum, heat-stable, and high-level resistance to RKN, which is remarkable in comparison with the Mi-1 gene from tomato (Solanum lycopersicum), the sole RKN resistance gene cloned. We report here the positional cloning and the functional validation of the Ma locus present at the heterozygous state in the P.2175 accession. High-resolution mapping totaling over 3,000 segregants reduced the Ma locus interval to a 32-kb cluster of three Toll/Interleukin1 Receptor-Nucleotide Binding Site-Leucine-Rich Repeat (LRR) genes (TNL1–TNL3), including a pseudogene (TNL2) and a truncated gene (TNL3). The sole complete gene in this interval (TNL1) was validated as Ma, as it conferred the same complete-spectrum and high-level resistance (as in P.2175) using its genomic sequence and native promoter region in Agrobacterium rhizogenes-transformed hairy roots and composite plants. The full-length cDNA (2,048 amino acids) of Ma is the longest of all Resistance genes cloned to date. Its TNL structure is completed by a huge post-LRR (PL) sequence (1,088 amino acids) comprising five repeated carboxyl-terminal PL exons with two conserved motifs. The amino-terminal region (213 amino acids) of the LRR exon is conserved between alleles and contrasts with the high interallelic polymorphisms of its distal region (111 amino acids) and of PL domains. The Ma gene highlights the importance of these uncharacterized PL domains, which may be involved in pathogen recognition through the decoy hypothesis or in nuclear signaling. PMID:21482634
Thanki, Anil S; Soranzo, Nicola; Haerty, Wilfried; Davey, Robert P
2018-03-01
Gene duplication is a major factor contributing to evolutionary novelty, and the contraction or expansion of gene families has often been associated with morphological, physiological, and environmental adaptations. The study of homologous genes helps us to understand the evolution of gene families. It plays a vital role in finding ancestral gene duplication events as well as identifying genes that have diverged from a common ancestor under positive selection. There are various tools available, such as MSOAR, OrthoMCL, and HomoloGene, to identify gene families and visualize syntenic information between species, providing an overview of syntenic regions evolution at the family level. Unfortunately, none of them provide information about structural changes within genes, such as the conservation of ancestral exon boundaries among multiple genomes. The Ensembl GeneTrees computational pipeline generates gene trees based on coding sequences, provides details about exon conservation, and is used in the Ensembl Compara project to discover gene families. A certain amount of expertise is required to configure and run the Ensembl Compara GeneTrees pipeline via command line. Therefore, we converted this pipeline into a Galaxy workflow, called GeneSeqToFamily, and provided additional functionality. This workflow uses existing tools from the Galaxy ToolShed, as well as providing additional wrappers and tools that are required to run the workflow. GeneSeqToFamily represents the Ensembl GeneTrees pipeline as a set of interconnected Galaxy tools, so they can be run interactively within the Galaxy's user-friendly workflow environment while still providing the flexibility to tailor the analysis by changing configurations and tools if necessary. Additional tools allow users to subsequently visualize the gene families produced by the workflow, using the Aequatus.js interactive tool, which has been developed as part of the Aequatus software project.
Ben Rebeh, Imen; Morinière, Madeleine; Ayadi, Leila; Benzina, Zeineb; Charfedine, Ilhem; Feki, Jamel; Ayadi, Hammadi; Ghorbel, Abdelmonem; Baklouti, Faouzi; Masmoudi, Saber
2010-09-30
Recessive mutations of the myosin VIIA (MYO7A) gene are reported to be responsible for both a deaf-blindness syndrome (Usher type 1B [USH1B] and atypical Usher syndrome) and nonsyndromic hearing loss (HL; Deafness, Neurosensory, Autosomal Recessive 2 [DFNB2]). The existence of DFNB2 is controversial, and often there is no relationship between the type and location of the MYO7A mutations corresponding to the USH1B and DFNB2 phenotype. We investigated the molecular determinant of a mild form of retinopathy in association with a subtle splicing modulation of MYO7A mRNA. Affected members underwent detailed audiologic and ocular characterization. DNA samples from family members were genotyped with polymorphic microsatellite markers. Sequencing of MYO7A was performed. Endogenous lymphoid RNA analysis and a splicing minigene assay were used to study the effect of the c.1935G>A mutation. Funduscopy showed mild retinitis pigmentosa in adults with HL. Microsatellite analysis showed linkage to markers in the region on chromosome 11q13.5. Sequencing of MYO7A revealed a mutation in the last nucleotide of exon 16 (c.1935G>A), which corresponds to a substitution of a methionine to an isoleucine residue at amino acid 645 of the myosin VIIA. However, structural prediction of the molecular model of myosin VIIA shows that this amino acid replacement induces only minor structural changes in the immediate environment of the mutation and thus does not alter the overall native structure. We found that, although predominantly included in mature mRNA, exon 16 is in fact alternatively spliced in control cells and that the mutation at the very last position is associated with a switch toward a predominant exclusion of that exon. This observation was further supported using a splicing minigene transfection assay; the c.1935G>A mutation was found to trigger a partial impairment of the adjacent donor splice site, suggesting that the unique change at the last position of the exon is responsible for the enhanced exon exclusion in this family. This study shows how an exonic mutation that weakens the 5' splice site enhances a minor alternative splicing without abolishing a complete exclusion of the exon and therefore causes a less severe retinitis pigmentosa than the USH1B-associated alleles. It would be interesting to examine a possible correlation between intrafamilial phenotypic variability and the subtle variation in exon 16 inclusion, probably related to genetic background specificities.
Bonasio, Roberto; Li, Qiye; Lian, Jinmin; Mutti, Navdeep S.; Jin, Lijun; Zhao, Hongmei; Zhang, Pei; Wen, Ping; Xiang, Hui; Ding, Yun; Jin, Zonghui; Shen, Steven S.; Wang, Zongji; Wang, Wen; Wang, Jun; Berger, Shelley L.; Liebig, Jürgen; Zhang, Guojie; Reinberg, Danny
2012-01-01
SUMMARY Background Ant societies comprise individuals belonging to different castes characterized by specialized morphologies and behaviors. Because ant embryos can follow different developmental trajectories, epigenetic mechanisms must play a role in caste determination. Ants have a full set of DNA methyltransferase and their genomes contain methylcytosine. To determine the relationship between DNA methylation and phenotypic plasticity in ants, we obtained and compared the genome-wide methylomes of different castes and developmental stages of Camponotus floridanus and Harpegnathos saltator. Results In the ant genomes, methylcytosines are found both in CpG and non-CpG contexts and are strongly enriched at exons of active genes. Changes in exonic DNA methylation correlate with alternative splicing events such as exon skipping and alternative splice site selection. Several genes exhibit caste-specific and developmental changes in DNA methylation that are conserved between the two species, including genes involved in reproduction, telomere maintenance, and noncoding RNA metabolism. Several loci are methylated and expressed monoallelically, and in some cases the choice of methylated allele depends on the caste. Conclusions These first ant methylomes and their intra- and inter-species comparison reveal an exonic methylation pattern that points to a connection between DNA methylation and splicing. The presence of monoallelic DNA methylation and the methylation of non-CpG sites in all samples suggest roles in genome regulation in these social insects, including the intriguing possibility of parental or caste-specific genomic imprinting. PMID:22885060
Complement receptor 1 variants confer protection from severe malaria in Odisha, India.
Panda, Aditya K; Panda, Madhumita; Tripathy, Rina; Pattanaik, Sarit S; Ravindran, Balachandran; Das, Bidyut K
2012-01-01
In Plasmodium falciparum infection, complement receptor-1 (CR1) on erythrocyte's surface and ABO blood group play important roles in formation of rosettes which are presumed to be contributory in the pathogenesis of severe malaria. Although several studies have attempted to determine the association of CR1 polymorphisms with severe malaria, observations remain inconsistent. Therefore, a case control study and meta-analysis was performed to address this issue. Common CR1 polymorphisms (intron 27 and exon 22) and blood group were typed in 353 cases of severe malaria (SM) [97 cerebral malaria (CM), 129 multi-organ dysfunction (MOD), 127 non-cerebral severe malaria (NCSM)], 141 un-complicated malaria and 100 healthy controls from an endemic region of Odisha, India. Relevant publications for meta-analysis were searched from the database. The homozygous polymorphisms of CR1 intron 27 and exon 22 (TT and GG) and alleles (T and G) that are associated with low expression of CR1 on red blood cells, conferred significant protection against CM, MOD and malaria deaths. Combined analysis showed significant association of blood group B/intron 27-AA/exon 22-AA with susceptibility to SM (CM and MOD). Meta-analysis revealed that the CR1 exon 22 low expression polymorphism is significantly associated with protection against severe malaria. The results of the present study demonstrate that common CR1 variants significantly protect against severe malaria in an endemic area.
Robichaux, Jacqulyne P.; Elamin, Yasir Y.; Tan, Zhi; Carter, Brett W.; Zhang, Shuxing; Liu, Shengwu; Li, Shuai; Chen, Ting; Poteete, Alissa; Estrada-Bernal, Adriana; Le, Anh T.; Truini, Anna; Nilsson, Monique B.; Sun, Huiying; Roarty, Emily; Goldberg, Sarah B.; Brahmer, Julie R.; Altan, Mehmet; Lu, Charles; Papadimitrakopoulou, Vassiliki; Politi6, Katerina; Doebele, Robert C.; Wong, Kwok-Kin; Heymach, John V.
2018-01-01
Although most activating mutations of epidermal growth factor receptor (EGFR)-mutant non–small cell lung cancers (NSCLCs) are sensitive to available EGFR tyrosine kinase inhibitors (TKIs), a subset with alterations in exon 20 of EGFR and HER2 are intrinsically resistant and lack an effective therapy. We used in silico, in vitro, and in vivo testing to model structural alterations induced by exon 20 mutations and to identify effective inhibitors. 3D modeling indicated alterations restricted the size of the drug-binding pocket, limiting the binding of large, rigid inhibitors. We found that poziotinib, owing to its small size and flexibility, can circumvent these steric changes and is a potent inhibitor of the most common EGFR and HER2 exon 20 mutants. Poziotinib demonstrated greater activity than approved EGFR TKIs in vitro and in patient-derived xenograft models of EGFR or HER2 exon 20 mutant NSCLC and in genetically engineered mouse models of NSCLC. In a phase 2 trial, the first 11 patients with NSCLC with EGFR exon 20 mutations receiving poziotinib had a confirmed objective response rate of 64%. These data identify poziotinib as a potent, clinically active inhibitor of EGFR and HER2 exon 20 mutations and illuminate the molecular features of TKIs that may circumvent steric changes induced by these mutations. PMID:29686424
Saturation mutagenesis reveals manifold determinants of exon definition.
Ke, Shengdong; Anquetil, Vincent; Zamalloa, Jorge Rojas; Maity, Alisha; Yang, Anthony; Arias, Mauricio A; Kalachikov, Sergey; Russo, James J; Ju, Jingyue; Chasin, Lawrence A
2018-01-01
To illuminate the extent and roles of exonic sequences in the splicing of human RNA transcripts, we conducted saturation mutagenesis of a 51-nt internal exon in a three-exon minigene. All possible single and tandem dinucleotide substitutions were surveyed. Using high-throughput genetics, 5560 minigene molecules were assayed for splicing in human HEK293 cells. Up to 70% of mutations produced substantial (greater than twofold) phenotypes of either increased or decreased splicing. Of all predicted secondary structural elements, only a single 15-nt stem-loop showed a strong correlation with splicing, acting negatively. The in vitro formation of exon-protein complexes between the mutant molecules and proteins associated with spliceosome formation (U2AF35, U2AF65, U1A, and U1-70K) correlated with splicing efficiencies, suggesting exon definition as the step affected by most mutations. The measured relative binding affinities of dozens of human RNA binding protein domains as reported in the CISBP-RNA database were found to correlate either positively or negatively with splicing efficiency, more than could fit on the 51-nt test exon simultaneously. The large number of these functional protein binding correlations point to a dynamic and heterogeneous population of pre-mRNA molecules, each responding to a particular collection of binding proteins. © 2018 Ke et al.; Published by Cold Spring Harbor Laboratory Press.
Veenstra, Jan A; Khammassi, Hela
2017-04-01
RYamides are arthropod neuropeptides with unknown function. In 2011 two RYamides were isolated from D. melanogaster as the ligands for the G-protein coupled receptor CG5811. The D. melanogaster gene encoding these neuropeptides is highly unusual, as there are four RYamide encoding exons in the current genome assembly, but an exon encoding a signal peptide is absent. Comparing the D. melanogaster gene structure with those from other species, including D. virilis, suggests that the gene is degenerating. RNAseq data from 1634 short sequence read archives at NCBI containing more than 34 billion spots yielded numerous individual spots that correspond to the RYamide encoding exons, of which a large number include the intron-exon boundary at the start of this exon. Although 72 different sequences have been spliced onto this RYamide encoding exon, none codes for the signal peptide of this gene. Thus, the RNAseq data for this gene reveal only noise and no signal. The very small quantities of peptide recovered during isolation and the absence of credible RNAseq data, indicates that the gene is very little expressed, while the RYamide gene structure in D. melanogaster suggests that it might be evolving into a pseudogene. Yet, the identification of the peptides it encodes clearly shows it is still functional. Using region specific antisera, we could localize numerous neurons and enteroendocrine cells in D. willistoni, D. virilis and D. pseudoobscura, but only two adult abdominal neurons in D. melanogaster. Those two neurons project to and innervate the rectal papillae, suggesting that RYamides may be involved in the regulation of water homeostasis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Molecular evolution of the crustacean hyperglycemic hormone family in ecdysozoans
2010-01-01
Background Crustacean Hyperglycemic Hormone (CHH) family peptides are neurohormones known to regulate several important functions in decapod crustaceans such as ionic and energetic metabolism, molting and reproduction. The structural conservation of these peptides, together with the variety of functions they display, led us to investigate their evolutionary history. CHH family peptides exist in insects (Ion Transport Peptides) and may be present in all ecdysozoans as well. In order to extend the evolutionary study to the entire family, CHH family peptides were thus searched in taxa outside decapods, where they have been, to date, poorly investigated. Results CHH family peptides were characterized by molecular cloning in a branchiopod crustacean, Daphnia magna, and in a collembolan, Folsomia candida. Genes encoding such peptides were also rebuilt in silico from genomic sequences of another branchiopod, a chelicerate and two nematodes. These sequences were included in updated datasets to build phylogenies of the CHH family in pancrustaceans. These phylogenies suggest that peptides found in Branchiopoda and Collembola are more closely related to insect ITPs than to crustacean CHHs. Datasets were also used to support a phylogenetic hypothesis about pancrustacean relationships, which, in addition to gene structures, allowed us to propose two evolutionary scenarios of this multigenic family in ecdysozoans. Conclusions Evolutionary scenarios suggest that CHH family genes of ecdysozoans originate from an ancestral two-exon gene, and genes of arthropods from a three-exon one. In malacostracans, the evolution of the CHH family has involved several duplication, insertion or deletion events, leading to neuropeptides with a wide variety of functions, as observed in decapods. This family could thus constitute a promising model to investigate the links between gene duplications and functional divergence. PMID:20184761
2014-01-01
Background The forelimb-specific gene tbx5 is highly conserved and essential for the development of forelimbs in zebrafish, mice, and humans. Amongst birds, a single order, Dinornithiformes, comprising the extinct wingless moa of New Zealand, are unique in having no skeletal evidence of forelimb-like structures. Results To determine the sequence of tbx5 in moa, we used a range of PCR-based techniques on ancient DNA to retrieve all nine tbx5 exons and splice sites from the giant moa, Dinornis. Moa Tbx5 is identical to chicken Tbx5 in being able to activate the downstream promotors of fgf10 and ANF. In addition we show that missexpression of moa tbx5 in the hindlimb of chicken embryos results in the formation of forelimb features, suggesting that Tbx5 was fully functional in wingless moa. An alternatively spliced exon 1 for tbx5 that is expressed specifically in the forelimb region was shown to be almost identical between moa and ostrich, suggesting that, as well as being fully functional, tbx5 is likely to have been expressed normally in moa since divergence from their flighted ancestors, approximately 60 mya. Conclusions The results suggests that, as in mice, moa tbx5 is necessary for the induction of forelimbs, but is not sufficient for their outgrowth. Moa Tbx5 may have played an important role in the development of moa’s remnant forelimb girdle, and may be required for the formation of this structure. Our results further show that genetic changes affecting genes other than tbx5 must be responsible for the complete loss of forelimbs in moa. PMID:24885927
Nascimento, Diana S; do Vale, Ana; Tomás, Ana M; Zou, Jun; Secombes, Christopher J; dos Santos, Nuno M S
2007-03-01
Interleukin-12 (IL-12) is a heterodimeric cytokine pivotal in resistance to microbial and viral infections. In the search for immunoregulatory genes in sea bass the genes for the two IL-12 subunits p40 and p35 were cloned and sequenced. Molecular characterization of these two genes was performed at both the cDNA and genomic levels. Sea bass IL-12 p40 and p35 conserve most cysteines involved in the intra-chain disulfide bonds of human IL-12 subunits as well as the important structural residues for human IL-12 heterodimerization. The gene organization of sea bass IL-12 p40 is similar to the human orthologue, whilst the sea bass IL-12 p35 gene structure, as reported for pufferfish, differs from the human one in containing an additional exon and lacking a second copy of a duplicated exon present in the mammalian genes. The promoter analysis of both sea bass and pufferfish IL-12 genes showed the presence of the main cis-acting elements involved in the transcriptional regulation of human and mouse orthologues. The involvement of IL-12 in sea bass anti-bacterial immune responses was demonstrated by investigating the expression profiles of IL-1beta, IL-12 p40 and p35 in the head-kidney and spleen following intraperitoneal injection of UV-killed and live Photobacterium damselae ssp. piscicida (Phdp). Finally, the importance of nuclear factor (NF)-kappaB on UV-killed Phdp-induced IL-12 p40 and p35 gene transcription was shown by the use of pyrrolidine dithiocarbamate (PDTC).
Huang, Wei; Zhang, Jianshe; Liao, Zhi; Lv, Zhenming; Wu, Huifei; Zhu, Aiyi; Wu, Changwen
2016-01-15
Gonadotropin-releasing hormone III (GnRH3) is considered to be a key neurohormone in fish reproduction control. In the present study, the cDNA and genomic sequences of GnRH3 were cloned and characterized from large yellow croaker Larimichthys crocea. The cDNA encoded a protein of 99 amino acids with four functional motifs. The full-length genome sequence was composed of 3797 nucleotides, including four exons and three introns. Higher identities of amino acid sequences and conserved exon-intron organizations were found between LcGnRH3 and other GnRH3 genes. In addition, some special features of the sequences were detected in partial species. For example, two specific residues (V and A) were found in the family Sciaenidae, and the unique 75-72 bp type of the open reading frame 2 and 3 existed in the family Cyprinidae. Analysis of the 2576 bp promoter fragment of LcGnRH3 showed a number of transcription factor binding sites, such as AP1, CREB, GATA-1, HSF, FOXA2, and FOXL1. Promoter functional analysis using an EGFP reporter fusion in zebrafish larvae presented positive signals in the brain, including the olfactory region, the terminal nerve ganglion, the telencephalon, and the hypothalamus. The expression pattern was generally consistent with the endogenous GnRH3 GFP-expressing transgenic zebrafish lines, but the details were different. These results indicate that the structure and function of LcGnRH3 are generally similar to the other teleost GnRH3 genes, but there exist some distinctions among them. Copyright © 2015 Elsevier B.V. All rights reserved.
Xia, Jun Hong; Li, Hong Lian; Li, Bi Jun; Gu, Xiao Hui; Lin, Hao Ran
2018-01-10
Hypoxia is one of the critical environmental stressors for fish in aquatic environments. Although accumulating evidences indicate that gene expression is regulated by hypoxia stress in fish, how genes undergoing differential gene expression and/or alternative splicing (AS) in response to hypoxia stress in heart are not well understood. Using RNA-seq, we surveyed and detected 289 differential expressed genes (DEG) and 103 genes that undergo differential usage of exons and splice junctions events (DUES) in heart of a hypoxia tolerant fish, Nile tilapia, Oreochromis niloticus following 12h hypoxic treatment. The spatio-temporal expression analysis validated the significant association of differential exon usages in two randomly selected DUES genes (fam162a and ndrg2) in 5 tissues (heart, liver, brain, gill and spleen) sampled at three time points (6h, 12h, and 24h) under acute hypoxia treatment. Functional analysis significantly associated the differential expressed genes with the categories related to energy conservation, protein synthesis and immune response. Different enrichment categories were found between the DEG and DUES dataset. The Isomerase activity, Oxidoreductase activity, Glycolysis and Oxidative stress process were significantly enriched for the DEG gene dataset, but the Structural constituent of ribosome and Structural molecule activity, Ribosomal protein and RNA binding protein were significantly enriched only for the DUES genes. Our comparative transcriptomic analysis reveals abundant stress responsive genes and their differential regulation function in the heart tissues of Nile tilapia under acute hypoxia stress. Our findings will facilitate future investigation on transcriptome complexity and AS regulation during hypoxia stress in fish. Copyright © 2017 Elsevier B.V. All rights reserved.
Functional and comparative genomics analyses of pmp22 in medaka fish
Itou, Junji; Suyama, Mikita; Imamura, Yukio; Deguchi, Tomonori; Fujimori, Kazuhiro; Yuba, Shunsuke; Kawarabayasi, Yutaka; Kawasaki, Takashi
2009-01-01
Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A). The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV). Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS) was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression. PMID:19534778
A Mayan founder mutation is a common cause of deafness in Guatemala.
Carranza, C; Menendez, I; Herrera, M; Castellanos, P; Amado, C; Maldonado, F; Rosales, L; Escobar, N; Guerra, M; Alvarez, D; Foster, J; Guo, S; Blanton, S H; Bademci, G; Tekin, M
2015-09-08
Over 5% of the world's population has varying degrees of hearing loss. Mutations in GJB2 are the most common cause of autosomal recessive non-syndromic hearing loss (ARNHL) in many populations. The frequency and type of mutations are influenced by ethnicity. Guatemala is a multi-ethnic country with four major populations: Maya, Ladino, Xinca, and Garifuna. To determine the mutation profile of GJB2 in a ARNHL population from Guatemala, we sequenced both exons of GJB2 in 133 unrelated families. A total of six pathogenic variants were detected. The most frequent pathogenic variant is c.131G>A (p.Trp44*) detected in 21 of 266 alleles. We show that c.131G>A is associated with a conserved haplotype in Guatemala suggesting a single founder. The majority of Mayan population lives in the west region of the country from where all c.131G>A carriers originated. Further analysis of genome-wide variation of individuals carrying the c.131G>A mutation compared with those of Native American, European, and African populations shows a close match with the Mayan population. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Tompkins, Joshua D.; Jung, Marc; Chen, Chang-yi; Lin, Ziguang; Ye, Jingjing; Godatha, Swetha; Lizhar, Elizabeth; Wu, Xiwei; Hsu, David; Couture, Larry A.; Riggs, Arthur D.
2016-01-01
The directed differentiation of human cardiomyocytes (CMs) from pluripotent cells provides an invaluable model for understanding mechanisms of cell fate determination and offers considerable promise in cardiac regenerative medicine. Here, we utilize a human embryonic stem cell suspension bank, produced according to a good manufacturing practice, to generate CMs using a fully defined and small molecule-based differentiation strategy. Primitive and cardiac mesoderm purification was used to remove non-committing and multi-lineage populations and this significantly aided the identification of key transcription factors, lncRNAs, and essential signaling pathways that define cardiomyogenesis. Global methylation profiles reflect CM development and we report on CM exon DNA methylation “memories” persisting beyond transcription repression and marking the expression history of numerous developmentally regulated genes, especially transcription factors. PMID:26981572
Hochbach, Anne; Schneider, Julia; Röser, Martin
2015-06-01
To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
Chromosomal localization and cDNA cloning of the human DBP and TEF genes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khatib, Z.A.; Inaba, T.; Valentine, M.
1994-09-15
The authors have isolated cDNA and genomic clones and determined the human chromosome positions of two genes encoding transcription factors expressed in the liver and the pituitary gland: albumin D-site-binding protein (DBP) and thyrotroph embryonic factor (TEF). Both proteins have been identified as members of the PAR (proline and acidic amino acid-rich) subfamily of bZIP transcription factors in the rat, but human homologues have not been characterized. Using a fluorescence in situ hybridization technique, the DBP locus was assigned to chromosome 19q13, and TEF to chromosome 22q13. Each assignment was confirmed by means of human chromosome segregation in somatic cellmore » hybrids. Coding sequences of DBP and TEF, extending beyond the bZIP domain to the PAR region, were highly conserved in both human-human and interspecies comparisons. Conservation of the exon-intron boundaries of each bZIP domain-encoding exon suggested derivation from a common ancestral gene. DBP and TEF mRNAs were expressed in all tissues and cell lines examined, including brain, lung, liver, spleen, and kidney. Knowledge of the human chromosome locations of these PAR proteins will facilitate studies to assess their involvement in carcinogenesis and other fundamental biological processes. 37 refs., 5 figs., 1 tab.« less
Bahnsen, U; Oosting, P; Swaab, D F; Nahke, P; Richter, D; Schmale, H
1992-01-01
Familial neurohypophyseal diabetes insipidus in humans is a rare disease transmitted as an autosomal dominant trait. Affected individuals have very low or undetectable levels of circulating vasopressin and suffer from polydipsia and polyuria. An obvious candidate gene for the disease is the vasopressin-neurophysin (AVP-NP) precursor gene on human chromosome 20. The 2 kb gene with three exons encodes a composite precursor protein consisting of the neuropeptide vasopressin and two associated proteins, neurophysin and a glycopeptide. Cloning and nucleotide sequence analysis of both alleles of the AVP-NP gene present in a Dutch ADNDI family reveals a point mutation in one allele of the affected family members. Comparison of the nucleotide sequences shows a G----T transversion within the neurophysin-encoding exon B. This missense mutation converts a highly conserved glycine (Gly17 of neurophysin) to a valine residue. RFLP analysis of six related family members indicates cosegregation of the mutant allele with the DI phenotype. The mutation is not present in 96 chromosomes of an unrelated control group. These data suggest that a single amino acid exchange within a highly conserved domain of the human vasopressin-associated neurophysin is the primary cause of one form of ADNDI. Images PMID:1740104
Alternative splicing and promoter use in TFII-I genes.
Makeyev, Aleksandr V; Bayarsaihan, Dashzeveg
2009-03-15
TFII-I proteins are ubiquitously expressed transcriptional factors involved in both basal transcription and signal transduction activation or repression. TFII-I proteins are detected as early as at two-cell stage and exhibit distinct and dynamic expression patterns in developing embryos as well as mark regional variation in the adult mouse brain. Analysis of atypical small and rare chromosomal deletions at 7q11.23 points to TFII-I genes (GTF2I and GTF2IRD1) as the prime candidates responsible for craniofacial and cognitive abnormalities in the Williams-Beuren syndrome. TFII-I genes are often subjected to alternative splicing, which generates isoforms that show different activities and play distinct biological roles. The coding regions of TFII-I genes are composed of more than 30 exons and are well conserved among vertebrates. However, their 5' untranslated regions are not as well conserved and all poorly characterized. In the present work, we analyzed promoter regions of TFII-I genes and described their additional exons, as well as tested tissue specificity of both previously reported and novel alternatively spliced isoforms. Our comprehensive analysis leads to further elucidation of the functional heterogeneity of TFII-I proteins, provides hints on search for regulatory pathways governing their expression, and opens up possibilities for examining the effect of different haplotypes on their promoter functions.
Expression of SMARCB1 (INI1) mutations in familial schwannomatosis.
Smith, Miriam J; Walker, James A; Shen, Yiping; Stemmer-Rachamimov, Anat; Gusella, James F; Plotkin, Scott R
2012-12-15
Genetic changes in the SMARCB1 tumor suppressor gene have recently been reported in tumors and blood from families with schwannomatosis. Exon scanning of all nine SMARCB1 exons in genomic DNA from our cohort of families meeting the criteria for 'definite' or 'presumptive' schwannomatosis previously revealed constitutional alterations in 13 of 19 families (68%). Screening of four new familial schwannomatosis probands identified one additional constitutional alteration. We confirmed the presence of mRNA transcripts for two missense alterations, four mutations of conserved splice motifs and two additional mutations, in less conserved sequences, which also affect splicing. Furthermore, we found that transcripts for a rare 3'-untranslated region (c.*82C > T) alteration shared by four unrelated families did not produce splice variants but did show unequal allelic expression, suggesting that the alteration is either causative itself or linked to an unidentified causative mutation. Overexpression studies in cells lacking SMARCB1 suggest that mutant SMARCB1 proteins, like wild-type SMARCB1 protein, retain the ability to suppress cyclin D1 activity. These data, together with the expression of SMARCB1 protein in a proportion of cells from schwannomatosis-related schwannomas, suggest that these tumors develop through a mechanism that is distinct from that of rhabdoid tumors in which SMARCB1 protein is completely absent in tumor cells.
The evolution of milk casein genes from tooth genes before the origin of mammals.
Kawasaki, Kazuhiko; Lafont, Anne-Gaelle; Sire, Jean-Yves
2011-07-01
Caseins are among cardinal proteins that evolved in the lineage leading to mammals. In milk, caseins and calcium phosphate (CaP) form a huge complex called casein micelle. By forming the micelle, milk maintains high CaP concentrations, which help altricial mammalian neonates to grow bone and teeth. Two types of caseins are known. Ca-sensitive caseins (α(s)- and β-caseins) bind Ca but precipitate at high Ca concentrations, whereas Ca-insensitive casein (κ-casein) does not usually interact with Ca but instead stabilizes the micelle. Thus, it is thought that these two types of caseins are both necessary for stable micelle formation. Both types of caseins show high substitution rates, which make it difficult to elucidate the evolution of caseins. Yet, recent studies have revealed that all casein genes belong to the secretory calcium-binding phosphoprotein (SCPP) gene family that arose by gene duplication. In the present study, we investigated exon-intron structures and phylogenetic distributions of casein and other SCPP genes, particularly the odontogenic ameloblast-associated (ODAM) gene, the SCPP-Pro-Gln-rich 1 (SCPPPQ1) gene, and the follicular dendritic cell secreted peptide (FDCSP) gene. The results suggest that contemporary Ca-sensitive casein genes arose from a putative common ancestor, which we refer to as CSN1/2. The six putative exons comprising CSN1/2 are all found in SCPPPQ1, although ODAM also shares four of these exons. By contrast, the five exons of the Ca-insensitive casein gene are all reminiscent of FDCSP. The phylogenetic distribution of these genes suggests that both SCPPPQ1 and FDCSP arose from ODAM. We thus argue that all casein genes evolved from ODAM via two different pathways; Ca-sensitive casein genes likely originated directly from SCPPPQ1, whereas the Ca-insensitive casein genes directly differentiated from FDCSP. Further, expression of ODAM, SCPPPQ1, and FDCSP was detected in dental tissues, supporting the idea that both types of caseins evolved as Ca-binding proteins. Based on these findings, we propose two alternative hypotheses for micelle formation in primitive milk. The conserved biochemical characteristics in caseins and their immediate ancestors also suggest that many slight genetic modifications have created modern caseins, proteins vital to the sustained success of mammals.
NASA Astrophysics Data System (ADS)
Terando, A. J.; Collazo, J.
2017-12-01
Boundary organizations, entities that facilitate the co-production and translation of scientific research in decision making processes, have been promoted as a means to assist global change adaptation, particularly in the areas of landscape conservation and natural resource management. However, scientists can and often still must perform a similar role and act as anchoring agents within wicked adaptation problems that involve a myriad of actors, values, scientific uncertainties, governance structures, and multidisciplinary research needs. We illustrate one such case study in Puerto Rico's Bosque Modelo (Model Forest) where we discuss an ongoing scientific effort to undertake a multi-objective landscape conservation design project that intersects with the Bosque Modelo geography and goals. Perspectives are provided from two research ecologists, one with a background in terrestrial ecology who has worked at the intersection of science, conservation, and government for over 30 years, and the other with a multi-disciplinary background in earth sciences, climatology, and terrestrial ecology. We frame our discussion around the learning process that accompanies the development of global change scenarios that are both useful and useable for a wide spectrum of scientists, and the likelihood that scientifically informed adaptive management actions will ultimately be implemented in this complex and changing landscape.
[Genome-wide identification and expression analysis of the WRKY gene family in peach].
Gu, Yan-bing; Ji, Zhi-rui; Chi, Fu-mei; Qiao, Zhuang; Xu, Cheng-nan; Zhang, Jun-xiang; Zhou, Zong-shan; Dong, Qing-long
2016-03-01
The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.
The Evolution of the Secreted Regulatory Protein Progranulin.
Palfree, Roger G E; Bennett, Hugh P J; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics.
The Evolution of the Secreted Regulatory Protein Progranulin
Palfree, Roger G. E.; Bennett, Hugh P. J.; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics. PMID:26248158
Schwannomatosis associated with multiple meningiomas due to a familial SMARCB1 mutation.
Bacci, Costanza; Sestini, Roberta; Provenzano, Aldesia; Paganini, Irene; Mancini, Irene; Porfirio, Berardino; Vivarelli, Rossella; Genuardi, Maurizio; Papi, Laura
2010-02-01
Schwannomatosis (MIM 162091) is a condition predisposing to the development of central and peripheral schwannomas; most cases are sporadic without a clear family history but a few families with a clear autosomal dominant pattern of transmission have been described. Germline mutations in SMARCB1 are associated with schwannomatosis. We report a family with multiple schwannomas and meningiomas. A SMARCB1 germline mutation in exon 1 was identified. The mutation, c.92A>T (p.Glu31Val), occurs in a highly conserved amino acid in the SMARCB1 protein. In addition, in silico analysis demonstrated that the mutation disrupts the donor consensus sequence of exon 1. RNA studies verified the absence of mRNA transcribed by the mutant allele. This is the first report of a SMARCB1 germline mutation in a family with schwannomatosis characterized by the development of multiple meningiomas.
Further confirmation of the MED13L haploinsufficiency syndrome.
van Haelst, Mieke M; Monroe, Glen R; Duran, Karen; van Binsbergen, Ellen; Breur, Johannes M; Giltay, Jacques C; van Haaften, Gijs
2015-01-01
MED13L haploinsufficiency syndrome has been described in two patients and is characterized by moderate intellectual disability (ID), conotruncal heart defects, facial abnormalities and hypotonia. Missense mutations in MED13L are linked to transposition of the great arteries and non-syndromal intellectual disability. Here we describe two novel patients with de novo MED13L aberrations. The first patient has a de novo mutation in the splice acceptor site of exon 5 of MED13L. cDNA analysis showed this mutation results in an in-frame deletion, removing 15 amino acids in middle of the conserved MED13L N-terminal domain. The second patient carries a de novo deletion of exons 6-20 of MED13L. Both patients show features of the MED13L haploinsufficiency syndrome, except for the heart defects, thus further confirming the existence of the MED13L haploinsufficiency syndrome.
Escher, Pascal; Passarin, Olga; Munier, Francis L; Tran, Viet H; Vaclavik, Veronika
2018-01-01
To expand the genotype/phenotype correlations in patients with autosomal dominant retinitis pigmentosa (adRP) harboring PRPF8 variants. Two patients, a father and his daughter, harboring a novel p.PRPF8-Glu2331* variant, underwent ophthalmic examination at 3-year-interval, including fundus photography, fundus autofluorescence, optical coherence tomography, and ISCEV standard full field ERGs. All reported disease-causing PRPF8 variants were collected and localized in the PRPF8 and PRPF8/SNRNP200 protein structures. The p.PRPF8-Glu2331* variant results in a truncated PRPF8 protein lacking the last five C-terminal amino acids and caused in the two patients a severe clinical phenotype, with the macula being affected from the second decade on. All but two adRP-linked variants are located in the last exon 43 encoding the C-terminal tail of the C-terminal PRPF8 Jab1 domain. The p.PRPF8-Ser2118Phe and -Asn2280Lys variants encoded by exons 39 and 42, respectively, are located at the basis of the C-terminal tail. Frame-shift mutations and nonconservative amino acid changes in PRPF8 typically cause severe clinical phenotypes. The conservative missense variant p.PRPF8-Arg2310Lys that is not altering the global charge of the C-terminal tail, and variants located at the basis of the C-terminal tail show milder clinical phenotypes, in accordance with functional data on PRPF8/SNRNP200 interactions in yeast.
Molecular evaluation of five cardiac genes in Doberman Pinschers with dilated cardiomyopathy.
Meurs, Kathryn M; Hendrix, Kristina P; Norgard, Michelle M
2008-08-01
To sequence the exonic and splice site regions of 5 cardiac genes associated with the human form of familial dilated cardiomyopathy (DCM) in Doberman Pinschers with DCM and to identify a causative mutation. 5 unrelated Doberman Pinschers with DCM and 2 unaffected Labrador Retrievers (control dogs). Exonic and splice site regions of the 5 genes encoding the cardiac proteins troponin C, lamin A/C, cysteine- and glycine-rich protein 3, cardiac troponin T, and the beta-myosin heavy chain were sequenced. Sequences were compared for nucleotide changes between affected dogs and the published canine sequences and 2 control dogs. Base pair changes were considered to be causative for DCM if they were present in an affected dog but not in the control dogs or published sequences and if they involved a conserved amino acid and changed that amino acid to a different polarity, acid-base status, or structure. A causative mutation for DCM in Doberman Pinschers was not identified, although single nucleotide polymorphisms were detected in some dogs in the cysteine- and glycine-rich protein 3, beta-myosin heavy chain, and troponin T genes. Mutations in 5 of the cardiac genes associated with the development of DCM in humans did not appear to be causative for DCM in Doberman Pinschers. Continued evaluation of additional candidate genes or a focused approach with an association analysis is warranted to elucidate the molecular cause of this important cardiac disease in Doberman Pinschers.
Oda, Akifumi; Nakayoshi, Tomoki; Fukuyoshi, Shuichi; Kurimoto, Eiji; Takahashi, Ohgi
2018-07-01
Recently, non-enzymatic stereoinversions of aspartic acid (Asp) residues in proteins and peptides have been reported. Here, we performed replica exchange molecular dynamics (REMD) simulations of model peptides (exon 6, 26A-1, and 26A-2) extracted from elastin to investigate their structural features, thereby revealing the factor that influences stereoinversions. For REMD trajectories, we calculated distances between carboxyl carbon in Asp and amide nitrogen in the (n + 1) residue (CN distances). Because bond formation between carbon and nitrogen is indispensable to the formation of a succinimide intermediate the distance between them seems to play an important role in stereoinversion. Moreover, we calculated polar surface areas (PSAs) for the trajectories, finding that CN distances and PSA were different for each peptide, with the longest CN distance and smallest PSA observed for exon 6 peptide, where stereoinversion of Asp is the slowest. Although the average CN distance was shorter for exon 26A-1 peptide than for exon 26A-2 peptide, the number of conformations with CN distances <3.0 Å was greater for exon 26A-2 peptide than for exon 26A-1 peptide. Furthermore, PSA for amide nitrogen of the (n + 1) residue was larger for exon 26A-2 peptide than for exon 26A-1 peptide. These results indicated that the flexibility of Asp and (n + 1) residues and hydrophilicity of peptides, especially in the (n + 1) residue, play important roles in the stereoinversion of Asp. This article is part of a Special Issue entitled: D-Amino acids: biology in the mirror, edited by Dr. Loredano Pollegioni, Dr. Jean-Pierre Mothet and Dr. Molla Gianluca. Copyright © 2018 Elsevier B.V. All rights reserved.
Margaglione, M; Santacroce, R; Colaizzo, D; Seripa, D; Vecchione, G; Lupone, M R; De Lucia, D; Fortina, P; Grandone, E; Perricone, C; Di Minno, G
2000-10-01
Congenital afibrinogenemia is a rare autosomal recessive disorder characterized by a hemorrhagic diathesis of variable severity. Although more than 100 families with this disorder have been described, genetic defects have been characterized in few cases. An investigation of a young propositus, offspring of a consanguineous marriage, with undetectable levels of functional and quantitative fibrinogen, was conducted. Sequence analysis of the fibrinogen genes showed a homozygous G-to-A mutation at the fifth nucleotide (nt 2395) of the third intervening sequence (IVS) of the gamma-chain gene. Her first-degree relatives, who had approximately half the normal fibrinogen values and showed concordance between functional and immunologic levels, were heterozygtes. The G-to-A change predicts the disappearance of a donor splice site. After transfection with a construct, containing either the wild-type or the mutated sequence, cells with the mutant construct showed an aberrant messenger RNA (mRNA), consistent with skipping of exon 3, but not the expected mRNA. Sequencing of the abnormal mRNA showed the complete absence of exon 3. Skipping of exon 3 predicts the deletion of amino acid sequence from residue 16 to residue 75 and shifting of reading frame at amino acid 76 with a premature stop codon within exon 4 at position 77. Thus, the truncated gamma-chain gene product would not interact with other chains to form the mature fibrinogen molecule. The current findings show that mutations within highly conserved IVS regions of fibrinogen genes could affect the efficiency of normal splicing, giving rise to congenital afibrinogenemia.
Chen, Hao; Kshirsagar, Sarika; Jensen, Ingvill; Lau, Kevin; Simonson, Caitlin; Schluter, Samuel F
2010-02-01
Beta 2 microglobulin (beta2m) is an essential subunit of major histocompatibility complex (MHC) type I molecules. In this report, beta2m cDNAs were identified and sequenced from sandbar shark spleen cDNA library. Sandbar shark beta2m gene encodes one amino acid less than most teleost beta2m genes, and 3 amino acids less than mammal beta2m genes. Although sandbar shark beta2m protein contains one beta sheet less than that of human in the predicted protein structure, the overall structure of beta2m proteins is conserved during evolution. Germline gene for the beta2m in sandbar and nurse shark is present as a single locus. It contains three exons and two introns. CpG sites are evenly distributed in the shark beta2m loci. Several DNA repeat elements were also identified in the shark beta2m loci. Sequence analysis suggests that the beta2m locus is not linked to the MHC I loci in the shark genome.
Sukalo, Maja; Schäflein, Eva; Schanze, Ina; Everman, David B; Rezaei, Nima; Argente, Jesús; Lorda-Sanchez, Isabel; Deshpande, Charu; Takahashi, Tsutomu; Kleger, Alexander; Zenker, Martin
2017-11-01
Johanson-Blizzard syndrome (JBS, MIM #243800) is a very rare autosomal recessive disorder characterized by exocrine pancreatic insufficiency, nasal wing hypoplasia, hypodontia, and other abnormalities. JBS is caused by mutations of the UBR1 gene (MIM *605981), encoding a ubiquitin ligase of the N-end rule pathway. Molecular findings in a total of 65 unrelated patients with a clinical diagnosis of JBS who were previously screened for UBR1 mutations by Sanger sequencing were reviewed and cases lacking a disease-causing UBR1 mutation on either one or both alleles were included in this study. In order to discover mutations that are not detectable by Sanger sequencing, we designed a probe set for multiplex ligation-dependent probe amplification (MLPA) analysis of the UBR1 gene and analyzed the copy number status of all 47 UBR1 exons. Our previous studies using Sanger sequencing could detect mutations in 93.1% of 130 disease-associated UBR1 alleles. Six patients with a highly suggestive clinical diagnosis of JBS and unsolved genotype were included in this study. MLPA analysis detected six alleles harboring exon deletions/duplications, thereby raising the mutation detection rate in the entire cohort to 97.7% (127/130 alleles). We conclude that single or multi-exon deletions or duplications account for a substantial proportion of JBS-associated UBR1 mutations. © 2017 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.
Solving multi-objective optimization problems in conservation with the reference point method
Dujardin, Yann; Chadès, Iadine
2018-01-01
Managing the biodiversity extinction crisis requires wise decision-making processes able to account for the limited resources available. In most decision problems in conservation biology, several conflicting objectives have to be taken into account. Most methods used in conservation either provide suboptimal solutions or use strong assumptions about the decision-maker’s preferences. Our paper reviews some of the existing approaches to solve multi-objective decision problems and presents new multi-objective linear programming formulations of two multi-objective optimization problems in conservation, allowing the use of a reference point approach. Reference point approaches solve multi-objective optimization problems by interactively representing the preferences of the decision-maker with a point in the criteria (objectives) space, called the reference point. We modelled and solved the following two problems in conservation: a dynamic multi-species management problem under uncertainty and a spatial allocation resource management problem. Results show that the reference point method outperforms classic methods while illustrating the use of an interactive methodology for solving combinatorial problems with multiple objectives. The method is general and can be adapted to a wide range of ecological combinatorial problems. PMID:29293650
Bahri, Bochra A; Daverdin, Guillaume; Xu, Xiangyang; Cheng, Jan-Fang; Barry, Kerrie W; Brummer, E Charles; Devos, Katrien M
2018-06-14
Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones. A total of 11,682 SNPs were mined from ~ 15 Gb of sequence data, out of which 251 SNPs were retained after filtering. Population structure analysis largely grouped upland accessions into one subpopulation and lowland accessions into two additional subpopulations. The most frequent SNPs were in homozygous state within accessions. Sixty percent of the exonic SNPs were non-synonymous and, of these, 45% led to non-conservative amino acid changes. The non-conservative SNPs were largely in linkage disequilibrium with one haplotype being predominantly present in upland accessions while the other haplotype was commonly present in lowland accessions. Tajima's test of neutrality indicated that PHYB, a gene involved in photoperiod response, was under positive selection in the switchgrass population. PHYB carried a SNP leading to a non-conservative amino acid change in the PAS domain, a region that acts as a sensor for light and oxygen in signal transduction. Several non-conservative SNPs in genes potentially involved in plant architecture and adaptation have been identified and led to population structure and genetic differentiation of ecotypes in switchgrass. We suggest here that PHYB is a key gene involved in switchgrass natural selection. Further analyses are needed to determine whether any of the non-conservative SNPs identified play a role in the differential adaptation of upland and lowland switchgrass.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, X.; Fleischer, D.T.; Whitehead, W.T.
1995-05-15
Hereditary C5 deficiency has been reported in several families of different ethnic backgrounds and from different geographic regions, but the molecular genetic defect causing C5 deficiency has not been delineated in any of them. To examine the molecular basis of C5 deficiency in the African-American population, the exons and intron/exon boundaries of the C5 structural genes from three C5-deficient (C5D) African-American families were sequenced, revealing two nonsense mutations. The nonsense mutations are located in exon 1 (C{sup 84}AG to TAG) in two of the C5D families (Rhode Island and North Carolina) and in exon 36 (C{sup 4521}GA to TGA) inmore » the third C5D family (New York). The exon 1 and 36 mutations are contained in codons that encode the first amino acid of the C5 {beta}-chain (Gln{sup 1} to Stop) and residue 1458 in the {alpha}-chain (Arg{sup 1458} to Stop), respectively. Allele-specific PCR and sequence analyses demonstrated that the exon 1 mutation is present in only one of the C5 null genes in both the Rhode Island and North Carolina families, and the exon 36 mutation is contained in only one C5 null gene in the New York family. Neither of the nonsense mutations was found in the European or Caucasian-American C5D individuals examined. Collectively, these data indicate that: (1) C5 deficiency is caused by several different molecular genetic defects, (2) C5 deficiency in the African-American population can be explained in part by two distinct nonsense mutations in exons 1 and 36, and (3) compound heterozygosity exists in all of the reported African-American C5D families. 44 refs., 5 figs., 1 tab.« less
Ogura, Yukiko; Hoshino, Tyuji; Tanaka, Nobuko; Ailiken, Guzhanuer; Kobayashi, Sohei; Kitamura, Kouichi; Rahmutulla, Bahityar; Kano, Masayuki; Murakami, Kentarou; Akutsu, Yasunori; Nomura, Fumio; Itoga, Sakae; Matsubara, Hisahiro; Matsushita, Kazuyuki
2018-05-01
Overexpression of alternative splicing of far upstream element binding protein 1 (FUBP1) interacting repressor (FIR; poly(U) binding splicing factor 60 [PUF60]) and cyclin E were detected in esophageal squamous cell carcinomas (ESCC). Accordingly, the expression of FBW7 was examined by which cyclin E is degraded as a substrate via the proteasome system. Expectedly, FBW7 expression was decreased significantly in ESCC. Conversely, c-myc gene transcriptional repressor FIR (alias PUF60; U2AF-related protein) and its alternative splicing variant form (FIRΔexon2) were overexpressed in ESCC. Further, anticancer drugs (cis-diaminedichloroplatinum/cisplatin [CDDP] or 5-fluorouracil [5-FU]) and knockdown of FIR by small interfering RNA (siRNA) increased cyclin E while knockdown of FIRΔexon2 by siRNA decreased cyclin E expression in ESCC cell lines (TE1, TE2, and T.Tn) or cervical SCC cells (HeLa cells). Especially, knockdown of SAP155 (SF3b1), a splicing factor required for proper alternative splicing of FIR pre-mRNA, decreased cyclin E. Therefore, disturbed alternative splicing of FIR generated FIR/FIRΔexon2 with cyclin E overexpression in esophageal cancers, indicating that SAP155 siRNA potentially rescued FBW7 function by reducing expression of FIR and/or FIRΔexon2. Remarkably, Three-dimensional structure analysis revealed the hypothetical inhibitory mechanism of FBW7 function by FIR/FIRΔexon2, a novel mechanism of cyclin E overexpression by FIR/FIRΔexon2-FBW7 interaction was discussed. Clinically, elevated FIR expression potentially is an indicator of the number of lymph metastases and anti-FIR/FIRΔexon2 antibodies in sera as cancer diagnosis, indicating chemical inhibitors of FIR/FIRΔexon2-FBW7 interaction could be potential candidate drugs for cancer therapy. In conclusion, elevated cyclin E expression was, in part, induced owing to potential FIR/FIRΔexon2-FBW7 interaction in ESCC.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kerr, J.M.; Fisher, L.W.; Termine, J.D.
The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less
Iwai, Kenichi; Yaguchi, Masahiro; Nishimura, Kazuho; Yamamoto, Yukiko; Tamura, Toshiya; Nakata, Daisuke; Dairiki, Ryo; Kawakita, Yoichi; Mizojiri, Ryo; Ito, Yoshiteru; Asano, Moriteru; Maezaki, Hironobu; Nakayama, Yusuke; Kaishima, Misato; Hayashi, Kozo; Teratani, Mika; Miyakawa, Shuichi; Iwatani, Misa; Miyamoto, Maki; Klein, Michael G; Lane, Wes; Snell, Gyorgy; Tjhen, Richard; He, Xingyue; Pulukuri, Sai; Nomura, Toshiyuki
2018-06-01
The modulation of pre-mRNA splicing is proposed as an attractive anti-neoplastic strategy, especially for the cancers that exhibit aberrant pre-mRNA splicing. Here, we discovered that T-025 functions as an orally available and potent inhibitor of Cdc2-like kinases (CLKs), evolutionally conserved kinases that facilitate exon recognition in the splicing machinery. Treatment with T-025 reduced CLK-dependent phosphorylation, resulting in the induction of skipped exons, cell death, and growth suppression in vitro and in vivo Further, through growth inhibitory characterization, we identified high CLK2 expression or MYC amplification as a sensitive-associated biomarker of T-025. Mechanistically, the level of CLK2 expression correlated with the magnitude of global skipped exons in response to T-025 treatment. MYC activation, which altered pre-mRNA splicing without the transcriptional regulation of CLKs, rendered cancer cells vulnerable to CLK inhibitors with synergistic cell death. Finally, we demonstrated in vivo anti-tumor efficacy of T-025 in an allograft model of spontaneous, MYC-driven breast cancer, at well-tolerated dosage. Collectively, our results suggest that the novel CLK inhibitor could have therapeutic benefits, especially for MYC-driven cancer patients. © 2018 Takeda Pharmaceutical Company Published under the terms of the CC BY 4.0 license.
Raudsepp, Terje; Dobson, Lauren; Vishnoi, Monika; Fritz, Krista L.; Schaefer, Robert; Rendahl, Aaron K.; Derr, James N.; Love, Charles C.; Varner, Dickson D.; Chowdhary, Bhanu P.
2012-01-01
Impaired acrosomal reaction (IAR) of sperm causes male subfertility in humans and animals. Despite compelling evidence about the genetic control over acrosome biogenesis and function, the genomics of IAR is as yet poorly understood, providing no molecular tools for diagnostics. Here we conducted Equine SNP50 Beadchip genotyping and GWAS using 7 IAR–affected and 37 control Thoroughbred stallions. A significant (P<6.75E-08) genotype–phenotype association was found in horse chromosome 13 in FK506 binding protein 6 (FKBP6). The gene belongs to the immunophilins FKBP family known to be involved in meiosis, calcium homeostasis, clathrin-coated vesicles, and membrane fusions. Direct sequencing of FKBP6 exons in cases and controls identified SNPs g.11040315G>A and g.11040379C>A (p.166H>N) in exon 4 that were significantly associated with the IAR phenotype both in the GWAS cohort (n = 44) and in a large multi-breed cohort of 265 horses. All IAR stallions were homozygous for the A-alleles, while this genotype was found only in 2% of controls. The equine FKBP6 was exclusively expressed in testis and sperm and had 5 different transcripts, of which 4 were novel. The expression of this gene in AC/AG heterozygous controls was monoallelic, and we observed a tendency for FKBP6 up-regulation in IAR stallions compared to controls. Because exon 4 SNPs had no effect on the protein structure, it is likely that FKBP6 relates to the IAR phenotype via regulatory or modifying functions. In conclusion, FKBP6 was considered a susceptibility gene of incomplete penetrance for IAR in stallions and a candidate gene for male subfertility in mammals. FKBP6 genotyping is recommended for the detection of IAR–susceptible individuals among potential breeding stallions. Successful use of sperm as a source of DNA and RNA propagates non-invasive sample procurement for fertility genomics in animals and humans. PMID:23284302
Structural characterization of the FKHR gene and its rearrangement in alveolar rhabdomyosarcoma.
Davis, R J; Bennicelli, J L; Macina, R A; Nycum, L M; Biegel, J A; Barr, F G
1995-12-01
The FKHR gene, which contains a forkhead DNA-binding motif, is fused to either PAX3 or PAX7 by the t(2;13) or t(1;13) translocation in alveolar rhabdomyosarcoma,respectively. These tumors express chimeric transcripts encoding the N-terminal portion of either PAX protein fused to the C-terminal portion of FKHR. To understand the structural basis and functional consequences of these translocations, we characterized the wild-type FKHR gene and its rearrangement in alveolar rhabdomyosarcomas. By isolating and analyzing phage, cosmid and YAC clones, we determined that FKHR consists of three exons spanning 140 kb and that several highly similar loci are present in other genomic regions. Exon 1 encodes the N-terminus of the forkhead domain and is embedded within demethylated CpG island. RNA analyses reveal FKHR transcripts initiate from a TATA-less promoter within this island. Exon 2 encodes the C-terminus of the forkhead domain and a transcription activation domain, whereas exon 3 encodes a large 3' untranslated region. The intron 1-exon 2 boundary precisely matches the FHKR fusion point in the chimeric transcripts found in alveolar rhabdomyosarcomas. Using pulsed-field and fluorescence in situ hybridization analyses, we demonstrate that the 130kb FKHR intron 1 is rearranged in t(2;13)-containing alveolar rhabdomyosarcomas. Our findings indicate that FKHR intron 1 provides a large target for DNA rearrangemnt. Rearrangement of this intron with PAX3 produces two important functional consequences: in-frame fusion of N-terminal PAX3 sequences to the FKHR transcriptional activation domain and disruption of the FKHR DNA binding domain.
Lee, Tai-Sung; Ma, Wanlong; Zhang, Xi; Kantarjian, Hagop; Albitar, Maher
2009-01-01
Background The functional relevance of many of the recently detected JAK2 mutations, except V617F and exon 12 mutants, in patients with chronic myeloproliferative neoplasia (MPN) has been significantly overlooked. To explore atomic-level explanations of the possible mutational effects from those overlooked mutants, we performed a set of molecular dynamics simulations on clinically observed mutants, including newly discovered mutations (K539L, R564L, L579F, H587N, S591L, H606Q, V617I, V617F, C618R, L624P, whole exon 14-deletion) and control mutants (V617C, V617Y, K603Q/N667K). Results Simulation results are consistent with all currently available clinical/experimental evidence. The simulation-derived putative interface, not possibly obtained from static models, between the kinase (JH1) and pseudokinase (JH2) domains of JAK2 provides a platform able to explain the mutational effect for all mutants, including presumably benign control mutants, at the atomic level. Conclusion The results and analysis provide structural bases for mutational mechanisms of JAK2, may advance the understanding of JAK2 auto-regulation, and have the potential to lead to therapeutic approaches. Together with recent mutation profiling results demonstrating the breadth of clinically observed JAK2 mutations, our findings suggest that molecular testing/diagnostics of JAK2 should extend beyond V617F and exon 12 mutations, and perhaps should encompass most of the pseudo-kinase domain-coding region. PMID:19744331
A novel presenilin 1 mutation (Ala275Val) as cause of early-onset familial Alzheimer disease.
Luedecke, Daniel; Becktepe, Jos S; Lehmbeck, Jan T; Finckh, Ulrich; Yamamoto, Raina; Jahn, Holger; Boelmans, Kai
2014-04-30
Mutations in the presenilin 1 (PS1) gene (PSEN1) are associated with familial Alzheimer disease (FAD). Here, we report on a 50-year-old patient presenting with progressive deterioration of his short-term memory and a family history of early-onset dementia. Diagnostic workup included a neuropsychological examination, structural magnetic resonance (MR) imaging, cerebrospinal fluid (CSF) biomarkers including total tau, phosphorylated tau, and Aβ42 levels, as well as sequencing relevant fragments of the genes PSEN1, PSEN2, and APP. Additionally, we were able to obtain archival paraffin-embedded cerebellar tissue from the patient's father for cosegregation analysis. Clinical, neuropsychological and MR imaging data were indicative of early-onset Alzheimer disease. Furthermore, CSF biomarkers showed a typical pattern for Alzheimer disease. DNA sequencing revealed a heterozygous nucleotide transition (c.824C>T) in exon 8 of PSEN1, leading to an amino acid change from alanine to valine at codon 275 (Ala275Val). The same mutation was found in an archival brain specimen of the patient's demented father, but not in a blood sample of the non-demented mother. This mutation alters a conserved residue in the large hydrophilic loop of PS1, suggesting pathogenic relevance. Cosegregegation analysis and the structural as well as the presumed functional role of the mutated and highly conserved residue suggest FAD causing characteristics of the novel PSEN1 mutation Ala275Val. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Rincon, Sergio A; Paoletti, Anne
2016-01-01
Unveiling the function of a novel protein is a challenging task that requires careful experimental design. Yeast cytokinesis is a conserved process that involves modular structural and regulatory proteins. For such proteins, an important step is to identify their domains and structural organization. Here we briefly discuss a collection of methods commonly used for sequence alignment and prediction of protein structure that represent powerful tools for the identification homologous domains and design of structure-function approaches to test experimentally the function of multi-domain proteins such as those implicated in yeast cytokinesis.
Lim, S K; Maquat, L E
1992-01-01
Previous studies have demonstrated that nonsense codons within beta zero-thalassemic or in vitro-mutagenized human beta-globin transgenes result in the production of mRNAs that are degraded abnormally rapidly in the cytoplasm of murine erythroid cells. As a consequence, three RNA degradative intermediates are formed that lack sequences from either exon I or exons I and II. We show here that the intermediates, like the full-length mRNA from which they derive and the endogenous murine beta maj-globin mRNA, bind to the anticap monoclonal antibody H-20 in a way that is competed by the cap analogue m7G and eliminated by prior exposure to tobacco acid pyrophosphatase. Furthermore, the intermediates, like the two full-length mRNAs, are resistant to a 5'----3' exonuclease activity isolated from HeLa cell nuclei that degrades uncapped but not capped ribopolymers. Based on these observations, the intermediates appear to possess a structure that is indistinguishable from the cap at the 5' end of mRNA, i.e. a methylated nucleoside that is linked to the RNA by a 5'-5' phosphodiester bond. Detection of the intermediates during murine development was concomitant with detection of full-length thalassemic mRNA. Intermediate production appears to be influenced by RNA structure as indicated by the products that derive from a beta zero-thalassemic beta-globin transgene harboring a structural alteration (a 4 bp deletion) that was larger than any of those previously studied. Images PMID:1324170
Genomic structure and chromosomal mapping of the human CD22 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilson, G.L.; Kozlow, E.; Kehrl, J.H.
1993-06-01
The human CD22 gene is expressed specifically in B lymphocytes and likely has an important function in cell-cell interactions. A nearly full length human CD22 cDNA clone was used to isolate genomic clones that span the CD22 gene. The CD22 gene is spread over 22 kb of DNA and is composed of 15 exons. The first exon contains the major transcriptional start sites. The translation initiation codon is located in exon 3, which also encodes a portion of the signal peptide. Exons 4 to 10 encode the seven Ig domains of CD22, exon 11 encodes the transmembrane domain, exons 12more » to 15 encode the intracytoplasmic domain of CD22, and exon 15 also contains the 3' untranslated region. A minor form of CD22 mRNA likely results from splicing of exon 5 to exon 8, skipping exons 6 and 7. A 4.6-kb Xbal fragment of the CD22 gene was used to map the chromosomal location of CD22 by fluorescence in situ hybridization. The hybridization locus was identified by combining fluorescent images of the probe with the chromosomal banding pattern generated by an Alu probe. The results demonstrate the CD22 is located within the band region q13.1 of chromosome 19. Two closely clustered major transcription start sites and several minor start sites were mapped by primer extension. Similarly to many other lymphoid-specific genes, the CD22 promoter lacks an obvious TATA box. Approximately 4 kb of DNA 5' of the transcription start sites were sequenced and found to contain multiple Alu elements. Potential binding sites for the transcriptional factors NF-kB, AP-1, and Oct-2 are located within 300 bp 5' of the major transcription start sites. A 400-bp fragment (bp -339 through +71) of the CD22 promoter region was subcloned into a pGEM-chloramphenicol acetyltransferase vector and after transfection into B and T cells was found to be active in both B and T cells. 45 refs., 7 figs., 2 tabs.« less
Alegre, Ana Claudia Paiva; Oliveira, Aline Ferreira; Dos Reis Almeida, Fausto Bruno; Roque-Barreira, Maria Cristina; Hanna, Ebert Seixas
2014-01-01
Background Paracoccin is a dual-function protein of the yeast Paracoccidioides brasiliensis that has lectin properties and N-acetylglucosaminidase activities. Proteomic analysis of a paracoccin preparation from P. brasiliensis revealed that the sequence matched that of the hypothetical protein encoded by PADG-3347 of isolate Pb-18, with a polypeptide sequence similar to the family 18 endochitinases. These endochitinases are multi-functional proteins, with distinct lectin and enzymatic domains. Methodology/principal findings The multi-exon assembly and the largest exon of the predicted ORF (PADG-3347), was cloned and expressed in Escherichia coli cells, and the features of the recombinant proteins were compared to those of the native paracoccin. The multi-exon protein was also used for protection assays in a mouse model of paracoccidioidomycosis. Conclusions/Significance Our results showed that the recombinant protein reproduced the biological properties described for the native protein—including binding to laminin in a manner that is dependent on carbohydrate recognition—showed N-acetylglucosaminidase activity, and stimulated murine peritoneal macrophages to produce high levels of TNF-α and nitric oxide. Considering the immunomodulatory potential of glycan-binding proteins, we also investigated whether prophylactic administration of recombinant paracoccin affected the course of experimental paracoccidioidomycosis in mice. In comparison to animals injected with vehicle (controls), mice treated with recombinant paracoccin displayed lower pulmonary fungal burdens and reduced pulmonary granulomas. These protective effects were associated with augmented pulmonary levels of IL-12 and IFN-γ. We also observed that injection of paracoccin three days before challenge was the most efficient administration protocol, as the induced Th1 immunity was balanced by high levels of pulmonary IL-10, which may prevent the tissue damage caused by exacerbated inflammation. The results indicated that paracoccin is the protein encoded by PADG-3347, and we propose that this gene and homologous proteins in other P. brasiliensis strains be called paracoccin. We also concluded that recombinant paracoccin confers resistance to murine P. brasiliensis infection by exerting immunomodulatory effects. PMID:24743161
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila
Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang
2015-01-01
Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
Mapping neurofibromatosis 1 homologous loci by fluorescence in situ hybridization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Viskochil, D.; Breidenbach, H.H.; Cawthon, R.
Neurofibromatosis 1 maps to chromosome band 17q11.2 and the NF1 gene is comprised of 59 exons that span approximately 335 kb of genomic DNA. In order to further analyze the structure of NF1 from exons 2 through 27b, we isolated a number of cosmid and bacteriophage P-1 genomic clones using NF1-exon probes under high-stringency hybridization conditions. Using tagged, intron-based primers and DNA from various clones as a template, we PCR-amplified and sequenced individual NF1 exons. The exon sequences in PCR products from several genomic clones differed from the exon sequence derived from cloned NF1 cDNAs. Clones with variant sequences weremore » mapped by fluorescence in situ hybridization under high-stringency conditions. Three clones mapped to chromosome band 15q11.2, one mapped to 14q11.2, one mapped to both 2q14.1-14.3 and 14q11.2, one mapped to 2q33-34, and one mapped to both 18q11.2 and 21q21. Even though some PCR-product sequences retained proper splice junctions and open reading frames, we have yet to identify cDNAs that correspond to the variant exon sequences. We are now sequencing clones that map to NF1-homologous loci in order to develop discriminating primer pairs for the exclusive amplification of NF1-specific sequences in our efforts to develop a comprehensive NF1 mutation screen using genomic DNA as template. The role of NF1-homologous sequences may play in neurofibromatosis 1 is not clear.« less
Zygote arrest 1 (Zar1) is an evolutionarily conserved gene expressed in vertebrate ovaries.
Wu, Xuemei; Wang, Pei; Brown, Christopher A; Zilinski, Carolyn A; Matzuk, Martin M
2003-09-01
Zygote arrest 1 (ZAR1) is an ovary-specific maternal factor that plays essential roles during the oocyte-to-embryo transition. In mice, the Zar1 mRNA is detected as a 1.4-kilobase (kb) transcript that is synthesized exclusively in growing oocytes. To further understand the functions of ZAR1, we have cloned the orthologous Zar1 cDNA and/or genes for mouse, rat, human, frog, zebrafish, and pufferfish. The entire mouse Zar1 gene and a related pseudogene span approximately 4.0 kb, contain four exons, and map to adjacent loci on mouse chromosome 5. The human ZAR1 orthologous gene similarly consists of four exons and resides on human chromosome 4p12, which is syntenic with the mouse Zar1 chromosomal locus. Rat (Rattus norvegicus) and pufferfish (Fugu rubripes) Zar1 genes were recognized by database mining and deduced protein alignment analysis. The rat Zar1 gene also maps to a region that is syntenic with the mouse Zar1 gene locus on rat chromosome 14. Frog (Xenopus laevis) and zebrafish (Danio rerio) Zar1 orthologs were cloned by reverse transcription-polymerase chain reaction and rapid amplification of cDNA ends analysis of ovarian mRNA. Unlike mouse and human, the frog Zar1 is detected in multiple tissues, including lung, muscle, and ovary. The Zar1 mRNA appears in the cytoplasm of oocytes and persists until the tailbud stage during frog embryogenesis. Mouse, rat, human, frog, zebrafish, and pufferfish Zar1 genes encode proteins of 361, 361, 424, 295, 329, and 320 amino acids, respectively, and share 50.8%-88.1% amino acid identity. Regions of the N-termini of these ZAR1 orthologs show high sequence identity among these various proteins. However, the C-terminal 103 amino acids of these proteins, encoded by exons 2-4, contain an atypical eight-cysteine Plant Homeo Domain motif and are highly conserved, sharing 80.6%-98.1% identity among these species. These findings suggest that the carboxyl-termini of these ZAR1 proteins contain an important functional domain that is conserved through vertebrate evolution and that may be necessary for normal female reproduction in the transition from oocyte to embryonic life.
Su, Pen-Hua; Yu, Ju-Shan; Chen, Jia-Yuh; Chen, Suh-Jen; Li, Shuan-Yow; Chen, Hsiao-Neng
2007-10-01
Oculo-auriculo-vertebral spectrum, the exact genetic predisposition of which has not yet been resolved, is characterized by varying degrees of the prevalently unilateral underdevelopment of craniofacial structures and spinal anomalies. Here, we analyzed four cases exhibiting multiple features of oculo-auriculo-vertebral spectrum and one case with Treacher-Collins syndrome. The cranium was analyzed using three-dimensional computed tomography, which reliably identifies craniofacial malformations. We detected one typical oculo-auriculo-vertebral spectrum patient who had a missense mutation in exon 9 of the TCOF1 gene complex and two silent mutations in exons 10 and 23, three partial oculo-auriculo-vertebral spectrum patients who had no detectable mutations in the TCOF1 gene complex, and one Treacher-Collins syndrome patient who had a nonsense mutation in exon 14. All five patients had eight previously reported polymorphic changes in the TCOF1 exons 10, 11, 12, 16, 21, 22, and 23, and four unreported polymorphisms in exons 9, 17, and 22 that were also detected in 51 Taiwanese control patients. These observations strongly suggest that the TCOF1 genetic changes observed in these five patients might be related to oculo-auriculo-vertebral spectrum symptoms.
Friedberg, Felix
2009-05-01
In this paper we examine (restricted to homo sapiens) the products resulting from gene duplication and the subsequent alternative splicing for the members of a multidomain group of proteins which possess the evolutionary conserved calponin homology CH domain, i.e. an "actin binding domain", as a singlet and which, in addition, contain the conserved cysteine rich double Zn finger possessing Lim domain, also as a singlet. Seven genes, resulting from gene duplications, were identified that code for seven group members for which pre-mRNAs appear to have undergone multiple alternative splicing: Mical 1, 2 and 3 are located on chromosomes 6q21, 11p15 and 22q11, respectively. The LMO7 gene is present on chromosome 13q22 and the LIMCH1 gene on chromosome 4p13. Micall1 is mapped to chromosome 22q13 and Micall2 to chromosome 7p22. Translated Gen/Bank ESTs suggest the existence of multiple products alternatively spliced from the pre-mRNAs encoded by these genes. Characteristic indicators of such splicing among the proteins derived from one gene must include containment of some common extensive 100% identical regions. In some instances only one exon might be partly or completely eliminated. Sometimes alternative splicing is also associated with an increased frequency of creation of an exon or part of an exon from an intron. Not only coding regions for the body of the protein but also for its N- or -C ends could be affected by the splicing. If created forms are merely beginning at different starting points but remain identical in sequence thereafter, their existence as products of alternate splicing must be questioned. In the splicings, described in this paper, multiple isoforms rather than a single isoform appear as products during the gene expression.
Nomiyama, H; Kuhara, S; Kukita, T; Otsuka, T; Sakaki, Y
1981-01-01
The 26S ribosomal RNA gene of Physarum polycephalum is interrupted by two introns, and we have previously determined the sequence of one of them (intron 1) (Nomiyama et al. Proc.Natl.Acad.Sci.USA 78, 1376-1380, 1981). In this study we sequenced the second intron (intron 2) of about 0.5 kb length and its flanking regions, and found that one nucleotide at each junction is identical in intron 1 and intron 2, though the junction regions share no other sequence homology. Comparison of the flanking exon sequences to E. coli 23S rRNA sequences shows that conserved sequences are interspersed with tracts having little homology. In particular, the region encompassing the intron 2 interruption site is highly conserved. The E. coli ribosomal protein L1 binding region is also conserved. Images PMID:6171776
Deletions of fetal and adult muscle cDNA in Duchenne and Becker muscular dystrophy patients.
Cross, G S; Speer, A; Rosenthal, A; Forrest, S M; Smith, T J; Edwards, Y; Flint, T; Hill, D; Davies, K E
1987-01-01
We have isolated a cDNA molecule from a human adult muscle cDNA library which is deleted in several Duchenne muscular dystrophy patients. Patient deletions have been used to map the exons across the Xp21 region of the short arm of the X chromosome. We demonstrate that a very mildly affected 61 year old patient is deleted for at least nine exons of the adult cDNA. We find no evidence for differential exon usage between adult and fetal muscle in this region of the gene. There must therefore be less essential domains of the protein structure which can be removed without complete loss of function. The sequence of 2.0 kb of the adult cDNA shows no homology to any previously described protein listed in the data banks although sequence comparison at the amino acid level suggests that the protein has a structure not dissimilar to rod structures of cytoskeletal proteins such as lamin and myosin. There are single nucleotide differences in the DNA sequence between the adult and fetal cDNAs which result in amino acid changes but none that would be predicted to change the structure of the protein dramatically. Images Fig. 1. Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:3428261
Multi-symplectic integrators: numerical schemes for Hamiltonian PDEs that conserve symplecticity
NASA Astrophysics Data System (ADS)
Bridges, Thomas J.; Reich, Sebastian
2001-06-01
The symplectic numerical integration of finite-dimensional Hamiltonian systems is a well established subject and has led to a deeper understanding of existing methods as well as to the development of new very efficient and accurate schemes, e.g., for rigid body, constrained, and molecular dynamics. The numerical integration of infinite-dimensional Hamiltonian systems or Hamiltonian PDEs is much less explored. In this Letter, we suggest a new theoretical framework for generalizing symplectic numerical integrators for ODEs to Hamiltonian PDEs in R2: time plus one space dimension. The central idea is that symplecticity for Hamiltonian PDEs is directional: the symplectic structure of the PDE is decomposed into distinct components representing space and time independently. In this setting PDE integrators can be constructed by concatenating uni-directional ODE symplectic integrators. This suggests a natural definition of multi-symplectic integrator as a discretization that conserves a discrete version of the conservation of symplecticity for Hamiltonian PDEs. We show that this approach leads to a general framework for geometric numerical schemes for Hamiltonian PDEs, which have remarkable energy and momentum conservation properties. Generalizations, including development of higher-order methods, application to the Euler equations in fluid mechanics, application to perturbed systems, and extension to more than one space dimension are also discussed.
Giese, Alexander; Jude, Rony; Kuiper, Heidi; Raudsepp, Terje; Piumi, Francois; Schambony, Alexandra; Guérin, Gérard; Chowdhary, Bhanu P; Distl, Ottmar; Töpfer-Petersen, Edda; Leeb, Tosso
2002-10-16
The cysteine-rich secretory protein (CRISP) family consists of three members called acidic epididymal glycoprotein 1 (AEG1), AEG2, and testis-specific protein 1 (TPX1), which share 16 conserved cysteine residues at their C-termini. The CRISP proteins are primarily expressed in different sections of the male genital tract and are thought to mediate cell-cell interactions of male germ cells with other cells during sperm maturation or during fertilization. Therefore, their genes are of interest as candidate genes for inherited male fertility dysfunctions and as putative quantitative trait loci for male fertility traits. In this report, the cloning and DNA sequence of 137 kb of horse genomic DNA from equine chromosome 20q22 containing the closely linked equine TPX1 and AEG2 genes are described. The equine TPX1 gene consists of ten exons spanning 18 kb while the AEG2 gene consists of eight exons that are spread over 24 kb. The expression of these two genes was investigated in several tissues by reverse transcription polymerase chain reaction analysis and Western blotting. Comparative genome analysis between horse, human, and mouse indicates that all three CRISP genes are clustered on one chromosomal location, which shows conserved synteny between these species.
Lan, Hong; Chen, Hui; Chen, Li-Cheng; Wang, Bei-Bing; Sun, Li; Ma, Mei-Ying; Fang, Sheng-Guo; Wan, Qiu-Hong
2014-01-01
Defensins play a key role in the innate immunity of various organisms. Detailed genomic studies of the defensin cluster have only been reported in a limited number of birds. Herein, we present the first characterization of defensins in a Pelecaniformes species, the crested ibis (Nipponia nippon), which is one of the most endangered birds in the world. We constructed bacterial artificial chromosome libraries, including a 4D-PCR library and a reverse-4D library, which provide at least 40 equivalents of this rare bird's genome. A cluster including 14 β-defensin loci within 129 kb was assigned to chromosome 3 by FISH, and one gene duplication of AvBD1 was found. The ibis defensin genes are characterized by multiform gene organization ranging from two to four exons through extensive exon fusion. Splicing signal variations and alternative splice variants were also found. Comparative analysis of four bird species identified one common and multiple species-specific duplications, which might be associated with high GC content. Evolutionary analysis revealed birth-and-death mode and purifying selection for avian defensin evolution, resulting in different defensin gene numbers among bird species and functional conservation within orthologous genes, respectively. Additionally, we propose various directions for further research on genetic conservation in the crested ibis. PMID:25372018
Alternative splicing and promoter use in TFII-I genes
Makeyev, Aleksandr V.; Bayarsaihan, Dashzeveg
2008-01-01
TFII-I proteins are ubiquitously expressed transcriptional factors involved in both basal transcription and signal transduction activation or repression. TFII-I proteins are detected as early as at two-cell stage and exhibit distinct and dynamic expression patterns in developing embryos as well as mark regional variation in the adult mouse brain. Analysis of atypical small and rare chromosomal deletions at 7q11.23 points to TFII-I genes (GTF2I and GTF2IRD1) as the prime candidates responsible for craniofacial and cognitive abnormalities in the Williams-Beuren syndrome. TFII-I genes are often subjected to alternative splicing, which generates isoforms that that show different activities and play distinct biological roles. The coding regions of TFII-I genes are composed of more than 30 exons and are well conserved among vertebrates. However, their 5′ untranslated regions are not as well conserved and all poorly characterized. In the present work, we analyzed promoter regions of TFII-I genes and described their additional exons, as well as tested tissue specificity of both previously reported and novel alternatively spliced isoforms. Our comprehensive analysis leads to further elucidation of the functional heterogeneity of TFII-I proteins, provides hints on search for regulatory pathways governing their expression, and opens up possibilities for examining the effect of different haplotypes on their promoter functions. PMID:19111598
Chalcone synthase genes from milk thistle (Silybum marianum): isolation and expression analysis.
Sanjari, Sepideh; Shobbar, Zahra Sadat; Ebrahimi, Mohsen; Hasanloo, Tahereh; Sadat-Noori, Seyed-Ahmad; Tirnaz, Soodeh
2015-12-01
Silymarin is a flavonoid compound derived from milk thistle (Silybum marianum) seeds which has several pharmacological applications. Chalcone synthase (CHS) is a key enzyme in the biosynthesis of flavonoids; thereby, the identification of CHS encoding genes in milk thistle plant can be of great importance. In the current research, fragments of CHS genes were amplified using degenerate primers based on the conserved parts of Asteraceae CHS genes, and then cloned and sequenced. Analysis of the resultant nucleotide and deduced amino acid sequences led to the identification of two different members of CHS gene family,SmCHS1 and SmCHS2. Third member, full-length cDNA (SmCHS3) was isolated by rapid amplification of cDNA ends (RACE), whose open reading frame contained 1239 bp including exon 1 (190 bp) and exon 2 (1049 bp), encoding 63 and 349 amino acids, respectively. In silico analysis of SmCHS3 sequence contains all the conserved CHS sites and shares high homology with CHS proteins from other plants.Real-time PCR analysis indicated that SmCHS1 and SmCHS3 had the highest transcript level in petals in the early flowering stage and in the stem of five upper leaves, followed by five upper leaves in the mid-flowering stage which are most probably involved in anthocyanin and silymarin biosynthesis.
Expression of SMARCB1 (INI1) mutations in familial schwannomatosis
Smith, Miriam J.; Walker, James A.; Shen, Yiping; Stemmer-Rachamimov, Anat; Gusella, James F.; Plotkin, Scott R.
2012-01-01
Genetic changes in the SMARCB1 tumor suppressor gene have recently been reported in tumors and blood from families with schwannomatosis. Exon scanning of all nine SMARCB1 exons in genomic DNA from our cohort of families meeting the criteria for ‘definite’ or ‘presumptive’ schwannomatosis previously revealed constitutional alterations in 13 of 19 families (68%). Screening of four new familial schwannomatosis probands identified one additional constitutional alteration. We confirmed the presence of mRNA transcripts for two missense alterations, four mutations of conserved splice motifs and two additional mutations, in less conserved sequences, which also affect splicing. Furthermore, we found that transcripts for a rare 3′-untranslated region (c.*82C > T) alteration shared by four unrelated families did not produce splice variants but did show unequal allelic expression, suggesting that the alteration is either causative itself or linked to an unidentified causative mutation. Overexpression studies in cells lacking SMARCB1 suggest that mutant SMARCB1 proteins, like wild-type SMARCB1 protein, retain the ability to suppress cyclin D1 activity. These data, together with the expression of SMARCB1 protein in a proportion of cells from schwannomatosis-related schwannomas, suggest that these tumors develop through a mechanism that is distinct from that of rhabdoid tumors in which SMARCB1 protein is completely absent in tumor cells. PMID:22949514
Long-range RNA pairings contribute to mutually exclusive splicing
Yue, Yuan; Yang, Yun; Dai, Lanzhi; Cao, Guozheng; Chen, Ran; Hong, Weiling; Liu, Baoping; Shi, Yang; Meng, Yijun; Shi, Feng; Xiao, Mu; Jin, Yongfeng
2016-01-01
Mutually exclusive splicing is an important means of increasing the protein repertoire, by which the Down's syndrome cell adhesion molecule (Dscam) gene potentially generates 38,016 different isoforms in Drosophila melanogaster. However, the regulatory mechanisms remain obscure due to the complexity of the Dscam exon cluster. Here, we reveal a molecular model for the regulation of the mutually exclusive splicing of the serpent pre-mRNA based on competition between upstream and downstream RNA pairings. Such dual RNA pairings confer fine tuning of the inclusion of alternative exons. Moreover, we demonstrate that the splicing outcome of alternative exons is mediated in relative pairing strength-correlated mode. Combined comparative genomics analysis and experimental evidence revealed similar bidirectional structural architectures in exon clusters 4 and 9 of the Dscam gene. Our findings provide a novel mechanistic framework for the regulation of mutually exclusive splicing and may offer potentially applicable insights into long-range RNA–RNA interactions in gene regulatory networks. PMID:26554032
Long-range RNA pairings contribute to mutually exclusive splicing.
Yue, Yuan; Yang, Yun; Dai, Lanzhi; Cao, Guozheng; Chen, Ran; Hong, Weiling; Liu, Baoping; Shi, Yang; Meng, Yijun; Shi, Feng; Xiao, Mu; Jin, Yongfeng
2016-01-01
Mutually exclusive splicing is an important means of increasing the protein repertoire, by which the Down's syndrome cell adhesion molecule (Dscam) gene potentially generates 38,016 different isoforms in Drosophila melanogaster. However, the regulatory mechanisms remain obscure due to the complexity of the Dscam exon cluster. Here, we reveal a molecular model for the regulation of the mutually exclusive splicing of the serpent pre-mRNA based on competition between upstream and downstream RNA pairings. Such dual RNA pairings confer fine tuning of the inclusion of alternative exons. Moreover, we demonstrate that the splicing outcome of alternative exons is mediated in relative pairing strength-correlated mode. Combined comparative genomics analysis and experimental evidence revealed similar bidirectional structural architectures in exon clusters 4 and 9 of the Dscam gene. Our findings provide a novel mechanistic framework for the regulation of mutually exclusive splicing and may offer potentially applicable insights into long-range RNA-RNA interactions in gene regulatory networks. © 2015 Yue et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cheng, J.; Liu, C.; Koopman, W.J.
Ligation of the Fas cell-surface molecule induces apoptosis. Defective Fas-mediated apoptosis has been associated with spontaneous autoimmunity in mice. Using human Fas/Apo-1 cDNA as a probe, the authors have molecularly cloned and characterized the human Fas chromosomal gene. The gene consists of nine exons and spans more than 26 kilobases of DNA. The lengths of introns vary from > 14 kilobases at the 5` end of the gene to 152 base pairs upstream of the exon encoding the transmembrane domain. The domain structure of the human Fas is encoded by an exon or a set of exons. Primer extension analysismore » revealed three major transcription initiation sites. The promoter region lacked canonical {open_quotes}TATA{close_quotes} and {open_quotes}CAAT{close_quotes} boxes but was a {open_quotes}GC-rich{close_quotes} sequence, and contained consensus sequences for AP-1, GF-1, NY-Y, CP-2, EBP20, and c-myb. These data provide the first characterization of the human Fas gene and insight into its regulatory region. 54 refs., 3 figs., 1 tab.« less
Showalter, Aaron D; Smith, Timothy P L; Bennett, Gary L; Sloop, Kyle W; Whitsett, Julie A; Rhodes, Simon J
2002-05-29
The Prophet of Pit-1 (PROP1) gene encodes a paired class homeodomain transcription factor that is exclusively expressed in the developing mammalian pituitary gland. PROP1 function is essential for anterior pituitary organogenesis, and heritable mutations in the gene are associated with combined pituitary hormone deficiency in human patients and animals. By cloning the bovine PROP1 gene and by comparative analysis, we demonstrate that the homeodomains and carboxyl termini of mammalian PROP1 proteins are highly conserved while the amino termini are diverged. Whereas the carboxyl termini of the human and bovine PROP1 proteins contain potent transcriptional activation domains, the amino termini and homeodomains have repressive activities. The bovine PROP1 gene has four exons and three introns and maps to a region of chromosome seven carrying a quantitative trait locus affecting ovulation rate. Two alleles of the bovine gene were found that encode distinct protein products with different DNA binding and transcriptional activities. These experiments demonstrate that mammalian PROP1 genes encode proteins with complex regulatory capacities and that modest changes in protein sequence can significantly alter the activity of this pituitary developmental transcription factor.
Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang
2015-11-23
With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
On the formalization of multi-scale and multi-science processes for integrative biology
Díaz-Zuccarini, Vanessa; Pichardo-Almarza, César
2011-01-01
The aim of this work is to introduce the general concept of ‘Bond Graph’ (BG) techniques applied in the context of multi-physics and multi-scale processes. BG modelling has a natural place in these developments. BGs are inherently coherent as the relationships defined between the ‘elements’ of the graph are strictly defined by causality rules and power (energy) conservation. BGs clearly show how power flows between components of the systems they represent. The ‘effort’ and ‘flow’ variables enable bidirectional information flow in the BG model. When the power level of a system is low, BGs degenerate into signal flow graphs in which information is mainly one-dimensional and power is minimal, i.e. they find a natural limitation when dealing with populations of individuals or purely kinetic models, as the concept of energy conservation in these systems is no longer relevant. The aim of this work is twofold: on the one hand, we will introduce the general concept of BG techniques applied in the context of multi-science and multi-scale models and, on the other hand, we will highlight some of the most promising features in the BG methodology by comparing with examples developed using well-established modelling techniques/software that could suggest developments or refinements to the current state-of-the-art tools, by providing a consistent framework from a structural and energetic point of view. PMID:22670211
A family of splice variants of CstF-64 expressed in vertebrate nervous systems
Shankarling, Ganesh S; Coates, Penelope W; Dass, Brinda; MacDonald, Clinton C
2009-01-01
Background Alternative splicing and polyadenylation are important mechanisms for creating the proteomic diversity necessary for the nervous system to fulfill its specialized functions. The contribution of alternative splicing to proteomic diversity in the nervous system has been well documented, whereas the role of alternative polyadenylation in this process is less well understood. Since the CstF-64 polyadenylation protein is known to be an important regulator of tissue-specific polyadenylation, we examined its expression in brain and other organs. Results We discovered several closely related splice variants of CstF-64 – collectively called βCstF-64 – that could potentially contribute to proteomic diversity in the nervous system. The βCstF-64 splice variants are found predominantly in the brains of several vertebrate species including mice and humans. The major βCstF-64 variant mRNA is generated by inclusion of two alternate exons (that we call exons 8.1 and 8.2) found between exons 8 and 9 of the CstF-64 gene, and contains an additional 147 nucleotides, encoding 49 additional amino acids. Some variants of βCstF-64 contain only the first alternate exon (exon 8.1) while other variants contain both alternate exons (8.1 and 8.2). In mice, the predominant form of βCstF-64 also contains a deletion of 78 nucleotides from exon 9, although that variant is not seen in any other species examined, including rats. Immunoblot and 2D-PAGE analyses of mouse nuclear extracts indicate that a protein corresponding to βCstF-64 is expressed in brain at approximately equal levels to CstF-64. Since βCstF-64 splice variant family members were found in the brains of all vertebrate species examined (including turtles and fish), this suggests that βCstF-64 has an evolutionarily conserved function in these animals. βCstF-64 was present in both pre- and post-natal mice and in different regions of the nervous system, suggesting an important role for βCstF-64 in neural gene expression throughout development. Finally, experiments in representative cell lines suggest that βCstF-64 is expressed in neurons but not glia. Conclusion This is the first report of a family of splice variants encoding a key polyadenylation protein that is expressed in a nervous system-specific manner. We propose that βCstF-64 contributes to proteomic diversity by regulating alternative polyadenylation of neural mRNAs. PMID:19284619
Lazebnaia, I V; Lazebnyĭ, O E; Sulimova, G E
2010-03-01
The genetic structure of the Yakutian cattle breed was studied using the following genes: bPRL (RsaI site in exon 3), bGH (AluI site in exon 5), and bPit-1 (HinfI site in exon 6). The values of observed heterozygosity were 0.36 for bPRL, 0.29 for bGH, and 0.16 for bPit-1. These values are within the range of values for this parameter established for a number of Bos taurus breeds. The results obtained show that genetic variation is preserved in this aboriginal Russian breed, despite a catastrophic reduction of the number of animals.
Solution structure of the core SMN–Gemin2 complex
Sarachan, Kathryn L.; Valentine, Kathleen G.; Gupta, Kushol; Moorman, Veronica R.; Gledhill, John M.; Bernens, Matthew; Tommos, Cecilia; Wand, A. Joshua; Van Duyne, Gregory D.
2012-01-01
In humans, assembly of spliceosomal snRNPs (small nuclear ribonucleoproteins) begins in the cytoplasm where the multi-protein SMN (survival of motor neuron) complex mediates the formation of a seven-membered ring of Sm proteins on to a conserved site of the snRNA (small nuclear RNA). The SMN complex contains the SMN protein Gemin2 and several additional Gemins that participate in snRNP biosynthesis. SMN was first identified as the product of a gene found to be deleted or mutated in patients with the neurodegenerative disease SMA (spinal muscular atrophy), the leading genetic cause of infant mortality. In the present study, we report the solution structure of Gemin2 bound to the Gemin2-binding domain of SMN determined by NMR spectroscopy. This complex reveals the structure of Gemin2, how Gemin2 binds to SMN and the roles of conserved SMN residues near the binding interface. Surprisingly, several conserved SMN residues, including the sites of two SMA patient mutations, are not required for binding to Gemin2. Instead, they form a conserved SMN/Gemin2 surface that may be functionally important for snRNP assembly. The SMN–Gemin2 structure explains how Gemin2 is stabilized by SMN and establishes a framework for structure–function studies to investigate snRNP biogenesis as well as biological processes involving Gemin2 that do not involve snRNP assembly. PMID:22607171
Marques, Alexandra T; Antunes, Agostinho; Fernandes, Pedro A; Ramos, Maria J
2006-01-01
Background The Aβ-binding alcohol dehydrogenase/17β-hydroxysteroid dehydrogenase type 10 (ABAD/HSD10) is an enzyme involved in pivotal metabolic processes and in the mitochondrial dysfunction seen in the Alzheimer's disease. Here we use comparative genomic analyses to study the evolution of the HADH2 gene encoding ABAD/HSD10 across several eukaryotic species. Results Both vertebrate and nematode HADH2 genes showed a six-exon/five-intron organization while those of the insects had a reduced and varied number of exons (two to three). Eutherian mammal HADH2 genes revealed some highly conserved noncoding regions, which may indicate the presence of functional elements, namely in the upstream region about 1 kb of the transcription start site and in the first part of intron 1. These regions were also conserved between Tetraodon and Fugu fishes. We identified a conserved alternative splicing event between human and dog, which have a nine amino acid deletion, causing the removal of the strand βF. This strand is one of the seven strands that compose the core β-sheet of the Rossman fold dinucleotide-binding motif characteristic of the short chain dehydrogenase/reductase (SDR) family members. However, the fact that the substrate binding cleft residues are retained and the existence of a shared variant between human and dog suggest that it might be functional. Molecular adaptation analyses across eutherian mammal orthologues revealed the existence of sites under positive selection, some of which being localized in the substrate-binding cleft and in the insertion 1 region on loop D (an important region for the Aβ-binding to the enzyme). Interestingly, a higher than expected number of nonsynonymous substitutions were observed between human/chimpanzee and orangutan, with six out of the seven amino acid replacements being under molecular adaptation (including three in loop D and one in the substrate binding loop). Conclusion Our study revealed that HADH2 genes maintained a reasonable conserved organization across a large evolutionary distance. The conserved noncoding regions identified among mammals and between pufferfishes, the evidence of an alternative splicing variant conserved between human and dog, and the detection of positive selection across eutherian mammals, may be of importance for further research on ABAD/HSD10 function and its implication in the Alzheimer's disease. PMID:16899120
Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.
2017-01-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Nowacka-Woszuk, J; Switonski, M
2010-02-01
Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.
Mulder, Kevin P; Cortazar-Chinarro, Maria; Harris, D James; Crottini, Angelica; Campbell Grant, Evan H; Fleischer, Robert C; Savage, Anna E
2017-11-01
The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Vysokovsky, A; Saxena, R; Landau, M; Zivelin, A; Eskaraev, R; Rosenberg, N; Seligsohn, U; Inbal, A
2004-10-01
Hereditary factor (F)XIII deficiency is a rare bleeding disorder mostly due to mutations in FXIII A subunit. We studied the molecular basis of FXIII deficiency in patients from 10 unrelated families originating from Israel, India and Tunisia. Exons 2-15 of genomic DNA consisting of coding regions and intron/exon boundaries were amplified and sequenced. Structural analysis of the mutations was undertaken by computer modeling. Seven novel mutations were identified in the FXIIIA gene. The propositus from the Ethiopian-Jewish family was found to be a compound heterozygote for two novel mutations: a 10-bp deletion in exon 12 at nucleotides 1652-1661 (followed by 22 altered amino acids and termination codon) and Ala318Val mutation. The propositus of the Tunisian family was homozygous for C insertion after nucleotide 863 within a stretch of six cytosines of exon 7. This insertion results in generation of eight altered amino acids followed by a termination codon downstream. The propositus from Indian-Jewish origin was found to be homozygous for G to T substitution at IVS 11 [+1] resulting in skipping of exons 10 and 11. In addition to the Ala318Val mutation, three of the novel mutations identified are missense mutations: Arg260Leu, Thr398Asn and Gly210Arg each occurring in a homozygous state in an Israeli-Arab and two Indian families, respectively. Structure-function correlation analysis by computer modeling of the new missense mutations predicted that Gly210Arg will cause protein misfolding, Ala318Val and Thr398Asn will interfere with the catalytic process or protein stability, and Arg260Leu will impair dimerization.
Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne
2015-02-10
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Canine TCOF1; cloning, chromosome assignment and genetic analysis in dogs with different head types.
Haworth, K E; Islam, I; Breen, M; Putt, W; Makrinou, E; Binns, M; Hopkinson, D; Edwards, Y
2001-08-01
We describe the construction of a dog embryonic head/neck cDNA library and the isolation of the dog homolog of the Treacher Collins Syndrome gene, TCOF1. The protein shows a similar three-domain structure to that described for human TCOF1, but the dog gene lacks exon 10 and contains two exons not present in the human sequence. In addition, exon 19 is differentially spliced in the dog. How these structural differences relate to TCOF1 phosphorylation is discussed. Isolation of a genomic clone allowed the exon/intron boundaries to be characterized and the dog TCOF1 gene to be mapped to CF Chr 4q31, a region syntenic to human Chr 5. Genetic analysis of DNA of dogs from 13 different breeds identified nine DNA sequence variants, three of which gave rise to amino acid substitutions. Grouping dogs according to head type showed that a C396T variant, leading to a Pro117Ser substitution, is associated with skull/face shape in our dog panel. The numbers are small, but the association between the T allele and brachycephaly, broad skull/short face, was highly significant (p = 0.000024). The short period of time during which the domestic dog breeds have been established suggests that this mutation has arisen only once in the history of dog domestication.
Chang, Yan-Li; Li, Wen-Yan; Miao, Hai; Yang, Shuai-Qi; Li, Ri; Wang, Xiang; Li, Wen-Qiang; Chen, Kun-Ming
2016-02-23
Plasma membrane NADPH oxidases (NOXs) are key producers of reactive oxygen species under both normal and stress conditions in plants and they form functional subfamilies. Studies of these subfamilies indicated that they show considerable evolutionary selection. We performed a comparative genomic analysis that identified 50 ferric reduction oxidases (FRO) and 77 NOX gene homologs from 20 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots, and eudicots. Phylogenetic and structural analysis classified these FRO and NOX genes into four well-conserved groups represented as NOX, FRO I, FRO II, and FRO III. Further analysis of NOXs of phylogenetic and exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures during the evolution of NOXs family genes and which were classified into four conserved subfamilies which are represented as Sub.I, Sub.II, Sub.III, and Sub.IV. Additionally, both available global microarray data analysis and quantitative real-time PCR experiments revealed that the NOX genes in Arabidopsis and rice (Oryza sativa) have different expression patterns in different developmental stages, various abiotic stresses and hormone treatments. Finally, coexpression network analysis of NOX genes in Arabidopsis and rice revealed that NOXs have significantly correlated expression profiles with genes which are involved in plants metabolic and resistance progresses. All these results suggest that NOX family underscores the functional diversity and divergence in plants. This finding will facilitate further studies of the NOX family and provide valuable information for functional validation of this family in plants. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-08-30
... include new and existing small-scale wind energy facilities, such as single-turbine demonstration projects, as well as large, multi-turbine commercial wind facilities. Covered Species The planning partners are...-FF03E00000] Draft Midwest Wind Energy Multi-Species Habitat Conservation Plan Within Eight-State Planning...
Garuti, R; Lelli, N; Barozzini, M; Tiozzo, R; Ghisellini, M; Simone, M L; Li Volti, S; Garozzo, R; Mollica, F; Vergoni, W; Bertolini, S; Calandra, S
1996-03-01
In the present study we report two novel partial deletions of the LDL-R gene. The first (FH Siracusa), found in an FH-heterozygote, consists of a 20 kb deletion spanning from the 5' flanking region to the intron 2 of the LDL-receptor gene. The elimination of the promoter and the first two exons prevents the transcription of the deleted allele, as shown by Northern blot analysis of LDL-R mRNA isolated from the proband's fibroblasts. The second deletion (FH Reggio Emilia), which eliminates 11 nucleotides of exon 10, was also found in an FH heterozygote. The characterization of this deletion was made possible by a combination of techniques such as single strand conformation polymorphism (SSCP) analysis, direct sequence of exon 10 and cloning of the normal and deleted exon 10 from the proband's DNA. The 11 nt deletion occurs in a region of exon 10 which contains three triplets (CTG) and two four-nucleotides (CTGG) direct repeats. This structural feature might render this region more susceptible to a slipped mispairing during DNA duplication. Since this deletion causes a shift of the BamHI site at the 5' end of exon 10, a method has been devised for its rapid screening which is based on the PCR amplification of exon 10 followed by BamHI digestion. FH Reggio Emilia deletion produces a shift in the reading frame downstream from Lys458, leading to a sequence of 51 novel amino acids before the occurrence of a premature stop codon (truncated receptor). However, since RT-PCR failed to demonstrate the presence of the mutant LDL-R mRNA in proband fibroblasts, it is likely that the amount of truncated receptor produced in these cells is negligible.
Intragenic motifs regulate the transcriptional complexity of Pkhd1/PKHD1
Boddu, Ravindra; Yang, Chaozhe; O’Connor, Amber K.; Hendrickson, Robert Curtis; Boone, Braden; Cui, Xiangqin; Garcia-Gonzalez, Miguel; Igarashi, Peter; Onuchic, Luiz F.; Germino, Gregory G.
2014-01-01
Autosomal recessive polycystic kidney disease (ARPKD) results from mutations in the human PKHD1 gene. Both this gene, and its mouse ortholog, Pkhd1, are primarily expressed in renal and biliary ductal structures. The mouse protein product, fibrocystin/polyductin complex (FPC), is a 445-kDa protein encoded by a 67-exon transcript that spans >500 kb of genomic DNA. In the current study, we observed multiple alternatively spliced Pkhd1 transcripts that varied in size and exon composition in embryonic mouse kidney, liver, and placenta samples, as well as among adult mouse pancreas, brain, heart, lung, testes, liver, and kidney. Using reverse transcription PCR and RNASeq, we identified 22 novel Pkhd1 kidney transcripts with unique exon junctions. Various mechanisms of alternative splicing were observed, including exon skipping, use of alternate acceptor/donor splice sites, and inclusion of novel exons. Bioinformatic analyses identified, and exon-trapping minigene experiments validated, consensus binding sites for serine/arginine-rich proteins that modulate alternative splicing. Using site-directed mutagenesis, we examined the functional importance of selected splice enhancers. In addition, we demonstrated that many of the novel transcripts were polysome bound, thus likely translated. Finally, we determined that the human PKHD1 R760H missense variant alters a splice enhancer motif that disrupts exon splicing in vitro and is predicted to truncate the protein. Taken together, these data provide evidence of the complex transcriptional regulation of Pkhd1/PKHD1 and identified motifs that regulate its splicing. Our studies indicate that Pkhd1/PKHD1 transcription is modulated, in part by intragenic factors, suggesting that aberrant PKHD1 splicing represents an unappreciated pathogenic mechanism in ARPKD. PMID:24984783
Alcaide, Miguel; Liu, Mark
2013-01-01
Genes of the Major Histocompatibility Complex (MHC) have become an important marker for the investigation of adaptive genetic variation in vertebrates because of their critical role in pathogen resistance. However, despite significant advances in the last few years the characterization of MHC variation in non-model species still remains a challenging task due to the redundancy and high variation of this gene complex. Here we report the utility of a single pair of primers for the cross-amplification of the third exon of MHC class I genes, which encodes the more polymorphic half of the peptide-binding region (PBR), in oscine passerines (songbirds; Aves: Passeriformes), a group especially challenging for MHC characterization due to the presence of large and complex MHC multigene families. In our survey, although the primers failed to amplify exon 3 from two suboscine passerine birds, they amplified exon 3 of multiple MHC class I genes in all 16 species of oscine songbirds tested, yielding a total of 120 sequences. The 16 songbird species belong to 14 different families, primarily within the Passerida, but also in the Corvida. Using a conservative approach based on the analysis of cloned amplicons (n = 16) from each species, we found between 3 and 10 MHC sequences per individual. Each allele repertoire was highly divergent, with the overall number of polymorphic sites per species ranging from 33 to 108 (out of 264 sites) and the average number of nucleotide differences between alleles ranging from 14.67 to 43.67. Our survey in songbirds allowed us to compare macroevolutionary dynamics of exon 3 between songbirds and non-passerine birds. We found compelling evidence of positive selection acting specifically upon peptide-binding codons across birds, and we estimate the strength of diversifying selection in songbirds to be about twice that in non-passerines. Analysis using comparative methods suggest weaker evidence for a higher GC content in the 3rd codon position of exon 3 in non-passerine birds, a pattern that contrasts with among-clade GC patterns found in other avian studies and may suggests different mutational mechanisms. Our primers represent a useful tool for the characterization of functional and evolutionarily relevant MHC variation across the hyperdiverse songbirds. PMID:23781408
Martins, Rute; Proença, Daniela; Silva, Bruno; Barbosa, Cristina; Silva, Ana Luísa; Faustino, Paula; Romão, Luísa
2012-01-01
Nonsense-mediated decay (NMD) is an mRNA surveillance pathway that selectively recognizes and degrades defective mRNAs carrying premature translation-termination codons. However, several studies have shown that NMD also targets physiological transcripts that encode full-length proteins, modulating their expression. Indeed, some features of physiological mRNAs can render them NMD-sensitive. Human HFE is a MHC class I protein mainly expressed in the liver that, when mutated, can cause hereditary hemochromatosis, a common genetic disorder of iron metabolism. The HFE gene structure comprises seven exons; although the sixth exon is 1056 base pairs (bp) long, only the first 41 bp encode for amino acids. Thus, the remaining downstream 1015 bp sequence corresponds to the HFE 3′ untranslated region (UTR), along with exon seven. Therefore, this 3′ UTR encompasses an exon/exon junction, a feature that can make the corresponding physiological transcript NMD-sensitive. Here, we demonstrate that in UPF1-depleted or in cycloheximide-treated HeLa and HepG2 cells the HFE transcripts are clearly upregulated, meaning that the physiological HFE mRNA is in fact an NMD-target. This role of NMD in controlling the HFE expression levels was further confirmed in HeLa cells transiently expressing the HFE human gene. Besides, we show, by 3′-RACE analysis in several human tissues that HFE mRNA expression results from alternative cleavage and polyadenylation at four different sites – two were previously described and two are novel polyadenylation sites: one located at exon six, which confers NMD-resistance to the corresponding transcripts, and another located at exon seven. In addition, we show that the amount of HFE mRNA isoforms resulting from cleavage and polyadenylation at exon seven, although present in both cell lines, is higher in HepG2 cells. These results reveal that NMD and alternative polyadenylation may act coordinately to control HFE mRNA levels, possibly varying its protein expression according to the physiological cellular requirements. PMID:22530027
Gaucher disease: molecular heterogeneity and phenotype-genotype correlations.
Theophilus, B; Latham, T; Grabowski, G A; Smith, F I
1989-08-01
Gaucher disease (GD) is the most prevalent lysosomal storage disease. This autosomal recessive trait results from the defective activity of acid beta-glucosidase (beta-Glc). Four different exonic point mutations have been identified as causal alleles for GD. To facilitate screening for these alleles, assays were developed using allele-specific oligonucleotide hybridization to amplified genomic DNA sequences. Specifically, intron bases flanking exons 5, 9, and 10 were determined, and conditions for PCR amplification of these exons were obtained. Two different procedures were developed to distinguish signals obtained from the structural beta-Glc gene exons and those from the pseudogene. These procedures were used to determine the distribution of all known GD alleles in a population of 44 affected patients of varying phenotypes and ethnicity. The high frequency of one of the exon 9 mutations in Ashkenazi Jewish GD type 1 patients was confirmed, and, in addition, this mutation was present in ethnically diverse non-Jewish type 1 GD patients. Homozygotes (N = 5) for this allele were midly affected older individuals, and this mutant allele was not found in any patient with neuronopathic disease. The exon 10 mutation was confirmed as the predominant allele in types 2 and 3 GD. However, several type 1 GD patients, including one of Ashkenazi-Jewish heritage, also were heterozygous for this allele. The presence of this allele in type 1 patients did not correlate with the severity of clinical symptoms. The second exon 9 mutation and the exon 5 mutation were rare, since they occurred only heterozygously either in one type 2 GD patient or in two related Ashkenazi-Jewish GD patients, respectively. Although most GD patients (38 of 44) had at least one of the known mutant alleles, 57% were heterozygotes for only one of these mutations. Fourteen percent of patients were negative for all mutations. A total of 73% of GD patients had at least one unknown allele. The varying clinical phenotypes and ethnic origins of these incompletely characterized patients suggest that multiple other GD alleles exist.
Tabish, M; Clegg, R A; Rees, H H; Fisher, M J
1999-04-01
The cAMP-dependent protein kinase (protein kinase A, PK-A) is multifunctional in nature, with key roles in the control of diverse aspects of eukaryotic cellular activity. In the case of the free-living nematode, Caenorhabditis elegans, a gene encoding the PK-A catalytic subunit has been identified and two isoforms of this subunit, arising from a C-terminal alternative-splicing event, have been characterized [Gross, Bagchi, Lu and Rubin (1990) J. Biol. Chem. 265, 6896-6907]. Here we report the occurrence of N-terminal alternative-splicing events that, in addition to generating a multiplicity of non-myristoylatable isoforms, also generate the myristoylated variant(s) of the catalytic subunit that we have recently characterized [Aspbury, Fisher, Rees and Clegg (1997) Biochem. Biophys. Res. Commun. 238, 523-527]. The gene spans more than 36 kb and is divided into a total of 13 exons. Each of the mature transcripts contains only 7 exons. In addition to the already characterized exon 1, the 5'-untranslated region and first intron actually contain 5 other exons, any one of which may be alternatively spliced on to exon 2 at the 5' end of the pre-mRNA. This N-terminal alternative splicing occurs in combination with either of the already characterized C-terminal alternative exons. Thus, C. elegans expresses at least 12 different isoforms of the catalytic subunit of PK-A. The significance of this unprecedented structural diversity in the family of PK-A catalytic subunits is discussed.
Lakshmi, G. Girija; Ghosh, Sushmita; Jones, Gabriel P.; Parikh, Roshni; Rawlins, Bridgette A.; Vaughn, Jack C.
2014-01-01
Alternative splicing greatly enhances the diversity of proteins encoded by eukaryotic genomes, and is also important in gene expression control. In contrast to the great depth of knowledge as to molecular mechanisms in the splicing pathway itself, relatively little is known about the regulatory events behind this process. The 5′-UTR and 3′-UTR in pre-mRNAs play a variety of roles in controlling eukaryotic gene expression, including translational modulation, and nearly 4,000 of the roughly 14,000 protein coding genes in Drosophila contain introns of unknown functional significance in their 5′-UTR. Here we report the results of an RNA electrophoretic mobility shift analysis of Drosophila rnp-4f 5′-UTR intron 0 splicing regulatory proteins. The pre-mRNA potential regulatory element consists of an evolutionarily-conserved 177-nt stem-loop arising from pairing of intron 0 with part of adjacent exon 2. Incubation of in vitro transcribed probe with embryo protein extract is shown to result in two shifted RNA-protein bands, and protein extract from a dADAR null mutant fly line results in only one shifted band. A mutated stem-loop in which the conserved exon 2 primary sequence is changed but secondary structure maintained by introducing compensatory base changes results in diminished band shifts. To test the hypothesis that dADAR plays a role in intron splicing regulation in vivo, levels of unspliced rnp-4f mRNA in dADAR mutant were compared to wild-type via real-time qRT-PCR. The results show that during embryogenesis unspliced rnp-4f mRNA levels fall by up to 85% in the mutant, in support of the hypothesis. Taken together, these results demonstrate a novel role for dADAR protein in rnp-4f 5′-UTR alternative intron splicing regulation which is consistent with a previously proposed model. PMID:23026215
ALDH1A2 (RALDH2) genetic variation in human congenital heart disease
2009-01-01
Background Signaling by the vitamin A-derived morphogen retinoic acid (RA) is required at multiple steps of cardiac development. Since conversion of retinaldehyde to RA by retinaldehyde dehydrogenase type II (ALDH1A2, a.k.a RALDH2) is critical for cardiac development, we screened patients with congenital heart disease (CHDs) for genetic variation at the ALDH1A2 locus. Methods One-hundred and thirty-three CHD patients were screened for genetic variation at the ALDH1A2 locus through bi-directional sequencing. In addition, six SNPs (rs2704188, rs1441815, rs3784259, rs1530293, rs1899430) at the same locus were studied using a TDT-based association approach in 101 CHD trios. Observed mutations were modeled through molecular mechanics (MM) simulations using the AMBER 9 package, Sander and Pmemd programs. Sequence conservation of observed mutations was evaluated through phylogenetic tree construction from ungapped alignments containing ALDH8 s, ALDH1Ls, ALDH1 s and ALDH2 s. Trees were generated by the Neighbor Joining method. Variations potentially affecting splicing mechanisms were cloned and functional assays were designed to test splicing alterations using the pSPL3 splicing assay. Results We describe in Tetralogy of Fallot (TOF) the mutations Ala151Ser and Ile157Thr that change non-polar to polar residues at exon 4. Exon 4 encodes part of the highly-conserved tetramerization domain, a structural motif required for ALDH oligomerization. Molecular mechanics simulation studies of the two mutations indicate that they hinder tetramerization. We determined that the SNP rs16939660, previously associated with spina bifida and observed in patients with TOF, does not affect splicing. Moreover, association studies performed with classical models and with the transmission disequilibrium test (TDT) design using single marker genotype, or haplotype information do not show differences between cases and controls. Conclusion In summary, our screen indicates that ALDH1A2 genetic variation is present in TOF patients, suggesting a possible causal role for this gene in rare cases of human CHD, but does not support the hypothesis that variation at the ALDH1A2 locus is a significant modifier of the risk for CHD in humans. PMID:19886994
Structure of the horseradish peroxidase isozyme C genes.
Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H
1988-05-02
We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.
Genetic basis of clinical catecholamine disorders
NASA Technical Reports Server (NTRS)
Garland, Emily M.; Hahn, Maureen K.; Ketch, Terry P.; Keller, Nancy R.; Kim, Chun-Hyung; Kim, Kwang-Soo; Biaggioni, Italo; Shannon, John R.; Blakely, Randy D.; Robertson, David
2002-01-01
Norepinephrine and epinephrine are critical determinants of minute-to-minute regulation of blood pressure. Here we review the characterization of two syndromes associated with a genetic abnormality in the noradrenergic pathway. In 1986, we reported a congenital syndrome of undetectable tissue and circulating levels of norepinephrine and epinephrine, elevated levels of dopamine, and absence of dopamine-beta-hydroxylase (DBH). These patients appeared with ptosis and severe orthostatic hypotension and lacked sympathetic noradrenergic function. In two persons with DBH deficiency, we identified seven novel polymorphisms. Both patients are compound heterozygotes for a variant that affects expression of DBH protein via impairment of splicing. Patient 1 also has a missense mutation in DBH exon 2, and patient 2 carries missense mutations in exons 1 and 6. Orthostatic intolerance is a common syndrome affecting young women, presenting with orthostatic tachycardia and symptoms of cerebral hypoperfusion on standing. We tested the hypothesis that abnormal norepinephrine transporter (NET) function might contribute to its etiology. In our proband, we found an elevated plasma norepinephrine with standing that was disproportionate to the increase in levels of dihydroxphenylglycol, as well as impaired norepinephrine clearance and tyramine resistance. Studies of NET gene structure revealed a coding mutation converting a conserved alanine residue in transmembrane domain 9 to proline. Analysis of the protein produced by the mutant cDNA demonstrated greater than 98% reduction in activity relative to normal. The finding of genetic mutations responsible for DBH deficiency and orthostatic intolerance leads us to believe that genetic causes of other autonomic disorders will be found, enabling us to design more effective therapeutic interventions.
Escobar-Aguirre, Matias; Zhang, Hong; Jamieson-Lucy, Allison; Mullins, Mary C
2017-09-01
Animal-vegetal (AV) polarity of most vertebrate eggs is established during early oogenesis through the formation and disassembly of the Balbiani Body (Bb). The Bb is a structure conserved from insects to humans that appears as a large granule, similar to a mRNP granule composed of mRNA and proteins, that in addition contains mitochondria, ER and Golgi. The components of the Bb, which have amyloid-like properties, include germ cell and axis determinants of the embryo that are anchored to the vegetal cortex upon Bb disassembly. Our lab discovered in zebrafish the only gene known to function in Bb disassembly, microtubule-actin crosslinking factor 1a (macf1a). Macf1 is a conserved, giant multi-domain cytoskeletal linker protein that can interact with microtubules (MTs), actin filaments (AF), and intermediate filaments (IF). In macf1a mutant oocytes the Bb fails to dissociate, the nucleus is acentric, and AV polarity of the oocyte and egg fails to form. The cytoskeleton-dependent mechanism by which Macf1a regulates Bb mRNP granule dissociation was unknown. We found that disruption of AFs phenocopies the macf1a mutant phenotype, while MT disruption does not. We determined that cytokeratins (CK), a type of IF, are enriched in the Bb. We found that Macf1a localizes to the Bb, indicating a direct function in regulating its dissociation. We thus tested if Macf1a functions via its actin binding domain (ABD) and plectin repeat domain (PRD) to integrate cortical actin and Bb CK, respectively, to mediate Bb dissociation at the oocyte cortex. We developed a CRISPR/Cas9 approach to delete the exons encoding these domains from the macf1a endogenous locus, while maintaining the open reading frame. Our analysis shows that Macf1a functions via its ABD to mediate Bb granule dissociation and nuclear positioning, while the PRD is dispensable. We propose that Macf1a does not function via its canonical mechanism of linking two cytoskeletal systems together in dissociating the Bb. Instead our results suggest that Macf1a functions by linking one cytoskeletal system, cortical actin, to another structure, the Bb, where Macf1a is localized. Through this novel linking process, it dissociates the Bb at the oocyte cortex, thus specifying the AV axis of the oocyte and future egg. To our knowledge, this is also the first study to use genome editing to unravel the module-dependent function of a cytoskeletal linker.
Zhang, Hong; Jamieson-Lucy, Allison
2017-01-01
Animal-vegetal (AV) polarity of most vertebrate eggs is established during early oogenesis through the formation and disassembly of the Balbiani Body (Bb). The Bb is a structure conserved from insects to humans that appears as a large granule, similar to a mRNP granule composed of mRNA and proteins, that in addition contains mitochondria, ER and Golgi. The components of the Bb, which have amyloid-like properties, include germ cell and axis determinants of the embryo that are anchored to the vegetal cortex upon Bb disassembly. Our lab discovered in zebrafish the only gene known to function in Bb disassembly, microtubule-actin crosslinking factor 1a (macf1a). Macf1 is a conserved, giant multi-domain cytoskeletal linker protein that can interact with microtubules (MTs), actin filaments (AF), and intermediate filaments (IF). In macf1a mutant oocytes the Bb fails to dissociate, the nucleus is acentric, and AV polarity of the oocyte and egg fails to form. The cytoskeleton-dependent mechanism by which Macf1a regulates Bb mRNP granule dissociation was unknown. We found that disruption of AFs phenocopies the macf1a mutant phenotype, while MT disruption does not. We determined that cytokeratins (CK), a type of IF, are enriched in the Bb. We found that Macf1a localizes to the Bb, indicating a direct function in regulating its dissociation. We thus tested if Macf1a functions via its actin binding domain (ABD) and plectin repeat domain (PRD) to integrate cortical actin and Bb CK, respectively, to mediate Bb dissociation at the oocyte cortex. We developed a CRISPR/Cas9 approach to delete the exons encoding these domains from the macf1a endogenous locus, while maintaining the open reading frame. Our analysis shows that Macf1a functions via its ABD to mediate Bb granule dissociation and nuclear positioning, while the PRD is dispensable. We propose that Macf1a does not function via its canonical mechanism of linking two cytoskeletal systems together in dissociating the Bb. Instead our results suggest that Macf1a functions by linking one cytoskeletal system, cortical actin, to another structure, the Bb, where Macf1a is localized. Through this novel linking process, it dissociates the Bb at the oocyte cortex, thus specifying the AV axis of the oocyte and future egg. To our knowledge, this is also the first study to use genome editing to unravel the module-dependent function of a cytoskeletal linker. PMID:28880872
Gatto, Alberto; Torroja-Fungairiño, Carlos; Mazzarotto, Francesco; Cook, Stuart A; Barton, Paul J R; Sánchez-Cabo, Fátima; Lara-Pezzi, Enrique
2014-04-01
Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of alternative splicing. This is particularly important for the identification of exon-exon junctions, which is a critical step in any analysis workflow. Here we performed a systematic benchmarking of alignment tools to dissect the impact of design and method on the mapping, detection and quantification of splice junctions from multi-exon reads. Accordingly, we devised a novel pipeline based on TopHat2 combined with a splice junction detection algorithm, which we have named FineSplice. FineSplice allows effective elimination of spurious junction hits arising from artefactual alignments, achieving up to 99% precision in both real and simulated data sets and yielding superior F1 scores under most tested conditions. The proposed strategy conjugates an efficient mapping solution with a semi-supervised anomaly detection scheme to filter out false positives and allows reliable estimation of expressed junctions from the alignment output. Ultimately this provides more accurate information to identify meaningful splicing patterns. FineSplice is freely available at https://sourceforge.net/p/finesplice/.
FoxP2 in song-learning birds and vocal-learning mammals.
Webb, D M; Zhang, J
2005-01-01
FoxP2 is the first identified gene that is specifically involved in speech and language development in humans. Population genetic studies of FoxP2 revealed a selective sweep in recent human history associated with two amino acid substitutions in exon 7. Avian song learning and human language acquisition share many behavioral and neurological similarities. To determine whether FoxP2 plays a similar role in song-learning birds, we sequenced exon 7 of FoxP2 in multiple song-learning and nonlearning birds. We show extreme conservation of FoxP2 sequences in birds, including unusually low rates of synonymous substitutions. However, no amino acid substitutions are shared between the song-learning birds and humans. Furthermore, sequences from vocal-learning whales, dolphins, and bats do not share the human-unique substitutions. While FoxP2 appears to be under strong functional constraints in mammals and birds, we find no evidence for its role during the evolution of vocal learning in nonhuman animals as in humans.
NASA Technical Reports Server (NTRS)
Kretsinger, R. H.; Nakayama, S.
1993-01-01
In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Kiryu, Hisanori; Kin, Taishin; Asai, Kiyoshi
2007-02-15
Recent transcriptomic studies have revealed the existence of a considerable number of non-protein-coding RNA transcripts in higher eukaryotic cells. To investigate the functional roles of these transcripts, it is of great interest to find conserved secondary structures from multiple alignments on a genomic scale. Since multiple alignments are often created using alignment programs that neglect the special conservation patterns of RNA secondary structures for computational efficiency, alignment failures can cause potential risks of overlooking conserved stem structures. We investigated the dependence of the accuracy of secondary structure prediction on the quality of alignments. We compared three algorithms that maximize the expected accuracy of secondary structures as well as other frequently used algorithms. We found that one of our algorithms, called McCaskill-MEA, was more robust against alignment failures than others. The McCaskill-MEA method first computes the base pairing probability matrices for all the sequences in the alignment and then obtains the base pairing probability matrix of the alignment by averaging over these matrices. The consensus secondary structure is predicted from this matrix such that the expected accuracy of the prediction is maximized. We show that the McCaskill-MEA method performs better than other methods, particularly when the alignment quality is low and when the alignment consists of many sequences. Our model has a parameter that controls the sensitivity and specificity of predictions. We discussed the uses of that parameter for multi-step screening procedures to search for conserved secondary structures and for assigning confidence values to the predicted base pairs. The C++ source code that implements the McCaskill-MEA algorithm and the test dataset used in this paper are available at http://www.ncrna.org/papers/McCaskillMEA/. Supplementary data are available at Bioinformatics online.
Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume
2017-07-19
Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.
Fukami, Maki; Naiki, Yasuhiro; Muroya, Koji; Hamajima, Takashi; Soneda, Shun; Horikawa, Reiko; Jinno, Tomoko; Katsumi, Momori; Nakamura, Akie; Asakura, Yumi; Adachi, Masanori; Ogata, Tsutomu; Kanzaki, Susumu
2015-09-01
Pseudoautosomal region 1 (PAR1) contains SHOX, in addition to seven highly conserved non-coding DNA elements (CNEs) with cis-regulatory activity. Microdeletions involving SHOX exons 1-6a and/or the CNEs result in idiopathic short stature (ISS) and Leri-Weill dyschondrosteosis (LWD). Here, we report six rare copy-number variations (CNVs) in PAR1 identified through copy-number analyzes of 245 ISS/LWD patients and 15 unaffected individuals. The six CNVs consisted of three microduplications encompassing SHOX and some of the CNEs, two microduplications in the SHOX 3'-region affecting one or four of the downstream CNEs, and a microdeletion involving SHOX exon 6b and its neighboring CNE. The amplified DNA fragments of two SHOX-containing duplications were detected at chromosomal regions adjacent to the original positions. The breakpoints of a SHOX-containing duplication resided within Alu repeats. A microduplication encompassing four downstream CNEs was identified in an unaffected father-daughter pair, whereas the other five CNVs were detected in ISS patients. These results suggest that microduplications involving SHOX cause ISS by disrupting the cis-regulatory machinery of this gene and that at least some of microduplications in PAR1 arise from Alu-mediated non-allelic homologous recombination. The pathogenicity of other rare PAR1-linked CNVs, such as CNE-containing microduplications and exon 6b-flanking microdeletions, merits further investigation.
Petrak, Borivoj; Bendova, Sarka; Seeman, Tomas; Klein, Tibor; Lisy, Jiri; Zatrapa, Tomas; Marikova, Tana
2007-12-01
Neurofibromatosis von Recklinghausen type 1 (NF1) is an autosomal dominant neurocutaneous disorder affecting one in 3 000-4 000 individuals. Mid-aortic syndrome (MAS) is a rare condition characterized by segmental narrowing of abdominal aorta and stenosis of its major branches - mainly renal arteries, including manifestation of renovascular hypertension. MAS can be caused by different diseases, including NF1. A 9 years old girl with primary diagnosis of NF1 combined with renovascular hypertension due to MAS, suffered of bilateral optic and chiasm glioma, pubertas praecox, speech disorder, light mental retardation and scoliosis. We have found a mutation in exone 34 of the NF1 gene (17q11.2). Her father has been also diagnosed with NF1 and hypertension developed at early age. He has the same mutation in exone 34 of NF1 gene. The girl is currently treated with conservative antihypertensive medication with positive effect. Bilateral optic and chiasm glioma are asymptomatic at the time and they had been without progress over period of time. Any vascular surgery, neurosurgical and oncological therapy are not indicated at the present time. This article is a summary of clinical findings in patient with NF1 due to NF1 gene mutation in exone 34. It confirms the importance of complex multidisciplinar approach to examination and taking care of NF1 patients and their families.
Multiple network alignment via multiMAGNA+.
Vijayan, Vipin; Milenkovic, Tijana
2017-08-21
Network alignment (NA) aims to find a node mapping that identifies topologically or functionally similar network regions between molecular networks of different species. Analogous to genomic sequence alignment, NA can be used to transfer biological knowledge from well- to poorly-studied species between aligned network regions. Pairwise NA (PNA) finds similar regions between two networks while multiple NA (MNA) can align more than two networks. We focus on MNA. Existing MNA methods aim to maximize total similarity over all aligned nodes (node conservation). Then, they evaluate alignment quality by measuring the amount of conserved edges, but only after the alignment is constructed. Directly optimizing edge conservation during alignment construction in addition to node conservation may result in superior alignments. Thus, we present a novel MNA method called multiMAGNA++ that can achieve this. Indeed, multiMAGNA++ outperforms or is on par with existing MNA methods, while often completing faster than existing methods. That is, multiMAGNA++ scales well to larger network data and can be parallelized effectively. During method evaluation, we also introduce new MNA quality measures to allow for more fair MNA method comparison compared to the existing alignment quality measures. MultiMAGNA++ code is available on the method's web page at http://nd.edu/~cone/multiMAGNA++/.
Structure and polymorphism of the mouse myelin/oligodendrocyte glycoprotein gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daubas, P.; Pham-Dinh, D.; Dautigny, A.
1994-09-01
The authors have isolated and characterized genomic clones containing the mouse myelin/oligodendrocyte glycoprotein (MOG) gene. It spans a region of 12.5 kb and consists of eight exons. Its exon-intron structure differs from that of classical MHC-class I genes, with which it is linked in the mouse genome. Nucleotide sequencing of the 5{prime} flanking region revelas that it contains several putative protein-binding sites, some of them in common with other myelin gene promoters. One intragenic polymorphism has been identified: it consists of a GA repeat, defining at least three alleles in mouse inbred strains, and is easily detectable using the polymerasemore » chain reaction method.« less
Das, Dhanjit Kumar; Raha, Sarbani; Sanghavi, Daksha; Maitra, Anurupa; Udani, Vrajesh
2013-02-15
Rett syndrome (RTT) is an X-linked neurodevelopmental disorder, primarily affecting females and characterized by developmental regression, epilepsy, stereotypical hand movements, and motor abnormalities. Its prevalence is about 1 in 10,000 female births. Rett syndrome is caused by mutations within methyl CpG-binding protein 2 (MECP2) gene. Over 270 individual nucleotide changes which cause pathogenic mutations have been reported. However, eight most commonly occurring missense and nonsense mutations account for almost 70% of all patients. We screened 90 individuals with Rett syndrome phenotype. A total of 19 different MECP2 mutations and polymorphisms were identified in 27 patients. Of the 19 mutations, we identified 7 (37%) frameshift, 6 (31%) nonsense, 14 (74%) missense mutations and one duplication (5%). The most frequent pathogenic changes were: missense p.T158M (11%), p.R133C (7.4%), and p.R306C (7.4%) and nonsense p.R168X (11%), p.R255X (7.4%) mutations. We have identified two novel mutations namely p.385-388delPLPP present in atypical patients and p.Glu290AlafsX38 present in a classical patient of Rett syndrome. Sequence homology for p.385-388delPLPP mutation revealed that these 4 amino acids were conserved across mammalian species. This indicated the importance of these 4 amino acids in structure and function of the protein. A novel variant p.T479T has also been identified in a patient with atypical Rett syndrome. A total of 62 (69%) patients remained without molecular genetics diagnosis that necessitates further search for mutations in other genes like CDKL5 and FOXG1 that are known to cause Rett phenotype. The majority of mutations are detected in exon 4 and only one mutation was present in exon 3. Therefore, our study suggests the need for screening exon 4 of MECP2 as first line of diagnosis in these patients. Copyright © 2012 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Strasberg, P.M.; Liede, H.A.; Stein, T.
1994-09-01
Norrie disease (MIM 310600; ND) is an X-linked (Xp11.2-11.3) neurodevelopmental disorder characterized by congenital blindness, retinal dysplasia with pseudoglioma formation, and often associated with progressive mental retardation and deafness. The ND gene, comprised of 3 exons, codes for an evolutionarily conserved protein of 133 amino acids. We have analyzed 8 pedigrees segregating Norrie disease. Although microdeletions have been detected in several typical ND patients, Southern blot analysis with probes L1.28, MAO-A, MAO-B, TIMP-3.9X, pTak8, and M27{beta} failed to detect such deletions in these 8 ND pedigrees. With the cloning of the ND gene, PCR analysis of all 3 exons likewisemore » did not reveal any insertions or deletions. SSCP analysis ({sup 35}S-dNTP PCR) on PCR products of exon 3 showed a band shift for 1 patient. Repeat `cold` SSCP on minigels (3 inches x 4 inches) followed by liver staining was confirmatory. Direct sequencing revealed a G{r_arrow}A transition at nucleotide 610 corresponding to amino acid 65, changing Cys to Tyr. The mutation created an RsaI site, such that the uncut, normal, and mutant PCR products (using the same PCR primers) were 297 bp, 243 and 54 bp, and 177, 72 and 54 bp respectively. Affected males in the relevant pedigree had restricted PCR products of 177, 72 and 54 bp, carrier mothers 243, 177, 72, and 54 bp, and normals, including 30 unrelated individuals, 243 and 54 bp. Recent evidence indicates that the ND gene has a C-terminal domain homologous to that of TGF{beta}, thus identifying it as putative peptide growth factor, providing a monogenic disease model for the family of cystine knot growth factors. This is the first report of a mutation in Cys 2, critical for crosslinking to Cys 5 forming a disulphide bridge which holds the cystine knot growth factor tertiary structure together.« less
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong
2014-10-16
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C.; Zhang, Baohong
2016-01-01
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development. PMID:26857372
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong
2014-01-01
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C; Zhang, Baohong
2016-02-09
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development.
Hong, Soon Gyu; Cramer, Robert A; Lawrence, Christopher B; Pryor, Barry M
2005-02-01
A gene for the Alternaria major allergen, Alt a 1, was amplified from 52 species of Alternaria and related genera, and sequence information was used for phylogenetic study. Alt a 1 gene sequences evolved 3.8 times faster and contained 3.5 times more parsimony-informative sites than glyceraldehyde-3-phosphate dehydrogenase (gpd) sequences. Analyses of Alt a 1 gene and gpd exon sequences strongly supported grouping of Alternaria spp. and related taxa into several species-groups described in previous studies, especially the infectoria, alternata, porri, brassicicola, and radicina species-groups and the Embellisia group. The sonchi species-group was newly suggested in this study. Monophyly of the Nimbya group was moderately supported, and monophyly of the Ulocladium group was weakly supported. Relationships among species-groups and among closely related species of the same species-group were not fully resolved. However, higher resolution could be obtained using Alt a 1 sequences or a combined dataset than using gpd sequences alone. Despite high levels of variation in amino acid sequences, results of in silico prediction of protein secondary structure for Alt a 1 demonstrated a high degree of structural similarity for most of the species suggesting a conservation of function.
Ratnam, Kavitha; Birch, David G.; Sundquist, Sanna M.; Lucero, Anna S.; Zhang, Yuhua; Meltzer, Meira; Smaoui, Nizar; Roorda, Austin
2011-01-01
Purpose. To evaluate macular cone structure in patients with X-linked retinoschisis (XLRS) caused by mutations in exon 6 of the RS1 gene. Methods. High-resolution macular images were obtained with adaptive optics scanning laser ophthalmoscopy (AOSLO) and spectral domain optical coherence tomography (SD-OCT) in two patients with XLRS and 27 age-similar healthy subjects. Retinal structure was correlated with best-corrected visual acuity, kinetic and static perimetry, fundus-guided microperimetry, full-field electroretinography (ERG), and multifocal ERG. The six coding exons and the flanking intronic regions of the RS1 gene were sequenced in each patient. Results. Two unrelated males, ages 14 and 29, with visual acuity ranging from 20/32 to 20/63, had macular schisis with small relative central scotomas in each eye. The mixed scotopic ERG b-wave was reduced more than the a-wave. SD-OCT showed schisis cavities in the outer and inner nuclear and plexiform layers. Cone spacing was increased within the largest foveal schisis cavities but was normal elsewhere. In each patient, a mutation in exon 6 of the RS1 gene was identified and was predicted to change the amino acid sequence in the discoidin domain of the retinoschisin protein. Conclusions. AOSLO images of two patients with molecularly characterized XLRS revealed increased cone spacing and abnormal packing in the macula of each patient, but cone coverage and function were near normal outside the central foveal schisis cavities. Although cone density is reduced, the preservation of wave-guiding cones at the fovea and eccentric macular regions has prognostic and therapeutic implications for XLRS patients with foveal schisis. (Clinical Trials.gov number, NCT00254605.) PMID:22110067
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes1
Rombauts, Stephane; Florquin, Kobe; Lescot, Magali; Marchal, Kathleen; Rouzé, Pierre; Van de Peer, Yves
2003-01-01
The identification of promoters and their regulatory elements is one of the major challenges in bioinformatics and integrates comparative, structural, and functional genomics. Many different approaches have been developed to detect conserved motifs in a set of genes that are either coregulated or orthologous. However, although recent approaches seem promising, in general, unambiguous identification of regulatory elements is not straightforward. The delineation of promoters is even harder, due to its complex nature, and in silico promoter prediction is still in its infancy. Here, we review the different approaches that have been developed for identifying promoters and their regulatory elements. We discuss the detection of cis-acting regulatory elements using word-counting or probabilistic methods (so-called “search by signal” methods) and the delineation of promoters by considering both sequence content and structural features (“search by content” methods). As an example of search by content, we explored in greater detail the association of promoters with CpG islands. However, due to differences in sequence content, the parameters used to detect CpG islands in humans and other vertebrates cannot be used for plants. Therefore, a preliminary attempt was made to define parameters that could possibly define CpG and CpNpG islands in Arabidopsis, by exploring the compositional landscape around the transcriptional start site. To this end, a data set of more than 5,000 gene sequences was built, including the promoter region, the 5′-untranslated region, and the first introns and coding exons. Preliminary analysis shows that promoter location based on the detection of potential CpG/CpNpG islands in the Arabidopsis genome is not straightforward. Nevertheless, because the landscape of CpG/CpNpG islands differs considerably between promoters and introns on the one side and exons (whether coding or not) on the other, more sophisticated approaches can probably be developed for the successful detection of “putative” CpG and CpNpG islands in plants. PMID:12857799
Peter, Annette; Khandekar, Shaunak; Deakin, Janine E; Stick, Reimer
2015-11-01
Platypus (Ornithorhynchus anatinus) holds a unique phylogenetic position at the base of the mammalian lineage due to an amalgamation of mammalian and sauropsid-like features. Here we describe the set of four lamin genes for platypus. Lamins are major components of the nuclear lamina, which constitutes a main component of the nucleoskeleton and is involved in a wide range of nuclear functions. Vertebrate evolution was accompanied by an increase in the number of lamin genes from a single gene in their closest relatives, the tunicates and cephalochordates, to four genes in the vertebrate lineage. Of the four genes the LIII gene is characterized by the presence of two alternatively spliced CaaX-encoding exons. In amphibians and fish LIII is the major lamin protein in oocytes and early embryos. The LIII gene is conserved throughout the vertebrate lineage, with the notable exception of marsupials and placental mammals, which have lost the LIII gene. Here we show that platypus has retained an LIII gene, albeit with a significantly altered structure and with a radically different expression pattern. The platypus LIII gene contains only a single CaaX-encoding exon and the head domain together with coil 1a and part of coil1b of the platypus LIII protein is replaced by a novel short non-helical N-terminus. It is expressed exclusively in the testis. These features resemble those of male germ cell-specific lamins in placental mammals, in particular those of lamin C2. Our data suggest (i) that the specific functions of LIII, which it fulfills in all other vertebrates, is no longer required in mammals and (ii) once it had been freed from these functions has undergone structural alterations and has adopted a new functionality in monotremes. Copyright © 2015 Elsevier GmbH. All rights reserved.
Chang, M X; Nie, P; Xie, H X; Sun, B J; Gao, Q
2005-01-01
The cDNAs and genes of two different types of leucine-rich repeat-containing proteins from grass carp (Ctenopharyngodon idellus) were cloned. Homology search revealed that the two genes, designated as GC-GARP and GC-LRG, have 37% and 32% deduced amino-acid sequence similarities with human glycoprotein A repetitions predominant precursor (GARP) and leucine-rich alpha2-glycoprotein (LRG), respectively. The cDNAs of GC-GARP and GC-LRG encoded 664 and 339 amino acid residues, respectively. GC-GARP and GC-LRG contain many distinct structural and/or functional motifs of the leucine-rich repeat (LRR) subfamily, such as multiple conserved 11-residue segments with the consensus sequence LxxLxLxxN/CxL (x can be any amino acid). The genes GC-GARP and GC-LRG consist of two exons, with 4,782 bp and 2,119 bp in total length, respectively. The first exon of each gene contains a small 5'-untranslated region and partial open reading frame. The putative promoter region of GC-GARP was found to contain transcription factor binding sites for GATA-1, IRF4, Oct-1, IRF-7, IRF-1, AP1, GATA-box and NFAT, and the promoter region of GC-LRG for MYC-MAX, MEIS1, ISRE, IK3, HOXA9 and C/EBP alpha. Phylogenetic analysis showed that GC-GARP and mammalian GARPs were clustered into one branch, while GC-LRG and mammalian LRGs were in another branch. The GC-GARP gene was only detected in head kidney, and GC-LRG in the liver, spleen and heart in the copepod (Sinergasilus major)-infected grass carp, indicating the induction of gene expression by the parasite infection. The results obtained in the present study provide insight into the structure of fish LRR genes, and further study should be carried out to understand the importance of LRR proteins in host-pathogen interactions.
Nonlinear Conservation Laws and Finite Volume Methods
NASA Astrophysics Data System (ADS)
Leveque, Randall J.
Introduction Software Notation Classification of Differential Equations Derivation of Conservation Laws The Euler Equations of Gas Dynamics Dissipative Fluxes Source Terms Radiative Transfer and Isothermal Equations Multi-dimensional Conservation Laws The Shock Tube Problem Mathematical Theory of Hyperbolic Systems Scalar Equations Linear Hyperbolic Systems Nonlinear Systems The Riemann Problem for the Euler Equations Numerical Methods in One Dimension Finite Difference Theory Finite Volume Methods Importance of Conservation Form - Incorrect Shock Speeds Numerical Flux Functions Godunov's Method Approximate Riemann Solvers High-Resolution Methods Other Approaches Boundary Conditions Source Terms and Fractional Steps Unsplit Methods Fractional Step Methods General Formulation of Fractional Step Methods Stiff Source Terms Quasi-stationary Flow and Gravity Multi-dimensional Problems Dimensional Splitting Multi-dimensional Finite Volume Methods Grids and Adaptive Refinement Computational Difficulties Low-Density Flows Discrete Shocks and Viscous Profiles Start-Up Errors Wall Heating Slow-Moving Shocks Grid Orientation Effects Grid-Aligned Shocks Magnetohydrodynamics The MHD Equations One-Dimensional MHD Solving the Riemann Problem Nonstrict Hyperbolicity Stiffness The Divergence of B Riemann Problems in Multi-dimensional MHD Staggered Grids The 8-Wave Riemann Solver Relativistic Hydrodynamics Conservation Laws in Spacetime The Continuity Equation The 4-Momentum of a Particle The Stress-Energy Tensor Finite Volume Methods Multi-dimensional Relativistic Flow Gravitation and General Relativity References
Molecular cloning, structure, and chromosomal localization of the mouse LIM/homeobox gene Lhx5
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertuzzi, S.; Sheng, Hui Z.; Westphal, H.
1996-09-01
Lhx5, the mouse ortholog of the Xenopus Xlim-5, is a LIM/homeobox gene expressed in the central nervous system during both embryonic development and adulthood. During development its domain of expression is mainly localized at the most anterior portion of the neural tube, and it precedes the morphological differentiation of the forebrain; for this reason we believe that Lhx5 could play an important role in forebrain patterning. Here we present the structural organization and the chromosomal localization of the Lhx5 gene. The gene is composed of five exons spanning more than 10 kb of genomic sequence. The first and second LIMmore » domains are encoded by the first and second exon, while the codons of the homeobox are split between the third and the fourth exons. The structure of Lhx5 is similar to that of other LIM/homeodomain proteins, Lxh1/lim1 and Lhx3/lim3, but differs from that of other LIM genes, such as mec3 and LMO1/Rbtn1, in which the codons for the LIM domains are interrupted by introns. We have mapped Lhx5 to the central region of mouse chromosome 5. 38 refs., 4 figs.« less
Cheng, Xi; Wang, Yanan; Abdullah, Muhammad; Li, Manli; Li, Dahui; Gao, Junshan
2017-01-01
Plant type III polyketide synthase (PKS) can catalyse the formation of a series of secondary metabolites with different structures and different biological functions; the enzyme plays an important role in plant growth, development and resistance to stress. At present, the PKS gene has been identified and studied in a variety of plants. Here, we identified 11 PKS genes from upland cotton (Gossypium hirsutum) and compared them with 41 PKS genes in Populus tremula, Vitis vinifera, Malus domestica and Arabidopsis thaliana. According to the phylogenetic tree, a total of 52 PKS genes can be divided into four subfamilies (I–IV). The analysis of gene structures and conserved motifs revealed that most of the PKS genes were composed of two exons and one intron and there are two characteristic conserved domains (Chal_sti_synt_N and Chal_sti_synt_C) of the PKS gene family. In our study of the five species, gene duplication was found in addition to Arabidopsis thaliana and we determined that purifying selection has been of great significance in maintaining the function of PKS gene family. From qRT-PCR analysis and a combination of the role of the accumulation of proanthocyanidins (PAs) in brown cotton fibers, we concluded that five PKS genes are candidate genes involved in brown cotton fiber pigment synthesis. These results are important for the further study of brown cotton PKS genes. It not only reveals the relationship between PKS gene family and pigment in brown cotton, but also creates conditions for improving the quality of brown cotton fiber. PMID:29104824
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
2016-04-01
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Human homolog of the mouse sperm receptor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chamberlin, M.E.; Dean, J.
1990-08-01
The human zona pellucida, composed of three glycoproteins (ZP1, ZP2, and ZP3), forms an extracellular matrix that surrounds ovulated eggs and mediates species-specific fertilization. The genes that code for at least two of the zona proteins (ZP2 and ZP3) cross-hybridize with other mammalian DNA. The recently characterized mouse sperm receptor gene (Zp-3) was used to isolate its human homolog. The human homolog spans {approx}18.3 kilobase pairs (kbp) (compared to 8.6 kbp for the mouse gene) and contains eight exons, the sizes of which are strictly conserved between the two species. Four short (8-15 bp) sequences within the first 250 bpmore » of the 5{prime} flanking region in the human Zp-3 homolog are also present upstream of mouse Zp-3. These elements may modulate oocyte-specific gene expression. By using the polymerase chain reaction, a full-length cDNA of human ZP3 was isolated from human ovarian poly(A){sup +} RNA and used to deduce the structure of human ZP3 mRNA. Certain features of the human and mouse ZP3 transcripts are conserved. Both have unusually short 5{prime} and 3{prime} untranslated regions, both contain a single open reading frame that is 74% identical, and both code for 424 amino acid polypeptides that are 67% the same. The similarity between the two proteins may define domains that are important in maintaining the structural integrity of the zona pellucida, while the differences may play a role in mediating the species-specific events of mammalian fertilization.« less
NASA Astrophysics Data System (ADS)
Tognetti, Eduardo S.; Oliveira, Ricardo C. L. F.; Peres, Pedro L. D.
2015-01-01
The problem of state feedback control design for discrete-time Takagi-Sugeno (TS) (T-S) fuzzy systems is investigated in this paper. A Lyapunov function, which is quadratic in the state and presents a multi-polynomial dependence on the fuzzy weighting functions at the current and past instants of time, is proposed.This function contains, as particular cases, other previous Lyapunov functions already used in the literature, being able to provide less conservative conditions of control design for TS fuzzy systems. The structure of the proposed Lyapunov function also motivates the design of a new stabilising compensator for Takagi-Sugeno fuzzy systems. The main novelty of the proposed state feedback control law is that the gain is composed of matrices with multi-polynomial dependence on the fuzzy weighting functions at a set of past instants of time, including the current one. The conditions for the existence of a stabilising state feedback control law that minimises an upper bound to the ? or ? norms are given in terms of linear matrix inequalities. Numerical examples show that the approach can be less conservative and more efficient than other methods available in the literature.
Kümmel, D; Heinemann, U
2008-04-01
The term 'tethering factor' has been coined for a heterogeneous group of proteins that all are required for protein trafficking prior to vesicle docking and SNARE-mediated membrane fusion. Two groups of tethering factors can be distinguished, long coiled-coil proteins and multi-subunit complexes. To date, eight such protein complexes have been identified in yeast, and they are required for different trafficking steps. Homologous complexes are found in all eukaryotic organisms, but conservation seems to be less strict than for other components of the trafficking machinery. In fact, for most proposed multi-subunit tethers their ability to actually bridge two membranes remains to be shown. Here we discuss recent progress in the structural and functional characterization of tethering complexes and present the emerging view that the different complexes are quite diverse in their structure and the molecular mechanisms underlying their function. TRAPP and the exocyst are the structurally best characterized tethering complexes. Their comparison fails to reveal any similarity on a struc nottural level. Furthermore, the interactions with regulatory Rab GTPases vary, with TRAPP acting as a nucleotide exchange factor and the exocyst being an effector. Considering these differences among the tethering complexes as well as between their yeast and mammalian orthologs which is apparent from recent studies, we suggest that tethering complexes do not mediate a strictly conserved process in vesicular transport but are diverse regulators acting after vesicle budding and prior to membrane fusion.
Costimulatory receptors in jawed vertebrates: Conserved CD28, odd CTLA4 and multiple BTLAs
Bernard, D.; Hansen, J.D.; Du, Pasquier L.; Lefranc, M.-P.; Benmansour, A.; Boudinot, P.
2007-01-01
CD28 family of costimulatory receptors is comprised of molecules with a single V-type extracellular Ig domain, a transmembrane and an intracytoplasmic region with signaling motifs. CD28 and cytotoxic T lymphocyte antigen-4 (CTLA4) homologs have been recently identified in rainbow trout. Other sequences similar to mammalian CD28 family members have now been identified using teleost, Xenopus and chicken databases. CD28- and CTLA4 homologs were found in all vertebrate classes whereas inducible costimulatory signal (ICOS) was restricted to tetrapods, and programmed cell death-1 (PD1) was limited to mammals and chicken. Multiple B and T Lymphocyte Attenuator (BTLA) sequences were found in teleosts, but not in Xenopus or in avian genomes. The intron/exon structure of btlas was different from that of cd28 and other members of the family. The Ig domain encoded in all the btla genes has features of the C-type structure, which suggests that BTLA does not belong to the CD28 family. The genomic localization of these genes in vertebrate genomes supports the split between the BTLA and CD28 families. ?? 2006 Elsevier Ltd. All rights reserved.
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-01-01
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts. PMID:26907269
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
2016-02-23
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Evolutionary diversification of type-2 HDAC structure, function and regulation in Nicotiana tabacum.
Nicolas-Francès, Valérie; Grandperret, Vincent; Liegard, Benjamin; Jeandroz, Sylvain; Vasselon, Damien; Aimé, Sébastien; Klinguer, Agnès; Lamotte, Olivier; Julio, Emilie; de Borne, François Dorlhac; Wendehenne, David; Bourque, Stéphane
2018-04-01
Type-2 HDACs (HD2s) are plant-specific histone deacetylases that play diverse roles during development and in responses to biotic and abiotic stresses. In this study we characterized the six tobacco genes encoding HD2s that mainly differ by the presence or the absence of a typical zinc finger in their C-terminal part. Of particular interest, these HD2 genes exhibit a highly conserved intron/exon structure. We then further investigated the phylogenetic relationships among the HD2 gene family, and proposed a model of the genetic events that led to the organization of the HD2 family in Solanaceae. Absolute quantification of HD2 mRNAs in N. tabacum and in its precursors, N. tomentosiformis and N. sylvestris, did not reveal any pseudogenization of any of the HD2 genes, but rather specific regulation of HD2 expression in these three species. Functional complementation approaches in Arabidopsis thaliana demonstrated that the four zinc finger-containing HD2 proteins exhibit the same biological function in response to salt stress, whereas the two HD2 proteins without zinc finger have different biological function. Copyright © 2018 Elsevier B.V. All rights reserved.
Long-term 12 year follow-up of X-linked congenital retinoschisis
Kjellström, Sten; Vijayasarathy, Camasamudram; Ponjavic, Vesna; Sieving, Paul A.; Andréasson, Sten
2010-01-01
Purpose To investigate the retinal structure and function during the progression of X-linked retinoschisis (XLRS) from childhood to adulthood. Methods Ten patients clinically diagnosed with XLRS were investigated at 6–15 years of age (mean age 9 years) with a follow-up 8 to 14 years later (mean 12 years). The patients underwent regular ophthalmic examination as well as testing of best corrected visual acuity (BCVA), visual field (VF) and assessment of full-field electroretinography (ERG) during their first visit. During the follow-up, the same clinical protocols were repeated. In addition, macular structure and function was examined with multifocal electroretinography (mfERG) and optical coherence tomography (OCT). The patients were 18–25 years of age (mean age 21 years) at the follow-up examination. All exons and exon-intron boundaries of RS1-gene were sequenced for gene mutations in 9 out of the 10 patients. Results Best corrected VA and VF were stable during this follow-up period. No significant progression in cone or rod function could be measured by full-field ERG. Multifocal electroretinography and OCT demonstrated a wide heterogeneity of macular changes in retinal structure and function at the time of follow-up visit. Three different mutations were detected in these nine patients, including a known nonsense mutation in exon 3, a novel insertion in exon 5 and an intronic mutation at 5' splice site of intron 3. Conclusions Clinical follow-up (mean 12 years) of ten young XLRS patients (mean age of 9 years) with a typical congenital retinoschisis phenotype revealed no significant decline in retinal function during this time period. MfERG and OCT demonstrated a wide variety of macular changes including structure and dysfunction. The XLRS disease was relatively stable during this period of observation and would afford opportunity for therapy studies to judge benefit against baseline and against the fellow eye. PMID:20569020
Ren, H; Stiles, G L
1994-01-01
The human A1 adenosine receptor gene contains six exons with exons 1, 2, 3, 4, and part of 5 representing 5' untranslated regions. Reverse transcription-PCR with exon-specific primers showed two distinct transcripts containing either exons 3, 5, and 6 or exons 4, 5, and 6, with exons 3 and 4 being mutually exclusive. No mature mRNAs containing exons 1 and 2 have been detected. All human tissues that express any A1 receptors contain mRNA with exons 4, 5, and 6. Tissues which express high levels of A1 receptors contain mRNA with exons 3, 5, and 6. Exon 4 contains two upstream ATG codons whereas exon 3 contains none. COS cells transfected with expression vectors containing exon 4 (exons 1-6, 3-6, or Ex4-6) express much lower levels of A1 receptors than vectors without exon 4 (exons 3, 5, and 6). Mutation of upstream ATG codons in exon 4 leads to 3- to 7-fold increased A1 receptor expression, up to the level seen with the construct containing exons 3, 5, and 6. Thus, in human tissues "basal" levels of A1 receptors can be expressed by use of mRNA containing exons 4, 5, and 6, but when high levels are needed, alternative transcripts with exons 3, 5, and 6 are produced. Images PMID:8197148
Böhm, Johann; Vasli, Nasim; Maurer, Marie; Cowling, Belinda; Shelton, G. Diane; Kress, Wolfram; Toussaint, Anne; Prokic, Ivana; Schara, Ulrike; Anderson, Thomas James; Weis, Joachim; Tiret, Laurent; Laporte, Jocelyn
2013-01-01
Amphiphysin 2, encoded by BIN1, is a key factor for membrane sensing and remodelling in different cell types. Homozygous BIN1 mutations in ubiquitously expressed exons are associated with autosomal recessive centronuclear myopathy (CNM), a mildly progressive muscle disorder typically showing abnormal nuclear centralization on biopsies. In addition, misregulation of BIN1 splicing partially accounts for the muscle defects in myotonic dystrophy (DM). However, the muscle-specific function of amphiphysin 2 and its pathogenicity in both muscle disorders are not well understood. In this study we identified and characterized the first mutation affecting the splicing of the muscle-specific BIN1 exon 11 in a consanguineous family with rapidly progressive and ultimately fatal centronuclear myopathy. In parallel, we discovered a mutation in the same BIN1 exon 11 acceptor splice site as the genetic cause of the canine Inherited Myopathy of Great Danes (IMGD). Analysis of RNA from patient muscle demonstrated complete skipping of exon 11 and BIN1 constructs without exon 11 were unable to promote membrane tubulation in differentiated myotubes. Comparative immunofluorescence and ultrastructural analyses of patient and canine biopsies revealed common structural defects, emphasizing the importance of amphiphysin 2 in membrane remodelling and maintenance of the skeletal muscle triad. Our data demonstrate that the alteration of the muscle-specific function of amphiphysin 2 is a common pathomechanism for centronuclear myopathy, myotonic dystrophy, and IMGD. The IMGD dog is the first faithful model for human BIN1-related CNM and represents a mammalian model available for preclinical trials of potential therapies. PMID:23754947
Barriers to Uptake of Conservation Agriculture in southern Africa: Multi-level Analyses from Malawi
NASA Astrophysics Data System (ADS)
Dougill, Andrew; Stringer, Lindsay; Whitfield, Stephen; Wood, Ben; Chinseu, Edna
2015-04-01
Conservation agriculture is a key set of actions within the growing body of climate-smart agriculture activities being advocated and rolled out across much of the developing world. Conservation agriculture has purported benefits for environmental quality, food security and the sustained delivery of ecosystem services. In this paper, new multi-level analyses are presented, assessing the current barriers to adoption of conservation agriculture practices in Malawi. Despite significant donor initiatives that have targeted conservation agriculture projects, uptake rates remain low. This paper synthesises studies from across 3 levels in Malawi: i.) national level- drawing on policy analysis, interviews and a multi-stakeholder workshop; ii.) district level - via assessments of development plans and District Office and extension service support, and; iii) local level - through data gained during community / household level studies in Dedza District that have gained significant donor support for conservation agriculture as a component of climate smart agriculture initiatives. The national level multi-stakeholder Conservation Agriculture workshop identified three areas requiring collaborative research and outlined routes for the empowerment of the National Conservation Agriculture Task Force to advance uptake of conservation agriculture and deliver associated benefits in terms of agricultural development, climate adaptation and mitigation. District level analyses highlight that whilst District Development Plans are now checked against climate change adaptation and mitigation criteria, capacity and knowledge limitations exist at the District level, preventing project interventions from being successfully up-scaled. Community level assessments highlight the need for increased community participation at the project-design phase and identify a pressing requirement for conservation agriculture planning processes (in particular those driven by investments in climate-smart agriculture) to better accommodate, and respond to, the differentiated needs of marginalised groups (e.g. poor, elderly, carers). We identify good practices that can be used to design, plan and implement conservation agriculture projects such that the multiple benefits can be realised. We further outline changes to multi-level policy and institutional arrangements to facilitate greater adoption of conservation agriculture in Malawi, noting the vital importance of District-level institutions and amendments and capacity building required within agricultural extension services. We highlight the need for capacity building and support to ensure conservation agriculture's multiple benefits are realised more widely as a route towards sustainable land management.
Li, Chenhong; Riethoven, Jean-Jack M; Naylor, Gavin J P
2012-09-01
Recent innovations in next-generation sequencing have lowered the cost of genome projects. Nevertheless, sequencing entire genomes for all representatives in a study remains expensive and unnecessary for most studies in ecology, evolution and conservation. It is still more cost-effective and efficient to target and sequence single-copy nuclear gene markers for such studies. Many tools have been developed for identifying nuclear markers, but most of these have focused on particular taxonomic groups. We have built a searchable database, EvolMarkers, for developing single-copy coding sequence (CDS) and exon-primed-intron-crossing (EPIC) markers that is designed to work across a broad range of phylogenetic divergences. The database is made up of single-copy CDS derived from BLAST searches of a variety of metazoan genomes. Users can search the database for different types of markers (CDS or EPIC) that are common to different sets of input species with different divergence characteristics. EvolMarkers can be applied to any taxonomic group for which genome data are available for two or more species. We included 82 genomes in the first version of EvolMarkers and have found the methods to be effective across Placozoa, Cnidaria, Arthropod, Nematoda, Annelida, Mollusca, Echinodermata, Hemichordata, Chordata and plants. We demonstrate the effectiveness of searching for CDS markers within annelids and show how to find potentially useful intronic markers within the lizard Anolis. © 2012 Blackwell Publishing Ltd.
De novo insertion of an intron into the mammalian sex determining gene, SRY
O’Neill, Rachel J. Waugh; Brennan, Francine E.; Delbridge, Margaret L.; Crozier, Ross H.; Graves, Jennifer A. Marshall
1998-01-01
Two theories have been proposed to explain the evolution of introns within eukaryotic genes. The introns early theory, or “exon theory of genes,” proposes that introns are ancient and that recombination within introns provided new exon structure, and thus new genes. The introns late theory, or “insertional theory of introns,” proposes that ancient genes existed as uninterrupted exons and that introns have been introduced during the course of evolution. There is still controversy as to how intron–exon structure evolved and whether the majority of introns are ancient or novel. Although there is extensive evidence in support of the introns early theory, phylogenetic comparisons of several genes indicate recent gain and loss of introns within these genes. However, no example has been shown of a protein coding gene, intronless in its ancestral form, which has acquired an intron in a derived form. The mammalian sex determining gene, SRY, is intronless in all mammals studied to date, as is the gene from which it recently evolved. However, we report here comparisons of genomic and cDNA sequences that now provide evidence of a de novo insertion of an intron into the SRY gene of dasyurid marsupials. This recently (approximately 45 million years ago) inserted sequence is not homologous with known transposable elements. Our data demonstrate that introns may be inserted as spliced units within a developmentally crucial gene without disrupting its function. PMID:9465071
Material optimization of multi-layered enhanced nanostructures
NASA Astrophysics Data System (ADS)
Strobbia, Pietro
The employment of surface enhanced Raman scattering (SERS)-based sensing in real-world scenarios will offer numerous advantages over current optical sensors. Examples of these advantages are the intrinsic and simultaneous detection of multiple analytes, among many others. To achieve such a goal, SERS substrates with throughput and reproducibility comparable to commonly used fluorescence sensors have to be developed. To this end, our lab has discovered a multi-layer geometry, based on alternating films of a metal and a dielectric, that amplifies the SERS signal (multi-layer enhancement). The advantage of these multi-layered structures is to amplify the SERS signal exploiting layer-to-layer interactions in the volume of the structures, rather than on its surface. This strategy permits an amplification of the signal without modifying the surface characteristics of a substrate, and therefore conserving its reproducibility. Multi-layered structures can therefore be used to amplify the sensitivity and throughput of potentially any previously developed SERS sensor. In this thesis, these multi-layered structures were optimized and applied to different SERS substrates. The role of the dielectric spacer layer in the multi-layer enhancement was elucidated by fabricating spacers with different characteristics and studying their effect on the overall enhancement. Thickness, surface coverage and physical properties of the spacer were studied. Additionally, the multi-layered structures were applied to commercial SERS substrates and to isolated SERS probes. Studies on the dependence of the multi-layer enhancement on the thickness of the spacer demonstrated that the enhancement increases as a function of surface coverage at sub-monolayer thicknesses, due to the increasing multi-layer nature of the substrates. For fully coalescent spacers the enhancement decreases as a function of thickness, due to the loss of interaction between proximal metallic films. The influence of the physical properties of the spacer on the multi-layer enhancement were also studied. The trends in Schottky barrier height, interfacial potential and dielectric constant were isolated by using different materials as spacers (i.e., TiO2, HfO2, Ag 2O and Al2O3). The results show that the bulk dielectric constant of the material can be used to predict the relative magnitude of the multi-layer enhancement, with low dielectric constant materials performing more efficiently as spacers. Optimal spacer layers were found to be ultrathin coalescent films (ideally a monolayer) of low dielectric constant materials. Finally, multi-layered structures were observed to be employable to amplify SERS in drastically different substrate geometries. The multi-layered structures were applied to disposable commercial SERS substrates (i.e., Klarite). This project involved the regeneration of the used substrates, by stripping and redepositing the gold coating layer, and their amplification, by using the multi-layer geometry. The latter was observed to amplify the sensitivity of the substrates. Additionally, the multi-layered structures were applied to probes dispersed in solution. Such probes were observed to yield stronger SERS signal when optically trapped and to reduce the background signal. The application of the multi-layered structures on trapped probes, not only further amplified the SERS signal, but also increased the maximum number of applicable layers for the structures.
Splicing factors PTBP1 and PTBP2 promote proliferation and migration of glioma cell lines
Cheung, Hannah C.; Hai, Tao; Zhu, Wen; Baggerly, Keith A.; Tsavachidis, Spiridon; Krahe, Ralf
2009-01-01
Polypyrimidine tract-binding protein 1 (PTBP1) is a multi-functional RNA-binding protein that is aberrantly overexpressed in glioma. PTBP1 and its brain-specific homologue polypyrimidine tract-binding protein 2 (PTBP2) regulate neural precursor cell differentiation. However, the overlapping and non-overlapping target transcripts involved in this process are still unclear. To determine why PTBP1 and not PTBP2 would promote glial cell-derived tumours, both PTBP1 and PTBP2 were knocked down in the human glioma cell lines U251 and LN229 to determine the role of these proteins in cell proliferation, migration, and adhesion. Surprisingly, removal of both PTBP1 and PTBP2 slowed cell proliferation, with the double knockdown having no additive effects. Decreased expression of both proteins individually and in combination inhibited cell migration and increased adhesion of cells to fibronectin and vitronectin. A global survey of differential exon expression was performed following PTBP1 knockdown in U251 cells using the Affymetrix Exon Array to identify PTBP1-specific splicing targets that enhance gliomagenesis. In the PTBP1 knockdown, previously determined targets were unaltered in their splicing patterns. A single gene, RTN4 (Nogo) had significantly enhanced inclusion of exon 3 when PTBP1 was removed. Overexpression of the splice isoform containing exon 3 decreased cell proliferation to a similar degree as the removal of PTBP1. These results provide the first evidence that RNA-binding proteins affect the invasive and rapid growth characteristics of glioma cell lines. Its actions on proliferation appear to be mediated, in part, through alternative splicing of RTN4. PMID:19506066
Phair, Glenn; Agus, Ashley; Normand, Charles; Brazil, Kevin; Burns, Aine; Roderick, Paul; Maxwell, Alexander P; Thompson, Colin; Yaqoob, Magdi; Noble, Helen
2018-05-01
Previous research has explored the cost of providing renal replacement therapies in patients with end-stage kidney disease and their quality of life. This is the first study to examine the healthcare costs of patients receiving conservative care without dialysis for end-stage kidney disease. This alternative to dialysis is an option for patients who prefer a supportive and palliative care approach. Descriptive cost and quality of life analyses alongside a UK-based multi-centre observational study in patients receiving conservative management for end-stage kidney disease. Health service use was recorded up to 12 months after making the decision to receive conservative management. Mean costs were calculated for each 3-month time period. The annual cost was calculated in two ways: by using only patients with complete cost data and by using all available data weighted by the number of patients at each time point. In total, 42 patients who opted for conservative management over dialysis were recruited. Mean costs were £1622 (0-3 months), £1008 (3-6 months), £554 (6-9 months) and £2626 (9-12 months). Mean annual cost based on complete data ( n = 8) was £5511, and the weighted mean annual cost was £5620. The importance of this study is twofold. First, it provides substantive new information for health and social care planning of conservative management by demonstrating where demand exists for services, in both the United Kingdom and other countries with a comparable health service structure. Second, methodologically, it indicates that it is feasible to collect service use data directly from this patient population.
Splicing of designer exons informs a biophysical model for exon definition
Arias, Mauricio A.; Chasin, Lawrence A.
2015-01-01
Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition has been proposed. We studied this exon definition using designer exons (DEs) made up of three prototype modules of our own design: an exonic splicing enhancer (ESE), an exonic splicing silencer (ESS), and a Reference Sequence (R) predicted to be neither. Each DE was examined as the central exon in a three-exon minigene. DEs made of R modules showed a sharp size dependence, with exons shorter than 14 nt and longer than 174 nt splicing poorly. Changing the strengths of the splice sites improved longer exon splicing but worsened shorter exon splicing, effectively displacing the curve to the right. For the ESE we found, unexpectedly, that its enhancement efficiency was independent of its position within the exon. For the ESS we found a step-wise positional increase in its effects; it was most effective at the 3′ end of the exon. To apply these results quantitatively, we developed a biophysical model for exon definition of internal exons undergoing cotranscriptional splicing. This model features commitment to inclusion before the downstream exon is synthesized and competition between skipping and inclusion fates afterward. Collision of both exon ends to form an exon definition complex was incorporated to account for the effect of size; ESE/ESS effects were modeled on the basis of stabilization/destabilization. This model accurately predicted the outcome of independent experiments on more complex DEs that combined ESEs and ESSs. PMID:25492963
Castaings, Loren; Bergonzi, Sara; Albani, Maria C; Kemi, Ulla; Savolainen, Outi; Coupland, George
2014-07-17
Antisense RNA (asRNA) COOLAIR is expressed at A. thaliana FLOWERING LOCUS C (FLC) in response to winter temperatures. Its contribution to cold-induced silencing of FLC was proposed but its functional and evolutionary significance remain unclear. Here we identify a highly conserved block containing the COOLAIR first exon and core promoter at the 3' end of several FLC orthologues. Furthermore, asRNAs related to COOLAIR are expressed at FLC loci in the perennials A. alpina and A. lyrata, although some splicing variants differ from A. thaliana. Study of the A. alpina orthologue, PERPETUAL FLOWERING 1 (PEP1), demonstrates that AaCOOLAIR is induced each winter of the perennial life cycle. Introduction of PEP1 into A. thaliana reveals that AaCOOLAIR cis-elements confer cold-inducibility in this heterologous species while the difference between PEP1 and FLC mRNA patterns depends on both cis-elements and species-specific trans-acting factors. Thus, expression of COOLAIR is highly conserved, supporting its importance in FLC regulation.
Leterrier, Marina; Holappa, Lynn D; Broglie, Karen E; Beckles, Diane M
2008-01-01
Background Starch is of great importance to humans as a food and biomaterial, and the amount and structure of starch made in plants is determined in part by starch synthase (SS) activity. Five SS isoforms, SSI, II, III, IV and Granule Bound SSI, have been identified, each with a unique catalytic role in starch synthesis. The basic mode of action of SSs is known; however our knowledge of several aspects of SS enzymology at the structural and mechanistic level is incomplete. To gain a better understanding of the differences in SS sequences that underscore their specificity, the previously uncharacterised SSIVb from wheat was cloned and extensive bioinformatics analyses of this and other SSs sequences were done. Results The wheat SSIV cDNA is most similar to rice SSIVb with which it shows synteny and shares a similar exon-intron arrangement. The wheat SSIVb gene was preferentially expressed in leaf and was not regulated by a circadian clock. Phylogenetic analysis showed that in plants, SSIV is closely related to SSIII, while SSI, SSII and Granule Bound SSI clustered together and distinctions between the two groups can be made at the genetic level and included chromosomal location and intron conservation. Further, identified differences at the amino acid level in their glycosyltransferase domains, predicted secondary structures, global conformations and conserved residues might be indicative of intragroup functional associations. Conclusion Based on bioinformatics analysis of the catalytic region of 36 SSs and 3 glycogen synthases (GSs), it is suggested that the valine residue in the highly conserved K-X-G-G-L motif in SSIII and SSIV may be a determining feature of primer specificity of these SSs as compared to GBSSI, SSI and SSII. In GBSSI, the Ile485 residue may partially explain that enzyme's unique catalytic features. The flexible 380s Loop in the starch catalytic domain may be important in defining the specificity of action for each different SS and the G-X-G in motif VI could define SSIV and SSIII action particularly. PMID:18826586
Genomic organization of the neurofibromatosis 1 gene (NF1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Y.; O`Connell, P.; Huntsman Breidenbach, H.
Neurofibromatosis 1 maps to chromosome band 17q11.2, and the NF1 locus has been partially characterized. Even though the full-length NF1 cDNA has been sequenced, the complete genomic structure of the NF1 gene has not been elucidated. The 5{prime} end of NF1 is embedded in a CpG island containing a NotI restriction site, and the remainder of the gene lies in the adjacent 350-kb NotI fragment. In our efforts to develop a comprehensive screen for NF1 mutations, we have isolated genomic DNA clones that together harbor the entire NF1 cDNA sequence. We have identified all intron-exon boundaries of the coding regionmore » and established that it is composed of 59 exons. Furthermore, we have defined the 3{prime}-untranslated region (3{prime}-UTR) of the NF1 gene; it spans approximately 3.5 kb of genomic DNA sequence and is continuous with the stop codon. Oligonucleotide primer pairs synthesized from exon-flanking DNA sequences were used in the polymerase chain reaction with cloned, chromosome 17-specific genomic DNA as template to amplify NF1 exons 1 through 27b and the exon containing the 3{prime}-UTR separately. This information should be useful for implementing a comprehensive NF1 mutation screen using genomic DNA as template. 41 refs., 3 figs., 2 tabs.« less
Hu, Liyan; Pandey, Amit V; Eggimann, Sandra; Rüfenacht, Véronique; Möslinger, Dorothea; Nuoffer, Jean-Marc; Häberle, Johannes
2013-11-29
Argininosuccinic aciduria (ASA) is an autosomal recessive urea cycle disorder caused by deficiency of argininosuccinate lyase (ASL) with a wide clinical spectrum from asymptomatic to severe hyperammonemic neonatal onset life-threatening courses. We investigated the role of ASL transcript variants in the clinical and biochemical variability of ASA. Recombinant proteins for ASL wild type, mutant p.E189G, and the frequently occurring transcript variants with exon 2 or 7 deletions were (co-)expressed in human embryonic kidney 293T cells. We found that exon 2-deleted ASL forms a stable truncated protein with no relevant activity but a dose-dependent dominant negative effect on enzymatic activity after co-expression with wild type or mutant ASL, whereas exon 7-deleted ASL is unstable but seems to have, nevertheless, a dominant negative effect on mutant ASL. These findings were supported by structural modeling predictions for ASL heterotetramer/homotetramer formation. Illustrating the physiological relevance, the predominant occurrence of exon 7-deleted ASL was found in two patients who were both heterozygous for the ASL mutant p.E189G. Our results suggest that ASL transcripts can contribute to the highly variable phenotype in ASA patients if expressed at high levels. Especially, the exon 2-deleted ASL variant may form a heterotetramer with wild type or mutant ASL, causing markedly reduced ASL activity.
[Observation on gene polymorphism of Rh blood group in Chinese Han nationality].
Lan, Jiong-Cai; Wang, Cong-Rong; Wei, Ya-Ming; Zhou, Hua-You; Cao, Qiong; Zhang, Yin-Ze; Jiang, KuReXi; Wu, Da-Lin; Liu, Zhong
2003-12-01
To observe the gene polymorphism of Rh blood group in unrelated random individuals and families for Chinese Han nationality, polymerase chain reaction-sequence specific primer (PCR-SSP) was used to amplify the Rh C/E gene, RhD gene, exons, intron 2 and 10, insert and Rh Box in 160 blood samples of RhD positive unrelated individuals and 71 samples of RhD negative unrelated individuals and 7 samples of families whose probands were RhD-negative. The results showed that RhD genes of RhD-negative individuals with C antigens were polymorphism, three forms were found for D exon including intact, partial deletion and complete deletion exons. Insert fragments and Rh Box were found in most cases of families whose probands were RhD-negative and its inheritance accorded with the Mendel's Law, and it did not affect the expression of RhD gene. "Normal" RhD exon 4 amplifying product was not found in all of the samples. It was concluded that gene structure of the RhD-negative in Chinese was polymorphism, intact, partial deletion and complete deletion exons were found in the individuals with C antigen and probably existed specific D (nf) Ce haplotype. The function of insert was uncertain. The Rh gene sequences of Chinese Han nationality are different from those of Caucasian and the Rh gene library based on Han nationality should be established.
Therapeutic NOTCH3 cysteine correction in CADASIL using exon skipping: in vitro proof of concept.
Rutten, Julie W; Dauwerse, Hans G; Peters, Dorien J M; Goldfarb, Andrew; Venselaar, Hanka; Haffner, Christof; van Ommen, Gert-Jan B; Aartsma-Rus, Annemieke M; Lesnik Oberstein, Saskia A J
2016-04-01
Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy, or CADASIL, is a hereditary cerebral small vessel disease caused by characteristic cysteine altering missense mutations in the NOTCH3 gene. NOTCH3 mutations in CADASIL result in an uneven number of cysteine residues in one of the 34 epidermal growth factor like-repeat (EGFr) domains of the NOTCH3 protein. The consequence of an unpaired cysteine residue in an EGFr domain is an increased multimerization tendency of mutant NOTCH3, leading to toxic accumulation of the protein in the (cerebro)vasculature, and ultimately reduced cerebral blood flow, recurrent stroke and vascular dementia. There is no therapy to delay or alleviate symptoms in CADASIL. We hypothesized that exclusion of the mutant EGFr domain from NOTCH3 would abolish the detrimental effect of the unpaired cysteine and thus prevent toxic NOTCH3 accumulation and the negative cascade of events leading to CADASIL. To accomplish this NOTCH3 cysteine correction by EGFr domain exclusion, we used pre-mRNA antisense-mediated skipping of specific NOTCH3 exons. Selection of these exons was achieved using in silico studies and based on the criterion that skipping of a particular exon or exon pair would modulate the protein in such a way that the mutant EGFr domain is eliminated, without otherwise corrupting NOTCH3 structure and function. Remarkably, we found that this strategy closely mimics evolutionary events, where the elimination and fusion of NOTCH EGFr domains led to the generation of four functional NOTCH homologues. We modelled a selection of exon skip strategies using cDNA constructs and show that the skip proteins retain normal protein processing, can bind ligand and be activated by ligand. We then determined the technical feasibility of targeted NOTCH3 exon skipping, by designing antisense oligonucleotides targeting exons 2-3, 4-5 and 6, which together harbour the majority of distinct CADASIL-causing mutations. Transfection of these antisense oligonucleotides into CADASIL patient-derived cerebral vascular smooth muscle cells resulted in successful exon skipping, without abrogating NOTCH3 signalling. Combined, these data provide proof of concept for this novel application of exon skipping, and are a first step towards the development of a rational therapeutic approach applicable to up to 94% of CADASIL-causing mutations. © The Author (2016). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Li, Chibo; Ding, Xi-Qin; O’Brien, John; Al-Ubaidi, Muayyad R.
2010-01-01
PURPOSE A great deal of information about functionally significant domains of a protein may be obtained by comparison of primary sequences of gene homologues over a broad phylogenetic base. This study was designed to identify evolutionarily conserved domains of the photoreceptor disc membrane protein peripherin/rds by analysis of the homologue in a primitive vertebrate, the skate. METHODS A skate retinal cDNA library was screened using a mouse peripherin/rds clone. The 5′ and 3′ untranslated regions of the skate peripherin/rds (srds) cDNA were isolated by the rapid amplification of cDNA ends (RACE) approach. The gene structure was characterized by PCR amplification and sequencing of genomic fragments. Northern and Western blot analyses were used to identify srds transcript and protein, respectively. RESULTS A new homologue of peripherin/rds was identified from the skate retinal cDNA library. SRDS is a glycoprotein with a predicted molecular mass of 40.2 kDa. The srds gene consists of two exons and one small intron and transcribes into a single 6-kb message. Phylogenetic analysis places SRDS at the base of peripherin/rds family and near the division of that group and the branch leading to rds-like and rom-1 genes. SRDS protein is 54.5% identical with peripherin/rds across species. Identity is significantly higher (73%) in the intradiscal domains. Sequence comparison revealed the conservation of all residues that have been shown, on mutation, to associate with retinitis pigmentosa and showed conservation of most residues associated with macular dystrophies. Comparison with ROM-1 and other rds-like proteins revealed the presence of a highly conserved domain in the large intradiscal loop. CONCLUSIONS Srds represents the skate orthologue of mammalian peripherin/rds genes. Conservation of most of the residues associated with human retinal diseases indicates that these residues serve important functional roles. The high degree of conservation of a short stretch within the large intradiscal loop also suggests an important function for this domain. PMID:12766040
Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon
2014-11-01
The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
RNA structure in splicing: An evolutionary perspective.
Lin, Chien-Ling; Taggart, Allison J; Fairbrother, William G
2016-09-01
Pre-mRNA splicing is a key post-transcriptional regulation process in which introns are excised and exons are ligated together. A novel class of structured intron was recently discovered in fish. Simple expansions of complementary AC and GT dimers at opposite boundaries of an intron were found to form a bridging structure, thereby enforcing correct splice site pairing across the intron. In some fish introns, the RNA structures are strong enough to bypass the need of regulatory protein factors for splicing. Here, we discuss the prevalence and potential functions of highly structured introns. In humans, structured introns usually arise through the co-occurrence of C and G-rich repeats at intron boundaries. We explore the potentially instructive example of the HLA receptor genes. In HLA pre-mRNA, structured introns flank the exons that encode the highly polymorphic β sheet cleft, making the processing of the transcript robust to variants that disrupt splicing factor binding. While selective forces that have shaped HLA receptor are fairly atypical, numerous other highly polymorphic genes that encode receptors contain structured introns. Finally, we discuss how the elevated mutation rate associated with the simple repeats that often compose structured intron can make structured introns themselves rapidly evolving elements.
Palti, Y.; Rodriguez, M.F.; Gahr, S.A.; Purcell, M.K.; Rexroad, C. E.; Wiens, G.D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5??? UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Palti, Yniv; Rodriguez, M. Fernanda; Gahr, Scott A.; Purcell, Maureen K.; Rexroad, Caird E.; Wiens, Gregory D.
2010-01-01
Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5' UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Another face of the Treacher Collins syndrome (TCOF1) gene: identification of additional exons.
So, Rolando B; Gonzales, Bianca; Henning, Dale; Dixon, Jill; Dixon, Michael J; Valdez, Benigno C
2004-03-17
Treacher Collins syndrome (TCS) is characterized by an abnormality in craniofacial development during early embryogenesis. TCS is caused by mutations in the gene TCOF1, which encodes the nucleolar phosphoprotein treacle. Genetic and proteomic characterizations of TCS/treacle are based on the previously reported 26 exons of TCOF1. Here, we report the identification of 231-nucleotide (nt) exon 6A (between exons 6 and 7) and 108-nt exon 16A (between exons 16 and 17). Isoforms with exon 6A are up to 3.7-fold more abundant than alternatively spliced variants without exon 6A, but only minor isoforms contain exon 16A. Exon 6A encodes a peptide sequence containing basic and acidic domains similar to 10 other exons of TCOF1. Unlike the other exons, exon 6A encodes a nuclear localization signal (NLS) which does not, however, alter the nucleolar localization of full-length treacle. The discovery of exons 6A and 16A is relevant to mutational analysis of the TCOF1 gene in TCS patients, and to functional analysis of its gene product.
Conservation of Fold and Topology of Functional Elements in Thiamin Pyrophosphate Enzymes
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, E. M.
2005-01-01
Thiamin pyrophosphate (TPP)-dependent enzymes are a highly divergent family of proteins binding both TPP and metal ions. They perform decarboxylation-hydroxyaldehydes. Prior -ketoacids and of a common - (O=)C-C(OH)- fragment of to knowledge of three-dimensional structures of these enzmes, the GDGY25-30NN sequence was used to identify these enzymes. Subsequently, a number of structural studies on those enzymes revealed multi-subunit organization and the features of the two duplicate cofactor binding sites. Analyzing the structures of 44 structurally known enzymes, we found that the common structure of these enzymes is reduced to 180-220 amino acid long fragments of two PP and two PYR domains that form the [PP:PYR]2 binding center of two cofactor molecules. The structures of PP and PYR are arranged in a similar fold-sheet with triplets of helices on both sides.Dconsisting of a six-stranded Residues surrounding the cofactors are not strictly conserved, but they provide the same interatomic contacts required for the catalytic functions that these enzymes perform while maintaining interactive structural integrity. These structural and functional amino acids are topological counterparts located in the same positions of the conserved fold of sets of PP and PYR domains. Additional parallels include short fragments of sequences that link these amino acids to the fold and function. This report on the structural commonalities amongst TPP dependent enzymes is thought to contribute new approaches to annotation that may assist in advancing the functional proteomics of TPP dependent enzymes, and trace their complexity within evolutionary context.
NASA Astrophysics Data System (ADS)
Feng, Wenqiang; Guo, Zhenlin; Lowengrub, John S.; Wise, Steven M.
2018-01-01
We present a mass-conservative full approximation storage (FAS) multigrid solver for cell-centered finite difference methods on block-structured, locally cartesian grids. The algorithm is essentially a standard adaptive FAS (AFAS) scheme, but with a simple modification that comes in the form of a mass-conservative correction to the coarse-level force. This correction is facilitated by the creation of a zombie variable, analogous to a ghost variable, but defined on the coarse grid and lying under the fine grid refinement patch. We show that a number of different types of fine-level ghost cell interpolation strategies could be used in our framework, including low-order linear interpolation. In our approach, the smoother, prolongation, and restriction operations need never be aware of the mass conservation conditions at the coarse-fine interface. To maintain global mass conservation, we need only modify the usual FAS algorithm by correcting the coarse-level force function at points adjacent to the coarse-fine interface. We demonstrate through simulations that the solver converges geometrically, at a rate that is h-independent, and we show the generality of the solver, applying it to several nonlinear, time-dependent, and multi-dimensional problems. In several tests, we show that second-order asymptotic (h → 0) convergence is observed for the discretizations, provided that (1) at least linear interpolation of the ghost variables is employed, and (2) the mass conservation corrections are applied to the coarse-level force term.
NASA Astrophysics Data System (ADS)
Clerici, Nicola; Vogt, Peter
2013-04-01
Riparian zones are of utmost importance in providing a wide range of ecological and societal services. Among these, their role in maintaining landscape connectivity through ecological corridors for animals and plants is of major interest from a conservation and management perspective. This paper describes a methodology to identify European regions as providers of structural riparian corridors, and to rank them with reference to conservation priority. Physical riparian connectors among core habitat patches are identified through a recent segmentation technique, the Morphological Spatial Pattern Analysis. A multi-scale approach is followed by considering different edge distances to identify core and peripheral habitats for a range of hypothetical species. The ranking is performed using a simple set of indices that take into account the degree of environmental pressure and the presence of land protection schemes. An example for environmental reporting is carried out using European administrative regions and major rivers to summarize indices value. The approach is based on freely available software and simple metrics which can be easily reproduced in a GIS environment.
Genomic V exons from whole genome shotgun data in reptiles.
Olivieri, D N; von Haeften, B; Sánchez-Espinel, C; Faro, J; Gambón-Deza, F
2014-08-01
Reptiles and mammals diverged over 300 million years ago, creating two parallel evolutionary lineages amongst terrestrial vertebrates. In reptiles, two main evolutionary lines emerged: one gave rise to Squamata, while the other gave rise to Testudines, Crocodylia, and Aves. In this study, we determined the genomic variable (V) exons from whole genome shotgun sequencing (WGS) data in reptiles corresponding to the three main immunoglobulin (IG) loci and the four main T cell receptor (TR) loci. We show that Squamata lack the TRG and TRD genes, and snakes lack the IGKV genes. In representative species of Testudines and Crocodylia, the seven major IG and TR loci are maintained. As in mammals, genes of the IG loci can be grouped into well-defined IMGT clans through a multi-species phylogenetic analysis. We show that the reptilian IGHV and IGLV genes are distributed amongst the established mammalian clans, while their IGKV genes are found within a single clan, nearly exclusive from the mammalian sequences. The reptilian and mammalian TRAV genes cluster into six common evolutionary clades (since IMGT clans have not been defined for TR). In contrast, the reptilian TRBV genes cluster into three clades, which have few mammalian members. In this locus, the V exon sequences from mammals appear to have undergone different evolutionary diversification processes that occurred outside these shared reptilian clans. These sequences can be obtained in a freely available public repository (http://vgenerepertoire.org).
From conservative to reactive transport under diffusion-controlled conditions
NASA Astrophysics Data System (ADS)
Babey, Tristan; de Dreuzy, Jean-Raynald; Ginn, Timothy R.
2016-05-01
We assess the possibility to use conservative transport information, such as that contained in transit time distributions, breakthrough curves and tracer tests, to predict nonlinear fluid-rock interactions in fracture/matrix or mobile/immobile conditions. Reference simulated data are given by conservative and reactive transport simulations in several diffusive porosity structures differing by their topological organization. Reactions includes nonlinear kinetically controlled dissolution and desorption. Effective Multi-Rate Mass Transfer models (MRMT) are calibrated solely on conservative transport information without pore topology information and provide concentration distributions on which effective reaction rates are estimated. Reference simulated reaction rates and effective reaction rates evaluated by MRMT are compared, as well as characteristic desorption and dissolution times. Although not exactly equal, these indicators remain very close whatever the porous structure, differing at most by 0.6% and 10% for desorption and dissolution. At early times, this close agreement arises from the fine characterization of the diffusive porosity close to the mobile zone that controls fast mobile-diffusive exchanges. At intermediate to late times, concentration gradients are strongly reduced by diffusion, and reactivity can be captured by a very limited number of rates. We conclude that effective models calibrated solely on conservative transport information like MRMT can accurately estimate monocomponent kinetically controlled nonlinear fluid-rock interactions. Their relevance might extend to more advanced biogeochemical reactions because of the good characterization of conservative concentration distributions, even by parsimonious models (e.g., MRMT with 3-5 rates). We propose a methodology to estimate reactive transport from conservative transport in mobile-immobile conditions.
Discovery of Novel Isoforms of Huntingtin Reveals a New Hominid-Specific Exon
Popowski, Melissa; Haremaki, Tomomi; Croft, Gist F.; Deglincerti, Alessia; Brivanlou, Ali H.
2015-01-01
Huntington’s disease (HD) is a devastating neurological disorder that is caused by an expansion of the poly-Q tract in exon 1 of the Huntingtin gene (HTT). HTT is an evolutionarily conserved and ubiquitously expressed protein that has been linked to a variety of functions including transcriptional regulation, mitochondrial function, and vesicle transport. This large protein has numerous caspase and calpain cleavage sites and can be decorated with several post-translational modifications such as phosphorylations, acetylations, sumoylations, and palmitoylations. However, the exact function of HTT and the role played by its modifications in the cell are still not well understood. Scrutiny of HTT function has been focused on a single, full length mRNA. In this study, we report the discovery of 5 novel HTT mRNA splice isoforms that are expressed in normal and HTT-expanded human embryonic stem cell (hESC) lines as well as in cortical neurons differentiated from hESCs. Interestingly, none of the novel isoforms generates a truncated protein. Instead, 4 of the 5 new isoforms specifically eliminate domains and modifications to generate smaller HTT proteins. The fifth novel isoform incorporates a previously unreported additional exon, dubbed 41b, which is hominid-specific and introduces a potential phosphorylation site in the protein. The discovery of this hominid-specific isoform may shed light on human-specific pathogenic mechanisms of HTT, which could not be investigated with current mouse models of the disease. PMID:26010866
Congenital analbuminemia caused by a novel aberrant splicing in the albumin gene.
Caridi, Gianluca; Dagnino, Monica; Erdeve, Omer; Di Duca, Marco; Yildiz, Duran; Alan, Serdar; Atasay, Begum; Arsan, Saadet; Campagnoli, Monica; Galliano, Monica; Minchiotti, Lorenzo
2014-01-01
Congenital analbuminemia is a rare autosomal recessive disorder manifested by the presence of a very low amount of circulating serum albumin. It is an allelic heterogeneous defect, caused by variety of mutations within the albumin gene in homozygous or compound heterozygous state. Herein we report the clinical and molecular characterization of a new case of congenital analbuminemia diagnosed in a female newborn of consanguineous (first degree cousins) parents from Ankara, Turkey, who presented with a low albumin concentration (< 8 g/L) and severe clinical symptoms. The albumin gene of the index case was screened by single-strand conformation polymorphism, heteroduplex analysis, and direct DNA sequencing. The effect of the splicing mutation was evaluated by examining the cDNA obtained by reverse transcriptase - polymerase chain reaction (RT-PCR) from the albumin mRNA extracted from proband's leukocytes. DNA sequencing revealed that the proband is homozygous, and both parents are heterozygous, for a novel G>A transition at position c.1652+1, the first base of intron 12, which inactivates the strongly conserved GT dinucleotide at the 5' splice site consensus sequence of this intron. The splicing defect results in the complete skipping of the preceding exon (exon 12) and in a frame-shift within exon 13 with a premature stop codon after the translation of three mutant amino acid residues. Our results confirm the clinical diagnosis of congenital analbuminemia in the proband and the inheritance of the trait and contribute to shed light on the molecular genetics of analbuminemia.
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, J.K.; Shaw, M.A.; Barton, C.H.
1994-11-15
Recent interest has focused on the region of conserved synteny between mouse chromosome 1 and human 2q33-q37, particularly over the region encoding the murine macrophage resistance gene Ity/Lsh/Bcg (candidate Nramp) and members of the Il8r interleukin-8 (IL8) receptor gene cluster. In this paper, identification of a restriction fragment length polymorphism in the Il8RB gene in 35 pedigrees previously typed for markers in the 2q33-37 interval provided evidence (lod scores > 3) for linkage between Il8RB and the 2q34-135 markers FN1, TNP1, VIL1, and DES. Physical mapping, using yeast artificial chromosomes isolated with VIL1, confirmed that IL8RA, IL8RB and the IL8RBmore » pseudogene map within the NRAMP-VIL1 interval, with the physical distance (155 kb) from 5{prime} LSH to 3{prime} VIL1 representing {approx}3-fold that observed in the mouse. Partial sequencing of NRAMP confirmed the presence of the N-terminal proline/serine-rich putative SH3 binding domain in exon 2 of the human gene. Further analysis of Brazilian leprosy and visceral leishmaniasis pedigrees identified a rare second allele varying in a 9-nucleotide repeat motif of the exon 2 sequence but segregating independently of the disease phenotype. 38 refs., 4 figs., 3 tabs.« less
NASA Astrophysics Data System (ADS)
Hamid, Nur Athirah Abd; Ismail, Ismanizan
2013-11-01
Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Xu; Zhou, Jing; Reeders, S.T.
1996-05-01
Basement membrane (type IV) collagen, a subfamily of the collagen protein family, is encoded by six distinct genes in mammals. Three of those, COL4A3, COL4A4, and COL4A5, are linked with Alport syndrome (hereditary nephritis). Patients with leimoyomatosis associated with Alport syndrome have been shown to have deletions in the 5{prime} end of the COL4A6 gene, in addition to having deletions in COL4A6. The human COL4A6 gene is reported to be 425 kb as determined by mapping of overlapping YAC clones by probes for its 5{prime} and 3{prime} ends. In the present study we describe the complete exon/intron size pattern ofmore » the human COL4A6 gene. The 12 {lambda} phage clones characterized in the study spanned a total of 110 kb, including 85 kb of the actual gene and 25 kb of flanking sequences. The overlapping clones contained all 46 exons of the gene and all introns, except for intron 2. Since the total size of the exons and all introns except for intron 2 is about 85 kb, intron 2 must be about 340 kb. All exons of the gene were assigned to EcoRI restriction fragments to facilitate analysis of the gene in patients with leiomyomatosis associated with Alport syndrome. The exon size pattern of COL4A6 is highly homologous with that of the human and mouse COL4A2 genes, with 27 of the 46 exons of COL4A6 being identical in size between the genes. 42 refs., 2 figs., 3 tabs.« less
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5’- and 3’-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species. PMID:25923814
Pauciullo, Alfredo; Erhardt, Georg
2015-01-01
In the present paper, we report for the first time the characterization of llama (Lama glama) caseins at transcriptomic and genetic level. A total of 288 casein clones transcripts were analysed from two lactating llamas. The most represented mRNA populations were those correctly assembled (85.07%) and they encoded for mature proteins of 215, 217, 187 and 162 amino acids respectively for the CSN1S1, CSN2, CSN1S2 and CSN3 genes. The exonic subdivision evidenced a structure made of 21, 9, 17 and 6 exons for the αs1-, β-, αs2- and κ-casein genes respectively. Exon skipping and duplication events were evidenced. Two variants A and B were identified in the αs1-casein gene as result of the alternative out-splicing of the exon 18. An additional exon coding for a novel esapeptide was found to be cryptic in the κ-casein gene, whereas one extra exon was found in the αs2-casein gene by the comparison with the Camelus dromedaries sequence. A total of 28 putative phosphorylated motifs highlighted a complex heterogeneity and a potential variable degree of post-translational modifications. Ninety-six polymorphic sites were found through the comparison of the lama casein cDNAs with the homologous camel sequences, whereas the first description and characterization of the 5'- and 3'-regulatory regions allowed to identify the main putative consensus sequences involved in the casein genes expression, thus opening the way to new investigations -so far- never achieved in this species.
Tan, Wei; Dean, Michael; Law, Amanda J.
2010-01-01
ErbB4 is a growth factor receptor tyrosine kinase essential for neurodevelopment. Genetic variation in ErbB4 is associated with schizophrenia and risk-associated polymorphisms predict overexpression of ErbB4 CYT-1 isoforms in the brain in the disorder. The molecular mechanism of association is unclear because the polymorphisms flank exon 3 of the gene and reside 700 kb distal to the CYT-1 defining exon. We hypothesized that the polymorphisms are indirectly associated with ErbB4 CYT-1 via splicing of exon 3 on the CYT-1 background. We report via cloning and sequencing of adult and fetal human brain cDNA libraries the identification of novel splice isoforms of ErbB4, whereby exon 3 is skipped (del.3). ErbB4 del.3 transcripts exist as CYT-2 isoforms and are predicted to produce truncated proteins. Furthermore, our data refine the structure of the human ErbB4 gene, clarify that juxtamembrane (JM) splice variants of ErbB4, JM-a and JM-b respectively, are characterized by the replacement of a 75 nucleotide (nt) sequence with a 45-nt insertion, and demonstrate that there are four alternative exons in the gene. Our analyses reveal that novel splice variants of ErbB4 exist in the developing and adult human brain and, given the failure to identify ErbB4 del.3 CYT-1 transcripts, suggest that the association of risk polymorphisms in the ErbB4 gene with CYT-1 transcript levels is not mediated via an exon 3 splicing event. PMID:20886074
Shirak, A; Golik, M; Lee, B-Y; Howe, A E; Kocher, T D; Hulata, G; Ron, M; Seroussi, E
2008-11-01
Lipocalins are involved in the binding of small molecules like sex steroids. We show here that the previously reported tilapia male-specific protein (MSP) is a lipocalin encoded by a variety of paralogous and homologous genes in different tilapia species. Exon-intron boundaries of MSP genes were typical of the six-exon genomic structure of lipocalins, and the transcripts were capable of encoding 200 amino-acid polypeptides that consisted of a putative signal peptide and a lipocalin domain. Cysteine residues are conserved in positions analogous to those forming the three disulfide bonds characteristic of the ligand pocket. The calculated molecular mass of the secreted MSP (20.4 kDa) was less than half of that observed, suggesting that it is highly glycosylated like its homologue tributyltin-binding protein. Analysis of sequence variations revealed three types of paralogs MSPA, MSPB and MSPC. Expression of both MSPA and MSPB was detected in testis. In haploid Oreochromis niloticus embryos, each of these types consisted of two closely related paralogs, and asymmetry between MSP copy numbers on the maternal (six copies) and the paternal (three copies) chromosomes was observed. Using this polymorphism we mapped MSPA and MSPC to linkage group 12 of an F(2) mapping family derived from a cross between O. niloticus and Oreochromis aureus. Females with high MSP copy number were more frequent by more than twofold than males. Gender-MSPC combinations showed significant deviation from expected Mendelian segregation (P=0.009) suggesting elimination of males with MSPC copies. We discuss different hypotheses to explain this elimination, including possibility for allelic conflict resulted by the hybridization.
Li, Ronggai; Wang, Tiehui; Bird, Steve; Zou, Jun; Dooley, Helen; Secombes, Christopher J.
2013-01-01
CD79α (also known as Igα) is a component of the B cell antigen receptor complex and plays an important role in B cell signalling. The CD79α protein is present on the surface of B cells throughout their life cycle, and is absent on all other healthy cells, making it a highly reliable marker for B cells in mammals. In this study the spiny dogfish (Squalus acanthias) CD79α (SaCD79α) is described and its expression studied under constitutive and stimulated conditions. The spiny dogfish CD79α cDNA contains an open reading frame of 618 bp, encoding a protein of 205 amino acids. Comparison of the SaCD79α gene with that of other species shows that the gross structure (number of exons, exon/intron boundaries, etc.) is highly conserved across phylogeny. Additionally, analysis of the 5′ flanking region shows SaCD79α lacks a TATA box and possesses binding sites for multiple transcription factors implicated in its B cell-specific gene transcription in other species. Spiny dogfish CD79α is most highly expressed in immune tissues, such as spleen, epigonal and Leydig organ, and its transcript level significantly correlates with those of spiny dogfish immunoglobulin heavy chains. Additionally, CD79α transcription is up-regulated, to a small but significant degree, in peripheral blood cells following stimulation with pokeweed mitogen. These results strongly indicate that, as in mammals, spiny dogfish CD79α is expressed by shark B cells where it associates with surface-bound immunoglobulin to form a fully functional BCR, and thus may serve as a pan-B cell marker in future shark immunological studies. PMID:23454429
Li, Ronggai; Wang, Tiehui; Bird, Steve; Zou, Jun; Dooley, Helen; Secombes, Christopher J
2013-06-01
CD79α (also known as Igα) is a component of the B cell antigen receptor complex and plays an important role in B cell signalling. The CD79α protein is present on the surface of B cells throughout their life cycle, and is absent on all other healthy cells, making it a highly reliable marker for B cells in mammals. In this study the spiny dogfish (Squalus acanthias) CD79α (SaCD79α) is described and its expression studied under constitutive and stimulated conditions. The spiny dogfish CD79α cDNA contains an open reading frame of 618 bp, encoding a protein of 205 amino acids. Comparison of the SaCD79α gene with that of other species shows that the gross structure (number of exons, exon/intron boundaries, etc.) is highly conserved across phylogeny. Additionally, analysis of the 5' flanking region shows SaCD79α lacks a TATA box and possesses binding sites for multiple transcription factors implicated in its B cell-specific gene transcription in other species. Spiny dogfish CD79α is most highly expressed in immune tissues, such as spleen, epigonal and Leydig organ, and its transcript level significantly correlates with those of spiny dogfish immunoglobulin heavy chains. Additionally, CD79α transcription is up-regulated, to a small but significant degree, in peripheral blood cells following stimulation with pokeweed mitogen. These results strongly indicate that, as in mammals, spiny dogfish CD79α is expressed by shark B cells where it associates with surface-bound immunoglobulin to form a fully functional BCR, and thus may serve as a pan-B cell marker in future shark immunological studies. Copyright © 2013 Elsevier Ltd. All rights reserved.
Distribution of mutations in the PEX gene in families with X-linked hypophosphataemic rickets (HYP).
Rowe, P S; Oudet, C L; Francis, F; Sinding, C; Pannetier, S; Econs, M J; Strom, T M; Meitinger, T; Garabedian, M; David, A; Macher, M A; Questiaux, E; Popowska, E; Pronicka, E; Read, A P; Mokrzycki, A; Glorieux, F H; Drezner, M K; Hanauer, A; Lehrach, H; Goulding, J N; O'Riordan, J L
1997-04-01
Mutations in the PEX gene at Xp22.1 (phosphate-regulating gene with homologies to endopeptidases, on the X-chromosome), are responsible for X-linked hypophosphataemic rickets (HYP). Homology of PEX to the M13 family of Zn2+ metallopeptidases which include neprilysin (NEP) as prototype, has raised important questions regarding PEX function at the molecular level. The aim of this study was to analyse 99 HYP families for PEX gene mutations, and to correlate predicted changes in the protein structure with Zn2+ metallopeptidase gene function. Primers flanking 22 characterised exons were used to amplify DNA by PCR, and SSCP was then used to screen for mutations. Deletions, insertions, nonsense mutations, stop codons and splice mutations occurred in 83% of families screened for in all 22 exons, and 51% of a separate set of families screened in 17 PEX gene exons. Missense mutations in four regions of the gene were informative regarding function, with one mutation in the Zn2+-binding site predicted to alter substrate enzyme interaction and catalysis. Computer analysis of the remaining mutations predicted changes in secondary structure, N-glycosylation, protein phosphorylation and catalytic site molecular structure. The wide range of mutations that align with regions required for protease activity in NEP suggests that PEX also functions as a protease, and may act by processing factor(s) involved in bone mineral metabolism.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shomrat, R.; Gluck, E.; Legum, C.
1994-02-15
Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy (BMD) are allelic disorders caused by mutations in the X-linked dystrophin gene. The most common mutations in western populations are deletions that are spread non-randomly throughout the gene. Molecular analysis of the dystrophin gene structure by hybridization of the full length cDNA to Southern blots and by PCR in 62 unrelated Israeli male DMD/BMD patients showed deletions in 23 (37%). This proportion is significantly lower than that found in European and North American populations (55-65%). Seventy-eight percent of the deletions were confined to exons 44-52, half of these exons 44-45, and themore » remaining 22% to exons 1 and 19. There was no correlation between the size of the deletion and the severity of the disease. All the deletions causing frameshift resulted in the DMD phenotypes. 43 refs., 1 fig., 1 tab.« less
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.
Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T
1993-02-01
An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Dias, José; Renault, Louis; Pérez, Javier; Mirande, Marc
2013-08-16
In animal cells, nine aminoacyl-tRNA synthetases are associated with the three auxiliary proteins p18, p38, and p43 to form a stable and conserved large multi-aminoacyl-tRNA synthetase complex (MARS), whose molecular mass has been proposed to be between 1.0 and 1.5 MDa. The complex acts as a molecular hub for coordinating protein synthesis and diverse regulatory signal pathways. Electron microscopy studies defined its low resolution molecular envelope as an overall rather compact, asymmetric triangular shape. Here, we have analyzed the composition and homogeneity of the native mammalian MARS isolated from rabbit liver and characterized its overall internal structure, size, and shape at low resolution by hydrodynamic methods and small-angle x-ray scattering in solution. Our data reveal that the MARS exhibits a much more elongated and multi-armed shape than expected from previous reports. The hydrodynamic and structural features of the MARS are large compared with other supramolecular assemblies involved in translation, including ribosome. The large dimensions and non-compact structural organization of MARS favor a large protein surface accessibility for all its components. This may be essential to allow structural rearrangements between the catalytic and cis-acting tRNA binding domains of the synthetases required for binding the bulky tRNA substrates. This non-compact architecture may also contribute to the spatiotemporal controlled release of some of its components, which participate in non-canonical functions after dissociation from the complex.
Large exon size does not limit splicing in vivo.
Chen, I T; Chasin, L A
1994-03-01
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
Multiple sequence alignment using multi-objective based bacterial foraging optimization algorithm.
Rani, R Ranjani; Ramyachitra, D
2016-12-01
Multiple sequence alignment (MSA) is a widespread approach in computational biology and bioinformatics. MSA deals with how the sequences of nucleotides and amino acids are sequenced with possible alignment and minimum number of gaps between them, which directs to the functional, evolutionary and structural relationships among the sequences. Still the computation of MSA is a challenging task to provide an efficient accuracy and statistically significant results of alignments. In this work, the Bacterial Foraging Optimization Algorithm was employed to align the biological sequences which resulted in a non-dominated optimal solution. It employs Multi-objective, such as: Maximization of Similarity, Non-gap percentage, Conserved blocks and Minimization of gap penalty. BAliBASE 3.0 benchmark database was utilized to examine the proposed algorithm against other methods In this paper, two algorithms have been proposed: Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC) and Bacterial Foraging Optimization Algorithm. It was found that Hybrid Genetic Algorithm with Artificial Bee Colony performed better than the existing optimization algorithms. But still the conserved blocks were not obtained using GA-ABC. Then BFO was used for the alignment and the conserved blocks were obtained. The proposed Multi-Objective Bacterial Foraging Optimization Algorithm (MO-BFO) was compared with widely used MSA methods Clustal Omega, Kalign, MUSCLE, MAFFT, Genetic Algorithm (GA), Ant Colony Optimization (ACO), Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO) and Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC). The final results show that the proposed MO-BFO algorithm yields better alignment than most widely used methods. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Cystinuria Associated with Different SLC7A9 Gene Variants in the Cat
Raj, Karthik; Osborne, Carl; Giger, Urs
2016-01-01
Cystinuria is a classical inborn error of metabolism characterized by a selective proximal renal tubular defect affecting cystine, ornithine, lysine, and arginine (COLA) reabsorption, which can lead to uroliths and urinary obstruction. In humans, dogs and mice, cystinuria is caused by variants in one of two genes, SLC3A1 and SLC7A9, which encode the rBAT and bo,+AT subunits of the bo,+ basic amino acid transporter system, respectively. In this study, exons and flanking regions of the SLC3A1 and SLC7A9 genes were sequenced from genomic DNA of cats (Felis catus) with COLAuria and cystine calculi. Relative to the Felis catus-6.2 reference genome sequence, DNA sequences from these affected cats revealed 3 unique homozygous SLC7A9 missense variants: one in exon 5 (p.Asp236Asn) from a non-purpose-bred medium-haired cat, one in exon 7 (p.Val294Glu) in a Maine Coon and a Sphinx cat, and one in exon 10 (p.Thr392Met) from a non-purpose-bred long-haired cat. A genotyping assay subsequently identified another cystinuric domestic medium-haired cat that was homozygous for the variant originally identified in the purebred cats. These missense variants result in deleterious amino acid substitutions of highly conserved residues in the bo,+AT protein. A limited population survey supported that the variants found were likely causative. The remaining 2 sequenced domestic short-haired cats had a heterozygous variant at a splice donor site in intron 10 and a homozygous single nucleotide variant at a branchpoint in intron 11 of SLC7A9, respectively. This study identifies the first SLC7A9 variants causing feline cystinuria and reveals that, as in humans and dogs, this disease is genetically heterogeneous in cats. PMID:27404572
Zhu, Yuyou; Wang, Juan; Wu, Yuanbo; Wang, Guoping; Hu, Bai
2015-01-01
To investigate the genetic pathogenic causes of cerebral autosomal dominant arteriopathy with subcritical infarct and leucoencephalopathy (CADASIL) in two Chinese families, to provide the molecular basis for genetic counseling and antenatal diagnosis. The genetic mutation of gene NOTCH3 of propositus and family members was analyzed in these two CADASIL families by polymerase chain reaction and DNA sequencing technology directly. At the same time, the NOTCH3 gene mutation point of 100 healthy collators was detected, to explicit the pathogenic mutation by function prediction with Polyphen-2 and SIFT. Both propositus of the two families and patients with symptom were all accorded with the clinical features of CADASIL. It was shown by DNA sequencing that the 19(th) exon [c. 3043 T > A (p.Cys1015Ser)] in gene NOTCH3 of propositus, 2 patients (II3, III7), and a presymptomatic patient (IV1) in Family I all had heterozygosity missense mutation; and the 3(rd) exon [c.316T > G, p. (Cys106Gly)] in gene NOTCH3 of the propositus, a patient (IV3) and two presymptomatic patients (IV5, 6) in Family II all had heterozygosity missense mutation; and no mutations were detected in the 100 healthy collators. It was indicated by analyzing the function prediction that the mutation of [c. 3043 T > A (p.Cys1015Ser)] and [c.316T > G, p. (Cys106Gly)] may both influence encoding protein in NOTCH3. By analysis of the conservatism of mutation point in each species, these two basic groups were highly conserved. The heterozygosity missense mutation of 19(th) exon [c. 3043 T > A (p.Cys1015Ser)] and the 3(rd) exon [c.316T > G, p. (Cys106Gly)] in NOTCH3 gene are the new pathogenic mutations of CADASIL, and enriches the mutation spectrum of NOTCH3 gene.
A novel missense HGD gene mutation, K57N, in a patient with alkaptonuria.
Grasko, Jonathan M; Hooper, Amanda J; Brown, Jeffrey W; McKnight, C James; Burnett, John R
2009-05-01
Alkaptonuria is a rare recessive disorder of phenylalanine/tyrosine metabolism due to a defect in the enzyme homogentisate 1,2-dioxygenase (HGD) caused by mutations in the HGD gene. We report the case of a 38 year-old male with known alkaptonuria who was referred to an adult metabolic clinic after initially presenting to an emergency department with renal colic and subsequently passing black ureteric calculi. He complained of severe debilitating lower back pain, worsening over the last few years. A CT scan revealed marked degenerative changes and severe narrowing of the disc spaces along the entire lumbar spine. Sequencing of the HGD gene revealed that he was a compound heterozygote for a previously described missense mutation in exon 13 (G360R) and a novel missense mutation in exon 3 (K57N). Lys(57) is conserved among species and mutation of this residue is predicted to affect HGD protein function by interfering with substrate traffic at the active site. In summary, we describe an alkaptonuric patient and report a novel missense HGD mutation, K57N.
Li, Ronggai; Dooley, Helen; Wang, Tiehui; Secombes, Christopher J; Bird, Steve
2012-04-01
B-cell activating factor (BAFF), also known as tumour necrosis factor (TNF) ligand superfamily member 13B, is an important immune regulator with critical roles in B-cell survival, proliferation, differentiation and immunoglobulin secretion. A BAFF gene has been cloned from spiny dogfish (Squalus acanthias) and its expression studied. The dogfish BAFF encodes for an anchored type-II transmembrane protein of 288 aa with a putative furin protease cleavage site and TNF family signature as seen in BAFFs from other species. The identity of dogfish BAFF has also been confirmed by conserved cysteine residues, and phylogenetic tree analysis. The dogfish BAFF gene has an extra exon not seen in teleost fish, birds and mammals that encodes for 29 aa and may impact on receptor binding. The dogfish BAFF is highly expressed in immune tissues, such as spleen, and is up-regulated by PWM in peripheral blood leucocytes, suggesting a potentially important role in the immune system. Copyright © 2011 Elsevier Ltd. All rights reserved.
Qin, Zhen; Xiao, Yibei; Yang, Xinbin; Mesters, Jeroen R.; Yang, Shaoqing; Jiang, Zhengqiang
2015-01-01
Glycoside hydrolase (GH) family 3 β-N-acetylglucosaminidases widely exist in the filamentous fungi, which may play a key role in chitin metabolism of fungi. A multi-domain GH family 3 β-N-acetylglucosaminidase from Rhizomucor miehei (RmNag), exhibiting a potential N-acetyltransferase region, has been recently reported to show great potential in industrial applications. In this study, the crystal structure of RmNag was determined at 2.80 Å resolution. The three-dimensional structure of RmNag showed four distinctive domains, which belong to two distinguishable functional regions — a GH family 3 β-N-acetylglucosaminidase region (N-terminal) and a N-acetyltransferase region (C-terminal). From structural and functional analysis, the C-terminal region of RmNag was identified as a unique tandem array linking general control non-derepressible 5 (GCN5)-related N-acetyltransferase (GNAT), which displayed glucosamine N-acetyltransferase activity. Structural analysis of this glucosamine N-acetyltransferase region revealed that a unique glucosamine binding pocket is located in the pantetheine arm binding terminal region of the conserved CoA binding pocket, which is different from all known GNAT members. This is the first structural report of a glucosamine N-acetyltransferase, which provides novel structural information about substrate specificity of GNATs. The structural and functional features of this multi-domain β-N-acetylglucosaminidase could be useful in studying the catalytic mechanism of GH family 3 proteins. PMID:26669854
Qin, Zhen; Xiao, Yibei; Yang, Xinbin; Mesters, Jeroen R; Yang, Shaoqing; Jiang, Zhengqiang
2015-12-16
Glycoside hydrolase (GH) family 3 β-N-acetylglucosaminidases widely exist in the filamentous fungi, which may play a key role in chitin metabolism of fungi. A multi-domain GH family 3 β-N-acetylglucosaminidase from Rhizomucor miehei (RmNag), exhibiting a potential N-acetyltransferase region, has been recently reported to show great potential in industrial applications. In this study, the crystal structure of RmNag was determined at 2.80 Å resolution. The three-dimensional structure of RmNag showed four distinctive domains, which belong to two distinguishable functional regions--a GH family 3 β-N-acetylglucosaminidase region (N-terminal) and a N-acetyltransferase region (C-terminal). From structural and functional analysis, the C-terminal region of RmNag was identified as a unique tandem array linking general control non-derepressible 5 (GCN5)-related N-acetyltransferase (GNAT), which displayed glucosamine N-acetyltransferase activity. Structural analysis of this glucosamine N-acetyltransferase region revealed that a unique glucosamine binding pocket is located in the pantetheine arm binding terminal region of the conserved CoA binding pocket, which is different from all known GNAT members. This is the first structural report of a glucosamine N-acetyltransferase, which provides novel structural information about substrate specificity of GNATs. The structural and functional features of this multi-domain β-N-acetylglucosaminidase could be useful in studying the catalytic mechanism of GH family 3 proteins.
Thirty-seven species identified in the Clark County Multi-Species Habitat Conservation Plan were
previously modeled through the Southwest Regional Gap Analysis Project. Existing SWReGAP habitat
models and modeling databases were used to facilitate the revision of mo...
Multi-Lagrangians for integrable systems
NASA Astrophysics Data System (ADS)
Nutku, Y.; Pavlov, M. V.
2002-03-01
We propose a general scheme to construct multiple Lagrangians for completely integrable nonlinear evolution equations that admit multi-Hamiltonian structure. The recursion operator plays a fundamental role in this construction. We use a conserved quantity higher/lower than the Hamiltonian in the potential part of the new Lagrangian and determine the corresponding kinetic terms by generating the appropriate momentum map. This leads to some remarkable new developments. We show that nonlinear evolutionary systems that admit N-fold first order local Hamiltonian structure can be cast into variational form with 2N-1 Lagrangians which will be local functionals of Clebsch potentials. This number increases to 3N-2 when the Miura transformation is invertible. Furthermore we construct a new Lagrangian for polytropic gas dynamics in 1+1 dimensions which is a free, local functional of the physical field variables, namely density and velocity, thus dispensing with the necessity of introducing Clebsch potentials entirely. This is a consequence of bi-Hamiltonian structure with a compatible pair of first and third order Hamiltonian operators derived from Sheftel's recursion operator.
Schlottfeldt, S; Walter, M E M T; Carvalho, A C P L F; Soares, T N; Telles, M P C; Loyola, R D; Diniz-Filho, J A F
2015-06-18
Biodiversity crises have led scientists to develop strategies for achieving conservation goals. The underlying principle of these strategies lies in systematic conservation planning (SCP), in which there are at least 2 conflicting objectives, making it a good candidate for multi-objective optimization. Although SCP is typically applied at the species level (or hierarchically higher), it can be used at lower hierarchical levels, such as using alleles as basic units for analysis, for conservation genetics. Here, we propose a method of SCP using a multi-objective approach. We used non-dominated sorting genetic algorithm II in order to identify the smallest set of local populations of Dipteryx alata (baru) (a Brazilian Cerrado species) for conservation, representing the known genetic diversity and using allele frequency information associated with heterozygosity and Hardy-Weinberg equilibrium. We worked in 3 variations for the problem. First, we reproduced a previous experiment, but using a multi-objective approach. We found that the smallest set of populations needed to represent all alleles under study was 7, corroborating the results of the previous study, but with more distinct solutions. In the 2nd and 3rd variations, we performed simultaneous optimization of 4 and 5 objectives, respectively. We found similar but refined results for 7 populations, and a larger portfolio considering intra-specific diversity and persistence with populations ranging from 8-22. This is the first study to apply multi-objective algorithms to an SCP problem using alleles at the population level as basic units for analysis.
Hu, Dong Gui; McKinnon, Ross A; Hulin, Julie-Ann; Mackenzie, Peter I; Meech, Robyn
2016-12-27
Nearly 20 different transcripts of the human androgen receptor (AR) are reported with two currently listed as Refseq isoforms in the NCBI database. Isoform 1 encodes wild-type AR (type 1 AR) and isoform 2 encodes the variant AR45 (type 2 AR). Both variants contain eight exons: they share common exons 2-8 but differ in exon 1 with the canonical exon 1 in isoform 1 and the variant exon 1b in isoform 2. Splicing of exon 1 or exon 1b is reported to be mutually exclusive. In this study, we identified a novel exon 1b (1b/TAG) that contains an additional TAG trinucleotide upstream of exon 1b. Moreover, we identified AR transcripts in both normal and cancerous breast and prostate cells that contained either exon 1b or 1b/TAG spliced between the canonical exon 1 and exon 2, generating nine-exon AR transcripts that we have named isoforms 3a and 3b. The proteins encoded by these new AR variants could regulate androgen-responsive reporters in breast and prostate cancer cells under androgen-depleted conditions. Analysis of type 3 AR-GFP fusion proteins showed partial nuclear localization in PC3 cells under androgen-depleted conditions, supporting androgen-independent activation of the AR. Type 3 AR proteins inhibited androgen-induced growth of LNCaP cells. Microarray analysis identified a small set of type 3a AR target genes in LNCaP cells, including genes known to modulate growth and proliferation of prostate cancer ( PCGEM1 , PEG3 , EPHA3 , and EFNB2 ) or other types of human cancers ( TOX3 , ST8SIA4 , and SLITRK3 ), and genes that are diagnostic/prognostic biomarkers of prostate cancer ( GRINA3 , and BCHE ).